Collaborative Optimization of High-Resolution Representation and Miss-Sensitive Supervision for Aero-Engine Micro-Crack Detection
Abstract
1. Introduction
- To alleviate the degradation of edge and texture information of micro-cracks in deep networks, a P1/P2 shallow high-resolution crack perception branch is introduced, enabling crack endpoints, edges, and local textures to participate in detection prediction at higher spatial resolution, thereby improving the model’s sensitivity to small-scale cracks.
- To address the severe imbalance between defect regions and background regions in borescope images, Focal Loss is incorporated as a hard-sample reweighting mechanism to reduce the dominance of easy background samples during training and enhance the model’s attention to low-confidence cracks and cracks under complex backgrounds.
- Considering that missed detections are more critical than false alarms in aero-engine crack inspection, Tversky Loss is extended to the object-level detection matching process. By assigning higher weights to false negatives, a miss-sensitive constraint mechanism is constructed to improve the recall capability for real crack regions.
- To improve learning on low-contrast, highly reflective, and easily confused crack samples, a Hard Mining-based self-enhanced learning strategy is designed. Based solely on training-set prediction results, missed cracks, low-confidence cracks, and category-confused cracks are selectively sampled and repeatedly trained to strengthen the model’s learning capability for difficult crack samples.
2. Dataset and Defect Characteristic Analysis
3. Crack Detection Method
3.1. High-Resolution Crack Perception
3.1.1. Micro-Crack Feature Analysis
3.1.2. Shallow High-Resolution Crack Representation
3.1.3. Multi-Scale Crack Prediction and Decoupled Detection Head
3.2. Miss-Sensitive Supervision
3.2.1. Hard-Sample Reweighting Based on Focal Loss
3.2.2. Recall Enhancement Constraint Based on Tversky Loss
3.2.3. Focal–Tversky Joint Optimization
3.3. Hard-Crack Self-Enhanced Learning
- (1)
- Missed cracks;
- (2)
- Low-confidence cracks;
- (3)
- Crack-as-gouge confusion samples.
4. Results and Discussion
4.1. Experimental Settings
4.2. Evaluation Metrics
4.3. Results
- (1)
- YOLO11 detection baseline;
- (2)
- YOLO11 segmentation baseline;
- (3)
- YOLO11 with P1/P2 high-resolution detection branches;
- (4)
- P1/P2 with Focal Loss;
- (5)
- P1/P2 with Focal–Tversky joint supervision;
- (6)
- The complete model with P1/P2, Focal Loss, Tversky Loss, and Hard Mining. The results are shown in Table 3.
5. Conclusions
- Expanding aero-engine borescope defect datasets to include more defect categories, operating conditions, and background environments;
- Introducing crack-skeleton constraints, curvilinear structural priors, or detection–segmentation fusion mechanisms to better characterize the geometric morphology of micro-cracks;
- Further optimizing hard-sample selection thresholds and repeated-sampling weights to achieve improved Precision–Recall balance;
- Combining lightweight network design and inference acceleration strategies to improve deployment capability on practical borescope inspection devices and aero-engine maintenance systems.
- Including comprehensive comparisons with YOLOv10, RT-DETR, and Transformer-based detectors to further validate the generalizability of the proposed method.
- Addressing the limitation of statistical robustness analysis, future work will conduct multiple independent training runs under different random seeds and report the mean ± standard deviation of evaluation metrics. In addition, statistical hypothesis testing (e.g., t-test) will be introduced to rigorously evaluate the significance of performance improvements and further improve the reliability and reproducibility of the proposed method under stochastic training conditions.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Abdulrahman, Y.; Eltoum, M.A.M.; Ayyad, A.; Moyo, B.; Zweiri, Y. Aero-Engine Blade Defect Detection: A Systematic Review of Deep Learning Models. IEEE Access 2023, 11, 53048–53061. [Google Scholar] [CrossRef]
- Shang, H.; Sun, C.; Liu, J.; Chen, X.; Yan, R. Deep Learning-Based Borescope Image Processing for Aero-Engine Blade In-Situ Damage Detection. Aerosp. Sci. Technol. 2022, 123, 107473. [Google Scholar] [CrossRef]
- Aust, J.; Pons, D. Taxonomy of Gas Turbine Blade Defects. Aerospace 2019, 6, 58. [Google Scholar] [CrossRef]
- Xu, X.; Li, F.; Xiong, L.; He, C.; Peng, H.; Zhao, Y.; Song, G. FDC-YOLO: A Blur-Resilient Lightweight Network for Engine Blade Defect Detection. Algorithms 2025, 18, 725. [Google Scholar] [CrossRef]
- Shi, T.; Boutry, N.; Xu, Y.; Géraud, T. Local Intensity Order Transformation for Robust Curvilinear Object Segmentation. IEEE Trans. Image Process. 2022, 31, 2557–2569. [Google Scholar] [CrossRef]
- Liu, S.; Liang, W.; Zhang, Y.; Wang, Y.; Liu, Y. Aero-engine blade detection and tracking using networked borescopes. Int. J. Sens. Netw. 2025, 47, 148–161. [Google Scholar] [CrossRef]
- Shang, H.; Yang, Q.; Sun, C.; Chen, X.; Yan, R. Superpixel Perception Graph Neural Network for Intelligent Defect Detection of Aero-Engine Blade. J. Manuf. Syst. 2024, 77, 112–126. [Google Scholar] [CrossRef]
- Mohammed Eltoum, M.A.; Iqbal, E.; Zweiri, Y.; Moyo, B.; Abdulrahman, Y. BladeSynth: A High-Quality Rendering-Based Synthetic Dataset for Aero Engine Blade Defect Inspection. Sci. Data 2025, 12, 1268. [Google Scholar] [CrossRef]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1137–1149. [Google Scholar] [CrossRef]
- He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask R-CNN. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2017; pp. 2980–2988. [Google Scholar] [CrossRef]
- Chen, Z.; Zhu, Q.; Zhou, X.; Deng, J.; Song, W. Experimental study on YOLO-based leather surface defect detection. IEEE Access 2024, 12, 32830–32848. [Google Scholar] [CrossRef]
- Srinivasan, R.; Dhavale, S.; Agrawal, P. Deep Learning–Based Automated Aero-Engine Blade Inspection, Damage Tracking, and Maintenance Decision Support. Aerosp. Sci. Technol. 2026, 178, 112350. [Google Scholar] [CrossRef]
- Wang, C.-Y.; Bochkovskiy, A.; Liao, H.-Y.M. YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors. In Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2023; pp. 7464–7475. [Google Scholar] [CrossRef]
- Wang, A.; Chen, H.; Liu, L.; Chen, K.; Lin, Z.; Han, J.; Ding, G. YOLOv10: Real-Time End-to-End Object Detection. Adv. Neural Inf. Process. Syst. 2024, 37, 107984–108011. [Google Scholar] [CrossRef]
- Carion, N.; Massa, F.; Synnaeve, G.; Usunier, N.; Kirillov, A.; Zagoruyko, S. End-to-End Object Detection with Transformers. In Proceedings of the Computer Vision–ECCV 2020; Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M., Eds.; Springer International Publishing: Cham, Switzerland, 2020; pp. 213–229. [Google Scholar] [CrossRef]
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision; IEEE: New York, NY, USA, 2021; pp. 9992–10002. [Google Scholar] [CrossRef]
- Kannusamy, P.; Gayathri, D.; Mirdula, S.; Bhoopalan, R.; Manikandan, D.; Ramaswamy, K. High-Precision Non-Destructive Blade Surface Inspection via Self Learning Transformer Networks. Sci. Rep. 2026, 16, 10288. [Google Scholar] [CrossRef]
- Zhao, Y.; Lv, W.; Xu, S.; Wei, J.; Wang, G.; Dang, Q.; Liu, Y.; Chen, J. DETRs Beat YOLOs on Real-Time Object Detection. In Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2024; pp. 16965–16974. [Google Scholar] [CrossRef]
- Wang, C.-Y.; Yeh, I.-H.; Mark Liao, H.-Y. YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information. In Proceedings of the Computer Vision–ECCV 2024; Leonardis, A., Ricci, E., Roth, S., Russakovsky, O., Sattler, T., Varol, G., Eds.; Springer Nature: Cham, Switzerland, 2025; pp. 1–21. [Google Scholar] [CrossRef]
- Lin, T.-Y.; Goyal, P.; Girshick, R.; He, K.; Dollár, P. Focal Loss for Dense Object Detection. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2017; pp. 2999–3007. [Google Scholar] [CrossRef]
- Salehi, S.S.M.; Erdogmus, D.; Gholipour, A. Tversky Loss Function for Image Segmentation Using 3D Fully Convolutional Deep Networks. In Proceedings of the Machine Learning in Medical Imaging; Wang, Q., Shi, Y., Suk, H.-I., Suzuki, K., Eds.; Springer International Publishing: Cham, Switzerland, 2017; pp. 379–387. [Google Scholar] [CrossRef]








| Category | Description | Legend | Amount |
|---|---|---|---|
| Crack | A linear opening that can easily be seen and which can cause the material to break | ![]() | 185 |
| Gouge | A large rough cut of large depth with some removal of material caused because a sharp object has hit the part | 201 |
| Input: Training set Dtrain, stage model M, IoU threshold τiou, low-confidence threshold τlow, repetition coefficient r | |
| Output: Enhanced training set D’train | |
| Step | Procedure |
| 1 | Initialize model M |
| 2 | For t = 1 to T, use model M to perform inference on the training set Dtrain |
| 3 | For each ground-truth crack target, compute the maximum IoU mi with the prediction set |
| 4 | If mi < τiou, label the sample as missed crack. |
| 5 | If mi ≥ τiou and conf < τlow ,label the sample as low-confidence crack. |
| 6 | If the predicted category is gouge while the ground-truth category is crack, label the sample as crack-as-gouge. |
| 7 | Add the identified hard samples into the hard-sample pool H. |
| 8 | Perform repeated sampling or sample duplication on H according to repetition coefficient r. |
| 9 | Update training set Dtrain ← D’train and update model M |
| 10 | Obtain the enhanced training set D’train for subsequent model training. |
| No. | Method | Precision | Recall | mAP@0.5 | mAP@0.5:0.95 | F1 | Crack mAP@0.5:0.95 | Gouge mAP@0.5:0.95 |
|---|---|---|---|---|---|---|---|---|
| 1 | YOLOv11-det baseline | 0.8909 | 0.8598 | 0.9549 | 0.6121 | 0.8751 | 0.5310 | 0.6932 |
| 2 | YOLOv11-seg baseline | 0.9350 | 0.9409 | 0.9753 | 0.6165 | 0.9379 | 0.4735 | 0.4501 |
| 3 | P1/P2 | 0.8993 | 0.9345 | 0.9610 | 0.6291 | 0.9166 | 0.6180 | 0.6756 |
| 4 | P1/P2 + Focal | 0.9912 | 0.9477 | 0.9829 | 0.6104 | 0.9689 | 0.6065 | 0.6734 |
| 5 | P1/P2 + Focal + Tversky | 0.9743 | 0.9744 | 0.9849 | 0.6317 | 0.9743 | 0.6212 | 0.6824 |
| 6 | P1/P2 + Focal + Tversky + Hard Mining | 0.9981 | 0.9606 | 0.9781 | 0.6938 | 0.9790 | 0.6869 | 0.7007 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Li, Z.; Liu, J.; Wang, H.; Liu, Z.; Zhang, F.; Bai, N.; Hou, J.; Yang, Y.; Cui, L. Collaborative Optimization of High-Resolution Representation and Miss-Sensitive Supervision for Aero-Engine Micro-Crack Detection. J. Imaging 2026, 12, 294. https://doi.org/10.3390/jimaging12070294
Li Z, Liu J, Wang H, Liu Z, Zhang F, Bai N, Hou J, Yang Y, Cui L. Collaborative Optimization of High-Resolution Representation and Miss-Sensitive Supervision for Aero-Engine Micro-Crack Detection. Journal of Imaging. 2026; 12(7):294. https://doi.org/10.3390/jimaging12070294
Chicago/Turabian StyleLi, Zixuan, Jiaxin Liu, Hongwei Wang, Zhaoming Liu, Feng Zhang, Ning Bai, Jing Hou, Yongliang Yang, and Long Cui. 2026. "Collaborative Optimization of High-Resolution Representation and Miss-Sensitive Supervision for Aero-Engine Micro-Crack Detection" Journal of Imaging 12, no. 7: 294. https://doi.org/10.3390/jimaging12070294
APA StyleLi, Z., Liu, J., Wang, H., Liu, Z., Zhang, F., Bai, N., Hou, J., Yang, Y., & Cui, L. (2026). Collaborative Optimization of High-Resolution Representation and Miss-Sensitive Supervision for Aero-Engine Micro-Crack Detection. Journal of Imaging, 12(7), 294. https://doi.org/10.3390/jimaging12070294


