Application of Wavelet Convolution and Scale-Based Dynamic Loss for Multi-Scale Damage Detection of Mining Conveyor Belt
Abstract
1. Introduction
1.1. Background
1.2. Limitations of Existing Methods
1.3. Motivation and Contributions
- A novel damage detection framework integrating large receptive field convolutions and dynamically adjusts bounding box loss function to adapt to multi-scale damage in mining environments
- Quantitative validation of the proposed model’s enhanced accuracy and superior anti-disturbance capacity in comparison to existing traditional detection models.
- An integrated software visual display system is developed to enable end-to-end mining conveyor belt damage detection for practical industrial application.
2. Materials and Methods
2.1. Large Receptive Field Convolutions
2.2. Scale-Sensitive Loss Functions
3. Methodology
3.1. Overview of the Proposed Model
3.2. Wavelet Transform Convolution (WTConv)
3.3. Integration of WTConv into YOLOv11
- Backbone WTConv: Captures low-frequency global context (large blocks) and high-frequency details (cracks) from multi-scale feature maps.
- Neck WTConv: Refines fused features to enhance small-target (micro-holes) visibility.
3.4. Scale-Based Dynamic Loss Function
4. Experiments
4.1. Dataset
4.2. Evaluation Metrics
4.3. Experimental Setup
- Hardware: Intel Core i9-12900K CPU, RTX 3090 GPU (24 GB VRAM), 64 GB RAM.Software: PyTorch 2.1, Python 3.9.Training hyperparameters: Epochs = 100, batch size = 16, optimizer = AdamW (weight decay = 1 × 10−4), initial learning rate = 1 × 10−4 (cosine annealing to 1 × 10−5).
4.4. Results and Analysis
4.4.1. Effect of Damage Detection
4.4.2. Ablation Study
4.5. Software Development
- (1)
- Top Basic Information Area. This area displays the interface title and the current system timestamp, confirming that this tool is a dedicated monitoring system for damage inspection of mining conveyor belts.
- (2)
- Model and Parameter Configuration Area. This section supports the configuration of detection-related parameters. We can also adjust the camera.
- (3)
- Real-Time Detection Display Area. This area consists of two sub-windows: live video streaming and detected object frame, both synchronously presenting the on-site view of the mining conveyor belt.
- (4)
- Control and Status Prompt Area. This part is used to control the stop and start of the conveyor belt.
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| CNN | Conventional Neural Networks |
| YOLO | You Only Look Once |
| WT | Wavelet Transform |
| WTConv | Wavelet Transform Convolution |
| SD Loss | Scale-based Dynamic Loss |
| IoU | Intersection over Union |
| GIoU | Generalized Intersection over Union |
| CIoU | Complete Intersection over Union |
| AP | Average Precision |
| mAP | mean Average Precision |
| GFLOPs | Giga Floating Point Operations |
| TCN | Temporal Convolutional Network |
References
- Rzeszowska, A.; Błażej, R.; Jurdziak, L. Non-Destructive Testing for Conveyor Belt Monitoring and Diagnostics: A Review. Appl. Sci. 2025, 15, 13272. [Google Scholar] [CrossRef]
- Sha, L.; Zhang, W.; Zhou, J.; Peng, C.; Yu, Z. Review of Non-Destructive Testing Techniques for Conveyor Belt Damage. NDT 2025, 3, 27. [Google Scholar] [CrossRef]
- Wang, Q.; Wang, M.; Sun, J.; Chen, D.; Shi, P. Review of Surface-defect Detection Methods for Industrial Products Based on Machine Vision. IEEE Access 2025, 13, 90668–90697. [Google Scholar] [CrossRef]
- Li, X.; Huang, X.; Zhang, C.; Zhang, L. Fault detection network for X-ray imaging steel cord conveyor belt based on improved YOLOv11. Nondestruct. Test. Eval. 2025, 1–20. [Google Scholar] [CrossRef]
- Ling, J.; Fu, Z.; Yuan, X. Lightweight coal mine conveyor belt foreign object detection based on improved Yolov8n. Sci. Rep. 2025, 15, 10361. [Google Scholar] [CrossRef]
- Peng, F.; Hao, K.; Lu, X. Camera-Adaptive Foreign Object Detection for Coal Conveyor Belts. Appl. Sci. 2025, 15, 4769. [Google Scholar] [CrossRef]
- Huang, J.; Ji, L.; Ma, X.; Ye, M. BeltCrack: The First Sequential-image Industrial Conveyor Belt Crack Detection Dataset and Its Baseline with Triple-domain Feature Learning. arXiv 2025, arXiv:2506.17892. [Google Scholar]
- Liu, W.; Tao, Q.; Wang, N.; Xiao, W.; Pan, C. YOLO-STOD: An industrial conveyor belt tear detection model based on Yolov5 algorithm. Sci. Rep. 2025, 15, 1659. [Google Scholar] [CrossRef]
- Yang, L.; Chen, G.; Liu, J.; Guo, J. Wear State Detection of Conveyor Belt in Underground Mine Based on Retinex-YOLOv8-EfficientNet-NAM. IEEE Access 2024, 12, 25309–25324. [Google Scholar] [CrossRef]
- Wang, R.; Li, Y.; Yang, F.; Wang, Z.; Dong, J.; Yuan, C.; Lu, X. PLC based laser scanning system for conveyor belt surface monitoring. Sci. Rep. 2024, 14, 27914. [Google Scholar] [CrossRef]
- Wang, H.; Kou, Z.; Wang, Y. GES-YOLO: A Light-Weight and Efficient Method for Conveyor Belt Deviation Detection in Mining Environments. Machines 2025, 13, 126. [Google Scholar] [CrossRef]
- Ni, Y.; Cheng, H.; Hou, Y.; Guo, P. Study of conveyor belt deviation detection based on improved YOLOv8 algorithm. Sci. Rep. 2024, 14, 26876. [Google Scholar] [CrossRef] [PubMed]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 39, 1137–1149. [Google Scholar] [CrossRef] [PubMed]
- Khanam, R.; Hussain, M. YOLOv11: An overview of the key architectural enhancements. arXiv 2024, arXiv:2410.17725. [Google Scholar] [CrossRef]
- Ding, X.; Zhang, X.; Han, J.; Ding, G. Scaling up your kernels to 31×31: Revisiting large kernel design in CNNs. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 11963–11975. [Google Scholar]
- Chen, H.; Chu, X.; Ren, Y.; Zhao, X.; Huang, K. Pelk: Parameter-efficient large kernel convnets with peripheral convolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 16–22 June 2024; pp. 5557–5567. [Google Scholar]
- He, J.; Erfani, S.; Ma, X.; Bailey, J.; Chi, Y.; Hua, X.S. Alpha-IoU: A family of power Intersection over Union losses for bounding box regression. arXiv 2021, arXiv:2110.13675. [Google Scholar]
- Zhang, M.; Zhang, Y.; Zhou, M.; Jiang, K.; Shi, H.; Yu, Y.; Hao, N. Application of lightweight convolutional neural network for damage detection of conveyor belt. Appl. Sci. 2021, 11, 7282. [Google Scholar] [CrossRef]
- Finder, S.E.; Amoyal, R.; Treister, E.; Freifeld, O. Wavelet convolutions for large receptive fields. In European Conference on Computer Vision; Springer Nature: Cham, Switzerland, 2024; pp. 363–380. [Google Scholar]
- Yang, J.; Liu, S.; Wu, J.; Su, X.; Hai, N.; Huang, X. Pinwheel-shaped convolution and scale-based dynamic loss for infrared small target detection. In Proceedings of the AAAI Conference on Artificial Intelligence, Philadelphia, PA, USA, 25 February–4 March 2025; Volume 39, pp. 9202–9210. [Google Scholar]
- Xin, Q.; Hu, S.; Liu, S.; Zhao, L.; Zhang, Y.-D. An attention-based wavelet convolution neural network for epilepsy EEG classification. IEEE Trans. Neural Syst. Rehabil. Eng. 2022, 30, 957–966. [Google Scholar] [CrossRef]
- Sun, Z.; Lin, Y.; Li, Y.; Lin, Z. Crossed wavelet convolution network for few-shot defect detection of industrial chips. Sensors 2025, 25, 4377. [Google Scholar] [CrossRef]
- Zakria, Z.; Deng, J.; Kumar, R.; Khokhar, M.S.; Cai, J.; Kumar, J. Multiscale and direction target detecting in remote sensing images via modified YOLO-v4. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2022, 15, 1039–1048. [Google Scholar] [CrossRef]
- Chen, Y.; Zhou, M.; Hu, F.; Gao, L.; Wang, K. YOLOv8-LDH: A lightweight model for detection of conveyor belt damage based on multispectral imaging. Measurement 2025, 245, 116675. [Google Scholar] [CrossRef]
- Liu, W.; Tao, Q.; Pei, H. Yolo-EV2: An Industrial Mining Conveyor Belt Tear Detection Model Based on Improved Yolov5 Algorithm for Efficient Backbone Networks. In Proceedings of the 2024 International Conference on Cyber-Physical Social Intelligence (ICCSI), Doha, Qatar, 8–12 November 2024; pp. 1–5. [Google Scholar]
- Khalfaoui-Hassani, I.; Pellegrini, T.; Masquelier, T. Dilated convolution with learnable spacings. arXiv 2021, arXiv:2112.03740. [Google Scholar]
- Wang, G.; Yang, Z.; Sun, H.; Zhou, Q.; Yang, Z. AC-SNGAN: Multi-class data augmentation for damage detection of conveyor belt surface using improved ACGAN. Measurement 2024, 224, 113814. [Google Scholar] [CrossRef]
- Zhou, K.; Jiang, S. Forest fire detection algorithm based on improved YOLOv11n. Sensors 2025, 25, 2989. [Google Scholar] [CrossRef]
- Hidayat, Z.S.; Wijaya, Y.A.; Kurniawan, R. Optimizing YOLOv8 for autonomous driving: Batch size for best mean average precision (mAP). J. Tek. Inform. (JUTIF) 2024, 5, 1147–1153. [Google Scholar] [CrossRef]
- Everingham, M.; Van Gool, L.; Williams, C.K.I.; Winn, J.; Zisserman, A. The Pascal Visual Object Classes (VOC) Challenge. Int. J. Comput. Vis. 2010, 88, 303–338. [Google Scholar] [CrossRef]
- Duan, K.; Bai, S.; Xie, L.; Qi, H.; Huang, Q.; Tian, Q. CenterNet: Keypoint triplets for object detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27–28 October 2019; pp. 6569–6578. [Google Scholar]
- Zhang, S.; Wen, L.; Bian, X.; Lei, Z.; Li, S.Z. Single-shot refinement neural network for object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June2018; pp. 4203–4212. [Google Scholar]
- Lu, X.; Kang, X.; Nishide, S.; Ren, F. Object detection based on SSD-ResNet. In Proceedings of the 2019 IEEE 6th International Conference on Cloud Computing and Intelligence Systems (CCIS), Singapore, 19–21 December 2019; pp. 89–92. [Google Scholar]
- Zhou, W.; Min, X.; Hu, R.; Long, Y.; Luo, H. FasterX: Real-time object detection based on edge GPUs for UAV applications. arXiv 2022, arXiv:2209.03157. [Google Scholar]







| Split | Images | Block (Instance) | Crack (Instance) | Foreign (Instance) | Hole (Instance) |
|---|---|---|---|---|---|
| Training | 1876 | 6212 | 226 | 1596 | 234 |
| Validation | 234 | 748 | 24 | 212 | 31 |
| Test | 235 | 737 | 28 | 176 | 32 |
| Model | Architecture | Block | Crack | Foreign | Hole | mAP50 |
|---|---|---|---|---|---|---|
| Fast R-CNN [13] | Two-stage | 0.621 | 0.785 | 0.923 | 0.256 | 0.646 |
| SSD [25] | Single-stage | 0.638 | 0.801 | 0.937 | 0.269 | 0.661 |
| Centernet [31] | Single-stage | 0.675 | 0.834 | 0.952 | 0.289 | 0.688 |
| YOLOv11 [14] | Single-stage | 0.692 | 0.871 | 0.971 | 0.307 | 0.703 |
| YOLO-EV2 [32] | Single-stage | 0.689 | 0.867 | 0.968 | 0.301 | 0.706 |
| YOLOv8-LDH [24] | Single-stage | 0.695 | 0.873 | 0.972 | 0.312 | 0.713 |
| RefineDet [33] | Single-stage | 0.653 | 0.812 | 0.945 | 0.278 | 0.672 |
| WT_YOLO | Single-stage | 0.724 | 0.927 | 0.983 | 0.317 | 0.738 |
| Model | WTconv | SD Loss | mAP50 | mAP50-95 | P | R | GFLOPs |
|---|---|---|---|---|---|---|---|
| YOLO V11 | 0.703 | 0.434 | 0.788 | 0.612 | 6.6 | ||
| √ | 0.730 | 0.453 | 0.813 | 0.642 | 6.4 | ||
| √ | 0.735 | 0.478 | 0.841 | 0.642 | 6.6 | ||
| √ | √ | 0.738 | 0.451 | 0.834 | 0.654 | 6.4 |
| Model | WTConv | SD Loss | Block | Crack | Foreign | Hole | Processing Time (ms/Image) |
|---|---|---|---|---|---|---|---|
| YOLO V11 | 0.692 | 0.871 | 0.971 | 0.307 | 28.5 | ||
| √ | 0.712 | 0.901 | 0.981 | 0.327 | 30.2 | ||
| √ | 0.730 | 0.870 | 0.983 | 0.358 | 29.1 | ||
| √ | √ | 0.721 | 0.927 | 0.982 | 0.342 | 30.8 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Xie, F.; Wang, J.; Gordin, S.A.; Ermakov, A.N.; Varnavskiy, K.A. Application of Wavelet Convolution and Scale-Based Dynamic Loss for Multi-Scale Damage Detection of Mining Conveyor Belt. Mining 2026, 6, 8. https://doi.org/10.3390/mining6010008
Xie F, Wang J, Gordin SA, Ermakov AN, Varnavskiy KA. Application of Wavelet Convolution and Scale-Based Dynamic Loss for Multi-Scale Damage Detection of Mining Conveyor Belt. Mining. 2026; 6(1):8. https://doi.org/10.3390/mining6010008
Chicago/Turabian StyleXie, Fangwei, Jianfei Wang, Sergey Alexandrovich Gordin, Aleksandr Nikolaevich Ermakov, and Kirill Aleksandrovich Varnavskiy. 2026. "Application of Wavelet Convolution and Scale-Based Dynamic Loss for Multi-Scale Damage Detection of Mining Conveyor Belt" Mining 6, no. 1: 8. https://doi.org/10.3390/mining6010008
APA StyleXie, F., Wang, J., Gordin, S. A., Ermakov, A. N., & Varnavskiy, K. A. (2026). Application of Wavelet Convolution and Scale-Based Dynamic Loss for Multi-Scale Damage Detection of Mining Conveyor Belt. Mining, 6(1), 8. https://doi.org/10.3390/mining6010008

