Research on Coal and Gangue Recognition Model Based on CAM-Hardswish with EfficientNetV2
Abstract
:1. Introduction
2. Coal and Gangue Recognition Network Model
2.1. EffcientNetV2 Network Model
2.2. EffcientNetV2 Network Based on CAM-Hardswish
2.3. Setting of Network Structure Hyperparameters
3. Image Data Preprocessing
3.1. Acquisition of Image Data
3.2. Coal and Gangue Image Data Preprocessing
3.2.1. Image Processing of Coal and Gangue Based on Pipeline
3.2.2. Histogram Equalization Operation
4. Experimental Results and Analysis
4.1. Training of CNN Model
4.2. Evaluation Index
4.3. Comparative Analysis of Model Performance
4.4. Comparative Analysis of Model Training Speeds
4.5. Comparison between EfficientNetV2-CAMHardswish and Mainstream Model
5. Conclusions
- (1)
- Compared with the SE module in the original model, the CAM channel attention mechanism module that was introduced in this study adopted a parallel connection method that combined average pooling and max pooling. This module could efficiently extract essential image information. This solved the problem that the single pooling operation of the SE module caused the image to lose information. In order to further improve the performance of the coal and gangue recognition network, the ReLu activation function was replaced by the Hardswish activation function, which improved the recognition accuracy.
- (2)
- The original EfficientNetV2 network suffered from excessive video memory usage, which led to a decrease in training speed and efficiency. The proposed coal and gangue recognition network in this study significantly enhanced its training speed, reduced video memory usage, and shortened the training time without compromising model accuracy.
- (3)
- This study resulted in an improved EfficientNetV2 network that showcases excellent performance in recognizing coal and gangue. It can achieve a recognition rate of 98.24%, which is a significant improvement of 3.98% over the previously tested network. The improved model reduced the inference time by 6.6 ms compared with its predecessor. The effectiveness of the improved model was validated.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Gao, L. Analysis on composition characteristics and resource utilization status of coal gangue. Jiangxi Coal Sci. Technol. 2022, 176, 233–235+238. [Google Scholar]
- Wang, X.; Niu, Y. Review of research on coal gangue: Classification, hazard and comprehensive utilization. Ind. Miner. Process. 2023. Available online: https://kns.cnki.net/kcms/detail/32.1492.TQ.20230228.1822.002.html (accessed on 7 June 2023).
- Cao, X.; Li, Y.; Wang, P. Coal and gangue recognition method research status and prospect. J. Ind. Autom. 2020, 46, 38–43. [Google Scholar]
- Li, L.; Zhu, J.; Liu, L. Coal and gangue recognition method based on density difference study. J. Coal Technol. 2022, 41, 181–184. [Google Scholar]
- Huo, P.; Zeng, H.; Huo, K. Research on coal gangue density recognition system based on image processing. Coal Prep. Technol. 2015, 249, 69–73. [Google Scholar]
- Pan, Y.; Zeng, Z.; Zhang, E. Research on Recognition of coal gangue in X-ray Detection Based on MATLAB and Image Gray Value. Coal Technol. 2017, 36, 307–309. [Google Scholar]
- Guo, Y.; He, L.; Liu, P. Coal gangue dual-energy X-ray image dimensional analysis recognition method. J. China Coal Soc. 2021, 46, 300–309. [Google Scholar]
- Kiljan, P.; Moczulski, W.; Kalinowski, K. Initial Study into the Possible Use of Digital Sound Processing for the Development of Automatic Longwall Shearer Operation. Energies 2021, 14, 2877. [Google Scholar] [CrossRef]
- Wu, F.; Yang, Z.; Mo, X.; Wu, Z.; Tang, W.; Duan, J.; Zou, X. Detection and counting of banana bunches by integrating deep learning and classic image-processing algorithms. Comput. Electron. Agr. 2023, 209, 107827. [Google Scholar] [CrossRef]
- Tang, Y.; Huang, Z.; Chen, Z.; Chen, M.; Zhou, H.; Zhang, H.; Sun, J. Novel visual crack width measurement based on backbone double-scale features for improved detection automation. Eng. Struct. 2023, 274, 115158. [Google Scholar] [CrossRef]
- Li, N.; Gong, X.; Jia, P. Segmentation method for low quality images of coal and gangue based on Retinex and local texture features with multifractal. J. Electron. Imaging 2022, 31, 061820. [Google Scholar] [CrossRef]
- Li, N.; Gong, X. An Image Preprocessing Model of Coal and Gangue in High Dust and Low Light Conditions Based on the Joint Enhancement Algorithm. Comput. Intel. Neurosc. 2021, 2021, 2436486. [Google Scholar] [CrossRef] [PubMed]
- Bloice, M.; Stocker, C.; Holzinger, A. Augmentor: An image augmentation library for machine learning. arXiv 2017, arXiv:1708.04680. [Google Scholar] [CrossRef]
- Wu, G.; Liang, X.; Hu, J.; Zhang, X. Coal gangue recognition method based on image processing and convolutional Neural Network. Microcomput. Appl. 2021, 37, 100–103. [Google Scholar]
- He, K.; Zhang, X.; Ren, S. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Woo, S.; Park, J.; Lee, J. Cbam: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar]
- Hu, J.; Shen, L.; Sun, G. Squeeze-and-excitation networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–23 June 2018; pp. 7132–7141. [Google Scholar]
- Gao, Y.; Zhang, B.; Lang, L. Based on in-depth study of coal gangue identification technology and implementation. Coal Sci. Technol. 2021, 49, 202–208. [Google Scholar]
- Howard, A.; Sandler, M.; Chu, G. Searching for mobilenetv3. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea, 27 Octobe–2 November 2019; pp. 1314–1324. [Google Scholar]
- Ramachandran, P.; Zoph, B.; Le, Q. Searching for activation functions. arXiv 2017, arXiv:1710.05941. [Google Scholar]
- Pan, X.; Cao, Y.; Jia, R. Neural network architecture search development review. J. Xi’an Univ. Posts Telecommun. 2022, 27, 43–63. [Google Scholar]
- Hong, H.; Zheng, L.; Zhu, J. Automatic recognition of coal and gangue based on convolution neural network. arXiv 2017, arXiv:1712.00720. [Google Scholar]
- Li, D.; Zhang, Z.; Xu, Z. An image-based hierarchical deep learning framework for coal and gangue detection. IEEE Access 2019, 7, 184686–184699. [Google Scholar] [CrossRef]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv 2020, arXiv:2010.11929. [Google Scholar]
- Liu, Z.; Mao, H.; Wu, C. A convnet for the 2020s. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (IEEE/CVF), New Orleans, LA, USA, 18–24 June 2022; pp. 11976–11986. [Google Scholar]
- Tan, M.; Le, Q. Efficientnet: Rethinking model scaling for convolutional neural networks. In Proceedings of the International Conference on Machine Learning (ICML), Long Beach, CA, USA, 9–15 June 2019; pp. 6105–6114. [Google Scholar]
- Tan, M.; Le, Q. Efficientnetv2: Smaller models and faster training. In Proceedings of the International Conference on Machine Learning (ICML), Virtual Event, 18–24 July 2021; pp. 10096–10106. [Google Scholar]
Stage | Block | Channels | Stride | Layers |
---|---|---|---|---|
0 | Conv3×3 | 24 | 2 | 1 |
1 | Fused-MBConv3×3 | 24 | 1 | 2 |
2 | Fused-MBConv3×3 | 48 | 2 | 2 |
3 | Fused-MBConv3×3 | 64 | 2 | 2 |
4 | MBConv3×3 | 128 | 2 | 3 |
5 | MBConv3×3 | 160 | 1 | 6 |
6 | MBConv3×3 | 256 | 2 | 10 |
7 | Conv1×1&Pooling&FC | 1280 | 1 or 2 | 1 |
Confusion Matrix | True Label | ||
---|---|---|---|
Positive | Negative | ||
Predicted label | Positive | TP | FP |
Negative | FN | TN |
Model | Attention Mechanism | Activation Function | Accuracy /% | Recall /% | F1 Score | Training Time /(h’min) | Inference Time /ms |
---|---|---|---|---|---|---|---|
EV2-SE | SE | ReLu | 94.26% | 90.4% | 0.942 | 1 h’6 min | 25.7 |
EV2-CBAM | CBAM | ReLu | 97.45% | 96.9% | 0.973 | 1 h’31 min | 34 |
EV2-CAMReLu | CAM | ReLu | 97.87% | 98.6% | 0.984 | 1 h’35 min | 28 |
EV2-CAM Hardswish | CAM | Hardswish | 98.24% | 99.1% | 0.989 | 48 min | 19.1 |
Model | Accuracy /% | Recall /% | F1 Score | Training Time /(h’min) | Inference Time /ms |
---|---|---|---|---|---|
EfficientNetV2-S | 94.26% | 90.4% | 0.942 | 1 h’6 min | 25.7 |
EfficientNetV2-M | 94.57% | 93.8% | 0.962 | 2 h’13 min | 38.2 |
EfficientNetV2-L | 95.0% | 92.9% | 0.96 | 3 h’45 min | 54.8 |
EfficientNetV2-CAM-Hardswish | 98.24% | 99.1% | 0.989 | 48 min | 19.1 |
Model | Accuracy /% | Recall /% | F1 Score | Training Time /(h’min) | Inference Time /ms |
---|---|---|---|---|---|
Vision Transformer | 94.31 | 94.2 | 0.952 | 1 h’17 min | 8.2 |
MobileNetV3-large | 94.57 | 94.1 | 0.956 | 30 min | 8.5 |
ConvNeXt | 94.0 | 93.8 | 0.954 | 1 h’8 min | 6.5 |
EfficientNetV2-CAMHardswish | 98.24 | 99.1 | 0.989 | 48 min | 19.1 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Li, N.; Xue, J.; Wu, S.; Qin, K.; Liu, N. Research on Coal and Gangue Recognition Model Based on CAM-Hardswish with EfficientNetV2. Appl. Sci. 2023, 13, 8887. https://doi.org/10.3390/app13158887
Li N, Xue J, Wu S, Qin K, Liu N. Research on Coal and Gangue Recognition Model Based on CAM-Hardswish with EfficientNetV2. Applied Sciences. 2023; 13(15):8887. https://doi.org/10.3390/app13158887
Chicago/Turabian StyleLi, Na, Jiameng Xue, Sibo Wu, Kunde Qin, and Na Liu. 2023. "Research on Coal and Gangue Recognition Model Based on CAM-Hardswish with EfficientNetV2" Applied Sciences 13, no. 15: 8887. https://doi.org/10.3390/app13158887
APA StyleLi, N., Xue, J., Wu, S., Qin, K., & Liu, N. (2023). Research on Coal and Gangue Recognition Model Based on CAM-Hardswish with EfficientNetV2. Applied Sciences, 13(15), 8887. https://doi.org/10.3390/app13158887