Mineral Identification Based on Multi-Label Image Classification
Abstract
:1. Introduction
2. Dataset
3. Method
3.1. Feature Extraction Network
3.2. Transformer Decoder
3.3. Asymmetric Loss Function
4. Experiments
4.1. Evaluation Metric
4.2. Experimental Results
4.2.1. Feature Extraction Network Selection
4.2.2. Loss Function Selection
4.2.3. Experimental Results with ViT and ASL
4.2.4. Visualization of the Class Activation Mapping
4.2.5. Comparison with Other Mineral Identification Methods
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Lou, W.; Zhang, D.; Bayless, R.C. Review of mineral recognition and its future. Appl. Geochem. 2020, 122, 104727. [Google Scholar] [CrossRef]
- Hao, H.; Gu, Q.; Hu, X. Research Advances and Prospective in Mineral Intelligent Identification Based on Machine Learning. Earth Sci. 2021, 46, 3091–3106. [Google Scholar] [CrossRef]
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
- Zeng, X.; Xiao, Y.; Ji, X.; Wang, G. Mineral Identification Based on Deep Learning That Combines Image and Mohs Hardness. Minerals 2021, 11, 506. [Google Scholar] [CrossRef]
- Peng, W.H.; Bai, L.; Shang, S.W.; Tang, X.J.; Zhang, Z.Y. Common mineral intelligent recognition based on improved InceptionV3. Geol. Bull. China 2019, 38, 2059–2066. [Google Scholar] [CrossRef]
- Liu, C.; Li, M.; Zhang, Y.; Han, S.; Zhu, Y. An Enhanced Rock Mineral Recognition Method Integrating a Deep Learning Model and Clustering Algorithm. Minerals 2019, 9, 516. [Google Scholar] [CrossRef] [Green Version]
- Brempong, E.A.; Agangiba, M.; Aikins, D. MiNet: A Convolutional Neural Network for Identifying and Categorising Minerals. Ghana J. Technol. 2020, 5, 86–92. [Google Scholar] [CrossRef]
- Guo, Y.; Zhou, Z.; Lin, H.; Liu, X.; Chen, D.; Zhu, J.; Wu, J. The mineral intelligence identification method based on deep learning algorithms. Earth Sci. Front. 2020, 27, 39–47. [Google Scholar] [CrossRef]
- Li, M.; Liu, C.; Zhang, Y. A Deep Learning and Intelligent Recognition Method of Image Data for Rock Mineral and its Implementation. Geotecton. Miner. 2020, 44, 203–211. [Google Scholar]
- Jia, L.; Yang, M.; Meng, F.; He, M.; Liu, H. Mineral Photos Recognition Based on Feature Fusion and Online Hard Sample Mining. Minerals 2021, 11, 1354. [Google Scholar] [CrossRef]
- Tsoumakas, G.; Katakis, I. Multi-Label Classification: An Overview. Int. J. Data Warehous. Min. 2009, 3, 1–13. [Google Scholar] [CrossRef]
- Tarekegn, A.N.; Giacobini, M.; Michalak, K. A review of methods for imbalanced multi-label classification. Pattern Recognit. 2021, 118, 107965. [Google Scholar] [CrossRef]
- Zhang, M.; Zhou, Z. A Review on Multi-Label Learning Algorithms. IEEE Trans. Knowl. Data Eng. 2014, 26, 1819–1837. [Google Scholar] [CrossRef]
- Wei, Y.; Xia, W.; Lin, M.; Huang, J.; Ni, B.; Dong, J.; Zhao, Y.; Yan, S. HCP: A Flexible CNN Framework for Multi-Label Image Classification. IEEE Trans. Softw. Eng. 2016, 38, 1901–1907. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Lin, W.-Z.; Fang, J.-A.; Xiao, X.; Chou, K.-C. iLoc-Animal: A multi-label learning classifier for predicting subcellular localization of animal proteins. Mol. BioSystems 2013, 9, 634–644. [Google Scholar] [CrossRef] [PubMed]
- Xiao, X.; Wu, Z.-C.; Chou, K.-C. iLoc-Virus: A multi-label learning classifier for identifying the subcellular localization of virus proteins with both single and multiple sites. J. Theor. Biol. 2011, 284, 42–51. [Google Scholar] [CrossRef] [PubMed]
- Salvatore, C.; Castiglioni, I. A Wrapped Multi-label Classifier for the Automatic Diagnosis and Prognosis of Alzheimer’s Disease. J. Neurosci. Methods 2018, 302, 58–65. [Google Scholar] [CrossRef] [PubMed]
- Shao, Z.; Zhou, W.; Deng, X.; Zhang, M.; Cheng, Q. Multilabel remote sensing image retrieval based on fully convolutional network. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 318–328. [Google Scholar] [CrossRef]
- A Mineral Database. Available online: https://www.mindat.org/ (accessed on 20 July 2022).
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition(CVPR), Las Vegas, NV, USA, 16 June–1 July 2016; pp. 770–778. [Google Scholar]
- Sandler, M.; Howard, A.; Zhu, M.; Zhmoginov, A.; Chen, L.C. MobileNetV2: Inverted Residuals and Linear Bottlenecks. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–23 June 2018; pp. 4510–4520. [Google Scholar] [CrossRef]
- Kolesnikov, A.; Beyer, L.; Zhai, X.; Puigcerver, J.; Yung, J.; Gelly, S.; Houlsby, N. Big Transfer (BiT): General Visual Representation Learning. In Proceedings of the 2020 ECCV European Conference on Computer Vision, Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2007; Volume 12350, pp. 491–507. [Google Scholar] [CrossRef]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S. An image is worth 16x16 words: Transformers for image recognition at scale. In Proceedings of the 2021 The International Conference on Learning Representations (ICLR), Online, 4–8 May 2021. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. In Proceedings of the 31st Annual Conference on Neural Information Processing Systems (NIPS), Long Beach, CA, USA, 4–9 December 2017. [Google Scholar] [CrossRef]
- Ba, J.L.; Kiros, J.R.; Hinton, G.E. Layer Normalization. arXiv 2016, arXiv:1607.06450. [Google Scholar] [CrossRef]
- Carion, N.; Massa, F.; Synnaeve, G.; Usunier, N.; Kirillov, A.; Zagoruyko, S. End-to-end object detection with transformers. In Proceedings of the 2020 ECCV European Conference on Computer Vision, Online, 23–28 August 2020; pp. 213–229. [Google Scholar] [CrossRef]
- Ben-Baruch, E.; Ridnik, T.; Zamir, N.; Noy, A.; Zelnik-Manor, L. Asymmetric Loss For Multi-Label Classification. In Proceedings of the 2021 IEEE International Conference on Computer Vision(ICCV), Montreal, BC, Canada, 11–17 October 2021; pp. 82–91. [Google Scholar] [CrossRef]
- Lin, T.Y.; Goyal, P.; Girshick, R.; He, K.; Dollár, P. Focal Loss for Dense Object Detection. In Proceedings of the IEEE International Conference on Computer Vision(ICCV), Venice, Italy, 22–29 October 2017; pp. 2999–3007. [Google Scholar] [CrossRef] [Green Version]
- Loshchilov, I.; Hutter, F. Decoupled weight decay regularization. In Proceedings of the 2019 The International Conference on Learning Representations (ICLR), New Orleans, LA, USA, 6–9 May 2019. [Google Scholar] [CrossRef]
- Cubuk, E.D.; Zoph, B.; Shlens, J.; Le, Q.V. Randaugment: Practical automated data augmentation with a reduced search space. In Proceedings of the 2020 IEEE Conference on Computer Vision and Pattern Recognition(CVPR), Seattle, WA, USA, 14–19 June 2020; pp. 702–703. [Google Scholar]
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision(ICCV), Montreal, BC, Canada, 11–17 October 2021; pp. 10012–10022. [Google Scholar] [CrossRef]
- Selvaraju, R.R.; Cogswell, M.; Das, A.; Vedantam, R.; Parikh, D.; Batra, D. Grad-cam: Visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE International Conference on Computer Vision(ICCV), Venice, Italy, 22–29 October 2017; pp. 618–626. [Google Scholar]
Number of Minerals in an Image | Number of This Kind of Images | |
---|---|---|
1 | 142,508 | |
2 | 34,619 | |
3 | 5641 | |
4 | 809 | |
5 | 104 | |
6 | 7 | |
Total | 183,688 |
Mineral | Quantities | Mineral Name | Quantities | ||
---|---|---|---|---|---|
1 | agate | 3357 | 19 | hematite | 8012 |
2 | albite | 5101 | 20 | magnetite | 2765 |
3 | almandine | 2072 | 21 | malachite | 10,256 |
4 | anglesite | 1926 | 22 | marcasite | 2144 |
5 | azurite | 8733 | 23 | opal | 3532 |
6 | beryl | 9282 | 24 | orpiment | 740 |
7 | cassiterite | 3343 | 25 | pyrite | 12,224 |
8 | chalcopyrite | 5408 | 26 | quartz | 55,059 |
9 | cinnabar | 1655 | 27 | rhodochrosite | 4550 |
10 | copper | 5460 | 28 | ruby | 828 |
11 | demantoid | 755 | 29 | sapphire | 1010 |
12 | diopside | 1718 | 30 | schorl | 3262 |
13 | elbaite | 5699 | 31 | sphalerite | 9796 |
14 | epidote | 4427 | 32 | stibnite | 2637 |
15 | fluorite | 28,290 | 33 | sulfur | 1958 |
16 | galena | 8513 | 34 | topaz | 3758 |
17 | gold | 4600 | 35 | torbernite | 1124 |
18 | halite | 963 | 36 | wulfenite | 7711 |
Total | 232,467 |
Backbone | mAP (%) | Number of Params (M) | Training Memory Cost (MB) |
---|---|---|---|
MobileNet-V2 | 72.38 | 2.27 | 678.59 |
Big Transfer-m | 79.55 | 42.56 | 2002.74 |
ResNet-101 | 81.76 | 42.57 | 1527.08 |
ViT-B/16 | 85.26 | 85.67 | 1847.14 |
Loss Function | mAP (%) | Parameter |
---|---|---|
Binary Cross-Entropy | 84.52 | - |
Focal Loss [28] | 84.79 | |
Asymmetric Loss | 85.26 |
Model | mAP (%) |
---|---|
ResNet-50 + Feature Fusion [10] | 72.13 |
ViT + Transformer Decoder (ours) | 85.26 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wu, B.; Ji, X.; He, M.; Yang, M.; Zhang, Z.; Chen, Y.; Wang, Y.; Zheng, X. Mineral Identification Based on Multi-Label Image Classification. Minerals 2022, 12, 1338. https://doi.org/10.3390/min12111338
Wu B, Ji X, He M, Yang M, Zhang Z, Chen Y, Wang Y, Zheng X. Mineral Identification Based on Multi-Label Image Classification. Minerals. 2022; 12(11):1338. https://doi.org/10.3390/min12111338
Chicago/Turabian StyleWu, Baokun, Xiaohui Ji, Mingyue He, Mei Yang, Zhaochong Zhang, Yan Chen, Yuzhu Wang, and Xinqi Zheng. 2022. "Mineral Identification Based on Multi-Label Image Classification" Minerals 12, no. 11: 1338. https://doi.org/10.3390/min12111338