Advancing Image Classification and Segmentation Through Machine Learning: Architectures and Applications
1. Introduction
2. Overview of the Published Articles
3. Conclusions
Acknowledgments
Conflicts of Interest
List of Contributions
- Ungur, V.; Popa, C.A. OpenMamba: Introducing state space models to open-vocabulary semantic segmentation. Appl. Sci. 2025, 15, 9087.
- Bolocan, V.O.; Nicu-Canareica, O.; Mitoi, A.; Costache, M.G.; Manolescu, L.S.C.; Medar, C.; Jinga, V. Deep learning for adrenal gland segmentation: Comparing accuracy and efficiency across three convolutional neural network models. Appl. Sci. 2025, 15, 5388.
- Xu, S.; Dong, H.; Zhang, C.; Wang, C. Fast normalization for bilinear pooling via eigenvalue regularization. Appl. Sci. 2025, 15, 4155.
- Jiménez-Gaona, Y.; Vivanco-Galván, O.; Castillo-Malla, D.; Vivanco-Gualán, I.; Díaz-Guzmán, P. VITA-D: A radiomic web tool for predicting vitamin D deficiency levels. Appl. Sci. 2025, 15, 1798.
- Tariku, G.; Ghiglieno, I.; Sanchez Morchio, A.; Facciano, L.; Birolleau, C.; Simonetto, A.; Serina, I.; Gilioli, G. Deep-learning-based land cover mapping in Franciacorta wine growing area. Appl. Sci. 2025, 15, 871.
- Zhao, D.; Zhang, W.; Wang, Y. Research on personnel image segmentation based on MobileNetV2 H-Swish CBAM PSPNet in search and rescue scenarios. Appl. Sci. 2024, 14, 10675.
References
- Minaee, S.; Boykov, Y.; Porikli, F.; Plaza, A.; Kehtarnavaz, N.; Terzopoulos, D. Image segmentation using deep learning: A survey. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 44, 3523–3542. [Google Scholar] [CrossRef] [PubMed]
- Alzubaidi, L.; Zhang, J.; Humaidi, A.J.; Al-Dujaili, A.; Duan, Y.; Al-Shamma, O.; Santamaría, J.; Fadhel, M.A.; Al-Amidie, M.; Farhan, L. Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions. J. Big Data 2021, 8, 53. [Google Scholar] [CrossRef] [PubMed]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; et al. An image is worth 16 × 16 words: Transformers for image recognition at scale. In Proceedings of the International Conference on Learning Representations, Vienna, Austria, 3–7 May 2021. [Google Scholar]
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10–17 October 2021; pp. 10012–10022. [Google Scholar]
- Kirillov, A.; Mintun, E.; Ravi, N.; Mao, H.; Roll, C.; Gustafson, L.; Xiao, T.; Whitehead, S.; Berg, A.C.; Lo, W.-Y.; et al. Segment anything. In Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 1–6 October 2023; pp. 4015–4026. [Google Scholar]
- Hatamizadeh, A.; Tang, Y.; Nath, V.; Yang, D.; Myronenko, A.; Landman, B.; Roth, H.R.; Xu, D. Unetr: Transformers for 3d medical image segmentation. In Proceedings of the 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA, 3–8 January 2022; pp. 574–584. [Google Scholar]
- Yuan, Q.; Shen, H.; Li, T.; Li, Z.; Li, S.; Jiang, Y.; Xu, H.; Tan, W.; Yang, Q.; Wang, J.; et al. Deep learning in environmental remote sensing: Achievements and challenges. Remote Sens. Environ. 2020, 241, 111716. [Google Scholar] [CrossRef]
- Liang, F.; Wu, B.; Dai, X.; Li, K.; Zhao, Y.; Zhang, H.; Zhang, P.; Vajda, P.; Marculescu, D. Open-vocabulary semantic segmentation with mask-adapted clip. In Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 17–24 June 2023; pp. 7061–7070. [Google Scholar]
- Maaz, M.; Shaker, A.; Cholakkal, H.; Khan, S.; Zamir, S.W.; Anwer, R.M.; Shahbaz Khan, F. Edgenext: Efficiently amalgamated cnn-transformer architecture for mobile vision applications. In European Conference on Computer Vision; Springer Nature: Cham, Switzerland, 2022; pp. 3–20. [Google Scholar]
- Han, D.; Wang, Z.; Xia, Z.; Han, Y.; Pu, Y.; Ge, C.; Song, J.; Song, S.; Zheng, B.; Huang, G. Demystify mamba in vision: A linear attention perspective. In Proceedings of the 38th International Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 10–15 December 2024; Volume 37, pp. 127181–127203. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Liu, F.; Huo, J. Advancing Image Classification and Segmentation Through Machine Learning: Architectures and Applications. Appl. Sci. 2026, 16, 2921. https://doi.org/10.3390/app16062921
Liu F, Huo J. Advancing Image Classification and Segmentation Through Machine Learning: Architectures and Applications. Applied Sciences. 2026; 16(6):2921. https://doi.org/10.3390/app16062921
Chicago/Turabian StyleLiu, Fan, and Jian Huo. 2026. "Advancing Image Classification and Segmentation Through Machine Learning: Architectures and Applications" Applied Sciences 16, no. 6: 2921. https://doi.org/10.3390/app16062921
APA StyleLiu, F., & Huo, J. (2026). Advancing Image Classification and Segmentation Through Machine Learning: Architectures and Applications. Applied Sciences, 16(6), 2921. https://doi.org/10.3390/app16062921

