This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
Metaheuristic Approaches to Enhance Voice-Based Gender Identification Using Machine Learning Methods
by
Şahin Yıldırım
Şahin Yıldırım 1,*
and
Mehmet Safa Bingöl
Mehmet Safa Bingöl 2
1
Department of Mechatronics Engineering, Erciyes University, 38039 Kayseri, Turkey
2
Department of Mechatronics Engineering, Nigde Omer Halisdemir University, 51240 Nigde, Turkey
*
Author to whom correspondence should be addressed.
Appl. Sci. 2025, 15(23), 12815; https://doi.org/10.3390/app152312815 (registering DOI)
Submission received: 30 October 2025
/
Revised: 19 November 2025
/
Accepted: 24 November 2025
/
Published: 3 December 2025
Abstract
Nowadays, classification of a person’s gender by analyzing characteristics of their voice is generally called voice-based identification. This paper presents an investigation on systematic research of metaheuristic optimization algorithms regarding machine learning methods to predict voice-based gender identification performance. Furthermore, four types of machine learning methods—Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN), and Artificial Neural Network (ANN)—are employed to predict voice-based gender identification. On the other hand, initially, the dataset is preprocessed using raw data and normalized with z-score and min–max normalization methods. Second, six different hyperparameter optimization approaches, including four metaheuristic optimization algorithms (Artificial Bee Colony (ABC), Particle Swarm Optimization (PSO), Grey Wolf Optimizer (GWO), and Artificial Fish Swarm Algorithm (AFSA)), along with random search and Tree-structured Parzen Estimator (TPE), are used to optimize the hyperparameters of the machine learning methods. A rigorous 5 × 10-fold cross-validation strategy is implemented to ensure robust model evaluation and minimize overfitting. A comprehensive evaluation was conducted using 72 different model combinations, assessed through accuracy, precision, recall, and F1-score metrics. The statistical significance of performance differences among models was assessed through a paired t-test and ANOVA for multiple group comparisons. In addition, external validation was performed by introducing noise into the dataset to assess model robustness under real-world noisy conditions. The results proved that metaheuristic optimization significantly outperforms traditional manual hyperparameter tuning approaches. Therefore, the optimal model, combining min–max normalization with RF optimized via the PSO algorithm, achieved an accuracy of 98.68% and an F1-score of 0.9869, representing competitive performance relative to the existing literature. This study demonstrated valuable insights into metaheuristic optimization for voice-based gender identification and presented a deployable model for forensic science, biometric security, and human–computer interaction. The results revealed that metaheuristic optimization algorithms demonstrated superior performance compared to traditional hyperparameter tuning methods and significantly improved the accuracy of voice-based gender identification systems.
Share and Cite
MDPI and ACS Style
Yıldırım, Ş.; Bingöl, M.S.
Metaheuristic Approaches to Enhance Voice-Based Gender Identification Using Machine Learning Methods. Appl. Sci. 2025, 15, 12815.
https://doi.org/10.3390/app152312815
AMA Style
Yıldırım Ş, Bingöl MS.
Metaheuristic Approaches to Enhance Voice-Based Gender Identification Using Machine Learning Methods. Applied Sciences. 2025; 15(23):12815.
https://doi.org/10.3390/app152312815
Chicago/Turabian Style
Yıldırım, Şahin, and Mehmet Safa Bingöl.
2025. "Metaheuristic Approaches to Enhance Voice-Based Gender Identification Using Machine Learning Methods" Applied Sciences 15, no. 23: 12815.
https://doi.org/10.3390/app152312815
APA Style
Yıldırım, Ş., & Bingöl, M. S.
(2025). Metaheuristic Approaches to Enhance Voice-Based Gender Identification Using Machine Learning Methods. Applied Sciences, 15(23), 12815.
https://doi.org/10.3390/app152312815
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article metric data becomes available approximately 24 hours after publication online.