Metaheuristic Approaches to Enhance Voice-Based Gender Identification Using Machine Learning Methods

Yıldırım, Şahin; Bingöl, Mehmet Safa

doi:10.3390/app152312815

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

Metaheuristic Approaches to Enhance Voice-Based Gender Identification Using Machine Learning Methods

by

Şahin Yıldırım

^1,*

and

Mehmet Safa Bingöl

²

¹

Department of Mechatronics Engineering, Erciyes University, 38039 Kayseri, Turkey

²

Department of Mechatronics Engineering, Nigde Omer Halisdemir University, 51240 Nigde, Turkey

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(23), 12815; https://doi.org/10.3390/app152312815 (registering DOI)

Submission received: 30 October 2025 / Revised: 19 November 2025 / Accepted: 24 November 2025 / Published: 3 December 2025

Download Versions Notes

Abstract

Nowadays, classification of a person’s gender by analyzing characteristics of their voice is generally called voice-based identification. This paper presents an investigation on systematic research of metaheuristic optimization algorithms regarding machine learning methods to predict voice-based gender identification performance. Furthermore, four types of machine learning methods—Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN), and Artificial Neural Network (ANN)—are employed to predict voice-based gender identification. On the other hand, initially, the dataset is preprocessed using raw data and normalized with z-score and min–max normalization methods. Second, six different hyperparameter optimization approaches, including four metaheuristic optimization algorithms (Artificial Bee Colony (ABC), Particle Swarm Optimization (PSO), Grey Wolf Optimizer (GWO), and Artificial Fish Swarm Algorithm (AFSA)), along with random search and Tree-structured Parzen Estimator (TPE), are used to optimize the hyperparameters of the machine learning methods. A rigorous 5 × 10-fold cross-validation strategy is implemented to ensure robust model evaluation and minimize overfitting. A comprehensive evaluation was conducted using 72 different model combinations, assessed through accuracy, precision, recall, and F1-score metrics. The statistical significance of performance differences among models was assessed through a paired t-test and ANOVA for multiple group comparisons. In addition, external validation was performed by introducing noise into the dataset to assess model robustness under real-world noisy conditions. The results proved that metaheuristic optimization significantly outperforms traditional manual hyperparameter tuning approaches. Therefore, the optimal model, combining min–max normalization with RF optimized via the PSO algorithm, achieved an accuracy of 98.68% and an F1-score of 0.9869, representing competitive performance relative to the existing literature. This study demonstrated valuable insights into metaheuristic optimization for voice-based gender identification and presented a deployable model for forensic science, biometric security, and human–computer interaction. The results revealed that metaheuristic optimization algorithms demonstrated superior performance compared to traditional hyperparameter tuning methods and significantly improved the accuracy of voice-based gender identification systems.

Keywords: voice; gender identification; machine learning; metaheuristic optimization; hyperparameter tuning

Share and Cite

MDPI and ACS Style

Yıldırım, Ş.; Bingöl, M.S. Metaheuristic Approaches to Enhance Voice-Based Gender Identification Using Machine Learning Methods. Appl. Sci. 2025, 15, 12815. https://doi.org/10.3390/app152312815

AMA Style

Yıldırım Ş, Bingöl MS. Metaheuristic Approaches to Enhance Voice-Based Gender Identification Using Machine Learning Methods. Applied Sciences. 2025; 15(23):12815. https://doi.org/10.3390/app152312815

Chicago/Turabian Style

Yıldırım, Şahin, and Mehmet Safa Bingöl. 2025. "Metaheuristic Approaches to Enhance Voice-Based Gender Identification Using Machine Learning Methods" Applied Sciences 15, no. 23: 12815. https://doi.org/10.3390/app152312815

APA Style

Yıldırım, Ş., & Bingöl, M. S. (2025). Metaheuristic Approaches to Enhance Voice-Based Gender Identification Using Machine Learning Methods. Applied Sciences, 15(23), 12815. https://doi.org/10.3390/app152312815

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.

Article Menu

Metaheuristic Approaches to Enhance Voice-Based Gender Identification Using Machine Learning Methods

Abstract

Share and Cite

Article Metrics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI