Machine Learning Classifiers for Voice Health Assessment Under Simulated Room Acoustics †
Abstract
:1. Introduction
2. Materials and Methods
2.1. Voice Samples Under Simulated Room Acoustics
2.2. Machine Learning for Voice Disorder Screening
3. Results
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Hegde, S.; Shetty, S.; Rai, S.; Dodderi, T. A Survey on Machine Learning Approaches for Automatic Detection of Voice Disorders. J. Voice 2019, 33, 947.e11–947.e33. [Google Scholar] [CrossRef] [PubMed]
- Idrisoglu, A.; Dallora, A.L.; Anderberg, P.; Berglund, J.S. Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review. J. Med. Internet Res. 2023, 25, e46105. [Google Scholar] [CrossRef]
- Yousef, A.M. Laryngeal Mechanisms and Vocal Folds Function in Adductor Laryngeal Dystonia During Connected Speech. Ph.D. Dissertation, Michigan State University, East Lansing, MI, USA, 2023. [Google Scholar]
- Al-Hussain, G.; Shuweihdi, F.; Alali, H.; Househ, M.; Abd-Alrazaq, A. The Effectiveness of Supervised Machine Learning in Screening and Diagnosing Voice Disorders: Systematic Review and Meta-Analysis. J. Med. Internet Res. 2022, 24, e38472. [Google Scholar] [CrossRef]
- Yousef, A.M.; Deliyski, D.D.; Zacharias, S.R.; Naghibolhosseini, M. Detection of Vocal Fold Image Obstructions in High-Speed Videoendoscopy during Connected Speech in Adductor Spasmodic Dysphonia: A Convolutional Neural Networks Approach. J. Voice 2024, 38, 951–962. [Google Scholar] [CrossRef] [PubMed]
- Saidi, P.; Almasganj, F. Voice Disorder Signal Classification Using M-Band Wavelets and Support Vector Machine. Circuits Syst. Signal Process. 2015, 34, 2727–2738. [Google Scholar] [CrossRef]
- Sharma, R.K.; Gupta, A.K. Processing and Analysis of Human Voice for Assessment of Parkinson Disease. J. Med. Imaging Health Inform. 2016, 6, 63–70. [Google Scholar]
- Yousef, A.M.; Deliyski, D.D.; Zacharias, S.R.; Naghibolhosseini, M. Deep-Learning-Based Representation of Vocal Fold Dynamics in Adductor Spasmodic Dysphonia during Connected Speech in High-Speed Videoendoscopy. J. Voice 2025, 39, 570.e1–570.e15. [Google Scholar] [CrossRef]
- Yousef, A.M.; Deliyski, D.D.; Zayernouri, M.; Zacharias, S.R.; Naghibolhosseini, M. Deep Learning-Based Analysis of Glottal Attack and Offset Times in Adductor Laryngeal Dystonia. J. Voice 2023. epub ahead of print. [Google Scholar] [CrossRef]
- Leite, D.R.A.; de Moraes, R.M.; Lopes, L.W. Different Performances of Machine Learning Models to Classify Dysphonic and Non-Dysphonic Voices. J. Voice 2022, in press. [Google Scholar] [CrossRef]
- Rehman, M.U.; Shafique, A.; Jamal, S.S.; Gheraibia, Y.; Usman, A.B. Voice Disorder Detection Using Machine Learning Algorithms: An Application in Speech and Language Pathology. Eng. Appl. Artif. Intell. 2024, 133, 108047. [Google Scholar] [CrossRef]
- Chen, C.-C.; Hsu, W.-C.; Lin, T.-H.; Chen, K.-D.; Tsou, Y.-A.; Liu, Y.-W. Classification of Vocal Cord Disorders: Comparison Across Voice Datasets, Speech Tasks, and Machine Learning Methods. In Proceedings of the 2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Taipei, Taiwan, 31 October–3 November 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1868–1873. [Google Scholar]
- Deliyski, D.D.; Evans, M.K.; Shaw, H.S. Influence of Data Acquisition Environment on Accuracy of Acoustic Voice Quality Measurements. J. Voice 2005, 19, 176–186. [Google Scholar] [CrossRef] [PubMed]
- Mealings, K.; Maggs, L.; Buchholz, J.M. The Effects of Classroom Acoustic Conditions on Teachers’ Health and Well-Being: A Scoping Review. J. Speech Lang. Hear. Res. 2024, 67, 346–367. [Google Scholar] [CrossRef]
- Marsano-Cornejo, M.-J.; Roco-Videla, Á. Variation of the Acoustic Parameters: F0, Jitter, Shimmer and Alpha Ratio in Relation with Different Background Noise Levels. Acta Otorrinolaringol. Engl. Ed. 2023, 74, 219–225. [Google Scholar] [CrossRef]
- Al-Nasheri, A.; Muhammad, G.; Alsulaiman, M.; Ali, Z. Investigation of Voice Pathology Detection and Classification on Different Frequency Regions Using Correlation Functions. J. Voice 2017, 31, 3–15. [Google Scholar] [CrossRef]
- Muhammad, G.; Alsulaiman, M.; Ali, Z.; Mesallam, T.A.; Farahat, M.; Malki, K.H.; Al-Nasheri, A.; Bencherif, M.A. Voice Pathology Detection Using Interlaced Derivative Pattern on Glottal Source Excitation. Biomed. Signal Process. Control 2017, 31, 156–164. [Google Scholar] [CrossRef]
- Rani, K.U.; Holi, M.S. GMM Classifier for Identification of Neurological Disordered Voices Using MFCC Features. IOSR J. VLSI Signal Process. 2015, 4, 44–51. [Google Scholar]
- Uloza, V.; Verikas, A.; Bacauskiene, M.; Gelzinis, A.; Pribuisiene, R.; Kaseta, M.; Saferis, V. Categorizing Normal and Pathological Voices: Automated and Perceptual Categorization. J. Voice 2011, 25, 700–708. [Google Scholar] [CrossRef] [PubMed]
- Yousef, A.M.; Deliyski, D.D.; Zacharias, S.R.; de Alarcon, A.; Orlikoff, R.F.; Naghibolhosseini, M. A Hybrid Machine-Learning-Based Method for Analytic Representation of the Vocal Fold Edges during Connected Speech. Appl. Sci. 2021, 11, 1179. [Google Scholar] [CrossRef]
- Agarwal, A.; Chandrayan, S.; Sahu, S.S. Prediction of Parkinson’s Disease Using Speech Signal with Extreme Learning Machine. In Proceedings of the 2016 international conference on electrical, electronics, and optimization techniques (ICEEOT), Chennai, India, 3–5 March 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 3776–3779. [Google Scholar]
- Teixeira, J.P.; Fernandes, P.O.; Alves, N. Vocal Acoustic Analysis–Classification of Dysphonic Voices with Artificial Neural Networks. Procedia Comput. Sci. 2017, 121, 19–26. [Google Scholar] [CrossRef]
- Maryn, Y.; Roy, N. Sustained Vowels and Continuous Speech in the Auditory-Perceptual Evaluation of Dysphonia Severity. J. Soc. Bras. Fonoaudiol. 2012, 24, 107–112. [Google Scholar] [CrossRef]
- Castillo-Allendes, A.; Codino, J.; Cantor-Cutiva, L.C.; Nudelman, C.J.; Rubin, A.D.; Barsties V Latoszek, B.; Hunter, E.J. Clinical Utility and Validation of the Acoustic Voice Quality and Acoustic Breathiness Indexes for Voice Disorder Assessment in English Speakers. J. Clin. Med. 2023, 12, 7679. [Google Scholar] [CrossRef] [PubMed]
- Yousef, A.M.; Hunter, E.J. Sensitivity of Acoustic Voice Quality Measures in Simulated Reverberation Conditions. Bioengineering 2024, 11, 1253. [Google Scholar] [CrossRef] [PubMed]
- Yousef, A.M.; Hunter, E.J. Male-Female Specific Changes in Voice Parameters under Varying Room Acoustics. Proc. Meet. Acoust. 2024, 55, 060001. [Google Scholar] [PubMed]
- Boersma, P. Praat, a System for Doing Phonetics by Computer. Glot Int. 2001, 5, 341–345. [Google Scholar]
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V. Scikit-Learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
- Yousef, A.M.; Hunter, E.J. How Do Room Acoustics Impact Machine Learning Accuracy in Voice Disorder Detection? In Proceedings of the 1st International Online Conference on Bioengineering Session Biosignal Processing, Online, 16–18 October 2024; MDPI: Basel, Switzerland, 2024. [Google Scholar]
- Yousef, A.M.; Hunter, E.J. Effect of Data Collection and Environment on Machine Learning Performance in Screening Dysphonia. In Proceedings of the the 1st International Online Conference on Bioengineering, Session Biosignal Processing, Online, 16–18 October 2024; MDPI: Basel, Switzerland, 2024. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Yousef, A.M.; Hunter, E.J. Machine Learning Classifiers for Voice Health Assessment Under Simulated Room Acoustics. Eng. Proc. 2024, 81, 16. https://doi.org/10.3390/engproc2024081016
Yousef AM, Hunter EJ. Machine Learning Classifiers for Voice Health Assessment Under Simulated Room Acoustics. Engineering Proceedings. 2024; 81(1):16. https://doi.org/10.3390/engproc2024081016
Chicago/Turabian StyleYousef, Ahmed M., and Eric J. Hunter. 2024. "Machine Learning Classifiers for Voice Health Assessment Under Simulated Room Acoustics" Engineering Proceedings 81, no. 1: 16. https://doi.org/10.3390/engproc2024081016
APA StyleYousef, A. M., & Hunter, E. J. (2024). Machine Learning Classifiers for Voice Health Assessment Under Simulated Room Acoustics. Engineering Proceedings, 81(1), 16. https://doi.org/10.3390/engproc2024081016