Exploring Acoustic Correlates of Depression and Preliminary Screening Models Using XGBoost and SHAP

Seok, Kwang-Ho; Shin, Jaeeun; Bae, Sung-Man

doi:10.3390/bs15121648

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

Exploring Acoustic Correlates of Depression and Preliminary Screening Models Using XGBoost and SHAP

by

Kwang-Ho Seok

¹,

Jaeeun Shin

² and

Sung-Man Bae

^3,*

¹

School of AI Convergence, Global Cyber University, Cheonan 31228, Republic of Korea

²

Department of Psychology, Chung-Ang University, Seoul 06974, Republic of Korea

³

Department of Psychology and Psychotherapy, College of Health Science, Dankook University, Cheonan 31116, Republic of Korea

^*

Author to whom correspondence should be addressed.

Behav. Sci. 2025, 15(12), 1648; https://doi.org/10.3390/bs15121648

Submission received: 23 September 2025 / Revised: 22 November 2025 / Accepted: 27 November 2025 / Published: 30 November 2025

Download Versions Notes

Abstract

This exploratory study investigated whether voice-derived acoustic features reflect depressive symptom severity and whether they carry preliminary predictive signal for distinguishing individuals with Major Depressive Disorder (MDD) from healthy controls (HC). Using the publicly available MODMA dataset (23 MDD; 29 HC), 6553 acoustic features were extracted with openSMILE. Spearman correlation and group-difference analyses identified several MFCC-derived spectral features as moderately and systematically associated with PHQ-9 scores, indicating their potential relevance as severity-linked acoustic markers. To complement these findings, a supplementary severity-based classification using a PHQ-9 ≥ 10 threshold showed that a logistic regression model trained on the top five correlated MFCC features achieved a cross-validated AUC of 0.78 (SD = 0.15), supporting their association with clinically defined symptom burden. Four machine learning pipelines were further evaluated for an exploratory MDD–HC classification task. Among them, the PCA + XGBoost model demonstrated the most stable generalization (test AUC = 0.60), although predictive performance remained limited within the constraints of the small and high-dimensional dataset. SHAP analysis highlighted MFCC-derived features as key contributors to model decisions, providing transparent interpretability. Overall, the study presents preliminary evidence linking acoustic characteristics to depressive symptoms and outlines a reproducible analytical workflow, while underscoring the need for substantially larger and more diverse datasets to establish clinically meaningful predictive validity.

Keywords: depression prediction; voice analysis; machine learning; SHAP; acoustic features

Share and Cite

MDPI and ACS Style

Seok, K.-H.; Shin, J.; Bae, S.-M. Exploring Acoustic Correlates of Depression and Preliminary Screening Models Using XGBoost and SHAP. Behav. Sci. 2025, 15, 1648. https://doi.org/10.3390/bs15121648

AMA Style

Seok K-H, Shin J, Bae S-M. Exploring Acoustic Correlates of Depression and Preliminary Screening Models Using XGBoost and SHAP. Behavioral Sciences. 2025; 15(12):1648. https://doi.org/10.3390/bs15121648

Chicago/Turabian Style

Seok, Kwang-Ho, Jaeeun Shin, and Sung-Man Bae. 2025. "Exploring Acoustic Correlates of Depression and Preliminary Screening Models Using XGBoost and SHAP" Behavioral Sciences 15, no. 12: 1648. https://doi.org/10.3390/bs15121648

APA Style

Seok, K.-H., Shin, J., & Bae, S.-M. (2025). Exploring Acoustic Correlates of Depression and Preliminary Screening Models Using XGBoost and SHAP. Behavioral Sciences, 15(12), 1648. https://doi.org/10.3390/bs15121648

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Exploring Acoustic Correlates of Depression and Preliminary Screening Models Using XGBoost and SHAP

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI