Next Article in Journal
Microscopic Pore–Throat Architecture and Fractal Heterogeneity in Tight Sandstone Reservoirs: Insights from the Denglouku Formation, Xujiaweizi Fault Depression
Previous Article in Journal
Interpretable Optimization: Why and How We Should Explain Optimization Models
Previous Article in Special Issue
Deep Learning Classification of Traffic-Related Tweets: An Advanced Framework Using Deep Learning for Contextual Understanding and Traffic-Related Short Text Classification
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Review

Review of Automatic Estimation of Emotions in Speech

by
Douglas O’Shaughnessy
Telecommunication Department, INRS, University of Quebec, 800 de la Gauchetiere West, Montreal, QC H5A 1K6, Canada
Appl. Sci. 2025, 15(10), 5731; https://doi.org/10.3390/app15105731
Submission received: 23 March 2025 / Revised: 13 May 2025 / Accepted: 14 May 2025 / Published: 20 May 2025
(This article belongs to the Special Issue Speech Recognition and Natural Language Processing)

Abstract

Identification of emotions exhibited in utterances is useful for many applications, e.g., assisting with handling telephone calls or psychological diagnoses. This paper reviews methods to identify emotions from speech signals. We examine the information in speech that helps to estimate emotion, from points of view involving both production and perception. As machine approaches to recognize emotion in speech often have much in common with other speech tasks, such as automatic speaker verification and speech recognition, we compare such processes. Many methods of emotion recognition have been found in research on pattern recognition in other areas, e.g., image and text recognition, especially in recent methods for machine learning. We show that speech is very different compared to most other signals that can be recognized, and that emotion identification is different from other speech applications. This review is primarily aimed at non-experts (more algorithmic detail is present in the cited literature), but this presentation has much discussion for experts as well.
Keywords: emotion identification; neural networks; speech analysis; pattern recognition; machine learning emotion identification; neural networks; speech analysis; pattern recognition; machine learning

Share and Cite

MDPI and ACS Style

O’Shaughnessy, D. Review of Automatic Estimation of Emotions in Speech. Appl. Sci. 2025, 15, 5731. https://doi.org/10.3390/app15105731

AMA Style

O’Shaughnessy D. Review of Automatic Estimation of Emotions in Speech. Applied Sciences. 2025; 15(10):5731. https://doi.org/10.3390/app15105731

Chicago/Turabian Style

O’Shaughnessy, Douglas. 2025. "Review of Automatic Estimation of Emotions in Speech" Applied Sciences 15, no. 10: 5731. https://doi.org/10.3390/app15105731

APA Style

O’Shaughnessy, D. (2025). Review of Automatic Estimation of Emotions in Speech. Applied Sciences, 15(10), 5731. https://doi.org/10.3390/app15105731

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop