Next Article in Journal
Seamless Navigation, 3D Reconstruction, Thermographic and Semantic Mapping for Building Inspection
Previous Article in Journal
Robust Nonlinear Tracking Control with Exponential Convergence Using Contraction Metrics and Disturbance Estimation
Article

Research on Chinese Speech Emotion Recognition Based on Deep Neural Network and Acoustic Features

1
Department of Computer and Communication Engineering, Ming Chuan University, Taoyuan 333, Taiwan
2
Department of Computer Science and Information Engineering, National Taichung University of Science and Technology, Taichung City 404, Taiwan
*
Author to whom correspondence should be addressed.
Academic Editors: Anastasios Doulamis, Marcin Woźniak and Leon Rothkrantz
Sensors 2022, 22(13), 4744; https://doi.org/10.3390/s22134744
Received: 19 April 2022 / Revised: 16 June 2022 / Accepted: 20 June 2022 / Published: 23 June 2022
(This article belongs to the Section Intelligent Sensors)
In recent years, the use of Artificial Intelligence for emotion recognition has attracted much attention. The industrial applicability of emotion recognition is quite comprehensive and has good development potential. This research uses voice emotion recognition technology to apply it to Chinese speech emotion recognition. The main purpose of this research is to transform gradually popularized smart home voice assistants or AI system service robots from a touch-sensitive interface to a voice operation. This research proposed a specifically designed Deep Neural Network (DNN) model to develop a Chinese speech emotion recognition system. In this research, 29 acoustic characteristics in acoustic theory are used as the training attributes of the proposed model. This research also proposes a variety of audio adjustment methods to amplify datasets and enhance training accuracy, including waveform adjustment, pitch adjustment, and pre-emphasize. This study achieved an average emotion recognition accuracy of 88.9% in the CASIA Chinese sentiment corpus. The results show that the deep learning model and audio adjustment method proposed in this study can effectively identify the emotions of Chinese short sentences and can be applied to Chinese voice assistants or integrated with other dialogue applications. View Full-Text
Keywords: emotion recognition; deep neural network; acoustic features emotion recognition; deep neural network; acoustic features
Show Figures

Figure 1

MDPI and ACS Style

Lee, M.-C.; Yeh, S.-C.; Chang, J.-W.; Chen, Z.-Y. Research on Chinese Speech Emotion Recognition Based on Deep Neural Network and Acoustic Features. Sensors 2022, 22, 4744. https://doi.org/10.3390/s22134744

AMA Style

Lee M-C, Yeh S-C, Chang J-W, Chen Z-Y. Research on Chinese Speech Emotion Recognition Based on Deep Neural Network and Acoustic Features. Sensors. 2022; 22(13):4744. https://doi.org/10.3390/s22134744

Chicago/Turabian Style

Lee, Ming-Che, Sheng-Cheng Yeh, Jia-Wei Chang, and Zhen-Yi Chen. 2022. "Research on Chinese Speech Emotion Recognition Based on Deep Neural Network and Acoustic Features" Sensors 22, no. 13: 4744. https://doi.org/10.3390/s22134744

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop