MDPI Contact

MDPI AG
St. Alban-Anlage 66,
4052 Basel, Switzerland
Support contact
Tel. +41 61 683 77 34
Fax: +41 61 302 89 18

For more contact information, see here.

Advanced Search

You can use * to search for partial matches.

Search Results

1 article matched your search query. Search Parameters:
Keywords = Laws masks

Matches by word:

LAWS (405) , MASKS (69)

View options
order results:
result details:
results per page:
Articles per page View Sort by
Displaying article 1-50 on page 1 of 1.
Export citation of selected articles as:
Open AccessArticle Time-Frequency Feature Representation Using Multi-Resolution Texture Analysis and Acoustic Activity Detector for Real-Life Speech Emotion Recognition
Sensors 2015, 15(1), 1458-1478; doi:10.3390/s150101458
Received: 16 September 2014 / Accepted: 1 December 2014 / Published: 14 January 2015
Cited by 7 | Viewed by 1481 | PDF Full-text (1190 KB) | HTML Full-text | XML Full-text
Abstract
The classification of emotional speech is mostly considered in speech-related research on human-computer interaction (HCI). In this paper, the purpose is to present a novel feature extraction based on multi-resolutions texture image information (MRTII). The MRTII feature set is derived from multi-resolution texture
[...] Read more.
The classification of emotional speech is mostly considered in speech-related research on human-computer interaction (HCI). In this paper, the purpose is to present a novel feature extraction based on multi-resolutions texture image information (MRTII). The MRTII feature set is derived from multi-resolution texture analysis for characterization and classification of different emotions in a speech signal. The motivation is that we have to consider emotions have different intensity values in different frequency bands. In terms of human visual perceptual, the texture property on multi-resolution of emotional speech spectrogram should be a good feature set for emotion classification in speech. Furthermore, the multi-resolution analysis on texture can give a clearer discrimination between each emotion than uniform-resolution analysis on texture. In order to provide high accuracy of emotional discrimination especially in real-life, an acoustic activity detection (AAD) algorithm must be applied into the MRTII-based feature extraction. Considering the presence of many blended emotions in real life, in this paper make use of two corpora of naturally-occurring dialogs recorded in real-life call centers. Compared with the traditional Mel-scale Frequency Cepstral Coefficients (MFCC) and the state-of-the-art features, the MRTII features also can improve the correct classification rates of proposed systems among different language databases. Experimental results show that the proposed MRTII-based feature information inspired by human visual perception of the spectrogram image can provide significant classification for real-life emotional recognition in speech. Full article
(This article belongs to the Section Physical Sensors)

Years

Subjects

Refine Subjects

Journals

Refine Journals

Article Types

Refine Types

Countries

Refine Countries
Back to Top