Next Article in Journal
Research on the Internal Thermal Boundary Conditions of Concrete Closed Girder Cross-Sections under Historically Extreme Temperature Conditions
Previous Article in Journal
Aerodynamic Optimization of a Micro Quadrotor Aircraft with Different Rotor Spacings in Hover
Open AccessArticle

An Optimal Feature Parameter Set Based on Gated Recurrent Unit Recurrent Neural Networks for Speech Segment Detection

1
Computer Engineering, Siirt University, Siirt 56100, Turkey
2
Computer Engineering, Yildiz Technical University, Istanbul 34220, Turkey
*
Author to whom correspondence should be addressed.
Appl. Sci. 2020, 10(4), 1273; https://doi.org/10.3390/app10041273
Received: 5 January 2020 / Revised: 9 February 2020 / Accepted: 9 February 2020 / Published: 13 February 2020
(This article belongs to the Special Issue Intelligent Speech and Acoustic Signal Processing)
Speech segment detection based on gated recurrent unit (GRU) recurrent neural networks for the Kurdish language was investigated in the present study. The novelties of the current research are the utilization of a GRU in Kurdish speech segment detection, creation of a unique database from the Kurdish language, and optimization of processing parameters for Kurdish speech segmentation. This study is the first attempt to find the optimal feature parameters of the model and to form a large Kurdish vocabulary dataset for a speech segment detection based on consonant, vowel, and silence (C/V/S) discrimination. For this purpose, four window sizes and three window types with three hybrid feature vector techniques were used to describe the phoneme boundaries. Identification of the phoneme boundaries using a GRU recurrent neural network was performed with six different classification algorithms for the C/V/S discrimination. We have demonstrated that the GRU model has achieved outstanding speech segmentation performance for characterizing Kurdish acoustic signals. The experimental findings of the present study show the significance of the segment detection of speech signals by effectively utilizing hybrid features, window sizes, window types, and classification models for Kurdish speech. View Full-Text
Keywords: database; deep learning; consonant/vowel/silence; segmentation; speech segment detection database; deep learning; consonant/vowel/silence; segmentation; speech segment detection
Show Figures

Figure 1

MDPI and ACS Style

BATUR DİNLER, Ö.; AYDIN, N. An Optimal Feature Parameter Set Based on Gated Recurrent Unit Recurrent Neural Networks for Speech Segment Detection. Appl. Sci. 2020, 10, 1273.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop