Next Article in Journal
New Approximation Methods Based on Fuzzy Transform for Solving SODEs: I
Previous Article in Journal
Solar Hybrid Micro Gas Turbine Based on Turbocharger
Previous Article in Special Issue
An Effect of White Space on Traditional Chinese Text-Reading on Smartphones
Article Menu

Export Article

Open AccessArticle
Appl. Syst. Innov. 2018, 1(3), 28; https://doi.org/10.3390/asi1030028

Employing Robust Principal Component Analysis for Noise-Robust Speech Feature Extraction in Automatic Speech Recognition with the Structure of a Deep Neural Network

Department of Electrical Engineering, National Chi Nan University, 545, Nantou County, Taiwan
*
Author to whom correspondence should be addressed.
Received: 6 April 2018 / Revised: 11 August 2018 / Accepted: 12 August 2018 / Published: 15 August 2018
(This article belongs to the Special Issue Selected papers from IEEE ICASI 2018)
Full-Text   |   PDF [3826 KB, uploaded 15 August 2018]   |  

Abstract

In recent decades, researchers have been focused on developing noise-robust methods in order to compensate for noise effects in automatic speech recognition (ASR) systems and enhance their performance. In this paper, we propose a feature-based noise-robust method that employs a novel data analysis technique—robust principal component analysis (RPCA). In the proposed scenario, RPCA is employed to process a noise-corrupted speech feature matrix, and the obtained sparse partition is shown to reveal speech-dominant characteristics. One apparent advantage of using RPCA for enhancing noise robustness is that no prior knowledge about the noise is required. The proposed RPCA-based method is evaluated with the Aurora-4 database and a task using a state-of-the-art deep neural network (DNN) architecture as the acoustic models. The evaluation results indicate that the newly proposed method can provide the original speech feature with significant recognition accuracy improvement, and can be cascaded with mean normalization (MN), mean and variance normalization (MVN), and relative spectral (RASTA)—three well-known and widely used feature robustness algorithms—to achieve better performance compared with the individual component method. View Full-Text
Keywords: robust principal component analysis; noise robustness; filter-bank features; mel-frequency cepstral coefficients; deep neural network robust principal component analysis; noise robustness; filter-bank features; mel-frequency cepstral coefficients; deep neural network
Figures

Figure 1a

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. (CC BY 4.0).
SciFeed

Share & Cite This Article

MDPI and ACS Style

Hung, J.-W.; Lin, J.-S.; Wu, P.-J. Employing Robust Principal Component Analysis for Noise-Robust Speech Feature Extraction in Automatic Speech Recognition with the Structure of a Deep Neural Network. Appl. Syst. Innov. 2018, 1, 28.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Appl. Syst. Innov. EISSN 2571-5577 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top