Next Article in Journal
Automatic Detection of Erratic Sensor Observations in Ami Platforms: A Statistical Approach
Previous Article in Journal
3D Technologies to Acquire and Visualize the Human Body for Improving Dietetic Treatment
 
 
Order Article Reprints
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Proceeding Paper

Coordination of Speech Recognition Devices in Intelligent Environments with Multiple Responsive Devices †

Department of Languages and Computer Systems, University of Granada, 18071 Granada, Spain
*
Authors to whom correspondence should be addressed.
Presented at the 13th International Conference on Ubiquitous Computing and Ambient Intelligence UCAmI 2019, Toledo, Spain, 2–5 December 2019.
Proceedings 2019, 31(1), 54; https://doi.org/10.3390/proceedings2019031054
Published: 20 November 2019

Abstract

Devices with oral interfaces are enabling new interesting interaction scenarios and ways of interaction in ambient intelligence settings. The use of several of such devices in the same environment opens up the possibility to compare the inputs gathered from each one of them and perform a more accurate recognition and processing of user speech. However, the combination of multiple devices presents coordination challenges, as the processing of one voice signal by different speech processing units may result in conflicting outputs and it is necessary to decide which is the most reliable source. This paper presents an approach to rank several sources of spoken input in multi-device environments in order to give preference to the input with the highest estimated quality. The voice signals received by the multiple devices are assessed in terms of their calculated acoustic quality and the reliability of the speech recognition hypotheses produced. After this assessment, each input is assigned a unique score that allows the audio sources to be ranked so as to pick the best to be processed by the system. In order to validate this approach, we have performed an evaluation using a corpus of 4608 audios recorded in a two-room intelligent environment with 24 microphones. The experimental results show that our ranking approach makes it possible to successfully orchestrate an increasing number of acoustic inputs, obtaining better recognition rates than considering a single input, both in clear and noisy settings.
Keywords: human–computer interaction; spoken interaction; speech recognition; ambient intelligence; coordination of devices human–computer interaction; spoken interaction; speech recognition; ambient intelligence; coordination of devices

Share and Cite

MDPI and ACS Style

Benítez-Guijarro, A.; Callejas, Z.; Noguera, M.; Benghazi, K. Coordination of Speech Recognition Devices in Intelligent Environments with Multiple Responsive Devices. Proceedings 2019, 31, 54. https://doi.org/10.3390/proceedings2019031054

AMA Style

Benítez-Guijarro A, Callejas Z, Noguera M, Benghazi K. Coordination of Speech Recognition Devices in Intelligent Environments with Multiple Responsive Devices. Proceedings. 2019; 31(1):54. https://doi.org/10.3390/proceedings2019031054

Chicago/Turabian Style

Benítez-Guijarro, Antonio, Zoraida Callejas, Manuel Noguera, and Kawtar Benghazi. 2019. "Coordination of Speech Recognition Devices in Intelligent Environments with Multiple Responsive Devices" Proceedings 31, no. 1: 54. https://doi.org/10.3390/proceedings2019031054

Article Metrics

Back to TopTop