Skip Content
You are currently on the new version of our website. Access the old version .

1,397 Results Found

  • Article
  • Open Access
1 Citations
5,827 Views
18 Pages

Design Space for Voice-Based Professional Reporting

  • Jaakko Hakulinen,
  • Tuuli Keskinen,
  • Markku Turunen and
  • Sanni Siltanen

Speech technology has matured so that voice-based reporting utilizing speech-to-text can be applied in various domains. Speech has two major benefits: it enables efficient reporting and speech input improves the quality of the reports since reporting...

  • Article
  • Open Access
1 Citations
3,320 Views
20 Pages

28 March 2025

Voice data contain a wealth of temporal and spectral information and can be a valuable resource for disease classification. However, traditional methods are often not effective in capturing the key features required for the classification of multiple...

  • Article
  • Open Access
13 Citations
5,990 Views
14 Pages

22 February 2021

With the development of artificial intelligence technology, voice-based intelligent systems (VISs), such as AI speakers and virtual assistants, are intervening in human life. VISs are emerging in a new way, called human–AI interaction, which is diffe...

  • Article
  • Open Access
17 Citations
4,996 Views
26 Pages

Toward an Automatic Quality Assessment of Voice-Based Telemedicine Consultations: A Deep Learning Approach

  • Maria Habib,
  • Mohammad Faris,
  • Raneem Qaddoura,
  • Manal Alomari,
  • Alaa Alomari and
  • Hossam Faris

10 May 2021

Maintaining a high quality of conversation between doctors and patients is essential in telehealth services, where efficient and competent communication is important to promote patient health. Assessing the quality of medical conversations is often h...

  • Article
  • Open Access
988 Views
21 Pages

17 February 2025

The number of new applications addressing human activities in social settings, like groups and organizations, is on the rise. Devising an effective data collection infrastructure is critical for such applications. This paper describes a computational...

  • Article
  • Open Access
7 Citations
6,074 Views
35 Pages

A Voice-Enabled ROS2 Framework for Human–Robot Collaborative Inspection

  • Apostolis Papavasileiou,
  • Stelios Nikoladakis,
  • Fotios Panagiotis Basamakis,
  • Sotiris Aivaliotis,
  • George Michalos and
  • Sotiris Makris

13 May 2024

Quality inspection plays a vital role in current manufacturing practice since the need for reliable and customized products is high on the agenda of most industries. Under this scope, solutions enhancing human–robot collaboration such as voice-...

  • Article
  • Open Access
500 Views
18 Pages

3 December 2025

Nowadays, classification of a person’s gender by analyzing characteristics of their voice is generally called voice-based identification. This paper presents an investigation on systematic research of metaheuristic optimization algorithms regar...

  • Article
  • Open Access
1,068 Views
18 Pages

Voice-Based Assessment of Extrapyramidal Symptoms Using Deep Learning

  • Erandhi M. Liyanage,
  • Kun-Chan Lan,
  • Quang Ha and
  • Sai Ho Ling

11 August 2025

Extrapyramidal symptoms encompass features of Parkinsonism, including bradykinesia, cogwheel rigidity, and resting tremors, which contribute to motor impairments hindering handwriting and speech. In this study, we analyzed voice data captured using a...

  • Review
  • Open Access
2 Citations
5,545 Views
9 Pages

As the global population ages, older adults face growing psychological challenges such as loneliness, cognitive decline, and loss of social roles. Meanwhile, artificial intelligence (AI) technologies, including chatbots and voice-based systems, offer...

  • Article
  • Open Access
1 Citations
3,792 Views
19 Pages

20 April 2019

Voice-based interfaces have become one of the most popular device capabilities, recently being regarded as one flagship user experience of smart consumer devices. However, the lack of common coordination mechanisms might often degrade the user experi...

  • Article
  • Open Access
5 Citations
7,662 Views
12 Pages

30 July 2021

Despite the well-known distracting effects, many drivers still engage in phone use, especially texting and especially among young drivers, with new emerging messaging modes. The present study aims to examine the effects of different answering modes o...

  • Article
  • Open Access
1 Citations
3,885 Views
23 Pages

Effective Acoustic Model-Based Beamforming Training for Static and Dynamic Hri Applications

  • Alejandro Luzanto,
  • Nicolás Bohmer,
  • Rodrigo Mahu,
  • Eduardo Alvarado,
  • Richard M. Stern and
  • Néstor Becerra Yoma

15 October 2024

Human–robot collaboration will play an important role in the fourth industrial revolution in applications related to hostile environments, mining, industry, forestry, education, natural disaster and defense. Effective collaboration requires rob...

  • Review
  • Open Access
2 Citations
9,924 Views
21 Pages

15 July 2025

This survey provides a comprehensive review of the integration of large language models (LLMs) into autonomous robotic systems, organized around four key pillars: locomotion, navigation, manipulation, and voice-based interaction. We examine how LLMs...

  • Article
  • Open Access
3 Citations
3,905 Views
23 Pages

6 August 2024

Online travel booking has become increasingly popular; however, most travel websites do not yet offer voice interaction. This study introduces VoiceBack, an artificial intelligence (AI)-driven voice-based feedback system conceptualized to support bot...

  • Article
  • Open Access
3 Citations
3,593 Views
24 Pages

The college entrance rate of the disabled is gradually increasing, and each university is trying to provide equal rights and opportunities for college students with disabilities. However, students with disabilities still have difficulty adapting to c...

  • Article
  • Open Access
10 Citations
7,265 Views
21 Pages

Voice-based digital assistants are growing in popularity and have been acknowledged as a crucial part of in-car interaction. Currently, academic attention is being paid to various voice assistant scenarios. However, sparse literature focuses on the a...

  • Article
  • Open Access
67 Citations
9,974 Views
17 Pages

A Near Real-Time Automatic Speaker Recognition Architecture for Voice-Based User Interface

  • Parashar Dhakal,
  • Praveen Damacharla,
  • Ahmad Y. Javaid and
  • Vijay Devabhaktuni

In this paper, we present a novel pipelined near real-time speaker recognition architecture that enhances the performance of speaker recognition by exploiting the advantages of hybrid feature extraction techniques that contain the features of Gabor F...

  • Review
  • Open Access
2,361 Views
29 Pages

Parkinson’s disease (PD) is a progressive neurodegenerative disorder characterized by motor and non-motor symptoms, among which vocal impairment is one of the earliest and most prevalent. In recent years, voice analysis supported by machine lea...

  • Article
  • Open Access
1 Citations
2,632 Views
19 Pages

This study presents a hybrid ensemble learning framework for the joint detection and motor severity prediction of Parkinson’s disease (PD) using biomedical voice features. The proposed architecture integrates a deep multimodal fusion model with...

  • Review
  • Open Access
22 Citations
5,969 Views
15 Pages

A Review of Voice-Based Pain Detection in Adults Using Artificial Intelligence

  • Sahar Borna,
  • Clifton R. Haider,
  • Karla C. Maita,
  • Ricardo A. Torres,
  • Francisco R. Avila,
  • John P. Garcia,
  • Gioacchino D. De Sario Velasquez,
  • Christopher J. McLeod,
  • Charles J. Bruce and
  • Antonio J. Forte
  • + 1 author

Pain is a complex and subjective experience, and traditional methods of pain assessment can be limited by factors such as self-report bias and observer variability. Voice is frequently used to evaluate pain, occasionally in conjunction with other beh...

  • Article
  • Open Access
2 Citations
7,082 Views
14 Pages

25 August 2016

Mobile devices can be exploited for enabling people to interact with Internet of Things (IoT) services. The MicroApp Generator [1] is a service-composition tool for supporting the generation of mobile applications directly on the mobile device. The u...

  • Article
  • Open Access
4 Citations
3,828 Views
12 Pages

Voice as a Mouse Click: Usability and Effectiveness of Simplified Hands-Free Gaze-Voice Selection

  • Darisy G. Zhao,
  • Nikita D. Karikov,
  • Eugeny V. Melnichuk,
  • Boris M. Velichkovsky and
  • Sergei L. Shishkin

9 December 2020

Voice- and gaze-based hands-free input are increasingly used in human-machine interaction. Attempts to combine them into a hybrid technology typically employ the voice channel as an information-rich channel. Voice seems to be “overqualified&rdq...

  • Article
  • Open Access
1,434 Views
15 Pages

6 January 2025

In this paper, we present the results of a study on the development of a system to support medical diagnoses based on voice-based medical interviews. The main objective is to develop a tool that improves the process of collecting information from pat...

  • Article
  • Open Access
180 Views
18 Pages

29 January 2026

Various sensors are increasingly being adopted to support intelligent healthcare systems, which address the growing problem of staff shortages in assisted-living communities. In this context, detecting and assessing pain remain critical yet challengi...

  • Article
  • Open Access
1,016 Views
19 Pages

End-Users’ Perspectives on Implementation Outcomes of Digital Voice Assistants Delivering a Home-Based Lifestyle Intervention in Older Obese Adults with Type 2 Diabetes Mellitus: A Qualitative Analysis

  • Costas Glavas,
  • Jiani Ma,
  • Surbhi Sood,
  • Elena S. George,
  • Robin M. Daly,
  • Eugene Gvozdenko,
  • Barbora de Courten,
  • David Scott and
  • Paul Jansons

Managing blood glucose levels and adhering to exercise is challenging for older adults with obesity and type 2 diabetes mellitus (T2DM). Digital voice assistants (DVAs) utilising conversation-based interactions and natural language may overcome barri...

  • Article
  • Open Access
2,859 Views
29 Pages

Voice-Based Early Diagnosis of Parkinson’s Disease Using Spectrogram Features and AI Models

  • Danish Quamar,
  • V. D. Ambeth Kumar,
  • Muhammad Rizwan,
  • Ovidiu Bagdasar and
  • Manuella Kadar

Parkinson’s disease (PD) is a progressive neurodegenerative disorder that significantly affects motor functions, including speech production. Voice analysis offers a less invasive, faster and more cost-effective approach for diagnosing and moni...

  • Review
  • Open Access
49 Citations
32,526 Views
13 Pages

28 February 2023

Voice conversion is a process where the essence of a speaker’s identity is seamlessly transferred to another speaker, all while preserving the content of their speech. This usage is accomplished using algorithms that blend speech processing tec...

  • Article
  • Open Access
2 Citations
2,552 Views
13 Pages

Development of an Industrial Safety System Based on Voice Assistant

  • Jaime Paúl Ayala Taco,
  • Oswaldo Alexander Ibarra Jácome,
  • Jaime Luciano Ayala Pico and
  • Brian Andrés López Castro

24 October 2023

Currently, there are limitations in the human–machine interfaces (HMIs) used in industry, either due to the characteristics of users’ cognitive abilities or interfaces, which hinder communication and interaction between humans and equipme...

  • Article
  • Open Access
12 Citations
4,477 Views
18 Pages

6 December 2022

An important issue in medical robotics is communication between physicians and robots. Speech-based communication is of particular advantage in robot-assisted surgery. It frees the surgeon’s hands; hence, he can focus on the principal tasks. Ma...

  • Article
  • Open Access
5 Citations
5,099 Views
16 Pages

2 October 2022

The development of artificial intelligence technology has made it possible to realize automatic evaluation systems for singing, and relevant research has been able to achieve accurate evaluations with respect to pitch and rhythm, but research on sing...

  • Article
  • Open Access
3 Citations
2,386 Views
10 Pages

18 February 2021

An underwater universal filtered multicarrier (UFMC)-based voice transmission scheme is proposed using a 512-point inverse discrete Fourier transform, utilizing 10 sub-bands, and that each had 20 subcarriers. In this proposed UFMC method, the adaptiv...

  • Article
  • Open Access
26 Citations
4,595 Views
16 Pages

10 March 2023

The Saarbruecken Voice Database (SVD) is a public database used by voice pathology detection systems. However, the distributions of the pathological and normal voice samples show a clear class imbalance. This study aims to develop a system for the cl...

  • Article
  • Open Access
1 Citations
2,569 Views
19 Pages

Feasibility of Big Data Analytics to Assess Personality Based on Voice Analysis

  • Víctor J. Rubio,
  • David Aguado,
  • Doroteo T. Toledano and
  • María Pilar Fernández-Gallego

7 November 2024

(1) Background: As far back as the 1930s, it was already thought that gestures, clothing, speech, posture, and gait could express an individual’s personality. Different research programs, some focused on linguistic cues, were launched, though r...

  • Article
  • Open Access
1 Citations
3,296 Views
33 Pages

2 June 2023

Over the past decades, many machine-learning- and artificial-intelligence-based technologies have been created to deduce biometric or bio-relevant parameters of speakers from their voice. These voice profiling technologies have targeted a wide range...

  • Proceeding Paper
  • Open Access
2 Citations
3,005 Views
7 Pages

Development of an Android-Based, Voice-Controlled Autonomous Robotic Vehicle

  • Abubakar Umar,
  • Mohammed Abdulkadir Giwa,
  • Abduljalal Yusha’u Kassim,
  • Muhammad Usman Ilyasu,
  • Ibrahim Abdulwahab,
  • Ezekiel Ehime Agbon and
  • Matthew T. Ogedengbe

15 November 2023

This research presents the development of an android-based, voice-controlled autonomous robotic vehicle. This article was developed in a way that the robotic vehicle was controlled using voice commands. An android application combined with an android...

  • Article
  • Open Access
40 Citations
10,436 Views
14 Pages

Wheelchair Neuro Fuzzy Control and Tracking System Based on Voice Recognition

  • Mokhles M. Abdulghani,
  • Kasim M. Al-Aubidy,
  • Mohammed M. Ali and
  • Qadri J. Hamarsheh

19 May 2020

Autonomous wheelchairs are important tools to enhance the mobility of people with disabilities. Advances in computer and wireless communication technologies have contributed to the provision of smart wheelchairs to suit the needs of the disabled pers...

  • Article
  • Open Access
4 Citations
3,411 Views
16 Pages

mmSafe: A Voice Security Verification System Based on Millimeter-Wave Radar

  • Zhanjun Hao,
  • Jianxiang Peng,
  • Xiaochao Dang,
  • Hao Yan and
  • Ruidong Wang

29 November 2022

With the increasing popularity of smart devices, users can control their mobile phones, TVs, cars, and smart furniture by using voice assistants, but voice assistants are susceptible to intrusion by outsider speakers or playback attacks. In order to...

  • Article
  • Open Access
11 Citations
4,263 Views
13 Pages

Detection of Major Depressive Disorder Based on a Combination of Voice Features: An Exploratory Approach

  • Masakazu Higuchi,
  • Mitsuteru Nakamura,
  • Shuji Shinohara,
  • Yasuhiro Omiya,
  • Takeshi Takano,
  • Daisuke Mizuguchi,
  • Noriaki Sonota,
  • Hiroyuki Toda,
  • Taku Saito and
  • Shinichi Tokuno
  • + 4 authors

In general, it is common knowledge that people’s feelings are reflected in their voice and facial expressions. This research work focuses on developing techniques for diagnosing depression based on acoustic properties of the voice. In this study, we...

  • Article
  • Open Access
2 Citations
2,800 Views
10 Pages

Spreading-Based Voice Encryption by Means of OVSF Codes

  • Diego Renza,
  • Dora M. Ballesteros and
  • Estibaliz Martinez

21 December 2019

This paper presents a new methodology to encrypt voice signals, in such a way that they simulate being a noise signal. The objective is to obtain a signal that does not generate suspicions about its content, while protecting the message. The process...

  • Systematic Review
  • Open Access
5 Citations
4,134 Views
15 Pages

Evidence-Based Recommendations in Primary Tracheoesophageal Puncture for Voice Prosthesis Rehabilitation

  • Miguel Mayo-Yáñez,
  • Alejandro Klein-Rodríguez,
  • Aldán López-Eiroa,
  • Irma Cabo-Varela,
  • Raquel Rivera-Rivera and
  • Pablo Parente-Arias

Head and neck cancer, the seventh most common cancer worldwide, often affects the larynx, with a higher incidence in men. Total laryngectomy, a common treatment, results in the loss of phonation, and tracheoesophageal voice rehabilitation is the curr...

  • Article
  • Open Access
5 Citations
2,051 Views
12 Pages

Reliability of Universal-Platform-Based Voice Screen Application in AVQI Measurements Captured with Different Smartphones

  • Virgilijus Uloza,
  • Nora Ulozaitė-Stanienė,
  • Tadas Petrauskas,
  • Kipras Pribuišis,
  • Tomas Blažauskas,
  • Robertas Damaševičius and
  • Rytis Maskeliūnas

18 June 2023

The aim of the study was to develop a universal-platform-based (UPB) application suitable for different smartphones for estimation of the Acoustic Voice Quality Index (AVQI) and evaluate its reliability in AVQI measurements and normal and pathologica...

  • Article
  • Open Access
10 Citations
14,517 Views
16 Pages

Voice cloning aims to synthesize the voice with a new speaker’s timbre from a small amount of the new speaker’s speech. Current voice cloning methods, which focus on modeling speaker timbre, can synthesize speech with similar speaker timb...

  • Article
  • Open Access
16 Citations
4,947 Views
20 Pages

Zigbee-Based Low Power Consumption Wearables Device for Voice Data Transmission

  • Asma Shuhail AlShuhail,
  • Surbhi Bhatia,
  • Ankit Kumar and
  • Bharat Bhushan

31 August 2022

Short-range wireless technologies can transmit real-time voice, audio, picture, and video communications. Such networks’ energy usage and transmission reach are crucial, especially for portable and power autonomous devices. Voice over Zigbee te...

  • Article
  • Open Access
33 Citations
5,402 Views
14 Pages

Vocal melody extraction is an important and challenging task in music information retrieval. One main difficulty is that, most of the time, various instruments and singing voices are mixed according to harmonic structure, making it hard to identify t...

  • Article
  • Open Access
1 Citations
1,378 Views
34 Pages

Context-Based Model for Browsing the Web Through Voice

  • Citlalli Selene Avalos Montiel,
  • José G. Rodríguez García,
  • Sonia Mendoza and
  • Dominique Decouchant

20 March 2025

To find useful information on the Web, a user must define the search according to their interests, then they must select and analyze one or more web pages, and finally they must decide which content is most useful to them. This process requires visua...

  • Article
  • Open Access
2 Citations
4,805 Views
15 Pages

16 January 2025

The metaverse, where users interact through avatars, is evolving to closely mirror the real world, requiring realistic object responses based on users’ emotions. While technologies like eye-tracking and hand-tracking transfer physical movements...

  • Communication
  • Open Access
2 Citations
2,211 Views
10 Pages

20 August 2023

Voice spoofing attempts to break into a specific automatic speaker verification (ASV) system by forging the user’s voice and can be used through methods such as text-to-speech (TTS), voice conversion (VC), and replay attacks. Recently, deep lea...

  • Article
  • Open Access
38 Citations
9,821 Views
17 Pages

Steering a Robotic Wheelchair Based on Voice Recognition System Using Convolutional Neural Networks

  • Mohsen Bakouri,
  • Mohammed Alsehaimi,
  • Husham Farouk Ismail,
  • Khaled Alshareef,
  • Ali Ganoun,
  • Abdulrahman Alqahtani and
  • Yousef Alharbi

Many wheelchair people depend on others to control the movement of their wheelchairs, which significantly influences their independence and quality of life. Smart wheelchairs offer a degree of self-dependence and freedom to drive their own vehicles....

  • Article
  • Open Access
5 Citations
2,493 Views
25 Pages

Artificial Intelligence Procedure for the Screening of Genetic Syndromes Based on Voice Characteristics

  • Federico Calà,
  • Lorenzo Frassineti,
  • Elisabetta Sforza,
  • Roberta Onesimo,
  • Lucia D’Alatri,
  • Claudia Manfredi,
  • Antonio Lanata and
  • Giuseppe Zampino

Perceptual and statistical evidence has highlighted voice characteristics of individuals affected by genetic syndromes that differ from those of normophonic subjects. In this paper, we propose a procedure for systematically collecting such pathologic...

  • Article
  • Open Access
4 Citations
3,779 Views
12 Pages

Steganalysis of Inactive Voice-Over-IP Frames Based on Poker Test

  • Jie Liu,
  • Hui Tian,
  • Chin-Chen Chang,
  • Tian Wang,
  • Yonghong Chen and
  • Yiqiao Cai

11 August 2018

This paper concentrates on the detection of steganography in inactive frames of low bit rate audio streams in Voice over Internet Protocol (VoIP) scenarios. Both theoretical and experimental analyses demonstrate that the distribution of 0 and 1 in en...

of 28