Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline

Article Types

Countries / Regions

Search Results (2)

Search Parameters:
Keywords = skin-contact microphones

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
15 pages, 3122 KiB  
Article
Simultaneous Speech and Eating Behavior Recognition Using Data Augmentation and Two-Stage Fine-Tuning
by Toshihiro Tsukagoshi, Masafumi Nishida and Masafumi Nishimura
Sensors 2025, 25(5), 1544; https://doi.org/10.3390/s25051544 - 2 Mar 2025
Viewed by 909
Abstract
Speaking and eating are essential components of health management. To enable the daily monitoring of these behaviors, systems capable of simultaneously recognizing speech and eating behaviors are required. However, due to the distinct acoustic and contextual characteristics of these two domains, achieving high-precision [...] Read more.
Speaking and eating are essential components of health management. To enable the daily monitoring of these behaviors, systems capable of simultaneously recognizing speech and eating behaviors are required. However, due to the distinct acoustic and contextual characteristics of these two domains, achieving high-precision integrated recognition remains underexplored. In this study, we propose a method that combines data augmentation through synthetic data creation with a two-stage fine-tuning approach tailored to the complexity of domain adaptation. By concatenating speech and eating sounds of varying lengths and sequences, we generated training data that mimic real-world environments where speech and eating behaviors co-exist. Additionally, efficient model adaptation was achieved through two-stage fine-tuning of the self-supervised learning model. The experimental evaluations demonstrate that the proposed method maintains speech recognition accuracy while achieving high detection performance for eating behaviors, with an F1 score of 0.918 for chewing detection and 0.926 for swallowing detection. These results underscore the potential of using voice recognition technology for daily health monitoring. Full article
(This article belongs to the Special Issue AI-Based Automated Recognition and Detection in Healthcare)
Show Figures

Figure 1

6 pages, 1173 KiB  
Proceeding Paper
Wearable Impedance-Matched Noise Canceling Sensor for Voice Pickup
by Hee Yun Suh, Helena Hahn and James West
Eng. Proc. 2023, 58(1), 99; https://doi.org/10.3390/ecsa-10-16153 - 15 Nov 2023
Cited by 1 | Viewed by 795
Abstract
Communicating under extreme noise conditions remains challenging in spite of higher-order noise-canceling microphones, throat microphones, and signal processing. Both natural and human-made background ambient noise can disturb the conveyance of information because of high noise levels. Noise cancellation, which is used frequently in [...] Read more.
Communicating under extreme noise conditions remains challenging in spite of higher-order noise-canceling microphones, throat microphones, and signal processing. Both natural and human-made background ambient noise can disturb the conveyance of information because of high noise levels. Noise cancellation, which is used frequently in audio technology, has limits in noise reduction and does not guarantee clear vocal pickup in these severe situations. A contact microphone that is attached directly to the medium of interest has the potential to pick up vocal signals with reduced noise. In this study, an electrostatic transducer with an elastomer layer that is impedance-matched to the human body is used to pick up speech sounds through constant contact on the chin and cheek. By attaching the wearable device directly to the skin, the medium of air is bypassed, and airborne noise is passively canceled. Because of the acoustic impedance-matched layer, the sensor is more sensitive to low frequencies under 500 Hz, so frequency equalization was implemented to flatten the frequency response throughout the vocal range. The perceptual evaluation of speech quality (PESQ) scores of the wearable device with equalization averaged around 2.6 on a scale from –0.5 to 4.5. Speech recordings were also collected in a noise field of 85 dB, and the performance was compared to a cardioid lapel mic, a cardioid dynamic mic, and an omnidirectional condenser mic. The recordings revealed a significantly reduced presence of white noise in the contact sensor. This study provides preliminary results that show potential vocal applications for a wearable impedance-matched sensor. Full article
Show Figures

Figure 1

Back to TopTop