Next Article in Journal
Optimized Design of a Triangular Shear Piezoelectric Sensor Using Non-Dominated Sorting Genetic Algorithm-II(NSGA-II)
Previous Article in Journal
Impact of Playing Position on Competition External Load in Professional Padel Players Using Inertial Devices
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Experimental Study on Noise-Reduced Propagation Characteristics of the Parametric Acoustic Array Field in a Neck Phantom

School of Biological Science and Medical Engineering, Beihang University, Beijing 100191, China
*
Authors to whom correspondence should be addressed.
Sensors 2025, 25(3), 802; https://doi.org/10.3390/s25030802
Submission received: 6 December 2024 / Revised: 23 January 2025 / Accepted: 27 January 2025 / Published: 29 January 2025
(This article belongs to the Section Biomedical Sensors)

Abstract

:
The electrolarynx (EL) is a common device for voice reconstruction in laryngectomy patients, but its mechanical sound source generates significant radiation noise, affecting the naturalness and acceptability of the speech. The parametric acoustic array (PAA), which produces directionally propagated difference-frequency sound waves, presents a promising alternative for creating a more natural glottal-like voice source in the trachea while reducing radiation noise. In this study, we developed a tissue-mimicking phantom to simulate human neck tissue and used a single-transducer-based PAA platform to generate modulated ultrasound signals with glottal waveform characteristics. Ultrasonic microphones captured sound signals fromthe trachea and surrounding air, and signal processing was used to isolate the difference-frequency signals. The results demonstrated that difference-frequency signals were successfully detected in the phantom’s trachea, with time-domain waveforms and frequency spectra closely resembling the designed glottal waveform (Pearson correlation coefficient = 0.9438). Additionally, radiation noise produced by the PAA was significantly lower (23 dB, p < 0.0001) compared to the traditional EL. These findings demonstrate the potential of PAA for voice source reconstruction in laryngectomy patients and suggest its capacity to enhance speech rehabilitation outcomes. Further research is required to refine the frequency range and evaluate clinical applicability.

1. Introduction

The electrolarynx (EL) is a critical device for voice reconstruction in patients undergoing total laryngectomy, a surgical procedure that removes the larynx [1]. This device compensates for the loss of vocal cords by using an external sound source, offering benefits like ease of use and the ability to produce continuous speech [2]. Research shows that more than half of laryngectomy patients rely on EL as their primary means of communication within five years post-surgery [3].
The EL functions by placing a vibrating membrane against the neck, allowing the sound produced to travel through surrounding tissues to the pharynx, thereby creating a substitute voice. However, a significant drawback is the mechanical noise (often referred to as “radiation noise”) generated by the device, which can reach 20–25 dB and severely interfere with speech intelligibility [4,5].
Researchers have employed various strategies to eliminate radiation noise, including physical modifications to the EL and signal processing techniques. For instance, Madden et al. [6] replaced the traditional motor with an eccentric motor, achieving a 20% improvement in speech intelligibility by reducing noise. However, maintaining clear and natural speech quality remains a challenge. Norton and Bernstein [7] wrapped the EL in thick foam, which reduced radiation noise by approximately 5 dB. However, this modification made the device cumbersome and less user-friendly. Other studies have focused on signal processing methods, such as spectral subtraction, voice conversion, and adaptive filtering, which have shown promise in enhancing speech quality by minimizing background noise [8,9,10,11]. Although these techniques enhance the clarity of the output, their lack of real-time performance often makes them unsuitable for spontaneous conversation.
The primary source of radiation noise in electrolarynx (EL) devices originates from external sound sources. One potential solution is to relocate the sound source to the oral cavity or pharynx, a method that has been explored by several researchers. For instance, Takahashi et al. [12,13] mounted a vibrating source on a denture; however, the resulting sound failed to produce natural voice quality. Huang et al. [14] developed a sound-generating device that is placed on the upper jaw, using a 3D-printed speaker holder to secure the speaker to a tooth sleeve. This design allows users to modulate sound frequency and amplitude by adjusting lung pressure and mouth shape. Despite its advantages, the tooth sleeve can interfere with speech production and pose challenges related to stability and hygiene. Moreover, foreign objects in the mouth can further complicate articulation. Painter et al. [15] explored an electromagnetic EL device for implantation in neck tissue, but this approach carries the risks associated with surgical procedures.
Parametric acoustic array (PAA) technology offers a promising alternative to address these challenges. This technique, which has been successfully applied in fields such as underwater measurement [16,17], underwater communication [17], and parametric speakers [18,19,20], generates difference-frequency waves from high-frequency sound waves. This capability enables precise and focused sound generation, even in complex environments [21]. Mills et al. [22] demonstrated the feasibility of generating difference frequency signals in soft tissue, highlighting the potential of using PAA for internal voice source reconstruction. While simulations have provided valuable insights, they cannot fully replicate the complexity of real tissue environments. Experimental validation is therefore essential to confirm the feasibility of using PAA in practical applications. To bridge this gap, our study conducts experimental investigations using a tissue-mimicking phantom to explore the feasibility of generating voice sources within the human body through PAA technology.
This study aims to experimentally investigate the use of modulated PAA technology to generate voice sources within a tissue-mimicking phantom that replicates the acoustic characteristics of human neck tissue. By comparing generated voice sources with natural voice and traditional EL outputs, this study seeks to establish the effectiveness of PAA technology in voice reconstruction. The findings aim to provide critical insights into the practical application of PAA technology for voice reconstruction in laryngectomy patients, with the potential to significantly improve their quality of life.

2. Materials and Methods

2.1. Experimental Platform

The experimental platform was designed to investigate the propagation characteristics of the PAA difference-frequency sound field in a neck phantom. The setup comprises two main subsystems: the signal excitation system and the signal acquisition system, with a neck phantom serving as the medium for acoustic wave propagation. The complete experimental setup is illustrated in Figure 1, which includes a schematic diagram (Figure 1a) and a photograph of the physical setup (Figure 1b).

2.1.1. Human Neck Tissue-Mimicking Phantom

In this study, a polyvinyl alcohol (PVA) material (product No. 563900, Sigma-Aldrich) was used to fabricate an acoustic phantom that simulates human neck tissue. The PVA (molecular weight: 130,000, Sigma-Aldrich, Zwijndrecht, The Netherlands) was prepared by dissolving 20% wt PVA in a mixture of 80% wt dimethyl sulfoxide (DMSO, Sigma-Aldrich, Zwijndrecht, The Netherlands) and 20% wt Milli-Q water. The solution underwent a series of freezing and thawing cycles to achieve the desired acoustic properties, which were designed to approximate those of human tissue (speed of sound: 1616 m/s, attenuation coefficient: 1.69 dB/cm, B/A value: 11.7) [23]. The phantom was shaped as a hollow cylinder, with a flat surface on the front for precise placement of the ultrasonic transducer. The height of the phantom was approximately 10 cm, and the outer diameter of the cylinder was designed to match the diameter of a human neck. The diameter of the central hole was chosen to resemble the size of the lower human vocal tract. Additionally, the thickness from the inner hole to the flat surface was made to approximate the thickness of human neck tissue [24]. A circular hole, approximately 1 cm in diameter, was added to the rear of the phantom to allow the insertion of a microphone for measurement.

2.1.2. Signal Excitation System

The signal excitation system was designed to generate stable ultrasonic signals. The excitation signal was produced by an arbitrary waveform generator (Analog Discovery 3, Digilent, WA, USA) and then amplified by a power amplifier (ATA3040, Aigtek, Xi’an, China) to drive the ultrasonic transducer (H2KA050 KA1CD00, Unictron, Taiwan). The transducer, with a center frequency of 50 kHz and a diameter of 5 cm, was positioned at the middle of the flat surface of the phantom, ensuring optimal transmission of acoustic waves into the medium. Figure 1b illustrates the physical setup of the transducer and phantom.

2.2. Experimental Procedure

The experiments were conducted in a quiet room to minimize external noise interference. The signal excitation system transmitted an excitation signal with a peak-to-peak voltage of 90 V, driving the ultrasonic transducer to emit ultrasound waves, which propagated through the neck phantom. The signal acquisition system captured 10 s of continuous audio data at a sampling rate of 204.8 kHz with 24-bit resolution. To assess the impact of the phantom’s acoustic properties, measurements were also taken in air after the phantom was removed. All measurements were repeated three times for reliability.
Subsequently, the ultrasonic transducer was replaced with a commercial EL, and the same acquisition conditions were used to capture the sound emitted by the EL as it propagated through the phantom.

2.3. Excitation Signal

The parametric array excitation signal used in this study was generated using the Amplitude Modulation (AM) method [25], as described by the following equation:
S t = ( A + m ( t ) ) × c o s ( 2 π f c t ) ,
where m ( t ) represents the envelope signal, s ( t ) is the resulting modulated signal, A is the amplitude of the carrier signal, and c o s ( 2 π f c t ) is the carrier signal with frequency f c . The envelope signal was generated using the Liljencrants–Fant (LF) glottal waveform model, which accurately simulates human glottal airflow during phonation [26]. The glottal waveform frequency was set to 200 Hz, with a carrier frequency of 50 kHz corresponding to the central frequency of the ultrasonic transducer. The modulation depth was set to 100%, resulting in full modulation of the carrier signal by the glottal waveform. Figure 2 illustrates the predefined glottal waveform signal and the corresponding AM-modulated excitation signal.

2.4. Signal Processing and Parameter Evaluation

The collected sound signals were subjected to a finite impulse response (FIR) band-pass filter with a passband of 20 Hz to 1000 Hz to extract the difference frequency signals. The extracted signal waveform was compared to the preset glottal waveform in the time domain. To quantify the similarity between the two signals, the Pearson correlation coefficient (r) was calculated. This was performed using MATLAB, with r computed based on the covariance of the variables normalized by their standard deviations:
r = x i x ¯ y i y ¯ x i x ¯ 2 y i y ¯ 2
where x i and y i represent the measured values of AS/TMS and LFS, respectively, and x ¯ and y ¯ are the mean values of AS/TMS and LFS.
Additionally, the autoregressive (AR) power spectral density of the acquired signals was calculated using the AR Burg method, with an order of 190 chosen for its stability and accuracy in spectral estimation.
The sound pressure levels (SPLs) recorded by the external and internal microphones were used to evaluate the intensity of the difference frequency signal and quantify radiation noise. The SPL difference (ΔL) between the external microphone, which measures radiation noise, and the internal microphone, which measures the sound inside the phantom, serves as an indicator of noise leakage. A larger ΔL indicates less radiation noise. The SPL difference was calculated using the following formula:
L = S P L e x t S P L i n t ,
To assess the statistical significance of the SPL differences between the two excitation sources (EL and PAA), a paired t-test was performed using a significance level of p < 0.001 . This stricter threshold was chosen to ensure robust conclusions in the context of this study.

3. Results

3.1. Time-Domain Analysis

The waveforms of the difference-frequency glottal wave obtained after excitation by the parametric array are shown in Figure 3. Figure 3a illustrates the waveform for five periods, with the first row representing the glottal wave obtained through the LF model calculation (envelope signal, LFS). The second and third rows show the difference-frequency signal waveforms captured at the transducer’s axial position in air (air signal, AS) and after propagation through the tissue-mimicking phantom (tissue-mimicking signal, TMS), respectively.
From the time-domain signals, it can be observed that the modulated glottal wave signal, after propagating through both the air and tissue-mimicking media, retains a periodicity corresponding to the fundamental frequency (F0) of the original glottal wave signal. The restored waveforms closely resemble the original glottal wave signal. Pearson correlation analysis revealed a correlation coefficient of 0.9767 between AS and LFS, and 0.9438 between TMS and LFS.
Compared to the LFS waveform, the difference-frequency waveforms (AS and TMS) show some distortion in their waveform shapes. To further analyze the differences, one period from the LFS, AS, and TMS waveforms, as well as the corresponding EL signal, were normalized and aligned by their signal peaks, as shown in Figure 3b. The analysis indicates that the peak of all three waveforms occurs at approximately the 50% point of the signal period. However, compared to LFS, the rise time of AS and TMS is steeper. The trough of the LFS signal is stable, without high-frequency noise or unwanted frequency components. On the right side of the peak, at approximately 70% of the signal period, a second sharp peak appears in both AS and TMS signals, with the TMS signal showing a more pronounced peak.

3.2. Frequency-Domain Analysis

The Burg AR power spectral density (PSD) curves of LFS, AS, and TMS signals are shown in Figure 4. All three signals exhibit a dominant peak at the F0, with harmonic components appearing at integer multiples of the F0 (200 Hz, 400 Hz, 600 Hz, etc.). The energy of the F0 is the highest, while the harmonic components decrease in amplitude as the frequency increases.
The primary energy is concentrated around 200 Hz, with harmonic components becoming progressively weaker at higher frequencies. The fourth harmonic in the TMS signal is notably smaller compared to the AS signal. The frequency spectra of AS and TMS signals show slight attenuation of higher harmonics compared to the LFS signal, with the TMS signal exhibiting more pronounced attenuation in the higher frequency range.

3.3. Radiation Noise Analysis

As shown in Figure 5a, the sound pressure level (SPL) of radiation noise measured with the EL as the excitation source was 81.20 dB (SD: 0.36 dB). In contrast, when using the PAA as the excitation source, the SPL of radiation noise was significantly lower at 24 dB (SD: 0.16 dB), with a difference that was statistically significant (p < 0.0001).
Figure 5b illustrates the ΔL between the external and internal microphones. For the EL, the SPL difference was −16 dB (SD: 0.34 dB), whereas for the PAA, the SPL difference was −23 dB (SD: 0.15 dB). The SPL difference for the PAA was significantly higher than that for the EL (p < 0.0001), indicating that less radiation noise was leaking outside the phantom when using the PAA.

4. Discussion

This study experimentally investigates the propagation of the PAA sound field through tissue-mimicking media to evaluate its potential for reconstructing glottal waveforms. The results confirm that the PAA effectively generates low-frequency difference waves within the tissue-mimicking media, retaining the envelope characteristics of the modulated signal after propagation. Notably, these difference-frequency waves, when transmitted through the medium, exhibit a higher degree of similarity to the human glottal waveform than the excitation signal generated by the EL. This finding demonstrates the feasibility of using the PAA to reconstruct glottal waveforms, offering a potential advantage over traditional EL devices, which produce signals that are significantly different from natural human phonation.
The time–domain analysis revealed that the PAA emitted modulated glottal waveforms (LF model) maintaining periodicity with the same fundamental frequency (200 Hz) as the pre-defined signal, regardless of whether they propagated through air or tissue-mimicking media. The waveforms produced by PAA showed a high degree of similarity to the original glottal waveform, as evidenced by the Pearson correlation coefficients exceeding 0.9. In contrast, the waveform produced by the EL resembled a sharp, impulsive signal, rather than a smoothly modulated glottal waveform. This finding aligns with the simulations reported by Mills et al. [22] but differs in that Mills’ approach used a frequency difference method with two transducers, which could only generate sine waves corresponding to the frequency difference between two excitation signals. This study, however, utilized a single transducer to modulate the excitation signal, allowing for more versatile waveform generation and yielding a signal closer to natural human phonation. Furthermore, the use of one transducer simplifies the design of future devices, making the system more portable and cost-effective.
When comparing the LFS, AS, and TMS waveforms, the peak-to-peak value of the TMS waveform was significantly smaller than that of the AS waveform, possibly due to stronger attenuation of the sound by the tissue-mimicking medium. Additionally, the rise and fall of the TMS waveform were less smooth compared to the LFS waveform, with added peaks and valleys, which could be attributed to the harmonic components present in the original glottal signal. These harmonic components experienced some loss during propagation through the tissue-mimicking medium, a phenomenon that became more pronounced as the medium’s nonlinearity increased. This was further corroborated by the power spectral analysis.
In the frequency domain, the AR power spectral density analysis indicated that, in addition to the F0, the LF model waveform contained several harmonic components. After propagation through both air and tissue-mimicking media, the parametric array sound field retained these harmonic components, with their energy gradually decreasing as the frequency increased, similar to the energy distribution observed in the original LF model. When compared to the EL-generated sound field, the PAA signal exhibited a frequency spectrum that more closely matched that of the LF model, emphasizing its potential to more accurately simulate human phonation.
A key aspect of this study was the evaluation of radiation noise produced by the two excitation sources, using an external microphone placed behind the tissue-mimicking medium. The results presented in Figure 5 indicate a significant difference in radiation noise between the EL and the PAA. The SPL of the radiation noise generated by the PAA was substantially lower than that of the EL, owing to the directional nature of the PAA. Additionally, the simplified neck phantom used in this study may have led to some sound leakage at the openings of the trachea. In a more realistic human model, where the trachea is a closed tube, we would expect even lower radiation noise. Moreover, due to the low energy of the difference-frequency sound generated by the PAA, any leaked radiation noise would likely be imperceptible to the listener.
While this study demonstrates the potential of the PAA for reconstructing glottal waveforms and reducing radiation noise, the glottal waveform frequency obtained (200 Hz) remains below the typical fundamental frequency range of normal human speech (60–500 Hz). This limitation is likely due to the output power of the transducer and the modulation method used. Future research should focus on enhancing the output power and improving the conversion efficiency of the difference-frequency signal to achieve a broader frequency range that better matches the natural human voice. Furthermore, the tissue-mimicking phantom in this study does not fully replicate the complex structure of human tissues, particularly regarding their nonlinear acoustic properties. Future studies should explore more anatomically accurate phantoms or in vivo experiments to further validate the feasibility of the parametric acoustic array for practical clinical applications, such as in improving the quality and naturalness of speech in patients with laryngectomy.
The selection of a 5 cm diameter and a 50 kHz center frequency for the transducer in this study was driven by the specific requirements of PAA technology in generating glottal waves within the human body. The diameter was chosen to accommodate the limited area of the human neck while maximizing the energy output of the difference-frequency wave, as the emitting area directly influences the efficiency of PAA energy conversion. Similarly, the 50 kHz center frequency was selected to balance energy conversion efficiency and directivity, with lower frequencies providing higher efficiency but suffering from poor directivity. While the experimental results confirm the feasibility of generating glottal waves within the phantom, further optimization of transducer parameters remains an essential avenue for improving conversion efficiency. Future work will focus on refining transducer size and center frequency, exploring advanced modulation methods, and employing focused transducers to enhance power density at specific target locations, ensuring a stronger and more localized difference-frequency sound field for applications in the human body.

5. Conclusions

This study provides compelling evidence that PAA technology is capable of reconstructing glottal waveforms with higher fidelity than traditional EL devices, offering significant advantages in both waveform accuracy and radiation noise reduction. Comparative analysis demonstrated a high degree of similarity between the PAA-generated signals and the model glottal waveforms. Additionally, the autoregressive spectral analysis confirmed that the PAA accurately reproduces essential spectral features of the glottal waveform, further supporting its potential for voice rehabilitation.
The use of a single transducer to generate modulated signals makes this method more efficient and practical for future speech rehabilitation technologies. However, improvements in signal strength, frequency range, and system design are necessary to fully meet the demands of natural human speech. Future work will focus on optimizing transducer parameters, exploring advanced modulation techniques, and conducting clinical evaluations to ensure effective translation of this technology into practical applications. This study lays a strong foundation for advancing voice restoration solutions, with the potential to significantly improve the quality of life for individuals with total laryngectomy.

Author Contributions

Conceptualization, L.W.; methodology L.W. and F.L.; data analysis, L.W. and J.Z.; supervision, H.N. and L.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China (12102057).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data is unavailable due to privacy or ethical restrictions.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Souza, F.G.R.; Santos, I.C.; Bergmann, A.; Thuler, L.C.S.; Freitas, A.S.; Freitas, E.Q.; Dias, F.L. Quality of Life after Total Laryngectomy: Impact of Different Vocal Rehabilitation Methods in a Middle Income Country. Health Qual. Life Outcomes 2020, 18, 92. [Google Scholar] [CrossRef] [PubMed]
  2. Tang, J.A.; McCarroll, L.; Schmalbach, C.E. Voice Restoration and Quality of Life in Larynx Cancer. Otolaryngol. Clin. N. Am. 2023, 56, 361–370. [Google Scholar] [CrossRef] [PubMed]
  3. Mendenhall, W.M.; Morris, C.G.; Stringer, S.P.; Amdur, R.J.; Hinerman, R.W.; Villaret, D.B.; Robbins, K.T. Voice Rehabilitation after Total Laryngectomy and Postoperative Radiation Therapy. J. Clin. Oncol. 2002, 20, 2500–2505. [Google Scholar] [CrossRef] [PubMed]
  4. Niu, H.-J.; Wan, M.-X.; Wang, S.-P.; Liu, H.-J. Enhancement of Electrolarynx Speech Using Adaptive Noise Cancelling Based on Independent Component Analysis. Med. Biol. Eng. Comput. 2003, 41, 670–678. [Google Scholar] [CrossRef]
  5. Kaye, R.; Tang, C.G.; Sinclair, C.F. The Electrolarynx: Voice Restoration after Total Laryngectomy. Med. Devices 2017, 10, 133–140. [Google Scholar] [CrossRef]
  6. Madden, B.; Nolan, M.; Burke, E.; Condron, J.; Coyle, E. Intelligibility of Electrolarynx Speech Using a Novel Actuator. In Proceedings of the IET Irish Signals and Systems Conference (ISSC 2010), Cork, Ireland, 23–24 June 2010; IET: London, UK; pp. 158–162. [Google Scholar]
  7. Norton, R.L.; Bernstein, R.S. Improved Laboratory Prototype Electrolarynx (LAPEL): Using Inverse Filtering of the Frequency Response Function of the Human Throat. Ann. Biomed. Eng. 1993, 21, 163–174. [Google Scholar] [CrossRef]
  8. Boll, S. Suppression of Acoustic Noise in Speech Using Spectral Subtraction. IEEE Trans. Acoust. Speech Signal Process 1979, 27, 113–120. [Google Scholar] [CrossRef]
  9. Kobayashi, K.; Toda, T. Electrolaryngeal Speech Enhancement with Statistical Voice Conversion Based on CLDNN. In Proceedings of the 2018 26th European Signal Processing Conference (EUSIPCO), Rome, Italy, 3–7 September 2018; IEEE: Rome, Italy; pp. 2115–2119. [Google Scholar]
  10. Cole, D.; Sridharan, S.; Moody, M.; Geva, S. Application of Noise Reduction Techniques for Alaryngeal Speech Enhancement. In Proceedings of the IEEE TENCON ’97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications (Cat. No.97CH36162), Brisbane, QLD, Australia, 4 December 1997; Volume 2, pp. 491–494. [Google Scholar]
  11. Violeta, L.P.; Huang, W.-C.; Ma, D.; Yamamoto, R.; Kobayashi, K.; Toda, T. Electrolaryngeal Speech Intelligibility Enhancement through Robust Linguistic Encoders. In Proceedings of the ICASSP 2024—2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, Republic of Korea, 14–19 April 2024; pp. 10961–10965. [Google Scholar]
  12. Takahashi, H.; Nakao, M.; Kikuchi, Y.; Kaga, K. Intra-Oral Pressure–Based Voicing Control of Electrolaryngeal Speech with Intra-Oral Vibrator. J. Voice 2008, 22, 420–429. [Google Scholar] [CrossRef]
  13. Takahashi, H.; Nakao, M.; Kikuchi, Y.; Kaga, K. Alaryngeal Speech Aid Using an Intra-Oral Electrolarynx and a Miniature Fingertip Switch. Auris Nasus Larynx 2005, 32, 157–162. [Google Scholar] [CrossRef]
  14. Huang, T.-Y.; Lin, B.-S.; Lien, C.-F.; Yu, W.-H.V.; Peng, Y.-Y.; Lin, B.-S. A Voice-Producing System With Naturalness and Variable Multi-Frequency Vocalization for Patients Who Have Undergone Laryngectomy. IEEE Access 2023, 11, 30619–30627. [Google Scholar] [CrossRef]
  15. Painter, C.; Fredrickson, J.M.; Kaiser, T.; Karzon, R. Human Speech Development for an Implantable Artificial Larynx. Ann. Otol. Rhinol. Laryngol. 1987, 96, 573–577. [Google Scholar] [CrossRef] [PubMed]
  16. Hong, F.; Feng, H.; Huang, M.; Wang, B.; Xia, J. China’s First Demonstration of Cobalt-Rich Manganese Crust Thickness Measurement in the Western Pacific with a Parametric Acoustic Probe. Sensors 2019, 19, 4300. [Google Scholar] [CrossRef] [PubMed]
  17. Tang, S.; Zhu, G.; Yin, J.; Zhang, X.; Han, X. A Modulation Method of Parametric Array for Underwater Acoustic Communication. Appl. Acoust. 2019, 145, 305–313. [Google Scholar] [CrossRef]
  18. Fantinelli De Carvalho, A.C.; Arnela, M.; Burbano-Escolà, R.; Vaquerizo-Serrano, J. A Study of the Ultrasonic and Audible Frequency Response of Commercial Loudspeakers for Use as Parametric Acoustic Arrays. Inter-Noise Noise-Con. Congr. Conf. Proc. 2024, 270, 755–763. [Google Scholar] [CrossRef]
  19. Gan, W.-S.; Yang, J.; Kamakura, T. A Review of Parametric Acoustic Array in Air. Appl. Acoust. 2012, 73, 1211–1219. [Google Scholar] [CrossRef]
  20. Ogami, Y.; Nakayama, M.; Nishiura, T. Virtual Sound Source Construction Based on Radiation Direction Control Using Multiple Parametric Array Loudspeakers. J. Acoust. Soc. Am. 2019, 146, 1314. [Google Scholar] [CrossRef]
  21. Červenka, M.; Bednařík, M. An Algebraic Correction for the Westervelt Equation to Account for the Local Nonlinear Effects in Parametric Acoustic Array. J. Acoust. Soc. Am. 2022, 151, 4046–4052. [Google Scholar] [CrossRef]
  22. Mills, P.; Zara, J. 3D Simulation of an Audible Ultrasonic Electrolarynx Using Difference Waves. PLoS ONE 2014, 9, e113339. [Google Scholar] [CrossRef]
  23. Chen, P.; Pollet, A.M.A.O.; Panfilova, A.; Zhou, M.; Turco, S.; Den Toonder, J.M.J.; Mischi, M. Acoustic Characterization of Tissue-Mimicking Materials for Ultrasound Perfusion Imaging Research. Ultrasound Med. Biol. 2022, 48, 124–142. [Google Scholar] [CrossRef]
  24. Breatnach, E.; Abbott, G.C.; Fraser, R.G. Dimensions of the Normal Human Trachea. AJR Am. J. Roentgenol. 1984, 142, 903–906. [Google Scholar] [CrossRef]
  25. Zhou, Z.; Chen, S. Design of Parametric Acoustic Array Based on Inverse Control Processing. Appl. Acoust. 2022, 195, 108816. [Google Scholar] [CrossRef]
  26. Fant, G. Glottal Flow: Models and Interaction. J. Phon. 1986, 14, 393–399. [Google Scholar] [CrossRef]
Figure 1. Experimental setup for investigating the propagation characteristics of the PAA difference-frequency sound field in a neck phantom. (a) Schematic diagram of the experimental platform; (b) photograph of the physical setup of the system: the ultrasound transducer is placed on the surface of the phantom, with coupling gel applied in the center. A microphone (Microphone 1) is inserted into a hole on the opposite side of the phantom, with its tip positioned at the center of the phantom’s trachea to capture the sound field inside the trachea. Another microphone (Microphone 2) is placed 10 cm from the trachea center on the outside of the phantom, at the same horizontal plane as Microphone 1 and at an approximate 45° angle to Microphone 1, to collect radiated noise.
Figure 1. Experimental setup for investigating the propagation characteristics of the PAA difference-frequency sound field in a neck phantom. (a) Schematic diagram of the experimental platform; (b) photograph of the physical setup of the system: the ultrasound transducer is placed on the surface of the phantom, with coupling gel applied in the center. A microphone (Microphone 1) is inserted into a hole on the opposite side of the phantom, with its tip positioned at the center of the phantom’s trachea to capture the sound field inside the trachea. Another microphone (Microphone 2) is placed 10 cm from the trachea center on the outside of the phantom, at the same horizontal plane as Microphone 1 and at an approximate 45° angle to Microphone 1, to collect radiated noise.
Sensors 25 00802 g001
Figure 2. (a) Predefined glottal waveform and (b) corresponding AM-modulated excitation signal.
Figure 2. (a) Predefined glottal waveform and (b) corresponding AM-modulated excitation signal.
Sensors 25 00802 g002
Figure 3. (a) Time−domain waveforms of LFS, AS, and TMS. (b) Normalized comparison of one period from the LFS, AS, TMS, and EL signals, derived from one cycle in Figure 3a (between the two dotted lines), highlighting waveform distortions and differences in peak and rise times. In contrast, the excitation signal produced by the EL is completely different from the glottal wave signal. The Pearson correlation between the EL signal and the LFS is only 0.0057, indicating minimal similarity.
Figure 3. (a) Time−domain waveforms of LFS, AS, and TMS. (b) Normalized comparison of one period from the LFS, AS, TMS, and EL signals, derived from one cycle in Figure 3a (between the two dotted lines), highlighting waveform distortions and differences in peak and rise times. In contrast, the excitation signal produced by the EL is completely different from the glottal wave signal. The Pearson correlation between the EL signal and the LFS is only 0.0057, indicating minimal similarity.
Sensors 25 00802 g003
Figure 4. Burg AR power spectral density (PSD) curves of (a) LFS, (b) AS, and (c) TMS signals.
Figure 4. Burg AR power spectral density (PSD) curves of (a) LFS, (b) AS, and (c) TMS signals.
Sensors 25 00802 g004
Figure 5. Comparison of radiation noise levels: (a) SPL of radiation noise for EL and PAA excitation sources, and (b) SPL difference (ΔL) between external and internal microphones for EL and PAA (***: p < 0.001).
Figure 5. Comparison of radiation noise levels: (a) SPL of radiation noise for EL and PAA excitation sources, and (b) SPL difference (ΔL) between external and internal microphones for EL and PAA (***: p < 0.001).
Sensors 25 00802 g005
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, L.; Li, F.; Zhou, J.; Niu, H. Experimental Study on Noise-Reduced Propagation Characteristics of the Parametric Acoustic Array Field in a Neck Phantom. Sensors 2025, 25, 802. https://doi.org/10.3390/s25030802

AMA Style

Wang L, Li F, Zhou J, Niu H. Experimental Study on Noise-Reduced Propagation Characteristics of the Parametric Acoustic Array Field in a Neck Phantom. Sensors. 2025; 25(3):802. https://doi.org/10.3390/s25030802

Chicago/Turabian Style

Wang, Li, Fengji Li, Jie Zhou, and Haijun Niu. 2025. "Experimental Study on Noise-Reduced Propagation Characteristics of the Parametric Acoustic Array Field in a Neck Phantom" Sensors 25, no. 3: 802. https://doi.org/10.3390/s25030802

APA Style

Wang, L., Li, F., Zhou, J., & Niu, H. (2025). Experimental Study on Noise-Reduced Propagation Characteristics of the Parametric Acoustic Array Field in a Neck Phantom. Sensors, 25(3), 802. https://doi.org/10.3390/s25030802

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop