Next Article in Journal
Detecting Out-of-Distribution Samples in Complex IoT Traffic Based on Distance Loss
Previous Article in Journal
Integrated Information in Relational Quantum Dynamics (RQD)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Tests of the Influence of DAF (Delayed Auditory Feedback) on Changes in Speech Signal Parameters

by
Dominika Kanty
1,* and
Piotr Staroniewicz
2
1
Department of Electronic and Photonic Metrology, Faculty of Electronics, Photonics and Microsystems, Wroclaw University of Science and Technology, Wybrzeże Wyspiańskiego 27, 50-370 Wrocław, Poland
2
Department of Acoustics, Multimedia and Signal Processing, Faculty of Electronics, Photonics and Microsystems, Wroclaw University of Science and Technology, Wybrzeże Wyspiańskiego 27, 50-370 Wrocław, Poland
*
Author to whom correspondence should be addressed.
Appl. Sci. 2025, 15(13), 7524; https://doi.org/10.3390/app15137524 (registering DOI)
Submission received: 12 June 2025 / Revised: 24 June 2025 / Accepted: 1 July 2025 / Published: 4 July 2025

Abstract

Contemporary phonetics and speech therapy continuously seek new techniques and methods that could contribute to improving verbal communication for individuals with speech disorders. One such phenomenon, Delayed Auditory Feedback (DAF), involves the speaker hearing their own voice with a specific delay relative to real-time speech. Although the research presented in this study was conducted on healthy individuals, it offers valuable insights into the mechanisms controlling speech, which may also apply to individuals with speech disorders. This article introduces a novel method and measurement setup, focusing on selected key speech signal parameters. To characterize the impact of Delayed Auditory Feedback (DAF) on fluent speakers, speech signal parameters were measured in 5 women and 5 men during spontaneous speech and reading. Parameters such as speech rate, fundamental frequency, formants, speech duration, jitter, and shimmer were analyzed both during and prior to the application of DAF. The results of this study may find practical applications in the field of telecommunications, especially in improving the efficiency and quality of human communication.

1. Introduction

Motor activities such as speech and articulation are highly complex operations that not only rely on precise neuromuscular coordination but also exemplify how natural language interactions support adaptive behavior in complex cyber-socio-technical systems [1]. Their proper functioning directly depends on the structure and functionality of the speech apparatus and the auditory system [2], which, when working in cooperation, enable accurate articulation of words and sentences. One example of speech disorders is stuttering, which may be caused by genetic mutations [3], atypical brain activity [4], or sensorimotor changes, including alterations in auditory information processing and speech rhythm [5]. Research on speech disorders, as outlined in the new WHO guidelines in ICD-11, emphasizes the need for detailed descriptions and classifications of speech disorders, considering their diverse symptoms and severity. This approach highlights the significance of speech disorders and facilitates a better understanding of these conditions, which is critical in globally improving the quality of life for individuals with communication difficulties [6].
Delayed Auditory Feedback (DAF) is a sensory modification characterized by a precisely defined delay in auditory feedback, which can range up to 500 ms [7]. Numerous studies in the literature have addressed the selection of appropriate delay intervals to elicit the DAF effect. Table 1 presents a selection of relevant articles along with the specific delayed used. For clarity, a 0 ms condition is not listed, as its inclusion is typically implicit. These findings support the rationale behind the delay intervals chosen by the authors, as described in the Study Design section.
The article opted for delays of 180, 250, and 360 ms, as delays below 100 ms did not significantly affect the subjects. In a recent article from 2022, authors Ozker, Dolye, Devinsky, and Flinker examined auditory cortex activation along with the DAF effect for delays of 50, 100, and 200 ms. Activity in the auditory cortex (STG) increased with increasing delay—for a delay of 200 ms, neural activation, prolongation, and speech fluency impairment were most noticeable [16].
The phenomenon of Delayed Auditory Feedback remains the only known research tool capable of inducing stuttering symptoms in healthy individuals. At the same time, several studies suggest that the appropriate use of DAF in speech therapy effectively supports individuals with fluency disorders, enhancing communication quality [15]. A notable advantage of this phenomenon is its potential application in home settings, making therapy more accessible. Regular training sessions with DAF can lead to significant improvements in speech fluency [17].
Speech disorders, such as stuttering (ICD-10), are the most common fluency disorders, affecting approximately 0.76% of the population [18]. It is worth noting that discrepancies in data are reported in the scientific literature—some sources suggest that stuttering may affect between 1% and even 5% of the population, which allows for the assumption that it pertains to a small percentage of individuals [19]. Another disorder currently undergoing clear identification, cluttering (ICD-10 F98.6, ICD-9307.0), is observed in 5–16% of the population [20].
The therapies currently available are tailored to the needs of patients and depend, among other factors, on age and the underlying cause of the disorder. It should be emphasized that methods effective for cluttering may exacerbate symptoms in cases of stuttering, and vice versa. For this reason, the phenomenon of Delayed Auditory Feedback (DAF) should be approached with care, and an appropriately selected research group should be prepared to evaluate whether such a therapy could serve as a beneficial tool for speech therapists in supporting the primary treatment for either group.

2. Tools Used

The study used headphones with foam ear cushions to provide participants with acoustic isolation from potential ambient sounds. These headphones are closed which feature surround sound. In addition, a foam-cushioned microphone was used separately to reduce breath sounds and suppress blast consonants. The microphone used had a cardioid characteristic, sampling at 96 kHz and a bit rate of 24 bits, with a frequency response of 20 Hz to 20 kHz. The cardioid microphone was positioned 10–15 cm from the speaker’s mouth. This range was chosen so that any movement away from the microphone would be minimal and would not have a negative impact on the observations. It is worth noting that for the purposes of this study, headphones with a built-in microphone function would suffice.
Figure 1 shows a diagram of the system used in the study. The upper part of the diagram is responsible for recording audio for later analysis for the purposes of this work, while the lower part is responsible for the auditory delay effect of feedback.
Delay simulation was performed using the DAF application, which allows fine-tuning of the desired delay. This application has the ability to set the delay and record the progress (Figure 2). By its accessibility, it was decided to use this application, as anyone interested in the DAF effect can use it anywhere. Because of its simplicity of use, it is an ideal tool for working on self-improvement in both the level of correctness and intelligibility of speech. After pressing a button, the application starts its work, and everything the user says is repeated after a certain pre-selected time. Each user has the ability to record his or her speech, which is later transferred to a library of recordings, which makes it possible to analyze speech and decide what still needs to be worked on. An additional feature of the application is the ability to record sessions, which allows for later analysis of progress.
Audacity was used to record the speech of the study participants. It is an audio editing and recording program. It allows audio files to be saved in multiple formats and be recorded at multiple sampling rates. In addition, once the study was completed, the program had the ability to extract the marked sound and automatically measure its timing. For phonetic analysis, on the other hand, the Praat program was used, which was developed by researchers from the Institute of Phonetics at the University of Amsterdam—Paul Boersma and David Weenink [21]. The program is intuitive, easy to use, and at the same time allows the provision of much needed information about sound. It not only educates laymen who are taking their first steps on the subject of speech acoustics, but it is also a professional tool for researchers, which allows them to extract a lot of information on signal parameters [22,23].

3. Study Design

The study group consisted of university students (mean age: 22 years), all of whom were native speakers of Polish. The participants had no speech, hearing, vision, or neurological disorders. All were open to participating in the study and provided their consent.
As required by the Polish legislation [24], written informed consent was obtained from all participants prior to their involvement in the study, including consent for the processing of personal data. The participants were provided with comprehensive information regarding the study’s objectives and scope. The research carried no health risks, and participation was strictly voluntary [25].
The investigation examined delays of 0 ms, 180 ms, 250 ms, and 360 ms, where 0 ms represented natural speech. The participants wore headphones that isolated them from external sounds, focusing their attention entirely on auditory sensations. The tasks assigned to the participants included engaging in short conversations, describing a picture, and reading—both in natural speech and under delayed auditory feedback conditions.
Before the study began, the participants were informed about the procedure and provided with guidelines to ensure smooth execution. They were instructed to maintain a consistent distance from the microphone and avoid deviating from the specified distance between their mouths and the transducer. They were asked not to laugh during their speech, to speak loudly and clearly, maintain their natural pace, avoid rustling the paper containing the text, and refrain from tapping or stomping.
Recognizing that the study could induce stress in the participants, a stress-relief tool in the form of an elastic band was provided. This allowed the participants to channel their emotions without disrupting the study. The band did not introduce motor noise, and hand movements were not recorded in the acoustic signal. These measures were designed to prevent extraneous sounds that might interfere with the recorded speech samples. The participants were reassured that speech difficulties arising under delayed auditory feedback were expected, ensuring they did not focus excessively on verbal impairments.
The study began with spontaneous speech to familiarize the participants with the experiment’s structure. This was followed by Delayed Auditory Feedback (DAF) with delays of 180 ms, 250 ms, and 360 ms, applied in random order to prevent the participants from predicting the delay or its level of difficulty. A short conversation was conducted at the start of the test, focusing on natural topics familiar to the participant. The dialog was designed to encourage a conversation rather than interrogate the participant, helping to alleviate stress and “warm up” the speech apparatus. This section was not analyzed, as its purpose was to prepare the participants for subsequent stages of the study.
The next stage involved describing photographs selected to minimize ambiguity about their content. Participants were not informed beforehand about the images and were required to describe them spontaneously as the visual stimuli were presented in a randomized sequence. The images used in the study were frames from cartoons familiar to the participants, ensuring contextual recognition. Each of the photos described was taken into account when analyzing the results. Before this task, the participants were advised to provide descriptions as detailed as possible for a later analysis. This part aimed to assess creativity, unconventional ideas, and the ability to divide attention during spontaneous speech.
The final task involved reading two texts: a poem and a book advertisement. The poem consisted of 110 words and 209 syllables, with a Flesch–Kincaid Grade Level (Fog Index) of 6, indicating that the language was very simple and appropriate for primary school students. The book advertisement contained 139 words and 295 syllables, with a Fog Index of 9, classifying it as simple and accessible. The Fog Index is widely recognized as an international standard for assessing text complexity, including materials written in Polish [26]. Currently, there is a lack of Polish language tools that are as thoroughly tested and validated in the scientific literature. However, existing research confirms that the Fog Index can be effectively applied to Polish language texts and that the results show good correlation with other established readability assessment methods [27]. The texts were chosen to be non-demanding to ensure the participants could read them without difficulty, avoiding errors such as misreading words or substituting them with others.
The participants were instructed to first read the poem, followed by the advertisement. This sequence was intentional, as the poem’s solemn tone encourages slower, more deliberate reading, allowing researchers to examine the focus and attention during the reading tasks. The poem’s structure, which naturally slows down speech, was used to determine whether the imposed delays would further affect the reading pace. In contrast, the book advertisement was a standard text used to measure word articulation speed.

4. Result and Discussion

Articulation rate is a commonly used tool for studying speech disorders [28]. Speech tempo varies across languages. For adult Polish speakers, it averages at 2 syllables per second during slow speech and 4–5 syllables per second during faster conversations [29].
Both female and male participants demonstrated a decrease in speech tempo as the delay increased. The highest average speech tempo was observed under natural speech conditions (0 ms delay), while the lowest occurred at the maximum delay (360 ms). The women exhibited generally slower speech rates compared to the men across all delay conditions, which may suggest differences in adaptive strategies or sensitivity to DAF. The men showed less variability in speech tempo across the different delay conditions, indicating greater stability in coping with delayed auditory feedback.
Speech tempo also varied depending on the complexity of the photographs being described, which could reflect their intricacy or the individual difficulty level the participants experienced in describing them. Notably, as the delay increased, the participants’ creativity and the length of their utterances significantly declined. In the context of verbal creativity, the authors refer to the ability to produce words with a greater number of syllables, enhanced speech fluency, as well as the temporal characteristics and manner of speech production.
The results presented in Figure 3 and Figure 4 indicate that delayed auditory feedback effectively disrupts speech fluency, compelling the participants to adopt a slower and more controlled speaking style while simultaneously impacting their focus. The decrease in speech tempo at higher delays likely stems from the need for more precise control over speech to mitigate disruptions caused by delayed feedback.
The experiment focused on analyzing the fundamental frequency (pitch) of vowels. For all vowels, a general trend was observed: the frequency of the fundamental pitch changed depending on the DAF delay. In the absence of delay (0 ms), the pitch was typically the highest, while increasing the delay caused a reduction in pitch frequency, suggesting that DAF affects voice stability control.
Among the vowels analyzed, the vowel ‘A’ exhibited the highest fundamental frequency, while the vowel ‘U’ had the lowest. These results reflect articulation characteristics and reduced resonance. The vowel ‘I’ demonstrated the highest pitch stability regardless of the delay value, indicating that it is less susceptible to disruptions caused by DAF. The greatest pitch variability was observed for the vowel ‘A,’ showing significant fluctuations. The results for both groups of subjects are presented in Table 2 and Table 3 below.
The data indicate that the group of women tested was more affected by the delay compared to the men, which contrasts with the existing literature [7]. For accurate sound production, vowels should be articulated effortlessly and without strain on the larynx. The typical fundamental frequency range for men is 85–180 Hz, and 165–255 Hz for women. The results from the existing literature and several factors may account for this discrepancy, ranging from the limited sample size to possible differences in task design, stimulus characteristics, or even cultural and linguistic background of the participants. It is also possible that individual variability within the female group was higher, leading to greater sensitivity to the delay condition. Further research with a larger and more diverse sample would be necessary to clarify the underlying causes of this divergence.
DAF-induced delays influence the pitch of the women, causing a reduction in frequency at higher delay values. At a delay of 360 ms, all vowels exhibited a decrease in pitch, which may indicate difficulties in maintaining controlled voice production under the maximum delay conditions. The 180 ms delay appeared to have the least impact on the pitch, suggesting it may represent a threshold value where the DAF effect on voice remains minimal.
Monitoring fundamental frequency is essential for accurate sound production and speech intelligibility. It also reflects emotional changes such as stress, tension, or uncertainty.
Formants are acoustic resonances generated during speech production, representing frequency bands with increased energy in the sound spectrum. These resonances are shaped by the articulatory system, including the throat, oral cavity, and nasal cavity. Special attention is given to the first and second formants: the first is related to the oral cavity, while the second reflects the tongue position. Other formants are associated with individual voice characteristics, such as timbre. Naturally, women exhibit higher formant values compared to men.
The process of formant estimation in Praat is based on spectral analysis of the speech signal and the identification of resonant frequencies corresponding to local maxima in the amplitude spectrum. A crucial part of this analysis involves both autocorrelation and Fourier transformation techniques, applied within moving time windows.
Initially, the speech signal is divided into short time frames. Each frame is then multiplied by a window function, ω ( t ) , using Hamming window, which helps minimize edge effects and improves frequency resolution.
a t = x t m i d T 2 + t μ x     ω t
t m i d —center of the analyzed frame;
T —window length;
t 0 , T —sliding variable within a window;
μ x —average signal value in the analyzed fragment.
Next, the normalized autocorrelation r a τ function is computed, allowing for the identification of dominant periodicities in the signal. The peak values of this function correspond to the dominant resonant frequencies.
r a τ = r a τ = 0 T τ a t a t + τ d t 0 T a 2 t d t
The implementation requires performing a Fourier transform followed by increasing the resolution of the frequency domain.
a ~ ω = a t e i ω t d t
The next step is to compute the inverse Fourier transform of the power spectral density, which enables a transition to the delay domain.
r a τ = a ~ ω 2 e i ω τ d ω 2 π
In the article by Paul Boersma, the creator of Praat, two methods of pitch analysis are described—zero-phased and cepstral—along with a discussion of their differences [30]. However, the current version of Praat employs an algorithm based on the Linear Predictive Coding (LPC) model. This method relies on the autocorrelation of the speech signal and models it as a resonant system, thereby indirectly utilizing spectral information.
Regarding formant stability, the men demonstrate greater susceptibility to changes under DAF conditions, particularly for the vowels ‘I’ and ‘U’. In contrast, women display more stable voice modulation under the same conditions, as shown in the graphs below (Figure 5 and Figure 6).
It is important to note that the obtained results are fully reliable, as no artificial conditions were imposed, requiring the participants to sustain vowels in isolation for several seconds. Instead, the analysis focused on vowel segments extracted from spontaneous speech recorded during the description of photographs. This data collection approach preserves naturalness and enhances the validity of the results within the context of the study’s conditions.
An analysis was conducted on the articulation time of 71 words repeated by each participant. The results showed that for the DAF delays of 250 ms and 360 ms, the articulation time increased for both women and men. In contrast, at the delay of 180 ms, the articulation time was shorter. Notably, the 180 ms delay not only reduced the articulation time but also increased the number of syllables in the overall speech compared to the 0 ms condition (no delay). This observation suggests that the participants were able to articulate more content under a shorter amount of time, indicating improved focus and greater speech efficiency.
The study also examined the number of syllables spoken during the description of the photographs. The results indicate that the number of syllables reflected the participants’ creativity and their ability to focus on the task or susceptibility to distraction. These data enabled the determination of speech tempo, which, for both women and men, was significantly higher at the 180 ms delay compared to the speech without a delay. These findings suggest that a 180 ms DAF delay may have a positive impact on speech fluency, making it a potentially useful tool for therapy aimed at improving speech fluency. It should be noted that the earlier conclusion is based on a sample of typical speakers. Therefore, further research is necessary to confirm these findings within clinical populations affected by speech disorders.
Additionally, jitter and shimmer parameters were analyzed. Jitter refers to irregularities in pitch, while shimmer measures the variation in acoustic wave intensity from one cycle to the next. Both parameters are determined by the characteristics of the vocal folds and are used to assess the extent to which a vowel in speech indicates voice pathology. In Praat, jitter is calculated based on the actual waveform of the voice signal rather than on a simulated model. It represents the average relative difference between successive pitch periods, thus reflecting the extent to which the duration of vocal fold vibrations varies over time.
J i t t e r = 1 N 1 i = 1 N 1 | T i T i + 1 | T ¯
T i —duration of the i-th period;
T ¯ —average duration of periods.
Analogous to jitter, shimmer refers to variations in amplitude between consecutive vocal cycles. It reflects the degree of instability in vocal intensity and is used to assess irregularities in voice signal amplitude over time.
Jitter is considered pathological at values exceeding 1.040%, and at values above 3.810% for shimmer. However, the reliability and utility of these parameters are debated, as they are susceptible to technical factors such as microphone interference, microphone quality, software type, and background noise. These circumstances can significantly influence jitter and shimmer measurements [31].
The analysis of the impact of text complexity and Delayed Auditory Feedback (DAF) on speech tempo indicates that both factors significantly influence reading fluency. The data show that as the delay increases, the speech tempo slows down. Notably, at the 250 ms delay, participants in both groups experienced clear difficulties in articulatory fluency. This finding suggests that delays of this magnitude significantly disrupt the natural speech process, leading to reduced speech tempo and extended reading times.
As illustrated in Figure 7, the men demonstrated better performance under the DAF conditions in terms of speech tempo compared to the women, which may indicate their greater tolerance to disruptions. However, the effect of delay was equally problematic for both groups when reading texts with lower complexity levels (FOG = 6). This suggests that speakers dedicate more attention to articulation control in simpler texts.
Importantly, the observed changes in speech tempo may affect text intelligibility, which is critical for effective communication. The results indicate partial adaptation by both women and men to the conditions of disrupted auditory feedback—particularly at the 360 ms delay—when reading texts with higher complexity levels. This adaptation highlights the potential for individuals to adjust to challenging speech conditions, though the process remains influenced by the complexity of the material being read.

5. Conclusions

The effect of auditory feedback is one of the phenomena that impacts healthy individuals by introducing speech disruptions. However, the ambiguity of the currently available findings prompted the authors to explore this topic further [7]. Auditory feedback significantly affects speech fluency and production, making it a crucial tool in working with individuals who have severe speech disorders as well as those capable of fluent speech (without disfluencies).
In summarizing the collected data, it can be concluded that DAF influences healthy individuals by slowing their speech and requiring greater focus when articulating subsequent words. The authors paid special attention to the results of multiple parameters, as they highlighted the participants’ susceptibility to delays. Observations suggest that the DAF effect significantly impacts the speech tempo and reading time, which could be relevant for speech therapy aimed at improving the articulation of both healthy individuals and those with disfluencies. This paper introduces a novelty by applying DAF in healthy individuals with specific speech parameters. The proposed approach improves the understanding of speech pathologies and may contribute to more effective therapeutic interventions within the target research field.
To expand the research on Delayed Auditory Feedback in the future, it would be beneficial to include a larger group of participants. With the support of speech therapists, extending the study to include younger individuals should be considered. To develop therapy for individuals with disfluencies based on the phenomenon of DAF, it would be valuable to also examine individuals at risk of or already affected by stuttering, Parkinson’s disease, or Alzheimer’s disease [32,33]. It is worth considering expanding the research to include physical parameters such as rapid movements of the face or skin during the examination. It is also worth checking whether these movements, e.g., rapid changes in facial expression, can be correlated or used to study this type of pathology together with DAF parameters.
During the course of the study, certain limitations were identified, including the characteristics of the sample, methodological constraints, measurement precision, and the size of the study group. It would be advisable to address these issues in future research to further improve the methodological approach.
In the future, a more precise investigation of jitter and shimmer parameters using advanced software would be desirable. Additionally, to better understand the impact of the DAF phenomenon on reading, selecting materials with a balanced number of syllables and a broad range of readability indices would help assess how text complexity affects participants.

Author Contributions

Conceptualization, P.S.; methodology, D.K.; validation, P.S. and D.K.; formal analysis, D.K.; investigation, D.K.; resources, D.K.; writing—original draft preparation, D.K.; writing—review and editing, P.S.; visualization, D.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The regulations of the Ethics Committee at Wrocław University of Science and Technology provide for the review of research projects “involving human participants,” although they do not precisely define the notion of participation. Instead, they indicate categories of participants for whom obtaining approval is required. This list is not exhaustive but exemplary (“in particular”), highlighting typical groups of research participants for whom a higher level of care and caution is recommended. At the same time, the regulations limit the review process to projects classified as “non-invasive empirical research,” meaning research that “does not interfere with the participant’s body” [34]. The internal regulations of universities must comply with national legislation; therefore, they are fully consistent with the legal framework in force in Poland.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data supporting the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Tomassi, A.; Falegnami, A.; Romano, E. Talking Resilience: Embedded Natural Language Cyber-Organizations by Design. Systems 2025, 13, 247. [Google Scholar] [CrossRef]
  2. Chon, H.; Kraft, S.J.; Zhang, J.; Loucks, T.; Ambrose, N.G. Individual Variability in Delayed Auditory Feedback Effects on Speech Fluency and Rate in Normally Fluent Adults. J. Speech Lang. Hear. Res. 2013, 56, 489–504. [Google Scholar] [CrossRef]
  3. Drayna, D.; Kang, C. Genetic approaches to understanding the causes of stuttering. J. Neurodev. Disord. 2011, 3, 374–380. [Google Scholar] [CrossRef]
  4. Neef, N.E.; Chang, S.-E. Knowns and unknowns about the neurobiology of stuttering. PLoS Biol. 2024, 22, e3002492. [Google Scholar] [CrossRef]
  5. Büchel, C.; Sommer, M. What Causes Stuttering? PLoS Biol. 2004, 2, e46. [Google Scholar] [CrossRef]
  6. Jastrzębowska, G. Zaburzenia neurorozwojowe. Zmiany w podejściu teoretycznym i diagnostycznym. Logopedia 2019, 48, 27–46. [Google Scholar]
  7. Corey, D.M.; Cuddapah, V.A. Delayed auditory feedback effects during reading and conversation tasks: Gender differences in fluent adults. J. Fluen. Disord. 2008, 33, 291–305. [Google Scholar] [CrossRef]
  8. Butler, R.A.; Galloway, F.T. Factoral Analysis of the Delayed Speech Feedback Phenomenon. J. Acoust. Soc. Am. 1957, 29, 632–635. [Google Scholar] [CrossRef]
  9. Neilson, S.P.; Burke, B.D.; Yates, A.J. (University of New England, Biddeford, ME, USA) Stability of susceptibility to DAF. Unpublished Research Report, 1967.
  10. Gates, A.; Bradshaw, J.L.; Nettleton, N.C. Effect of different delayed auditory feedback intervals on a music performance task. Percept. Psychophys. 1974, 15, 21–25. [Google Scholar] [CrossRef]
  11. Burke, B.D. Susceptibility to Delayed Auditory Feedback and Dependence on Auditory or Oral Sensory Feedback. J. Commun. Disord. 1975, 8, 75–96. [Google Scholar] [CrossRef]
  12. Vrtunski, P.B.; Martinez, M.; Boller, F. Evaluation of Delayed Auditory Feedback (DAF) Effect: Comparison Between Subjective Judgments and Objective Measures. Cortex 1979, 15, 337–341. [Google Scholar] [CrossRef]
  13. Fukawa, T.; Yoshida, S. Sex difference in susceptibility to delayed auditory feedback in oral reading tasks. Shinrigaku Kenkyu Jpn. J. Psychol. 1988, 59, 144–150. [Google Scholar] [CrossRef]
  14. Takaso, H.; Eisner, F.; Wise, R.J.; Scott, S.K. The effect of delayed auditory feedback on activity in the temporal lobe while speaking: A positron emission tomography study. J. Speech Lang. Hear. Res. 2010, 53, 226–236. [Google Scholar] [CrossRef]
  15. Bahadorinejad, A.; Almasganj, F. Delayed Auditory Feedback for Speech Disorders. In Proceedings of the 2012 International Conference on Biomedical Engineering (ICoBE), Penang, Malaysia, 27–28 February 2012; pp. 585–588. [Google Scholar]
  16. Ozker, M.; Doyle, W.; Devinsky, O.; Flinker, A. A cortical network processes auditory error signals during human speech production to maintain fluency. PLoS Biol. 2022, 20, e3001493. [Google Scholar] [CrossRef]
  17. Van Borsel, J.; Reunes, G.; Van den Berh, N. Delayed auditory feedback in the treatment of stuttering: Clients as consumers. Int. J. Lang. Commun. Disord. 2003, 38, 119–129. [Google Scholar] [CrossRef]
  18. Woźniak, T. Zaburzenia płynności mowy-stan badań i praktyki logopedycznej na początku XXI wieku. Pol. Tow. Logop. 2018, 47, 141–156. [Google Scholar]
  19. Luzhnov, P.; Shmatko, V. Development of a speech delay device for patients with logoneurosis. In Proceedings of the 2022 IEEE International Multi-Conference on Engineering, Computer and Information Sciences (SIBIRCON), Yekaterinburg, Russia, 11–13 November 2022; pp. 540–543. [Google Scholar]
  20. Kaźmierczak, M. Giełkot-historia i współczesność. Prace Językoznawcze 2024, 26, 219–233. [Google Scholar] [CrossRef]
  21. Boersma, P.; van Heuven, V. Speak and unSpeak with Praat. Glot Int. 2001, 5, 341–347. [Google Scholar]
  22. Yang, W.; Zhao, X. Research on the Function of Visual Phonetic Software Praat in Vocational English Phonetics Teaching. J. Phys. Conf. Ser. 2021, 1856, 012057. [Google Scholar] [CrossRef]
  23. de Jong, N.H.; Wempe, T. Automatic measurement of speech rate in spoken Dutch. ACLC Work. Pap. 2007, 2, 51–60. [Google Scholar]
  24. Małek-Orłowska, M.; Jach, K. Aspekty normatywne i aktualna sytuacja komisji etyki badań naukowych z udziałem ludzi na polskich uczelniach technicznych. Diametros 2022, 74, 19–35. [Google Scholar] [CrossRef]
  25. Czarkowski, M. Teoria i praktyka działania komisji bioetycznych. In Badania naukowe z udziałem ludzi w biomedycynie. Standardy międzynarodowe; Wolters Kluwer Poland: Warszawa, Poland, 2012; pp. 181–199. [Google Scholar]
  26. Świeczkowski, D.; Kułacz, S. The use of the Gunning Fog Index to evaluate the readability of Polish and English drug leaflets in the context of Health Literacy challenges in Medical Linguistics: An exploratory study. Cardiol. J. 2021, 28, 627–631. [Google Scholar] [CrossRef]
  27. Broda, B.; Ogrodniczuk, M.; Nitoń, B.; Gruszczyński, W. Measuring Readability of Polish Texts: Baseline Experiments. In Proceedings of the Ninth International Conference on Language Resources and Evaluation, Reykjavik, Iceland, 26–31 May 2014; European Language Resources Association: Reykjavik, Iceland, 2014; pp. 573–580. [Google Scholar]
  28. Michalik, M.; Cholewiak, A. Tempo wypowiedzi w oligofazji. Logopedia 2017, 46, 267–283. [Google Scholar]
  29. Woźniak, T.; Soboń, J. Ocena płynności mówienia. Nowa Audiofonia 2015, 4, 9–19. [Google Scholar] [CrossRef]
  30. Boersma, P. Accurate Short-Term Analysis of the Fundamental Frequency and the Harmonics-to-Noise Ratio of a Sampled Sound. IFA Proc. 1993, 17, 97–110. [Google Scholar]
  31. Brockmann-Bauser, M.; Beyer, D.; Bohlender, J.E. Clinical relevance of speaking voice intensity effects on acoustic jitter and shimmer in children between 5;0 and 9;11 years. Int. J. Pediatr. Otorhinolaryngol. 2014, 78, 2121–2126. [Google Scholar] [CrossRef]
  32. Blanchet, P.G.; Hoffman, P.R. Factors Influencing the Effects of Delayed Auditory Feedback on Dysarthric Speech Associated with Parkinson’s Disease. Commun. Disord. Deaf Stud. Hear. Aids 2014, 2, 106. [Google Scholar]
  33. Michalik, M.; Milewski, S.; Kaczorowska-Bray, K.; Solak, A.; Krajwska, M. Tempo artykulacji i tempo mówienia w otępieniu alzheimerowskim. Logopedia 2019, 48, 231–250. [Google Scholar]
  34. Politechnika Wrocławska. 2021. Available online: https://nauka.pwr.edu.pl/zadania/komisja-ds-etyki-badan-naukowych-politechniki-wroclawskiej/regulamin-komisji-ds-etyki-badan-naukowych (accessed on 16 September 2022).
Figure 1. Diagram of the prepared layout used to conduct the study.
Figure 1. Diagram of the prepared layout used to conduct the study.
Applsci 15 07524 g001
Figure 2. Screenshot of the DAF application during testing with a selected delay of 250 ms.
Figure 2. Screenshot of the DAF application during testing with a selected delay of 250 ms.
Applsci 15 07524 g002
Figure 3. Approximate speech tempo results for women at the given delays for each of fotography (fot).
Figure 3. Approximate speech tempo results for women at the given delays for each of fotography (fot).
Applsci 15 07524 g003
Figure 4. Approximate speech tempo results for men at the given delays for each of fotography (fot).
Figure 4. Approximate speech tempo results for men at the given delays for each of fotography (fot).
Applsci 15 07524 g004
Figure 5. Standard deviation of formants for vowel ‘A’ at different DAF delay levels in women.
Figure 5. Standard deviation of formants for vowel ‘A’ at different DAF delay levels in women.
Applsci 15 07524 g005
Figure 6. Standard deviation of formants for vowel ‘A’ at different DAF delay levels in men.
Figure 6. Standard deviation of formants for vowel ‘A’ at different DAF delay levels in men.
Applsci 15 07524 g006
Figure 7. Average speech tempo for reading texts by study participants at specific delays in woman (w) and men (m).
Figure 7. Average speech tempo for reading texts by study participants at specific delays in woman (w) and men (m).
Applsci 15 07524 g007
Table 1. Overview of selected studies on Delayed Auditory Feedback (DAF) and the delay intervals used. The 0 ms condition, typically serving as a control, was omitted for brevity. The table supports the selection of DAF intervals employed in the current study.
Table 1. Overview of selected studies on Delayed Auditory Feedback (DAF) and the delay intervals used. The 0 ms condition, typically serving as a control, was omitted for brevity. The table supports the selection of DAF intervals employed in the current study.
AuthorsTitleDelayed [ms]
Robert A. Butler, F. Thomas GallowayFactoral Analysis of the Delayed Speech Feedback Phenomenon [8]85, 170, 255, 340, random
S.P. Neilson, B.D. Burke, A.J. YatesStability of Susceptibility to DAF [9]80, 180, 280
Anne Gatest, John L. Bradshaw, Norman C. NettletonEffect of Different Delayed Auditory Feedback Intervals on a Music Performance Task [10] 10, 13, 18, 20, 27, 35, 48, 55, 65, 75, 85, 105
Bryan D. BurkeSusceptibility to Delayed Auditory Feedback and Dependence on Auditory or Oral Sensory Feedback [11]50, 100, 200, 400
P.B. Vrtunski, Mark Martinez, Francois BollerEvaluation of Delayed Auditory Feedback (DAF) Effect: Comparison Between Subjective Judgments and Objective Measures [12]180, 360
T. Fukawa, S. YoshidaSex Difference in Susceptibility to Delayed Auditory Feedback in Oral Reading Tasks [13]150, 200, 300, 400, 500
David M. Corey, Vishnu Anand CuddapahDelayed Auditory Feedback Effects During Reading and Conversation Tasks: Gender Differences in Fluent Adults [7]180, 230, 280, 330, 380
Hideki Takaso, Frank Eisner, Richard J.S. Wise, Sophie K. ScottThe Effect of Delayed Auditory Feedback on Activity in the Temporal Lobe While Speaking: A Positron Emission Tomography Study [14]50, 125, 200
Arghavan Bahadorinejad, Farshad AlmasganjDelayed Auditory Feedback for speech disorders [15]25-75, 75-250
Heechong Chon, Shelly Jo Kraft, Jingfei Zhang, Torrey Loucks, Nicoline G. AmbroseIndividual Variability in Delayed Auditory Feedback Effects on Speech Fluency and Rate in Normally Fluent Adults [2]250
Table 2. Results for vowel ‘A’ in the Polish word “las” (ang. Forest) in women (w).
Table 2. Results for vowel ‘A’ in the Polish word “las” (ang. Forest) in women (w).
Las
DelayLaryngeal ToneFormantsJitterShimmerTime
[ms][Hz]1234[%][%][s]
w10231.88827.602586.282586.283888.090.6443.0840.210
180209.72744.252603.532603.533706.120.5843.6040.192
250225.32734.262650.142650.143793.150.8872.9540.150
360215.23822.902747.962747.963837.261.1805.3190.154
w20209.49852.681521.672582.343895.520.4534.9870.289
180203.25874.091686.672833.924554.520.3745.2950.307
250215.41874.621685.212874.623444.651.3266.7810.351
360210.33878.361683.332714.503967.030.4577.9100.136
w30222.48836.911478.212673.313120.440.65410.6840.245
180205.01979.401579.972724.053440.651.16911.4580.291
250227.87923.111607.132932.373482.391.12314.0650.197
360224.14947.851629.632886.723547.171.42116.1630.238
w40223.69834.721500.412685.584048.660.4735.8310.172
180205.64859.871576.312745.994326.920.7995.0990.156
250264.17811.681590.032603.314575.022.10110.0530.186
360248.93696.191490.292722.674560.491.28911.4310.149
w50212.77845.441545.452567.223221.951.0278.2270.187
180207.58867.921572.102613.943569.740.69510.2300.182
250226.31842.991601.202603.453477.231.21611.6820.196
360229.74861.021677.122760.743967.531.1208.5960.192
Table 3. Results for vowel ’A’ in polish word “las” (ang. Forest) in men (m).
Table 3. Results for vowel ’A’ in polish word “las” (ang. Forest) in men (m).
Las
DelayLaryngeal toneFormantsJitterShimmerTime
[ms][Hz]1234[%][%][s]
m10116.79541.011189.922474.822955.150.9094.9640.158
180113.35573.271179.772563.033529.237.41315.4700.140
250123.39555.421157.772424.122974.536.43810.2500.165
360107.31482.561160.832538.483598.163.88025.2630.242
m20113.79457.011360.082673.053694.221.2006.1200.073
180119.76541.271351.852871.713562.361.2558.0410.163
250104.45472.931545.382768.563550.252.4198.0900.169
360119.03532.411222.552557.553503.260.8024.7610.164
m30157.57567.941048.122465.643795.604.39918.4160.125
180107.16525.671238.332538.773691.863.42114.9570.259
250140.25513.591287.812502.873945.332.35614.7850.169
360169.14579.841311.172528.194032.853.37312.7480.205
m40141.98731.021239.862684.843640.492.1369.9600.104
180159.62650.701231.502577.453252.220.8143.7390.268
250166.80645.711233.042722.863354.400.9746.6470.176
360150.21394.861157.292695.023486.300.7375.7500.159
m50120.74660.721387.292428.003901.290.6357.6920.245
180126.30686.061398.442589.613943.210.57315.4940.183
250134.61643.961379.292509.083983.960.73412.6660.324
360163.91667.371437.482568.833790.441.24713.5380.147
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kanty, D.; Staroniewicz, P. Tests of the Influence of DAF (Delayed Auditory Feedback) on Changes in Speech Signal Parameters. Appl. Sci. 2025, 15, 7524. https://doi.org/10.3390/app15137524

AMA Style

Kanty D, Staroniewicz P. Tests of the Influence of DAF (Delayed Auditory Feedback) on Changes in Speech Signal Parameters. Applied Sciences. 2025; 15(13):7524. https://doi.org/10.3390/app15137524

Chicago/Turabian Style

Kanty, Dominika, and Piotr Staroniewicz. 2025. "Tests of the Influence of DAF (Delayed Auditory Feedback) on Changes in Speech Signal Parameters" Applied Sciences 15, no. 13: 7524. https://doi.org/10.3390/app15137524

APA Style

Kanty, D., & Staroniewicz, P. (2025). Tests of the Influence of DAF (Delayed Auditory Feedback) on Changes in Speech Signal Parameters. Applied Sciences, 15(13), 7524. https://doi.org/10.3390/app15137524

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop