Distribution and Acoustic Characteristics of Filled Pauses in Spontaneous Urdu Speech
Abstract
1. Introduction
2. Review of Relevant Literature
2.1. The Function of Filled Pauses
2.2. Characteristics of Filled Pauses
2.3. Filled Pauses in Urdu
- (1)
- How are different types of FPs distributed across segmental contexts and utterance positions in spontaneous Urdu speech?
- (2)
- Do filled pause types differ systematically in their acoustic realization?
- (3)
- How do utterance position and segmental context condition the distribution and acoustic realization of FPs in Urdu?
3. Corpus and Methods
3.1. Segmentation, Annotation, and Frequency Determination of FPs
3.2. Acoustic Measurements
3.3. Statistical Analysis
4. Results
4.1. Distribution of FPs
4.2. Distribution and Acoustic Characteristics of FPs Types
4.3. Contextual Position of FPs
4.4. FPs’ Position in the Utterance
4.5. Acoustics of Filled Pauses in Urdu
5. Discussion
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
| 1 | In line with Swerts (1998) and Clark and Fox Tree (2002), FPs occurring with surrounding silence are interpreted as major delays (often associated with um), whereas FPs occurring between words are interpreted as minor delays (often associated with uh). |
| 2 | Throughout the manuscript, the term nasal refers specifically to a nasal consonant (i.e., [m]/[n]) and not to nasalization of the vowel. |
| 3 | https://github.com/stylerw/styler_praat_scripts (accessed on 5 July 2025). |
References
- Belz, M. (2021). The phonetics of “äh” and “ähm”: Acoustic variation of filled particles in German. Metzler. [Google Scholar] [CrossRef]
- Beňuš, Š. (2009, September 6–10). Variability and stability in collaborative dialogues: Turn-taking and filled pauses. INTERSPEECH 2009, 10th Annual Conference of the International Speech Communication Association (pp. 796–799), Brighton, UK. [Google Scholar]
- Betz, S. (2020). Hesitations in spoken dialogue systems [Doctoral dissertation, Bielefeld University]. [Google Scholar]
- Boersma, P., & Weenink, D. (2015). Praat: Doing phonetics by computer (Version 5.4) [Computer software]. University of Amsterdam. Available online: http://www.praat.org/ (accessed on 20 January 2025).
- Candea, M., Vasilescu, I., & Adda-Decker, M. (2005, September 10–12). Inter- and intra-language acoustic analysis of autonomous fillers. Disfluency in Spontaneous Speech (DiSS 2005) (pp. 47–51), Aix-en-Provence, France. [Google Scholar]
- Cataldo, V., Schettino, L., Savy, R., Poggi, I., Origlia, A., Ansani, A., Sessa, I., & Chiera, A. (2019). Phonetic and functional features of pauses, and concurrent gestures, in tourist guides’ speech. Audio Archives at the Crossroads of Speech Sciences, Digital Humanities and Digital Heritage, 6, 205–231. [Google Scholar]
- Christenfeld, N. (1994). Options and ums. Journal of Language and Social Psychology, 13(2), 192–199. [Google Scholar] [CrossRef]
- Clark, H. H., & Fox Tree, J. E. (2002). Using uh and um in spontaneous speaking. Cognition, 84(1), 73–111. [Google Scholar] [CrossRef] [PubMed]
- de Boer, M. M., & Heeren, W. F. (2020). Cross-linguistic filled pause realization: The acoustics of uh and um in native Dutch and non-native English. The Journal of the Acoustical Society of America, 148(6), 3612–3622. [Google Scholar] [CrossRef] [PubMed]
- de Jong, N. H. (2016). Predicting pauses in L1 and L2 speech: The effects of utterance boundaries and word frequency. International Review of Applied Linguistics in Language Teaching, 54, 113–132. [Google Scholar] [CrossRef]
- De Leeuw, E. (2007). Hesitation markers in English, German, and Dutch. Journal of Germanic Linguistics, 19(2), 85–114. [Google Scholar] [CrossRef]
- Eklund, R. (2004). Disfluency in Swedish human–human and human–machine travel booking dialogues [Doctoral dissertation, Linköping University]. [Google Scholar]
- Finlayson, I. R., & Corley, M. (2012). Disfluency in dialogue: An intentional signal from the speaker? Psychonomic Bulletin & Review, 19, 921–928. [Google Scholar] [CrossRef] [PubMed]
- Fischer, K., Niebuhr, O., Novák-Tót, E., & Jensen, L. C. (2017, March 6–9). Strahlt die negative Reputation von Häsitationsmarkern auf ihre Sprecher aus? 43rd Annual Meeting of the German Acoustical Society (DAGA 2017) (pp. 1450–1453), Kiel, Germany. [Google Scholar]
- Goldman-Eisler, F. (1968). Psycholinguistics: Experiments in spontaneous speech. Academic Press. [Google Scholar]
- Götz, S. (2013). Fluency in native and nonnative English speech. John Benjamins Publishing Company. [Google Scholar]
- Horváth, V. (2010). Filled pauses in Hungarian: Their phonetic form and function. Acta Linguistica Hungarica (Since 2017 Acta Linguistica Academica), 57(2–3), 288–306. [Google Scholar] [CrossRef]
- Hughes, V., Wood, S., & Foulkes, P. (2016). Strength of forensic voice comparison evidence from the acoustics of filled pauses. The International Journal of Speech, Language and the Law, 23(1), 99–132. [Google Scholar] [CrossRef]
- Jabeen, F., & Betz, S. (2022, September 18–22). Hesitations in Urdu/Hindi: Distribution and properties of fillers and silences. Interspeech 2022 (pp. 3113–3117), Incheon, Republic of Korea. [Google Scholar]
- Jabeen, F., & Wagner, P. (2023, August 28–30). Variability in hesitations in Punjabi semi-spontaneous narrative speech: An automatic clustering based analysis. Disfluency in Spontaneous Speech (DiSS) Workshop 2023, Bielefeld, Germany. [Google Scholar]
- Kirjavainen, M., Crible, L., & Beeching, K. (2022). Can filled pauses be represented as linguistic items? Investigating the effect of exposure on the perception and production of um. Language and Speech, 65(2), 263–289. [Google Scholar] [CrossRef] [PubMed]
- Kjellmer, G. (2003). Hesitation. In defence of er and erm. English Studies, 84(2), 170–198. [Google Scholar] [CrossRef]
- Kosmala, L., & Crible, L. (2022). The dual status of filled pauses: Evidence from genre, proficiency and co-occurrence. Language and Speech, 65(1), 216–239. [Google Scholar] [CrossRef] [PubMed]
- Levelt, W. J. (1983). Monitoring and self-repair in speech. Cognition, 14(1), 41–104. [Google Scholar] [CrossRef] [PubMed]
- Levelt, W. J. (1993). Speaking: From intention to articulation. MIT Press. [Google Scholar]
- Lickley, R. J. (2015). Fluency and disfluency. In M. A. Redford (Ed.), The handbook of speech production (pp. 445–474). John Wiley. [Google Scholar]
- Maclay, H., & Osgood, C. E. (1959). Hesitation phenomena in spontaneous English speech. Word, 15(1), 19–44. [Google Scholar] [CrossRef]
- Maekawa, K., & Mori, H. (2017). Comparison of voice quality between the vowels in filled pauses and ordinary lexical items. Journal of the Phonetic Society of Japan, 21(3), 53–62. [Google Scholar]
- Niebuhr, O., & Fischer, K. (2019, September). Do not hesitate!—Unless you do it shortly or nasally: How the phonetics of filled pauses determine their subjective frequency and perceived speaker performance. In Interspeech 2019 (pp. 544–548). International Speech Communication Association. [Google Scholar]
- O’Connell, D. C., & Kowal, S. (2005). Uh and um revisited: Are they interjections for signaling delay? Journal of Psycholinguistic Research, 34, 555–576. [Google Scholar] [CrossRef]
- O’Shaughnessy, D. (1992, March). Recognition of hesitations in spontaneous speech. In IEEE international conference on Acoustics, Speech, and Signal Processing (Vol. 1, pp. 521–524). IEEE Computer Society. [Google Scholar]
- Oviatt, S. (1995). Predicting spoken disfluencies during human-computer interaction. Computer Speech and Language, 9(1), 19–36. [Google Scholar] [CrossRef]
- Paschen, L. (2023, August 28–30). Filled pauses and false starts do not reliably preface longer or more complex utterances across typologically diverse languages. Proceedings of the Disfluency in Spontaneous Speech (DiSS) Workshop 2023 (pp. 13–17), Bielefeld, Germany. [Google Scholar]
- Schnadt, M. J., & Corley, M. (2006, July 26–29). The influence of lexical, conceptual and planning based factors on disfluency production. Proceedings of the Annual Meeting of the Cognitive Science Society (Vol. 28 No. 28. ), Vancouver, BC, Canada. [Google Scholar]
- Shriberg, E. (2001). To ‘errrr’is human: Ecology and acoustics of speech disfluencies. Journal of the International Phonetic Association, 31(1), 153–169. [Google Scholar] [CrossRef]
- Shriberg, E. E. (1994). Preliminaries to a theory of speech disfluencies [Doctoral dissertation, University of California]. [Google Scholar]
- Shriberg, E. E., & Lickley, R. J. (1993). Intonation of clause-internal filled pauses. Phonetica, 50(3), 172–179. [Google Scholar] [CrossRef] [PubMed]
- Smith, V. L., & Clark, H. H. (1993). On the course of answering questions. Journal of Memory and Language, 32(1), 25–38. [Google Scholar] [CrossRef]
- Swerts, M. (1998). Filled pauses as markers of discourse structure. Journal of Pragmatics, 30(4), 485–496. [Google Scholar] [CrossRef]
- Torreira, F., Adda-Decker, M., & Ernestus, M. (2010). The Nijmegen corpus of casual French. Speech Communication, 52(3), 201–212. [Google Scholar] [CrossRef]
- Tottie, G. (2016). Planning what to say: Uh and um among the pragmatic markers. In Outside the clause (pp. 97–122). John Benjamins Publishing Company. [Google Scholar]
- Tottie, G. (2020). Word-search as word-formation?: The case of “Uh” and “Um”. In Crossing linguistic boundaries: Systemic, synchronic and diachronic variation in English (pp. 29–42). Bloomsbury Academic. [Google Scholar]
- Watanabe, M., Hirose, K., Den, Y., & Minematsu, N. (2008). Filled pauses as cues to the complexity of upcoming phrases for native and non-native listeners. Speech Communication, 50(2), 81–94. [Google Scholar] [CrossRef]
- Wieling, M., Grieve, J., Bouma, G., Fruehwald, J., Coleman, J., & Liberman, M. (2016). Variation and change in the use of hesitation markers in Germanic languages. Language Dynamics and Change, 6(2), 199–234. [Google Scholar] [CrossRef]




| (1) | (2) | (3) | (4) | (5) | |
|---|---|---|---|---|---|
| Variables | logfp | f0sem | LOB_F1 | LOB_F2 | LOB_Intensity |
| Vocalic | 0.569 *** | 2.054 | −0.645 * | −0.412 | −0.219 |
| (0.000) | (0.432) | (0.038) | (0.186) | (0.488) | |
| Constant | −1.704 *** | 16.236 *** | 0.922 ** | 0.543 | −0.238 |
| (0.000) | (0.000) | (0.012) | (0.141) | (0.525) | |
| Observations | 190 | 190 | 188 | 188 | 188 |
| Number of groups | 16 | 16 | 14 | 14 | 14 |
| (1) | (2) | (3) | (4) | (5) | |
|---|---|---|---|---|---|
| Variables | logfp | f0sem | LOB_F1 | LOB_F2 | LOB_Intensity |
| SW | 0.099 | −0.928 | −0.287 | −0.178 | 0.011 |
| (0.164) | (0.596) | (0.162) | (0.385) | (0.958) | |
| WS | 0.039 | 2.320 | 0.090 | 0.176 | 0.213 |
| (0.741) | (0.416) | (0.787) | (0.596) | (0.528) | |
| WW | 0.060 | −5.495 | −0.507 | −0.355 | 0.613 |
| (0.688) | (0.136) | (0.260) | (0.430) | (0.181) | |
| Medial | −0.209 * | −3.768 | 0.104 | −0.021 | 0.591 * |
| (0.034) | (0.120) | (0.725) | (0.942) | (0.048) | |
| Final | 0.091 | −7.405 * | −0.410 | −0.605 | 0.519 |
| (0.438) | (0.010) | (0.237) | (0.081) | (0.141) | |
| Single | −0.274 ** | −5.639 * | −0.010 | 0.218 | 0.715 * |
| (0.007) | (0.024) | (0.972) | (0.471) | (0.020) | |
| Constant | −1.185 *** | 19.763 *** | 0.186 | 0.116 | −0.618 |
| (0.000) | (0.000) | (0.580) | (0.730) | (0.071) | |
| Observations | 179 | 179 | 177 | 177 | 177 |
| Number of groups | 16 | 16 | 14 | 14 | 14 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Zahid, S.; Lee, H.-Y.; Mahmood, M.A. Distribution and Acoustic Characteristics of Filled Pauses in Spontaneous Urdu Speech. Languages 2026, 11, 34. https://doi.org/10.3390/languages11030034
Zahid S, Lee H-Y, Mahmood MA. Distribution and Acoustic Characteristics of Filled Pauses in Spontaneous Urdu Speech. Languages. 2026; 11(3):34. https://doi.org/10.3390/languages11030034
Chicago/Turabian StyleZahid, Saira, Ho-Young Lee, and Muhammad Asim Mahmood. 2026. "Distribution and Acoustic Characteristics of Filled Pauses in Spontaneous Urdu Speech" Languages 11, no. 3: 34. https://doi.org/10.3390/languages11030034
APA StyleZahid, S., Lee, H.-Y., & Mahmood, M. A. (2026). Distribution and Acoustic Characteristics of Filled Pauses in Spontaneous Urdu Speech. Languages, 11(3), 34. https://doi.org/10.3390/languages11030034

