Speech Segmentation with Prosodic and Statistical Cues Is Language-Specific in Infancy
Abstract
1. Introduction
1.1. Prosody in Speech Segmentation
1.2. Statistical Cues in Speech Segmentation
1.3. Cue Weighting
1.4. Cross-Linguistic Differences in Infant- and Child-Directed Speech
2. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Arciuli, J., & Simpson, I. (2012). Statistical learning is related to reading ability in children and adults. Cognitive Science, 36(2), 286–304. [Google Scholar] [CrossRef]
- Aslin, R., Saffran, J., & Newport, E. (1998). Computation of conditional probability statistics by 8-month-old infants. Psychological Science, 9(4), 321–324. [Google Scholar] [CrossRef]
- Beech, C., & Swingley, D. (2023). Consequences of phonological variation for algorithmic word segmentation. Cognition, 235, 105401. [Google Scholar] [CrossRef]
- Black, A., & Bergmann, C. (2017). Quantifying infants’ statistical word segmentation: A meta-analysis. In G. Gunzelmann, A. Howes, T. Tenbrink, & E. Davelaar (Eds.), Proceedings of the 39th annual meeting of the cognitive science society (pp. 124–129). Cognitive Science Society. [Google Scholar]
- Bulf, H., Johnson, S., & Valenza, E. (2011). Visual statistical learning in the newborn infant. Cognition, 121, 127–132. [Google Scholar] [CrossRef] [PubMed]
- Christophe, A., Mehler, J., & Sebastián-Gallés, N. (2001). Perception of prosodic boundary correlates by newborn infants. Infancy, 2(3), 385–394. [Google Scholar] [CrossRef]
- Cunha, C., & Cintra, L. (1984). Novo gramática do portugués scontemporaneo. Ediçoes Joao Sá da Costa. [Google Scholar]
- Cutler, A. (2005). Native listening: Language experience and the recognition of spoken words. MIT Press. [Google Scholar]
- Cutler, A., & Mehler, J. (1993). The periodicity bias. Journal of Phonetics, 21(1), 103–108. [Google Scholar] [CrossRef]
- Cutler, A., & Norris, D. (1988). The role of strong syllables in segmentation for lexical access. Journal of Experimental Psychology: Human Perception and Performance, 14, 113–121. [Google Scholar] [CrossRef]
- Dupoux, E., Pallier, C., Sebastián, N., & Mehler, J. (1997). A destressing “deafness” in French? Journal of Memory Language, 36, 406–421. [Google Scholar] [CrossRef]
- Endress, A., & Mehler, J. (2009). Primitive computations in speech processing. The Quarterly Journal of Experimental Psychology, 62(11), 2187–2209. [Google Scholar] [CrossRef]
- Erickson, L., Thiessen, E., & Graf Estes, K. (2014). Statistically coherent labels facilitate categorization in 8-month-olds. Journal of Memory and Language, 72, 49–58. [Google Scholar] [CrossRef]
- Fernald, A., & Morikawa, H. (1993). Common themes and cultural variations in Japanese and American mothers’ speech to infants. Child Development, 64(3), 637–656. [Google Scholar] [CrossRef] [PubMed]
- Féry, C., Hörnig, R., & Pahaut, S. (2011). Correlates of phrasing in French and German from an experiment with semi-spontaneous speech. In C. Gabriel, & C. Lleó (Eds.), Intonational phrasing in Romance and Germanic: Cross-linguistic and bilingual studies (pp. 11–41). John Benjamins. [Google Scholar]
- Fló, A., Brusini, P., Macagno, F., Nespor, M., Mehler, J., & Ferry, A. L. (2019). Newborns are sensitive to multiple cues for word segmentation in continuous speech. Developmental Science, 22, e12802. [Google Scholar] [CrossRef]
- Frank, M. C., Goldwater, S., Griffiths, T. L., & Tenenbaum, J. B. (2010). Modeling human performance in statistical word segmentation. Cognition, 117(2), 107–125. [Google Scholar] [CrossRef]
- Friederici, A., Friedrich, M., & Christophe, A. (2007). Brain responses in 4-month-old infants are already language specific. Current Biology, 17, 1208–1211. [Google Scholar] [CrossRef]
- Frota, S., Butler, J., Uysal, E., Severino, C., & Vigário, M. (2020). European Portuguese-learning infants look longer at iambic stress: New data on language specificity in early stress perception. Frontiers in Psychology, 11, 1890. [Google Scholar] [CrossRef]
- Frota, S., Severino, C., & Vigário, M. (2024). Unfolding prosody guides the development of word segmentation. Languages, 9(9), 305. [Google Scholar] [CrossRef]
- Gervain, J., & Guevara, E. (2012). The statistical signature of morphosyntax: A study of Hungarian and Italian infant-directed speech. Cognition, 125(2), 263–287. [Google Scholar] [CrossRef] [PubMed]
- Graf Estes, K., Evans, J., & Else-Quest, N. (2007). Differences in the nonword repetition performance of children with and without specific language impairment: A meta-analysis. Journal of Speech Language and Hearing Research, 50(1), 177–195. [Google Scholar] [CrossRef] [PubMed]
- Harris, Z. S. (1955). From phoneme to morpheme. Language, 31, 190–222. [Google Scholar] [CrossRef]
- Hauser, M., Newport, E., & Aslin, R. (2001). Segmentation of the speech stream in a nonhuman primate: Statistical learning in cotton top tamarins. Cognition, 78, B53–B64. [Google Scholar] [CrossRef] [PubMed]
- Hayes, J., & Clark, H. (1970). Experiments on the segmentation of an artificial speech analog. In J. Hayes (Ed.), Cognition and the development of language (pp. 221–234). Wiley. [Google Scholar]
- Houston, D., Jusczyk, P., Kuljpers, C., Coolen, R., & Cutler, A. (2000). Cross-language word segmentation by 9-month-olds. Psychonomic Bulletin & Review, 7(3), 504–509. [Google Scholar] [CrossRef]
- Höhle, B., Bijeljac-Babic, R., Herold, B., Weissenborn, J., & Nazzi, T. (2009). Language specific prosodic preferences during the first half year of life: Evidence from German and French infants. Infant Behavior and Development, 32(3), 262–274. [Google Scholar] [CrossRef] [PubMed]
- Jessop, A., Pine, J., & Gobet, F. (2025). Chunk-based incremental processing and learning: An integrated theory of word discovery, implicit statistical learning, and speed of lexical processing. Psychological Review. Advance online publication. [Google Scholar] [CrossRef]
- Johnson, E., & Jusczyk, P. (2001). Word segmentation by 8-month-olds: When speech cues count more than statistics. Journal of Memory and Language, 44(4), 548–567. [Google Scholar] [CrossRef]
- Johnson, E., & Seidl, A. (2009). At 11 months, prosody still outranks statistics. Developmental Science, 12(1), 131–141. [Google Scholar] [CrossRef]
- Johnson, E., & Tyler, M. (2010). Testing the limits of statistical learning for word segmentation. Developmental Science, 13(2), 339–345. [Google Scholar] [CrossRef] [PubMed]
- Jusczyk, P. W., Hohne, E. A., & Bauman, A. (1999a). Infant’s sensitivity to allophonic cues for word segmentation. Perception & Psychophysics, 61(8), 1465–1476. [Google Scholar] [CrossRef]
- Jusczyk, P. W., Houston, D. M., & Newsome, M. (1999b). The beginnings of word segmentation in English-learning infants. Cognitive Psychology, 39, 159–207. [Google Scholar] [CrossRef] [PubMed]
- Kidd, E., & Arciuli, J. (2016). Individual differences in statistical learning predict children’s comprehension of syntax. Child Development, 87(1), 184–193. [Google Scholar] [CrossRef] [PubMed]
- Kirkham, N., Slemmer, J., & Johnson, S. (2002). Visual statistical learning in infancy: Evidence for a domain general learning mechanism. Cognition, 83(2), 35–42. [Google Scholar] [CrossRef]
- Kooijman, V., Hagoort, P., & Cutler, A. (2009). Prosodic structure in early word segmentation: ERP evidence from Dutch ten-month-olds. Infancy, 14(6), 591–612. [Google Scholar] [CrossRef]
- Kudo, N., Nonaka, Y., Mizuno, N., Mizuno, K., & Okanoya, K. (2011). On-line statistical segmentation of a non-speech auditory stream in neonates as demonstrated by event-related brain potentials. Developmental Science, 14(5), 1100–1106. [Google Scholar] [CrossRef]
- Kuijpers, C., Coolen, R., Houston, D., & Cutler, A. (1998). Using the head-turning technique to explore cross-linguistic performance differences. In C. Rovee-Collier, L. Lipsitt, & H. Hayne (Eds.), Advances in infancy research (pp. 205–220). Ablex. [Google Scholar]
- Langus, A., Marimon, M., Saksida, A., Boll-Avetisyan, N., & Höhle, B. (2019). Cross-linguistic evidence for age-related changes in the statistical structure of child-directed speech. [Manuscript submitted for publication]. Department of Linguistics, University of Potsdam. [Google Scholar]
- Lew-Williams, C., & Saffran, J. (2012). All words are not created equal: Expectations about word length guide infant statistical learning. Cognition, 122, 241–246. [Google Scholar] [CrossRef] [PubMed]
- MacWhinney, B. (2000). The CHILDES project: Tools for analyzing talk (3rd ed.). Lawrence Erlbaum Associates. [Google Scholar]
- Marimon, M., Berdasco-Muñoz, E., Höhle, B., & Nazzi, T. (2025). Use of statistical and acoustic cues for speech segmentation in French-learning 7-month-old infants and French-speaking adults. In Discoveries in Cognitive Science (Volume 9, pp. 189–209). Open Mind. [Google Scholar] [CrossRef]
- Marimon, M., Höhle, B., & Langus, A. (2022). Pupillary entrainment reveals individual differences in cue weighting in 9-month-old German-learning infants. Cognition, 224, 105054. [Google Scholar] [CrossRef] [PubMed]
- Marimon, M., Langus, A., & Höhle, B. (2024). Prosody outweighs statistics in 6-month-old German-learning infants’ speech segmentation. Infancy, 29(5), 750–770. [Google Scholar] [CrossRef] [PubMed]
- Mattys, S. L., & Jusczyk, P. W. (2001). Do infants segment words or recurring contiguous patterns? Journal of Experimental Psychology: Human Perception and Performance, 27(3), 644–655. [Google Scholar] [CrossRef]
- Mattys, S. L., Jusczyk, P. W., Luce, P. A., & Morgan, J. L. (1999). Phonotactic and prosodic effects on word segmentation in infants. Cognitive Psychology, 38, 465–494. [Google Scholar] [CrossRef]
- Maye, J., Weiss, D. J., & Aslin, R. N. (2002). Infant sensitivity to distributional information can affect phonetic discrimination. Cognition, 82(3), B101–B111. [Google Scholar] [CrossRef]
- Moon, C., Lagercrantz, H., & Kuhl, P. K. (2013). Language experienced in utero affects vowel perception after birth: A two-country study. Acta Paediatrica, 102(2), 156–160. [Google Scholar]
- Morgan, J., & Saffran, J. (1995). Emerging integration of sequential and suprasegmental information in preverbal speech segmentation. Child Development, 66, 911–936. [Google Scholar] [CrossRef]
- Nazzi, T., Iakimova, G., Bertoncini, J., Frédonie, S., & Alcantara, C. (2006). Early segmentation of fluent speech by infants acquiring French: Emerging evidence for crosslinguistic differences. Journal of Memory and Language, 54, 283–299. [Google Scholar] [CrossRef]
- Nazzi, T., Jusczyk, P., & Johnson, E. (2000). Language discrimination by English-learning 5-month-olds: Effects of rhythm and familiarity. Journal of Memory and Language, 43, 1–19. [Google Scholar] [CrossRef]
- Obeid, R., Brooks, P., Powers, K., Gillespie-Lynch, K., & Lum, J. (2016). Statistical learning in specific language impairment and autism spectrum disorder: A meta-analysis. Frontiers in Psychology, 7, 1245. [Google Scholar] [CrossRef] [PubMed]
- Pelucchi, B., Hay, J., & Saffran, J. (2009). Statistical learning in a natural language by 8-month-old infants. Child Development, 80(3), 674–685. [Google Scholar] [CrossRef]
- Perruchet, P. (2019). What mechanisms underlie implicit statistical learning? Transitional probabilities versus chunks in language learning. Topics in Cognitive Science, 11(3), 520–535. [Google Scholar] [CrossRef]
- Polka, L., & Sundara, M. (2012). Word segmentation in monolingual infants acquiring Canadian English and Canadian French: Native language, cross-dialect, and cross-language comparisons. Infancy, 17(2), 198–232. [Google Scholar] [CrossRef]
- Ramus, F., Hauser, M., Miller, C., Morris, D., & Mehler, J. (2000). Language discrimination by human newborns and by cotton-top tamarin monkeys. Science, 288, 349–351. [Google Scholar] [CrossRef] [PubMed]
- Raneri, D., Von Holzen, K., Newman, R., & Bernstein Ratner, N. (2020). Change in maternal speech rate to preverbal infants over the first two years of life. Journal of Child Language, 47(6), 1263–1275. [Google Scholar] [CrossRef] [PubMed]
- Saffran, J. (2001). Words in a sea of sounds: The output of infant statistical learning. Cognition, 81, 149–169. [Google Scholar] [CrossRef] [PubMed]
- Saffran, J., Aslin, R., & Newport, E. (1996). Statistical learning by 8-month-olds. Science, 274, 1926–1928. [Google Scholar] [CrossRef]
- Saffran, J., & Kirkham, N. (2017). Infant statistical learning. Annual Review of Psychology, 69, 181–203. [Google Scholar] [CrossRef]
- Saksida, A., Langus, A., & Nespor, M. (2017). Co-occurrence statistics as a language-dependent cue for speech segmentation. Developmental Science, 20(3), 1–11. [Google Scholar] [CrossRef]
- Sansavini, A., Bertoncini, J., & Giovanelli, G. (1997). Newborns discriminate the rhythm of multisyllabic stressed words. Developmental Psychology, 33(1), 3–11. [Google Scholar] [CrossRef] [PubMed]
- Skoruppa, K., Cristia, A., Peperkamp, S., & Seidl, A. (2011). English-learning infants’ perception of word stress patterns. Journal of the Acoustical Society of America, 130, 50–55. [Google Scholar] [CrossRef]
- Stärk, K., Kidd, E., & Frost, R. (2022). Word Segmentation Cues in German Child-Directed Speech: A Corpus Analysis. Language and Speech, 65(1), 3–27. [Google Scholar] [CrossRef]
- Tal, S., Smith, K., Culbertson, J., Grossman, E., & Arnon, I. (2022). The impact of information structure on the emergence of differential object marking: An experimental study. Cognitive Science, 46(3), 1–31. [Google Scholar] [CrossRef]
- Teinonen, T., Fellmann, R., Näätänen, R., Alku, P., & Huotilainen, M. (2009). Statistical language learning in neonates revealed by event-related brain potentials. Neuroscience, 10(1), 21. [Google Scholar] [CrossRef]
- Thiessen, E., & Erickson, L. (2013). Discovering words in fluent speech: The contribution of two kinds of statistical information. Frontiers in Psychology, 3, 590. [Google Scholar] [CrossRef] [PubMed]
- Thiessen, E., & Saffran, J. (2003). When cues collide: Use of stress and statistical cues to word boundaries by 7- to 9-month-old infants. Developmental Psychology, 39(4), 706–716. [Google Scholar] [CrossRef]
- Toro, J. M., & Trobalón, J. (2005). Statistical computations over a speech stream in a rodent. Perception and Psychophysics, 67(5), 867–875. [Google Scholar] [CrossRef] [PubMed]
Language | Children/Transcript | Youngest/Oldest (Months) | Sentences (SD) | Word Tokens (SD) | Word Types (SD) | Syllable Tokens (SD) | Syllable Types (SD) |
---|---|---|---|---|---|---|---|
Dutch | 2 | 19.20 | 291.60 | 1306.00 | 293.90 | 1694.10 | 318.40 |
7 | 36.30 | (140.00) | (652.00) | (83.60) | (819.00) | (74.10) | |
English | 29 | 1.50 | 374.20 | 1252.30 | 229.00 | 1469.30 | 258.70 |
29 | 4.00 | (214.90) | (778.90) | (98.90) | (910.40) | (107.60) | |
Estonian | 3 | 19.80 | 178.80 | 776.80 | 261.10 | 1228.40 | 255.10 |
14 | 49.10 | (123.20) | (604.70) | (148.10) | (887.40) | (113.40) | |
German | 2 | 10.00 | 106.80 | 538.10 | 24.80 | 750.60 | 250.90 |
63 | 59.10 | (60.00) | (267.0) | (73.90) | (352.20) | (75.60) | |
Hungarian | 5 | 32.20 | 251.60 | 937.80 | 344.70 | 1592.20 | 386.00 |
57 | 59.90 | (189.50) | (747.20) | (227.00) | (1279.7) | (215.90) | |
Italian | 3 | 16.10 | 186.50 | 926.50 | 307.10 | 1719.30 | 233.30 |
14 | 40.30 | (162.60) | (754.00) | (183.70) | (1454.0) | (96.10) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Marimon, M.; Saksida, A.; Höhle, B.; Langus, A. Speech Segmentation with Prosodic and Statistical Cues Is Language-Specific in Infancy. Languages 2025, 10, 240. https://doi.org/10.3390/languages10090240
Marimon M, Saksida A, Höhle B, Langus A. Speech Segmentation with Prosodic and Statistical Cues Is Language-Specific in Infancy. Languages. 2025; 10(9):240. https://doi.org/10.3390/languages10090240
Chicago/Turabian StyleMarimon, Mireia, Amanda Saksida, Barbara Höhle, and Alan Langus. 2025. "Speech Segmentation with Prosodic and Statistical Cues Is Language-Specific in Infancy" Languages 10, no. 9: 240. https://doi.org/10.3390/languages10090240
APA StyleMarimon, M., Saksida, A., Höhle, B., & Langus, A. (2025). Speech Segmentation with Prosodic and Statistical Cues Is Language-Specific in Infancy. Languages, 10(9), 240. https://doi.org/10.3390/languages10090240