Examining Speech Perception–Production Relationships Through Tone Perception and Production Learning Among Indonesian Learners of Mandarin †
Abstract
1. Introduction
1.1. Transfer of Learning Effects Across Domains
1.2. Learning Effect on Perceptual Cue Weighting
1.3. The Present Study
2. Materials and Methods
2.1. Participants
2.2. General Procedures
2.3. Production Task
2.3.1. Materials
2.3.2. Procedures
2.3.3. Acoustic Measurements and Modeling
2.4. Perception Experiment
2.4.1. Stimuli
2.4.2. Procedures
2.4.3. Statistical Approach
3. Results
3.1. Perception Model Classification Results
3.2. Production Model Classification Results
3.3. Correlation Analysis of Perception and Production Gains
4. Discussion
4.1. L2 Perception–Production Links Established Through F0 Slope, the Critical Perceptual Cue
4.2. L2 Learning Effect on F0 Mean, the Non-Critical Perceptual Cue
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Galantucci, B.; Fowler, C.A.; Turvey, M.T. The motor theory of speech perception reviewed. Psychon. Bull. Rev. 2006, 13, 361–377. [Google Scholar] [CrossRef] [PubMed]
- Liberman, A.M.; Cooper, F.S.; Shankweiler, D.P.; Studdert-Kennedy, M. Perception of the speech code. Psychol. Rev. 1967, 74, 431–461. [Google Scholar] [CrossRef] [PubMed]
- Liberman, A.M.; Mattingly, I.G. The motor theory of speech perception revised. Cognition 1985, 21, 1–36. [Google Scholar] [CrossRef] [PubMed]
- Liberman, A.M.; Whalen, D.H. On the relation of speech to language. Trends Cogn. Sci. 2000, 4, 187–196. [Google Scholar] [CrossRef]
- Fowler, C.A. An event approach to the study of speech perception from a direct-realist perspective. J. Phon. 1986, 14, 3–28. [Google Scholar] [CrossRef]
- Fowler, C.A.; Brown, J.M.; Sabadini, L.; Weihing, J. Rapid access to speech gestures in perception: Evidence from choice and simple response time tasks. J. Mem. Lang. 2003, 49, 396–413. [Google Scholar] [CrossRef]
- Blumstein, S.E.; Stevens, K.N. Acoustic invariance in speech production: Evidence from measurements of the spectral characteristics of stop consonants. J. Acoust. Soc. Am. 1979, 66, 1001–1017. [Google Scholar] [CrossRef]
- Diehl, R.L.; Kluender, K.R. On the Objects of Speech Perception. Ecol. Psychol. 1989, 1, 121–144. [Google Scholar] [CrossRef]
- Diehl, R.L.; Lotto, A.J.; Holt, L.L. Speech perception. Annu. Rev. Psychol. 2004, 55, 149–179. [Google Scholar] [CrossRef]
- Kuhl, P.K.; Conboy, B.T.; Coffey-Corina, S.; Padden, D.; Rivera-Gaxiola, M.; Nelson, T. Phonetic learning as a pathway to language: New data and native language magnet theory expanded (NLM-e). Philos. Trans. R. Soc. B Biol. Sci. 2008, 363, 979–1000. [Google Scholar] [CrossRef]
- Guenther, F.H. A neural network model of speech acquisition and motor equivalent speech production. Biol. Cybern. 1994, 72, 43–53. [Google Scholar] [CrossRef]
- Guenther, F.H. Speech sound acquisition, coarticulation, and rate effects in a neural network model of speech production. Psychol. Rev. 1995, 102, 594–621. [Google Scholar] [CrossRef] [PubMed]
- Guenther, F.H. The neural control of speech: From computational modeling to neural prosthesis. In Proceedings of the 18th International Congress of Phonetic Sciences, Glasgow, UK, 10–14 August 2015; The Scottish Consortium for ICPhS 2015, Ed.; pp. 1042.1–1042.5. [Google Scholar]
- Flege, J.E. Second language speech learning: Theory, findings, and problems. In Speech Perception and Linguistic Experience: Issues in Cross-Language Research; Strange, W., Ed.; York Press: Timonium, MD, USA, 1995; pp. 209–217. [Google Scholar]
- Flege, J.E. The Relation between L2 Production and Perception. In Proceedings of the 14th International Congress of Phonetic Sciences, San Francisco, CA, USA, 1–7 August 1999; Ohala, J.J., Hasegawa, Y., Ohala, M., Granville, D., Bailey, A.C., Eds.; pp. 1273–1276. [Google Scholar]
- Flege, J.E. Assessing constraints on second-language segmental production and perception. In Phonetics and Phonology in Language Comprehension and Production, Differences and Similarities; Meyer, A., Schiller, N., Eds.; Mouton de Gruyter: Berlin, Germany, 2003; pp. 319–355. [Google Scholar]
- Bradlow, A.R.; Pisoni, D.B.; Akahane-Yamada, R.; Tohkura, Y. Training Japanese listeners to identify English /r/ and /l/: IV. Some effects of perceptual learning on speech production. J. Acoust. Soc. Am. 1997, 101, 2299–2310. [Google Scholar] [CrossRef] [PubMed]
- Brosseau-Lapré, F.; Rvachew, S.; Claywards, M.; Dickson, D. Stimulus variability and perceptual learning of nonnative vowel categories. Appl. Psycholinguist. 2013, 34, 419–441. [Google Scholar] [CrossRef]
- Hardison, D.M. Acquisition of second-language speech: Effects of visual cues, context, and talker variability. Appl. Psycholinguist. 2003, 24, 495–522. [Google Scholar] [CrossRef]
- Hazan, V.; Sennema, A.; Iba, M.; Faulkner, A. Effect of audiovisual perceptual training on the perception and production of consonants by Japanese learners of English. Speech Commun. 2005, 47, 360–378. [Google Scholar] [CrossRef]
- Herd, W.; Jongman, A.; Sereno, J.A. Perceptual and production training of intervocalic /d, R, r/ in American English learners of Spanish. J. Acoust. Soc. Am. 2013, 133, 4247–4255. [Google Scholar] [CrossRef]
- Hirata, Y. Computer Assisted Pronunciation Training for Native English Speakers Learning Japanese Pitch and Durational Contrasts. Comput. Assist. Lang. Learn. 2004, 17, 357–376. [Google Scholar] [CrossRef]
- Iverson, P.; Pinet, M.; Evans, B.G. Auditory training for experienced and inexperienced second-language learners: Native French speakers learning English vowels. Appl. Psycholinguist. 2012, 33, 145–160. [Google Scholar] [CrossRef]
- Kartushina, N.; Hervais-Adelman, A.; Frauenfelder, U.H.; Golestani, N. The effect of phonetic production training with visual feedback on the perception and production of foreign speech sounds. J. Acoust. Soc. Am. 2015, 138, 817–832. [Google Scholar] [CrossRef]
- Lambacher, S.G.; Martens, W.L.; Kakehi, K.; Marasinghe, C.A.; Molholt, G. The effects of identification training on the identification and production of American English vowels by native speakers of Japanese. Appl. Psycholinguist. 2005, 26, 227–247. [Google Scholar] [CrossRef]
- Wang, Y.; Spence, M.M.; Jongman, A.; Sereno, J.A. Training American listeners to perceive Mandarin tones. J. Acoust. Soc. Am. 1999, 106, 3649–3658. [Google Scholar] [CrossRef]
- Wang, Y.; Jongman, A.; Sereno, J.A. Acoustic and perceptual evaluation of Mandarin tone productions before and after perceptual training. J. Acoust. Soc. Am. 2003, 113, 1033–1043. [Google Scholar] [CrossRef] [PubMed]
- Sakai, M.; Moorman, C. Can perception training improve the production of second language phonemes? A meta-analytic review of 25 years of perception training research. Appl. Psycholinguist. 2018, 39, 187–224. [Google Scholar] [CrossRef]
- Adank, P.; Hagoort, P.; Bekkering, H. Imitation improves language comprehension. Psychol. Sci. 2010, 21, 1903–1909. [Google Scholar] [CrossRef]
- Wang, X. Perception of mandarin tones: The effect of L1 background and training. Mod. Lang. J. 2013, 97, 144–160. [Google Scholar] [CrossRef]
- Baese-Berk, M.M.; Samuel, A.G. Listeners beware: Speech production may be bad for learning speech sounds. J. Mem. Lang. 2016, 89, 23–36. [Google Scholar] [CrossRef]
- Baese-Berk, M.M. Interactions between speech perception and production during learning of novel phonemic categories. Atten. Percept. Psychophys. 2019, 81, 981–1005. [Google Scholar] [CrossRef] [PubMed]
- Baese-Berk, M.M.; Samuel, A.G. Just give it time: Differential effects of disruption and delay on perceptual learning. Atten. Percept. Psychophys. 2022, 84, 960–980. [Google Scholar] [CrossRef]
- Schertz, J.; Clare, E.J. Phonetic cue weighting in perception and production. WIREs Cogn. Sci. 2020, 11, e1521. [Google Scholar] [CrossRef]
- Schertz, J.; Kang, Y.; Han, S. Sources of variability in phonetic perception: The joint influence of listener and talker characteristics on perception of the Korean stop contrast. Lab. Phonol. J. Assoc. Lab. Phonol. 2019, 10, 13. [Google Scholar] [CrossRef]
- Lim, S.B.; Han, J.I. Effects of dialectal differences in the use of native-language acoustic cues on the production and perception of second language stops. Stud. Phon. Phonol. Morphol. 2014, 20, 403–426. [Google Scholar] [CrossRef]
- Kong, E.J.; Yoon, I.H. L2 Proficiency Effect on the Acoustic Cue-Weighting Pattern by Korean L2 Learners of English: Production and Perception of English Stops. Phon. Speech Sci. 2013, 5, 81–90. [Google Scholar] [CrossRef]
- Casillas, J. Production and Perception of the /i/-/I/ Vowel Contrast: The Case of L2-Dominant Early Learners of English. Phonetica 2015, 72, 182–205. [Google Scholar] [CrossRef]
- Leung, K.K.W.; Wang, Y. Modelling Mandarin tone perception-production link through critical perceptual cues. J. Acoust. Soc. Am. 2024, 155, 1451–1468. [Google Scholar] [CrossRef]
- Leung, K.K.W.; Wang, Y. Production-perception relationship of Mandarin tones as revealed by critical perceptual cues. J. Acoust. Soc. Am. 2020, 147, EL301–EL306. [Google Scholar] [CrossRef]
- Chao, Y.R. Mandarin Primer: An Intensive Course in Spoken Chinese; Harvard University Press: Cambridge, MA, USA, 1947. [Google Scholar]
- Iverson, P.; Hazan, V.; Bannister, K. Phonetic training with acoustic cue manipulations: A comparison of methods for teaching English /r/-/l/ to Japanese adults. J. Acoust. Soc. Am. 2005, 118, 3267–3278. [Google Scholar] [CrossRef] [PubMed]
- Ylinen, S.; Uther, M.; Latvala, A.; Vepsäläinen, S.; Iverson, P.; Akahane-Yamada, R.; Näätänen, R. Training the brain to weight speech cues differently: A study of finnish second-language users of English. J. Cogn. Neurosci. 2010, 22, 1319–1332. [Google Scholar] [CrossRef]
- Chandrasekaran, B.; Sampath, P.D.; Wong, P.C. Individual variability in cue-weighting and lexical tone learning. J. Acoust. Soc. Am. 2010, 128, 456–465. [Google Scholar] [CrossRef]
- Francis, A.L.; Ciocca, V.; Ma, L.; Fenn, K. Perceptual learning of Cantonese lexical tones by tone and non-tone language speakers. J. Phon. 2008, 36, 268–294. [Google Scholar] [CrossRef]
- Wiener, S. Changes in Early L2 Cue-Weighting of Non-Native Speech: Evidence from Learners of Mandarin Chinese. In Proceedings of the Interspeech 2017, Stockholm, Sweden, 20–24 August 2017; pp. 1765–1769. [Google Scholar] [CrossRef]
- So, C.K.; Best, C.T. Categorizing Mandarin tones into listeners’ native prosodic categories: The role of phonetic properties. Poznań Stud. Contemp. Linguist. 2011, 47, 133–145. [Google Scholar] [CrossRef]
- So, C.K.; Best, C.T. Phonetic Influences on English and French Listeners’ Assimilation of Mandarin Tones to Native Prosodic Categories. Stud. Second Lang. Acquis. 2014, 36, 195–221. [Google Scholar] [CrossRef]
- Udayana, I.N.; Aryawibawa, I.N.; Sedeng, I.N.; Sereno, J.A. Tonal properties in a non-tonal language: The case of Indonesian. Heliyon 2023, 9, e13440. [Google Scholar] [CrossRef] [PubMed]
- Hao, Y.C. Second language acquisition of Mandarin Chinese tones by tonal and non-tonal language speakers. J. Phon. 2012, 40, 269–279. [Google Scholar] [CrossRef]
- Guion, S.G.; Pederson, E. Investigating the role of attention in phonetic learning. In Language Experience in Second Language Speech Learning: In honor of James Emil Flege; Bohn, O.S., Munro, M.J., Eds.; John Benjamins: Amsterdam, The Netherlands, 2007; pp. 57–77. [Google Scholar] [CrossRef]
- Jongman, A.; Qin, Z.; Zhang, J.; Sereno, J.A. Just noticeable differences for pitch direction, height, and slope for Mandarin and English listeners. J. Acoust. Soc. Am. 2017, 142, EL163–EL169. [Google Scholar] [CrossRef]
- Massaro, D.W.; Cohen, M.M.; Tseng, C.Y. The evaluation and integration of pitch height and pitch contour in lexical tone perception in Mandarin Chinese. J. Chin. Linguist. 1985, 13, 267–289. [Google Scholar]
- Tupper, P.; Leung, K.; Wang, Y.; Jongman, A.; Sereno, J.A. Characterizing the distinctive acoustic cues of Mandarin tones. J. Acoust. Soc. Am. 2020, 147, 2570–2580. [Google Scholar] [CrossRef] [PubMed]
- Flege, J.E.; Schmidt, A.M. Native Speakers of Spanish Show Rate-Dependent Processing of English Stop Consonants. Phonetica 1995, 52, 90–111. [Google Scholar] [CrossRef]
- Flege, J.E.; MacKay, I.R.A.; Meador, D. Native Italian Speakers’ Perception and Production of English Vowels. J. Acoust. Soc. Am. 1999, 106, 2973–2987. [Google Scholar] [CrossRef]
- Yang, B. The gap between the perception and production of tones by American learners of Mandarin—An intralingual perspective. Chin. A Second Lang. Res. 2012, 1, 33–53. [Google Scholar] [CrossRef]
- Kirby, J.; Giang, D.L. Relating Production and Perception of L2 Tone. In Second Language Speech Learning: Theoretical and Empirical Progress; Wayland, R., Ed.; Cambridge University Press: Cambridge, UK, 2021; pp. 249–272. [Google Scholar] [CrossRef]
- Kartushina, N.; Frauenfelder, U.H. On the effects of L2 perception and of individual differences in L1 production on L2 pronunciation. Front. Psychol. 2014, 5, 1246. [Google Scholar] [CrossRef] [PubMed]
- Schertz, J.; Cho, T.; Lotto, A.; Warner, N. Individual differences in phonetic cue use in production and perception of a non-native sound contrast. J. Phon. 2015, 52, 183–204. [Google Scholar] [CrossRef] [PubMed]
- Boersma, P.; Weenink, D. Praat: Doing Phonetics by Computer [Computer Program], version 6.0.43; 2018; Available online: http://www.praat.org/ (accessed on 30 March 2025).
- Venables, W.N.; Ripley, B.D. Modern Applied Statistics with S, 4th ed.; Springer: New York, NY, USA, 2002; ISBN 0-387-95457-0. [Google Scholar]
- R Core Team. R: A Language and Environment for Statistical Computing [Computer Program], version 4.0; R Foundation for Statistical Computing: Vienna, Austria, 2021; Available online: https://www.R-project.org (accessed on 30 March 2025).
- Singmann, H.; Bolker, B.; Westfall, J.; Aust, F.; Ben-Shachar, M.S. afex: Analysis of Factorial Experiments, R package version 0.28-1; 2021; Available online: https://cran.r-project.org/package=afex (accessed on 30 March 2025).
- Lenth, R.V. emmeans: Estimated Marginal Means, aka Least-Squares Means, R package version 1.6.0; 2021; Available online: https://cran.r-project.org/package=emmeans (accessed on 30 March 2025).
- McMurray, B.; Jongman, A. What information is necessary for speech categorization? Harnessing variability in the speech signal by integrating cues computed relative to expectations. Psychol. Rev. 2011, 118, 219–246. [Google Scholar] [CrossRef] [PubMed]
- Gandour, J.T. Tone perception in far Eastern languages. J. Phon. 1983, 11, 149–175. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Leung, K.K.W.; Lu, Y.-A.; Wang, Y. Examining Speech Perception–Production Relationships Through Tone Perception and Production Learning Among Indonesian Learners of Mandarin. Brain Sci. 2025, 15, 671. https://doi.org/10.3390/brainsci15070671
Leung KKW, Lu Y-A, Wang Y. Examining Speech Perception–Production Relationships Through Tone Perception and Production Learning Among Indonesian Learners of Mandarin. Brain Sciences. 2025; 15(7):671. https://doi.org/10.3390/brainsci15070671
Chicago/Turabian StyleLeung, Keith K. W., Yu-An Lu, and Yue Wang. 2025. "Examining Speech Perception–Production Relationships Through Tone Perception and Production Learning Among Indonesian Learners of Mandarin" Brain Sciences 15, no. 7: 671. https://doi.org/10.3390/brainsci15070671
APA StyleLeung, K. K. W., Lu, Y.-A., & Wang, Y. (2025). Examining Speech Perception–Production Relationships Through Tone Perception and Production Learning Among Indonesian Learners of Mandarin. Brain Sciences, 15(7), 671. https://doi.org/10.3390/brainsci15070671