Next Article in Journal
Suicide-Related Knowledge among Italian Early Career Psychiatrists and Trainees: Results from a Cross-Sectional Survey
Next Article in Special Issue
Increased Pre-Boundary Lengthening Does Not Enhance Implicit Intonational Phrase Perception in European Portuguese: An EEG Study
Previous Article in Journal
Intermodulation from Unisensory to Multisensory Perception: A Review
Previous Article in Special Issue
Native Listeners’ Use of Information in Parsing Ambiguous Casual Speech
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Explaining L2 Lexical Learning in Multiple Scenarios: Cross-Situational Word Learning in L1 Mandarin L2 English Speakers

1
The MARCS Institute for Brain, Behaviour, and Development, Western Sydney University, Penrith, NSW 2751, Australia
2
Australian Research Council Centre of Excellence for the Dynamics of Language, Canberra, ACT 2601, Australia
3
Department of Linguistics, University of Konstanz, 78457 Konstanz, Germany
*
Author to whom correspondence should be addressed.
Brain Sci. 2022, 12(12), 1618; https://doi.org/10.3390/brainsci12121618
Submission received: 23 May 2022 / Revised: 3 November 2022 / Accepted: 15 November 2022 / Published: 25 November 2022
(This article belongs to the Special Issue Auditory and Phonetic Processes in Speech Perception)

Abstract

:
Adults commonly struggle with perceiving and recognizing the sounds and words of a second language (L2), especially when the L2 sounds do not have a counterpart in the learner’s first language (L1). We examined how L1 Mandarin L2 English speakers learned pseudo English words within a cross-situational word learning (CSWL) task previously presented to monolingual English and bilingual Mandarin-English speakers. CSWL is ambiguous because participants are not provided with direct mappings of words and object referents. Rather, learners discern word-object correspondences through tracking multiple co-occurrences across learning trials. The monolinguals and bilinguals tested in previous studies showed lower performance for pseudo words that formed vowel minimal pairs (e.g., /dit/-/dɪt/) than pseudo word which formed consonant minimal pairs (e.g., /bɔn/-/pɔn/) or non-minimal pairs which differed in all segments (e.g., /bɔn/-/dit/). In contrast, L1 Mandarin L2 English listeners struggled to learn all word pairs. We explain this seemingly contradicting finding by considering the multiplicity of acoustic cues in the stimuli presented to all participant groups. Stimuli were produced in infant-directed-speech (IDS) in order to compare performance by children and adults and because previous research had shown that IDS enhances L1 and L2 acquisition. We propose that the suprasegmental pitch variation in the vowels typical of IDS stimuli might be perceived as lexical tone distinctions for tonal language speakers who cannot fully inhibit their L1 activation, resulting in high lexical competition and diminished learning during an ambiguous word learning task. Our results are in line with the Second Language Linguistic Perception (L2LP) model which proposes that fine-grained acoustic information from multiple sources and the ability to switch between language modes affects non-native phonetic and lexical development.

1. Introduction

Learning a second language (L2) in adulthood is difficult. Adults typically require additional time and exposure to their target L2 compared to younger learners to reach native-like proficiency (e.g., [1,2]), and commonly exhibit prolonged difficulty with pronunciation [3,4,5,6], and lexical access [7,8]. Most researchers agree that these difficulties stem in part from differences between learners’ L1 and L2 systems. In the realm of speech learning, whether an L2 phoneme contrast is easy or difficult for a learner to perceive highly depends on how L1 speech sounds compare to those in the target L2, as advocated by most cross-language and L2 speech learning models (e.g., [9,10,11,12]). For instance, the Second Language Linguistic Perception model (L2LP; [11,12,13,14]) describes possible L2 learning scenarios depending on the acoustic proximity and overlap between L1 and L2 phoneme categories. Within the L2LP model, when two phonemes of an L2 contrast are acoustically closest to a single L1 category, learners face a NEW scenario, and have difficulty perceiving the difference between the “new” L2 contrast (e.g., English /i/-/ɪ/ for L1 Catalan, Japanese, Mandarin, Polish, Portuguese, or Russian speakers, cf. (for a review see [13]). Learners face a SIMILAR scenario when two L2 categories are close matches of an acoustically similar L1 contrast (e.g., English /æ/-/ε/ for L1 Spanish speakers or English /i/-/ɪ/ for L1 Japanese speakers, cf. [14]). In this case, learners can replicate their existing L1 categories in the L2 but adjust their boundary as needed so that they match the L2 contrast. The SIMILAR scenario is suggested to be less problematic for the L2 learner since, unlike the NEW scenario, it does not require creating new phonological categories [12,14,15].
Non-native vowel perception studies support the claim that differences in the acoustic realization of L1 and L2 phonemes influence L2 speech perception (e.g., [16,17]). For instance, Alispahic and colleagues [17] tested Australian English (AusE) and Peruvian Spanish native speakers’ discrimination of Dutch vowel contrasts. They made predictions about which Dutch contrasts would be easy and difficult for AusE monolinguals to discriminate and which would be easy and difficult for Peruvian Spanish monolinguals based on the unique acoustic relationships between L1 and L2 categories for each language. Indeed, they found that performance matched these predictions for both groups of speakers, supporting the idea that acoustic properties impact listeners’ perception of non-native sounds (see [18]) and that listeners use perceptual cues from their L1 when categorizing non-native contrasts.
L2LP additionally proposes that the difficulties and relative ease in perceiving certain L2 contrasts based on the L1–L2 acoustic relationship extend to word learning and recognition [11,12,13,19,20,21,22,23]. Specifically, L2 learning of word pairs differing in a single phonological category—also known as minimal pairs—is predicted and explained by L2 learners’ perceptual difficulty, which in turn is based on the acoustic comparisons of L1 and L2 categories (e.g., [15,20,21]. For instance, [20] tested learning of Dutch minimal and non-minimal pairs in L1 Spanish speakers learning Dutch. Word pairs were separated into easy and difficult categories based on whether these contrasts exist in Spanish and are thus NEW for learners and predicted to be difficult, such as /i-ɪ/—Spanish only has /i/—or whether the L2 contrasts are similar to Spanish and predicted to be easy, such as /ɪ-a/. In contrast with Dutch native speakers who performed equally well for all minimal word pairs, the L1 Spanish speakers performed worse for the difficult minimal pairs compared to the easy minimal pairs, confirming the L2LP proposal that L2 perceptual difficulty influences L2 lexical representations and L2 word learning.
In the present study, we examined L1 Mandarin L2 English learners’ ability to learn English words in an ambiguous cross-situational word learning paradigm (CSWL) containing English sounds that do not exist in their L1 and compared their performance to English monolinguals. Specifically, we tested whether these L2 learners (a) perform equally or worse than English monolinguals in ambiguous word learning situations and if so, whether (b) their performance on minimal pairs may be explained by the relationship between L1 and L2 vowels and consonants, as proposed by most L2 speech learning models, including the L2LP. CSWL paradigms resemble a common real-world word learning situation in which word-objects pairings are presented ambiguously, in the context of other candidate pairings. Across multiple encounters with the words and items, learners can derive the correct word-object pairing through bottom-up statistical tracking mechanisms (e.g., [24]) or top-down hypothesis testing mechanisms (e.g., [25]).
Previous studies have demonstrated that both simultaneous bilinguals and L2 learners (also known as sequential bilinguals) can learn the pseudo words we present in a statistical word learning task. We define simultaneous bilinguals as learners who were exposed to two languages from birth and sequential bilinguals as L2 learners with exposure to the L2 after acquiring a first language, with onset of L2 acquisition during childhood, adolescence or adulthood. In the case of our study, all learners were exposed to English as a foreign language at school and therefore are referred to as L2 learners or sequential bilinguals. While most simultaneous bilinguals acquire proficiency comparable to monolingual speakers of their two languages, L2 learning can yield different levels of proficiency, in the case of the present group and those tested in [26]. The participants in both groups tested here followed university education in English, thus their L2 proficiency was advanced enough to understand English at a tertiary education level. When learning English words in a CSWL task Singaporean English-Mandarin simultaneous bilinguals had higher word-learning accuracy than monolingual English speakers [27]. In contrast, no difference in CSWL between highly proficient L2 English speakers with heterogeneous L1 backgrounds and monolingual English listeners was found in [26]. In both CSWL studies, participants were tested on their ability to learn eight words, four of which differed from one another on their initial consonant, forming consonant minimal pairs (cMP; e.g., BON-TON), and four of which formed vowel minimal pairs (vMP; e.g., DIT-DUT). Pairing a word from one set with one from the other set formed a non-minimal pair (nonMP; e.g., BON-DIT). Even though the simultaneous bilinguals in [27] outperformed monolinguals in accuracy, their reaction time for vMPs was slower to that of monolinguals. This may have been due to difficulties distinguishing the words DIT (/dɪt/) and DEET /dit/, as the vowel /ɪ/ is not found in the Mandarin vowel inventory. Alternatively, a vowel bias in Mandarin could have impacted reaction time, as vowels appear to provide stronger lexical identity in Mandarin compared to consonants [28], unlike in English. This may result in delayed processing of words differing in a single vowel by English-Mandarin bilinguals. The contrasting findings between learner groups, namely bilinguals in [27] versus L2 learners in [26], may be due to their specific linguistic background or to the English variety of the stimuli (American versus Australian English).
We presented L1 Mandarin L2 English learners with the same CSWL task as in [26,27] including the same eight words (i.e., /dit/, /dɪt/, /dʊt/, /dut/, /bɔn/, /pɔn/, /tɔn/ and /dɔn/, and pairings (i.e., nonMPs, vMPs and cMPs), compared their performance to that of AusE monolinguals, and assessed whether the relationship between Mandarin and English vowels and consonants can explain L2 word learning. First, we expected our AusE monolinguals to perform similarly to those tested in [27], with higher performance for nonMPs and cMPs compared to vMPs. In contrast, we expected our L1 Mandarin L2 English learners to have more L1 interference and have less optimal L2 representations than simultaneous bilinguals and therefore predicted they would find vMPs more difficult than cMPs or nonMPs, with their high L2 proficiency leading to similar performance to AusE monolinguals in cMPs and nonMPs. This prediction is in line with the L2LP model and with many other cross-language and L2 speech learning models. Specifically, if L1 Mandarin L2 English learners continue to have L2 representations that are L1-like (as shown in [12]), English words containing the vowels /ɪ/, /ʌ/, /ʊ/ should be particularly difficult to master [29], as these vowels are not found in Mandarin and are acoustically very similar to Mandarin /i/, /a/, /u/, leading to a NEW scenario that has not been resolved despite advanced L2 proficiency. Although many previous studies have shown plasticity for L2 learners in the phonetic/phonology domain, L2 proficiency does not seem to have a clear correlation with mastering new contrasts [13]. Conversely, previous studies have shown English consonants appear to be easier to perceive for Mandarin speakers, suggesting that they constitute a SIMILAR scenario, leading to better L2 performance from the start [30,31].
Finally, we tested a developmental tenet of L2LP which poses that transition from naïve listening to high L2 proficiency results in L2 perceptual and lexical development, such as creating or shifting of phonological categories to better represent the L2 [11,12]. Previous results have been mixed, as no difference between naïve Spanish-speaking listeners and those who had been learning Dutch in an immersive environment has been found [13,20], while some L2 perception studies report a positive effect of L2 experience [32,33] and others find no effect [13,34,35]. L2 immersion in a city where the L2 is spoken is a further opportunity to learn and be surrounded by the L2 in daily life, as opposed to only in a classroom. Within the L2LP framework, language immersion is seen as richer and more impactful language exposure, which should lead to further learning and in turn to higher L2 proficiency. Thus, if immersive experience with the specific target L2, namely AusE, influences performance, L2 word learning accuracy will be higher for L1 Mandarin L2 English learners who are immersed in the target L2 in Sydney, Australia than those who live in their home country (Shanghai, China). Our study therefore differs from previous studies in examining a homogeneous group of L2 learners who have Mandarin as their L1.

2. Materials and Methods

2.1. Participants

Sixty participants took part in the experiment. Thirty-one were AusE monolinguals from Sydney (Mage = 26, 21 females, 10 males), and 29 were L1 Mandarin L2 English participants who were divided in two groups according to their place of residence during testing: 11 lived in Sydney, Australia (MandSyd, Mage = 27.34, 9 females, 2 males) and 18 in Shanghai, China (MandShanghai, Mage =22, 10 females, 8 males). Participants tested in Sydney were undergraduate psychology students or people from the local community recruited through word-of-mouth, advertisements or the university’s participant recruitment system. Mandarin speakers in Sydney were native in Mandarin and indicated to speak and understand (Australian) English at advanced to native level. Participants tested in Shanghai were recruited through word-of-mouth and were all native Mandarin speakers at East China Normal University. They had studied English for an average of 14 years. They used Mandarin-Chinese daily, and English occasionally at university. Specific data regarding the precise number of years of English experience per participant was not collected or is no longer available. Participants received course credit or $10 travel compensation for their participation. Written informed consent was obtained from all participants prior to the start of the experiment, and the study was approved by the Western Sydney University Human Research Ethics Committee.

2.2. Stimuli

2.2.1. Pseudo Spoken Words

Stimuli consisted of eight monosyllabic pseudo words recorded by a female speaker of AusE using infant-directed speech (IDS). The words originate from a prior CSWL study [36] and have been used in other word learning and CSWL studies [26,27,37,38,39,40] with no effect of item on word learning accuracy. For the present study, we chose to use the same IDS stimuli in order to directly compare our results to previous studies, which were aimed at testing word learning in infants versus adults. We also chose IDS stimuli because many previous studies have shown that IDS facilitates word learning in infants learning their native language [41,42] and adult second language learners [43,44,45,46].
Words followed a CVC structure following English phonotactics. Per word, two tokens were selected to match prosodic contours across all words, with one token having a rising prosodic contour whereas the second has a descending prosodic contour. Four words differing from one another on their initial consonant formed consonant minimal pairs and followed a /Cɔn/ structure (cMP; e.g., BON-TON). The other four words followed a /dVt/ structure, forming vowel minimal pairs (vMP) with one another (e.g., DIT-DUT). Pairing a word from one set with one from the other set formed a non-minimal pair (nonMP; e.g., BON-DIT).

2.2.2. Pseudo Visual Referents

Each word was paired with a visual referent, which consisted of colour pictures of pseudo items (see Figure 1). These pictures have been used in prior CSWL studies (e.g., [23,26,27,36,40,47]. Pictures were 210 × 206 pixels.

2.3. Procedure

After obtaining written consent, participants filled out a language background questionnaire. They then completed the CSWL task, comprising a learning phase followed by a test phase. For this, participants were seated in front of a laptop computer with a 17-inch monitor and were asked to wear headphones throughout the experiment. The experiment was run using the software package E-prime (version 2.0, Psychology Software Tools Inc., Sharpsburg, PA, USA).
The learning phase consisted of 36 trials (as in [26]), with each word-referent pairing presented nine times. As in the previous CSWL studies, participants were instructed to look at the images and listen to the sounds but were not informed that this was a word learning experiment. During each trial, two visual referents were presented on the screen on a white background, centered vertically. After the images had been on the screen for 500 ms (to keep the experimental design consistent with prior CSWL studies [26,27,38,40]), the auditory labels corresponding to each referent were played such that the referents were named left-to-right or right-to-left with 500 ms between tokens, without indication of which label belonged to which referent. Trials were randomized for each participant and were controlled to ensure that each image was presented simultaneously with every other image at least once and at most twice (for more specific details regarding the counterbalancing of the trials, see [26]). In total, there were 24 nonMPs pairs, 6 cMP pairs, and 6 vMPs. Trials lasted for 3.5 s leading to a total learning phase of approximately 3 min. Participants did not complete a familiarization test before the testing phase as this would defeat the purpose of the statistical learning paradigm.
Participants were tested directly after the learning phase and were told that they would view two images on the screen and would hear one word. They were instructed to press the left or right ALT key on the keyboard to indicate whether they thought the word corresponded to the left or right image, respectively. Trials in the testing phase used the same visual referent pairs as the learning phase, but the left and right designations of the images were randomized once (similar to [26]). In each trial participants heard four repetitions of the label corresponding to one image (the target word). The first token began 500 ms after presentation of the images, with 500 ms between tokens. Every word appeared as target word four or five times. As in the training phase, there were 36 trials in total with 24 nonMP trials, 6 cMP trials and 6 vMP trials. Trials were separated into three blocks of 12 trials, with block order counterbalanced between participants, and trial order within blocks randomized for each participant. Every trial lasted 6.5 s leading to a total test phase of approximately 4 min. An example of a learning and test trial is presented in Figure 2.

2.4. Statistical Analysis

The data were analyzed using the statistical program R [48] with the brms package (using Stan; [48,49,50]). We used a multilevel Bayesian regression model to analyze participants’ accuracy. We chose to use a Bayesian approach for the statistical analysis due to the advantage and flexibility of using probabilistic statistics with small sample sizes [51], as in the case of our L1 Mandarin L2 English group based in Sydney (N = 11). We have successfully used Bayesian modelling in previous studies where we provide further information and details of two other cases where probabilistic statistics are particularly useful [38,52,53,54].
For the multilevel Bayesian regression model reported below, we used dummy coding (the default in brms) for the factors of Language group and Pair type, with AusE and nonMP as reference levels. Approximate leave-one-out (LOO) cross-validation was used to find the best fitting models, which resulted in a model fitted with Language group and Pair type as fixed effects and Pair type and Trial number as random effects within-participants. We used weakly informative priors [55,56] with a Student-t’s distribution with 3 degrees of freedom, a mean of 0 and a standard deviation of 2.5. We used a Bernoulli distribution to model accuracy responses (which consist of either 0′s and 1′s).
After fitting the model, we proceeded with hypothesis testing. Based on the predictions from the L2LP model, we hypothesized that the L1 Mandarin L2 English groups would perform less accurately than the AusE group for vMPs and cMPs but not for nonMPs. If experience with living in a country were English is spoken and in particular the English variety of the stimuli influences performance, we also expected the L2 group based in Sydney to perform better than the group based in Shanghai. We quantified the evidence for the tested hypotheses by using evidence ratios (ER), which are used to assess the likelihood of the test hypothesis against its alternative. To test our hypotheses, we only consider ERs above 30 (or of 1/30 or beyond) which qualify as “very strong evidence” and ERs of 10–30 (or of 1/10–1/30) which qualify as “strong” evidence (see [57] as cited by [58]). For readers unfamiliar with Bayesian statistics, an ER of >19 is approximately equivalent to an alpha of 0.05 in frequentist null-hypothesis testing [59]. In addition to the ERs, we also report the hypotheses’ posterior probabilities (PP).

3. Results

Accuracy

Figure 3 shows the mean accuracy responses per language group and pair type. We first analyzed participants’ accuracy (correct and incorrect responses) and tested whether participants were able to learn the word-object pairings for each pair type. Bayesian linear models on the Intercept, which are equivalent to frequentist one sample t-tests, revealed very strong evidence of above chance performance for all pair types in the AusE and in the L2 group from Shanghai, as indicated by posterior probabilities (PP) of > 0.98 and ERs of >3999. However, for the L2 group from Sydney we found very strong evidence of above chance performance only for the nonMPs (PP = > 0.999; ER = 3999), while evidence to support above chance performance was weaker for cMPs and vMPs (PPs = 0.92, 0.86; ERs = 11.82, 6.09, respectively). This is likely due to the higher response variability in this group (see Figure 3), which might be related to its smaller sample size (N = 11).
We then used a multilevel Bayesian regression model to estimate the interaction between Language group and Pair type on accuracy scores (see Table 1). We first tested whether there were differences in performance across the three pair types in each of the three language groups. For AusE, we found strong evidence that nonMPs and cMPs were more accurate than vMPs (PPs > 0.95; ERs of > 10), while no evidence for a difference between nonMPs and cMPs was found. These results replicate those reported in [26,27]) using frequentist statistics. Interestingly, no such difference between pair types was found for any of the L2 groups (PPs < 0.65 ERs < 10), suggesting their performance was similar across all three pair types.
The results in Table 1 indicate no performance differences between the two L2 groups (residing in Sydney or Shanghai), which was confirmed by further hypothesis testing where we found no evidence of a between group difference for nonMPs (PP = 0.40; ER = 0.68), cMPs (PP = 0.40; ER = 1.23) or vMPs (PP = 0.32; ER = 0.47). We thus combined the data from the two L2 groups into one L1 Mandarin L2 English group to test our hypothesis that the L1 Mandarin L2 English learners should have lower performance on vMPs than on cMPs or nonMPs because the vowels in the vMPs are not contrastive in their L1 Mandarin.
A second multilevel Bayesian regression model was run to estimate whether monolingual AusE learners (N = 26) indeed performed better than L1 Mandarin L2 English learners (N = 29). As shown in Table 2, we found very strong evidence (for nonMPs and cMPs) and strong evidence (for vMPs) that the AusE group had higher accuracy than the L2 group for all three pair types, which runs contrary to the hypotheses that these L2 learners will have lower performance for vMPs than the other two pair types and that they would only differ from monolingual English speakers in the vMP trials.

4. Discussion

This paper is the first to show that the mechanism of CSWL can be blocked by certain properties of the learner’s L1, which go beyond segmental differences between L1 and L2 vowels and consonants. Overall, participants learned the word-object pairings for all pair types. In line with AusE monolinguals tested in [27], AusE monolinguals here were best at identifying words in a nonMP or cMP context compared to a vMP context. However, contradicting our prediction, this pattern was not found for either L1 Mandarin L2 English group, who were less accurate than the AusE group for all three pair types. This suggests that the phoneme inventory differences between Mandarin and English do not explain their L2 word learning performance. Experience with AusE did not impact performance either, as both L2 groups performed similarly.
In contrast with [26,27], where simultaneous English-Mandarin bilinguals outperformed AusE monolinguals and L2 learners with diverse L1 backgrounds performed similarly to AuE monolinguals, here we found that L1 Mandarin L2 English had lower accuracy than AusE monolinguals. Prior research suggests that bilinguals have an advantage in pseudo word learning due to enhanced phonological memory [60,61,62,63] and executive functioning (e.g., [64]). Conversely, L2 learners have been found to have low sensitivity to L2 phonological contrasts that are absent in their L1 [65]. This may be explained by the idea that L2 learners perceive the sounds of a new language through their native phonological categories (e.g., [9,10,11,33,66], which can lead to L2 word recognition problems and L2 representations that continue to be L1-like [13,20,67,68,69,70,71]. Instead, simultaneous bilinguals are able to fully inhibit each of their languages selectively, while L2 learners, despite their proficiency, have trouble doing so. This has been shown many times in previous studies where language dominance yields to interference, especially for the L2 in sequential bilinguals [43]. For example, the pseudo words making up the vMPs of the present study included vowels that are not present in Mandarin, namely /ɪ/ and /ʊ/, as mentioned in the Introduction. As predicted by the L2LP model [11,13,14], such vowel contrasts are likely to be perceived as the closest acoustically related native vowel, leading to problems with the recognition of words containing those L2 contrasts, as has been shown for similar L2 word recognition cases (e.g., [13,20,21,22,23,69]).
However, absent or L1-like L2 representations in the L1 Mandarin L2 English group cannot explain the current results because they found all pseudo pair types equally difficult to recognize. Rather, we propose that their general word learning difficulty may have resulted from the specific stimuli presented to them. As mentioned in the Methods, participants heard pseudo words produced in infant-directed speech (IDS), which is a speech style often used by mothers and caregivers when speaking to babies and infants and contains more variable pitch relative to adult-directed speech (ADS) [41]. Although many studies have shown that IDS can be beneficial for word learning in infants [42,44] and adults [41,45,46,72], IDS might negatively impact word learning for listeners who have heighted attention to pitch variation, such as tonal language speakers for whom pitch variations signify different lexical items [73].
We propose that heightened discrimination of pitch variation may have resulted in L2LP’s Multiple Category Assimilation (MCA, L2LP; [74]), a scenario where an L2 category is acoustically similar to more than one L1 category, causing learners to perceive different tokens of a single L2 category as belonging to different categories in their L1 [17,22,66]. According to the L2LP proposal, this scenario results in listeners’ perception of contrasts that do not exist in the L2 [12] and is referred to as a SUBSET problem [11,74]. When L2 sounds are a subset of what the learner can actually hear, there is no overt information from the target L2 that would allow the learner to stop hearing the extra category or stop activating irrelevant or spurious lexical items [11,22,46,74] resulting in higher lexical competition and overall less efficient L2 lexicalization and recognition. It is likely that MCA plays a role in the overall lower performance of Mandarin speakers in this study, specifically due to the use of IDS for the stimuli tokens. The IDS-induced pitch variations may have resulted in L1 Mandarin L2 English learners’ perception of the two tokens of each word as two different words, challenging their ability to learn correct word-object pairs in the CSWL task. Importantly, the L2LP model is currently the only model of L2 perceptual and lexical developmental that can explain this type of L2 learning scenarios, starting from perceiving a single L2 category as more than one L1 category [75]. According to the L2LP model, this problem may not arise in simultaneous English-Mandarin bilinguals who may be able to de-activate their tonal language, succeeding at learning English words via CSWL and even surpassing monolingual English speakers, as reported in [26].
The L2LP explanation that the presence of additional pitch variation may be particularly problematic for speakers of a tonal language is further supported by findings that experience influences the perception of nonnative pitch variation. In many cases, tonal language experience is advantageous—for instance, Mandarin listeners outperform AusE listeners when learning a Thai tone distinction differing in pitch height contour [76,77]. In addition to tonal language learners, those who have experience with pitch via musical training have shown better tone discrimination [78], though not lexical tone learning [77]. However, in a recent study using the exact same CSWL paradigm and stimuli as in the present study [54] together with two standard music perception tests, we found that learners with high music perception abilities struggled most with IDS-produced words that had the highest pitch variability, i.e., vMPs. The tonal language speakers tested here struggled across all pair types, which may be due to them consistently using pitch information to discriminate between the exemplars of each word and across words for all pair types.
To confirm whether the additional pitch fluctuations induced by IDS indeed lead to a SUBSET problem and block CSWL in L1 Mandarin L2 English speakers, future research can use words produced in adult-directed-speech (ADS) with minimal pitch variation within and between exemplars of each word to examine whether word learning accuracy improves [54]. As vMPs naturally contain pitch variability, we expect the SUBSET problem to remain when tested with stimuli produced in ADS. If tonal language speakers indeed use suprasegmental information, such as pitch variations, when learning L2 words, their performance should thus improve more for nonMPs and cMPs than for vMPs when the variations in pitch are less prominent, as it is typical of ADS stimuli.
Lastly, we did not find a difference in experience with AusE, as both Mandarin groups performed similarly, providing no evidence that exposure to the L2 via an immersive environment mitigated L2 word learning difficulty. It could be that L2 exposure for the two L1 Mandarin groups is similar because while participants in Shanghai were not typically exposed to English in the community, both groups were students at English-speaking universities. Additionally, the two groups did not sufficiently differ in prior L2 exposure. However, a lack of group difference may also be due to a smaller sample size in the Sydney group, which resulted in higher variability in their results.
A limiting factor in this study is that by using self-reports as a measure of English proficiency, we may not be certain that participants have under- or overestimated their level of English. An English proficiency test administered alongside speech perception tasks may solve this issue. We also acknowledge that the missing details for each participants’ English proficiency is a limitation. However, with the available demographic data, we replicated previous results showing that individual background does not play a role in performance. In a future study, more detailed information should be collected to confirm that this variable indeed does not affect CSWL. It is also important to note that individual differences in cognitive abilities may influence word learning in such word learning paradigms and in previous cross-situational word learning studies. Results from our lab show that cognitive skills, such as visuospatial memory, inhibition, or flexibility were not significant predictors of cross-situational and incidental word learning in four-year-old children [79,80], but this may be different for adults.

5. Conclusions

To conclude, although L1 Mandarin L2 English learners were able to learn the pseudo English words in an ambiguous word learning scenario, their performance was overall lower than that of monolingual English speakers. Given that their performance was equally low for all pair types regardless of L1-L2 phonological relationships, an explanation solely based on the absence of L2 representations in this L2 learners cannot adequately account for the results. The more nuanced L2LP model’s explanation of a potential “subset problem,” in which these L2 leaners may have perceived different tokens of the same word as separate words because of their L1 tonal language background, seems to a more adequate and accurate account. However, further research is needed to confirm this proposal.

Author Contributions

Conceptualization, P.E. and K.E.M.; methodology, P.E. and K.E.M.; formal analysis, E.A.S.; data curation, K.E.M. and E.A.S.; writing—original draft preparation, P.E., K.E.M. and E.A.S.; writing—review and editing, P.E., E.A.S. and K.E.M.; visualization, E.A.S.; supervision, P.E.; funding acquisition, P.E. All authors have read and agreed to the published version of the manuscript.

Funding

Data collection and K.M.’s work were funded by the Australian Research Centre of Excellence for the Dynamics of Language (CE140100041). P.E.’s and E.A.S.’s work were funded by an ARC Future Fellowship (FT160100514) awarded to P.E. Article publication fees were funded by Western Sydney University.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and was approved by the Western Sydney University Human Research Ethics committee (protocol code H11022 in 2017).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Anonymized data is available upon request to the first author.

Acknowledgments

We would like to thank Xiaoluan Liu and Nicole Traynor for their help with data collection in Shanghai and Sydney respectively and the participants for their time and participation.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. DeKeyser, R. The robustness of critical period effects in second language acquisition. Stud. Second Lang. Acquis. 2000, 22, 499–533. [Google Scholar] [CrossRef] [Green Version]
  2. Johnson, J.S.; Newport, E.I. Critical period effects in second language learning: The influence of maturational state on the acquisition of English as a second language. Cogn. Pyschol. 1989, 21, 60–99. [Google Scholar] [CrossRef] [PubMed]
  3. Oyama, S. A sensitive period for the acquisition of a nonnative phonological system. J. Psycholinguist. Res. 1976, 5, 261–283. [Google Scholar] [CrossRef]
  4. Piske, T.; MackKay, I.R.A.; Flege, J.E. Factors affecting degree of foreign accent in an L2: A review. J. Phon. 2001, 29, 191–215. [Google Scholar] [CrossRef] [Green Version]
  5. Seliger, H.W.; Krashen, S.D.; Ladefoged, P. Maturational constraints in the acquisition of second language accent. Lang. Sci. 1975, 36, 20–22. [Google Scholar]
  6. Tahta, S.; Wood, M.; Loewenthal, K. Foreign accents: Factors relating to transfer of accent from the first language to a second language. Lang. Speech. 1981, 24, 265–272. [Google Scholar] [CrossRef]
  7. Jared, D.; Kroll, J.F. Do bilinguals activate phonological representations in one or both of their languages when naming words? J. Mem. Lang. 2001, 44, 2–31. [Google Scholar] [CrossRef]
  8. Kroll, J.F.; Sunderman, G. Cognitive processes in second language learners and bilinguals: The development of lexical and conceptual representations. In The Handbook of Second Language Acquisition; Doughty, C.J., Long, M.H., Eds.; Blackwell Publishing: Oxford, UK, 2003; pp. 104–129. [Google Scholar]
  9. Best, C.T.; Tyler, M.D. Nonnative and second-language speech perception: Commonalities and complementaries. In Language Experience in Second Language Speech Learning: In Honor of James Emil Flege; Bohn, O.-S., Munro, M.J., Eds.; John Benjamins: Amsterdam, The Netherlands, 2007; pp. 13–34. [Google Scholar]
  10. Flege, J.E. Second language speech learning theory, findings and problems. In Speech Perception and Linguistic Experience: Issues in Cross-Language Research; Strange, W., Ed.; York Press: Timonium, MD, USA, 1995; pp. 229–273. [Google Scholar]
  11. Escudero, P. Linguistic Perception and Second Language Acquisition: Explaining the Attainment of Optimal Phonological Categorization; Netherlands Graduate School of Linguistics: Amsterdam, The Netherlands, 2005. [Google Scholar]
  12. Van Leussen, J.-W.; Escudero, P. Learning to perceive and recognize a second language: The L2LP model revised. Front. Psychol. 2015, 6, 1000. [Google Scholar] [CrossRef] [Green Version]
  13. Escudero, P.; Benders, T.; Lipski, S.C. Native, non-native and L2 perceptual cue weighting for Dutch vowels: The case of Dutch, German and Spanish Listeners. J. Phon. 2009, 37, 452–465. [Google Scholar] [CrossRef]
  14. Yazawa, K.; Whang, J.; Kondo, M.; Escudero, P. Language-dependent cue weighting: An investigation of perception modes in L2 learning. Second Lang. Res. 2020, 36, 557–581. [Google Scholar] [CrossRef]
  15. Escudero, P.; Simon, E.; Mulak, K.E. Learning words in a new language: Orthography doesn’t always help. Billing. Lang. Cogn. 2014, 17, 384–395. [Google Scholar] [CrossRef]
  16. Escudero, P.; Vasiliev, P. Cross-language acoustic similarity predicts perceptual assimilation of Canadian English and Canadian French vowels. J. Acoust. Soc. Am. 2011, 130, EL277. [Google Scholar] [CrossRef]
  17. Alispahic, S.; Mulak, K.E.; Escudero, P. Acoustic properties predict perception of unfamiliar Dutch vowels by adult Australian English and Peruvian Spanish listeners. Front. Psychol. 2017, 8, 52. [Google Scholar] [CrossRef] [Green Version]
  18. Lengeris, A. Perceptual assimilation and L2 learning: Evidence from the perception of Southern British English vowels by native speakers of Greek and Japanese. Phonetica 2009, 66, 169–187. [Google Scholar] [CrossRef]
  19. Boersma, P.; Escudero, P. Learning to perceive a smaller L2 vowel inventory. An optimality theory account. In Contrast in Phonology: Theory, Perception, Acquisition; Avery, P., Dresher, B.E., Rice, K., Eds.; De Gruyter Mouton: Berlin, Germany, 2008; pp. 271–301. [Google Scholar]
  20. Escudero, P.; Broersma, M.; Simon, E. Learning words in a third language: Effects of vowel inventory and language proficiency. Lang. Cogn. 2013, 28, 746–761. [Google Scholar] [CrossRef] [Green Version]
  21. Escudero, P. Orthography plays a limited role when learning the phonological forms of new words: The case of Spanish and English learners of novel Dutch vowels. Appl. Psycholinguist. 2015, 36, 7–22. [Google Scholar] [CrossRef]
  22. Elvin, J.; Williams, D.; Escudero, P. Learning to perceive, produce and recognise words in a non-native language. In Linguistic Approaches to Portuguese as an Additional Language; Molsing, K.V., Perna, C.B.L., Ibaños, A.M.T., Eds.; John Benjamins: Amsterdam, The Netherlands, 2020; pp. 61–82. [Google Scholar]
  23. Tuninetti, A.; Mulak, K.; Escudero, P. Cross-situational word learning in two foreign languages: Effects of native and perceptual difficulty. Front. Commun. 2020, 5, 602471. [Google Scholar] [CrossRef]
  24. Yu, C.; Smith, L.B. Rapid word learning under uncertainty via cross-situational statistics. Psychol. Sci. 2007, 18, 414–420. [Google Scholar] [CrossRef] [Green Version]
  25. Trueswell, J.C.; Medina, T.N.; Hafri, A.; Gleitman, L.R. Propose but verify: Fast mapping meets cross-situational word learning. Cogn. Psychol. 2013, 66, 126–156. [Google Scholar] [CrossRef] [Green Version]
  26. Escudero, P.; Mulak, K.E.; Fu, C.S.; Singh, L. More limitations to monolingualism: Bilinguals outperform monolinguals in implicit word learning. Front. Psychol. 2016, 7, 1218. [Google Scholar] [CrossRef] [Green Version]
  27. Escudero, P.; Mulak, K.E.; Vlach, H.A. Cross-situational word learning of minimal word pairs. Cogn. Sci. 2016, 40, 455–465. [Google Scholar] [CrossRef] [PubMed]
  28. Chen, F.; Wong, M.L.Y.; Zhu, S.; Wong, L.L.N. Relative contributions of vowels and consonants in recognizing isolated Mandarin words. J. Phon. 2015, 52, 26–34. [Google Scholar] [CrossRef]
  29. Jia, G.; Strange, W.; Wu, Y.; Collado, J. Perception and production of English vowels by Mandarin speakers: Age-related differences vary with amount of L2 exposure. J. Acoust. Soc. Am. 2006, 119, 1118–1130. [Google Scholar] [CrossRef] [PubMed]
  30. Mi, L.; Tao, S.; Wang, W.; Dong, Q.; Jin, S.-H.; Liu, C. English vowel identification in long-term speech-shaped noise and multi-talker babble for English and Chinese listeners. J. Acoust. Soc. Am. 2013, 113, EL391. [Google Scholar] [CrossRef] [PubMed]
  31. Tao, S.; Chen, Y.; Wang, W.; Dong, Q. English consonant identification in multi-talker babble: Effects of Chinese-native listeners’ English experience. Lang. Speech 2019, 62, 531–545. [Google Scholar] [CrossRef]
  32. Flege, J.E. Perception and production: The relevance of phonetic input to L2 phonological learning. In Cross Currents in Second Language Acquisition and Linguistic Theory; Huebner, T., Ferguson, A.C., Eds.; John Benjamins: Amsterdam, The Netherlands, 1991; pp. 249–290. [Google Scholar]
  33. Flege, J.E.; Bohn, O.-S.; Jan, S. Effects of experience on non-native speakers’ production and perception of English vowels. J. Phon. 1997, 25, 437–470. [Google Scholar] [CrossRef] [Green Version]
  34. Cebrian, J. Input and experience in the perception of an L2 temporal and spectral contrast. In Proceedings of the 15th International Congress of the Ponetics Sciences, Barcelona, Spain, 3–9 August 2003; Recasens, D., Solé, M.J., Romero, J., Eds.; Universitat Autònoma de Barcelona/Causal Productions: Barcelona, Spain, 2003; pp. 2297–2300. [Google Scholar]
  35. Flege, J.E.; Munro, M.J.; Fox, R.A. Auditory and categorical effects on cross-language vowel perception. J. Acoust. Soc. Am. 1994, 95, 3623–3641. [Google Scholar] [CrossRef] [Green Version]
  36. Escudero, P.; Mulak, K.E.; Vlach, H.A. Infants encode phonetic detail during cross-situational word learning. Front. Psychol. 2016, 7, 1419. [Google Scholar] [CrossRef] [Green Version]
  37. Curtin, S.A.; Fennell, C.; Escudero, P. Weighting of vowel cues explains patterns of word-object associative learning. Dev. Sci. 2009, 12, 725–731. [Google Scholar] [CrossRef]
  38. Escudero, P.; Smit, E.A.; Angwin, A. Investigating orthographic versus auditory cross-situational word learning with online and lab-based research. Lang. Learn. 2022; early view. [Google Scholar]
  39. Fikkert, P. Developing representations and the emergence of phonology: Evidence from perception and production. In Laboratory Phonology 10: Variation, Phonetic Detail and Phonological Representation; Fougeron, C., Kühnert, B., D’Imperio, M., Eds.; De Gruyter Mouton: Berlin, Germany, 2010; pp. 227–258. [Google Scholar]
  40. Mulak, K.E.; Vlach, H.A.; Escudero, P. Cross-situational word learning of phonologically overlapping words across degrees of ambiguity. Cogn. Sci. 2019, 42, e12731. [Google Scholar] [CrossRef]
  41. Kuhl, P.K.; Andruski, J.E.; Chistovich, I.A.; Chistovich, L.A.; Kozhevnikova, E.V.; Ryskina, V.L.; Stolyarova, E.I.; Sundberg, U.; Lacerda, F. Cross-language analysis of phonetic units in language addressed to infants. Science 1997, 277, 684–686. [Google Scholar] [CrossRef]
  42. Graf Estes, K.; Hurley, K. Infant-directed prosody helps infants map sounds to meanings. Infancy 2013, 18, 797–824. [Google Scholar] [CrossRef]
  43. Marian, V.; Spivey, M. Bilingual and monolingual processing of competing lexical items. Appl. Psycholinguist. 2003, 24, 173–193. [Google Scholar] [CrossRef] [Green Version]
  44. Ma, W.; Golinkoff, R.M.; Houston, D.M.; Hirsh-Pasek, K. Word learning in infant- and adult-directed speech. Lang. Learn. Dev. 2011, 7, 185–201. [Google Scholar] [CrossRef] [Green Version]
  45. Ellis, N.C. Salience, cognition, language complexity, and complex adaptive systems. Stud. Second Lang. Acquis. 2016, 38, 341–351. [Google Scholar] [CrossRef]
  46. Golinkoff, R.M.; Alioto, A. Infant-directed speech facilitates lexical learning in adults hearing Chinese: Implications for language acquisition. J. Child Lang. 1995, 22, 703–726. [Google Scholar] [CrossRef]
  47. Vlach, H.A.; Sandhofer, C.M. Retrieval dynamics and retention in cross-situational statistical word learning. Cogn. Sci. 2014, 38, 757–774. [Google Scholar] [CrossRef] [Green Version]
  48. R Core Team. R: A Language and Environment for Statistical Computing [Computer Software Manual]; The R Project for Statistical Computing: Vienna, Austria, 2020. [Google Scholar]
  49. Bürkner, P.-C. brms: An R package for Bayesian multilevel models using Stan. J. Stat. Softw. 2017, 80, 1–28. [Google Scholar] [CrossRef] [Green Version]
  50. Bürkner, P.-C. brms: Advanced Bayesian multilevel modelling with the R package brms. R J. 2018, 10, 395–411. [Google Scholar] [CrossRef]
  51. Van de Schoot, R.; Depaoli, S. Bayesian analyses: Where to start and what to report. Eur. J. Health Psychol. 2014, 16, 75–84. [Google Scholar]
  52. Escudero, P.; Jones Diaz, C.; Hajek, J.; Wigglesworth, G.; Smit, E.A. Probability of heritage language use at a supportive early childhood setting in Australia. Front. Educ. 2020, 5, 93. [Google Scholar] [CrossRef]
  53. Smit, E.A.; Milne, A.J.; Dean, R.T.; Weidemann, G. Perception of affect in unfamiliar musical chords. PLoS ONE 2019, 14, e0218570. [Google Scholar] [CrossRef] [PubMed]
  54. Smit, E.A.; Milne, A.J.; Escudero, P. Music perception abilities and ambiguous word learning: Is there cross-domain transfer in nonmusicians? Front. Psychol. 2022, 13, 801263. [Google Scholar] [CrossRef] [PubMed]
  55. Gelman, A.; Hwang, J.; Vehtari, A. Understanding predictive information criteria for Bayesian models. Stat Comput. 2014, 24, 997–1016. [Google Scholar] [CrossRef]
  56. Gelman, A.; Lee, D.; Guo, J. Stan: A probabilistic programming language for Bayesian inference and optimization. J. Educ. Behav. Stat. 2015, 40, 530–543. [Google Scholar] [CrossRef] [Green Version]
  57. Jeffreys, H. The Theory of Probability; OUP: Oxford, UK, 1998. [Google Scholar]
  58. Kruschke, J.K. Rejecting or accepting parameter values in Bayesian estimation. Adv. Methods Pract. Psychol. Sci. 2018, 1, 270–280. [Google Scholar] [CrossRef] [Green Version]
  59. Milne, A.J.; Herff, S.A. The perceptual relevance of balance, evenness, and entropy in musical rhythms. Cognition 2020, 203, 104233. [Google Scholar] [CrossRef]
  60. Adesopa, O.O.; Lavin, T.; Thompson, T.; Ungerleider, C. A systematic review and meta-analysis of the cognitive correlates of bilingualism. Rev. Educ. Res. 2010, 80, 207–245. [Google Scholar] [CrossRef] [Green Version]
  61. Kaushanskaya, M.; Rechtzigel, K. Concreteness effects in bilingual and monolingual word learning. Psychon. Bull. Rev. 2012, 19, 935–941. [Google Scholar] [CrossRef]
  62. Majerus, S.; Poncelet, M.; Van der Linden, M.; Weeks, B.S. Lexical learning in bilingual adults: The relative importance of short-term memory for serial order and phonological knowledge. Cognition 2008, 107, 395–419. [Google Scholar] [CrossRef] [Green Version]
  63. Service, E.; Simola, M.; Metsänheimo, O.; Maury, S. Maturational constraints in the acquisition of second language accent. Eur. J. Cogn. Psychol. 2002, 14, 383–403. [Google Scholar] [CrossRef]
  64. Papagno, C.; Vallar, G. Phonological short-term memory and the learning of novel words: The effect of phonological similarity and item length. Q. J. Exp. 1992, 44, 47–67. [Google Scholar] [CrossRef]
  65. Gor, K. Phonological priming and the role of phonology in nonnative word recognition. Biling. Lang. Cogn. 2018, 21, 437–442. [Google Scholar] [CrossRef]
  66. Elvin, J.; Escudero, P. Perception of Brazilian Portuguese vowels by Australian English and Spanish listeners. In Proceedings of the International Symposium on the Acquisition of Second Language Speech (New Sounds 2013), Concordia Working Papers in Applied Linguistics. Montreal, Canada, 17–19 May 2013; pp. 15–156. Available online: http://doe.concordia.ca/copal/documents/12_Elvin_Escudero_Vol5.pdf (accessed on 1 May 2022).
  67. Chrabaszcz, A.; Gor, K. Quantifying contextual effects in second language processing of phonolexically ambiguous and unambiguous words. Appl. Psycholinguist. 2017, 38, 909–942. [Google Scholar] [CrossRef]
  68. Cutler, A.; Weber, A.; Otake, T. Asymmetric mapping from phonetic to lexical representations in second-language listening. J. Phon. 2006, 34, 269–284. [Google Scholar] [CrossRef] [Green Version]
  69. Escudero, P.; Hayes-Harb, R.; Mitterer, H. Novel second-language words and asymmetric lexical access. J. Phon. 2008, 36, 345–360. [Google Scholar] [CrossRef]
  70. Hayes-Harb, R.; Masuda, K. Development of the ability to lexically encode novel L2 phonemic contrasts. Second Lang. Res. 2008, 24, 5–33. [Google Scholar] [CrossRef] [Green Version]
  71. Weber, A.; Cutler, A. Lexical competition in non-native spoken-word recognition. J. Mem. Lang. 2004, 50, 1–25. [Google Scholar] [CrossRef] [Green Version]
  72. Houston-Price, C.; Law, B. How experiences with words supply all the tools in the toddler’s word—Learning toolbox. In Theoretical and Computational Models of Word Learning: Trends in Psychology and Artificial Intelligence; Gogate, L., Hollich, G., Eds.; IGI Global: Hershey, PA, USA, 2013; pp. 81–108. [Google Scholar]
  73. Han, M.; de Jong, N.H.; Kager, R. Lexical tones in Mandarin Chinese infant-directed speech: Age-related changes in the second year of life. Front. Psychol. 2018, 9, 434. [Google Scholar] [CrossRef]
  74. Escudero, P.; Boersma, P. The subset problem in L2 perceptual development: Multiple-category assimilation by Dutch learners of Spanish. In Proceedings of the 26th Annual Boston University Conference on Language Development, Somerville, MA, USA, 2–4 November 2001; Skarabela, B., Fish, S., Do, A.H.-J., Eds.; Cascadilla Press: Somerville, MA, USA; pp. 208–219. [Google Scholar]
  75. Escudero, P.; Hayes-Harb, R. The Ontogenesis Model may provide a useful guiding framework but lacks explanatory power for the nature and development of L2 lexical representation. Biling. Lang. Cogn. 2021, 25, 212–213. [Google Scholar] [CrossRef]
  76. Ong, J.H.; Burnham, D.; Escudero, P. Distributional learning of lexical tones: A comparison of attended vs. unattended listening. PLoS ONE 2015, 10, e0133446. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  77. Ong, J.H.; Burnham, D.; Escudero, P.; Stevens, C.J. Effect of linguistic and musical experience on distributional learning of nonnative lexical tones. J. Speech Lang. Hear. Res. 2017, 60, 2769–2780. [Google Scholar] [CrossRef] [PubMed]
  78. Ong, J.H.; Wong, P.C.M.; Liu, F. Musicians show enhanced perception, but not production of native lexical tones. J. Acoust. Soc. Am. 2020, 148, 3443–3454. [Google Scholar] [CrossRef]
  79. Pino Escobar, G. Word Learning and Executive Functions in Preschool Children: Bridging the Gap between Vocabulary Acquisition and Domain-General Cognitive Processes. Ph.D. Thesis, Western Sydney University, Penrith, Australia, 2022. [Google Scholar]
  80. Pino Escobar, G.; Tuninetti, A.; Antoniou, M.; Escudero, P. Understanding pre-schoolers’ word learning success in different scenarios: Disambiguation meets statistical learning and eBook reading. Dev. Psychol. 2022; manuscript submitted for publication. [Google Scholar]
Figure 1. The eight pseudo words and their visual referents. The four words in the top row are minimally different in their initial consonant, whereas the words on the bottom are minimally different in their vowel. The vowel used for the consonant minimal pairs is /ɔ/ as in POT. Vowels used for the vowel minimal pairs are /i/ as in BEAT, /ɪ/ as in BIT, /u/ as in BOOT, and /ʊ/ as in PUT. Figure 1 originates from [27].
Figure 1. The eight pseudo words and their visual referents. The four words in the top row are minimally different in their initial consonant, whereas the words on the bottom are minimally different in their vowel. The vowel used for the consonant minimal pairs is /ɔ/ as in POT. Vowels used for the vowel minimal pairs are /i/ as in BEAT, /ɪ/ as in BIT, /u/ as in BOOT, and /ʊ/ as in PUT. Figure 1 originates from [27].
Brainsci 12 01618 g001
Figure 2. Example of a learning (left) and test (right) trial (figure from [27]).
Figure 2. Example of a learning (left) and test (right) trial (figure from [27]).
Brainsci 12 01618 g002
Figure 3. Mean accuracy in percentage per language group and pair type. Error bars indicate the standard error of the mean.
Figure 3. Mean accuracy in percentage per language group and pair type. Error bars indicate the standard error of the mean.
Brainsci 12 01618 g003
Table 1. Hypothesis test results for the evidence ratios of the accuracy models’ interaction between pair type and language group.
Table 1. Hypothesis test results for the evidence ratios of the accuracy models’ interaction between pair type and language group.
HypothesisMean90% CIERPP
AusE
nonMP > cMP0.10[−0.24, 0.45]2.170.68
nonMP > vMP0.35[0.01, 0.68]22.050.96
cMP > vMP0.45[0.01, 0.89]21.060.95
L1 Mand L2 Eng–Shanghai
nonMP > cMP0.21[−0.19, 0.60]4.320.81
nonMP > vMP0.06[−0.34, 0.45]1.480.60
cMP > vMP−0.15[−0.66, 0.36]0.450.31
L1 Mand L2 Eng–Sydney
nonMP > cMP0.05[−0.45, 0.54]1.300.57
nonMP > vMP0.18[−0.32, 0.67]2.650.73
cMP > vMP0.13[−0.50, 0.77]1.720.63
Note: Mean = mean of the effect’s posterior distribution. 90% CI = one-sided 90% credibility intervals. ER = evidence ratio = the odds that the effect is in the direction specified by the hypothesis. PP = the posterior probability of the tested hypothesis.
Table 2. Hypothesis test results for the evidence ratios of the second accuracy models’ interaction between pair type and language group.
Table 2. Hypothesis test results for the evidence ratios of the second accuracy models’ interaction between pair type and language group.
Hypothesis
AusEnglish > Mandarin
Mean90% CIERPP
nonMPs0.78[0.33, 1.22]412.791.00
cMPs1.03[0.46, 1.60]799.001.00
vMPs0.51[−0.03, 1.05]15.200.94
Note: Mean = mean of the effect’s posterior distribution. 90% CI = one-sided 90% credibility intervals. ER = evidence ratio = the odds that the effect is in the direction specified by the hypothesis. PP = the posterior probability of the tested hypothesis.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Escudero, P.; Smit, E.A.; Mulak, K.E. Explaining L2 Lexical Learning in Multiple Scenarios: Cross-Situational Word Learning in L1 Mandarin L2 English Speakers. Brain Sci. 2022, 12, 1618. https://doi.org/10.3390/brainsci12121618

AMA Style

Escudero P, Smit EA, Mulak KE. Explaining L2 Lexical Learning in Multiple Scenarios: Cross-Situational Word Learning in L1 Mandarin L2 English Speakers. Brain Sciences. 2022; 12(12):1618. https://doi.org/10.3390/brainsci12121618

Chicago/Turabian Style

Escudero, Paola, Eline A. Smit, and Karen E. Mulak. 2022. "Explaining L2 Lexical Learning in Multiple Scenarios: Cross-Situational Word Learning in L1 Mandarin L2 English Speakers" Brain Sciences 12, no. 12: 1618. https://doi.org/10.3390/brainsci12121618

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop