1. Introduction
The aim of this paper is to examine the comparability of different language dominance assessments in their ability to identify and classify bilinguals with respect to a dominant language. Language dominance, or “observed asymmetries of skill in or use of one language over the other” (
Birdsong 2014, p. 374), is a factor that is used by researchers to account for and explain variation in bilingual behavior (i.e., production, perception, processing, etc.). For instance, language dominance has been used to account for variation in studies including but not limited to: the production and perception of mid vowel contrasts (
Amengual and Chamorro 2015), differences in voice onset time (VOT) in codeswitching contexts (
Antoniou et al. 2011;
Tsui et al. 2019), use of null/overt pronominal subjects and pre/post overt subjects (
Argyri and Sorace 2007), child bilingual acquisition (
Yip and Matthews 2006), and codeswitching patterns (
Pérez-Leroux et al. 2014). Further, language dominance is also a variable of interest for educators and administrators and clinical research (
Gertken et al. 2014).
However, even though language dominance is an important factor in bilingualism research, there is currently no unified method for assessing language dominance in the field. Rather, it has been operationalized and measured via an array of different methods whose usage depends on the individual researcher/research team, phenomenon under study, and formal perspective/view that the study takes on bilingualism (e.g., psycholinguistic, theoretical, sociolinguistic, etc.). Further, the studies that have examined language dominance as a factor in bilingual behavior have produced contradictory results. To illustrate these differences in measuring language dominance and how it plays a role in bilingual outcomes, we review four studies (
Amengual and Chamorro 2015;
Antoniou et al. 2011;
Argyri and Sorace 2007;
Tsui et al. 2019) that operationalize language dominance in different ways and that find differing results with respect to how language dominance affects a given linguistic phenomenon.
1 In these four studies, we will see that three different measures were used to classify bilinguals as dominant in a given language: (1) based on their self-ratings of proficiency, (2) the language of most exposure and use, and (3) the results of a sociolinguistic questionnaire. The results show both an effect for the dominant language (
Amengual and Chamorro 2015;
Argyri and Sorace 2007), and an effect for the non-dominant language (
Antoniou et al. 2011;
Tsui et al. 2019). Additionally, language dominance will be shown to affect both groups of bilinguals, i.e., bilinguals dominant in L
A and bilinguals dominant in L
B (
Tsui et al. 2019) or only one group of bilinguals (
Argyri and Sorace 2007).
First,
Amengual and Chamorro (
2015) studied the production and perception of Galician mid-vowel contrasts (e.g., /e/~/ε/ and /o/~/ɔ/) in early Spanish/Galician bilinguals. They measured language dominance via a sociolinguistic questionnaire, the Bilingual Language Profile (
Gertken et al. 2014) and classified the bilinguals into two groups: Spanish dominant and Galician dominant. They found that the Spanish dominant bilinguals had a harder time perceiving and producing the Galician mid vowel contrasts in comparison to the Galician dominant bilinguals. In other words, for the Spanish dominant bilinguals, Amengual and Chamorro found an effect of the dominant language (here, Spanish) on the non-dominant language (here, Galician). As
Amengual and Chamorro (
2015) did not examine the Spanish of their participants, it remains to be seen whether a similar effect would have been found in the opposite direction (i.e., an effect of Galician on the Spanish of the Galician dominant bilinguals or on the Spanish of the Spanish dominant bilinguals).
Second,
Antoniou et al. (
2011) and
Tsui et al. (
2019) examined voice onset time (i.e., the length of time between the release of a stop consonant and the onset of voicing) in codeswitching contexts across two different groups of bilinguals.
Antoniou et al. (
2011) examined early Greek/English bilinguals who were all classified as English-dominant based upon their self-ratings (i.e., they rated their mastery of English higher than that of their Greek). They found a unidirectional effect such that when switching from Greek to English, the English VOT values were produced more Greek-like than in non-codeswitching contexts. No such effect was found for the Greek VOT values (i.e., they were not produced as English-like). In other words, in contrast to
Amengual and Chamorro (
2015),
Antoniou et al. (
2011) found an effect for the non-dominant language (here, Greek) on the dominant language (here, English). As
Antoniou et al. (
2011) did not examine Greek dominant bilinguals, it remains to be seen whether a similar effect for the non-dominant language would have been found when switching from English into Greek (i.e., an effect of English on the Greek VOT values).
Tsui et al. (
2019) examined early Cantonese/English bilinguals who were classified into three different dominance groups on the basis of their self-ratings of language proficiency: (1) Cantonese dominant, (2) English dominant, and (3) balanced. Similar to
Antoniou et al. (
2011),
Tsui et al. (
2019) found an asymmetrical effect for the non-dominant language on the dominant language for both the Cantonese and English dominant bilinguals. When switching from English to Cantonese, the Cantonese dominant bilinguals produced Cantonese VOT values as more English-like, whereas when switching from Cantonese to English, the English dominant bilinguals produced English VOT values as more Cantonese-like. No effects were found when switching in the opposite direction for either group. Further, the balanced bilinguals did not display any effects on either their Cantonese or English VOT values. Based upon these results, we can hypothesize that
Antoniou et al. (
2011) would have also found an effect of English on the Greek VOT values if Greek dominant bilinguals had been examined.
Third,
Argyri and Sorace (
2007) examined whether language dominance could account for syntax-pragmatics interface phenomena, specifically the use of (1) null/overt pronominal subjects, (2) pre/post overt subjects, (3) object pronouns and (4) the structure of wh-embedded interrogatives, in early Greek/English bilingual children. The children were all eight years old at the time of testing. Language dominance was measured as the amount of exposure the bilinguals received in each language. In this case, the Greek dominant bilingual children were born and raised in Greece, whereas the English dominant bilingual children were born and raised in the UK. Similar to
Amengual and Chamorro (
2015), Argyri and Sorace found an asymmetrical effect of the dominant language on the non-dominant language, but only for the English dominant bilinguals. That is, the English dominant bilinguals extended their use of preverbal subjects in Greek, in contexts where pragmatically they were not the most felicitous option and also where they were grammatically inappropriate. No such effect was present in the Greek dominant bilinguals.
Argyri and Sorace (
2007) concluded that while language dominance does play a role in predicting crosslinguistic effects, it cannot be the sole factor as in this case no effect was found for Greek on the English of the Greek dominant bilinguals.
A comparison of the methods and results of these four studies raises the question as to why they found different results. Why is it that
Amengual and Chamorro (
2015) and
Argyri and Sorace (
2007) found an effect of the dominant language on the non-dominant language, but
Antoniou et al. (
2011) and
Tsui et al. (
2019) found an effect of the non-dominant language on the dominant language? Does it have to do with the language pair or linguistic phenomenon under study?
Amengual and Chamorro (
2015) examined the phonological phenomenon of Galician mid-vowel contrasts and Argyri and Sorace examined syntax-pragmatics interface phenomena, whereas
Antoniou et al. (
2011) and
Tsui et al. (
2019) examined the phonetic effects of codeswitching. Or could it have to do with the way in which language dominance was operationalized?
Amengual and Chamorro (
2015) used a sociolinguistic questionnaire (the BLP) to group participants into dominance groups, while
Antoniou et al. (
2011) and
Tsui et al. (
2019) both used self-ratings of proficiency, and
Argyri and Sorace (
2007) relied on amount of language exposure. Further, why is it that
Argyri and Sorace (
2007) only found an effect for one of the dominance groups (here, English dominant) but
Tsui et al. (
2019) found an effect for both groups (here, Cantonese dominant and English dominant)? Again, could it have to do with the linguistic phenomenon under study? Or could these differences in results have to do with the way the bilinguals were grouped into different dominance groups based upon the chosen language dominance measure? The combination of variation in language dominance assessments and the different results in studies that use them to examine language dominance (that these four studies highlight) is problematic because we have no way to determine whether any differences in results are due to the specific language dominance assessment used or another variable (e.g., language pair, type of bilingualism, linguistic phenomenon, etc.). This then makes it difficult to synthesize the effects of language dominance on bilingual behavior across different studies, which is not ideal as being able to compare the effect of language dominance across different studies is an essential step in understanding not only the bilingual construct of language dominance, but also bilingual outcomes. As there is such variation in the way that language dominance is measured (see
Section 3 for more details), one question that arises is whether all these methods are comparable and whether we would get the same results regardless of the method. In this paper, we compare four different but commonly used language dominance assessments in order to shed light on this issue. Specifically, we compare results of the Bilingual Language Profile (
Gertken et al. 2014), the Bilingual Dominance Scale (
Dunn and Fox Tree 2009), self-ratings of verbal and written proficiency and a sentence repetition task (
Flege et al. 2002) on a single group of early Spanish/English bilinguals in order to determine if these different assessments classify bilinguals in the same way. In other words, does it make a difference which language dominance assessment is used? We conclude that the language dominance assessment used does make a difference as our results show that the same bilingual will be classified into different groups (here, Spanish dominant, English dominant or balanced) based upon a given assessment. Further, our results suggest that the more “balanced” bilinguals are more difficult to consistently classify into a dominance group as they demonstrate the most variability in classification across the different assessments.
This article is organized as follows: in
Section 2 we define language dominance and discuss the different ways it has been operationalized in the literature.
Section 3 presents three previous studies that have compared different language dominance assessments and introduces our research question. In
Section 4 we outline the methods used in our current study and describe the different assessments under review.
Section 5 and
Section 6 present the results and discussion, respectively. In
Section 7 we conclude the paper and illuminate avenues for future research.
2. What Is Language Dominance and How Has It Been Measured?
A wide-held assumption in bilingualism research is that bilinguals, rarely, if ever, demonstrate equal capacity in both of their languages, most likely as a result of the fact that they use their languages for different purposes, in different situations, and with different people (e.g.,
Grosjean 2016;
Treffers-Daller 2019). This difference in a bilingual’s linguistic ability (i.e., proficiency), language processing ability and/or language use is often characterized as language dominance (see
Treffers-Daller 2019 for a review). That is, bilinguals are thought to be dominant in one of their languages with respect to the other. Recall from the introduction that
Birdsong (
2014) states “in the context of bilingualism, dominance refers to observed asymmetries of skill in, or use of one language over the other” (p. 374). Following
Birdsong (
2014), asymmetries of skill can be observed in linguistic competence, language production, and language processing (what he calls dimensions of dominance), whereas asymmetries of use can be considered the varying situations and contexts in which bilinguals use their two languages (what he calls domains of dominance). Further, these dimensions and domains of language dominance (to use Birdsong’s terminology) are not entirely independent of one another.
Language dominance has been viewed as a link between the sociolinguistic factors of language use and language exposure and the psycholinguistic factors of language processing (e.g.,
Dubiel 2019;
De Bot 2001).
Dubiel (
2019) explains, “the language that bilinguals are exposed to and use more frequently becomes the language in which they can access words without pauses and hesitations, and thus it is the language they are more dominant in.” (p. 96). This connection between sociolinguistic and psycholinguistic factors can be seen clearly in the realm of child bilingualism where language dominance can be defined as “a situation where one of a child’s languages is more advanced or developing faster than the other” (
Yip and Matthews 2006, p. 4). The child’s language that is more advanced or developing faster is the language to which they are the most exposed, which then causes it to be the language the child uses more (
De Bot 2001). When measures are employed to determine the child’s language dominance, the language of most exposure and most use becomes the language in which they perform “better” in terms of psycholinguistic factors such as reaction or response time. In adult bilinguals, support for a similar effect has been found by the Weaker Links Hypothesis (e.g.,
Gollan et al. 2008), and The Activation Threshold Hypothesis (
Paradis 2004). The Weaker Links Hypothesis claims that due to not being able to speak both languages at the same time, bilinguals inevitably use their two languages less than a monolingual uses his/her one language. This reduced use leads to weaker links between semantics and phonology in the bilingual linguistic system, which in turn makes bilinguals slower to recall and produce words than monolinguals. In a similar vein, the Activation Threshold Hypothesis claims that the more a language is used, the lower the activation threshold, i.e., the easier and/or faster it is to comprehend/produce that language. In contrast, languages with a higher activation threshold are subject to attrition or language loss. In these studies, the language of most exposure and use is the language in which bilinguals are able to react or respond faster and produce more infrequent lexical items.
From these definitions, it becomes readily apparent that language dominance is not a simple unidimensional construct. Instead, language dominance can be thought of as an umbrella term for possible and/or observed asymmetries that are a result of the many linguistic, social and cognitive factors and realities of bilingualism. That being said, there is a general consensus in the field that language dominance is something that is relative, continuous and fluid (
Gertken et al. 2014). Language dominance is relative in that we compare dominance in one language (L
A) with dominance in the other (L
B). It is continuous, as opposed to categorical, in that bilinguals are not simply dominant in L
A but more or less dominant in L
A with respect to the other language and with respect to other L
A dominant bilinguals. It is also a fluid construct in that it can change over time as the language use and linguistic abilities of a bilingual change.
Given the multifaceted nature of language dominance, researchers have employed a wide variety of methods to operationalize and measure the observed asymmetries that it encompasses.
Table 1 illustrates 19 different methods that have been previously employed as a measurement of language dominance.
2Table 1 reveals several differences and commonalities between the different measurements. First, a combination of both subjective (i.e., participant providing information about self) and objective (i.e., someone/something else providing information about participants) measures have been used. For example, self-ratings, one of the most commonly used assessments, is subjective as it asks bilinguals to rate their linguistic abilities (typically presented in terms of reading, writing, speaking and listening skills) of both of their languages. A frequent criticism of self-ratings is that bilinguals may be biased for or against one of their languages. A particular group of bilinguals might be inclined to under or overvalue (i.e., rate lower or higher) one of their languages or a specific facet of their languages based upon their language or educational background (see
Section 7 for more discussion of this issue). Other common subjective measures of language dominance take the form of sociolinguistic questionnaires such as the Bilingual Language Profile (BLP) and the Bilingual Dominance Scale (BDS) or questionnaires about language use and exposure. These questionnaires often rely on the bilinguals’ memory of their perceived language use and history.
Objective measures of language dominance can take several forms as well from tasks in which bilinguals are asked to name words as fast as they can (e.g., Boston Naming Task, Multilingual Naming Task, Category Generation Task, Child HALA test) to tasks that directly measure either written or spoken language proficiency (e.g., Semantic/morphosyntax knowledge test, Oral proficiency interview). These objective measures are conducted in both languages and then the language in which a bilingual performs best (i.e., scores higher) is considered his/her dominant language.
Second, even though, as
Gertken et al. (
2014) and
Grosjean (
1998,
2001) argue that language dominance is continuous in nature, we see that it has primarily been treated as a categorical variable. That is, researchers use the results of a language dominance assessment to classify and divide bilinguals into different language dominance groups (L
A dominant, L
B dominant, balanced) instead of placing the bilinguals on a single scale of more or less dominant than each other. Further, it is interesting to note that the two assessments in
Table 1 that view language dominance as a continuous variable, The BLP and BDS, can be used to treat language dominance as a categorical variable as well. Both are questionnaires that provide a score on a given scale (see
Section 4.2 for more details). Researchers can consider language dominance on a continuum by using the scores and/or they can use the scores to group the bilinguals into different dominance groups based upon the cutoff points provided. For example, studies such as
Amengual and Chamorro (
2015) used the scores provided by the BLP to classify participants into different dominance groups, while studies such as
Amengual (
2016) used the scores provided by the BLP to treat language dominance both as a categorical and a continuous variable in two separate analyses.
Third, we see that different measures are used to test language dominance in children versus adults. Assessments such as the Mean Length of Utterance (MLU) and the Child HALA are designed specifically for use with children and are not used to measure language dominance in adults, while assessments such as self-ratings (e.g.,
Antoniou et al. 2011;
Tsui et al. 2019) and perceived accent are used primarily with adults and not with children. Other assessments, such as patterns of language use and exposure (e.g.,
Argyri and Sorace 2007) have been used with both children and adults, the key difference being that bilingual adults report their own patterns of language use and exposure while the parents of bilingual children provide that information instead of the children themselves. Additionally, we note that some assessments (e.g., Multilingual Naming Task, MINT) were developed specifically with bilinguals in mind whereas other assessments (e.g., Boston Naming Task, BNT) were developed for monolinguals and were then later adapted for use with bilinguals.
Given the multitude of assessments used and the differences between them, an important question for researchers in bilingualism is whether or not these different measures capture the construct of language dominance in the same way and on a larger scale, whether they capture language dominance at all. That is, if we want to design a research study that looks at the effects of language dominance on a given linguistic phenomenon or even if we just want to control for language dominance as an intervening variable, does it matter which method we employ to measure language dominance? Would we get different results if we used a subjective versus objective measure? Or if we treated language dominance as a categorical versus continuous variable? And if so, how do we know which assessment we should ultimately use?
In order to begin to answer these questions, we should examine (a) whether these different assessments classify an individual the same way with respect to dominance and (b) whether using a different assessment leads to a different interpretation of the results of bilingual behavior.
3. Comparing Different Language Dominance Assessments
Bedore et al. (
2012) examined language dominance in Spanish/English bilingual children. In their study, language dominance was operationalized in two ways: (1) as relative proficiency (found by comparing a proficiency score in L
A by that of L
B) and (2) the language of most exposure (determined by parental report). Children completed the BESOS (Bilingual English and Spanish Oral Language Screening,
Peña et al. 2010) in both English and Spanish, which included semantics and morphosyntax subtests. As a part of the BESOS, interviews were completed with the parents to determine the language use of the children (combination of input and output). The scores achieved by the children on the semantics and morphosyntax tests were used to classify language dominance in that a child was considered dominant in L
A if s/he scored higher on the semantics/morphosyntax test in L
A when compared to L
B. The parental interviews were also used to calculate language dominance by averaging out the reported input and output of each language; A child was considered dominant in L
A if s/he had a higher percentage of language use in L
A when compared to L
B.
The results of their analysis determined that in general, different ways of operationalizing language dominance can result in different classifications of bilingual children. Specifically, 17.1% of the children were classified differently based on input and output. Further, 51.2% of the children were classified differently based on the results of the morphosyntax versus semantics tests. As
Bedore et al. (
2012) explain “if tested at 60% usage of English, children would appear to be English dominant if given a semantics test but Spanish dominant if given a morphosyntax test” (p. 622). In other words, using the metrics of language use and performance on a semantics test, children would be classified into an English-dominant group but using the metric of a morphosyntax test they would be classified into a Spanish-dominant group. These findings suggest that it does make a difference which assessment is used to determine and classify the language dominance of participants and that this difference could lead to different interpretations of the results.
Gollan et al. (
2012) looked at language dominance in young adult and aging Spanish/English bilinguals, while
Sheng et al. (
2014) examined Mandarin/English child and adult bilinguals. As
Sheng et al. (
2014) was a replication of
Gollan et al. (
2012), both studies operationalized language dominance as relative proficiency calculated from the results of four assessments: self-ratings, scores on oral proficiency interviews (OPI), the Boston Naming Task (BNT) and the Multilingual Naming Task (MINT). Both
Gollan et al. (
2012) and
Sheng et al. (
2014) found mismatches in the dominance classification of participants across the four measures. For instance, participants tended to be classified as English dominant more via the BNT or MINT than the self-ratings and OPI. The researchers concluded that in general, the different assessments are comparable in terms of classifying bilinguals into different groups but that they are unable to capture the degree of dominance.
4 From this, we can infer that the bilinguals who tend to fall more towards the middle of the scale (i.e., they are more balanced) are more difficult to accurately classify across different language measures.
The results of
Bedore et al. (
2012),
Gollan et al. (
2012) and
Sheng et al. (
2014) indicate that in research studies on bilingualism and language dominance it is important which metric is used to operationalize and measure language dominance, as different measures seem to classify bilinguals differently, which in turn can lead to variable interpretation of the results. Together, these studies examined only six of the many different types of assessments previously used in the literature (see
Table 1). As such, their findings highlight the need for more research comparing the differences between language dominance assessments and the reasons behind why different assessments that are all supposedly measuring the same thing, language dominance, classify bilinguals differently with respect to their dominant language. This discussion brings us to the current study which aims to analyze the comparability of a different set of commonly used language dominance assessments. To that effect, we ask the following research question:
RQ: Do different language dominance measures predict the results of one another?
In order to answer this question, we examined four language dominance assessments: The Bilingual Language Profile (
Gertken et al. 2014), the Bilingual Dominance Scale (
Dunn and Fox Tree 2009), self-ratings of verbal and of written ability and a repetition task (
Flege et al. 2002). Following the results of
Bedore et al. (
2012),
Gollan et al. (
2012) and
Sheng et al. (
2014) as well as
Birdsong (
2014)’s definition of language dominance, we hypothesized that dominance measures that do not test the same domain/dimension would not predict the results of one another with respect to dominance classification.
5. Results
Our research question asked whether different language dominance assessments predict the results of one another with respect to the classification of a bilingual’s dominant language. In this section, we first present the dominance score and classification distribution for each assessment. Then, we turn to an analysis of the individual participants to determine how they were classified across the different measures based upon their individual dominance scores.
The Bilingual Language Profile calculates a language dominance score on a scale from −218 (Spanish dominance) to 218 (English dominance). In this experiment all participants scored between −46 and 102 (
M = 15.077,
SD = 45.073). Based on their score, 19 participants were categorized as English dominant, nine were classified as Spanish dominant, and one participant was classified as Balanced. Individual scores are provided in
Appendix A.
The Bilingual Dominance Scale calculates a dominance score on a scale from −32 (Spanish dominance) to 32 (English dominance). All of the participants in this experiment scored between −5 and 18 (
M = −4.727,
SD = 9.424). According to their dominance score, 20 participants were classified as English dominant, eight were classified as Spanish dominant, and one participant was classified as Balanced. Individual scores are provided in
Appendix B.
The self-ratings of proficiency provided two separate scores, one of verbal ability and one of written ability, which will be presented individually. In the self-ratings of verbal ability, all participants scored between 0.64 and 1.09 (
M = 0.90,
SD = 0.10). According to their scores, 17 participants were classified as English dominant, one was classified as Spanish dominant and 11 participants were classified as Balanced. Individual scores are provided in
Appendix C.
In the self-ratings of written ability, participants scored between 0.5 and 1.2 (
M = 0.80,
SD = 0.22). According to the self-ratings of verbal ability, 21 participants were classified as English dominant, two were classified as Spanish dominant and six participants were classified as Balanced. Individual scores are provided in
Appendix D.
In the repetition task, participants scored between 0.92 and 1.19 (
M = 1.00,
SD = 0.074). Based on this assessment, 19 participants were classified as English dominant, nine participants were classified as Spanish dominant, and one was classified as Balanced. Individual scores are provided in
Appendix E.
A visual summary of the dominance classification distribution between the five assessments is provided in
Figure 1.
To corroborate whether or not participants were consistently classified into the same dominance group across the five assessments, the individual dominance classifications of each participant were examined (
Table 3). The results show that 20 out of the 29 participants were classified differently in one or more of the assessments. 13 out of the 29 participants were categorized into two different dominance groups and seven out of the 29 participants were categorized into three different dominance groups.
After establishing the language dominance scores for all assessments, a series of correlations (
n = 10) were run between the BLP scores, the BDS scores, the self-ratings of verbal ability scores, the self-ratings of written ability scores and the repetition task scores. To correct for multiple comparisons, the Holm-Bonferroni correction was used.
Table 4 shows the pairwise comparisons with the adjusted alpha levels.
The results indicated a moderate, positive association between the BLP and the BDS (r (29) = 0.692, p < 0.001). Further, the BLP had a strong, negative association (r (29) = −0.761, p < 0.001) with the self-rating of verbal ability. The BDS and the self-ratings of verbal ability had a moderate, negative association (r (29) = −0.527, p = 0.003), whereas the self-ratings of verbal and written ability had a moderate, positive association (r (29) = 0.534, p = 0.003). No significant correlations were found between the self-ratings of written ability and the BDS (p = 0.210), or the BLP (p = 0.015). Further, no significant correlations were found between the repetition task and any of the other assessments.
6. Discussion
The goal of this study was to investigate whether different language dominance assessments produce comparable results. To answer the research question, 29 early Spanish/English bilinguals completed four dominance assessments: BLP (
Gertken et al. 2014), BDS (
Dunn and Fox Tree 2009), self-ratings of proficiency and a repetition task (
Flege et al. 2002) and the language dominance scores from each assessment were compared to one another. The language dominance measures used in this study varied in regard to the dimensions and domains of dominance they examined and as was predicted, not all of the language dominance measures provided comparable results. This finding supports the conclusions drawn by previous studies (
Bedore et al. 2012;
Gollan et al. 2012;
Sheng et al. 2014), who found a similar result in their analyses of different sets of language dominance assessments.
In our study, the BLP did significantly predict the dominance scores provided by the BDS and the self-ratings of verbal proficiency. A relationship between these three measures is to be expected because the BLP and BDS are both language questionnaires and they determine dominance based upon the same factors: language history, language use, language proficiency, and language attitudes. Further, self-ratings of verbal ability are a part of the BLP and the BDS, so it is to be expected that when self-ratings of verbal ability are considered their own assessment, they relate to the BLP and BDS.
However, unlike the self-ratings of verbal ability, the self-ratings of written ability did not correlate with either the BLP or the BDS. With respect to the BLP, this outcome is unexpected considering self-ratings of written ability are included within the BLP questionnaire and because the self-ratings of written ability correlated with those of verbal ability. In contrast, the BDS only accounts for verbal language proficiency and not written language proficiency like the BLP (
Dunn and Fox Tree 2009). The discrepancy between the dominance scores of the verbal and written self-ratings suggests that self-ratings of verbal ability capture a different facet of a bilingual’s competence than self-ratings of written ability. This might be especially relevant for heritage speakers who are well known for being more proficient in terms of speaking/listening than reading/writing the heritage language.
In this study, the dominance scores from the repetition task did not predict the scores of any other assessment. There are several notable differences between the repetition task and the other assessments that could explain this lack of association. First, the repetition task assessed dominance via a processing/production task while the other tasks assessed language dominance based on their language background/history. Second, given current research on the cost effects of codeswitching it could be the case that the design of the repetition task itself is not ideal to capture the linguistic processing dimension of dominance. In this task, participants heard and repeated sentences in both Spanish and English and the order of the languages was counterbalanced within the repetition task. That is, participants repeated two sentences in English, then two sentences in Spanish, etc. In other words, participants were, in effect, switching between their two languages throughout the task. This was done in order to replicate the repetition task as presented in the original study (
Flege et al. 2002). More recent work on code-switching has found evidence that switching from one language to another leads to processing burdens (see
Van Hell et al. 2015 for review). Bilinguals are thought to be slower to switch from their non-dominant language into their dominant language given that they have to work harder to suppress their dominant language when speaking the non-dominant language (e.g.,
Meuter and Allport 1999).
Given that the underlying assumption behind the repetition task in
Flege et al. (
2002) was that bilinguals would be more likely to repeat sentences faster in their dominant language, we see a clear conflict with the use of the switching between the two languages in the task. If the participants demonstrate the cost effects of language switching, then they would actually be slower to repeat half of the sentences in their dominant language (i.e., the ones directly following a sentence in the opposite language), which would then skew the measurements used to calculate the dominance score and therefore the classification. For the purposes of our study, we aimed to replicate the use of each assessment as close as possible to the way they have been used previously. It remains to be seen whether the output of a repetition task that did not include the convoluting factor of language switching would correlate with that of other language dominance measures. Future work on language dominance could employ a modified version of Flege’s repetition task in which participants first repeat sentences in L
A and then in L
B (and vice versa) or in which the sentences are divided into two sessions so that the participants are not forced to switch between their languages. Third, some important information needed to accurately replicate the analysis of the repetition task was not provided in
Flege et al. (
2002). For instance,
Flege et al. (
2002) mentioned that “non-fluently produced sentences were excluded from analysis” (p. 580) but no criteria about what was considered non-fluently produced sentences were given.
Even though the results of the correlation analysis indicated that the dominance scores provided by the BLP, BDS, and self-ratings of verbal ability are predictive of one another, 20 of the 29 participants were classified into different groups based upon those scores. Even if we remove the repetition task from the analysis (as it did not correlate with any other assessment), 16 of the participants were still classified differently, even across the BLP, BDS, and self-ratings of verbal ability (see
Table 3). The average dominance scores for the 16 participants whose dominance classification changed across the four measures and for the 13 participants whose classification did not change are provided in
Table 5.
We note, descriptively speaking, that the bilinguals who were more inconsistently classified are the bilinguals whose dominance scores tended to fall towards the middle of the scale (i.e., were more balanced). Recall that a perfectly balanced bilingual would score 0 on the BLP and BDS and 1 on the self-ratings. Bilinguals whose scores were at more extreme ends of the scale (i.e., more Spanish dominant or more English dominant) tended to maintain their dominance classification across the five measures. For example, Participant 2001 was classified as English dominant across the board with scores of 102, 18, 0.75 and 0.58 on the BLP, BDS, self-ratings of verbal ability and self-ratings of written ability respectively. In contrast, Participant 2003 was classified as Spanish dominant, English dominant and balanced with scores of −11, 0.916, 0, and 1 on the BLP, self-ratings of written ability, BDS, and self-ratings of verbal ability respectively. These results suggest that more balanced bilinguals are more difficult to consistently classify across different language dominance assessments.
This discrepancy in the individual dominance classification of certain bilinguals echoes the results of
Gollan et al. (
2012) and
Sheng et al. (
2014), who found similar mismatches across four different dominance measures: The Boston Naming Task, The Multilingual Naming Task, self-ratings, and oral proficiency interviews. They concluded that the dominance measures in their studies were unable to capture “degree of dominance”, thus begging the question as to whether the dominance assessments used in our current study were also unable to capture the degree of dominance of a given bilingual (i.e., the relationship of how dominant a bilingual is in a given language in comparison to other bilinguals dominant in that same language).
This is a significant finding as it suggests a difference in treating language dominance as a categorical vs. continuous variable. If you recall, the majority of assessments provided in
Table 1 treat language dominance as a categorical variable. The results of the current study show that treating language dominance as a categorical variable is problematic given that an individual bilingual will not be consistently placed into the same dominance group depending on which assessment is given. When we think about this abstractly, we can see that it makes sense. Consider the hypothetical case of three different Spanish/English bilinguals who were all given the BLP. Bilingual 1 received a score of −5 and was placed into the Spanish dominant group. Bilingual 2 received a score of 5 and was placed into the English dominant group and Bilingual 3 received a score of 67 and was also placed into the English dominant group. In this scenario, there is a good chance that based on their scores Bilinguals 1 and 2 are more similar to each other than Bilinguals 2 and 3. Yet, Bilinguals 2 and 3 will be placed into the same group whereas Bilingual 1 is in a different group. In our study, this hypothetical scenario did, in fact, take place. Following the BLP, Participant 2026, received a score of 8.72, and was classified as English dominant. Participant 2001 received a score of 102 and was also classified as English dominant. In contrast, Participant 2010 was classified as Spanish dominant with a score of −4. When we view these scores on a continuum, Participant 2026’s score is closer to that of Participant 2010 than that of Participant 2001.
Now let’s consider what would happen if these three bilinguals were a part of a research study that was examining the effect of language dominance on a given linguistic phenomenon. Using the dominance scores on the basis of a continuum, we would logically predict that Participant 2026 will behave more similarly to Participant 2010 on a given linguistic task than Participant 2001. Or we might predict that Participant 2026’s behavior should fall somewhere in between that of Participant 2010 and 2001. However, if we use these scores to classify the participants into different dominance groups, Participant 2026 will end up in the same group as Participant 2001. Following this dominance classification, we would predict that Participant 2026 and Participant 2001 should pattern more similarly on a given task than either of them should with Participant 2010 (given that they belong in two different groups). Based on this simplified example, we can see that a given bilingual’s linguistic behavior is predicted to be different based upon whether language dominance is treated as a categorical versus a continuous variable.
Further, a natural consequence of treating language dominance as categorical is the potential heterogeneity of each group. In our example, Participant 2026 and Participant 2001 are both classified as English dominant even though their individual scores fall quite far from each other. Collapsing participants across the group level (i.e., English dominant versus Spanish dominant) could potentially obscure patterns that would be otherwise more visible if the participant’s linguistic behavior was examined based upon a continuum of their dominance scores. This would allow us a more fine-grained analysis of the construct of language dominance in which we could answer questions as to whether the more dominant a bilingual is in LA the more likely they are to behave a certain way with respect to their production, perception or processing of a given linguistic phenomenon.
7. Conclusions
This study provides evidence that not all language dominance measures provide comparable results. Out of the five assessments tested, only the Bilingual Language Profile (BLP), Bilingual Dominance Scale (BDS) and self-ratings of verbal ability were found to predict the scores of one another, and the self-ratings of verbal ability predicted the scores of the self-ratings of written ability. The self-ratings of written ability did not predict the scores of the BLP nor the BDS and the repetition task did not predict the scores of any other assessment. Further, across the five language dominance measures 20 out of the 29 participants in this study were classified differently in one or more measure. Given the fact that language dominance is a construct that represents asymmetries in language ability, processing, and use, these results suggest that language dominance measures that tap into one specific domain/dimension of dominance do not necessarily predict the dominance of another domain/dimension. In other words, it could be the case that a bilingual’s dominant language is not all encompassing but rather his/her dominant language varies as a result of the domain and/or dimension in question. The implication of this result is that when comparing across studies that examine language dominance, we cannot assume, for example, that all Spanish dominant bilinguals are alike. Rather, we must ask the follow-up question “Spanish dominant in what way?” (see (
Grosjean 2016) for more discussion of the domains of language dominance). Thus, when it comes to operationalizing language dominance, we must consider what and how each individual measure tests dominance as it seems to be the case that different measures approach it differently. For instance, the BLP and the BDS are questionnaires and calculate dominance based on language history, language use, language proficiency, and language attitudes. The self-ratings treat proficiency and dominance as interchangeable constructs, and the repetition task measures dominance based on a processing task.
The difference in results between the correlation analysis of the dominance scores and the dominance classification based upon those scores, has implications for treating language dominance as a continuous versus a categorical variable. When language dominance is viewed as a continuous variable (here, the correlation analysis), a relationship was found between three of the assessments: the BLP, the BDS, and the self-ratings of verbal ability, and also between the self-ratings of verbal and written ability However, when language dominance is viewed as a categorical variable (here, the dominance classifications), 69% of the participants were classified into more than one dominance group across the five different assessments, including the BLP, BDS, and self-ratings of verbal ability. In other words, treating language dominance as categorical is problematic because a given bilingual could be classified differently depending on what language dominance assessment is being used. Particularly, the more balanced a bilingual is the greater chance s/he has of being inconsistently classified. Different dominance classifications of a group of bilinguals leads to different predictions as to how said bilinguals will behave on a given linguistic task. The potential heterogeneity of each group based upon these inconsistent classifications feeds different and possibly contradictory results which in turn obstructs successful synthesis of studies that examine the effects of language dominance on linguistic phenomena. Given that these different assessments measure different aspects of language dominance and that these aspects result in different dominance profiles, we can ask several theoretical questions. First, are all these assessments actually measuring language dominance? Second, if they are, then what is language dominance? Can we simply consider a bilingual to be dominant in LA? Or must we specify that a bilingual is dominant in LA with respect to a specific domain/dimension?
Based on the results of the current study it seems to be the case that some dominance measures are better equipped for assessing dominance in particular bilingual populations. When analyzing dominance in early and late bilinguals together,
Flege et al. (
2002) found that the repetition task and self-ratings provided consistent dominance scores. In the current study in which only early bilinguals were tested, the same result did not occur. There was no significant correlation between the dominance scores provided by the repetition task and the self-ratings. Although self-ratings of proficiency are a widely used language dominance measure (
Gertken et al. 2014), this assessment has been found to be less valuable for some bilingual populations (see
Tomoschuk et al. 2019 for discussion). Due to their linguistic environment and the limited formal instruction that Spanish/English heritage bilinguals receive, it could be the case that heritage bilinguals underrate their verbal and writing proficiency in Spanish. As a result, their self-ratings might not accurately be representative of their proficiency which might make this assessment unreliable for determining dominance in early heritage bilinguals.
In sum, in this study, we have shed light on some methodological issues regarding language dominance. For the purpose of replicability and comparability, we suggest that it is important to determine what language dominance is (i.e., work towards a consensus on a unified definition) and to develop a unified methodology for assessing it. In order to do so, it is important to consider the bilingual population and the domain and dimension being tested. We suggest that treating language dominance as a continuous variable, rather than categorical, can help mitigate some of the possible effects of using a variety of assessments across the field.
Future work will test three other dominance measures in early Spanish/English bilinguals including the Multilingual Naming Test (e.g.,
Gollan et al. 2012), a category generation task (e.g.,
Bahrick et al. 1994) and measures of spoken proficiency in the form of oral interviews (e.g., OPI). By assessing more language dominance measurements, we can begin to get an idea of which types of assessments pattern together in order to delve deeper into the construct of language dominance. Additionally, future research will test different types of bilinguals (e.g., late bilinguals) to see if we find the same issues with classifying bilinguals into dominance groups.