1. Introduction
When people make certain kinds of decisions, they are systematically sensitive to the way in which information is presented (
Tversky and Kahneman 1981). For example, when a task presents two alternatives to choose from in which some kind of gain is involved, the majority of participants tend to be conservative and select the choice that involves less potential risk. However, when the same task is presented as involving a loss, participants tend to select the riskier choice. This tendency is irrational, since the way a task is presented/framed should not have an effect on the decision. This difference is known as a framing bias or framing effect (
Tversky and Kahneman 1986,
1991;
Kahneman 2003,
2011;
Kahneman and Frederick 2007).
Subsequent research indicates that when a similar decision task is presented in a foreign language, this bias is reduced or eliminated (
Keysar et al. 2012;
Costa et al. 2014a;
Costa et al. 2017;
Ivaz et al. 2019; see
Polonioli 2018 for critical assessment)—a finding that has been referred to as Foreign Language Effect (FLe).
Keysar et al. (
2012) tested three separate populations of L2 speakers—English-L1/Japanese-L2, Korean L1/English-L2 and English-L1/French-L2—and found a reduced bias when the decision-making tasks were presented in the L2. In a follow-up study,
Costa et al. (
2014a) replicated the finding and extended it to a number of different decision-making tasks in speakers of Spanish-L1/English-L2, Arabic-L1/Hebrew-L2, and English-L1/Spanish-L2. In addition to these two studies on framing effect, a FLe has been found in a number of studies on moral decision-making: people scored higher on their tendency to maximize benefit when presented with dilemmas in their FL than in their L1 (
Costa et al. 2014b;
Geipel et al. 2015), a finding referred to as a moral Foreign Language effect (MFLe).
Three hypotheses have been put forward to account for the FLe both on cognitive biases and moral decision-making. The three hypotheses can be traced back to theories on the perception of risk and benefit, according to which judgment and decision-making happen via two routes or systems (e.g.,
Loewenstein et al. 2001;
Slovic et al. 2004): an intuitive, quick, and automatic route called System 1 and an analytical, slow, and cognitively effortful route called System 2. A decision made via the first route is considered to be based on
affect heuristic, where the positive or negative associations activated by the description of the task predetermine the decision (
Damasio 1994;
Slovic et al. 2002). By contrast, a decision made via the second route is considered to be based on normative principles (e.g., expected utility theory) and on analysis and prediction of potential outcomes.
Specifically, the first hypothesis, the reduced emotionality hypothesis (REH), rests on the idea that L2 words have a weaker and less automatic emotional effect on people, an idea supported by a number of clinical, cognitive, psychophysiological, and neuroimaging studies (for a comprehensive review, see
Pavlenko 2012;
Caldwell-Harris 2014). According to this hypothesis, the weakened emotional effect of the L2 words inhibits the quick affect-based response generated by System 1, leading to less emotional decisions. For example, people’s negative reactions to a harm-causing action (e.g., killing a person to save five) are “blunted” when presented as a moral dilemma in a FL because they are less affected by the emotional component of the dilemma, thereby allowing them to focus on the benefit-maximizing component of the action rather than on causing harm. Similarly, the linguistic description of distressing life-death scenarios in a framing bias scenario such as the Asian disease task does not trigger negative affect to the same extent when presented in the FL, thereby decreasing framing bias. In other words, thinking about saving lives or causing deaths in one’s FL is not as emotional as it is in the L1, hence is less prone to affect-based decision-making (
Keysar et al. 2012;
Costa et al. 2014a on cognitive biases; and
Cipolletti et al. 2016;
Geipel et al. 2015;
Costa et al. 2014b on moral dilemmas).
The
cognitive enhancement hypothesis (CEH) is an alternative to the reduced emotionality hypothesis. This hypothesis explains the FLe by positing that the greater cognitive load involved in FL processing forces one to slow down, thus inhibiting the quick spontaneous response generated by System 1 (
Hayakawa et al. 2017). As a result, processing is “routed” to System 2, which promotes a more analytical approach to decision-making (
Keysar et al. 2012), thereby
enhancing responses to decision-making problems, both for moral decision-making (
Hayakawa et al. 2017) and framing biases (
Costa et al. 2014b)
The third hypothesis, which in this study we will refer to as the
cognitive overload hypothesis (COH), is the exact opposite of the cognitive enhancement hypothesis in that it states that decision-making in one’s FL actually exacerbates biases by overloading the processor. According to
Costa et al. (
2014a, pp. 238–39), “Under conditions of high cognitive load participants’ decisions tend to be more affected by heuristic biases (
Benjamin et al. 2006;
Whitney et al. 2008;
Forgas et al. 2009). That is, when cognitive load taxes System 2, the rational processor cannot check or control the intuitive answers given by System 1. Hence, to the extent that reading in a FL increases cognitive load, one might expect heuristic biases to affect participants’ responses to a larger extent when the problem is set in a FL.”
Hayakawa et al. (
2017) make a similar prediction about FLe on moral decision-making.
In previous studies, the FLe has been mostly tested using FL learners (
Keysar et al. 2012;
Costa et al. 2014a on framing biases;
Costa et al. 2014b;
Geipel et al. 2015;
Hayakawa et al. 2017;
Mills and Nicoladis 2020;
Białek et al. 2019 on moral decision-making). Following
Gass et al. (
2013), foreign language learning is defined as the study of a second language in a formal classroom situation that takes place in a country where the native language is dominant. Crucially, for such FL learners the FL is both the less emotional
and the less proficient language, making it difficult to tease apart the effects of reduced emotionality and cognitive enhancement/overload. Furthermore, the limited populations tested in such studies may occlude the possibility that the FLe is a characteristic unique to FL learners and may be absent in other language populations. Thus, one of the purposes of this article—along with others that test the FLe in, for example, highly proficient acculturated bilinguals, who are not foreign language learners in the traditional sense of the term—is to test the limits of the FLe. This is an important point, because FL speakers represent just a fraction of the world’s bilingual population. Bilingualism encompasses a large and diverse set of speakers, including balanced bilinguals, unbalanced bilinguals who have learned their L2 in a context other than the classroom, and heritage speakers, and conventional studies on FLe cannot clarify what (if any) kind of language effect is present in those bilingual populations.
These studies shed more light on the issue of FLe origin and scope, but while highly proficient acculturated bilinguals in those studies are not FL learners, their first language is still always the dominant one, and thus both more proficient as well as more emotional. Therefore, we complement data from the studies on FL learners and highly acculturated bilinguals with data from a novel and principally different population: heritage speakers. Heritage speakers represent a unique language population that, to the best of our knowledge, has not been tested in the studies investigating FLe on judgment and decision-making. For the purposes of this article, we adopt the definition of heritage speakers as “individuals raised in homes where a language other than English is spoken and who are to some degree bilingual in English and the heritage language” (
Valdés 2000). While heritage speakers may come from different countries, we only recruited heritage speakers of Spanish (HS speakers) in our study, because we did not want to introduce additional variables (e.g., Spanish-speaking country of origin vs. Russian-speaking country of origin, etc.) in the design.
There are two principal differences between FL learners/highly proficient bilinguals in the previous studies and the HS speakers in our study. First, proficiency in heritage speakers’ L1 vs. L2 language is reversed compared to typical bilinguals —HS speakers are typically less proficient in their L1 than in their L2. This pattern is caused by their language acquisition history: they are born into Spanish speaking families and learn Spanish as their L1, but are then exposed to and learn English (L2) because of societal and schooling needs, and receive formal education in their L2 but generally not in their L1. As a result, English becomes their dominant and more proficient language.
Second,
emotional resonance, which “refers to the emotionality elicited by a given problem” (
Costa et al. 2014a, p. 237), should be comparable in HS speakers’ two languages, because both languages are learned in an immersion context—the L1 with family, and the L2 with friends, schoolmates, etc. By contrast, FL learners study their FL in a classroom setting and thus should have a weaker emotional connection to it. In this study, we measure the perceived “emotionality” of Spanish and English in HS speakers using an adapted version of the Emotional Phrases Task from
Caldwell-Harris and Ayçiçegi-Dinn (
2009). We expect to find no differences between Spanish and English, because a number of studies have compared early and late bilinguals and found that the reduced emotionality effect is present in late bilinguals, but it diminishes or disappears in early bilinguals (
Anooshian and Hertel 1994;
Harris 2004;
Harris et al. 2006;
Sutton et al. 2007;
Eilola and Havelka 2011;
Ferré et al. 2010;
Caldwell-Harris et al. 2011;
Ferré et al. 2018;
Ivaz et al. 2019;
Miozzo et al. 2020). That is, if the L2 is acquired earlier in life and in a naturalistic setting, it evokes emotional resonances similar to the L1. For example, Spanish–English early bilinguals who had learned their L2 (English) before puberty and often in a naturalistic environment (studied in the USA and used English regularly) in
Ferré et al. (
2010) recalled emotional words at the same rate in L1 and L2.
Harris (
2004) compared 31 early Spanish-English bilinguals who were born in the US or immigrated to the US before the age of 7 and 21 late bilinguals who arrived in the US at or after the age of 12. Early bilinguals rated themselves as either balanced bilinguals or dominant in English, while late bilinguals’ most proficient and likely most dominant language was Spanish. Results showed that L1 and L2 had similar emotional strength for early bilinguals, and the author concluded that the L1 is only perceived to be more emotional if it is the more proficient language.
Similarly, in
Caldwell-Harris et al. (
2011), late Mandarin–English bilinguals rated the L1 Mandarin to be more emotional, but this language effect disappeared in early bilinguals, who rated the two languages as equally emotional. Moreover,
Caldwell-Harris et al. (
2012) explored perceived language emotionality of L1 Russian L2 English speakers who arrived in the USA at different ages or learned Russian as their second language. A comparison of three groups—Russian native speakers who arrived to the USA before the age of 10 (early arrivals), Russian native speakers who arrived to the USA after the age of 10 (late arrivals), and L1 English speakers who learned Russian as a foreign or second language—showed that perceived emotionality of Russian was the highest for late arrivals, followed by early arrivals, and was the lowest in the L2 group.
In a recent behavioral study,
Ferré et al. (
2018) administered a lexical decision task (LDT) and an affective decision task (ADT) to highly proficient balanced Catalan-Spanish bilinguals in both of their native languages, and to a group of Catalan-Spanish bilinguals in their FL English. Language effect was found only in the FL group (when the tasks were performed in English), but not in the Catalan-Spanish groups. Since Catalan and Spanish were acquired in early childhood in a naturalistic environment, while English was studied as a foreign language, these results support the idea that languages acquired early in life have comparable emotional resonances. The authors also suggest that FL words are not as grounded in sensorimotor experiences as L1 words.
Also relevant is
Ivaz et al. (
2019) on self-bias and non-nativeness vs. foreignness. The authors employed self-bias paradigm to establish whether the foreign language effect came from non-nativeness or foreignness of a language. They tested Spanish native speakers who were born and raised in the Basque Country and who spoke Basque and English with relatively high and approximately equal proficiency. Crucially, while both Basque and English are not their native languages, the participants had learned the former in an immersion context by virtue of living and working among people who speak it as a native tongue, while they learned English in a largely impersonal and unemotional formal classroom context. The results showed a reduction of self-bias in the non-native foreign (English) language, but not in the non-native local language (Basque), which indicates that the FLe is caused by foreignness, not non-nativeness of a language.
In fact, recent discoveries in neuroscience research indicate that, provided the right context, one may be able to continue building emotional resonances in the L2 well into adulthood.
Sorrells et al. (
2019) found a group of neurons in the paralaminar nuclei (PL) of the human amygdala—the center of emotional processing in the brain—that remain immature late into adulthood. Most of these neurons rapidly mature in adolescence, which can account for the tumultuous development of emotional intelligence in teenagers, but some of them remain immature throughout life, thus possibly allowing the brain to remain flexible as far as emotional processing is concerned (
Sorrells et al. 2019). This suggests that the reason the L2 is typically less emotional does not have to do with some kind of neurological maturational constraint but rather with the absence of emotional stimulation. Most importantly for the purposes of this study, it suggests that our HS speakers, who were exposed to their L2 at an average age of 4.9 and learned it in a naturalistic environment, should have been able to build similar emotional resonances in their L1 and L2.
Thus, since HS speakers’ second language is almost always the more proficient one, and, presuming that they have comparable emotional resonances in both of their languages, the population should be distinct from the FL learners and highly proficient acculturated bilinguals in the recent studies on Moral FLe (see
Table 1). Studying this unique population allows us to tease apart the effects of emotionality and cognitive load.
If the FLe is caused by reduced emotionality in the FL (the reduced emotionality hypothesis), we would not expect to observe a language-associated reduction of decision-making biases in heritage speakers, since they have similar emotional resonances in both of their languages. Alternatively, if the FLe is caused by cognitive enhancement (the cognitive enhancement hypothesis) due to a more deliberate processing caused by the lower proficiency in the L1 (Spanish), the FLe should be present in HS speakers’ less proficient L1, in contrast with the typical L2 populations where it is present in the less proficient L2. Finally, if the cognitive overload hypothesis is right and lower proficiency in a language actually exacerbates rather than reduces decision-making biases, the bias reduction should be present in the HS speakers’ more proficient L2, since higher proficiency in the L2 (English) should lead to more rational results.
In addition to complementing the existing data on the FLe with data from a novel population of HS speakers, we also complement it with data from emotion-laden vs. emotion-neutral tasks. Task type is an important variable when considering the FLe for the following reasons: if the FLe is caused by reduced emotionality in the foreign language, it should only be present in tasks that involve emotionality, but if it is caused by cognitive load, it should be present in any task that involves cognitive processing. Therefore, in order to establish whether the FLe stems from reduced emotionality or cognitive enhancement/cognitive overload it is critical to employ tasks that involve both emotional and emotionally-neutral problems.
Costa et al. (
2014a) and
Vives et al. (
2018) included such problems in their studies of FL learners, but the recent studies that specifically looked at the FLe in highly proficient acculturated bilinguals employed only emotion-laden tasks. For example,
Miozzo et al. (
2020) and
Dylman and Champoux-Larsson (
2020) used the Asian Disease Problem and the Footbridge Dilemma,
Brouwer (
2020) used personal and impersonal dilemmas, and
Čavar and Tytus (
2018) used a number of moral dilemmas from
Bartels (
2008). In other words, as of yet early bilinguals have not been tested on unemotional tasks.
For that reason, in this study we tested our HS speakers on both emotion-laden and emotion-neutral tasks. The former may trigger negative affect (unpleasant emotions such as anxiety, fear, shame, guilt, irritability, etc.) (
Watson et al. 1988), while the latter typically do not produce such an effect on people. Examples of emotion-laden tasks include those involving risk aversion and loss aversion, moral dilemmas, etc., while examples of emotion-neutral tasks include those testing the outcome bias, the conjunction fallacy, disjunction fallacy, base-rate neglect fallacy, the cognitive reflection test, etc. We refer to the former as “cognitive-emotional” problems and to the latter as “purely-cognitive” problems. Recruiting the population of HS speakers and employing these two types of tasks allows us to test the three hypotheses —reduced emotionality, cognitive enhancement, and cognitive overload—and make the following predictions. First, if the reduced emotionality hypothesis is correct, and given the above-mentioned findings showing comparable emotionality in the two languages of early bilinguals, there should neither be language-associated bias reduction in cognitive-emotional nor in purely-cognitive tasks in our HS speakers. Second, if the cognitive enhancement hypothesis is correct, HS speakers should display language-associated bias reduction in their less proficient L1 (Spanish), where their lack of proficiency would make them slow down and give a more deliberate response across the tasks (because all tasks involve some kind of cognitive effort). This is the opposite of the typical FL populations where the bias reduction is present in the FL, because the FL is always less proficient in such populations. Third, if the cognitive overload hypothesis is correct, bias reduction should be found in the more proficient L2 across the tasks, unlike in the typical FL populations where it should be present in the more proficient L1.
Table 2 summarizes these hypotheses, and the present study will allow us to test them with respect to the HS speakers (rightmost column of
Table 2).
Complementing existing research on FL learners and highly-proficient bilinguals, our study of HS speakers will provide additional insights into the origin and scope of the FLe, explore language effects on a language population other than FL learners, and will make it possible to tease apart the effects of higher emotionality and higher proficiency. Therefore, in this paper, we contribute to the discussion of the FLe by extending the decision-making experiments to bilingual heritage speakers of Spanish (HS) who are dominant in English, as well as employing decision-making tasks that both involve an emotional component and those that do not. In addition, ours is the first study after
Costa et al. (
2014a) to explore the FLe on several cognitive biases rather than moral dilemmas.
4. Discussion
This study contributes to the discussion of FLe by providing data from the novel and principally distinct population of Spanish heritage speakers (HS speakers). FLe refers to an observed bias reduction in decision-making tasks administered in a foreign language. As noted earlier, initial studies with FL learners found FLe in decision-making tasks and moral dilemmas; this FLe has been accounted for by a reduced emotionality in the FL. More recent studies have engaged populations of non-FL speakers such as highly proficient acculturated bilinguals to determine whether FLe is (1) generalizable to a different population of language speakers, and (2) caused by reduced emotionality in one of the languages. Our study extends this research program to a unique population of L1 Spanish/L2 English HS speakers using the standard battery of judgment and decision-making tasks. In the next few paragraphs, we discuss results from EPT (Emotional Phrases task) first, followed by the findings on cognitive biases in general, and finally findings on FLe.
We acknowledge that emotionality has different dimensions that may not be immediately captured by behavioral tasks such as the one used in this study. Many studies on emotionality and language supplemented emotionality ratings with psychophysiological measures such as skin conductance tests (
Harris et al. 2003;
Harris 2004;
Eilola and Havelka 2011). In fact, some report a dissociation between the findings from behavioral measures vs. psychophysiological ones. For example,
Harris et al. (
2003) found no differences between the L1 Turkish and L2 English ratings of emotional words, but they did find significantly stronger skin conductance responses in Turkish. In short, EPT may be informative, but it certainly is neither exhaustive nor fully reliable in all cases.
Despite these considerations, our EPT results are consistent with 1) data from a significant number of research studies that show no differences between emotional resonances in the two languages of early bilinguals, and 2) with the general consensus among scholars that there is a comparable emotional intensity of the two languages spoken by an early bilingual. For these reasons, we may be able to conclude that our HS speakers have similar emotional reactions to their two languages in general. Beyond this broad claim, there could be a myriad of variables that affect emotionality. For example, different linguistic stimuli may trigger variable degrees of emotionality in the two languages depending on the language in which the stimuli were first and/or mostly experienced. For example, childhood reprimands would potentially be experienced by HS speakers more emotionally in the L1, whereas romantic endearments would be so in the L2 because these are the languages in which they were most likely to encounter them. This idea is supported by studies showing that autobiographical memories are recalled more easily in the language in which they were experienced (
Marian and Neisser 2000). Overall, emotionality is a highly complex concept and, as
McFarlane and Cipolletti Perez (
2020) argue, one that is hard to measure in the absence of a predictive and generalizable theory of emotion and specific emotion types.
Second, as far as our findings on cognitive biases in judgment and decision-making (JDM) are concerned, they are consistent with previous monolingual literature: HS speakers clearly exhibit cognitive biases across all tasks in both languages. Specifically, in cognitive-emotional tasks they fall victim to framing biases, show risk aversion, and have a preference for local vs. global accounting. In purely cognitive tasks, they tend to provide intuitive rather than rational answers on CRT and provide answers that violate probability laws in the Disjunction fallacy task. Thus, this study is the first one to demonstrate cognitive biases in heritage speakers. While there is no reason to suspect that cognitive biases do not apply to heritage speakers, this was an empirical question that we have addressed in this paper.
Third, as far as FLe is concerned, we found a general tendency toward bias reduction in the L2; however, this tendency was not statistically significant in cognitive-emotional tasks, a finding that differs from some previous studies on FL effects. For purely-cognitive tasks (CRT and Disjunction fallacy) the results were more complex: language effect was not significant for the CRT, but it was significant for the Disjunction fallacy. More specifically, CRT correct responses were more likely to be given in English, with the increase in correct responses corresponding to a decrease in “other” responses, rather than to a decrease in intuitive ones. It is possible that participants are more used to doing mathematical calculations in English, since for most of them, this would have been the language of their schooling beginning at an early age. Crucially, we did find a significant effect of language of presentation on Disjunction fallacy responses—there was a statistically significant reduction of incorrect (intuitive) responses in the L2 compared to the L1. In other words, HS speakers answered the Disjunction fallacy problem more correctly in their second, more proficient language. Moreover, there was a significant effect of proficiency in Spanish on the responses, such that higher proficiency was correlated with more correct responses when the task was presented in Spanish.
We will now discuss these results in light of each of the three hypotheses proposed in this paper. The predictions from
Table 2 are partially reproduced in
Table 18.
Our findings are not fully compatible with the reduced emotionality hypothesis for two reasons. First, if the cause of FLe on JDM were reduced emotionality in the L2, we should essentially find no differences at all in our HS population, given that their two languages are comparable in terms of emotionality. Second, even if our EPT is incorrect and our HS speakers do have unequal emotionality in one of their languages, the FLe should be present only in tasks that involve emotionality—the cognitive-emotional tasks. Although we found a tendency for English to reduce biases, we did not find any significant differences between the two languages as far as JDM in cognitive-emotional tasks was concerned, and we found a significant effect of language of presentation and proficiency in one of our purely-cognitive tasks (Disjunction fallacy). Nonetheless, we are cautious about making strong claims with respect to the reduced emotionality hypothesis, because our HS speakers are presumably equally emotional in both languages, so the null effect could both serve as evidence against the hypothesis as well as a mere indication that we do not have enough participants to find an effect of emotion.
Our findings are also not compatible with the cognitive enhancement hypothesis, since it predicts a bias reduction in the less proficient language, and our results show the opposite effect. Similarly, it predicts that higher proficiency will lead to increased biases, which runs contrary to our findings: lower proficiency in Spanish led to significantly lower scores in the Disjunction fallacy.
Finally, our results are most consistent with the cognitive overload hypothesis. Recall that this hypothesis states that the ability to provide a rational response should be affected in the less proficient FL, and that researchers hypothesized that these reduced rational responses were due to the increased processing load in that language. Since English is the dominant and more proficient language for our heritage speakers, we would expect fewer cognitive-load effects in that language. While we did not find significant differences in most tasks, we did find them in one of the tasks, where the HS speakers responded overall more correctly in their more proficient language (L2 English). Furthermore, in the less proficient language (L1 Spanish), those with higher proficiency responded more correctly than those with lower proficiency.
One might object that if the cognitive overload hypothesis were correct, we would find language of presentation effects in more tasks, not just one of them, and that proficiency would have a larger effect on bias reduction. We acknowledge that finding a stronger effect across more tasks would make a more robust case for the cognitive overload hypothesis, and we suspect that we did not find it for several reasons. First, our HS speakers may not have low enough proficiency to suffer sufficient overload to affect their responses. Studies have shown that FL learners are affected by processing load (
Meuter and Allport 1999), but that early balanced bilinguals are not (
Costa and Santesteban 2004;
Costa et al. 2006). Although our HS speakers are early bilinguals and are more proficient than FL learners, they are still not balanced in both languages; hence, they might represent an intermediate case between FL learners and early balanced bilinguals, where processing load still affects them, but to a lesser extent than FL learners. Moreover, our HS speakers’ proficiency fell on a continuum from very high to very low and it certainly could have introduced variability in degrees of cognitive load. Having proficiency groups with consistently low proficiency (but sufficient to understand the tasks) might have revealed significant effects across more tasks. This is an empirical question that future research can investigate. In addition, as an anonymous reviewer suggests, the foundational methodologies of this study could be strengthened through a rigorous proficiency measure of the L2 (English). Although we assumed that our participants were native speakers—since they were exposed to English in early childhood and all of their schooling was in English, including higher education—and we thus expected them to perform at ceiling on English proficiency tests, future studies of this nature might benefit from incorporating an additional linguistic measure of the heritage speakers’ L2 skills beyond self-ratings.
Another possible interpretation of these results is that language effect does not apply at all to heritage speakers: if it can be shown that this is due to their being equally emotional in both languages, it would provide support for the reduced emotionality hypothesis, and this is the reason why we do not claim to have refuted it. We also acknowledge that the statistical power of this study is lower than some of the previous ones. For a medium size effect, 47 participants per language results in .73 statistical power. Finally, it is possible that reduced emotionality, cognitive enhancement, and cognitive overload operate simultaneously, producing a combined language effect. In any case, we should assume that any language effect should be as unique as the individual bilingual speaker, and should depend on their specific linguistic profile (language proficiency, dominance, emotionality, etc.).
Nevertheless, the contribution of the present study to the FLe research is tangible: first, we have explored language effects on decision-making in a novel bilingual population of heritage speakers; second, we addressed the issue of confounding emotionality and proficiency inevitable in FL learners; and, finally, we showed that if a language effect exists in HS speakers, who have equal emotional resonances in both Spanish and English, it would be caused by cognitive overload in the less proficient language rather than by cognitive enhancement. This is consistent with recent moral decision-making research suggesting that low proficiency in a language is not correlated with heightened utilitarianism (
Hayakawa et al. 2017;
Muda et al. 2018;
Białek et al. 2019).
Lastly, we would like to suggest potential avenues for future research on FLe. One future direction is the direct measurement of cognitive load and emotionality during the process of decision-making. While cognitive load and emotionality have been suggested as key factors causing FLe on decision-making tasks, no studies, to the best of our knowledge, have tested the amount of cognitive load or emotional reaction
during the decision-making tasks. It would be enlightening to measure both the amount of cognitive load as well as the intensity of emotional response induced during the L1 vs. FL presentations of the task: if cognitive load is reliably higher in the FL condition than in the L1 condition, one can conclude that FLe is related to the amount of cognitive load induced by the FL. Some objective measures of cognitive load include pupillometry (eye-tracking), brain activity measures such as MRI and fNIRS, EEG and cardiovascular metrics; while subjective measures include self-reports of stress or mental effort (
Martin 2014). Similarly, it would be highly informative to measure emotionality level during the task. While we did employ a measure of emotionality (Emotional Phrases Task) to get insights into which of the two languages overall is perceived as more emotional by our HS speakers, we did not measure emotionality induced by the task itself. As mentioned before, some studies on language and emotionality used skin-conductance tests (
Harris et al. 2003;
Harris 2004;
Eilola and Havelka 2011), but none of them studied the FLe as it specifically impacts decision-making. FLe research would also benefit from extending it to the other types of heuristics, such as fast and frugal heuristics (
Chen et al. 2015). This is because most research on FLe to date, including ours, is based on Tversky and Kahneman’s program, which considers heuristics to be a liability rather than a tool, but this assumption represents just one of many views on heuristics within the philosophy of judgment and decision-making.