Basque has an apico-alveolar /s̺/, a lamino-alveolar /s̻/ and a prepalatal /ʃ/ sibilant that are represented by the letters <s> as in
hasi ‘begin’, <z> as in
hazi ‘to grow up’ and <x> as in
xake ‘chess’, respectively. The difference between these sibilants is that they have three different places of articulation, and the Academy of the Basque Language recommends speakers to distinguish them [
1]. Nevertheless, there are speakers of Basque who do not keep the distinction between the three sibilants and merge them to some extent. According to Hualde [
2], the three sibilants are maintained in some varieties of Basque, such as the Basque spoken in Goizueta, Navarre, but the apico-alveolar and the lamino-alveolar sibilants have merged in Biscay, some areas of Guipuzcoa, and the Basque-speaking territories of Alava. This merger of the apico-alveolar and lamino-alveolar sibilants is not a new phenomenon, as Michelena Elissalt claims that it was already present centuries ago in Biscay and it was in fact completed by 1961 in Biscay (with the exception of Markina and Bolibar), and in some urban areas of Guipuzcoa [
3]. Michelena Elissalt argues that the merger originated in Bilbao and that, from there, it spread to different territories [
3]. However, Ulibarri Orueta finds that the sibilant merger was present in texts from Vitoria, a city in the southern Basque Country, in the 16th century and gives evidence for a possible southern origin of the merger [
4].
This study aimed to present an acoustic analysis of the sibilants in a Biscayan variety of Basque and to analyze the characteristics of the merger, its current expansion and the factors that may condition it including the language dominance, either Basque or Spanish, of the speakers. More specifically, the study analyzed the merger between the apico-alveolar and the lamino-alveolar sibilants in detail and investigated whether the prepalatal sibilant resists the merger or not. In order to analyze the production of the three sibilants, the study presents results from an acoustic analysis including the center of gravity (COG) of the sibilants. The center of gravity correlates with the different articulatory configurations among fricatives, which result in distinct places of articulation [
5], and the analysis of this acoustic cue allows us to determine whether the sibilants are merged or not. The COG also allows us to establish the place of articulation of the fricative resulting from the merger, i.e., whether it is a more lamino-alveolar, apico-alveolar or prepatal sound.
The geographical focus of the present study was Amorebieta-Etxano, a village in Biscay. The study of the merger in Amorebieta-Etxano is of interest because it can provide information about the expansion of the phenomenon given its location close to the city of Bilbao and the town of Gernika, where the merger seems to be common only among younger speakers [
2]. The present study analyzed the production of the sibilants by ten speakers aged between 20–33 years from Amorebieta-Etxano. The language dominance of these speakers was measured through the Bilingual Language Profile (BLP) Questionnaire [
6], a tool that allows us to measure language dominance in a standardized manner.
Overall, the results of the present study show that the sibilant merger occurs in Basque, as expected, and that, at least among younger speakers, this merger depends on the speakers’ degree of dominance in Basque or Spanish. More precisely, Basque-dominant participants merge two of the three sibilants and Spanish-dominant ones merge the three sibilants into a single one. This is the first acoustic study that explores the effect of language dominance quantitatively on the sibilant merger in Biscay, and it is a novel approach to the question of how the degree of bilingualism affects the merger in Basque. Moreover, it provides a description of the three sibilants with different places of articulation, which may be useful to analyze and describe the phonemic inventories of other languages with rich sibilant inventories.
1.1. Bilingualism and Bilingual Language Profile
Gutiérrez-Clellen defines the term bilingualism as the “knowledge of two languages” [
7] (p. 291). Valdés and Figueroa, on the other hand, define it as “a condition that makes it possible for an individual to function, at some level, in more than one language” [
8] (p. 8). With regard to Basque–Spanish bilingualism, the Basque Institute of Statistics, also known as Eustat, classifies speakers in the Basque Autonomous Community as follows [
9] (p. 32):
Bilingual speakers: People who speak and understand Basque well or reasonably well.
Passive bilingual speakers: people who are able to understand Basque although they cannot speak it or they speak it with difficulty.
Monolingual speakers of Spanish: people who cannot understand or speak any Basque.
It should be noted here that Eustat does not list monolingual speakers of Basque, due to their “virtually complete disappearance” [
10] (p. 11). According to the census data of the year 2011, there were 2,056,136 inhabitants in the Basque Autonomous Community (henceforth BAC) and 749,182 of them were bilingual speakers. That is, they were able to communicate in both Basque and Spanish. Furthermore, 910,032 were monolingual speakers of Spanish, whereas 396,922 were passive bilingual speakers [
9]. The Basque Institute of Statistics also provides reports on the degree of bilingualism in different towns and cities in each province of the BAC. In the last report of 2011, the town of Amorebieta-Etxano, the location of the present study, had a population of 17,581 inhabitants and the majority of them were bilingual (9308), whereas there were 3995 passive bilingual speakers and 4278 monolingual speakers of Spanish [
9].
These census data report information on the overall linguistic competence in the BAC, and in its provinces and villages. Nevertheless, the classification of bilingual speakers provided by the census data has its limitations. Most notably, it does not reflect the gradient nature of the language dominance among bilingual speakers, i.e., the fact that speakers might be more or less dominant in one or the other language. Birdsong et al. consider that it can be difficult to assess language dominance among bilingual speakers and they claim that, for that reason, researchers have relied on interviews, speakers’ language choice, proficiency tests, psycholinguistic tasks and experiential and psychosocial criteria to classify speakers as dominant in one language or the other [
6]. However, Gertken et al. claim that there are some “reliable” and “widely accessible” self-report tools now that enable researchers to measure language dominance in a more gradient, non-binary manner [
11] (p. 213). Some of the newest questionnaires are the Language Experience and Proficiency Questionnaire (LEAP-Q), the Bilingual Dominance Scale (BDS), and the Self-Report Classification Tool (SRCT) [
11]. The LEAP-Q, BDS and SRCT are all self-report questionnaires with questions about language experience, proficiency and in the case of the LEAP-Q, linguistic attitudes. The advantages of these questionnaires are that they allow researchers to collect data on bilingualism quickly (5–25 min) and they provide a description of speakers’ bilingual profile (LEAP-Questionnaire), speakers’ continuous dominance scores (BDS Questionnaire) or “discrete dominance groups” (SRCT Questionnaire) [
11]. Nevertheless, there are some limitations when using these questionnaires [
11]. Firstly, Gertken et al. claim that the LEAP-Q has many items, and that some of them are long and complex. Secondly, the BDS questionnaire has items that are not equally weighted. Finally, the SRCT questionnaire was designed to fit Mandarin–English bilingual speakers living in Singapore and it is difficult to apply to other bilingual speakers. Beyond the previously mentioned disadvantages, the questionnaires contain some ambiguous questions or free response questions, which result in a wide variety of responses that are difficult to measure or quantify [
11].
Birdsong et al. [
6] and Gertken et al. [
11] propose the Bilingual Language Profile or BLP Questionnaire as an alternative tool to overcome the limitations of previous research methodologies. This BLP Questionnaire was created based on the LEAP-Q, BDS and SRCT. The main advantages of the BLP Questionnaire are that it is self-scored, items are equally weighted, it avoids free response questions, and it takes less than 10 min to complete. Furthermore, the BLP Questionnaire shows the gradient nature of language dominance in a systematic manner. Gertken et al. claim that measuring the gradient nature of dominance is essential because “a person is not simply dominant in a given language, but is dominant in that language to a certain measurable degree” [
11] (p. 208).
The BLP Questionnaire has 19 multiple-choice questions and measures the following factors: (I) language history, (II) use, (III) proficiency and (IV) attitudes.
Section 2.1 summarizes the results of these four factors for all the participants in the study. The BLP provides numeric scores of dominance from −218 to +218 that allow researchers to measure dominance in a standardized way and place speakers within that continuum. Positive scores indicate dominance towards one language, whereas negative scores indicate dominance towards the other language. It should be noted here that any of the two languages can be placed at any of the extremes: −218 or +218. That is, Basque can be placed at the −218 extreme or at the +218 extreme, and the same could be done with Spanish. These numeric scores are a system to measure dominance systematically and to allow researchers to compare the BLP scores to the ones in other studies. Furthermore, these scores are based on the detailed evaluation of the four factors mentioned above and they capture the linguistic reality of speakers better than other approaches based on only one or two factors, as they do not rely only on speakers’ linguistic proficiency or language use. Several studies have successfully used the BLP to measure language dominance among bilinguals of Spanish and another language, including Baird [
12], Coetzee et al. [
13], and Amengual and Chamorro [
14].
In the present study, the ten participants received a dominance score based on their BLP Questionnaire responses, which makes it possible to analyze the effects of language dominance on the production of Basque sibilants. This is the first time that this methodology has been used in a study of Basque sibilants to quantify the degree of bilingualism among bilingual speakers of Basque and Spanish, as previous studies on sibilants have relied on speakers’ origin, and the experimenters’ criteria in order to classify speakers as dominant in one language or the other [
15]. The advantage of using the BLP Questionnaire in this study is that it allows for measuring informants’ language history, use, proficiency and attitudes in a more standardized manner, which makes this study replicable.
1.2. Basque Sibilants
According to Ladefoged and Maddieson, fricative sounds are those in which speakers produce a turbulent air stream [
16]. The turbulence of the air stream is determined by two factors: the size of the channel and the volume velocity of the air stream [
17]. More precisely, turbulence is more likely to arise when the constriction of the vocal tract is narrow and when the air velocity is high. Ladefoged and Maddieson classify fricatives depending on whether the turbulence is generated in the constriction or when the airstream is directed against an obstruction like the edge of the teeth. They define sibilants, the sounds that are analyzed in the present study, as fricatives that are generated when the airstream is directed against an obstruction [
16]. Some varieties of Basque, such as Goizueta Basque, have three sibilants. These three sibilants are voiceless, although they undergo regressive voicing assimilation when followed by a voiced consonant, as in the word
esne ‘milk’, which is pronounced as [ˈez.ne] [
18] (p. 24). These sounds have been widely described, but there is no consensus on the terms and International Phonetic Alphabet (IPA) symbols used to describe them. Jurado Noriega describes the sibilants as “alveolopalatal” or “palatalized postalveolar” /ɕ/ (letter <x>),“apico-postalveolar” /ʂ/ (letter <s>), and “predorso-dentoalveolar” /s̻/ (letter <z>) [
15]. Hualde classifies the Basque sibilants as being “dorsum-prepalatal” (<x>), “apico-alveolar or alveolar” (<s>), and “lamino-alveolar” (<z>) [
2] (p. 90). In general, most authors use the feature of apical/ laminal to characterize the distinction between the two front sibilants (<z> vs. <s>) [
15]. This feature is used to describe which part of the tongue—the tip of the tongue or the blade—is used in the production of the sibilants. Following this approach, Basque sibilants are classified as apical if the tip of the tongue is elevated and as laminal if it is not elevated [
15]. Hualde uses this feature and describes the sibilants as being “prepalatal” (<x>), “apico-alveolar” (or “apico-postalveolar”) (<s>), and “lamino-alveolar” (<z>) [
18]. The IPA symbols that Hualde uses to represent the sibilants are the following ones respectively: /ʃ, s̺, s̻/ [
18] (p. 22). Likewise, Egurtzegi also uses this apical/ laminal feature to describe the Basque sibilants in a more recent study [
19]. The present study uses the same laminal/ apical description for the Basque sibilants as Hualde [
18] and Egurtzegi [
19], as can be seen in
Table 1.
1According to Jurado Noriega, the apico-alveolar sibilant is produced when the tip of the tongue is elevated towards the alveolar ridge and the blade of the tongue is also elevated [
15]. Based on Alonso [
21], Jurado Noriega claims that the lamino-alveolar sibilants, on the other hand, are produced when the tip of the tongue and the internal surface of the lower incisors are in contact, and the blade of the tongue is elevated towards the alveolar ridge [
15]. With regard to the prepalatal sibilants, the tip of the tongue touches the gums and the base of the lower teeth, and the form of the tongue is convex [
15].
According to Hualde, the most conservative varieties of Basque maintain the three-way distinction among the sibilants [
2]. In Goizueta Basque, for instance, many speakers keep the three sibilants distinct in terms of their spectral peak, COG, and skewness. Hualde finds a correlation between the spectral peak and the articulation of the sibilant so that a higher spectral peak corresponds with a more fronted articulation [
2]. Therefore, in the conservative varieties of Basque, the lamino-alveolar sibilant has the highest spectral peak of all the sibilants and the prepalatal sibilant has the lowest spectral peak. As for the spectral peak of the apico-alveolar sibilant, it falls between that for the lamino-alveolar and the prepalatal sibilants. Hualde’s results show that the lamino-alveolar sibilant has the highest COG of the three sibilants (6645 Hz) and that the prepalatal sibilant has the lowest COG (3531 Hz) [
2]. The COG values of the apico-alveolar sibilant (4173 Hz) fall between those for the lamino-alveolar and the prepalatal sibilants. It should be noted here that the apico-alveolar and the prepalatal sibilants are more similar in acoustic terms than the lamino-alveolar and the prepalatal sibilants. Even though Hualde’s analysis only shows the results for a single speaker from Goizueta [
2], his results are in agreement with what we would expect based on other studies. Jurado Noriega analyzes the Basque sibilants in the Guipuzcoan territories of Donostialdea and Bidasoa, where the sibilants have been kept distinct, and shows that the COG is higher for the lamino-alveolar sibilant (<z>) than for the apico-alveolar sibilant (<s>) [
15]. Likewise, Iglesias et al. analyze the Basque sibilants as pronounced by a speaker from Beizama, Guipuzcoa [
20], and report similar results to Hualde [
2] for the COG values. Iglesias et al. analyze the production of the nonce-words
aza, axa, asa from a reading task and they find that the lamino-alveolar sibilant has the highest COG of the three sibilants (14,452 Hz) and the prepalatal sibilant has the lowest COG (5966 Hz) [
20]. As in Hualde [
2], the COG values of the apico-alveolar sibilant (8081 Hz) fall between those for the lamino-alveolar and the prepalatal sibilants.
Results from Basque studies are in agreement with cross-linguistic work on fricative differences and COG values. Jongman et al. analyze English fricatives and they claim that the prepalatal sibilant /ʃ/ has lower COG values than the alveolar sibilant /s/ [
23]. Likewise, Gordon et al. analyze the COG values of several fricatives in languages such as Chickasaw (a Muskogean language spoken in Oklahoma), Western Apache (spoken in Arizona), Scottish Gaelic (spoken in Scotland), Hupa (spoken in California), Montana Salish (spoken in Montana), and Toda (spoken in India) and report similar results [
24]. More precisely, they find that the prepalatal sibilant /ʃ/ in Chickasaw, Western Apache, Gaelic, Montana Salish, and Hupa has lower COG values than the alveolar sibilant /s/. The authors also observe that the COG values of the lamino-alveolar sibilant /s̻/ in Toda has significantly higher COG values than the apico-alveolar /s̺/ and the prepalatal sibilants /ʃ/ [
24], which is in agreement with the COG values in Hualde [
2].
1.3. Basque Sibilant Merger
Many varieties of Basque have lost the distinction among the three Basque sibilants and they present only a two-way distinction among these fricatives. In those varieties of Basque, the lamino-alveolar sibilant is merged with the apico-alveolar sibilant and realized as an apico-alveolar sibilant, while the distinction between the apico-alveolar and the prepalatal sibilant is still maintained. This type of merger process between the apico-alveolar and the lamino-alveolar sibilants is widely spread and seems to have concluded in some areas of Biscay and the Basque-speaking areas of Alava [
2]. In other territories, the sibilant merger is adopting a different tendency. In the Guipuzcoan towns of Azpeitia and Azkoitia, for instance, the tendency is to also merge the apico-alveolar and the lamino-alveolar sibilant, but the resulting production is a lamino-alveolar sibilant, and not an apico-alveolar sibilant [
2]. As one can see, the merger of the lamino-alveolar and the apico-alveolar sibilants seems to be spreading in different manners through the Basque territories. Moreover, Hualde claims that, although the prepalatal sibilant (<x>) is kept distinct from the lamino-alveolar and the apico-alveolar sibilants in many territories, younger generations of speakers in certain areas merge the three Basque sibilants into a single one [
2]. Hualde claims that speakers from Gernika, for instance, tend to merge the three sibilants to an apico-alveolar sibilant if they were born after 1980 [
2].
When analyzing the sibilant merger, it is important to take into account that Basque is in contact with the Castilian variety of Spanish, which has four voiceless fricative phonemes: /f θ s x/ [
25] (p. 147). That is, Castilian Spanish has an apico-alveolar sibilant that is represented by the letter <s> [
25].
2 This Castilian sibilant has been acoustically measured in several studies. Medina del Moral and Romera, for instance, analyze the Spanish sibilant as pronounced by 44 informants from Navarre and claim that its mean COG value is 3525 Hz [
26] (p. 43). One may think that the sibilant-merger is taking place because speakers are transferring their Spanish sibilant inventory to Basque and they are reducing the historical three-way contrast in Basque to a single sibilant. It should be noted here that before the interdental fricative /θ/ became part of the system, Spanish used to have a three-sibilant inventory that was very similar to the sibilant inventory found nowadays in some varieties of Basque, such as the variety of Basque spoken in Navarre [
2]. Nevertheless, Hualde (2010) claims that it is difficult to determine whether the sibilant merger started in a variety of Basque first and was then transferred to Spanish, or if the merger started in a variety of Spanish and was then transferred to Basque [
2]. Moreover, another possibility could be that the Basque sibilant system neutralized independently of language contact, as it happens with other complex sibilant systems [
27].
Jurado Noriega analyzes the sibilants in both Basque and Spanish in Donostialdea, a region in Guipuzcoa, and Bidasoa [
15]. The author measures the intensity, the second formant, the COG, the frequency cut-off, and the spectral peaks of the sibilants in production data from 24 participants, who fall in three groups: (G1) monolingual speakers of Spanish from outside the Basque Country (
n = 8), (G2) bilingual speakers whose first language (L1) is Basque (
n = 8), and (G3) bilingual speakers whose L1 is Spanish (
n = 8). Jurado Noriega offers an acoustic analysis of the apico-alveolar (<s>) and the lamino-alveolar (<z>) sibilants in Basque as produced by the bilingual groups G2 and G3, and an acoustic analysis of the Spanish sibilant as produced by all groups (G1–G3) [
15]. Regarding the COG, Jurado Noriega finds that the apico-alveolar sibilant in Basque has the lowest COG values, whereas the lamino-alveolar sibilant in Basque has the highest COG values of the Basque sibilants. Regarding the Spanish sibilant, Jurado Noriega finds that the COG values among informants from G2 (5566 Hz), i.e., bilingual informants whose L1 is Basque, fall between those for the Basque apico-alveolar (5362 Hz) and the lamino-alveolar sibilants (9602 Hz) [
15]. Informants from G3, on the other hand, i.e., bilingual informants whose L1 is Spanish, have very similar COG values for the Spanish sibilant (3945 Hz) and for the Basque apico-alveolar sibilant (4098 Hz). However, Jurado Noriega does not show whether the differences between the COG values of the sibilants are significant, as she does not present a statistical analysis [
15].
Regarding the effect of surrounding vowels on Basque sibilant production, Jurado Noriega finds that the vowel that follows the sibilant (front vowel/back vowel) and the position of the sibilant in the word (word initial/intervocalic) have an effect on the production of the sibilants [
15]. More specifically, the COG values are higher overall when the sibilant is followed by a front vowel than when it is followed by a back vowel, and the COG values are also higher among bilingual speakers when the sibilant is in intervocalic position than when it is in word initial position.
Ensunza Aldamizetxebarria explores whether the sibilant is palatalized after the high front vowel /i/ in the variety of Basque spoken in Gernika [
28]. More specifically, the author analyzes the sibilants produced by 63 speakers in six words that have the letter <z> preceded by the front high vowel /i/ (such as
haize ‘wind’). Ensunza Aldamizetxebarria classifies the sibilants categorically as apico-alveolar or prepalatal and finds that younger speakers pronounce an apico-alveolar sibilant [s̺] more often (86% of the time) than the prepalatal sibilant [ʃ] (14%), whereas older speakers produce the prepalatal sibilant (86%) more frequently than the apico-alveolar (14%) sibilant. The results also show that overall, female speakers produce the apico-alveolar sibilant more often (74%) than the prepalatal sibilant (26%). Likewise, male speakers use the apico-alveolar more frequently (57%) than the prepalatal sibilant (43%) . Ensunza Aldamizetxebarria finds that there are no differences overall in the production of the two sibilants after a front high vowel based on the school where the participants studied or the origin/linguistic background of their parents [
28].
3Despite the contribution of previous studies on Basque sibilants, most of them do not present inferential statistics in order to evaluate whether the reported differences are significant. Only Hualde [
2] and Iglesias Chaves et al. [
20] present inferential statistics, although they only analyze the production data of a single participant. The present study aimed to offer the first quantitative analysis of the COG of speakers between the ages of 20–33 years in their production of the three Basque sibilants in the village of Amorebieta-Etxano, a Biscayan town close to Gernika. More precisely, the present study aimed to analyze the merger of sibilants and to examine whether there is a relationship between the speakers’ language dominance and the merger. The present study also analyzed if the stress, i.e., whether the syllable that contains the sibilant is stressed or unstressed, and the sibilant position in the syllable and in the word have an effect on the production of the sibilant. The study answered the following research questions:
Are the apico-alveolar, the lamino-alveolar and the prepalatal sibilants merged in Amorebieta- Etxano? If they are, what is/are the resulting sibilant or sibilants?
Which factors condition the possible merger among the Basque sibilants in this community? More precisely, does the sibilant position in the syllable and in the word or the stress of the syllable containing the sibilant condition the merger? Does the language dominance of participants, either Basque or Spanish as determined by the BLP Questionnaire, affect the merger?