Next Article in Journal
Understanding Dialectal Variation in Contact Scenarios Through Dialectometry: Insights from Inner Asia Minor Greek
Next Article in Special Issue
The Acoustic Properties of Vowels in Foreigner-Directed Speech: Insights from Speech Directed at Foreign Domestic Helpers
Previous Article in Journal
Accentedness Perception in L2: An Investigation of Thai Monophthong Pronunciation of Chinese Students
Previous Article in Special Issue
An Acoustic–Phonetic Description of Hidatsa Vowels
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Acoustic Study of Romanian Stressed Vowels with Special Reference to Mid Central [ɨ] and [ə]

by
Daniel Recasens
1,* and
Fernando Sánchez-Miret
2,*
1
Departament de Filologia Catalana, Facultat de Filosofia i Lletres, Universitat Autònoma de Barcelona, 08193 Barcelona, Spain
2
Departamento de Lengua Española, Facultad de Filología, Universidad de Salamanca, 37008 Salamanca, Spain
*
Authors to whom correspondence should be addressed.
Languages 2025, 10(1), 12; https://doi.org/10.3390/languages10010012
Submission received: 26 September 2024 / Revised: 4 January 2025 / Accepted: 8 January 2025 / Published: 15 January 2025
(This article belongs to the Special Issue An Acoustic Analysis of Vowels)

Abstract

:
The present study is concerned with some aspects of the production of [ɨ] and [ə] in Romanian, i.e., their position within the vowel space, degree of acoustic variability and acoustic duration. To this end, acoustic data were collected for the Romanian stressed vowels [i e a o u ɨ ə] produced by six speakers in controlled consonantal context conditions and real and nonsense words. The formant frequency data reveal that [ɨ] and [ə] do not overlap along the F1 dimension, which may be ascribed to the need to set in contrast the two central vowels phonologically. Moreover, [ɨ] is clearly more variable in F2, and thus in vowel fronting, than schwa. Regarding segmental duration, [ɨ] is as short as high vowels and shorter than schwa, whose duration is comparable to that of mid vowels. The phonetic characteristics for stressed schwa in Romanian are in contrast with those for the same vowel in other world’s languages, in which it is highly variable and shorter than all or most peripheral vowels. This behaviour may be attributed to the existence of two central vowels, while suggesting that [ə] has a well-defined articulatory target in the Romanian language.

1. Introduction

The present study is an investigation into the acoustic vowel space defined in terms of the first three spectral formants and of the acoustic duration of the seven Romanian stressed vowels [i e a o u ɨ ə]. The Romanian language is endowed with seven stressed vowels, namely, five peripheral vowels and two central vowels: a high vowel [ɨ], which is represented by means of the graphemes î and â in writing, and a mid vowel [ə], which is represented with the grapheme ă. All of these vowels have phonemic status (Chitoran, 2001) with the inclusion of the high and mid central vowels as demonstrated by the following minimal pairs: [pɨr] onomatopeia/[pәr] ‘hair’, [vɨr] ‘I thrust’/[vәr] ‘cousin’, [rɨw] ‘river’/[rәw] ‘bad’ and [tsɨrj] ‘sea mackerels’/[tsərj] ‘lands’. Regarding other Romance languages, this central vowel scenario is similar to that of European Portuguese, in which, however, [ɨ] and [ə] (the latter vowel is often transcribed as [ɐ] in this language variety) are found in unstressed instead of in stressed syllables.
This acoustic study is especially relevant in view of the comparatively little attention that phoneticians have devoted to the only Romance language endowed with two stressed central vowels, and to the fact that Romanian is the only major Romance language that does not yet have an article in the Illustrations of the IPA series in the Journal of the International Phonetic Association. Only a few acoustic studies of Romanian vowels exist, and, as pointed out later in this section, they do not control for phonological context in a stringent manner, and do not report vowel duration values. These previous acoustic studies include that of Teodorescu (1985), which reports unnormalized formant frequency values for these seven vowels as produced in nonsense VCV sequences with the consonant [p] by three males, and Renwick (2012, pp. 142–147; 2014, pp. 114–115), which presents unnormalized formant frequency data for stressed [i e a o u ɨ ə] as produced in real words in various syllable structure and contextual consonant conditions by fourteen female speakers and three males.
Moreover, the present investigation complements with new acoustic data previous research on the phonetic characteristics of stressed schwa in the world’s languages that was carried out by the first author. The two stressed central vowels [ɨ] and [ə] occur in languages and dialects from other language families such as Battambang Khmer, Madurese, Sundanese, Cheju, Thai, Amharic and Northern Welsh. As shown subsequently, these languages and dialects exhibit vowel systems of a similar size and quality to that of Romanian, including the two central vowels, which are considered to be phonemic, or to exhibit a high frequency of occurrence, whenever they are in complementary distribution or their phonemic status is not mentioned in existing monographs: Battambang Khmer [i e a ɨ ə ɔ o u] and Wa Mon Khmer [i e ɛ a ɨ ə ɔ o u] (Huffman, 1967; Pinnow, 1979); Madurese [i ɛ a ɨ ə ɔ ɤ u] (Stevens, 1964); Sundanese [i e a ɨ ə o u] (Van Syoc, 1959); Cheju [i e æ a ɨ ə ʌ o u] (Yang et al., 2020); Thai [i e æ ɨ ə a ɔ o u] (Abramson, 1962); Amharic [i e a ɨ ə o u] (Leslau, 1995); and Northern Welsh [i e a ɨ ə o u] (Jones, 1984). Several phonetic aspects of the central vowels of Romanian and of these other languages and dialects will be subject to comparison in this and the following sections of the present study.
The research goals of the present investigation are as follows.
Firstly, we will focus on the placement of mid and central vowels in the vowel space, and, more specifically, on the degree of vowel height for the mid vowels [e] and [o], and the degree of vowel height and fronting for the central vowels [ɨ] and [ə].
Concerning stressed schwa and as reported for a considerable number of languages exhibiting this vowel in their vowel inventory (Recasens, 2021), it is expected that the first formant or F1 (which is inversely correlated with vowel height and thus is higher for low vs. high vowels) occurs ca. 500 Hz, and that the second formant or F2 (which is positively correlated with vowel fronting and thus is higher for front vs. back vowels) is found ca. 1500 Hz. These formant frequency values, i.e., 500 Hz for F1 and 1500 Hz for F2, correspond to a 17.5 cm long vocal tract with no tongue constriction (Fant, 1960) while, as pointed out to us by a reviewer, the F1/F2 ratio should also be 1 to 3 for other vocal tract lengths. As mentioned more explicitly below, the actual phonetic realization of stressed schwa is, however, quite variable when evaluated across different positional and contextual conditions, which accounts for the fact that this vowel has often been transcribed not only as [ə], but also as a mid back unrounded vowel, whether [ɤ] or [ʌ], in phonetic studies of other languages such as Thai, Wa, Chantyal, Scottish Gaelic and Bulgarian (see Recasens, 2021). Regarding Romanian, stressed schwa has been labelled as mid central [ə] (Chitoran, 2001) and mid back unrounded [ʌ] (Renwick, 2014), the issue being which one of these two IPA symbols is more appropriate in the light of formant frequency data for a representative number of subjects. As shown in Figure 1, the F1 and F2 frequencies for Romanian vowels reported by Teodorescu (1985) and Renwick (2014) reveal the following: the two peripheral mid vowels [e] and [o] are placed about halfway between [i u] and [a]; and [ɨ] and [ə] are lined up with the high and mid vowels, respectively, and placed mid way between the front and back vowels. Based on these formant frequency data, the most suitable IPA symbol for Romanian stressed schwa should probably be [ə], not the symbols [ɤ] or [ʌ], which, as pointed out above, are also used by scholars for transcribing the mid central vowel in the world’s languages. An analogous scenario, i.e., one in which [ɨ] and [ə] are roughly equidistant from the front and back vowels and occur at the high and mid regions of the vowel space, respectively, may be inferred from formant frequency data for most of the non-Romance languages listed above: Battambang Khmer long vowels (Wayland, 1998), Wa Mon Khmer (Svantesson, 1993; Watkins, 2002), Sundanese (Yusuf, 1993), Cheju (Cho et al., 2001), Thai (Abramson, 1962; Apiluck, 1996) and Amharic (Iwatsuki, 2012). Alternatively, the vowels in question lie closer to the back vowels than to the front vowels in Madurese (Cohn & Lockwood, 1994), while the opposite is true for Northern Welsh (Mayr & Davies, 2011).
In the light of the evidence from other languages endowed with stressed schwa that have just been described, and based on the position that this vowel occupies within the vowel space in Romanian, our concern is whether the mid central vowel in question ought to be transcribed as central ([ə]) or as mid back unrounded ([ʌ]). The argument provided in Renwick (2014, p. 13) for transcribing the mid central vowel with the symbol [ʌ] instead of with [ə] appears to be either only phonological or both phonetic and phonological. The phonetic aspect may be inferred by the use for stressed and unstressed schwa of not only the notation /ʌ/ (Renwick, 2012, 2014), but also [ʌ] (Renwick et al., 2016) and both /ʌ/ and [ʌ] (Renwick, 2021). Regarding the phonological aspect, the symbol [ʌ] would be preferable to [ə], since [ə] is usually used for reduced mid vowels (Maddieson, 1984, p. 252; Silverman, 2011), these vowels being often the end product of phonological neutralizations in prosodically weak positions, which is not what happens in Romanian, where the mid central vowel may function synchronically as a full vowel.
A second research issue concerns acoustic variability. A salient characteristic of stressed schwa in the world’s languages is its high degree of spectral variability as a function of segmental context, and therefore its high degree of sensitivity to coarticulatory effects in tongue and lip position exerted by its flanking consonants. In particular, the F2 frequency ranges for this vowel often exceed those for all or most other vowels in the language, as has been reported to be the case, for example, for Enganno, Iaai, Javanese, Khonoma Angami, Albanian, Arrernte and Kabardian, which is why a number of different context-dependent allophones have been identified for stressed schwa in some Salishan languages and also in Sharwa, Cheremis, Amharic and Macedonian (see Recasens, 2021 for details). The reason for this appears to be the lack of a well-defined constriction location for central vowels in general, and for [ə] in particular, which nevertheless appears to be specified for a mid tongue position (i.e., F1 values for this vowel rarely extend into the vowel spaces for high and low vowels), and, at least in the case of English, could even exhibit some postdorsal constriction at the pharynx rather than being fully unconstricted (Browman & Goldstein, 1992; Kondo, 1995; Gick, 2002). Moreover, these positive articulatory properties should hold for stressed schwa rather than for unstressed schwa, which is often the outcome of vowel reduction, and during whose production the tongue is likely to be placed in the breathing position, and thus roughly equidistant from all regions of the vocal tract.
The possibility that stressed central vowels including schwa may be produced with some active tongue configuration, and consequently turn out to be less variable than expected, should hold precisely in languages like Romanian, in which [ɨ] and [ə] have phonemic status. In other words, the need to avoid intelligibility problems could very well cause the two central vowels in question to exhibit different articulatory goals and a moderate degree of articulatory variability, at least in F1 frequency, and thus along the vowel height dimension. An aspect which is closely related to this need is functional load, i.e., phoneme pairs undergoing merging distinguish significantly fewer minimal pairs in the lexicon than those which avoid the merging process (Wedel et al., 2013; Tse, 2024). With regard to this issue, a look at the frequency of occurrence of the two Romanian central vowels reveals that [ɨ] is the least frequent of all vowels of the vowel system, followed by schwa (Renwick, 2014, p. 22), one of the reasons for this being the limited contextual origin of both stressed [ɨ] and [ə] in native words: [ə] comes from Latin /e/, after labials and the alveolar trill ([pəɾ] PILU ‘hair’, [rəw] REU ‘bad’), and [ɨ], probably through intermediate [ə], from /i/ after [r] ([ˈrɨpə] RIPA ‘ravine’, [ˈrɨde] RIDET ‘(s)he laughs’) and from non-back vowels before a nasal consonant ([ˈlɨnə] LANA ‘wool’, [kɨmp] CAMPU ‘field’, [ˈvɨnə] VENA ‘vein’) (Nandris, 1963). The two vowels may also be found in other consonantal contexts in loanwords from Hungarian and Turkish, and also from the Slavic languages, such as [sfɨnt] sfânt “saint, holy” and [ɡɨskə] gâscă “goose”. In spite of their limited contextual occurrence, the two vowels of interest are endowed with a relatively high morphological load (Chitoran, 2001): [ɨ] appears in common function words such as pronouns ([ɨl] ‘him’, [ɨmj] ‘to me, dative’), the preposition [ɨn] ‘in’ and the infinitive ending of fourth conjugation verbs ([uˈrɨ] urî ‘to hate’); and [ə] is the non-definite nominative/accusative ending of many feminine singular nouns ([ˈkasə] CASA ‘a house’), and may be also found in pronouns ([mə] ‘me, accusative’) and in verb paradigms. In view of these historical and synchronic scenarios, it may be hypothesized that the two Romanian vowels [ɨ] and [ə] are kept phonemically distinct, since they are involved in a number of minimal pairs and play a relevant morphological role.
Data on vowel dispersion for 14 female speakers reported by Renwick (2012) are consistent with this assumption. Indeed, they show that, when evaluated across syllable and consonant context conditions, Romanian [ɨ] and [ə] overlap very little along the F1 dimension, and are thus quite distinct from one another regarding vowel height, which appears to be in agreement with their independent phonemic status in the language. Also in Renwick (2012), on the other hand, these two vowels happen to be more variable along the F2 dimension than all five peripheral vowels [i e a o u], with their ranges amounting to ca. 1000–2000 Hz ([ɨ]) and 1200–1800 Hz ([ə]) in the case of female speakers, and ca. 1100–1750 Hz ([ɨ]) and 1100–1650 Hz ([ə]) in the case of male speakers. These figures also reveal a higher degree of F2 variability for [ɨ] than for [ə]. As revealed by vowel acoustic data for schwa from other languages such as English and Catalan, changes in F2 frequency for central vowels typically result from coarticulatory effects in tongue, jaw and lip position exerted by palatal and certain dentoalveolar consonants, which cause the vowel F2 to rise, vs. those exerted by labials, other alveolars (/r/ mostly if realized as a trill, and dark /l/) and possibly velars, which cause F2 to lower (Kondo, 1995; Recasens & Espinosa, 2006). Moreover, also according to Renwick (2012), perceptual identification data show that the two central vowels are not inherently more difficult to identify than the five peripheral vowels, with [ə] being more prone to confusion with [ɨ] than the reverse. All in all, even though they are highly variable in terms of fronting degree, the two central vowels appear to be well integrated in the Romanian vowel system, and perhaps more so [ɨ] than [ə]. Minimal overlap between stressed [ɨ] and [ə] has also been reported consistently in F1 (and thus in oral opening and tongue dorsum height), but not in F2 (and thus in tongue raising and fronting), for Battambang and Wa Mon Khmer, Madurese, Sundanese, Cheju (when adjacent to /t/, but not when flanked by /k/; Cho et al., 2001), Thai, Amharic (in the context of /d/; Iwatsuki, 2012) and Northern Welsh by the studies mentioned earlier in this section. In particular, the cross-language F1 frequency averages for stressed [ɨ] and [ə] provided by these studies amount to 414.7 Hz (sd = 49.6) for the former vowel and to 568.2 Hz (sd = 78.2) for the latter, which follows from [ə] being a lower vowel than [ɨ]; as for the F2 frequency data, the cross-language values are slightly higher for [ɨ] (1490.4 Hz, sd = 210.2) than for [ə] (1410.3 Hz, sd = 164.6).
It must be pointed out that this scenario is less valid for spontaneous speech than for laboratory speech. In contrast with laboratory speech conditions, it has been found that [ɨ] and [ə] may overlap more or less considerably along the F1 dimension in spontaneous speech, in which [ɨ] is often produced with a higher F1 and thus a more schwa-like realization than expected (Vasilescu et al., 2016). In line with previous experimental evidence (Wedel et al., 2018), it may be true that in laboratory speech tasks speakers hyperarticulate words with [ɨ] in order to distinguish them from competitors endowed with schwa and perhaps other peripheral vowels as well. Moreover, it appears that the F1 frequency, and thus the acoustic cue that better sets the two central vowels in contrast, is chosen for this purpose.
Another reason for carrying out an analysis of vowel dispersion and overlap in the present study stems from the fact that in the lexical items subject to analysis in Renwick (2012), the Romanian stressed vowels were often preceded and followed by consonants specified for different places of articulation, and, in addition, were produced in words differing in their number of syllables and endowed with various syllable structures (CV, VC, CVC, CCCVC, CVCC). Thus, for example, only [ɨ] and [ə] occurred in CV words. More importantly, regarding the former variable, contextual /r/ was over-represented in words with the two central vowel nuclei, but not in those with the five peripheral vowels. In sum, based on Renwick’s data, it is still not perfectly clear whether the two Romanian mid vowels, and especially schwa, are highly variable acoustically, which would be in line with other world’s languages, or else whether they exhibit a restricted acoustic variability, at least along the F1 frequency dimension, which would be determined by their phonological status.
A final goal of the present investigation is to compare the duration of the two central vowels [ɨ] and [ə] with those of the five peripheral vowels [i e a o u]. This appears to be a novel contribution insofar as no duration values for each of the seven stressed vowels of Romanian are reported in Teodorescu (1985) and Renwick (2012, 2014). It should be recalled in this respect that vowel duration increases with vowel opening, such that high vowels are shorter than mid vowels, and mid vowels shorter than low vowels (Lehiste, 1970, pp. 18–19). Moreover, vowel duration data for the world’s languages reported in Recasens (2021) reveal that stressed schwa is often the shortest of all stressed vowels, or as short as high vowels, which, as just mentioned, are typically shorter than lower vowels in vowel systems. It thus appears that the lack of a clear-cut constriction location, at least for schwa, may go hand in hand with a short vowel duration, which again is prone to occurring if the vowel is unstressed rather than if it is stressed. A partial exception appears to take place precisely in Romanian, in which, according to Estill (2010), vowel duration varies in the progression [i u] > [e o] > [a], and [ə] and [ɨ] are slightly shorter than [e o], but longer than [i u]. In view of Estill’s data, it may then turn out to be the case that, given their phonological characteristics, and in parallel to the corresponding formant frequency values, the two Romanian central vowels (or just one of them) are longer than expected because they are produced with some sort of articulatory target. Insofar as Estill’s study is merely exploratory (the only piece of information about the experimental setup provided in it is that the data were recorded from two Romanian newsreaders), a more controlled dataset is warranted in order to verify whether this prediction holds or not.
To summarize, by using a more controlled experimental setup than in previous studies, and given the existence and the phonological status of the two central vowels [ɨ] and [ə] in Romanian, the present study seeks to determine whether the following are true: (a) in contrast with other languages in which stressed schwa is the only stressed central vowel, Romanian stressed schwa has a vowel space of its own, exhibits less variation than reduced vowels and shows a similar duration to [e] and [o]; and (b) in parallel to other languages endowed with stressed [ɨ] and [ə], there is minimal overlap between these two central vowels. Moreover, the position occupied by stressed schwa relative to the other vowels in the vowel space should allow for ascertaining which is the most suitable IPA symbol for phonetically transcribing this Romanian vowel.

2. Materials and Methods

In order to explore the research issues identified in the Section 1, acoustic data were recorded and analyzed for the seven Romanian stressed vowels [i e a o u ɨ ə] in the contextual, word type and sentence conditions referred to below. In light of differences in vowel overlap between [ɨ] and [ə] depending on elicitation conditions reported in earlier studies, data for vowels embedded in meaningful and nonsense words were recorded and processed separately.
(a) Recordings were made of meaningful words in which the target vowel was preceded and followed by consonants exhibiting the same place of articulation (see Appendix A). There were four contextual conditions overall, i.e., labial/labiodental ([p b f v]), dental/alveolar ([t s]), palatoalveolar ([ʃ ʒ tʃ dʒ]) and alveolar lateral [l]. A partial exception was the palatoalveolar consonant condition, since due to lexical restrictions, the consonant placed on the other side of the vowel was [s] in the lexical items [ʃes], [tʃas] and [ʒos]; this other contextual setup should not differ considerably from the symmetrical one in view of the articulatory similarity between alveolars and palatoalveolars. On the other hand, the reason why [l] was included as a separate contextual group is because, in spite of being ‘clear’ in Romanian (and thus produced with a similar tongue fronting gesture to other dentals and alveolars), it may involve some tongue dorsum and jaw lowering and perhaps a slight tongue body retraction due to the laterality requirement involved (Lindblad & Lundqvist, 2003). Real words were uttered in meaningful sentences and conformed to the syllable structures CVC([ɾ])V (20 cases) and, to a much lesser extent, CVC(C) (6 cases) and CVCCV(C) (2 cases). The stressed vowel subject to analysis thus occurred in open CV syllables (20 cases) rather than in closed CVC syllables (8 cases). All subjects who took part in the recording procedure were familiar with the chosen regional words, which are available in the real word list reproduced in Appendix A. In order to render these words more familiar to the reader, they were presented in distinctive sentences; thus, for example, băbci appears in the sentence nu ştiu ce sunt băbci “I don’t know the meaning of the word băbci”, and sătă appears in the sentence se zicea şi sătă “sătă was said as well”.
(b) Nonsense CVC[ə] words with identical prevocalic and postvocalic consonants were also recorded in the meaningful sentence frame spunaşa “I say…so” (see Appendix B). There were six contextual conditions in this case, i.e., bilabial ([p]), dental ([t]), palatoalveolar ([ʃ]), velar ([k]), alveolar lateral ([l]) and alveolar rhotic ([r] word-initially, [ɾ] intervocalically). The reason why the alveolar rhotic was included as a separate contextual group is because, due to manner of articulation requirements, and mostly in the case of the trill, it may be articulated with a somewhat lowered tongue predorsum and blade and some tongue body retraction (see for Spanish rhotics, Proctor, 2009). It also needs to be said that Romanian velars are realized as purely velars before low and back vowels, and exhibit a (post)palatal realization before front vowels (Rosetti, 1967, p. 77). Some linguists consider that these fronted velars are allophones of the palatalized phonemes /kʲ/ and /ɡʲ/ (Vasiliu, 1989, p. 5). The subjects were informed that some of the words appearing in the nonsense word list could be equal or similar to existing words. Even though a small number of these words have a meaning in Romanian, they must have been treated as nonsense by the subjects who took part in the acoustic recordings, insofar as they were read together with all the other nonsense words in the sentence spunaşa.
The speech material was read by three males (THE, SEB, RAD) and three females (SIM, NIC, LIV). Subjects SIM and THE were 39 and 45 years old, respectively, and were born and have always lived in Bucharest. Subjects SEB and LIV were 55 and 54 years old, respectively, were born in Piatra Neamţ in the historical Romanian region of Moldova, have lived in Spain since the early 1990s and speak Romanian regularly as a couple and with their friends and family. NIC was 56 years old, was born in Iaşi (also placed in Moldova) and has been living in Spain since 2013. RAD was born in Buzău, which is located in the historical region of Muntenia, was 40 years old and lived in Romania until the age of 28. Both NIC and RAD used the Romanian language at work and with their family and friends. Sentences were recorded 5 times each at a 48 kHz sampling rate using an AKG PERCEPTION 200 microphone and a Marantz PMD660 recorder. Recordings were carried out in a quiet room at the Faculty of Philology of the University of Salamanca.
The four last repetitions of the meaningful and nonsense words were analyzed acoustically using Praat (Boersma & Weenink, 2024). For all vowel tokens, the following landmarks were used for identifying the vowel onset and offset times on 0–5 kHz spectrographic displays: the onset of voicing after the burst of the preceding stop and closure onset for the following stop; the high intensity vowel formant edges bordering the frication period for fricatives; the low intensity formants for [l], and also for [r] if realized as an approximant; and the short closure period whenever contextual [r] was implemented as an alveolar tap or as an alveolar trill. The segmentation criteria for contextual stops and fricatives were used in order to identify the onset and offset times for vowels in the context of affricate consonants. After segmentation, the first author measured the F1, F2 and F3 frequency values of the target vowels at their midpoint by placing the cursor manually in the middle of the vowel. The Praat LPC formant tracker was used to assist in formant measurement in specific cases: whenever formant displays were visually unclear; whenever two formants came close to each other, as in the case of F1 and F2 for [a], and F2 and F3 for [i]; and whenever F1 for high vowels approached the voicing bar. The formant tracking procedure was set up for six formants and a 0.015 ms window length.
The formant frequency data were subjected to the Lobanov vowel normalization method in order to compare vowel formant frequencies across speakers (see Adank et al., 2004 for details). This transformation was carried out because it is known that absolute formant frequency values are influenced by individual differences in a speaker’s vocal anatomy. The Lobanov method has been characterized as vowel-extrinsic, since it uses acoustic information distributed across more than one vowel category of a given speaker, i.e., the overall mean formant frequencies across all vowels for that speaker, as a correction factor for the frequencies of the individual vowels. It is computed separately for each formant and speaker using the formula Fn[V]N = (Fn[V]−X̅n)/σn, where Fn[V]N is the normalized value for the formant n of vowel V, and X̅n and σn are the mean and standard deviation values for formant n for the speaker in question.
Several statistical tests were carried out in order to evaluate whether the two central vowels [ɨ] and [ə] differed significantly in F1 and F2 frequency, and also the extent to which duration differences among the seven Romanian vowels achieved statistical significance. For this purpose, main effects and vowel × consonant interactions for the formant frequency data, and main effects for the duration data, were computed by means of a linear mixed model (LMM) analysis, with vowel and contextual consonant as fixed factors and subject as a random factor, using SAS 9.4 (SAS Institute Inc., Cary, NC, USA). Separate tests were carried out for the meaningful and nonsense word conditions. Tukey-Kramer post-hoc tests were used for determining whether pairwise comparisons between contextual consonants were significant or not. In all statistical tests, the significance level was set at p < 0.05.

3. Results

3.1. Formant Frequency

Figure 2 shows the normalized F1 × F2 frequency values across speakers for Romanian [i e a o u ɨ ə] produced in meaningful words (left graph) and in nonsense words (right graph). The formant frequency values located inside the vowel ellipses correspond to the different consonant context conditions (four for meaningful words, six for nonsense words). In Figure 3 and Figure 4, the unnormalized F1 × F2 values for all vowel tokens are plotted separately for each speaker. Each ellipse includes 16 data values for the meaningful words (four contextual consonants × four repetitions) and 24 data values for the nonsense words (six contextual consonants × four repetitions).
According to Figure 2, the two peripheral mid vowels are generally placed midway between the high and low vowels, and somewhat closer to the former than to the latter in the case of the nonsense word condition. As shown by Figure 3 and Figure 4, this scenario is available for all speakers (nonsense word condition) and for subjects THE, SEB and SIM (meaningful word condition), and consequently appears to be consistent with the use of the symbols [e] and [o], instead of [ɛ] and [ɔ], for the transcription of the two Romanian mid vowels (Dascălu, 1971; Vasiliu, 1989, p. 1; Renwick, 2021). On the other hand, the three figures also show that the central vowels [ɨ] and [ə] are roughly equidistant between the front and back vowels, and placed at the same height level as high vowels in the case of [ɨ], and as mid vowels for [ə]. Moreover, the F1 × F2 data plotted in Figure 2, and for most speakers in Figure 3 and Figure 4, also reveal that these two central vowels, and perhaps schwa more than [ɨ], may lie somewhat closer to the back vowels [u] and [o] than to the front cognates [i] and [e] along the F2 dimension. It is indeed often the case that the ellipses for both [ɨ] and [ə] approach very closely those for [u] and [o], and may intersect with them, but may intersect barely or not at all with the ellipses for [i] and [e]. This turns out to be the case for subjects THE, SIM, RAD and LIV (meaningful words), and for THE, SIM, SEB and RAD (nonsense words). Insofar as the F2 frequency for front and central vowels is positively associated with tongue body fronting, these F2 data could be indicative that [ɨ] and [ə] are produced with a somewhat retracted tongue body configuration. These data differ from those reported by Teodorescu (1985) and Renwick (2012) in that [e] and [o] are somewhat higher, and [ə] somewhat more retracted, in our study than in theirs. Our acoustic data confirm that the stressed mid vowels are mostly mid high in Romanian, while suggesting that the stressed mid central vowel could be transcribed with the IPA symbol [ə], though [ʌ] may not be entirely discarded in view of the fact that, while being essentially central, the vowel in question may approach the mid back vowel in the vowel space.
Vowel dispersion as a function of speaker and word type may be evaluated based on the size and orientation of the ellipses displayed in Figure 2, Figure 3 and Figure 4, and the standard deviation values across vowel tokens plotted separately for F1 and F2 in Figure 5. Regarding the five peripheral vowels, there is a trend, which has also been observed to a greater or lesser extent in other languages (English: Hillenbrand et al., 2001; Swedish: Fant, 1973; Catalan: Recasens & Espinosa, 2006), for back vowels to vary mostly in F2 frequency, for low vowels to vary in F1 rather than in F2, and for front vowels to vary both in F1 and F2. These general trends are often apparent for the cross-speaker data plotted in Figure 2. Also, in accordance with previous studies, the ellipses appearing in this figure show that the front vowels [i] and [e] are less variable than the other vowels [a], [o] and [u], this being the case for the meaningful words but not for the nonsense words. It appears, then, that vowels coarticulate less with the contextual consonants, and consequently approach their articulatory target more closely, in the latter word condition than in the former one. In line with these vowel-dependent patterns of formant frequency variability, the data for the individual speakers displayed in Figure 3 and Figure 4, and for subjects THE, SEB, RAD and LIV (meaningful and nonsense word conditions) in particular, show that overlap between those peripheral vowels which are adjacent in the vowel space is greater for back vowels than for front vowels: while the ellipses for [i] and [e] are kept separate, those for [u] and [o] and for [o] and [a] come closer together, and may exhibit some superposition (see Martinet, 1952, p. 24 regarding the same difference in the margins of security between the mid vowels of the front and back series).
The main consonant-dependent effects for the five peripheral vowels observable in Figure 2 are as follows. Regarding back vowels, the maximal F2 values correspond to palatoalveolar consonants, which cause the tongue body and lips for [u], and to a lesser extent for [o], to front and unround, respectively, and the minimal F2 values correspond to adjacent labial and velar consonants, which do not trigger those coarticulatory effects during back vowels. In particular, the cross-speaker means for the palatoalveolar and labial contexts in the real word condition amount to 1315.13 Hz and 798.42 Hz for [u] and to 1277.29 Hz and 967.33 for [o] (unnnormalized data), and to −0.45 and −1.65 for [u] and to −0.64 and −1.32 for [o] (Lobanov normalized data). As for the low vowel [a], F1 may be lower or higher, respectively, depending on whether the adjacent consonant is articulated by means of a high tongue body and jaw position (palatoalveolars), or a relatively low tongue body and jaw position (the alveolar lateral [l]). The actual cross-speaker means for the real word condition are 849.88 Hz for [a] next to [l], and 756.79 Hz for [a] next to palatoalveolars, with the corresponding normalized values being 2.43 and 1.71. In spite of it being clear in Romanian, the alveolar lateral should be produced with a lower jaw than other consonants, due to the requirement during the production of laterals for the tongue front to be lowered to allow the passage of air through the sides of the mouth (Mooshammer et al., 2007, and see Section 2). As for [i] and [e], F2 lowers and F1 rises when they are flanked by consonants such as [l] and [r], which exhibit a primary or secondary constriction at the back of the vocal tract (see Section 1). This is in agreement with Dascălu (1971), who observes a higher F1 for [e] in contact with a syllable-final rhotic than with other Romanian consonants. Thus, for example, in the meaningful word data from our study, [i] exhibits higher F1 and lower F2 frequencies across speakers when it is next to [l] than to the other consonants: 412.6 Hz and 2323.17 Hz (unnormalized values), and −0.96 and 1.68 (normalized values).
Shifting now to the two central vowels, which are of special concern in the present investigation, the ellipses displayed in Figure 2 indicate a more variable realization for [ɨ] than for [ə] with regard to the F2 frequency and thus vowel fronting degree. According to the F1 × F2 data for the individual speakers plotted in Figure 3 and Figure 4, and, most clearly, to the standard deviation values presented in Figure 5, this turns out to be the case for all speakers and word conditions, with only two exceptions, i.e., speaker SEB (meaningful words) and speaker RAD (nonsense words). Moreover, for those same speakers, F2 for [ɨ] may be as variable as F2 for the highly variable vowel [u], or else more variable than F2 for any of the five peripheral vowels. As for the stressed vowel [ə], formant frequency variability is quite comparable to that of the peripheral vowels. Thus, for example, focusing on the standard deviation values for F2 (see Figure 5, meaningful and nonsense words), it may be seen that [ə] is often less variable than [ɨ] and [u] and as variable as mid and low vowels. Moreover, judging from the orientation of the ellipses observable in Figure 2, Figure 3 and Figure 4, this vowel happens to be more variable in F1 frequency than in F2 frequency, and thus more variable in vowel height than in vowel fronting, for some speakers (THE, SIM and NIC, meaningful words; THE, SIM, NIC and LIV, nonsense words), whereas the opposite is true for other subjects (SEB, RAD and LIV, meaningful words; RAD, nonsense words). In sum, there is a trend for [ɨ] to be more variable than [ə] and the other peripheral vowels in F2 frequency, and for [ə] to exhibit a comparable degree of contextual variability to that of the peripheral mid and low vowels.
Most importantly, the ellipses displayed in Figure 2, Figure 3 and Figure 4 reveal that there is very little overlap in the F1 dimension, if any, between the vowel spaces for [ɨ] and [ə], irrespective of word type and subject, thus suggesting that speakers avoid these two vowels from being confused in production. In particular, the real words show no overlap for THE, SIM, RAD and LIV and practically no overlap for NIC, while regarding nonsense words, there is essentially no overlap in the case of SIM and NIC, practically no overlap for SEB, RAD and LIV, and some more overlap for THE. Consistently with this observation, the F1 frequency values turned out to be significantly higher for [ə] than for [ɨ] for the two word type conditions, i.e., meaningful (F(1, 35) = 325.77, p < 0.001) and nonsense (F(1, 35) = 13.44, p < 0.001), while the corresponding differences in F2 achieved significance for the nonsense words (F(3, 35) = 48.86, p < 0.001) but not for the real words. Regarding superposition degree between the ellipses for the central and peripheral vowels, and as already pointed out above, Figure 2, Figure 3 and Figure 4 reveal that some overlap may occur between [ɨ] and [u], on the one hand, and between [ə] and [o] and [a], on the other hand, and therefore between central and back and low vowels, but essentially not between either of the two central vowels and the two front vowels [i] and [e].
Consonant-dependent formant frequency differences achieved significance for F2 (p < 0.001) but not for F1 in the case of both [ɨ] and [ə] and the two word conditions, thus indicating that acoustic changes as a function of the contexual consonants occur in vowel fronting rather than in vowel height. In particular, significant F2 differences for [ɨ] were found to vary with consonant type in the progression palatoalveolar > dental/alveolar, [l] > labial (meaningful words), and palatoalveolar, dental, [l] > [r] > (palato)velar > labial (nonsense words). Focusing on the meaningful word condition, the cross-speaker F2 frequency values for [ɨ] parallel those for [u], in that F2 is lowest when the vowel occurs next to labial consonants (1252.33 Hz for the unnormalized data, −0.67 for the normalized data) and highest when adjacent to palatoalveolar consonants (1803.21 Hz, 0.56), with the latter contextual trend being associated with the tongue body fronting and unrounding coarticulatory effects exerted by the consonant on the vowel. As for the stressed vowel [ə], consonant-dependent effects on F2 frequency turned out to be significantly lower for labials than for all the other consonants. Descriptive data for the meaningful word condition indicate that, similarly to the vowel [a], F1 is highest in the adjacency of [l] (650.50 Hz, 0.85), and F2 rises most in the adjacency of palatoalveolars (1563.29 Hz, 0.01).
An additional piece of information is provided in Figure 6 showing the normalized F2 × F3 frequency values across speakers for all Romanian vowels produced in meaningful words (left graph) and in nonsense words (right graph). The F2 × F3 data values appearing inside the ellipses correspond to the different consonant context conditions (four for meaningful words, six for nonsense words). The figure shows that, except for [i] and, to a lesser extent, [e], there are no clear-cut F3 differences between vowels, which means that [ɨ] and [ə] may be set in contrast acoustically based on the F1 and F2 frequency values alone.

3.2. Duration

The linear mixed model statistical test run on the vowel duration data yielded a main vowel effect for the two word type conditions (meaningful words: F(6, 153) = 15.62, p < 0.001; nonsense words: F(6, 235) = 33.99, p < 0.001). Regarding the five peripheral vowels, the duration data across speakers, displayed in Figure 7, and the results for the post hoc tests for the nonsense words, shown in Figure 8 (right panel), indicate differences in vowel duration proceeding in the progression [a] > [e o] > [i u]. Therefore, as expected, peripheral vowels shorten as they become higher in the vowel space. Smaller duration differences among vowels hold for the meaningful words, which explains why the left panel of Figure 8 exhibits less significant effects than the right panel. Differences in vowel duration between high, mid high and low vowels may also be seen for the individual speakers in Figure 9.
Regarding the two central vowels, two major findings come to our attention (see Figure 7 and Figure 9): [ə] is longer than [ɨ] for all speakers and word type conditions, with a single exception (speaker LIV, meaningful words); and schwa duration is comparable to the duration of the peripheral mid vowels, and [ɨ] is not longer than high vowels. Therefore, in contrast with other world languages, Romanian schwa turns out to not be shorter than all or most of the other vowels in the language, while [ɨ] may be characterized as a short vowel. The statistical results (see Figure 8) are in accordance with the descriptive data in the case of the nonsense word condition, insofar as [ɨ] differs from [e a o], but not from [i u], while [ə] differs from high and low vowels, but not from mid ones. Also, the durational contrast between [ə] and [ɨ] (for [ə] > [ɨ]) achieved significance for the nonsense words, but not for the meaningful words.

4. Discussion

The acoustic data for the Romanian stressed vowels [i e a o u ɨ ə] produced by six speakers reveal that [e] and [o] are roughly midway between the high and low vowels, though they are closer to [i] and [u] than to [a]. As for the two central vowels, while [ɨ] may be characterized as high central and [ə] as mid central, both [ɨ] and [ə] (and [ə] more than [ɨ]) may occur closer to the back vowels than to the front vowels in the vowel space. The latter finding is in accordance with other world’s languages whose stressed schwa is also central but leaning towards the space occupied by [ʌ]. Regarding phonetic transcription, and based solely on phonetic considerations, [ə] appears to be the most suitable IPA symbol for Romanian stressed schwa, since retracted [ʌ]-like realizations of this vowel appear to occur in specific consonantal contexts only. In any case, the symbol [ʌ] used in Renwick (2012, 2014, 2021) cannot be entirely rejected until data from more speakers are taken into consideration. It should be mentioned, in this respect, that both the symbols [ə] and [ɤ] or [ʌ] have been used in the literature for the phonetic representation of the stressed mid central vowel in phonetic studies of other languages and dialects. The precise nature of the articulatory configuration during the production of the two Romanian central vowels should be ascertained with data on tongue configuration recorded with ultrasound or MRI (magnetic resonance imaging). Apparently, in the spirit of Renwick, another reason for not rejecting the symbol [ʌ], which is of no concern to the present study, is for purely phonological reasons, in view of the phonemic status of this vowel in Romanain (see our comments on this issue in Section 1).
Our data agree with Renwick (2012) in showing that there is often a more variable F2 frequency for [ɨ] than for [ə] and for the peripheral vowels, which, analogously to the back rounded vowels, is mostly associated with the tongue fronting and unrounding coarticulatory effects exerted by palatoalveolar and palatal consonants on the high central vowel. Stressed schwa, on the other hand, turns out to be generally more variable in vowel opening (F1) than in vowel fronting (F2), and may thus be somewhat higher or lower, with, analogously to the low vowel [a], the highest F1 occurring next to [l r], and the highest F2 next to palatals and palatoalveolars. This scenario contrasts with that of other languages with stressed schwa and no [ɨ], showing that this vowel is more variable acoustically than all or most peripheral vowels, while suggesting that it has an articulatory target in Romanian, presumably since it must be distinguished from [ɨ]. Another relevant finding is that mid, back and low vowels were more variable and thus exposed to greater consonant-to-vowel coarticulatory effects, in the case of the meaningful vs. the nonsense word condition.
Of special relevance to the relationship between phonetics and phonology is the absence or minimal presence of F1 overlap between the vowel spaces for [ɨ] and [ə] for those speakers whose acoustic data were subjected to analysis in the present study, which may nevertheless intersect to a greater or lesser extent with the vowel spaces of neighbouring peripheral back vowels. The validity of this finding, which has also been reported to occur for other languages endowed with [ɨ] and [ə], supports the view that overlap in production is avoided because the two vowels have a distinctive phonological status in the Romanian language. As found by Vasilescu et al. (2016), while holding for the laboratory speech condition, this clear-cut acoustic distinction does not necessarily operate in spontaneous speech, where the two mid vowels are not hyperarticulated.
The duration data for the two Romanian central vowels reported in the present study reveal that [ɨ] is shorter than [ə], the two vowel durations being in the range of the high and mid vowels, respectively. This difference in duration, which is in contrast with other languages with stressed schwa, is in accordance with the difference in formant frequency variability reported above, insofar as the vowel showing the greatest variability (i.e., [ɨ]) is also the shorter one of the two, while suggesting, at the same time, that the tongue body configuration for stressed schwa may be subject to active articulatory control by Romanian speakers. Moreover, duration differences between the central and peripheral vowels, and also between [ɨ] and [ə], turned out to be greater in the nonsense vs. the meaningful word condition (see Figure 7).
To summarize, the data for the two Romanian central vowels reveal that [ɨ] is more variable and shorter than schwa, while schwa differs from other languages in being longer and not too variable. A reason for this finding may be the need to set in contrast the two central vowels phonologically. It is important to stress that even though the high central vowel [ɨ] has a marginal occurrence status in Romanian, it appears to be phonetically different from all the other Romanian vowels. As for stressed schwa, its formant frequency and duration characteristics suggest that it is articulated not only with a mid tongue position (and thus is neither too high nor too low), but also with some tongue body approximation towards the back of the vocal tract.
The acoustic characteristics of Romanian vowels differed as a function of word type condition: generally speaking, they were shorter, less contrastive (both regarding vowel quality and duration) and spectrally more disperse, and thus more sensitive to consonant coarticulatory effects, in meaningful vs. nonsense words, presumably because they were more hyperarticulated in the latter word type condition than in the former one. Moreover, it is worth noticing that, while the central vowels [ɨ] and [ə] turned out to be closer to each other in duration and more spectrally variable in meaningful vs. nonsense words, they did not exhibit greater overlap in F1 frequency in the former vs. the latter word type condition. Indeed, the statistical results showed significant F1 differences between [ɨ] and [ə] when produced both in meaningful words and in nonsense words. This piece of evidence may be attributed again to the need to set in contrast the two central vowels on the part of the Romanian speakers.

Author Contributions

Conceptualization, D.R. and F.S.-M.; methodology, D.R. and F.S.-M.; software, F.S.-M.; validation, D.R.; formal analysis, D.R.; investigation, D.R. and F.S.-M.; resources, D.R. and F.S.-M.; data curation, D.R. and F.S.-M.; writing—original draft preparation, D.R.; writing—review and editing, D.R. and F.S.-M.; visualization, D.R. and F.S.-M.; supervision, D.R.; project administration, D.R.; funding acquisition, D.R. and F.S.-M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by project PGC2018-096877-B-I00 of the Spanish Ministry of Science, Innovation and Universities.

Institutional Review Board Statement

Ethical review and approval were waived for this study due to only involving acoustic recording of the speakers’ voices.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Data supporting reported results can be obtained on demand from the authors.

Acknowledgments

We would like to acknowledge the Servei d’Estadística of the Universitat Autònoma de Barcelona for help with the statistical analysis.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

The list of meaningful sentences with Romanian orthographic representations and English translations. The words containing the stressed vowels subject to analysis are given in phonetic transcription in the left column.
Labial context
[ˈpipə]tutun pentru pipă‘pipe tobacco’
[ˈfebrə]ai puţină febră‘you have a little fever’
[ˈbabə]a murit o babă‘an old woman died’
[ˈpopa]vorbesc cu popa‘I am talking to the priest’
[ˈbuba]azi s-a spart buba‘today everything was revealed’
[ˈbɨvə]s-a apucat de bâvă‘(s)he insists in denying the fact’
[bəbtʃ] nu ştiu ce sunt băbci‘I don’t know the meaning of the word băbci’
(băbci: a regional word meaning ‘hand anvil’ (pl.))
Dental/alveolar context
[ˈsitə]vede că prin sită‘(s)he has a blurry vision’
[ˈtest]medicii fac un test‘doctors do a test’
[ˈtata]nu vorbesc cu tata‘I am not talking to my dad’
[ˈsotsul]a venit cu soţul‘she came with her husband’
[ˈtuse]o criză de tuse‘a coughing attack’
[ˈtɨta]se zicea şi tâta‘tâta was said as well’ (tâta: a short form of the
indefinite pronoun atâta)
[ˈsətə]se zicea şi sătă‘sătă was said as well’ (sătă: a regional word
meaning ‘sieve’)
Palatoalveolar context
[ˈʒiʒie]a curge jijie‘to have a runny nose’
[ʃes]păştea pe un câmp şes‘it was grazing on a flat field’
[ˈtʃasul]merge cu ceasul‘it works out perfectly’
[ʒos]a lăsa ochii în jos‘to keep his eyes down’
[ˈdʒuʃkə]Constantina Giuşcă‘Constantina Giuşcă’ (person name)
[ʒɨtʃ]pluralul e jâci‘the plural is jâci’ (jâci: a regional word meaning
‘young and strong-willed’ (pl.))
[ʃəʃ]nu ştiu cuvântul şăş‘I don’t know the word şăş’ (şăş: a regional
word meaning ‘support’)
Alveolar lateral context
[ˈlilie]plantă de lilie‘a lily plant’
[ˈlela]am umblat lela‘I have been wandering’
[ˈlalə]eu îi zic lală‘I call her lală’ (lală: a regional word used by young
girls to talk respectfully to or about an older sister)
[ˈlolə]eu îi zic lolă‘I call her lolă’ (lolă: a regional word to designate an
older or an unkempt and poorly dressed woman)
[ˈlulə]tutun de lulă‘pipe tobacco’ (lulă: a regional word meaning ‘pipe’)
[ˈlɨlə]o femeie lâlă‘a lazy woman’
[ˈlələ]spun cuvântul lălă‘I say the word lălă’ (lălă: a regional word
expressing physical or mental pain).

Appendix B

The list of nonsense words in phonetic transcription and with Romanian orthographic representations.
Labial context: [ˈpipə] pipă, [ˈpepə] pepă, [ˈpapə] papă, [ˈpopə] popă, [ˈpupə] pupă, [ˈpɨpə] pâpă, [ˈpəpə] păpă.
Dental context: [ˈtitə] tită, [ˈtetə] tetă, [ˈtatə] tată, [ˈtotə] totă, [ˈtutə] tută, [ˈtɨtə] tâtă, [ˈtətə] tătă.
Palatoalveolar context: [ˈʃiʃə] şişă, [ˈʃeʃə] şeşă, [ˈʃaʃə] şaşă, [ˈʃoʃə] şoşă, [ˈʃuʃə] şuşă, [ˈʃɨʃə] şâşă, [ˈʃəʃə] şăşă.
(Palato)velar context: [ˈkikə] chică, [ˈkekə] checă, [ˈkakə] cacă, [ˈkokə] cocă, [ˈkukə] cucă, [ˈkɨkə] câcă, [ˈkəkə] căcă.
Alveolar lateral context: [ˈlilə] lilă, [ˈlelə] lelă, [ˈlalə] lală, [ˈlolə] lolă, [ˈlulə] lulă, [ˈlɨlə] lâlă, [ˈlələ] lălă.
Alveolar rhotic context: [ˈriɾə] riră, [ˈreɾə] reră, [ˈraɾə] rară, [ˈroɾə] roră, [ˈruɾə] rură, [ˈrɨɾə] râră, [ˈrəɾə] rără.

References

  1. Abramson, A. S. (1962). The vowels and tones of standard Thai. Research Center in Anthropology, Folklore, and Linguistics, Indiana University. [Google Scholar]
  2. Adank, P., Smits, R., & van Hout, R. (2004). A comparison of vowel normalization procedures for language variation research. The Journal of the Acoustical Society of America, 116, 3099–3107. [Google Scholar] [CrossRef] [PubMed]
  3. Apiluck, T. (1996). The mid central vowel [ə] in Thai. Mon-Khmer Studies Journal, 26, 65–77. [Google Scholar]
  4. Boersma, P., & Weenink, D. (2024). Praat: Doing phonetics by computer. (Version 6.4.13). University of Amsterdam. [Google Scholar]
  5. Browman, C. P., & Goldstein, L. (1992). Articulatory phonology: An overview. Phonetica, 49, 222–234. [Google Scholar] [CrossRef]
  6. Chitoran, I. (2001). The phonology of Romanian. Mouton de Gruyter. [Google Scholar]
  7. Cho, T., Jun, S.-A., Jung, S.-C., & Ladefoged, P. (2001). Vowels of Cheju. Korean Journal of Linguistics, 26, 801–819. [Google Scholar]
  8. Cohn, A. C., & Lockwood, K. (1994). A phonetic description of Madurese and its phonological implications. Working Papers Cornell Phonetics Laboratory, 9, 67–92. [Google Scholar]
  9. Dascălu, L. (1971). Cîteva variante ale fonemului /e/ din punct de vedere acustic. Fonetică şi Dialectologie, 7, 87–93. [Google Scholar]
  10. Estill, D. (2010). The enigmatic central vowel in Moksha. How central, how reduced. In S. Werner, & T. Kinnunen (Eds.), XXVI Fonetiikan päivät (pp. 33–37). Itä-Suomen yliopisto. [Google Scholar]
  11. Fant, G. (1960). Acoustic theory of speech production. Mouton. [Google Scholar]
  12. Fant, G. (1973). Acoustic description and classification of phonemic units. In Speech sound and features (pp. 32–83). The MIT Press. [Google Scholar]
  13. Gick, B. (2002). An X-ray investigation of pharyngeal constriction in American English schwa. Phonetica, 59, 38–48. [Google Scholar] [CrossRef] [PubMed]
  14. Hillenbrand, J., Clark, M. J., & Nearey, T. M. (2001). Effects of consonant environment on vowel formant patterns. Journal of the Acoustical Society of America, 109, 748–763. [Google Scholar] [CrossRef] [PubMed]
  15. Huffman, F. (1967). Outline of Cambodian grammar [Ph.D. dissertation, Cornell University]. [Google Scholar]
  16. Iwatsuki, S. (2012). An acoustic analysis of Amharic vowels. Studies in Ethiopian Languages, 1, 19–31. [Google Scholar]
  17. Jones, G. E. (1984). The distinctive vowels and consonants of Welsh. In M. J. Ball, & G. E. Jones (Eds.), Welsh phonology: Selected readings (pp. 40–64). University of Wales Press. [Google Scholar]
  18. Kondo, Y. (1995). Production of schwa by Japanese speakers of English: A crosslinguistic study of coarticulatory strategies [Ph.D. dissertation, University of Edinburgh]. [Google Scholar]
  19. Lehiste, I. (1970). Suprasegmentals. The MIT Press. [Google Scholar]
  20. Leslau, W. (1995). Reference grammar of Amharic. Otto Harrassowitz. [Google Scholar]
  21. Lindblad, P., & Lundqvist, S. (2003). [l] tends to be velarized, apical as opposed to laminal, and produced with a low jaw, and these features are connected. In M. J. Solé, D. Recasens, & J. Romero (Eds.), Proceedings of the 15th International Congress of Phonetic Sciences (pp. 1899–1902). Causal Productions. [Google Scholar]
  22. Maddieson, I. (1984). Patterns of sounds. Cambridge University Press. [Google Scholar]
  23. Martinet, A. (1952). Function, structure, and sound change. Word, 8, 1–32. [Google Scholar] [CrossRef]
  24. Mayr, R., & Davies, H. (2011). A cross-dialectal acoustic study of the monophthongs and diphthongs of Welsh. Journal of the International Phonetic Association, 41, 1–25. [Google Scholar] [CrossRef]
  25. Mooshammer, C., Hoole, P., & Geumann, A. (2007). Jaw and order. Language and Speech, 50, 145–176. [Google Scholar] [CrossRef] [PubMed]
  26. Nandris, O. (1963). Phonétique historique du roumain. Klincksieck. [Google Scholar]
  27. Pinnow, H.-J. (1979). Reflections on the history of the Khmer phonemic system. Mon-Khmer Studies, 8, 103–130. [Google Scholar]
  28. Proctor, M. I. (2009). Gestural characterization of a phonological class: The liquids [Ph.D. dissertation, Yale University]. [Google Scholar]
  29. Recasens, D. (2021). Acoustic characteristics and placement within vowel space of full schwa in the world’s languages: A survey. Journal of the International Phonetic Association, 52, 59–94. [Google Scholar] [CrossRef]
  30. Recasens, D., & Espinosa, A. (2006). Dispersion and variability of Catalan vowels. Speech Communication, 48, 645–666. [Google Scholar] [CrossRef]
  31. Renwick, M. E. L. (2012). Vowels of Romanian: Historical, phonological and phonetic studies [Ph.D. dissertation, Cornell University]. Available online: http://conf.ling.cornell.edu/peggy/Renwick_2012_Vowels-of-Romanian.pdf (accessed on 1 January 2021).
  32. Renwick, M. E. L. (2014). The phonetics and phonology of contrast. The case of the Romanian vowel system. Walter de Gruyter. [Google Scholar]
  33. Renwick, M. E. L. (2021). Romanian. In C. Gabriel, R. Gess, & T. Meisenburg (Eds.), Manual of Romance phonetics and phonology (pp. 531–558). De Gruyter. [Google Scholar]
  34. Renwick, M. E. L., Vasilescu, I., Dutrey, C., Lamel, L., & Vieru, B. (2016). Marginal contrast among Romanian vowels: Evidence from ASR and functional load. In Interspeech 2016 (pp. 2433–2437). ISCA. [Google Scholar]
  35. Rosetti, A. (1967). Introducere în fonetică. Editura Ştiinţifică. [Google Scholar]
  36. Silverman, D. (2011). Schwa. In M. van Oostendorp, C. J. Ewen, E. V. Hume, & K. Rice (Eds.), The Blackwell companion to phonology (pp. 628–642). Blackwell-Wiley. [Google Scholar]
  37. Stevens, A. M. (1964). Madurese phonology and morphology [Ph.D. dissertation, Yale University]. [Google Scholar]
  38. Svantesson, J.-O. (1993). Phonetic correlates of register in Paraok. Reports from Uppsala University, Department of Linguistics, 23, 102–106. [Google Scholar]
  39. Teodorescu, M. (1985). Descrierea acustică a vocalelor din limba română literară (An acoustic description of the vowels of Literary Romanian). Studii şi Cercetări Lingvistice, 36, 463–477. [Google Scholar]
  40. Tse, H. (2024). Functional load and vowel merger in Toronto Heritage Cantonese. In R. Rao (Ed.), The phonetics and phonology of heritage languages (pp. 280–301). Cambridge University Press. [Google Scholar]
  41. Van Syoc, W. B. (1959). The phonoogy and morphology of the Sundanese language [Ph.D. dissertation, University of Michigan]. [Google Scholar]
  42. Vasilescu, I., Renwick, M. E. L., Dutrey, C., Lamel, L., & Vieru, B. (2016). Réalisation phonétique et contraste phonologique marginal: Une étude automatique des voyelles du roumain. In Actes de la conférence conjointe JEP-TALN-RECITAL (Volume 1, pp. 597–606). AFCP-ATALA. [Google Scholar]
  43. Vasiliu, E. (1989). Roumain: Phonétique et phonématique. In G. Holtus, M. Metzeltin, & C. Schmitt (Eds.), Lexikon der Romanistischen linguistik. Band III: Die einzelnen romanischen Sprachen und Sprachgebiete von der Renaissance bis zur Gegenwart. Rumänisch, Dalmatisch/Istroromanisch, Friaulisch, Ladinisch, Bündnerromanisch (pp. 1–7). Niemeyer. [Google Scholar]
  44. Watkins, J. (2002). The phonetics of Wa: Experimental phonetics, phonology, orthography and sociolinguistics. Australian National University. [Google Scholar]
  45. Wayland, R. (1998). An acoustic study of Battambang Khmer vowels. Mon-Khmer Studies, 28, 43–62. [Google Scholar]
  46. Wedel, A., Kaplan, A., & Jackson, S. (2013). High functional load inhibits phonological contrast loss: A corpus study. Cognition, 128, 179–186. [Google Scholar] [CrossRef] [PubMed]
  47. Wedel, A., Nelson, N., & Sharp, R. (2018). The phonetic specificity of contrastive hyperarticulation in natural speech. Journal of Memory and Language, 100, 61–88. [Google Scholar] [CrossRef]
  48. Yang, C., Yang, S., & O’Grady, W. (2020). Jejueo: The language of Korea’s Jeju Island. University of Hawai’i Press. [Google Scholar]
  49. Yusuf, S. (1993). Sundanese vowel sounds: A spectrographic study [Master’s thesis, Ohio University]. [Google Scholar]
Figure 1. Cross-speaker F1 × F2 frequency data for the Romanian vowels [i e a o u ɨ ə], reported by Teodorescu (Teodorescu, 1985; 3 males, left graph) and by Renwick (Renwick, 2012; 14 females, right graph).
Figure 1. Cross-speaker F1 × F2 frequency data for the Romanian vowels [i e a o u ɨ ə], reported by Teodorescu (Teodorescu, 1985; 3 males, left graph) and by Renwick (Renwick, 2012; 14 females, right graph).
Languages 10 00012 g001
Figure 2. Normalized F1 × F2 frequency values across speakers for Romanian vowels [i e a o u ɨ ə] produced in meaningful words (left graph) and in nonsense words (right graph). F1 × F2 data values for each vowel correspond to different consonant context conditions, and are encircled by 95% confidence ellipses.
Figure 2. Normalized F1 × F2 frequency values across speakers for Romanian vowels [i e a o u ɨ ə] produced in meaningful words (left graph) and in nonsense words (right graph). F1 × F2 data values for each vowel correspond to different consonant context conditions, and are encircled by 95% confidence ellipses.
Languages 10 00012 g002
Figure 3. F1 × F2 frequency values for all tokens of the Romanian vowels [i e a o u ɨ ə] produced in meaningful words. They are encircled by 95% confidence ellipses and displayed separately for the six Romanian subjects.
Figure 3. F1 × F2 frequency values for all tokens of the Romanian vowels [i e a o u ɨ ə] produced in meaningful words. They are encircled by 95% confidence ellipses and displayed separately for the six Romanian subjects.
Languages 10 00012 g003
Figure 4. F1 × F2 frequency values for all tokens of the Romanian vowels [i e a o u ɨ ə] produced in nonsense words. They are encircled by 95% confidence ellipses and displayed separately for the six Romanian subjects.
Figure 4. F1 × F2 frequency values for all tokens of the Romanian vowels [i e a o u ɨ ə] produced in nonsense words. They are encircled by 95% confidence ellipses and displayed separately for the six Romanian subjects.
Languages 10 00012 g004
Figure 5. Cross-token standard deviations for the F1 and F2 frequency values for the seven Romanian vowels, displayed as a function of word type and speaker. Vowels are placed from left to right in the order [i e a o u ɨ ə] for each subject.
Figure 5. Cross-token standard deviations for the F1 and F2 frequency values for the seven Romanian vowels, displayed as a function of word type and speaker. Vowels are placed from left to right in the order [i e a o u ɨ ə] for each subject.
Languages 10 00012 g005
Figure 6. Normalized F2 × F3 frequency values across speakers for the Romanian vowels [i e a o u ɨ ə] produced in meaningful words (left graph) and in nonsense words (right graph). The F1 × F2 data values for each vowel correspond to the different consonant context conditions, and are encircled by 95% confidence ellipses.
Figure 6. Normalized F2 × F3 frequency values across speakers for the Romanian vowels [i e a o u ɨ ə] produced in meaningful words (left graph) and in nonsense words (right graph). The F1 × F2 data values for each vowel correspond to the different consonant context conditions, and are encircled by 95% confidence ellipses.
Languages 10 00012 g006
Figure 7. Mean duration and standard deviation values for the Romanian vowels [i e a o u ɨ ə] across tokens and speakers, displayed separately for the meaningful words (left) and the nonsense words (right).
Figure 7. Mean duration and standard deviation values for the Romanian vowels [i e a o u ɨ ə] across tokens and speakers, displayed separately for the meaningful words (left) and the nonsense words (right).
Languages 10 00012 g007
Figure 8. Significant differences for all vowel pairs above the grey color cells, according to results from the post hoc tests for the meaningful word condition (left) and the nonsense word condition (right) (* p < 0.05, ** p < 0.01, *** p < 0.001).
Figure 8. Significant differences for all vowel pairs above the grey color cells, according to results from the post hoc tests for the meaningful word condition (left) and the nonsense word condition (right) (* p < 0.05, ** p < 0.01, *** p < 0.001).
Languages 10 00012 g008
Figure 9. Mean duration and standard deviation values for Romanian [i e a o u ɨ ə] across tokens, displayed for each speaker and word type condition.
Figure 9. Mean duration and standard deviation values for Romanian [i e a o u ɨ ə] across tokens, displayed for each speaker and word type condition.
Languages 10 00012 g009
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Recasens, D.; Sánchez-Miret, F. An Acoustic Study of Romanian Stressed Vowels with Special Reference to Mid Central [ɨ] and [ə]. Languages 2025, 10, 12. https://doi.org/10.3390/languages10010012

AMA Style

Recasens D, Sánchez-Miret F. An Acoustic Study of Romanian Stressed Vowels with Special Reference to Mid Central [ɨ] and [ə]. Languages. 2025; 10(1):12. https://doi.org/10.3390/languages10010012

Chicago/Turabian Style

Recasens, Daniel, and Fernando Sánchez-Miret. 2025. "An Acoustic Study of Romanian Stressed Vowels with Special Reference to Mid Central [ɨ] and [ə]" Languages 10, no. 1: 12. https://doi.org/10.3390/languages10010012

APA Style

Recasens, D., & Sánchez-Miret, F. (2025). An Acoustic Study of Romanian Stressed Vowels with Special Reference to Mid Central [ɨ] and [ə]. Languages, 10(1), 12. https://doi.org/10.3390/languages10010012

Article Metrics

Back to TopTop