Compensation in Time and Space: Prominence Marking in Aging and Disease

Thies, Tabea; Hermes, Anne; Mücke, Doris

doi:10.3390/languages7010021

Open AccessArticle

Compensation in Time and Space: Prominence Marking in Aging and Disease

by

Tabea Thies

^1,2,*

,

Anne Hermes

^2,* and

Doris Mücke

^3,*

¹

Department of Neurology, Faculty of Medicine and University Hospital Cologne, University of Cologne, 50924 Cologne, Germany

²

Laboratoire de Phonétique et Phonologie, UMR 7018, CNRS/Sorbonne Nouvelle, 75005 Paris, France

³

IfL Phonetics, Faculty of Arts and Humanities, University of Cologne, 50931 Cologne, Germany

^*

Authors to whom correspondence should be addressed.

Languages 2022, 7(1), 21; https://doi.org/10.3390/languages7010021

Submission received: 15 July 2021 / Revised: 19 December 2021 / Accepted: 15 January 2022 / Published: 27 January 2022

(This article belongs to the Special Issue Aging- and Disease-related Changes in Speech Production)

Download

Browse Figures

Versions Notes

Abstract

:

This study aims to explore the effects of healthy aging and Parkinson’s disease on speech motor performance. One area of speech production which requires fine speech motor control is prominence marking. Therefore, strategies of prominence marking of three speaker groups with four speakers each were investigated: younger speakers, older speakers, and speakers with Parkinson’s disease (PD). Acoustic and articulatory data were collected. Speech data were analyzed focusing on prominence-related adjustments of vowel production and tongue body movements in the temporal and spatial domain. Longer durations, varying initiation of the tongue movements and smaller vowel sizes in older speakers and in speakers with PD were found compared to younger speakers. The data indicate further that all speaker groups mark prominence by changing relevant parameters in the vowel articulation; however, strategies seem to differ between the groups: (i) in the temporal domain, articulatory movement durations increase with age and are further prolonged in speakers with PD; (ii) in the spatial domain, the articulation space is resized by the older speakers in a non-symmetrical way, while no systematic vocalic modulations were found for speakers with PD. To conclude, the speech system seems to be affected by age and disease, but speakers develop compensatory strategies to counteract influences in the spatial domain.

Keywords:

speech production; articulation; prominence marking; aging; Parkinson’s disease; compensatory strategies

1. Introduction

Prosody plays an essential role in conveying the meaning of an utterance. Speakers use multiple prosodic cues to highlight important information in the phonetic substance (Ladd 2012; Roessig and Mücke 2019). Prosodic highlighting involves categorical and gradient changes in intonation and articulation, while on the intonational level, pitch accent placement and pitch modulation are important factors (Sluijter and van Heuven 1995; Grice et al. 2005). Temporal and spatial changes in the oral vocal tract are related to the segmental level of consonant and vowel production that can increase sonority features and paradigmatic contrasts in prominent syllables (De Jong 1995; Beckman et al. 1992). Prosodic prominence is a complex process that requires fine speech motor control. The ability to mark prominence in terms of fine modulations of the laryngeal and supra-laryngeal system may decrease due to the impact of aging or diseases as physiological and/or mental changes can be involved (Thies et al. 2020; Karlsson and Hartelius 2021).

In an aging society, it is not only the process of aging and its effects which play an important role, but also the prevalence of associated diseases, potentially allowing for the possibility to disentangle the effects of aging from the effects of diseases. Parkinson’s disease (PD) especially, as the second most common neurodegenerative disease (De Lau and Breteler 2006), is a disease that continues to worsen with age (Kalia and Lang 2015). Aging, as well as PD, are known to impact general motor performance, speech production as well as cognitive-linguistic skills (Levelt and Meyer 2000; Ferreira and Swets 2002; Ketcham and Stelmach 2004; Smith and Caplan 2018). As the process of prosodic prominence marking requires fine speech motor control as well as sufficient cognitive skills, this study explores the role of prosodic modifications related to aging on the one hand and PD on the other.

The present study focuses on the relations of temporal and spatial marking of prominence in different speaker groups: younger healthy speakers, older healthy speakers, and speakers with PD. The two latter groups are associated with certain limitations due to the impact of aging and/or disease. Acoustic and articulatory data were collected by using electromagnetic articulography to analyze temporal and spatial strategies linked to prominence marking.

1.1. Prominence Marking and Information Structure

Prominence marking is a strategy to let certain parts of an utterance stand out compared to others. This highlighting process requires changes in intonation and articulation and can take place either between unaccented and accented words (across accentuation: non-prominent vs. prominent, e.g., background vs. broad focus,) or between accented words with different degrees of prominence (within accentuation: e.g., broad focus vs. contrastive focus).

Marking prominence on the intonational level involves the modulation of pitch (Sluijter and van Heuven 1995). According to the autosegmental-metrical model of intonation, changes in pitch movement generate different tones on the perceptual level which are classified into pitch accents. Pitch accent placement (i.e., the choice of a pitch accent type) and pitch modulation influence the degree of prominence (Sluijter and van Heuven 1995; Grice et al. 2005). Rising pitch accents are perceived as more prominent than falling pitch accents (categorical changes; Baumann and Röhr 2015). Furthermore, later and higher F0 peaks are also perceived as more prominent within the same rising pitch accent category (gradient changes, Roessig 2021).

Articulatory modifications are related to two principles: sonority expansion and hyper-articulation. For enhancing sonority, the degree of opening of the oral cavity increases to allow for more radiation of acoustic energy from the mouth in accented syllables (Harrington et al. 2000; Cho 2005, 2006). In addition, longer vowel durations can increase sonority on the perceptual level as well (Beckman et al. 1992). The strategy of hyper-articulation increases the paradigmatic contrasts between syllables by changing vocal tract configurations towards a more distinct articulation (De Jong 1995; Lindblom 1990). Therefore, place features of vowels are enhanced, leading to more peripheral formant frequencies. For example, the front vowel /i/ is articulated with a more fronted tongue position, while the low vowel /a/ is articulated with a more lowered tongue position.

Prominence marking can be captured on the acoustic level (e.g., acoustic durations, vowel formants; Kügler 2008; Baumann et al. 2007) and the articulatory level (e.g., duration and displacement of articulatory movements). A previous study investigated tongue body movements of healthy German speakers and stated that tongue positions and velocities systematically change with prominence across and also within accentuation (Pagel et al. 2021; Roessig and Mücke 2019). Vocalic movements are adjusted not only in the vertical dimension, e.g., greater jaw opening for /a/, but also in the horizontal domain, e.g., retraction of the tongue, to achieve a vocalic target on the periphery. The results are in line with the lip aperture measured during the production of vowels for different focus conditions in German and English, revealing a greater degree of lip opening from unaccented to accented syllables (across accentuation) and from broad to narrow focus and from narrow to contrastive focus (within accentuation; Mücke and Grice 2014; Krivokapić et al. 2017). All these types of adjustments regarding the lip and tongue movements were gradient in nature but systematically encoded different degrees of phrasal prominence during the communication process.

Prosodic prominence indicates information structure, such as focus marking (Lambrecht 1994). In those cases, the modulation of speech parameters changes on a continuum from not prominent to most prominent. For this study, three focus types are relevant: (i) background, (ii) broad focus and (iii) contrastive focus. The realization of focus types is dependent on the communicative context. While given information is classified as less important and thus not highlighted on the surface production (not prominent), new or less accessible information is made prominent. Examples of the three focus types investigated in this study are the following:

(1): Background (girl’s name is already given, not accented):
Q: Hat die Schwester der Mila gewunken? (Has the sister waved to Mila?)
A: Die OMA hat der Mila gewunken. (The grandmother waved to Mila.)
(2): Broad focus (whole answer is of interest, girl’s name is new, accented):
Q: Was ist passiert? (What happened?)
A: Die Oma hat der Mila gewunken. (The grandmother waved to Mila.)
(3): Contrastive focus (girl’s name is new and the name is corrected, accented):
Q: Hat der Opa der Luna gewunken? (Has the grandfather waved to Luna?)
A: Der Opa hat der MILA gewunken. (The grandfather waved to Mila.)

In the given examples, the constituent of interest is always the girl’s name ‘Mila’ (underlined). Words which are expected to be accented (prominent) are highlighted in bold (examples 1–3). Typically, words in background position are realized as unaccented without prosodic adjustments. In the present study, broad focus refers to a unit larger than just a word in which two constituents receive prominence (cf. ‘grandmother’ and ‘Mila’), as the whole answer provides new information to the question raised. Contrastive focus is restricted to prominence on a single word or syllable. As contrastive focus is also known as corrective focus, it is used to correct the previously introduced constituent (cf. ‘Luna’ vs. ‘Mila’).

The distinction between unaccented (background) and accented (broad focus) words is what is considered as across accentuation. In cases where speakers encode prosodic prominence to differentiate between broad focus and contrastive focus, this is considered as marking different degrees of contrast, i.e., within accentuation. In general, prosodic adjustments and therefore the degree of prominence increases from background to broad focus and further to contrastive focus condition (Baumann et al. 2007; Hermes et al. 2008; Roessig and Mücke 2019).

1.2. Aging and Speech

The aging process can involve changes in mental and physical skills. At the physiological level, age-induced changes affect the central nervous system, the (musculo)-skeletal system, and the cardiovascular system, leading to deficits in movement and posture. This is further accompanied by a loss of flexibility and muscular strength and can result in smaller and slowed down movements as well as affected initiation and execution. Therefore, deficits of gross motor control can arise. Previous studies report on prolonged limb movements and a reduction of maximum velocities (Seidler et al. 2002; Ketcham and Stelmach 2004). Furthermore, it has been shown that movement profiles (symmetry of acceleration and deceleration phases) are also affected by age, leading to asymmetrical movement patterns (prolongation of deceleration in movements). While younger individuals perform gross motor control movements with a rather symmetrical pattern of acceleration and deceleration phases to achieve the target of movements, prolonged deceleration phases have been reported for older individuals revealing an asymmetrical pattern (Brown 1996). A recent paper confirms that older adults perform with slower movements (Kornatz et al. 2021); it states further that older adults more often use submovements for goal-directed pointing movements indicating less accurate movement patterns.

However, age-related effects are not restricted to gross motor control, as they also appear on the level of speech motor control. A reduced speech tempo was reported in the literature accompanied by reduced coarticulation (Hermes et al. 2018; Amerman and Parnell 1992; Bourbon and Hermes 2021; D’Alessandro et al. 2020; Mücke et al. 2021). Two articulatory studies with a small sample size reported slower movements of the tongue body in older speakers compared to younger ones, especially during vowel production (Hermes et al. 2018; Mücke et al. 2021). Furthermore, they were able to show a similar pattern in the movement profiles as it has been shown for gross motor control: prolonged deceleration phases for the respective vocalic tongue body movements (asymmetrical pattern). In prominent syllables, the deceleration phase considerably increases in the production patterns of older speakers. This was especially the case for high vowels. The authors assumed that this might be a compensatory strategy for a decrease in sensory feedback in the older speakers. Moreover, regarding prominence marking there is preliminary evidence that older speakers do modulate durational properties across and within accentuation while spatial modulations were less clear (Mücke et al. 2021).

1.3. Parkinson’s Disease and Speech

Patients with PD suffer from a neurodegenerative disorder which affects motor and non-motor functions (Kalia and Lang 2015). Early motor signs are bradykinesia (slowness of movements), rigidity (stiffness of muscles) and a resting tremor. Axial symptoms, such as postural and gait impairment, speech problems (hypokinetic dysarthria) and dysphagia are manifested in later stages of the disease. The speech disorder results in less intelligible and unnatural speech evoked by a reduced modulation of pitch and intensity, slower and unprecise articulation as well as an overall reduced articulation space (Duffy 2019). In a study on speakers with PD, it was shown that the size of the vowel space is related to motor skills, as a higher motor impairment correlated with a reduction and centralization of the vowel space (Thies et al. 2020). Conversely, improved motor function induced by medication leads to an improved speech motor performance (Thies et al. 2021). Therefore, a connection between gross motor skills and speech motor skills cannot be denied.

With regard to prominence marking, previous studies provided evidence that speakers with PD are indeed able to modulate relevant speech parameters, such as pitch, intensity and durational properties (Thies et al. 2020, 2021). Their strategies of prominence marking on the acoustic level did not differ compared to healthy control speakers. However, when looking at the vowel production, a recent study showed that speakers with PD modulate durational and spatial properties of the vocalic tongue body movement only across accentuation (e.g., background vs. broad focus). Modulations distinguishing different degrees of prominence, i.e., within accentuation (e.g., broad focus vs. contrastive focus), were not found (Thies et al. 2021). Due to the overall reduced articulation space, spatial modulations seem to be restricted in amplitude or space in speakers with PD. This causes problems in which speakers with PD are no longer able to make fine-grained distinctions. In a study by Pell et al. (2006), it was reported that contrastive focus produced by patients with PD is less accurately recognized by listeners. Thus, a clear differentiation within accentuation, such as broad focus vs. contrastive focus, is no longer possible. In addition, a previous articulatory study determines heterogeneous pattern on the articulatory level for speakers with PD ranging from hyper- to hypo-articulation within and across speakers (Fivela et al. 2014).

1.4. Compensation Strategies in Speech

Compensation mechanisms can appear in speech production depending on external and/or internal factors. The process of speech production aims to produce intelligible speech output that is understood by the listeners. To achieve this auditory goal, speakers play along a continuum of hyper-articulation on the one side, and hypo-articulation on the other side (Lindblom 1990). With regard to Lindblom’s H&H theory, a speaker wants to be understood in the best possible way, but at the same time wants to make a minimum of effort.

Nevertheless, speakers adapt to new requirements driven by internal or external factors to maintain goal-directed speech production and thus accept investing more articulatory energy if needed. For example, in a noisy environment, speakers speak louder in order to be understood (Folk and Schiel 2011). In a study by Brunner et al. (2006) the palate shape of speakers was changed for two weeks by a prothesis. Speakers adapted to the new palate shape by lowering and retracting the tongue to produce intelligible speech output. As stated above, older speakers produce longer deceleration phases and more submovements in terms of multiple velocity peaks compared to younger speakers. This can be interpreted as a compensatory strategy to counteract age effects on speech planning so that older speakers can reach the articulatory target at the right time (Hermes et al. 2018; Mücke et al. 2021).

There are different stages of adaptation (Brunner et al. 2006) that arise, the longer speakers have to deal with the new requirements; this can be related to speakers who have to adapt to new physical conditions as they get older or develop a disease: they have to learn to deal with possible deviations in order to produce targeted speech movements. Compensation related to underlying articulatory movements does not necessarily mean that the speech output is different. Having the system of motor equivalence in mind, this emphasizes that articulatory goals can be achieved in different ways (Perrier and Fuchs 2015).

In the present study, we compare the flexibility of the prosodic system to mark prominence between younger speakers, older speakers, and speakers with PD. Therefore, we concentrate on temporal and spatial properties of the acoustic and articulatory vowel patterns. In prominent positions, we expect hyperarticulated vowels in terms of longer and larger tongue movement to signal prominence; however, speakers with PD are expected to show a reduced vowel space: Do they compensate for the reduced vowel space by adjusting the temporal domain to increase prominence? The same interplay of temporal and spatial modifications for prosodic modulations is investigated with respect to aging: How do older speakers express the flexibility of the prosodic system when abilities of the speech motor systems are changed?

2. Method

2.1. Participants

For this exploratory study, a small sample size of native German speakers was recorded (12 speakers in total, sex balanced, Table 1). This study analyzes strategies of prominence marking in German across three different groups: 4 younger healthy speakers (25 ± 3 years), 4 older healthy speakers (76 ± 4 years) and 4 speakers with PD (72 ± 5 years). Patients with PD were diagnosed with PD 12 (± 7) years prior to the study and were only treated with medications. None of the patients have had a deep brain stimulation implanted or a pump. The data of the speakers with PD were assessed in medication-OFF condition (after abstaining 12 hours from PD medication) to capture the pure status of the disease without treatment effects. To determine the motor ability, the third part of the Unified Parkinson’s Disease Scale (UPDRS, Goetz et al. 2008) was used. The mean UPDRS III score of 34 (± 6) indicates a moderate motor impairment (Martínez-Martín et al. 2015). All patients with PD showed mild signs of hypokinetic dysarthria. None of the participants had clinically relevant signs of cognitive impairment which were assessed with the Mini-Mental State Examination (Arevalo-Rodriguez et al. 2015).

The study was approved by the local ethics committee. Informed consent was obtained from all subjects involved in the study prior to their participation.

2.2. Speech Material and Recordings

Acoustic and articulatory speech recordings were carried out by using electromagnetic articulography (AG 501, Carstens Medizinelektronik GmbH, Bovenden, Germany). The acoustic signal was captured using a condenser microphone headset. The acoustic signal was recorded at 44.1 kHz/16 bit. To capture kinematic data, sensors were placed on the (1) lower lip, (2) upper lip, (3) tongue body and (4) tongue tip. The tongue sensors were placed approximately 1 cm and 4 cm from the beginning of tongue tip. Reference sensors were placed behind the ears and on the nose ridge for head correction.

Speech material was elicited with a game-like question–answer scenario and presented via a computer screen (Figure 1, Appendix A). The questions were presented auditorily, while the answers were produced by the participants (Example 1–3). Target words were disyllabic girl names with C₁V₁.C₂V₂-structure and with word stress on the first syllable (Table 2), which were embedded in a predefined sentence structure, such as: ‘Die Schwester hat der Mela gewunken.’ (Engl. “The sister has waved to Mela.”). The vowel V1 was one of the five peripheral vowels in German /i:, e:, a:, o:, u:/. Target words were collected in three different focus conditions associated with different degrees of prominence: background (target word in unaccented condition), broad focus, and contrastive focus (target words in accented condition).

In total, 360 tokens went into the analysis (10 target words × 3 focus conditions × 4 speakers × 3 groups). No filler items were used, and no repetitions were made in order to not prolong the duration of the experiment with regard to the condition of the patients. Only utterances that were produced incorrectly were repeated. A test phase was included at the beginning of the experiment. During this phase, all target words were produced in isolation by the participants (plus three practice trials).

2.3. Data Processing and Measurements

The speech data was displayed and annotated using the EMU-webApp (Winkelmann et al. 2017). Adjustments related to prominence marking were investigated based on acoustic and articulatory variables corresponding to vowel production in the target syllable, as the stressed vowel V1 is the main domain of prominence marking. It has been shown that the vowel is strongly affected by prominence modulations, while the role of consonants due to accent marking is less clear (Fougeron and Keating 1997; Cho and McQueen 2005; Mücke 2017; Cho 2006).

Figure 2 provides an annotation scheme for articulatory landmarks (Figure 2: trajectory, top) and acoustic landmarks (Figure 2: segmental string, below). In the acoustic dimension, target words, stressed syllables (C₁V₁) and vowels (V1) were determined according to the speech waveform and the wideband spectrogram by inspection of the higher formant structure. In the articulatory dimension, the following landmarks of the vocalic tongue body movement were identified: start of the movement (onset, onsV1) and target of the movement (targV1). The landmarks were annotated in the vertical plane by using zero-crossings in the respective velocity trace. The maximum velocity (pvelV1) was identified by using zero-crossings in the acceleration trace.

On the acoustic level, the following variables were computed:

Acoustic vowel duration (ms): Temporal interval between the start of the first vowel V1 and the end of the vowel V1 in the stressed C₁V₁ syllable of the target word. Longer vowels are associated with an increase in prominence.

Vowel space area: The mean vowel formants (F1, F2) taken from the vocalic segment V1 (/i:, e:, a:, o:, u:/) were calculated for analyzing the vowel space. Within the central 25 ms of the vocalic segment, a value was taken every 6.25 ms. The average of all four values was calculated for F1 and F2, respectively. An increase of the vowel space in terms of more peripheral vowels is associated with an increase in prominence marking (hyper-articulation).

Vowel Articulation Index (VAI): Based on the formants F1 and F2 of the V1 vowels /i:, a:, u:/, the VAI was calculated using the following formula (Sapir et al. 2010): VAI = (F2_i + F1_a)/(F1_i + F1_u + F2_u + F2_a). Higher values represent an enhancement of the vowel space as expected for prominence marking.

On the articulatory level, the following landmarks were computed to capture the vocalic tongue body movement:

Vocalic movement duration (ms): Temporal interval between the onset of the vocalic movement until the maximum target (onsV1 to targV1). An increase in movement duration is associated with an increase in prominence.

Symmetry ratio: This is the ratio of the deceleration phase divided by the acceleration phase. The temporal interval between the start of the movement to the maximum velocity (onsV1 to pvelV1) corresponds to the acceleration phase of the vocalic movement and the interval from the maximum velocity to the target of the movement (pvelV1 to targV1) corresponds to the deceleration phase of the vocalic movement. An increase in the symmetry ratio indicates a prolongation of the deceleration phase.

Tongue body position in the vertical and horizontal domain: The mean position of the tongue body was calculated using the y-position and the x-position of the movement trajectory. Per domain, all position values during the first half (50%) of the acoustic vowel segment V1 were averaged, as depicted in Figure 2 (Roessig 2021; Pagel et al. 2021). All values were z-transformed for each speaker and vowel. Under prominence, more extreme tongue positions are expected (hyper-articulation) to signal a more distinct vowel articulation corresponding to a more peripheral vowel space.

To capture the relationship between the acoustic speech output and underlying articulatory movements, coordination patterns were calculated for the onset (onsV1) and the target (targV1) of the vocalic tongue body movement with respect to the acoustic syllable properties (cf. Thies et al. 2021). Since it has been reported that prominence affects the syllable-internal coordination, variability of the coordination patterns between the initiation and the target achievement of the vocalic tongue movement and the acoustic syllable boundaries is expected. While the vocalic target is expected to be achieved consistently, variation in terms of movement initiation is suspected (Thies et al. 2021).

Onset V1 to start of acoustic syllable (%): Interval between the onset of the vocalic tongue body movement (onsV1) and the left acoustic syllable boundary (start) divided by the acoustic syllable duration of the stressed CV syllable. Negative values indicate that the articulatory vocalic movement is initiated before the start of the acoustic syllable. The smaller the values, the earlier the movement is initiated before the start of the acoustic syllable. Under prominence, the start of the vocalic movement and the start of the acoustic syllable boundary are expected to be timed more tightly (leading to smaller temporal lags).

Target V1 to start of acoustic syllable (%): Interval between the target of vocalic tongue body movement (targV1) and the left acoustic syllable boundary (start) divided by the acoustic syllable duration of the stressed CV syllable. Positive values indicate an achievement within the acoustic syllable. The higher the values, the later the target is achieved within the acoustic syllable. Under prominence, the target achievement of the vocalic movement with respect to the start of the acoustic syllable boundary is supposed to remain unchanged.

3. Results

3.1. Temporal Domain

The data presentation is divided into a temporal and spatial analysis. The temporal measurements are reported first and presented in Figure 3. These include (a) acoustic vowel duration, (b) vocalic movement duration and (c) symmetry ratio. Comparisons are made between focus conditions and speaker groups. As this study is of exploratory nature with a small sample size, only descriptive statistics are applied.

Acoustic vowel duration (ms): Figure 3a presents the acoustic vowel durations separately for the speaker groups and focus conditions. Comparing values across the groups, the longest vowel durations are produced by the older speakers (background: 130 ± 33 ms, broad focus: 156 ± 47 ms, contrastive focus: 166 ± 57 ms), while the shortest durations are found in the young speaker group (background: 109 ± 24 ms, broad focus: 119 ± 26 ms, contrastive focus: 133 ± 28 ms). Speakers with PD produce shorter durations than older speakers (background: 138 ± 35 ms, broad focus: 152 ± 35 ms, contrastive focus: 155 ± 30 ms). Moreover, it is noticeable that the data of the older speakers show a higher variability compared to the younger speakers. There is a systematic increase of vowel duration under prominence from background to broad focus (across accentuation) and from broad focus to contrastive focus (within accentuation) for both, the younger and the older speaker group. The data of the speakers with PD, indicate a reduced range of prominence modulation. While the acoustic vowel duration increases from unaccentuated to accented condition (14 ms from background to broad focus), there are no clear adjustments within accentuation (3 ms from broad focus to contrastive focus).

Vocalic movement duration (ms): The duration of the tongue body movement for producing the vowel V1 increases with prominence in all groups (Figure 3b). The data indicate that movement durations are shortest and least variable in the younger speaker group (background: 168 ± 25 ms, broad focus: 179 ± 24 ms, contrastive focus: 187 ± 28 ms), while older speakers prolong movement durations and show more variability (background: 187 ± 39 ms, broad focus: 206 ± 52 ms, contrastive focus: 221 ± 66 ms). Speakers with PD present with the longest movement durations and also with a high variability (background: 203 ± 45 ms, broad focus: 226 ± 35 ms, contrastive focus: 215 ± 50 ms). Note, that the speakers with PD increase the duration of the vocalic movement from background to broad focus, but surprisingly decrease the duration from broad focus to contrastive focus. This is not the case for younger and older speakers, who systematically produce longer vocalic movements with increasing prominence.

Symmetry ratio (dec/acc): The symmetry (ratio of deceleration phase to acceleration phase) of the tongue body movement does not change with prominence (Figure 3c). However, group differences can be detected. Younger speakers produce rather symmetrical profiles as the acceleration and the deceleration phases are of the same length (ratio = 1.1). In contrast, older speakers and speakers with PD show more asymmetrical patterns, implying longer deceleration phases (older speakers: ratio = 1.6, speakers with PD: ratio = 1.7). The variability is higher in older speakers and speakers with PD when being compared to younger speakers. The highest variability in the symmetry ratio can be found in background condition for speakers with PD.

Another metric within the temporal domain we investigated is the relationship between the acoustic and articulatory level of syllable production. Therefore, the onset and the target of the vocalic tongue movement with respect to the start of the acoustic syllable as a ratio of syllable duration was calculated (Table 3). The data indicate that the target of the tongue body movement is on average achieved at 66% within the acoustic syllable independent of focus condition and speaker group.

In contrast, the initiation of the vocalic tongue body movement differs across speaker groups and focus conditions. As expected, the vocalic tongue movement is initiated before the acoustic syllable onset in all conditions (negative values). Under prominence, the onset of the vocalic movement systematically shifts to the right, closer to the acoustic start of the stressed syllable. Therefore, the onset of the vocalic tongue body movement and the start of the acoustic target syllable are more tightly timed with respect to each other under prominence. Note that in the articulatory domain, consonants are superimposed by the vowel-to-vowel articulation (Öhman 1966). The onset of the movement for V1 is at the same time the offset of the movement for the preceding vowel. According to this, the target syllable and the preceding syllable are timed closer with respect to each other when the target syllable is accented. These adjustments of coordination patterns are systematic across speaker groups. Comparing the speaker groups, it becomes apparent that speakers with PD initiate the vocalic movement earlier with respect to the acoustic syllable boundary than the healthy younger speakers, thus indicating a looser coordination between the two syllables for the speakers with PD. The opposite is the case for the older speakers; they show an even tighter coordination between the two syllables compared to the younger speakers, i.e., the onset of the vocalic movement is shifted to the right towards the start of the acoustic syllable. To summarize, the onset of the tongue body movement is initiated earliest in speakers with PD, followed by younger speakers and latest initiated in older speakers.

3.2. Spatial Domain

Measures within the spatial domain display (i) the acoustic vowel space area (Figure 4), (ii) the vowel articulation index (Table 4) and (iii) the mean position of the tongue body on the vertical and the horizontal plane (Figure 5 and Figure 6) related to the five vowels under investigation, separately for each speaker group and focus condition.

Acoustic vowel space area: Figure 4 displays the modulation of the acoustic vowel space area on the basis of the formant frequencies F1 and F2 (see Appendix B for values of F1 and F2). The plots show from left to right the three focus conditions (background, broad focus and contrastive focus) for the three speaker groups. While younger speakers perform with the largest vowel space (Figure 4, grey area), speakers with PD have the smallest vowel space across all focus conditions (Figure 4, red area). Furthermore, the overall vowel space appears to be strongly retracted in the older speaker group (Figure 4, blue area) in comparison to the younger speaker group. In addition, some retracted positions for /u:/ and /o:/ can be observed in the productions of speakers with PD compared to younger speakers which are not as strongly retracted as in the older speakers.

Table 4 displays the VAI per speaker group (younger, older, and speakers with PD) and illustrates the different adjustments due to prominence marking per group. The VAI indicates the highest values for younger speakers and the lowest VAI for speakers with PD across all conditions. For prominence marking, the VAI increases in younger speakers the most, as the range of the VAI from background to contrastive focus condition is higher in younger speakers (difference of 0.11) compared to older speakers (difference of 0.06). In contrast to the healthy speaker groups, the VAI of speakers with PD does not change under prominence (difference of 0.03).

Tongue body position: Figure 5 and Figure 6 present the vertical and horizontal tongue body positions taken from the articulatory trajectories for the different vowel types. On the vertical axis, low values indicate a lowering of the vocalic target during the vowel production and high values indicate a raising of the tongue. On the horizontal axis, lower values indicate a retraction of the tongue and higher values indicate a fronting of the tongue. Note that the positional plots of the tongue body in the articulatory dimension resemble the positions of F1 and F2 in acoustic formant charts. Furthermore, the low vowel /a:/ is produced with a central tongue position in the horizontal plane, i.e., /a:/ is neither specified as a front vowel nor as back vowel in Standard German.

Front vowels /i:, e:/: The overall tongue position for the production of the vowel /i:/ and /e:/ is much more fronted and raised in the healthy speaker groups compared to the speakers with PD. Comparing older and younger speakers, the tongue is slightly more raised in older speakers.

Focusing on prominence marking, younger speakers present a more fronted and raised tongue position for /i:/ and /e:/ in contrastive focus condition. Older speakers differentiate between unaccented and accented conditions by moving the tongue more to the front, but note that the conditions partly overlap. Speakers with PD adjust the articulatory target only for /i:/ (across accentuation), whereas the modulation for /e:/ is not systematic.

Low vowel /a:/: For producing the vowel /a:/, a lower tongue position is found in younger speakers compared to older speakers. Tongue positions of speakers with PD are further retracted in comparison to older speakers but at the same tongue height.

Prominence modulations are visible through a systematic lowering of the tongue in younger speakers (within accentuation) and older speakers (across and within accentuation). Speakers with PD do not lower the tongue under prominence, but they aim to retract the tongue for /a:/ in accented syllables; however, there is no systematic mechanism for prominence marking detectable, since the backing of the tongue appears to be stronger in broad focus than in contrastive focus.

Back vowels /u:, o:/: For the corner vowel /u:/, the younger speakers produce the highest tongue positions. The tongue positions for /u:/ are lower in older speakers compared to younger speakers. The lowest and most retracted tongue positions for /u:/ are observable in the productions of the speakers with PD. While the tongue positions for /o:/ are comparable between the younger and older speakers, the speakers with PD produce /o:/ with considerably lowered and retracted tongue positions.

To encode prominence, younger speakers show a slightly fronting of the tongue body for /u:/ to differentiate between accented and non-accented conditions, while the older speakers are systematically retracting the tongue across and within accentuation (note that focus conditions in younger speakers overlap). Speakers with PD do not modulate the vocalic target for the corner vowel /u:/ at all. For /o:/, both the younger and the older speaker group retract the tongue under prominence. While speakers with PD do not encode different degrees of prominence for the corner vowel /u:/, they produce a fronting and lowering for /o:/ under prominence.

To conclude, prominence modulations and group differences are visible in the current data set. Speakers with PD stand out with an overall smaller articulation space in both dimensions, horizontal and vertical, revealing smaller tongue body movements for the corner vowels. Thus, prominence modulation of the tongue body is reduced and less systematic. It is noticeable that speakers with PD mark prominence by a fronting of the tongue during the production of the back vowel /o:/ and a retracted tongue for the low vowel /a:/, which is indeed the opposite direction compared to younger and older speakers. Moreover, the healthy speaker groups differ across each other as the older speakers try to produce more often systematic contrasts across and within accentuation, whereas the younger speakers often differentiate only between unaccented and accented conditions (across accentuation); however, the younger speakers show a more fronted and a larger vowel space in terms of articulatory tongue positions than the older speakers.

4. Discussion

This exploratory study investigates age-related and disease-related changes in prominence marking. A small data set of three groups (younger speakers, older speakers, speakers with PD) with four speakers each were analyzed and compared in order to examine strategies of modulating speech parameters in the temporal and spatial domain. The analysis has been applied to changes in vowel production on the acoustic and the articulatory level.

Possible differences in prominence marking strategies between the speaker groups could be found on the basis of this small data set, leading to the assumption that age and disease affect speech production, and prominence marking in particular. The effects of age and disease on speech point towards the development of certain strategies to compensate speech deficits. These compensatory strategies seem to be quite different comparing older speakers and speakers with PD. We cannot exclude that the observations are due to speaker-specific variabilities since our group sizes are rather small (which is indeed often the case for studies using electromagnetic articulography) but trends are clearly detectable. The group strategies will be discussed in more detail below.

Younger speaker group: We found adjustments in the temporal and spatial domain to signal prominence. In the temporal domain, the younger speakers gradually increase acoustic vowel durations across accentuation (background to broad focus) and within accentuation (broad focus to contrastive focus). This was also reflected in the temporal properties of the underlying vocalic tongue movement, which were gradually increased under prominence. The younger speakers, as expected, show symmetrical profiles for the tongue movements in terms of acceleration and deceleration phases. The symmetry profile of the vocalic tongue movement is not affected by prominence. When investigating the coordination patterns of the vocalic tongue movement with respect to the acoustic syllable boundaries, our data reveal that prominence affects the initiation of the vocalic movement, while the target achievement remains stable. Under prominence, the vocalic tongue movement is initiated closer to the syllable onset, leading to a tighter timing pattern between articulatory vowel initiation and the start of the acoustic syllable for prominent constituents and therefore to a tighter timing between the accented syllable and the preceding one. In the spatial domain, the younger speakers produced a more distinct vowel articulation under prominence. In terms of formant frequencies, F1 and F2, the acoustic vowel space systematically increases. This pattern is also reflected by the VAI and the maximum target positions of the underlying articulation of the tongue. They increase the vowel space across accentuation (background to broad focus) for back vowels /u:, o:/ and within accentuation (broad focus to contrastive focus) for front vowels and the low vowel /a:/. For highlighting important information, front vowels become more fronted and back vowels more lowered to encode contrastive focus. The results can be attributed to hyper-articulation strategies to enhance paradigmatic contrasts between the different vowel types and therefore to increase intelligibility (De Jong 1995; Harrington et al. 2000; Cho 2005; Roessig and Mücke 2019). The higher degree of the opening of the oral cavity during the production of the low vowel /a:/ can further be attributed to sonority expansion, allowing for more radiation of acoustic energy from the mouth during the production of prominent elements in an utterance. Sonority expansion enhances syntagmatic contrasts, i.e., between accented and unaccented syllables. Both hyper-articulation and sonority expansion are related to feature enhancement, triggered by the phonological system of a given language.

Taking all these adjustments together, our data may reveal that the younger speakers are rather flexible in encoding prominence on the phonetic surface. The strongest effects were found in the temporal domain, while spatial modifications in terms of hyperarticulated vowels are less consistent in the productions of the younger speaker group. The most spatial modulations were found in contrastive focus condition, a focus type that reveals the highest degree of prominence. With regard to articulatory effort, we conclude that young speakers articulate with a high degree of efficacy to express pragmatic meaning in terms of phonetic cues, both in the temporal and spatial domain; it seems that they avoid to spent more articulatory effort than necessary to encode prominence in the relevant task—however, the prosodic marking is rather systematically and appears to be balanced well.

Older speaker group: The older speaker group encodes prominence in the temporal and spatial domain systematically. In the temporal domain, the older speakers (as the younger ones) gradually increase acoustic vowel durations across accentuation (background to broad focus) and within accentuation (broad focus to contrastive focus). This was reflected in the temporal properties of the underlying vocalic tongue movement, which were also gradually increased under prominence. However, they produce considerably longer acoustic vowel durations than the younger speaker group. This is also reflected in the articulatory domain by longer vocalic tongue movements. As in the younger speaker group, the symmetry profiles of the older speakers remain unchanged under prominence, i.e., longer deceleration phases are not used to mark prominence in both groups. In contrast to the younger speakers, the deceleration phases of the tongue body movement are longer than the acceleration phases in the older speakers, thus changing the symmetry profile of the articulatory movement. We assume that the longer deceleration phases in the older speakers can be attributed to compensatory strategies to maintain the goal-oriented articulation and to avoid possible problems with deficient sensory feedback (Hermes et al. 2018; Mücke et al. 2021). It might be also one of the reasons why older speakers are reported to show a decrease in coarticulation on the acoustic surface (D’Alessandro et al. 2020). In addition, longer deceleration phases are interpreted as a sign of deviant speech patterns related to speech movement disorders, such as dysarthria (Mücke et al. 2018; Forrest et al. 1989). These changes within the temporal domain refer to a slowing down of the speech system with age. This is in line with inter alia Amerman and Parnell (1992) who also reported on slower speaking rates in older speakers compared to younger ones. Another aspect can be found in the coordination patterns between the onset of the vocalic tongue movement and the acoustic syllable boundaries. The overall coordination patterns are tighter in the older speakers than in the younger ones, leading to a relatively late initiation of the vocalic tongue body movement with respect to the target syllable. As in the younger speaker, the degree of prominence is reflected in these subtle timing patterns, leading to a tighter timing between the accented and the preceding syllable in the productions of the older speakers. In the spatial domain, the data reveal a smaller and more retracted vowel space in older speakers compared to younger ones. The smaller vowel space is in line with the literature showing parallels to gross motor control with slower and smaller movement trajectories of the limbs (Seidler et al. 2002; Ketcham and Stelmach 2004). The older speakers systematically resize the vowel space in all directions to signal prominence by making use of the strategies of sonority expansion and hyper-articulation. Note that they may only try do differentiate across and within accentuation as focus conditions overlap sometimes. According to the VAI, the resizing of the vowel space seems to be lower than with the younger speakers. Moreover, the expansion of the vowel space is stronger and less symmetrical in the front-back dimension compared to the younger speakers, since the retraction is more pronounced than the fronting. In the open-closed dimension, the resizing of the vowel space includes a systematical lowering of the tongue in terms of vowel formants. These observations are reflected in the underlying articulatory tongue positions on the vertical and horizontal axis. In contrast to the younger speakers, we found a more retracted tongue position for /o:/ and /u:/ in prominent positions indicating hyperarticulated vowels. In addition, the lowering of the tongue for /a:/ from background to broad focus and further to contrastive focus seems to be stronger in the older speaker group compared to younger speakers, indicating the use of sonority expansion. Overall, modulations due to prominence seem to be stronger and more systematic across but also within accentuation for the older speakers than for the younger speakers.

We conclude that the older speaker group sufficiently encodes prominence across accentuation and within accentuation in the temporal but also spatial domain. In contrast to younger speakers, strategies of prominence marking in older speakers require higher articulatory effort as they use the temporal and the spatial domain. Therefore, the speech production is less efficient (high-cost behavior) compared to younger speakers.

Speaker group with PD: The speakers with PD do also signal prominence, but they differ in their strategies from both healthy younger and older speakers. In the temporal domain, as the other speaker groups, speakers with PD increase acoustic and articulatory vowel durations across accentuation (background and broad focus) to mark prominence, but adjustments within accentuation (broad focus and contrastive focus) are less clear. When being compared to older speakers, PD speakers produce shorter vowel durations on the acoustic level, but interestingly longer durations of vocalic tongue body trajectories on the articulatory level. This can be explained by looking in more detail at the onset coordination of the vocalic tongue movement relative to the start of the acoustic syllable. While the vocalic target is consistently reached at 66% of the acoustic syllable in all speaker groups, the speakers with PD initiated the vocalic tongue body movement much earlier than the older speakers. This coordination pattern between the target syllable and the preceding syllable is less tight than in the healthy speaker groups and thus leads to longer vocalic tongue durations in speakers with PD in the underlying articulatory patterns. Due to the early vowel initiation and the looser coordination between the target syllable and the adjacent syllable, and especially due to the stable achievement of the target, we do not find longer vowel segments on the acoustic surface. As speech production is guided toward auditory goals, the acoustic output can remain the same, while the way to produce the same speech output may differ. We assume that speakers with PD change their articulatory coordination and speech planning to counteract effects of the disease on their speech. In addition, longer deceleration phases of the vocalic tongue body movements (asymmetric movement profiles) were produced by the speakers with PD, which were comparable to the older speakers (while the younger speakers produced symmetrical movement profiles). As mentioned above, this supports the goal-oriented articulation and leads to the target being reached at the right time. In the spatial domain, the acoustic vowel space of speakers with PD is smaller and more centralized compared to both healthy speaker groups. This is reflected in the VAI but also in the underlying articulatory patterns in terms of the vertical and horizontal tongue positions, being less peripheral than in the healthy speaker groups. A reduced vowel space is already reported in the literature and is a symptom of hypokinetic dysarthria (Duffy 2019; Skodda et al. 2011; Skodda et al. 2012; Thies et al. 2020). This centralization can cause problems as for instance the vowel /o:/ is produced with a much lower tongue position in speakers with PD, which can lead to a merging of vowel categories, a decrease in perceptual salience and less intelligible speech. The more the tongue is lowered, the more likely it is that an /o:/ will sound like an /a:/. As previously shown by Thies et al. (2020), these two vowel categories, the back vowel /o:/ and the low vowel /a:/ were more centralized and partly overlapped (Thies et al. 2020). In addition, it is interesting to note that tongue body positions were strongly adjusted for the vowels /i:, o:/ under prominence. Some prosodic adjustments were different from the healthy speaker groups, as the open vowel /a:/ is retracted (not lowered) and /o:/ is lowered and fronted (and not retracted). The overall vowel space is not further retracted to signal prominence in speakers with PD as it is in older speakers. This suggests that compensation strategies differ between older speakers and speakers with PD. One might expect a transition from younger to older speakers to speakers with PD, so that the vowel space becomes smaller and smaller, but also further and further retracted, but this is not the case in our data.

Aging and disease effects: Whereas younger speakers show a high degree of efficacy in marking prominence in the temporal and spatial domain across accentuation and within accentuation, older speakers and speakers with PD show deviant and less efficient speech patterns. Thus, the speech system seems to be affected by age and disease evoking a slowing down, less flexible articulators and a reduced vowel space. Differences in the use of temporal parameters between the groups are clearly visible. In older speakers and speakers with PD, strong temporal modifications (prolongations) are found with regard to acoustic durations, articulatory movements durations and deceleration phases. It is likely, that the durational adjustments are compensatory strategies that repair for deficits in the spatial domain (smaller vowel size) and/or for coordination deficits. While older speakers produce a tight coordination between the target syllable and the preceding syllable, the coordination is rather loose for speakers with PD. This means that speakers with PD produce long vocalic movements in the articulatory domain (since the vowel is initiated rather early). Moreover, the vowel space decreases with age and decreases even further with PD. Whereas older speakers most often use spatial adjustments systematically across and within accentuation, speakers with PD cannot resize the vowel space in a strategic and sufficient way under prominence. The older speakers systematically retract the vowel space under prominence for back vowels to encode prominence and to compensate for less abilities in adjusting the tongue body for front vowels. However, the adjustments are less efficient for the speakers with PD since they were not able to resize the corner vowel in an appropriate way. The best example is probably the low vowel /a:/, which is not further lowered but retracted under prominence.

We interpret the modifications of the temporal parameters as a developed compensatory strategy to deal with the spatial deficits and coordination problems between syllables. The more the tongue system is affected by age or disease, the stronger the temporal compensations. Strong temporal modifications are a compensatory strategy for the smaller and retracted vowel space. The compensatory strategies developed in the different speaker groups show a complex behavior of the speech system with different solutions in speech motor control patterns. We conclude that there seems to be no gradient transition when comparing the limitations of the lingual system from aging to Parkinson’s disease.

5. Conclusions

All speaker groups mark prominence in the temporal and spatial domain; however, they seem to differ in the way how they modulate prominence to highlight important information on the textual string. This could be interpreted as compensatory strategies developed by older speakers and speakers with PD to overcome the limitations of the oral speech motor system. In the temporal domain, durations increase with age and movements are even longer in speakers with PD. The coordination between adjacent syllables is quite different since there is a tight coordination in older speakers and a rather loose coordination in speakers with PD. In the spatial domain, the articulation space decreases with age and decreases further with PD. While older speakers compensate for the spatial limitations by retracting the complete vowel space, this was not the case for speakers with PD. Speakers with PD aim to adjust single vowels, but not the system as a whole. The more the tongue system is affected by age or disease, the stronger the temporal compensations. Since older speakers seem to differ in their compensatory articulation from the PD group, we conclude that there is no gradient transition from aging to Parkinson’s disease in the facilities of oral speech motor control; however, this study demonstrates, on the one hand, that older speakers and speakers with PD show a reduced flexibility of the vocalic tongue movements, but on the other, that these two groups differ in their ability to compensate for these limitations.

Author Contributions

Conceptualization, D.M. and T.T.; methodology, T.T., A.H. and D.M.; software, T.T.; validation, T.T., A.H. and D.M.; formal analysis, T.T.; investigation, A.H. and D.M.; resources, D.M.; data curation, T.T.; writing—original draft preparation, T.T.; writing—review and editing, A.H. and D.M.; visualization, T.T.; supervision, D.M.; project administration, A.H.; funding acquisition, D.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the French ANR (ANR-10-LABX-0083; Strand 1—Operation PATH) and funded by the German Research Foundation (DFG) as part of the SFB 1252 “Prominence in Language” (Project-ID 281511265) in project A04 “Dynamic Modelling of Prosodic Prominence” at the University of Cologne and by a.r.t.e.s. Graduate School for the Humanities Cologne.

Institutional Review Board Statement

The study was approved by the local ethics committee of the University Hospital of Cologne (protocol code: 18–425; date of approval: 8 February 2019).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Not applicable.

Acknowledgments

We thank Theo Klinker and Lukas Henne for their technical support and Jane Mertens for part of the data collection.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

The question–answer pairings were divided according to word lists and blocks. Each subject completed one Block 1 and one Block 2 which were randomly selected across the word lists. The following overview presents the sentences (answers) which were produced by the participants. Words which are expected to be accented (prominent) in the answer are highlighted in bold. Capitalized words indicate that a production of the contrastive focus is expected.

Word list 1—Block 1

Question	Answer
Hat die Oma der Mila gewunken?	Der OPA hat der Mila gewunken.
Hat die Schwester der Dina gewunken?	Die Schwester hat der LUNA gewunken.
Hat die Schwester der Nora gewunken?	Die Schwester hat der MELA gewunken.
Was ist passiert?	Die Oma hat der Lina gewunken.
Hat der Bruder der Lena gewunken?	Der OPA hat der Lena gewunken.
Hat die Schwester der Gabi gewunken?	Die Schwester hat der MULA gewunken.
Hat der Opa der Luna gewunken?	Der BRUDER hat der Luna gewunken.
Was ist passiert?	Der Opa hat der Mela gewunken.
Hat der Bruder der Mona gewunken?	Der Bruder hat der LINA gewunken.
Was ist passiert?	Die Oma hat der Mula gewunken.
Was ist passiert?	Der Opa hat der Lena gewunken.
Hat die Oma der Mela gewunken?	Die SCHWESTER hat der Mela gewunken.
Hat die Schwester der Kati gewunken?	Die Schwester hat der MILA gewunken.
Hat der Opa der Lina gewunken?	Der BRUDER hat der Lina gewunken.
Was ist passiert?	Die Oma hat der Luna gewunken.
Hat der Bruder der Mula gewunken?	Der OPA hat der Mula gewunken.
Hat die Oma der Mina gewunken?	Die Oma hat der LENA gewunken.
Was ist passiert?	Der Bruder hat der Mila gewunken.

Word list 1—Block 2

Question	Answer
Hat der Bruder die Moli verlassen?	Der OPA hat die Moli verlassen.
Hat der Opa die Tina verlassen?	Der Opa hat die LANI verlasssen.
Hat die Schwester die Sina verlassen?	Die Schwester hat die MALI verlassen.
Was ist passiert?	Die Oma hat die Loni verlassen.
Hat die Oma die Mali verlassen?	Der BRUDER hat die Mali verlassen.
Was ist passiert?	Der Opa hat die Moli verlasssen.
Hat die Schwester die Nena verlassen?	Die Schwester hat die LONI verlassen.
Hat der Opa die Lani verlassen?	Die OMA hat die Lani verlassen.
Was ist passiert?	Der Bruder hat die Mali verlassen.
Hat der Bruder die Loni verlassen?	Der OPA hat die Loni verlassen.
Hat die Oma die Susi verlassen?	Die Oma hat die MOLI verlassen.
Was ist passiert?	Die Oma hat die Lani verlassen.

Word list 2—Block 1

Question	Answer
Hat die Oma der Mula gewunken?	Der OPA hat der Mula gewunken.
Hat der Bruder der Susi gewunken?	Der Bruder hat der LINA gewunken.
Was ist passiert?	Die Oma hat der Mela gewunken.
Hat die Schwester der Nina gewunken?	Die Schwester hat der LUNA gewunken.
Was ist passiert?	Der Opa hat der Mila gewunken.
Hat der Opa der Lena gewunken?	Die OMA hat der Lena gewunken.
Hat der Bruder der Lina gewunken?	Die SCHWESTER hat der Lina gewunken.
Was ist passiert?	Der Opa hat der Mula gewunken.
Was ist passiert?	Die Oma hat der Lena gewunken.
Hat der Bruder der Dina gewunken?	Der Bruder hat der MELA gewunken.
Hat der Opa der Luna gewunken?	Die OMA hat der Luna gewunken.
Hat die Schwester der Mona gewunken?	Die Schwester hat der MILA gewunken.
Was ist passiert?	Der Bruder hat der Lina gewunken.
Hat die Oma der Mela gewunken?	Der OPA hat der Mela gewunken.
Hat der Opa der Tina gewunken?	Der Opa hat der LENA gewunken.
Hat die Schwester der Mila gewunken?	Die OMA hat der Mila gewunken.
Was ist passiert?	Die Schwester hat der Luna gewunken.
Hat der Bruder der Nora gewunken?	Der Bruder hat der MULA gewunken.

Word list 2—Block 2

Question	Answer
Hat die Schwester die Mali verlassen?	Die OMA hat die Mali verlassen.
Hat der Bruder die Nora verlassen?	Der Bruder hat die LANI verlassen.
Hat die Oma die Moli verlassen?	Der OPA hat die Moli verlassen.
Was ist passiert?	Die Schwester hat die Loni verlassen.
Was ist passiert?	Die Oma hat die Lani verlassen.
Hat der Opa die Nena verlassen?	Der Opa hat die MALI verlassen.
Hat der Bruder die Loni verlassen?	Die SCHWESTER hat die Loni verlassen.
Was ist passiert?	Der Bruder hat die Moli verlassen.
Hat die Oma die Lani verlassen?	Der OPA hat die Lani verlassen.
Hat die Oma die Susi verlassen?	Die Oma hat die LONI verlassen.
Hat die Schwester die Tina verlassen?	Der Schwester hat die MOLI verlassen.
Was ist passiert?	Die Oma hat die Mali verlassen.

Appendix B

The following table displays the means and standard deviations of the vowel formant frequencies F1 and F2 per vowel, speaker group, and focus condition.

Table A1. Mean (sd) F1 and F2 formant frequencies in Hz of vowel V1.

Vowel	Speaker Group	Vowel Formants	Background	Broad	Contrastive
/i:/	younger	F1	294 (58)	266 (59)	256 (49)
	younger	F2	2296 (295)	2318 (296)	2371 (196)
	older	F1	256 (44)	268 (47)	270 (49)
	older	F2	2156 (210)	2165 (266)	2207 (182)
	PD	F1	296 (35)	310 (48)	297 (38)
	PD	F2	1997 (419)	2021 (409)	2059 (407)
/e:/	younger	F1	375 (91)	368 (68)	362 (71)
	younger	F2	2138 (298)	2197 (268)	2179 (228)
	older	F1	344 (47)	347 (34)	374 (23)
	older	F2	2043 (197)	2064 (228)	2161 (198)
	PD	F1	375 (29)	390 (37)	382 (50)
	PD	F2	1921 (363)	1943 (366)	1935 (387)
/a:/	younger	F1	691 (210)	716 (187)	752 (184)
	younger	F2	1453 (199)	1425 (169)	1423 (157)
	older	F1	554 (128)	616 (130)	671 (75)
	older	F2	1412 (189)	1351 (171)	1386 (173)
	PD	F1	629 (170)	626 (185)	670 (190)
	PD	F2	1228 (231)	1281 (256)	1267 (244)
/o:/	younger	F1	339 (75)	353 (61)	347 (62)
	younger	F2	974 (125)	1072 (178)	941 (171)
	older	F1	290 (44)	334 (28)	362 (28)
	older	F2	920 (137)	825 (92)	830 (109)
	PD	F1	362 (34)	378 (42)	393 (32)
	PD	F2	944 (157)	962 (145)	951 (93)
/u:/	younger	F1	293 (54)	277 (47)	287 (59)
	younger	F2	1073 (239)	997 (154)	927 (166)
	older	F1	271 (44)	289 (37)	304 (49)
	older	F2	885 (130)	887 (104)	883 (142)
	PD	F1	297 (28)	291 (18)	296 (35)
	PD	F2	950 (89)	915 (93)	974 (122)

References

Amerman, James D., and Martha M. Parnell. 1992. Speech Timing Strategies in Elderly Adults. Journal of Phonetics 20: 65–76. [Google Scholar] [CrossRef]
Arevalo-Rodriguez, Ingrid, Nadja Smailagic, Marta Roqué i Figuls, Agustín Ciapponi, Erick Sanchez-Perez, Antri Giannakou, Olga L. Pedraza, Xavier Bonfill Cosp, and Sarah Cullum. 2015. Mini-Mental State Examination (MMSE) for the Detection of Alzheimer’s Disease and Other Dementias in People with Mild Cognitive Impairment (MCI). Edited by Cochrane Dementia and Cognitive Improvement Group. Cochrane Database of Systematic Reviews 3: CD010783. [Google Scholar] [CrossRef]
Baumann, Stefan, Johannes Becker, Martine Grice, and Doris Mücke. 2007. Tonal and Articulatory Marking of Focus in German. Paper presented at the 16th International Congress of Phonetic Sciences, Saarbrücken, Germany, August 6–10; Edited by Jürgen Trouvain and William J. Barry. Saarbrücken: Pirrot GmbH Dudweiler, pp. 1029–32. [Google Scholar]
Baumann, Stefan, and Christine T. Röhr. 2015. The Perceptual Prominence of Pitch Accent Types in German. Paper presented at the 18th International Congress of Phonetic Sciences, Glasgow, UK, August 10–14; Glasgow: The Scottish Consortium for ICPhS 2015, vol. 298, pp. 1–5. [Google Scholar]
Beckman, Mary E., Jan Edwards, and Janet Fletcher. 1992. Prosodic Structure and Tempo in a Sonority Model of Articulatory Dynamics. In Papers in Laboratory Phonology II: Segment, Gesture, Prosody. Edited by Gerard J. Docherty and D. Robert Ladd. Cambridge: Cambridge University Press, pp. 68–86. [Google Scholar] [CrossRef]
Bourbon, Angélina, and Anne Hermes. 2021. Have a Break: Aging Effects on Sentence Production and Structuring in French. Paper presented at the 12th International Seminar on Speech Production, New Haven, CT, USA, December 14–18; pp. 102–5. [Google Scholar]
Brown, Susan H. 1996. Control of Simple Arm Movements in the Elderly. In Advances in Psychology. North-Holland: Elsevier, vol. 114, pp. 27–52. [Google Scholar] [CrossRef]
Brunner, Jana, Phil Hoole, Pascal Perrier, and Susanne Fuchs. 2006. Temporal Development of Compensation Strategies for Perturbed Palate Shape in German/Sch/-Production. Paper presented at the 7th International Seminar on Speech Production, Ubatuba, Brazil, December 13–15; pp. 247–54. [Google Scholar]
Cho, Taehong. 2005. Prosodic Strengthening and Featural Enhancement: Evidence from Acoustic and Articulatory Realizations of /ɑ,i/ in English. The Journal of the Acoustical Society of America 117: 3867–78. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Cho, Taehong. 2006. Manifestation of Prosodic Structure in Articulatory Variation: Evidence from Lip Kinematics in English. Laboratory Phonology 8: 519–48. [Google Scholar] [CrossRef]
Cho, Taehong, and James M. McQueen. 2005. Prosodic Influences on Consonant Production in Dutch: Effects of Prosodic Boundaries, Phrasal Accent and Lexical Stress. Journal of Phonetics 33: 121–57. [Google Scholar] [CrossRef]
D’Alessandro, Daria, Angélina Bourbon, and Cecile Fougeron. 2020. Effect of Age on Rate and Coarticulation across Different Speech-Tasks. Paper presented at 12th International Seminar on Speech Production, virtual, December 14–18. [Google Scholar]
De Jong, Kenneth J. 1995. The Supraglottal Articulation of Prominence in English: Linguistic Stress as Localized Hyperarticulation. The Journal of the Acoustical Society of America 97: 491–504. [Google Scholar] [CrossRef] [PubMed]
De Lau, Lonneke M. L., and Monique M. B. Breteler. 2006. Epidemiology of Parkinson’s Disease. The Lancet Neurology 5: 525–35. [Google Scholar] [CrossRef]
Duffy, Joseph R. 2019. Motor Speech Disorders: Substrates, Differential Diagnosis, and Management, 4th ed. Edinburgh: Elsevier. [Google Scholar]
Ferreira, Fernanda, and Benjamin Swets. 2002. How Incremental Is Language Production? Evidence from the Production of Utterances Requiring the Computation of Arithmetic Sums. Journal of Memory and Language 46: 57–84. [Google Scholar] [CrossRef] [Green Version]
Fivela, Barbara Gili, Massimiliano Mario Iraci, Vincenzo Sallustio, Mirko Grimaldi, Claudio Zmarich, and Danilo Patrocinio. 2014. Italian Vowel and Consonant (Co)Articulation in Parkinson’s Disease. Paper presented at the 10th International Seminar on Speech Production, Cologne, Germany, May 5–8; pp. 146–9. [Google Scholar]
Folk, Laura, and Florian Schiel. 2011. The Lombard Effect in Spontaneous Dialog Speech. Paper presented at the Interspeech, Florence, Italy, August 27–31; pp. 2701–4. [Google Scholar]
Forrest, Karen, Gary Weismer, and Greg S. Turner. 1989. Kinematic, Acoustic, and Perceptual Analyses of Connected Speech Produced by Parkinsonian and Normal Geriatric Adults. The Journal of the Acoustical Society of America 85: 2608–22. [Google Scholar] [CrossRef]
Fougeron, Cécile, and Patricia A. Keating. 1997. Articulatory Strengthening at Edges of Prosodic Domains. The Journal of the Acoustical Society of America 101: 3728–40. [Google Scholar] [CrossRef] [Green Version]
Goetz, Christopher G., Barbara C. Tilley, Stephanie R. Shaftman, Glenn T. Stebbins, Stanley Fahn, Pablo Martinez-Martin, Werner Poewe, Cristina Sampaio, Matthew B. Stern, Richard Dodel, and et al. 2008. Movement Disorder Society-Sponsored Revision of the Unified Parkinson’s Disease Rating Scale (MDS-UPDRS): Scale Presentation and Clinimetric Testing Results. Movement Disorders: Official Journal of the Movement Disorder Society 23: 2129–70. [Google Scholar] [CrossRef]
Grice, Martine, Stefan Baumann, and Ralf Benzmüller. 2005. German Intonation in Autosegmental-Metrical Phonology. In Prosodic Typology: The Phonology of Intonation and Phrasing. Edited by Sun-Ah Jun. Oxford: Oxford University Press, pp. 55–83. [Google Scholar] [CrossRef]
Harrington, Jonathan, Janet Fletcher, and Mary E. Beckman. 2000. Manner and Place Conflicts in the Articulation of Accent in Australian English. In Papers in Laboratory Phonology. Edited by Michael Broe. Cambridge: Cambridge University Press, vol. 5, pp. 40–55. [Google Scholar]
Hermes, Anne, Johannes Becker, Doris Mücke, Stefan Baumann, and Martine Grice. 2008. Articulatory Gestures and Focus Marking in German. Paper presented at the 4th Conference on Speech Prosody, Campinas, Brazil, May 6–9; pp. 457–60. [Google Scholar]
Hermes, Anne, Jane Mertens, and Doris Mücke. 2018. Age-Related Effects on Sensorimotor Control of Speech Production. Paper presented at the Interspeech 2018, Hyderabad, India, September 2–6; pp. 1526–1233. [Google Scholar] [CrossRef] [Green Version]
Kalia, Lorraine V., and Anthony E. Lang. 2015. Parkinson’s Disease. The Lancet 386: 896–912. [Google Scholar] [CrossRef]
Karlsson, Fredrik, and Lena Hartelius. 2021. On the Primary Influences of Age on Articulation and Phonation in Maximum Performance Tasks. Languages 6: 174. [Google Scholar] [CrossRef]
Ketcham, Caroline J., and George E. Stelmach. 2004. Movement Control in the Older Adult. In Technology for Adaptive Aging. Edited by Richard W. Pew, Susan B. Van Hemel and National Research Council (U.S.). Washington, DC: National Academies Press, pp. 64–92. [Google Scholar]
Kornatz, Kurt W., Brach Poston, and George E. Stelmach. 2021. Age and Not the Preferred Limb Influences the Kinematic Structure of Pointing Movements. Journal of Functional Morphology and Kinesiology 6: 100. [Google Scholar] [CrossRef]
Krivokapić, Jelena, Mark K. Tiede, and Martha E. Tyrone. 2017. A Kinematic Study of Prosodic Structure in Articulatory and Manual Gestures: Results from a Novel Method of Data Collection. Laboratory Phonology 8: 3. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kügler, Frank. 2008. The Role of Duration as a Phonetic Correlate of Focus. Paper presented at the Speech Prosody 2008 Conference, Campinas, Brazil, May 6–9; pp. 592–94. [Google Scholar]
Ladd, D. Robert. 2012. Intonational Phonology, 2nd ed. Cambridge: Cambridge University Press. [Google Scholar] [CrossRef]
Lambrecht, Knud. 1994. Information Structure and Sentence Form: Topic, Focus, and the Mental Representations of Discourse Referents. Cambridge Studies in Linguistics. Cambridge: Cambridge University Press. [Google Scholar] [CrossRef]
Levelt, Willem J. M., and Antje S. Meyer. 2000. Word for Word: Multiple Lexical Access in Speech Production. European Journal of Cognitive Psychology 12: 433–52. [Google Scholar] [CrossRef] [Green Version]
Lindblom, Björn. 1990. Explaining Phonetic Variation: A Sketch of the H&H Theory. In Speech Production and Speech Modelling. Edited by William J. Hardcastle and Alain Marchal. Dordrecht: Springer, pp. 403–39. [Google Scholar] [CrossRef]
Martínez-Martín, Pablo, Carmen Rodríguez-Blázquez, Mario Alvarez, Tomoko Arakaki, Víctor Campos Arillo, Pedro Chaná, William Fernández, Nélida Garrettoc, Juan Carlos Martínez-Castrillog, Mayela Rodríguez-Violante, and et al. 2015. Parkinson’s Disease Severity Levels and MDS-Unified Parkinson’s Disease Rating Scale. Parkinsonism & Related Disorders 21: 50–4. [Google Scholar] [CrossRef] [Green Version]
Mücke, Doris. 2017. Dynamische Modellierung von Artikulation Und Prosodischer Struktur: Eine Einführung in Die Artikulatorische Phonologie. Studies in Laboratory Phonology 4. Berlin: Language Science Press, https://langsci-press.org/catalog/book/154.
Mücke, Doris, and Martine Grice. 2014. The Effect of Focus Marking on Supralaryngeal Articulation—Is It Mediated by Accentuation? Journal of Phonetics 44: 47–61. [Google Scholar] [CrossRef]
Mücke, Doris, Anne Hermes, Timo B. Roettger, Johannes Becker, Henrik Niemann, Till A. Dembek, Lars Timmermann, Veerle Visser-Vandewalle, Gereon R. Fink, Martine Grice, and et al. 2018. The Effects of Thalamic Deep Brain Stimulation on Speech Dynamics in Patients with Essential Tremor: An Articulographic Study. PLoS ONE 13: e0191359. [Google Scholar] [CrossRef] [Green Version]
Mücke, Doris, Tabea Thies, Jane Mertens, and Anne Hermes. 2021. Age-Related Effects of Prosodic Prominence in Vowel Articulation. Paper presented at the 12th International Seminar on Speech Production, New Haven, CT, USA, December 14–18; pp. 126–9. [Google Scholar]
Öhman, Sven E. G. 1966. Coarticulation in VCV Utterances: Spectrographic Measurements. The Journal of the Acoustical Society of America 39: 151–68. [Google Scholar] [CrossRef] [PubMed]
Pagel, Lena, Simon Roessig, and Doris Mücke. 2021. Modifications of Tongue Body Kinematics as a Focus Marking Strategy in German. Paper presented at the 12th International Seminar on Speech Production, New Haven, CT, USA, December 14–18; pp. 21–4. [Google Scholar]
Pell, Marc D., Henry S. Cheang, and Carol L. Leonard. 2006. The Impact of Parkinson’s Disease on Vocal-Prosodic Communication from the Perspective of Listeners. Brain and Language 97: 123–34. [Google Scholar] [CrossRef]
Perrier, Pascal, and Susanne Fuchs. 2015. Motor Equivalence in Speech Production. In The Handbook of Speech Production. Edited by Melissa A. Redford. Hoboken: John Wiley & Sons, Inc., pp. 223–47. [Google Scholar] [CrossRef]
Roessig, Simon. 2021. Categoriality and Continuity in Prosodic Prominence. Studies in Laboratory Phonology 10. Berlin: Language Science Press. [Google Scholar]
Roessig, Simon, and Doris Mücke. 2019. Modeling Dimensions of Prosodic Prominence. Frontiers in Communication 4: 44. [Google Scholar] [CrossRef]
Sapir, Shimon, Lorraine O. Ramig, Jennifer L. Spielman, and Cynthia Fox. 2010. Formant Centralization Ratio: A Proposal for a New Acoustic Measure of Dysarthric Speech. Journal of Speech, Language, and Hearing Research 53: 114–25. [Google Scholar] [CrossRef]
Seidler, Rachael D., Jay L. Alberts, and George E. Stelmach. 2002. Changes in Multi-Joint Performance with Age. Motor Control 6: 19–31. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Skodda, Sabine, Wenke Visser, and Uwe Schlegel. 2011. Vowel Articulation in Parkinson’s Disease. Journal of Voice 25: 467–72. [Google Scholar] [CrossRef] [PubMed]
Skodda, Sabine, Wenke Grönheit, and Uwe Schlegel. 2012. Impairment of Vowel Articulation as a Possible Marker of Disease Progression in Parkinson’s Disease. PLoS ONE 7: e32132. [Google Scholar] [CrossRef]
Sluijter, Agaath M.C., and Vincent J. van Heuven. 1995. Effects of Focus Distribution, Pitch Accent and Lexical Stress on the Temporal Organization of Syllables in Dutch. Phonetica 52: 71–89. [Google Scholar] [CrossRef] [Green Version]
Smith, Kara M., and David N. Caplan. 2018. Communication Impairment in Parkinson’s Disease: Impact of Motor and Cognitive Symptoms on Speech and Language. Brain and Language 185: 38–46. [Google Scholar] [CrossRef]
Thies, Tabea, Doris Mücke, Anja Lowit, Elke Kalbe, Julia Steffen, and Michael T. Barbe. 2020. Prominence Marking in Parkinsonian Speech and Its Correlation with Motor Performance and Cognitive Abilities. Neuropsychologia 137: 107306. [Google Scholar] [CrossRef] [PubMed]
Thies, Tabea, Doris Mücke, Richard Dano, and Michael T. Barbe. 2021. Levodopa-Based Changes on Vocalic Speech Movements during Prosodic Prominence Marking. Brain Sciences 11: 594. [Google Scholar] [CrossRef]
Winkelmann, Raphael, Jonathan Harrington, and Klaus Jänsch. 2017. EMU-SDMS: Advanced Speech Database Management and Analysis in R. Computer Speech & Language 45: 392–410. [Google Scholar] [CrossRef]

Figure 1. Game-like scenario to elicit target words in different focus conditions.

Figure 2. Schematized tongue body movement with relevant landmarks (top) and acoustic segment durations within the stressed CV syllable (below).

Figure 3. Results of measurements within the temporal domain for (a) acoustic V1 duration in ms, (b) vocalic V1 movement duration in ms and (c) symmetry ratio for vocalic movement.

Figure 4. Acoustic vowel space area dependent on the vowel formants F1 and F2 for /i:, e:, a:, o:, u:/, separately for the focus conditions (background, broad focus, contrastive focus). Each focus condition includes the respective plots for the speaker groups (grey = younger speakers, blue = older speakers, red = speakers with PD).

Figure 5. Vertical and horizontal tongue body position, for the front vowels /i:, e:/ and the low vowel /a:/, separately for each speaker group. Focus condition is indicated by color (grey = background, yellow = broad focus, red = contrastive focus).

Figure 6. Vertical and horizontal tongue body position, for the back vowels /u:, o:/, separately for each speaker group. Focus condition is indicated by color (grey = background, yellow = broad focus, red = contrastive focus).

Table 1. Age of participants and clinical characteristics of patients with PD.

	Younger Speakers	Older Speakers	Speakers with PD
	Age (Years)			Disease Duration (Years)	UPDRS III Score
Female 01	21	70	69	13	41
Female 02	28	78	70	20	27
Male 01	25	76	68	12	36
Male 02	26	79	79	2	31
mean (sd)	25 (±3)	75.75 (±4)	71.50 (±5)	11.75 (±7)	33.75 (±6)

Table 2. Target words of the corpus.

Article Preceding Target Word	Target Words
Article Preceding Target Word	Labial C1	Alveolar C1
<der> deɐ̯	mi:.la	li:.na
	mu:.la	lu:.na
	me:.la	le:.na
<die> di:	ma:.li	la:.ni
<die> di:	mo:.li	lo:.ni

Table 3. Coordination pattern (mean and sd) of the vocalic tongue body movement (onset and target) relative to the acoustic syllable.

Coordination Pattern	Condition	Background	Broad	Contrastive
Onset V1 to start of acoustic syllable (%)	younger	−21 (12)	−19 (12)	−16 (11)
	older	−14 (10)	−11 (8)	−10 (10)
	PD	−24 (15)	−26 (19)	−22 (16)
Target V1 to start of acoustic syllable (%)	younger	66 (7)	69 (8)	67 (9)
	older	67 (10)	67 (12)	66 (10)
	PD	64 (12)	67 (10)	64 (10)

Table 4. Vowel Articulation Index per speaker group and focus condition.

	Vowel Articulation Index
	Background	Broad	Contrastive
younger	0.96 (0.11)	1.05 (0.11)	1.07 (0.08)
older	0.95 (0.08)	0.99 (0.08)	1.01 (0.06)
PD	0.93 (0.12)	0.94 (0.13)	0.96 (0.13)

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Thies, T.; Hermes, A.; Mücke, D. Compensation in Time and Space: Prominence Marking in Aging and Disease. Languages 2022, 7, 21. https://doi.org/10.3390/languages7010021

AMA Style

Thies T, Hermes A, Mücke D. Compensation in Time and Space: Prominence Marking in Aging and Disease. Languages. 2022; 7(1):21. https://doi.org/10.3390/languages7010021

Chicago/Turabian Style

Thies, Tabea, Anne Hermes, and Doris Mücke. 2022. "Compensation in Time and Space: Prominence Marking in Aging and Disease" Languages 7, no. 1: 21. https://doi.org/10.3390/languages7010021

APA Style

Thies, T., Hermes, A., & Mücke, D. (2022). Compensation in Time and Space: Prominence Marking in Aging and Disease. Languages, 7(1), 21. https://doi.org/10.3390/languages7010021

Article Menu

Compensation in Time and Space: Prominence Marking in Aging and Disease

Abstract

1. Introduction

1.1. Prominence Marking and Information Structure

1.2. Aging and Speech

1.3. Parkinson’s Disease and Speech

1.4. Compensation Strategies in Speech

2. Method

2.1. Participants

2.2. Speech Material and Recordings

2.3. Data Processing and Measurements

3. Results

3.1. Temporal Domain

3.2. Spatial Domain

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix B

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI