Next Article in Journal
Linguistic Diversity in German Youth Media—The Use of English in Professionally Produced Instagram Memes and Reels
Previous Article in Journal
Inter-Generational Language Socialization Practices of German-Speaking Migrants in the North of Finland
Previous Article in Special Issue
Water and Sheep: The Pronunciation and Geographical Distribution of Two Germanic Vowels in the Dialects Around the Former Zuiderzee Area
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Two-Verb Clusters in Mennonite Low German: The Impact of Auxiliary Verb and Clause Type

Department of German Linguistics, University of Freiburg, Platz der Universität 3, 79085 Freiburg, Germany
Languages 2025, 10(5), 95; https://doi.org/10.3390/languages10050095
Submission received: 2 August 2024 / Revised: 2 January 2025 / Accepted: 10 March 2025 / Published: 29 April 2025
(This article belongs to the Special Issue Dialectal Dynamics)

Abstract

:
Although verb clusters in Continental West Germanic varieties are a well-researched topic, their derivation and the possible functions of their variants are still not yet fully understood. Both issues are discussed in the present paper, which is based on the translations of 46 English, Spanish, or Portuguese stimulus sentences by 321 North and South American speakers of Mennonite Low German. In order to analyze the variation in clause-final two-verb clusters, we focus on three structural factors, namely (i) the auxiliary verb, (ii) the structural link between the auxiliary verb and the main verb, and (iii) the type of the subordinate clause in which the cluster occurs. Regarding the first and the second factor, we will employ the cartographic approach in order to explain the impact of different auxiliary verbs. Regarding the third factor, it is somewhat surprising that the potential effect of the subordinate clause on the distribution of different cluster variants has received little attention in the research literature. Clause type will be shown to have such an effect and, therefore, we will assume that the speakers of MLG use different variants deliberately to indicate different degrees of clausal integration.

1. Introduction

An important aim of this paper is to relate the theoretical insights of the cartographic approach to the variation in clause-final two-verb clusters found in real production data. In doing so, we follow a desideratum mentioned by Boeckx (2011, p. 205): “There is indeed very little substantive discussion of the issue of linguistic variation in the context of the Minimalist Program”. In addition to the frequent lack of a substantive discussion, the equally frequent lack of a reliable and sufficiently large empirical base in studies within the generative framework is problematic (cf., e.g., Adli et al., 2015, pp. 12–17 and the literature cited therein). Rizzi (in Quarezemin, 2020, p. 6) mentions the potential risks of this state of affairs: “[I]n the absence of such a constant enrichment and exchange with the empirical dimension, the theoretical work risks to become sterile, and not raise enough interest in the scientific community”.
We will attempt to raise this interest by studying the linguistic behavior of 321 Mennonite informants from six communities in the Americas. Between 1999 and 2002, each of them was asked to translate 46 English, Spanish, or Portuguese stimulus sentences into Mennonite Low German (henceforth MLG). The resulting dataset, which is available online (cf. Kaufmann, 2018), contains approximately 14,500 tokens.1 About 4500 of these tokens contain clause-final two-verb clusters in subordinate clauses. Due to this robust number and due to the comparability of the stimulus sentences, advanced statistical analyses can be applied (cf. Section 4). In most empirical aspects, the present paper draws on Kaufmann’s habilitation thesis (cf. Kaufmann, 2016); however, the application of the cartographic approach is novel, not only in comparison to Kaufmann (2016) but also in comparison to the discussion of verb clusters in general (cf. Kaufmann, 2023, pp. 377–385 for a first attempt). The cartographic approach has been chosen because it combines the analysis of different structural positions within the split IP domain (in our case embodied by the different auxiliary verbs) with different serialization patterns (in our case embodied by the different cluster variants). Wiltschko (2014, p. 11) calls this the “[u]niversality of linearization and categorization”.
The paper is organized as follows: while Section 2.1 provides a historical sketch of the Mennonite communities in the Americas, Section 2.2 presents a brief introduction to the method of data elicitation. Section 3.1 then discusses the role of verb projection raising (henceforth v-p-r) and scrambling in the derivation of clause-final two-verb clusters, whereupon Section 3.2 introduces the three structural factors we consider crucial for the variation between the different cluster variants. In this section, we also discuss the use of the cartographic approach and present four crucial hypotheses. Section 4 then statistically analyzes the relevant tokens in order to verify these hypotheses. Section 5 provides a brief conclusion.

2. Mennonites in the Americas

2.1. A Brief Historical Sketch

The origins of the Mennonites in the Americas can be traced back to Eastern Holland, Friesland, Flanders, and what is now Northwestern Germany. Anabaptist communities formed in these regions during the Reformation. Because of religious persecution, many of these Anabaptists emigrated to West and East Prussia in the 16th century. There, a koiné was formed from the Continental West Germanic varieties the Mennonites had brought with them and the local varieties of Low German. In addition, the Mennonites began to use Standard German (henceforth StG) instead of Dutch for official purposes such as worship and education. When the Prussian government imposed stricter rules in the 18th century, many Mennonites accepted an invitation by Catherine II to settle in the recently conquered territories of “New Russia” (today Ukraine!). The first settlers arrived in 1789 and founded the colony Chortitza. In 1803, a second colony was founded and named Molotchna. Due to the different periods of emigration, the slightly different geographic origins, and the different social backgrounds of the emigrants, two MLG varieties developed, one in Chortitza and one in Molotchna, with the Molotchna variety enjoying more prestige (cf. Siemens, 2012, pp. 30–70 for a detailed discussion).
At the end of the 19th century, Russian officials introduced laws to ensure a certain degree of integration, prompting the more conservative Mennonites from Chortitza in particular to emigrate to Canada around 1870. During and after World War I, the situation of German-speaking immigrants also became difficult in Canada. Once again, the more conservative members did not yield to outside pressure and moved to either Mexico, where most of them settled in the state of Chihuahua (the region of Ciudad Cuauhtémoc; today approximately 40,000 people with a Mennonite background; 103 Mennonites took part in the translation task), or to Paraguay, where they set up the colony of Menno (9000 people; 44 informants). Mennonites from Mexico founded various daughter colonies in Belize and one in Seminole, Texas (4000 people; 73 informants). Mennonites from both Mexico and Menno also founded several daughter colonies in the region of Santa Cruz de la Sierra in Bolivia (50,000 people; eight informants with a Menno background).
The Mennonites who stayed in Russia in 1870 modernized their school system and placed more emphasis on StG. Due to their economic success, these Mennonites faced severe persecution when Stalin gained power in 1927. Because of these unfavorable prospects, many Mennonites, most of them from Molotchna and its daughter colonies, attempted to leave the Soviet Union and, in 1930, some succeeded in emigrating to Canada, Paraguay (colony of Fernheim; 4000 people; 37 informants), and Brazil (Colônia Nova in Rio Grande do Sul; 1000 people; 56 informants).
These different migration paths have resulted in different language repertoires. In addition to MLG and StG, the repertoires normally include the majority language of each community’s home country and, quite often, English as an international language. The heritage language MLG is still the unrivaled in-group language in Mexico, Bolivia, and the Paraguayan colony of Menno. It is weakest in the United States and Brazil, the two communities where competence in the majority language, English and Portuguese, respectively, is strongest. Aside from this, there are some families in the Paraguayan colony of Fernheim that use StG instead of MLG at home. Today, both Paraguayan communities, not only the Molotchna-based one in Fernheim, benefit from their modern school system, in which StG is not only taught as a subject but is also used as a language of instruction. Thus, in Paraguay, and to a lesser extent in Brazil, StG can still be seen as a roofing variety (Dachsprache) that exerts a certain influence on MLG. Admittedly, most Mennonite schools in the Chortitza-based communities in the USA, Mexico, and Bolivia also use and teach StG, but their StG must be considered a mere hagiolect that hardly influences the respective MLG varieties.
Of the 321 informants, 192 claim MLG to be their dominant language (59.8%). Another 29 informants indicate having comparable knowledge of MLG and one of the majority languages (9%; 10 in Brazil; 10 in the USA), and twelve do so with respect to MLG and StG (3.7%; 9 in Paraguay). One Paraguayan informant indicates having comparable knowledge of MLG, StG, and Spanish. In contrast, seventy informants claim one of the majority languages to be their dominant language (21.8%; 39 in the USA; 19 in Brazil); 17 attribute this status to StG (5.4%; 16 in Paraguay). Importantly, the 87 informants with a dominant language other than MLG still speak their heritage language (very) well. On average, they reach 9.9 out of 14 possible points for their competence in MLG.2

2.2. Data Elicitation

With a few exceptions, the sentences were presented in the respective majority language. Only 19 of the 184 Mexican and Paraguayan informants preferred English over Spanish, mostly because they had spent some time in Mennonite communities in Canada and/or the United States. Importantly, the different stimulus languages had no measurable effect on the distribution of the cluster variants. In any case, the three languages do not differ too much with regard to their verb syntax. They are all SVO, and neither of them shows a marked difference between root and subordinate clauses. Moreover, the fact that English, Spanish, or Portuguese were used as stimulus languages rather than StG not only allowed more Mennonites to participate but also greatly reduced possible priming effects. The stimulus sentences were read to the informants one by one, and the informants translated them immediately, without the help of a written version. Crucially, 16 of the stimulus sentences were modeled to elicit clause-final clusters with one finite and one non-finite verb (either a modal verb with a bare infinitive or a temporal auxiliary with a past participle). These sentences represent four sentence compounds with complement clauses, four with causal clauses, four with conditional clauses, and four with relative clauses (cf. Kaufmann, 2016, pp. 19–42 for detailed information on all stimulus sentences). Apart from this, the translations of many stimulus sentences that aimed at eliciting subordinate clauses with one verb actually produced two verbs due to the appearance of either the optional irrealis/aspectual auxiliary dune (‘do’; cf., e.g., (8a) and (19), respectively) or the inchoative auxiliary woare(n) (‘will’), an optional marker for future tense (cf., e.g., (8b)). Just like modal verbs, these auxiliaries govern a bare infinitive.

3. Verb Clusters in Mennonite Low German

3.1. The Role of Verb Projection Raising and Scrambling

Due to constraints of space, we will not be able to offer exhaustive discussions for our basic assumptions (cf. Kaufmann, 2016, pp. 43–63 for this). However, we will illustrate these assumptions by means of the translations in (1a–c).
stimulus <15>Portuguese: Se ele tiver que vender a casa agora, ele vai ficar muito triste.
Spanish: Si tiene que vender la casa ahora, se va a poner muy triste.
English: If he has to sell the house now, he will be very sorry.
(1)a.wanndeidatHüsverköpemut
ifhenow-ADV[thehouse]-ObjDPsell-V2must-V1
[0.5]dannwirddeisehrtrürig3
[0.5]thenturnsheverysad
(Bra-59; f/56/MLG)4
b.wannheimutdatHüsverköpe
ifhenow-ADVmust-V1[thehouse]-ObjDPsell-V2
wirdhei-ihmdatsehrleidsene
willhehimthatverysorrybe
(Men-43; m/27/MLG)
c.wannheidatHüsmutverköpe
ifhenow-Adv[thehouse]-ObjDPmust-V1sell-V2
dannwirdheisehrtrürigsene
thenwillheverysadbe
(Bra-60; f/40/MLG)
The three translations contain the modal verb mute(n) (“must”) and the main verb verköpe(n)5 (“sell”). In (1a), the ObjDP dat Hüs (“the house”) precedes the main verb verköpe, labeled V2, which in turn precedes the finite modal verb mut, labeled V1. Thus, this construction shows a strictly left-branching sequence ObjDP-V2-V1, i.e., V1 governs V2 to the left and V2 governs the ObjDP to the left. We will call this sequence the NR variant (non-raising variant) because we assume that the V2P (dat Hüs verköpe) remains in its base position. The tree in (2), which only displays the relevant phrases in the IP and the VP domain, presents the structure of (1a) (phonetically realized elements appear in bold).
(2) Structural representation of the NR variant in (1a).
Languages 10 00095 i001
The tree demonstrates our conviction that the VP domain and the IP domain of MLG are head-final. The question of whether the modal verb is base-generated in ModPobligation by external merge or has moved there by feature-driven internal merge from a base-generated position in a V1P is irrelevant to our discussion.6 The inchoative phrase with the adverb 7 in its specifier position is a rather low functional phrase, even lower than ModPobligation (cf. the scheme in (7)), so the ObjDP dat Hüs (“the house”) in (1a) is either realized in its base position (as the sister of verköpe “sell”, as illustrated) or it has been scrambled string-vacuously to a very low topic phrase by internal merge (not present in (2); cf. Belletti, 2004, p. 17 for IP internal focus and topic positions). When the ObjDP dat Hüs is scrambled to a higher topic position (visualized in brackets in (2)), it is serialized to the left of the adverb,8 and this sequence is actually more frequent (cf. Table 1).
The cluster variants in (1b+c) take (2) as their base. For (1b), we assume v-p-r of the V2P dat Hüs verköpe (“sell the house”). This rightward movement leads to the partially right-branching sequence V1-ObjDP-V2. The tree in (3) demonstrates this variant, which we will call the VPR variant (verb projection raising variant). This is its traditional name, as, just as we do, many researchers have taken (and some still take) it for granted that the entire V2P is raised (cf., e.g., Vanden Wyngaerd, 1989, p. 436; Broekhuis & den Besten, 1989; Penner, 1990; Wurmbrand, 2001, pp. 120–121).
(3) Structural representation of the VPR variant in (1b).
Languages 10 00095 i002
As before, phonetically realized copies are shown in bold. Unrealized copies are crossed out. In order to obtain the correct superficial sequence, V2P has to be adjoined to a phrase higher up than mut in ModPobligation. We have labeled this phrase XP. Admittedly, by considering adjunction as a derivational option, we create multiple X’ levels, which may be considered problematic in the current generative framework (cf., e.g., Rizzi, 2013, p. 198). However, as v-p-r seems to be, at least partly, a parsing-driven mechanism (cf., e.g., Bach et al., 1987; Haider, 2010, pp. 338–339) which does not affect the semantics of the clause (cf. Wurmbrand, 2017, p. 27), we do not assume it to take place in the narrow syntax but shortly before or after spell-out and, in that sphere, derivational rules may work differently.
Importantly, the preverbal position of the adverb makes it clear that (1b) is not a verb-second clause, i.e., the finite verb is in the IP domain; it has not moved to the CP domain as in an unintroduced root clause. Many other translations, however, superficially serialize the finite auxiliary in the second position, i.e., after the introductory element and the SubjDP (cf., e.g., (6) and (8b)). Although we are sure that this position is only superficially and not structurally verb-second, we will see in Section 4.4 that these apparent verb-second clauses may serve as an indicator of a low degree of clausal integration.
The cluster sequence ObjDP-V1-V2 in (1c) is traditionally called the VR variant (verb raising variant). This label stems from the conviction that on the surface—and in our opinion, only on the surface—the main verb verköpe (“sell”) seems to have moved to the right of mut on its own. However, just like Broekhuis and den Besten, (1989) and Penner, (1990)9, we assume that (1c) is a case of remnant movement, i.e., as in (1b), the entire V2P raises, but unlike in (1b), the ObjDP dat Hüs (“the house”) has been scrambled out of V2P before v-p-r. Wurmbrand’s (2017, p. 78) analysis with respect to the definiteness and the type of internal complement in clauses with the V(P)R variants supports this view:
[…] the final two categories that have to be distinguished in VPR constructions are, on the one hand, indefinite (weak) noun phrases and prepositional phrase [sic], and definite (strong) noun phrases on the other hand. While the phrases of the former type are allowed to occur within a verb cluster in Swiss German, West Flemish, and (at least for some speakers) in Afrikaans and German, the latter are only allowed to separate the verbs of a cluster in Swiss German and West Flemish.
Faced with this distribution, one can conclude that “definite (strong) noun phrases” are scrambled more often than “indefinite (weak) noun phrases” since they can appear within a verb cluster in fewer Continental West Germanic varieties. A similar relationship between the indefiniteness of a phrase and its reluctance to move is found in Broekhuis (2007, p. 121), who states for Dutch that “non-specific, indefinite noun phrases never shift, which is due to the fact that they are necessarily part of the focus of the clause”. In more technical terms, Biskup (2007, p. 122) writes: “In Biskup (2006a, 2006b) I argue that scrambling gives rise to specificity, which can be partitive, epistemic or generic, and that scrambling is driven by a Specificity-feature”. If there are indeed features involved, then scrambling must be considered part of the narrow syntax, a view shared by Roberts (2010, p. 48) with regard to object shift: “If we follow Chomsky (2001, p. 15) in assuming that ‘surface semantic effects are restricted to narrow syntax’, and take the specificity effect associated with clitic movement and object shift to be such a surface semantic effect, then this movement must take place in the narrow syntax”.
In Section 4.4, we will demonstrate that both ObjPPs and the (in)definiteness of the ObjDP/PP have an influence on the raised variants in MLG (cf. also Kaufmann, 2016, pp. 97–100, 202–209 for additional signs of scrambling such as cluster-internal floating quantifiers and cluster-internal stranded prepositions). Having said this, the tree in (4) illustrates the VR variant in (1c), in which the ObjDP dat Hüs has been scrambled out of V2P and is realized in a low topical phrase (TopicP I). This low position causes its serialization to the right of (“now”). Again, the more frequent serialization is dat Hüs nü. To account for this sequence, we have to assume a phrase such as TopicP II (with the ObjDP in brackets) to be located above the inchoative phrase with .
(4) Structural representation of the VR variant in (1c).
Languages 10 00095 i003
In our view, the best way to describe the different cluster variants in MLG is to assume that they are but a superficial epiphenomenon of two unrelated syntactic mechanisms, namely v-p-r and scrambling. Assuming that these phenomena are independent of each other may help explain the speaker’s ability to use them for functional purposes. For example, by applying v-p-r but avoiding scrambling, a speaker can create a subordinate clause that closely resembles a root clause (cf. Section 4.4). In this way, the concentration of the V(P)R variants and, in particular, of the VPR variant in strongly disintegrated subordinate clauses may help answer a question posed by Wurmbrand (2017, p. 93): “But what still appears to be an open question is the question of why the elements of a verb cluster are inverted in certain languages and constructions”.

3.2. Structural Factors in the Variation of Two-Verb Clusters

We will now introduce the structural factors that we believe account for much of the variation in clause-final two-verb clusters. These factors are (i) the auxiliary verb, (ii) the structural link between the auxiliary verb and the main verb, and (iii) the type of subordinate clause in which the cluster occurs.

3.2.1. Auxiliary Verbs and Adverbs in the Cartographic Approach

Many researchers have pointed out that the auxiliary verb is fundamental in explaining the different serialization patterns in verb clusters (cf., e.g., Zwart, 1996, p. 233; Lötscher, 1978, p. 10; Barbiers, 2005, pp. 248–255). The innovative aspect of our approach is that we relate this variation to Rizzi and Cinque’s (2016) functional hierarchy in the split IP domain (cf. also Cinque, 1999, 2004; Shlonsky, 2010). Cinque’s (1999) seminal work analyzed the serialization patterns of different types of adverbs in Italian and other languages and claimed that each adverb is located in the specifier position of a functional phrase and that these phrases are localized in a hierarchical order in the IP domain. Rizzi (2004, p. 4) describes this domain (and the split CP domain) as follows:
If overt morphological richness is a superficial trait of variation and a fundamental assumption of uniformity is followed […] then it is reasonable to expect that clauses should be formed by a constant system of functional heads in all languages, each projecting a subtree occurring in a fixed syntactic hierarchy, irrespective of the actual morphological manifestation of the head (as an affix, as an autonomous function word, or as nothing at all).
While adverbs are supposed to occupy the specifier position of the respective phrase, function words (e.g., auxiliary verbs) occupy their head position. Let us now see if we can find supporting evidence for both these claims in the MLG dataset. With respect to adverbs, we can analyze two preposed conditional clauses. While stimulus sentence <15> with an inchoative (“now”) has already been introduced in (1a-c), stimulus sentence <17> features the evidential adverb wirklich (“really”). Two serialization patterns can be seen in (5a+b):
stimulus <17>Portuguese: Se ele realmente matou o homem, ninguém pode ajudar ele.
English: If he really killed the man, nobody can help him.
(5)a.wannheiwirklichdenMenschtodgemaakthaft
ifhereally-ADV[theman]-ObjDPkilled-V2has-V1
dann[0.7]kaunkeinerihmhelpe
then[0.7]cannobodyhimhelp
(Bra-22; m/37/MLG+Port)
b.wannheidenMenschwirklichumgebrachthaft
ifhe[theman]-ObjDPreally-ADVkilled-V2has-V1
kaunihmkeinerhelpe
canhimnobodyhelp
(Bra-20; f/50/MLG)
Unlike with the different positions of in (1a–c), there may be a scopal difference between the different positions of wirklich. However, the differences between the evidential adverb in (5a) (Zifonun et al., 1997, p. 1126 call wirklich in StG an assertive sentence adverb) and the adverb in (5b), which could be read as a manner adverb, seem to be rather marginal, at least for sentence <17>.10 In any case, (6) shows that there can be little doubt about the informants’ preferred interpretation. This translation represents a total of five tokens.
stimulus <17>Spanish: Si realmente mató al hombre, nadie lo puede ayudar.
English: If he really killed the man, nobody can help him.
(6) wanndatwirklichisdatheihafdenMensch
ifitreally-ADVisthathehas-V1[theman]-ObjDP
todgemeaknad-keinerkaunihnhelpen
killed-V2nonobodycanhimhelp
‘If it is true that he killed the man, nobody can help him.’
(Mex-67; m/16/MLG)
The translation begins with wann dat wirklich is (“if it is true”), which marks the evidential status of the entire sentence compound. This conditional clause is followed by a second subordinate clause containing the proposition of the protasis of the stimulus sentence. It is beyond doubt that wirklich in (6) has scope over this proposition and must be read as an evidential adverb.
Let us now in (7) (next page) visualize the positional difference between and wirklich using the scheme in Shlonsky (2010, pp. 422–423). This scheme demonstrates the position of core functional phrases, illustrating them with prototypical adverbs. The scheme contains mostly mood, mod(al), t(ense), and asp(ect) phrases.11
(7) Illustration of two adverbs and two instances of scrambled ObjDPs in the split IP domain (based on Shlonsky’s (2010, pp. 422–423) reduced scheme).
Languages 10 00095 i004
The difference in height between Moodevidential and Aspinceptive(II),12 which we can assume to be close to an unrepresented phrase for inchoative aspect, is huge. Importantly, (7) only shows the upper and lower parts of Shlonsky’s scheme (cf. the sign […]), i.e., the distance between the two phrases is even bigger than it appears. In any case, visible scrambling of an ObjDP such as dat Hüs (“the house”) in sentence <15> can be easily detected in the case of low (“now”) (cf. ObjDP-b), while the ObjDP den Mensch (“the man”) in sentence <17> needs to scramble to a very high position in order to appear in front of wirklich (“really”) (cf. ObjDP-c). Table 1 now compares conditional clauses with a modal verb and (or vondaag “today”) in sentence <15> with conditional clauses with the temporal auxiliary han (“have”) and wirklich (or ap iernst “in seriousness”) in sentence <17>. Only tokens with the NR variant are considered.
Table 1. Distribution of the sequences adverb–ObjDP and ObjDP–adverb in the conditional clauses of sentences <15> and <17> (only tokens with the NR variant).
Table 1. Distribution of the sequences adverb–ObjDP and ObjDP–adverb in the conditional clauses of sentences <15> and <17> (only tokens with the NR variant).
sentence <15>
(vondaag)
sentence <17>
wirklich (ap iernst)
n (tokens)104138
adverb-ObjDP14123
13.5%89.1%
χ2(1, n = 242) = 138.2; p < 0.001 ***/Phi: 0.76/0 cells with less than 5 expected tokens
ObjDP-adverb9015
86.5%10.9%
The unmarked sequence for sentence <15> is ObjDP–adverb, strongly suggesting that the inchoative adverb (“now”) occupies a low position.13 Unlike this—and this difference is highly significant14—the unmarked sequence in sentence <17> is adverb–ObjDP, as in (5a), strongly suggesting that the evidential adverb wirklich (“really”) occupies a high position.
With regard to the position of the auxiliary verb, sentence <15> offers another illuminative type of variation. Five informants did not translate this sentence with the intended modal verb expressing deontic obligation (208 times mute(n) “must”; 39 times solle(n) “should”15) but with dune (“do”), as in (8a). Here, dune, a multifunctional auxiliary in MLG (cf. Postma, 2019, p. 149 for the comparable multifunctionality of the Pomerano cognate daua), could be said to be a conditional marker, or—in a somewhat loose sense (sentence <15> is not counterfactual)—an irrealis marker.16 Even more informants, 18 to be precise, translated the clause with woare(n) (“will”), as in (8b), an optional future tense marker, whose inchoative nature fits the presence of the inchoative adverb (“now”, cf. Leiss, 1992, pp. 214–215 for the fact that the characterization as inchoative fits the StG cognate werden both as a main and an auxiliary verb).17
stimulus <15>Spanish: Si tiene que vender la casa ahora, se va a poner muy triste.
English: If he has to sell the house now, he will be very sorry.
(8)a.wannheidatHüsvondaagverköpendät
ifhe[thehouse]-ObjDPtoday-ADVsell-V2does-V1
dannisdei trürig
thenbeheØsad
(Mex-107; m/38/MLG)
b.wannheiwirddatHüsfuatsverköpe
ifhewill-V1[thehouse]-ObjDPnow-ADVquicklysell-V2
datwirdihmsehrleidwere
thiswillhimverysorrybecome
(Fern-35; f/57/MLG+StG)
Thanks to this variation, we can verify the existence of a potential relationship between the structural height of different auxiliary verbs and the share of the V(P)R variants in the same stimulus sentence, meaning that we do not have to worry too much about further interfering factors. Once again, we will use Shlonsky’s (2010, pp. 422–423) reduced scheme. This time, we have added the three auxiliary verbs found in sentence <15> and four possible adjunction sites of V2P after v-p-r.
(9) Illustration of three auxiliary verbs and four instances of the raised V2P in the split IP domain (based on Shlonsky’s (2010, pp. 422–423) reduced scheme).
Languages 10 00095 i005
The phrase Moodirrealis (dune) is very high, much higher than Aspinceptive(II), a phrase we have assumed to be close to the unrepresented phrase Aspinchoative headed by woare(n) (“will”; cf. Note 12). The third relevant phrase Modobligation (mute(n)) occupies an intermediate position.18 With regard to the different positions of a raised V2P, (9) shows that in verb clusters with woare(n), the right-branching V(P)R variants can be derived by [(dat Hüs) verköpe(n)]-c/d/e. As for mute(n), visible v-p-r only occurs in cases d/e, while for dune, it only occurs in case e. With regard to case b, v-p-r will apply string-vacuously for all auxiliaries. The fact that we assume that it is possible for v-p-r to end up in so many different positions is not as problematic as it might seem. Remember that we consider v-p-r to be a case of adjunction, a type of movement that enjoys much more freedom than adverbs and auxiliary verbs in the spine of the split IP domain. Precisely to show that the sequence of adverbs cannot be a consequence of adjunction, Shlonsky (2010, p. 421) writes: “Adjunctions to maximal projections typically do not show such rigid ordering constraints […]”. In any case, judging from (9), we would expect that clusters with dune should lead to very few instances of the V(P)R variants, while clusters with woare(n) should induce very many such instances. Table 2 shows the respective shares for the visibly right-branching V(P)R variants and the NR variant. This latter variant will inevitably contain a certain number of tokens in which v-p-r has occurred string-vacuously (cf., e.g., case b in (9)).
There is only one token of the NR variant with the auxiliary woare(n) (5.6%), but four of the five tokens with dune feature the NR variant (80%). As expected, translations with mute(n), which expresses deontic obligation and is located between the phrases headed by dune and woare(n), show an intermediate share of 45.7%. Given the highly significant distribution in Table 2, our first hypothesis regarding the effect of auxiliary verbs on the serialization patterns in two-verb clusters can be formulated as follows:
(10) Hypothesis I:
The higher up in the split IP domain an auxiliary verb is located, the fewer cases of v-p-r will be visible. Thus, a high structural position should correlate with a low share of the raised V(P)R variants.
Importantly, although the tokens in Table 2 come from the same stimulus sentence, this analysis is merely monofactorial, i.e., it does not consider other possible influences (cf. Section 4 for this) such as informant-related factors like age, gender, the language competences, and in particular, the possible impact of the structural link between the auxiliary verb and the main verb, the topic of the next section.

3.2.2. The Structural Link between the Auxiliary Verb and the Main Verb

We assume that the fundamental possibility of v-p-r strongly depends on the structural link between the auxiliary verb and the main verb. Thus, the second hypothesis is as follows:
(11) Hypothesis II:
If the structural link between the auxiliary verb and the main verb is strong, as in the case of the auxiliary dune (“do”), v-p-r is inhibited more frequently than in a verb cluster with a weaker link as, for example, in the case of modal verbs.
One way to evaluate the strength of this link is to see if the two verbs are mutually dependent in order to ensure a correct interpretation. Modal verbs, for example, do not influence the meaning of the main verb they govern, while this is quite different in the case of the temporal auxiliary in the present perfect tense. With respect to this tense in StG, Grewendorf (1995, p. 83) and Musan (2002, p. 40) are convinced that the temporal auxiliary in the present tense and the past participle of the main verb must co-act in order to transmit the notion of anteriority.
The importance of this link can also be seen by comparing sentence <3> Don’t you see that I am turning on the light, in which many informants inserted the progressive auxiliary dune, with sentence <5> Henry doesn’t know that he can leave the country, in which most translations include the dynamic modal verb könne(n) (“can”). Shlonsky’s scheme in (7) and (9) locates the phrase Modpossibility much higher than the phrase Aspgeneric/progressive (note once again the missing phrases indicated by […]). Therefore, if only the location of the auxiliary verb in the split IP domain was decisive, we would expect more instances of the V(P)R variants with low dune than with high könne(n). This, however, is not the case. In the complement clauses with könne(n), we find 53.4% of instances of the V(P)R variants (95 of 178 relevant tokens), while in the complement clauses with dune, only 29.8% of the translations feature the V(P)R variants (28 of 94 relevant tokens). Therefore, contrary to the first hypothesis in (10), dune does not co-occur with more instances of the V(P)R variants, even though it is localized much lower than könne(n) in the split IP domain.
One reason for this unexpected result may be the fact that dune is an all-purpose auxiliary.19 This multifunctionality is probably a consequence of the fact that even the semantics of dune as a main verb is rather unspecific. Dune, therefore, lends itself to all kinds of grammaticalization processes, for example, the shift to an aspectual or irrealis auxiliary or even as a semantically even emptier verbal host for the integration of loan verbs (cf., e.g., Kaufmann, 2023, pp. 353–355). However, such an all-purpose auxiliary comes with a serious disadvantage, namely the fact that the hearer has to parse the whole clause in order to specify its meaning. In particular, its interpretation depends on certain adverbs such as the habitual always, as in the stimulus sentence <34> This is the man who is always staring at my house (cf. (19)) or, when coding the progressive aspect, on the meaning of the main verb, as with the achievement verb to turn on in the just analyzed stimulus sentence <3> Don’t you see that I am turning on the light. This dependency is the reason why we assume that the structural link between dune and the main verb is very strong and why the NR variant with the sequence main verb–dune is very frequent. If finite dune appeared earlier in the clause—and this would be the case in the V(P)R variants—the hearer may have trouble identifying its meaning.
Admittedly, our reasoning so far does not provide a feasible way to accurately measure the strength of the link between the auxiliary and the main verb, and unfortunately, we will not be able to offer a definitive conclusion in this respect. However, another aspect of verb clusters may at least partly address this conundrum, namely the different morphological forms of the main verb as either a past participle (in the case of the temporal auxiliary han “have”) or a bare infinitive (in all other cases of the MLG dataset). One could assume, as Barbiers (2005, p. 262—Note 18) and von Stechow (1990, pp. 149–150) do, that at least modal verbs selecting a bare infinitive and temporal auxiliaries selecting a past participle are constructed identically and coherently, which means that they all embed a bare V2P. However, opinions differ with regard to different morphological expressions. For example, Ramchand and Svenonius (2014, p. 158) write: “The novelty here comes from actually attempting to line up auxiliary ordering with these zones in an explicit way, placing the progressive participle within the VP domain and the perfect participle outside of it”. Moreover, even the same morphological expression does not guarantee structural identity (cf., e.g., Wurmbrand, 2001). A bare infinitive governed by aspectual dune (“do”) may be of a very different nature than a bare infinitive governed by modal verbs. Ramchand and Svenonius (2014, p. 159) confirm this assumption: “Both the modal and the perfect auxiliary introduce topic situations which are anchored to the utterance time, and which are distinguishable in principle from the VP event […]”, while “[i]n the progressive […], the event run time and the tense specification of the progressive auxiliary cannot be so distinguished”. The fact that they group together the modal verbs and the perfect auxiliary and contrast this class with an aspectual marker for which the event time and the tense specification cannot be distinguished may explain the role of dune as the most v-p-r-hostile auxiliary (cf. Table 4). After all, this unique “semantic” congruence can be seen as an indication of a very strong link between this auxiliary and the main verb.
In conclusion, if we assume that there exist different types of links between the auxiliary and the main verbs, we must distinguish different classes of auxiliary verbs. Only after such a distinction has been made should we focus on the relative position of individual members of these classes in the split IP domain. To illustrate this, we propose a potential classification of the nine auxiliary verbs in the MLG dataset in Shlonsky’s now complete scheme in (12).
(12) Five possible classes of auxiliary verbs in the IP domain (based on Shlonsky’s (2010, pp. 422–423) complete scheme).
Languages 10 00095 i006
Let us start out by discussing some general facts about the split IP domain. One way to understand this domain is to think of its higher functional phrases as more speaker-related or more related to the subject of the clause and its lower phrases as more verb-related or simply as less speaker-/subject-related (cf. also Wiltschko’s (2014) organization of universal categories in a universal spine). In (12), the sphere of mood is highest, followed by the sphere of epistemic modals, for which we unfortunately have no relevant data. Both of these spheres are strongly speaker-related. The next sphere is the tense sphere, which contains T(Past) and T(Future). From there, the scheme roughly mirrors the TMA approach (tense–mood–aspect). All root modals are below the tense phrases, with the exception of T(Anterior), and the majority although not all of the aspect phrases are lower than the modal phrases.
What makes the aspectual sphere somewhat problematic, at least on first sight, is the huge range from the very high habitual aspect to the very low frequentative aspect. However, this spread becomes comprehensible once we realize that the higher aspect phrases are more speaker-oriented, while the lower aspect phrases are more verb-oriented (bordering on the category Verb with the truly verbal sphere of Aktionsart or inner aspect; cf., e.g., MacDonald, 2008). For example, with respect to the continuous/progressive aspect, which is closer to the speaker’s end of the IP domain than the inchoative aspect, here represented by Aspinceptive(ii), it is the speaker who decides whether s/he wants to adopt an internal or an external point of view with regard to the state/event described. In contrast, the inchoative meaning of woare(n) (“will”) does not depend at all on the speaker’s point of view. Finding habitual aspect highest in (12) should not surprise us either, since in this case, the speaker is simply emphasizing the habitual occurrence of an event, abstracting from any single event and not immersing herself/himself in the described event, as is the case with the progressive aspect.
As for the modal verbs, the modal possibility often describes a characteristic/capability of the subject of the clause, which may or may not be identical with the speaker. Therefore, it makes sense to situate this phrase in the higher speaker-/subject-related sphere (cf., e.g., stimulus sentence <16>, as in (22a–c)). In contrast, the cause of or source for an obligation is something/someone that is almost always outside the speaker’s/subject’s influence (cf. stimulus sentence <15>, as in (1a–c)). The existence of such an external source justifies a position far away from the upper regions of the IP domain. The intermediate position of wolle(n) (“want to”) can be explained by the fact that it is more subject-oriented than mute(n) (“must”) but also less subject-oriented than könne(n) (“can”). After all, the subject’s mental and physical capabilities are generally more subject-defining than his or her often short-lived desires.
Regarding the temporal auxiliary han (“have”), it is important to note that we did not locate it in the high tense phrase T(Past). There are two reasons for this; first, we must not forget that han occurs with present tense morphology in the present perfect tense, a fact that strongly contradicts its localization in T(Past). Second, as in StG, there still seems to be a certain aspectual touch to the present perfect tense in MLG, which distinguishes it from the past tense and makes its positioning in T(Anteriority) more appropriate. In any case, T(Anteriority) appears in the predominantly aspectual sphere of (12). Regarding its structural link, we have already mentioned in Section 3.2.2 that han can only code anteriority in combination with the past participle of the main verb. However, although the results of Table 4 suggest that this link is strong, it does not seem to be as strong as the link of aspectual dune, as the V(P)R variants are much less frequent with dune than with han.
With regard to the classes of auxiliary verbs in (12), we still have to see whether it makes sense to separate irrealis dune (Mood) from aspectual dune or to separate woare(n) (“will”; Aspectverb) from the modal verbs (cf. the dashed lines in (12) and Section 4.3 for a discussion). At this point, we only claim that both han and dune are more strongly linked to their respective main verbs than modal verbs. Within each of these classes, however, we expect differences in the shares of visible v-p-r according to the position of the individual members in the split IP domain. Since Kaufmann (2023, pp. 377–385) has already shown that the different positions of aspectual phrases do indeed account for part of the variation in the MLG two-verb clusters headed by dune (“do”), this seems to be a viable approach.

3.2.3. Clause Type

The third structural factor concerns the degree of integration of the subordinate clause into the matrix clause. Apart from Kaufmann (2003, 2007, 2016), serious consideration has never been given to the potential influence of this factor on verb clusters. Though De Sutter (2009, p. 234) mentions possible effects of “clause type (root vs. embedded clause), the nature of the conjunction (in case of an embedded clause) and verb tense”, he neither offers a real discussion of these factors nor does he point to any pertinent publication. In any case, v-p-r to a high position in the higher IP domain or the CP domain will almost always cause a right-branching sequence (the V(P)R variants). However, in strongly integrated subordinate clauses, such high positions may simply not be available. Hopp and Putnam (2015, p. 207), who relate clause type to the verbal position in subordinate clauses, write: “In our approach, the subordinate clauses in MSG [Moundridge Schweitzer German] that continue to exhibit verb-final ordering lack illocutionary force (e.g., [-Force]), and as a result do not license an extended CP-layer […]”. Although we see the causality the other way around, with the lack of an extended CP-layer (and/or IP-layer) inhibiting the VPR variants, Hopp and Putnam (2015, pp. 195–196) also induce v-p-r as a possible factor for verb-second clauses. In this respect, it is important to realize that it is especially subordinate clauses with the VPR variant that share some crucial features with unintroduced root clauses: (i) their verbs appear discontinuously in a V1-V2-sequence and (ii) their finite verb often appears, at least superficially, in second position (cf. (6) and (8b)). Strikingly, this second similarity resembles the variation in Danish subordinate clauses with adverbs (cf., e.g., Christensen & Jensen, 2022; Christensen et al., 2020), which appear either with the sequence adverb–verb or with the sequence verb–adverb, the latter resembling the sequence in Danish root clauses, which are, just like in StG, verb-second clauses. Thus, the third hypothesis can be formulated as follows:
(13) Hypothesis III:
More integrated subordinate clauses provide less structural space for adjunction in the IP and the CP domain. Therefore, we expect to see fewer instances of the raised V(P)R variants in more integrated clauses.
Crucially, because of the possible functional side of the different cluster variants, we have to formulate a fourth hypothesis:
(14) Hypothesis IV:
Since the V(P)R variants share some linearization features with unintroduced root clauses (e.g., V1 precedes V2), some Mennonites may have begun to refrain from scrambling the ObjDP out of V2P in order to further strengthen this similarity by producing the VPR variant with the superficial sequence V1-ObjDP-V2. With this, they may have started to indicate structural integration by their choice of particular cluster variants.
With the MLG dataset, we can analyze three types of subordinate clauses, namely complement, conditional, and relative clauses. Causal clauses cannot be included, since many informants, especially in the USA and Mexico, have reanalyzed them as structural verb-second clauses20 (cf. Kaufmann, 2016, pp. 286–306 and Christensen & Jensen, 2022, p. 201 for a comparable development in Danish). All complement clauses analyzed represent the extraposed internal complement of the respective matrix construction (a verb or a predicative), i.e., they follow their matrix clause. All conditional clauses precede their matrix clause. The relative clauses appear adjacent to their reference noun, and all of them can be qualified as restrictive rather than appositive (cf., however, Kaufmann, 2016, pp. 30–31 for some problems in deciding this issue).
The question now is which characteristics can be used to distinguish between different degrees of clausal integration. Here, it is important to note that a scalar concept of clausal integration is not new. Fabricius-Hansen (1992, p. 465; cf. also Frey, 2011, p. 72; Reis, 1997; Auer, 1998; Langacker, 2009, p. 328), for example, writes that “[…] it need not be the case that a clause is either subordinate or not subordinate to another clause. Rather, there may be a stronger or weaker, or a more or less pronounced, relationship of subordination between them [translation G.K.]”. One characteristic that defines the relationship of subordination can be broadly called “semantic”. For this, a comment by Reis (1997, p. 126) is crucial:
Subordinate clauses occupy specific phrasal positions in the projection of the head Vb [matrix verb] of Sb [matrix clause] and have specific relations with it (adverbial clauses with the event variable of the head, complement clauses with its theta grid). Thus they are ‘directly licensed’ from Vb (Haider, 1995, p. 262). The base position of the complement clauses results from the assignment conditions of the theta roles: They must be l-marked by Vb or strictly governed by it. Attributive clauses of Sb (relative clauses, comparative clauses, etc.) are characterized by an antecedent in Sb to which they relate in a specific way (restrictive, explicative, etc.). With regard to Vb, Haider calls them ‘indirectly licensed’ […] [translation G.K.].
“Semantically”, the three types of subordinate clauses analyzed in the present paper can be ranked. Complement and conditional clauses are directly licensed by the matrix verb, whereas relative clauses are only indirectly licensed and thus may be considered to be less integrated. Complement clauses are distinguished from adverbial clauses by the fact that, in their base position, they are l-marked or strictly governed by the matrix verb and not only by its event variable. In terms of their “semantic” integration, complement clauses are more integrated than adverbial clauses (cf. also Christensen & Jensen, 2022, p. 196 for the same claim). The “semantic” ranking is as follows (< means less integrated):
(15) relative clauses < conditional clauses < complement clauses
Two other characteristics of subordinate clauses are their vertical position in the tree structure and their horizontal position in clausal topology. The higher up in the structural tree the subordinate clause is, the further away from the (verbal core of the) matrix clause it is. Haider’s (2010, p. 225) comment with regard to extraposed relative and complement clauses illustrates this (cf. also Frey’s 2011, p. 71 and Haegeman’s 2012, p. 170 analogous comments with regard to adverbial clauses): “As emphasized above […], extraposed relative clauses precede extraposed argument clauses […]. Therefore it is impossible that extraposed argument clauses are adjoined lower than extraposed relative clauses”. If extraposed complement/argument clauses are adjoined in such a high and disintegrated position, we may assume that they possess more structural space in the higher IP domain and the CP domain, which may then allow for more instances of visible v-p-r. Regarding structure, we can then formulate the following sequence (< again means less integrated):
(16) extraposed complement clauses < extraposed relative clauses
With regard to the topological position of the subordinate clause, the subordinate clause must, according to Fabricius-Hansen (1992, p. 467), either occupy the topological prefield, as is the case with the conditional clauses in the MLG dataset, or the topological midfield, or it must be an attributive clause adjacent to its nominal head (the relative clauses in the MLG dataset). Extraposed clauses in the postfield, such as the complement clauses in the MLG dataset, do not belong to Fabricius-Hansen’s category of indisputably subordinate clauses. The potential impact of the topological field in which the subordinate clause occurs is not at all surprising, since several syntactic phenomena demonstrate that the prefield of a German clause is structurally more integrated than its postfield.21 On the one hand, in a declarative clause, the prefield must (almost) always be phonetically realized, regardless of whether this condition is met by an expletive (e.g., es “it” in StG), an adverb, a DP, a PP, or a CP. Conversely, the postfield frequently remains empty. On the other hand, the prefield is usually identified with a strongly integrated specifier position in the CP domain, while the postfield is usually analyzed as a loosely adjoined position (cf., e.g., Sternefeld, 2008, pp. 289–290). With respect to the MLG dataset, we can thus establish the following hierarchy with respect to the topological integration of the subordinate clauses:
(17) complement clauses in the postfield < conditional clauses in the prefield
It is difficult to decide how relative clauses should be treated in this respect. They may sometimes be said to be part of the prefield as in (18), but it is often hard to determine whether they are either part of the midfield or, if they have been extraposed string-vacuously, part of the postfield as in (19).
stimulus <36>English: The doctor who wants to see my foot is very worried.
(18) de:-deDoktorwatwillminFutsehen
thethedoctorwhowants-V1[myfoot]-ObjDPsee-V2
issehrbekümmert
isveryworried
(USA-6; m/20/Engl>MLG-79%)
stimulus <34>Spanish: Este es el hombre que está siempre mirando mi casa.
English: This is the man who is always staring at my house.
(19) detisdeMaunwatimmernominHüs
thisisthemanwhoalways-ADV[tomyhouse]-ObjPP
kiekendät
look-V2does-V1
(Mex-81; m/46/Span>MLG-71%)
One important feature that distinguishes relative clauses from both complement and conditional clauses is the fact that they do not form a constituent themselves but are part of a constituent (cf., e.g., Hilpert, 2013, p. 204 for this). With this in mind, a further hierarchy can be established as follows:
(20) complement clauses/conditional clauses < relative clauses
In three of the four hierarchies in (15), (16), (17), and (20), extraposed complement clauses represent the least integrated clause type. Both relative and conditional clauses obtain this position only once (cf. (15) and (20), respectively). Thus, extraposed complement clauses do indeed seem to be the most disintegrated clause type in the MLG dataset. If the third hypothesis in (13) is correct, we expect more tokens of the V(P)R variants (especially the VPR variant) in this clause type. As for the ranking of conditional and relative clauses, we are faced with a tie. We could assign different weights to the four rankings in order to distinguish the two clause types, but this would only be an ad hoc decision. Therefore, we would prefer to trust the statistical analyses outlined in the next section.

4. Large-Scale Statistical Analyses

4.1. The Dataset

The monofactorial analyses in Section 3.2 have provided preliminary support for the claim that the auxiliary verb affects the frequency of v-p-r (cf. Table 2 in Section 3.2.1 and the discussion of sentences <3> and <5> in Section 3.2.2). However, we have not yet provided an analysis of the link between the auxiliary and the main verb nor of the clause type. To fill this gap, we will now carry out a generalized linear mixed model in SPSS (version 29.0.1.0 (171); target distribution and relationship with the linear model; binary logistic regression). The use of such a model instead of a binary logistic regression analysis is necessary because we will introduce both random and fixed factors. The model comprises 4470 tokens of subordinate clauses with clause-final two-verb clusters (1854 tokens with the V(P)R variants). The model binarily contrasts the V(P)R variants (+v-p-r) with the reference category, the NR variant (-visible v-p-r). The 321 informants enter the model as a random factor. Strictly speaking, these informants were not randomly selected, as their selection was based on the indispensable condition of sufficient competence in the majority language. Importantly, however, when asked to participate, very few people refused to do so. On average, each informant contributed 13.9 tokens (ranging from 4 to 23 translations; SD: 3.2). Apart from missing or unusable translations, this rather wide range is due to informants who produced the preterite in stimulus sentences that aimed at translations with the present perfect tense and informants who did not produce (m)any translations with optional dune (“do”) or woare(n) (“will”). Both preferences resulted in translations without a verb cluster.
The 4470 tokens come from 29 of the 46 stimulus sentences, with the majority of 2984 tokens (66%) coming from the twelve sentences designed to elicit clause-final two-verb clusters with a temporal auxiliary or a modal verb in complement, conditional, and relative clauses. Since these sentences were carefully designed (cf. Section 2.2), they cannot be considered as a random factor nor can they enter the modal as a fixed factor either, since they are strongly correlated with the crucial factors of auxiliary verb and clause type. On the contrary, the research location enters the model as a fixed factor. Again, the six locations were not randomly selected and therefore should not be considered a random factor. Moreover, the number of six levels is rather small for a random factor. In total, the model contains a binary dependent variable, one random factor, and eight fixed factors. In the following, the fixed factors are listed, with categorical variables being presented with their reference category and the exact number of tokens for each level:
  • Categorical variables
  • Research location (six variants; reference category Brazil): Brazil (828 tokens); USA (1123 tokens); Mexico (1336 tokens); Bolivia (113 tokens); Menno, Paraguay (569 tokens); Fernheim, Paraguay (501 tokens)
  • Gender (two variants; reference category men): men (2481 tokens); women (1989 tokens)
  • Clause type (eight variants; reference category ±extraposed relative clause): ±extraposed relative clause (791 tokens); -dislocated relative clause (382 tokens); ±dislocated relative clause with an anaphoric pronoun (221 tokens); conditional clause without dann (735 tokens); conditional clause with dann (946 tokens); disintegrated conditional clause (136 tokens); complement clause without dat (1009 tokens); complement clause with dat (250 tokens)
  • Auxiliary verb22 (nine variants; reference category hananterior): hananterior (1527 tokens); woare(n)inchoative (542 tokens); wolle(n)volition (613 tokens); mute(n)obligation (629 tokens); könne(n)possibility (525 tokens); duneconditional (153 tokens); dunehabitual (150 tokens); dunecontinuous (141 tokens); duneprogressive (190 tokens)
  • Metric variables
  • Age
  • Competence in MLG
  • Competence in the majority language
  • Competence in StG
Before presenting the results of the model, some comments are in order: (i) For 45 of the 321 informants (14%), we do not know the “precise” level of competence in the contact languages (cf. Note 2). However, we do know the dominant language(s) of these informants. In order to not lose their tokens for the model, their language competences were estimated according to the available data of comparable informants, i.e., of informants of the same community and in the same gender and age group. (ii) Six of the 73 informants in the United States and two of the 44 informants in the Paraguayan colony of Menno had lived in the respective research location for less than five years. These informants were pooled with the other members of the respective community. Importantly, excluding these two sets of informants would not change anything substantial with regard to the model’s results. Their inclusion, however, greatly increases its reliability. (iii) We distinguish three types of imperfective aspects (cf. Kaufmann, 2023, pp. 389–393 for a clause-by-clause categorization), namely habitual aspect as in <34> This is the man who is always staring at my house (cf. (19)); continuous aspect as in <4> Can’t you see that I am wearing a new dress?; and progressive aspect as in <32> The stories that he is telling the men are very sad (cf. (23)). Regarding the auxiliary verbs in general, it should not be forgotten that tokens with aspectual/irrealis dune (“do”) are rare in the Paraguayan translations and that the modal verb wolle(n) (“want to”) occurs predominantly in relative clauses (cf. the discussion of sentences <35> and <36> in Section 4.3). These uneven distributions are certainly not ideal for statistical analyses, but most of them cannot be avoided. (iv) So far, we have talked about three clause types, namely complement, conditional, and relative clauses. However, there are some variations in the translations that force us to refine this rather rough division by creating eight instead of three levels for the factor clause type. Regarding complement clauses, many translations include a correlate in the matrix clauses, as in (21b), which unlike (21a) features dat (“it” or “that”).23
stimulus <5>Portuguese: O Enrique não sabe que ele pode sair do país.
English: Henry doesn’t know that he can leave the country.
(21)a.Heinrich[1.7]weitnichdatheidatLand
Henry[1.7]knowsnotthathe[thecountry]-ObjDP
verlotekaun
leave-V2can-V1
(Bra-2; m/55/MLG)
b.Enriqueweitdatnichdatheikaun
Henryknowsthat-CORRELATEnotthathecan-V1
ütdemLandgone
[outthecountry]-ObjPPgo-V2
(Bra-36; f/31/Port>MLG-50%)
Due to the fact that the subordinate clause in (21b) is linked to the nominal correlate dat, many linguists see it as a kind of attributive/relative clause rather than a complement clause (cf., e.g., Lehmann, 1984, pp. 325–329; Zimmermann, 2011; Kaufmann, 2016, pp. 362–373 on the formal and semantic similarities of relative and complement clauses). Taking this into account, the question as to whether such a correlate changes the degree of integration of the subordinate clause arises.
A somewhat comparable variation occurs in the translations of conditional sentence compounds, many of which contain the resumptive element dann (“then”; cf. (22b) instead of (22a)). Aside from this, translations such as (22c) contain disintegrated conditional clauses, i.e., clauses that do not occupy the prefield of the matrix clause (SpecCP in structural terms) but rather the pre-prefield (probably a position adjoined to a phrase of the CP domain).
stimulus <16>Spanish: Si él puede resolver este problema, es muy inteligente.
English: If he can solve this problem, he is very smart.
(22)a.wannheikaunditProblemlösen
ifhecan-V1[thisproblem]-ObjDPsolve-V2
isheisehrklüag
isheverysmart
(Mex-60; f/42/MLG)
b.wanndeidat:[0.7]Problemkaunlösen
ifhe[the[0.7]problem]-ObjDPcan-V1solve-V2
dannisdeisehrklüag
then-RESUMPTIONisheverysmart
(Mex-61; m/31/Span>MLG-64%)
c.wanndeiditTroublekaunlösen
ifhe[thistrouble]-ObjDPcan-V1solve-V2
deiissehrklüag
heisverysmart
(Mex-41; m/37/MLG)
With respect to relative clauses, there is, in addition to their topological position (cf. (18) and (19)), one more type of variation. Here, (23) is relevant.
stimulus <32>Spanish: Las historias que les está contando a los hombres son muy tristes.
English: The stories that he is telling the men are very sad.
(23) dieGeschichtewatdeideMenschevertahledät
thestoriesthathe[thepeople]-ObjDPtell-V1does-V1
diesin   sehrtrürig
they-RESUMPTIONare   verysad
(Bol-8; m/20/MLG)
The translation in (23) features the resumptive anaphoric pronoun die (“they”) that appears after the relative clause. This is a clear case of left dislocation (cf. the discussion in Bousquette et al., 2021), and one may therefore assume that the relative clause and its reference noun are located in the pre-prefield, as was the case with the disintegrated conditional clause in (22c). Bearing this in mind, we have to distinguish three subtypes of relative clauses. Relative clauses as part of the (pre-)prefield, one of which may be dislocated (-dislocated relative clause; ±dislocated relative clause with an anaphoric pronoun), and relative clauses that occur at the end of the sentence compound and may or may not be extraposed into the postfield (±extraposed relative clause).

4.2. Informant-Related Predictors

We are now ready to discuss the results of the model. The predicted distribution of the NR variant and the V(P)R variants versus their actual observed distribution increases from 58.5% to 86.8% when the random effect and the fixed effects are added. The conditional R2 is 0.642, i.e., the random and fixed effects together “explain”24 64.2% of the variation in the dependent variable. The informants (random effect) “explain” 17%.25 With this, the six fixed effects with a significant F-value “explain” 47.2% of the variation (marginal R2 is 0.472). Because of the high number of fixed effects with a significant F-value, we will illustrate the results in two tables, starting with the four informant-related fixed effects in Table 3.
Table 3. Generalized linear mixed model for visible v-p-r in complement, conditional, and relative clauses (part I: informant-related predictors).
Table 3. Generalized linear mixed model for visible v-p-r in complement, conditional, and relative clauses (part I: informant-related predictors).
research locationStGagegender
F: 28.6 ***F: 13.9 ***F: 11.1 ***F: 6.3 *
USA (7.97 ***)
Mexico (6.23 ***)
Bolivia (3.2 (*))
women (1.65 *)
Brazil men
Menno (0.31 **)
Fernheim (0.27 **)
StG (0.864 ***)age (0.977 ***)
The four fixed effects in Table 3 are ranked by their F-values. The larger this value, the more confident we can be that the predictor in question actually improves the overall fit of the model. The metric variables and the reference categories of the categorical variables are shown in bold. The reference categories are placed in the shaded central line. Variants that do not show a significant difference in effect size to the reference category are located together with them (cf. Table 4 for this). Above the shaded line, the metric variables and the variants of the categorical variables that indicate greater odds for the V(P)R variants (odds ratios greater than 1) than the respective reference category are listed with the precise values of their odds ratio. The odds ratios represent the ratio of the odds of the target variant, in Table 3, the V(P)R variants, occurring in the presence of a certain level of a fixed factor (e.g., being a woman) to the odds of the target variant occurring in the presence of the reference category (in this case, being a man). Below this line, the metric variables and the variants of the categorical variables that indicate smaller odds for the V(P)R variants than the reference category are listed (odds ratios smaller than 1).
The research location is the first predictor that we will discuss. In order to understand the differences between the six communities, it is important to recall their different migration paths, which were described in Section 2.1. On the one hand, there are the more conservative communities in Mexico, Bolivia, and the United States, all of which speak the Chortitza variety of MLG. On the other hand, there are more progressive communities in Brazil and Paraguay, all of which speak the Molotchna variety of MLG. Table 3 shows that the odds of a US-American informant producing the V(P)R variants are 7.97 times greater than those of a Brazilian informant, the reference category for this variable. For a Mexican informant, the odds ratio is 6.23. Although the two communities in Paraguay originate from different migration waves, the influence of Fernheim was so strong that today, they behave rather similarly. Compared to a Brazilian informant, their odds of producing the V(P)R variants are 3.23 (Menno; 1:0.31) and 3.71 times smaller (Fernheim; 1:0.27), respectively. Interestingly, the eight Bolivian informants, whose ancestors came from the once conservative community of Menno, turn out to be syntactically more progressive. In comparison to a Brazilian informant, their odds of producing the V(P)R variants are 3.2 times greater, though this difference only represents a statistical tendency (p = 0.064). The intermediate position of the Brazilian Mennonites, who belong to the same migration wave as the progressive Mennonites from Fernheim, may be due to the fact that the anti-German language laws of the so-called Estado Novo (1937–1945) destroyed much of the community school system of German-speaking migrants in Brazil and the once important role of StG along with it.
Crucially, it is not only the research location that shows a significant F-value but also the competence in StG (without any multicollinearity problems26). While the latter effect exclusively concerns the informants’ individual competence in StG, the former effect may also be related to the different roles of StG in the six research communities. Some readers may take issue with the fact that we do not only relate the informants’ competence but also the factor research location to StG. Of course, for this factor, one could also assume that there might be syntactic differences between the two basic MLG varieties (Chortitza and Molotchna). Unfortunately, we have no relevant information about their syntax before the Mennonite migration to the Americas. The idea that the different stimulus languages in the six research locations might have had an influence has already been discarded in Section 2.2. In any case, four more points on the individual competence scale (cf. Note 2) reduce the odds of the V(P)R variants by a factor of 1.79 ((1:0.864)4). Since StG does not allow v-p-r in two-verb clusters, this prestigious variety may thus still be considered an influential roofing variety (Dachsprache) for many speakers of MLG.
Aside from the competence in StG and the research location, both age and gender show significant effect sizes. Ten more years reduce the odds of producing the V(P)R variants by a factor of 1.26 ((1:0.977)10), while the odds of a female informant are 1.65 greater than those of a male informant. One can therefore assume that v-p-r constitutes a linguistic innovation. After all, younger women seem to lead instances of change from below (cf. Labov, 2001, pp. 279–280). Crucially, the level of competence in MLG does not show a significant effect size, strongly suggesting that the odds of the V(P)R variants are not related to language attrition.

4.3. Auxiliary Verbs and their Structural Link to the Main Verb

With regard to the structural factors, both the type of auxiliary verb and the clause type show significant effect sizes. Table 4 summarizes these results.
Table 4. Generalized linear mixed model for visible v-p-r in complement, conditional, and relative clauses (part II: structural predictors).
Table 4. Generalized linear mixed model for visible v-p-r in complement, conditional, and relative clauses (part II: structural predictors).
auxiliary verbclause type
F: 96.4 ***F: 10.5 ***
woare(n) (18.59 ***)
wolle(n) (15.08 ***)
mute(n) (14.48 ***)
könne(n) (4.16 ***)
complement (+dat) (3.29 ***)
disintegrated conditional (2.75 ***)
complement (-dat) (2.51 ***)
conditional (+dann) (1.42 (*))
han±extraposed relative
−dislocated relative (-der)
±dislocated relative (+der)
conditional (-dann)
dunecontinuous (0.481 *)
duneprogressive (0.393 ***)
duneconditional (0.167 ***)
dunehabitual (0.093 ***)
Regarding the auxiliary verb, the odds of the inchoative auxiliary woare(n) (“will”) co-occurring with the V(P)R variants are 18.59 times greater than the co-occurrence of these variants with the temporal auxiliary han (“have”), the reference category. For the habitual auxiliary dune (“do”), these odds are 10.75 smaller (1:0.093). Crucially, all auxiliaries show significant effect sizes. Regarding the raising behavior of the auxiliary verbs, the very high habitual dune (decreasing odds ratio of 10.75), the very low inchoative woare(n) (increasing odds ratio of 18.59), the high modal könne(n) (“can”) (increasing odds ratio of 4.16), and the low modal mute(n) (“must”; increasing odds ratio of 14.48) behave in line with the first hypothesis in (10). The lower their location in the split IP domain (cf. (12)), the higher their odds ratio. Nevertheless, a one-dimensional look at the scheme in (12) cannot explain all of Table 4’s results. For example, judging by the rather low position of the progressive auxiliary dune (odds ratio of 0.393), which is located 15 phrases lower than Modpossibility, one would have expected its odds ratio for visible v-p-r to be much greater than those of the modal verb könne(n) (odds ratio of 4.16; cf. also the discussion of sentences <3> and <5> in Section 3.2.2), but this is definitely not the case. For this reason, the structural position of the auxiliaries in the split IP domain can be regarded as an important but insufficient factor on its own. The crucial factor in this respect is the grouping of the auxiliary verbs according to their structural link to the main verb (cf. (12)).
In Section 3.2.2, we presented some arguments suggesting that the link between modal verbs and their respective main verbs is weak, and according to the second hypothesis in (11), this weak link should allow for rather unrestricted v-p-r. The results of Table 4 support this hypothesis, as all modal verbs show (very) high values for their odds ratios. If we group woare(n) (“will”) with the modal verbs, a move justified by their functional similarity (cf., e.g., the use of the StG cognate werden as an epistemic modal verb or the discussion of MLG modal verbs below), and if we group the irrealis marker dune with the three aspectual forms of dune, we can now assume the following sequence of the auxiliary classes in (12) (< means weaker structural link):
(24) Modal verbs (Modalroot)/woare(n) (Aspectverb) < han (Tense) < dune (Mood/Aspectspeaker/subject)
If this sequence, which is based on Table 4, is correct, we no longer have to worry about the fact that a low functional position like that of progressive dune does not automatically lead to greater odds ratios than the high functional position of modal possibility. We can now explain this difference by pointing to the strong structural link of dune that greatly inhibits v-p-r. What remains to be explained is the internal ordering in the two auxiliary classes with several members. Regarding the category Modal verbs (Modalroot)/woare(n) (Aspectverb), the scheme in (12) suggests the following internal sequence (< means smaller odds ratios for visible v-p-r):
(25) könne(n)possibility < wolle(n)volition < mute(n)obligation < woare(n)inchoative
However, the sequence found in Table 4 is as in (26) (unexpected position in bold):
(26) könne(n)possibility (odds ratio of 4.16) < mute(n)obligation (14.48) < wolle(n)volition (15.08) < woare(n)inchoative (18.59)
The volitional modal verb wolle(n) (“want to”) is the only verb whose position is unexpected, as it displays the second greatest odds ratio (after very low woare(n)). Assuming that the hierarchy in (12) is universally correct, this result is unexpected, since the volitional phrase is located between the phrases for obligation and possibility. The fact that the odds ratio for visible v-p-r is greater with wolle(n) than with könne(n) (“can”) can therefore be regarded as expected, but the fact that the odds ratio is also slightly greater than with mute(n) (“must”) is unexpected. Seggewiß’s (2012, p. 112) description of one possible meaning of English want to may help solve this conundrum:
In the last two examples [134) so I met Jake at the pub and I said oh you want to read this before you go […]; 135) You don’t want to worry about fat], want to does not refer to the volition of the speaker but to that of the hearer and thereby expresses a recommendation or polite command. In 134), the suggestion or recommendation implied by want to can be substituted by should and in 135), don’t need to would be an appropriate alternative to want to.
A single look at the stimulus sentences <35> and <36>, which are responsible for 523 of the 613 tokens with wolle(n) (85.3%), makes it clear that a possible reading of them is relatable to Seggewiß’s description.
(27)a.stimulus <35> Is this the film you want to show to all your friends?
b.stimulus <36> The doctor who wants to see my foot is very worried.
One is probably not far off the mark if one assumes some surprise on the part of the speaker of sentence <35>. This person could also have said “I’m not quite sure if it is a good idea to show this film to all your friends” or “You had better not show this film to all your friends”. It is certainly no coincidence that two informants used the obligatory modal verb mute(n) (“must”) instead of expected wolle(n), thus hypothesizing about an external obligation that forces the hearer to act in this way. Similarly, the relative clause of sentence <36> does not represent a typical case of volition. The worried doctor certainly does not want to see the speaker’s foot as part of his personal desires; it is rather the case that s/he is obligated to see a person’s foot as part of his/her job. In line with this assumption, four informants used the modal verb solle(n) (“should”) instead of wolle(n). A further important clue for the behavior of wolle(n) comes from Seggewiß’s (2012, p. 112) summary with regard to want to:
Want to is apparently gaining ground as a modal marker, displaying new modal meanings. It is not only used for the expression of volition but is also beginning to be used as a future marker and as a marker of polite obligations.
The fact that want to can also be used as a future marker suggests that this English surrogate modal verb may be following the same path as its precursor will. The possibility of a similar path for the MLG modal verb wolle(n) may also help to explain the fact that its behavior with respect to visible v-p-r is rather close to that of the future marker woare(n) (“will”). Moreover, 32 informants used woare(n) instead of wolle(n) in the translations of stimulus sentence <35> (cf. (27a)). We can therefore conclude that the similarity between an inchoative auxiliary verb, which can indicate future time reference, and a volitional modal verb, which at least sometimes implies a future sub-meaning, leads to comparable behavior with regard to v-p-r. The (implicational) marking of future time reference by both wolle(n) and woare(n) is supported by Bybee et al. (1994, pp. 253–266), who demonstrate that in the world’s languages, markers of desire (and obligation) have produced more future markers than markers of possibility. Bybee et al. (1994, pp. 254) write that “[a] common agent-oriented pathway to future begins with desire”. The cross-linguistic rarity of markers of possibility as a source for future markers and the fact that the future tense in MLG is not obligatorily marked by a genuine tense marker high up in the IP domain may help to explain why MLG könne(n) (“can”) is the modal verb that behaves most differently from woare(n).
Regarding the four forms of dune in the category dune (Mood/Aspectspeaker/subject) in (24), one would expect the following sequence (< again means smaller odds ratios for visible v-p-r):
(28) duneconditional < dunehabitual < dunecontinuous < duneprogressive
Again, the actual sequence in Table 4, which is illustrated in (29), deviates (unexpected positions in bold):
(29) dunehabitual (odds ratio of 0.093) < duneconditional (0.167) < duneprogressive (0.393) < dunecontinuous (0.481)
There are several possible explanations for the two deviations. First, it is noteworthy that in (12), there are only two functional phrases between duneconditional and dunehabitual and only three functional phrases between dunecontinuous and duneprogressive. In contrast, there are nine phrases between these two pairs, i.e., between dunehabitual and dunecontinuous. In light of this, it is important to realize that the unexpected positions in (29) occur only within the two neighboring pairs, not between them. Second, one must not forget that duneconditional is not an aspectual marker, so its link with the main verb may actually be stronger or weaker than the link of aspectual dune. Nevertheless, the fact that duneconditional is homonymous with the three forms of aspectual dune also suggests that despite Wiltschko’s (2014, p. 93) function diagnostic, form-related mutual influences might be at play. Wiltschko writes: “The absolute position of a given UoL [Unit of Language] can be diagnosed by identifying its function, which is independent of its content”. Crucially, Table 4 demonstrates that dune as a habitual marker has a decreasing odds ratio for the V(P)R variants of 10.75 (1:0.093), while dune as a continuous marker has a much weaker decreasing odds ratio of 2.08 (1:0.481). With this, it is clear that the speakers of MLG discriminate different functions of the auxiliary dune, and thus we can assume that the MLG hearers are also able to discriminate probabilistically the different functions of dune, at least in the admittedly few cases where it precedes the main verb.
Apart from this, the aspectual categorization of the stimulus sentence <32> The stories that he is telling the men are very sad (cf. (23)), which accounts for 84 of the 481 tokens with aspectual dune (17.5%), is somewhat complicated. There can be little doubt that the unmarked reading of this clause is progressive, since its speaker is most likely witnessing a specific and probably unique episode of storytelling. However, the clause could also be read habitually. A clear indication for this is the fact that two informants inserted the adverb immer (“always”) in the matrix clause, as in (30), and this insertion forces a habitual reading.
stimulus <32>Spanish: Las historias que les está contando a los hombres son muy tristes.
English: The stories that he is telling the men are very sad.
(30) dieGeschichtewatdeideMenschevertahlt
thestoriesthathe[thepeople]-ObjDPtells-V1
sindimmersehrtrürig
arealways-ADVverysad
(Fern-11; m/44/StG>MLG-79%)
These two habitual tokens do not contain the habitual marker dune. Nevertheless, some informants may have understood this sentence as habitual even without a habitual adverb. If this occurred several times in translations with aspectual dune, at least one of the two deviations in (29) could simply be an artifact of some unavoidable misclassifications.
In any case, the odds of all forms of dune co-occurring with the V(P)R variants are (much) smaller than the odds of the temporal auxiliary han (“have”). Since two of the functional phrases headed by dune are located above the functional phrase of han in (12) and two are located below it, we are led to conclude that the structural link between han and the main verb is weaker than the structural link between dune and the main verb.

4.4. Clause Type

Regarding the clause type, the third hypothesis in (13) stated that the more disintegrated a subordinate clause is, the more illocutionary force it has and therefore the more topological space it should have in the upper region of the IP domain and the CP domain. This surplus of topological space could then lead to more possible adjunction sites for a raised V2P. Table 4 now shows that the odds for the V(P)R variants are 3.29 times greater in a complement clause with a correlate than the odds for the V(P)R variants in ±extraposed relative clauses, the reference category. The odds ratio for a complement clause without such a correlate is 2.51. For a disintegrated conditional clause, it is 2.75. Conditional clauses with the resumptive element dann (“then”) also obtain an odds ratio greater than one, but the value is more moderate with 1.42, and it only represents a statistical tendency (p = 0.057). Neither the conditional clauses without such an element nor the other two types of relative clauses behave differently from the reference category. As far as relative clauses are concerned, this is a clear indication that it is their close relation to the reference noun that matters, not the topological field in which they are located. The slightly greater odds ratios in complement clauses with a correlate rather than without one and in conditional clauses with a resumptive element rather than without one may support the assumption that the nominal correlate dat (“it/this”) and the resumptive element dann (“then”) lead to a lower degree of integration of the subordinate clause. However, in Section 3.2.3, we considered the idea that these specific clause types might also share some characteristics with relative clauses, namely their close relationship to a(n) (nominal) element of the matrix clause—and this would instead suggest that their behavior is comparable to the reference category. However, Table 4 does not support this assumption.
In any case, if the V(P)R variants are supposed to reflect or perhaps even indicate a higher degree of disintegration, we would expect the two types of extraposed complement clauses and the disintegrated conditional clauses to show higher odds ratios—and this is indeed the case. Moreover, the distribution of Table 4 resembles a comparable type of variation in Danish. Christensen et al. (2020, p. 248) write: “Previous studies have found that Verb > Adverb word order occurs most frequently in complement clauses […] and fordi ‘because’ clauses […], but may also be found in (other) adverbial clauses […] and relative clauses […], although fairly infrequently […]”. Importantly, the sequence Verb > Adverb leads to subordinate clauses with the finite verb in second position, just like in a Danish root clause. Quite strikingly, Hopp and Putnam (2015) even report a comparable development for a language they call “a moribund heritage variety of German”. Hopp and Putnam (2015, p. 196) write about Moundridge Schweitzer German: “In subordinate clauses, there is considerably more variation in that the standard verb-final order is apparent in relative clauses and clauses beginning with temporal complementizers. In contrast, weil-clauses and clauses introduced by the complementizer dass predominantly take verb-second orders”. So again, it is complement and causal clauses that often do not serialize finite verbs in the clause-final position.
To analyze this issue further, we will offer a second generalized linear mixed model (target distribution and relationship with the linear model; binary logistic regression). In Section 3.2.3, we argued that in comparison to the VR variant, the VPR variant would be an even better indicator for a high degree of clausal disintegration. After all, this variant shares two crucial characteristics with root clauses, namely that (i) the two verbs are discontinuous and (ii) the finite verb frequently surfaces in the second position (cf. (6) and (8b)). If this reasoning is correct, disintegrated clause types such as extraposed complement clauses should not only show a greater odds ratio for visible v-p-r, i.e., for the V(P)R variants but, in particular, they should show a greater odds ratio for the VPR variant. Since the second model does not contain any translations with the NR variant, the number of analyzable tokens decreases to 1841 (898 tokens with the VPR variant). For two reasons, this number is slightly smaller than the number of translations with the V(P)R variants analyzed in Table 3 and Table 4 (1854 tokens). On the one hand, there are some translations without an ObjDP for which we can distinguish between the verbal sequences V2-V1 (no visible v-p-r) and V1-V2 (with v-p-r) but not between the VR variant (with scrambling of the ObjDP) and the VPR variant (without scrambling of the ObjDP). On the other hand, the definiteness of some ObjDPs could not be clearly determined, and this factor is one of the ten independent variables used in the current model. The model’s dependent reference category is the VR variant; the informants again enter as a random factor. The list of the fixed factors is as follows:
  • Categorical variables
  • Research location (six variants; reference category Brazil): Brazil (236 tokens); USA (681 tokens); Mexico (757 tokens); Bolivia (44 tokens); Menno, Paraguay (77 tokens); Fernheim, Paraguay (46 tokens)
  • Gender (two variants; reference category men): men (963 tokens); women (878 tokens)
  • Clause type (eight variants; reference category ±extraposed relative clause): ±extraposed relative clause (266 tokens); -dislocated relative clause (124); ±dislocated relative clause with an anaphoric pronoun (95); conditional clause without dann (256 tokens); conditional clause with dann (468 tokens); disintegrated conditional clause (91 tokens); complement clause without dat (394 tokens); complement clause with dat (147 tokens)
  • Auxiliary verb (six variants; reference category han): hananterior (377 tokens); woare(n)inchoative (378 tokens); wolle(n)volition (353 tokens); mute(n)obligation (384 tokens); könne(n)possibility (250 tokens); dune (99 tokens)
  • Definiteness of internal complement (three variants; reference category definite ObjDP/PP): definite ObjDP/PP (1583 tokens); indefinite ObjDP/PP (203 tokens); demonstrative pronoun (55 tokens)
  • Type of internal complement (two variants; reference category ObjDP): ObjDP (1753 tokens) ObjPP (88 tokens)
  • Metric variables
  • Age
  • Competence in MLG
  • Competence in the majority language
  • Competence in StG
Eight of the ten fixed factors have already been used in the first model. The only caveat in this regard is the fact that due to the small number of translations with the V(P)R variants in clauses with the irrealis/aspectual auxiliary dune (“do”), these four categories had to be lumped together. The two factors added, the definiteness and the type of the internal complement, are frequently mentioned in the context of scrambling (cf. Wurmbrand’s (2017, p. 78) quote cited in Section 3.1). As for the type of the internal complement, the distinctive marker is simply the presence or absence of a prepositional head (cf. (19) and (21a+b)). The issue of definiteness is somewhat trickier. In order to illustrate this, a look at the translations in (31a+b) is helpful.
stimulus <11>Spanish: Si él firma ese contrato, va a perder mucho dinero.
English: If he signs this contract, he will lose a lot of money.
(31)a.wanndeiwirddiesenKontraktunterschriewen
ifhewill-V1[this-DEMproblem]-ObjDPsign-V2
dannwirddeiendollGeldverspielen
thenwillhealotmoneylose
(Mex-92; f/42/Span>MLG-61%)
b.wannheiwirddatunterschriewen
ifhewill-V1this-ObjDPsign-V2
dannwirdheivielGeldverspielen
thenwillhemuchmoneylose
(Mex-76; m/24/MLG+Span)
These translations represent tokens with the resumptive element dann (“then”) in the matrix clause and both feature the VPR variant with the finite verb superficially in the second position. However, the internal complements differ. Mex-92 translates it in the expected way, i.e., with a demonstrative determiner and a noun. These tokens are considered to have a definite ObjDP and, despite their deictic nature, are lumped together with the tokens with a definite article in the ObjDP. Mex-76 prefers a demonstrative pronoun (in the current context, dat represents English this, not it; cf. Note 23). This preference is understandable, since the interviewer has just read the Spanish stimulus sentence verbally pointing to the contract in question, so both participants already have the necessary information. There are 55 tokens of this kind. Importantly, the cluster variant in (31b) is an exception. Only seven of the 55 relevant tokens (12.7%) contain the VPR variant. Moving the demonstrative pronoun out of V2P is thereby definitely the default case; Hinterhölzl (2018, p. 229) even considers the movement of pronouns to be obligatory. Lenerz (1993, pp. 139–144) also claims that pronouns move and that this movement is a case of scrambling. Another case of variation concerns (32a+b).
stimulus <6>Spanish: ¿No sabes que él debería aprender inglés?
English: Don’t you know that he should learn English?
(32)a.weitsnichdat deisollEnglischlehren
knowyounotthatheshould-V1English-ObjDPlearn-V2
(Mex-5; m/16/MLG)
b.heiweißdat heisoll[äh]datEnglischlehren
heknowsthatheshould-V1[eh][theEnglish]-ObjDPlearn-V2
(Mex-25; f/14/MLG)
Although language names can be considered definite per se and although the vast majority of informants did not translate stimulus sentence <6> with a definite article as in (32b), we consider the ObjDP in this latter translation to be more definite than in (32a). This line of reasoning is consistent with Evans and Wimmer (1990, p. 268), who write: “It is also perhaps worth noting that in German colloquial styles the definite article series is often used with otherwise un-premodified proper nouns, putting them in much closer structural proximity to ‘definite descriptions’ […]”. Importantly, taking these tokens out of the model would not change anything substantial in the expression of the variable definiteness. It would, however, reduce the reliability of the model, since translations such as (33b) with an indefinite article instead of the expected definite article (cf. (33a)) occur rather rarely in the stimulus sentences of the MLG dataset.
stimulus <17>Spanish: Si realmente mató al hombre, nadie lo puede ayudar.
English: If he really killed the man, nobody can help him.
(33)a.wanndeiwirklichdenMaunhaftodgemeak
ifhereally-ADV[the.DEFman]-ObjDPhas-V1killed-V2
dannkaun   ihnkeinerhelpen
thencan   himnobodyhelp
(USA-69; m/29/Engl>MLG-71%)
b.wannheiwirklichhafenMaun[0.4][äh]
ifhereally-ADVhas-V1[a.INDEFman]-ObjDP[0.4][eh]
ge-[0.5]todgemeakdannkaunheideMaunnichhelpen
ki[0.5]killed-V2thencanhethemannothelp
‘If he really killed a man, then he cannot help the man.’
(Mex-37; f/18/MLG)
As already mentioned, the reference category of the binary dependent variable is the VR variant, i.e., we are interested in learning more about the factors that show significant effect sizes in the relationship with the VPR variant. The correct classification of the predicted distribution of the VR variant and the VPR variant versus their actual observed distribution increases from 51.2% to 80.3% when the random effect and the fixed effects are added. The conditional R2 is 0.443, i.e., the random and fixed effects together “explain” 44.3% of the variation in the dependent variable. The informants (random effect) “explain” 20%, a slightly larger share than in the first model.27 On average, each of the 266 informants who produced at least one token with the V(P)R variants contributed 6.9 tokens (ranging from 1 to 19 translations; SD: 4). A total of 55 informants, mostly from the two Paraguayan communities (45 informants), did not produce a single relevant token. The four fixed factors with a significant F-value “explain” 24.3% of the variation (marginal R2 is 0.243), a much smaller share than in the model of Table 3 and Table 4. Table 5 (next page) represents the four significant predictors.
In comparison to Table 3 and Table 4, the research location, the competence in StG, age, and gender do not show significant effect sizes. Therefore, the avoidance of scrambling cannot be qualified as a linguistic innovation. The only informant-related significant factor is the competence in MLG. The odds ratio of informants with, for example, 14 points in comparison to informants with ten points is reduced by a factor of 1.52 ((1:0.9)4). It is difficult to say whether this means that some tokens with the VPR variant must be qualified as instances of language attrition, especially because the relevant F-value of 3.1 only represents a weak statistical tendency (p = 0.08). However, one could argue that some informants with a lower level of competence in MLG produce the VPR variant in order to minimize the difference between unintroduced root and introduced subordinate clauses. This assumption would fit the comparable variation in the moribund heritage variety Moundridge Schweitzer German studied by Hopp and Putnam (2015), who write that “the extension of main clause word order into embedded clauses may be conceived of as a strategy to minimize processing effort”.
With regard to the structural factors, both the type and the definiteness of the internal complement show significant effect sizes, and both are consistent with Wurmbrand’s (2017, p. 78) quote cited in Section 3.1. The odds of ObjPPs and indefinite (or less definite) ObjDPs/PPs co-occurring with the unscrambled VPR variant are much greater than those of this variable co-occurring with ObjDPs and definite ObjDPs/PPs, the respective reference categories (factors of 3.82 and 10.45, respectively). Moreover, and also as expected, the odds ratio of demonstrative pronouns co-occurring with the unscrambled VPR variant is very low. The decreasing factor is 6.76 (1:0.148). In conclusion, if we assume that demonstrative pronouns almost always scramble and that ObjPPs and indefinite ObjDPs/PPs rarely scramble, their respective odds ratios regarding the VPR variant in the MLG dataset strongly suggest that the difference between the two raised variants is indeed the scrambling of the internal complement out of V2P.
The third structural predictor in Table 5 is the clause type. Both types of extraposed complement clauses do not only show high odds ratios with regard to visible v-p-r (cf. Table 4) but also between the two raised cluster variants, i.e., they show significant effect sizes with regard to the VPR variant. The respective odds ratios are 4.27 and 3.81. Since the difference between complement sentence compounds with and without the correlate dat (“it/this”) is small, one may assume that the disintegration of the postfield is so overwhelming that any possible effect of the correlate is outweighed. Unlike in Table 4, conditional clauses with the resumptive element dann (“then”) and disintegrated conditional clauses do not differ from ±extraposed relative clauses; just like in Table 4, the other two types of relative clauses do not differ either. So, all types of relative and conditional clauses behave in a comparable way with regard to the VPR variant.
A rather striking fact in Table 5 is the non-appearance of the auxiliary verb. The lack of a significant effect size means that the functional height of the auxiliary in the split IP domain is decisive for the visibility of v-p-r in general (cf. Table 4), whereas the scrambling of the internal argument is only helped or hindered by the availability of topological space. Strongly integrated subordinate clauses thus do not only impede visible v-p-r but also impede the raising of a large amount of phonetically realized material, which frequently seems to enforce the scrambling of the internal complement out of V2P. However, since scrambling is optional in most cases, some informants may refrain from it deliberately in raised variants in order to indicate a high degree of clausal disintegration. The resulting superficial similarity to root clauses may even have led to the reanalysis of causal clauses into structural verb-second clauses in the North American communities (cf. Kaufmann, 2016, pp. 286–306 and the brief discussion in Section 3.2.3).
In order to strengthen the assumption of such a functional turn, Table 6 groups the 167 informants with at least four translations with the V(P)R variants in conditional and relative clauses according to their preference for either the VR variant or the VPR variant (cf. the first line VPR variant). Importantly, these are precisely the clauses that showed no differential behavior in Table 5.
The first striking fact about Table 6 is that the share of the VPR variant in relative and conditional clauses ranges between 7.9% and 89.7%, while this range is much smaller in complement clauses (between 62.1% and 93.2%). Moreover and as expected, each group of informants shows a larger share of the VPR variant in complement clauses. However, the ratio between the two shares increases steadily (cf. the line ratio (compl/rel + cond)). While the ratio is negligible for the group that hardly ever produces the VR variant in relative and conditional clauses (cf. the column [>75%[), it is only 1.39 and 1.62, respectively, for the following two groups. For the two groups with a clear preference for the VR variant in relative and conditional clauses, things change (dramatically). They produce 2.27 and 7.86 times more tokens with the VPR variant in complement clauses, respectively. The last group (cf. the column ]<25%]) almost completely ignores its normal derivational path, which almost always involves the scrambling of the internal complement. Their share of the VR variant drops from 92.1% in conditional/relative clauses to 37.9% in complement clauses. This can be seen as a deliberate choice, which we can assume to be based on a functional intention, namely the marking of a high degree of clausal disintegration by means of the VPR variant.

4.5. The Size of the Raising Domain

There is one last issue we would like to briefly discuss. Regarding the sequence of ObjDPs and adverbs in the NR variant, Table 1 has shown that the conditional clauses with the inchoative adverb (“now”) behave rather differently from the conditional clauses with the evidential adverb wirklich (“really”). This, however, is not the only difference. While all variants that occur in sentence <17> also occur in sentence <15>, three variants, namely (34a–c), are exclusively found in sentence <15>.
stimulus <15>Spanish: Si tiene que vender la casa ahora, se va a poner muy triste.
Portuguese: Se ele tiver que vender a casa agora, ele vai ficar muito triste.
English: If he has to sell the house now, he will be very sorry.
(34)a.wannmutsvondaagdinHüsverköpe
ifyoumust-V1today-ADV[yourhouse]-ObjDPsell-V2
dann[0.4]wirsch trürigsene
then[0.4]willyouøsadbe
(Bol-9; m/43/MLG)
b.wanndeimutdieHüsverköpen
ifhemust-V1[thehouse]-ObjDPnow-ADVsell-V2
deiwird:[1.2]sehrsorrysene
hewill[1.2]verysorrybe
(USA-17; f/14/Engl>MLG-71%)
c.wannheidatHüsmutnuverköpe
ifhe[thehouse]-ObjDPmust-V1now-Advsell-V2
dannwirdheisehrtrürig
thenturnsheverysad
(Bra-6; f/23/MLG)
In these translations, the adverb (“now”) appears within the verb cluster. Twice, it is accompanied by the ObjDP, after the adverb in (34a) (two tokens) and in front of it in (34b) (13 tokens). In (34c) (nine tokens), the ObjDP dat Hüs (“the house”) has been scrambled to a rather high topic position before v-p-r. Despite the fact that there is some non-verbal material within the right-branching verb cluster, we consider (34c) to be an instance of the VR variant, since the non-verbal material is not the internal complement of the main verb. In any case, the three translations show that it is not only V2P that can be raised but that parts of the lower IP domain including at least AspPinchoative can also be raised. For (34b), we even have to assume that a topic phrase above AspPinchoative has been raised. The derivational tree for (34c) is illustrated in (35).
(35) Structural representation of the VR variant in (34c).
Languages 10 00095 i007
Crucially, not a single translation of sentence <17> with wirklich (“really”) presents a comparable serialization, i.e., there seems to be an upper limit for v-p-r (but cf. (36) for a possible counterexample). In this respect, it is indeed a telling fact that Wurmbrand (2017, p. 73—Table 8) mentions low adverbs and only low adverbs as possible interveners in a V1-V2-constellation, as in (34a-c).

5. Conclusions

There are three crucial aspects to this paper. At least with respect to the traditional generative framework, the first aspect is the fact that our analyses are based on a large dataset of real production data (in this case, translations). The second aspect is the use of the cartographic approach. It is the combination of this approach and the assumption that the structural link between the auxiliary verb and the main verb is of paramount importance that has allowed us to “explain” a substantial part of the variation in clause-final two-verb clusters in MLG. Table 4 makes it clear that the stronger this link is, the more marginal v-p-r is. Only within the resulting classes of auxiliary verbs do the actual positions of modal verbs or different forms of the auxiliary dune (“do”) in the split IP domain become decisive. The higher the position of the phrase headed by one of these auxiliary verbs, the smaller the odds ratios for visible v-p-r and thus for the V(P)R variants. These findings confirm the hypotheses in (10) and (11). The third crucial aspect concerns the different clause types. Here, we were able to provide empirical support for the hypotheses in (13) and (14). Regarding (13), the assumption that less integrated clause types allow for more visible v-p-r (higher odds ratios in Table 4), presumably because of the availability of more structural space in the higher IP domain and the CP domain, could be confirmed. Moreover, Table 6 supports the idea that the speakers of MLG deliberately use this structural space in order to signal clausal disintegration through the unscrambled VPR variant, the variant that most closely resembles the serialization found in an unintroduced root clause.
In the future, it will be important to compare the informants’ behavior with respect to two-verb clusters with their behavior with respect to clauses with three-verb clusters and also with respect to clauses with only one verb. The latter issue has already been analyzed by Kaufmann (2015; cf. also Kaufmann, 2016, pp. 234–258), where it was shown that, with respect to the type and definiteness of the internal complement and to the type of subordinate clause, some MLG tokens with the strongly marked sequence verb-ObjDP/PP behave exactly like the VPR variant in Table 6. For three-verb clusters, analyses were also carried out (cf. Kaufmann, 2016, pp. 219–227; Kaufmann, 2007, pp. 185–193). What remains to be conducted is the discussion of three-verb clusters within the cartographic approach. A crucial question, for example, is whether we should locate both auxiliary verbs in the same IP domain, as Cinque (2004, pp. 139–140) does, or whether we should assume two separate IP domains. That this will not be an easy task can be seen in translations where the evidential adverb wirklich (“really”) appears unexpectedly after the finite verb, i.e., apparently within the three-verb cluster, a constellation that did not occur even once in clauses with two-verb clusters (cf. Section 4.5). In a token like (36), either the raised domain in the subordinate clause is much larger than in two-verb clusters, or the finite verb hat (“had”) moves higher up, perhaps even to a head position in the CP domain.
stimulus <19>Spanish: Si realmente hubiera querido escribir esta carta habría encontrado tiempo.
English: If he really had wanted to write this letter, he would have found the time.
(36) wannheihatwirklichdiesenBriefschriewen
ifhehad-V1really-ADV[thisletter]-ObjDPwrite-V3
[0.5]wollt[0.3]hatheiTiedgefungen
[0.5]wanted-V2[0.3]hadhetimefound
(Mex-43; m/31/MLG)
Most importantly though, our findings should be compared with other Continental West Germanic varieties. Do we find the same differences with regard to Dutch, Flemish, or Swiss German auxiliary verbs? And what about the translations into Pomerano (cf. Notes 1 and 16)? Although visible v-p-r is not as common in this Brazilian variety of Low German as it is in MLG, it does occur, hardly ever with daua, the Pomerano cognate of dune (“do”), but quite often in two-verb clusters with modal verbs—and here again the odds ratios for the V(P)R variants are significantly greater with the deontic modal verb expressing obligation than with the dynamic modal verb expressing possibility.

Funding

This research received no external funding.

Informed Consent Statement

Informant consent was waived because the research is based only on translations of given stimulus sentences.

Data Availability Statement

The Mennonite Low German dataset is publicly available under http://agd.ids-mannheim.de/MEND_extern.shtml (cf. Kaufmann 2018).

Acknowledgments

I would like to thank three anonymous reviewers for their very helpful comments, and Sandra Hansen and Felix Bildhauer for their statistical expertise.

Conflicts of Interest

The author declares no conflicts of interest.

Notes

1
Aside from this, 375 speakers of Pomerano from Brazil (Rio Grande do Sul, Santa Catarina, Espírito Santo, and Rondônia) and 15 speakers of Pomeranian from the USA (Wisconsin) were asked between 2017 and 2023 to translate 61 stimulus sentences, among them the 46 original stimuli (cf. Kaufmann, 2022, 2023 for first analyses). With this second dataset, we will be able to see whether the factors that influence the variation in Mennonite Low German (MLG) verb clusters are specific to this variety or whether they also influence the variation in another Low German variety spoken in the Americas (cf. Note 16 for a first comparison).
2
In order to gauge the informants’ subjective competence in MLG, Standard German (StG), and the majority languages, two questions were asked, namely (i) how well do you speak MLG/StG/the majority language? and (ii) which is the language you speak best, second-best, third-best? The answers to these questions were pooled, resulting in a scale from 0 (no knowledge at all) to 14 points (full competence of a native speaker; cf. Kaufmann, 1997, pp. 135–138 for more detailed explanations). For all informants, the average value for their competence in MLG is 12.4 points (between 13.8 points in Bolivia and 11.7 points in the USA). The average value for StG is 7.9 points (between 11.7 points in Fernheim, Paraguay, and 5.4 points in the USA). For the majority languages, the average value for all informants is 9.2 points (11.4 points for English in the USA; 11.1 points for Portuguese in Brazil; for Spanish, the competence ranges from 7 points in Fernheim, Paraguay to 8.1 points in Mexico).
3
The stimulus sentences are provided in both the actually used stimulus language (Portuguese and Spanish in (1a–c)) and, in all cases, in English. The transliteration of the translations does not claim phonetic accuracy. Many differences to StG are not visualized in order to make recognition easier for readers with knowledge of StG. (Un)filled pauses are indicated in brackets (e.g., [0.5] or [äh]; in the glosses [eh]). Break-offs and repairs are marked with a hyphen. Underlined elements in the glosses represent semantic deviations from the stimulus sentence; a ø represents a missing element. Elements that the informants produced even though they do not appear in the stimulus are crossed out. This rule does not apply to added auxiliaries such as dune (“do”) and woare(n) (“will”).
4
Aside from the informants’ origin (Mex = Mexico; USA = USA; Bra = Brazil; Men = Menno, Paraguay; Fern = Fernheim, Paraguay; Bol = Bolivia) and coding number, their gender (m = male; f = female), their age in years, and their language competence(s) are indicated. If the informant considers MLG as his/her dominant language, the label MLG is used as in (1a–c). Ambilingual informants are represented by, for example, MLG+Port as in (5a). If StG or the respective majority language is the dominant language, the label used is StG>MLG, Engl>MLG, etc. In these cases, the actual level of knowledge in MLG is further specified. For example, informant USA-6 of (18) evaluates his competence in MLG with 11 out of 14 points. This leads to the label Engl>MLG-79%.
5
While the infinitival suffix -en is typical for the Chortitza variety, the suffix -e is typical for the Molotschna variety (cf. Siemens, 2012, pp. 47–52). When both varieties are discussed, n will be put in brackets.
6
In general, this difference is obviously not irrelevant. Aboh (2015, p. 308), for example, mentions the possibility that feature-driven internal merge may be derivationally more costly and thus more complex than base generation via external merge, a position promoted by Cinque (2004) with respect to cartography. However, since we are only interested in the (final) functional height of the auxiliary verb, we will not discuss this issue any further. After all, it is this height that may influence the visibility of v-p-r, a type of movement that we do not assume to take place in the narrow syntax (cf. the discussion of (3)).
7
Boulin (2017b, p. 36; cf. also Zifonun et al., 1997, p. 358) characterizes a central reading of now in the following way: “We conclude that the deictic adverb now can be considered to have an aspectual function: it is an inchoative marker coding contrast between two situations”. The two situations in the stimulus sentence <15> are the status quo of not being sorry and a possible status quo postea of being sorry. The game changer in this case would be the need to sell the house.
8
Our analysis implies—contrary to assumptions mentioned by Grewendorf and Sternefeld (1990, p. 13)—that adverbs, unlike ObjDPs, do not move. Together with Pollock (1989) and Cinque (1999), we assume that different adverbs are base generated in different structural positions. In our view, the surface sequence ObjDP–adverb–V2 is the visible consequence of scrambling the ObjDP over the adverb.
9
Broekhuis and den Besten (1989; translated in Haegeman 1994, p. 512) argue that “[…] VR is interpreted as the limiting case of VPR, an instantiation of VPR where all nonverbal material has been scrambled out of the adjoined VP”. Penner (1990, p. 170) writes: “Returning now to VPR, the analysis states that the upper verb (e.g., modal) moves to AGR20 and then to TNS0 to pick up inflectional features. In order for the RV [raising verb] to be properly procliticized, the whole V2P is raised to the right side of TNSP via successive adjunction to each maximal projection. Simultaneously, NP1, NP2 and, in principle, any non-verbal constituent in V2P may be scrambled out, i.e., adjoined to the left of the higher A’-position, if the result is compatible with what the information structure requires”.
10
Compare also Cinque (1999, p. 34) and Ernst (2002, p. 16) for different readings of really.
11
It is important to note that the schemes in (7), (9), and (12) all lean to the left. This is due to the fact that the functional phrases in the IP domain of the respective reference languages are head-initial. As mentioned in Section 3.1, this is different in MLG. Nevertheless, we will use these representations for illustrative purposes.
12
We have located in Aspinceptive(II) and not in Aspinceptive, which is above Modobligation. The reason for this is that Shlonsky represents Aspinceptive with the adverb suddenly, which does not fit at all the meaning of now in stimulus sentence <15>. Likewise, suddenly does not fit the meaning of the inchoative auxiliary woare(n) (“will”; cf. (8b)), which we also locate in Aspinceptive(II). Due to its clearly inchoative semantics and the fact that woare(n) is far from obligatory in future tense contexts, we cannot locate it in the hightense phrase T(Future).
13
This fact contradicts Ernst (2002, p. 10), who assumes that now is a sentential adjunct located in the highest zone of the clause, most probably in the CP domain.
14
One asterisk * means that SPSS calculates the probability for a Type I-error between 1% and 5% (0.01 ≤ p < 0.05), two asterisks ** indicate that the probability is smaller than 1% (p < 0.01), and three asterisks *** show that it is close to 0% (p < 0.001). One asterisk in brackets (*) indicates a statistical tendency with an error margin of 5% to 10% (0.05 ≤ p < 0.1).
15
Crucially, when comparing the translations with mute(n) (“must”) and solle(n) (“should”), there is no measurable difference in the frequency of v-p-r.
16
One aspect of this irrealis reading is that dune (“do”), just like its German cognate tun in Europe (cf. Schwitalla, 2006, pp. 143–145), may contain a certain future meaning. Daua, the Pomerano cognate of dune, is also an example for this future-marking ability (cf. Postma, 2019, p. 149). Importantly, however, the Pomerano translations clearly demonstrate that the future meaning of daua is stronger than that of dune in MLG. In total, 21 of 301 Pomerano translations of sentence <15> feature daue (7%), while this share is just 1.9% for MLG dune (5 of 270 tokens; cf. Table 2). Crucially, waara, the Pomerano cognate of MLG woare(n) (“will”), does not appear a single time in the translations of sentence <15>, while MLG woare(n) is rather frequent (18 tokens, i.e., 6.7% of the 270 tokens).
17
Quite fittingly, Boulin (2017a, p. 13) writes the following with regard to Chinese translations of English sentences: “Inchoative verbs mark an opposition between two situations, and their use in the translations of sentences with now suggests that now possesses similar semantic properties, i.e., it expresses change”.
18
The modal verb in sentence <15> does not represent Modnecessity, which is located high up in (9), because that phrase refers to alethic (logical) necessity and not to deontic necessity (the subject’s necessity to do something).
19
Granted, the fact that dune in MLG codes different types of imperfective aspects is not that surprising. Cinque (2017, p. 542), for example, writes: “So for example the Progressive, the Habitual and the Generic aspects (which apparently occupy different positions in the functional sequence of the clause—cf. Cinque, 1999), plausibly have a common core meaning”.
20
This is not the case in the South American communities. But even there, causal clauses are by far the context most compatible with visible v-p-r and especially with the VPR variant (cf. Kaufmann, 2016, pp. 292–296).
21
Crucially, some types of strongly disintegrated clauses can only appear in the postfield, for example, continuative relative clauses or consecutive clauses. The fact that it is impossible for these clauses to appear in the highly integrated prefield highlights their disintegratedness.
22
With regard to the categories obligation (including non-alethic necessity), possibility (including both ability and, in some few cases, permission), and volition, it is important to keep in mind that each category represents two modal verbs. Mute(n) actually stands for 352 tokens with solle(n) and just 277 tokens with mute(n); könne(n) stands for 506 tokens with könne(n) and 19 tokens with dürfe(n) (“may”). Finally, wolle(n) stands for 601 tokens with wolle(n) and twelve tokens with möge(n) (“like”). The syntactic behavior of these pairs was thoroughly checked, and no difference whatsoever could be discovered.
23
MLG dat is not only the deictic counterpart of MLG et (“it”, cf. the StG cognates das and es), but it has effectively replaced it. Thiessen (2003, p. 393), for example, translates English it as daut (a spelling variant).
24
In generalized linear mixed models, we obtain so-called pseudo-R-squared values. Because it is more difficult to interpret them than R2 values in linear regression analyses, we will always put the word “explain” in quotation marks.
25
The estimate for the random effect is 1.957 (confidence interval [1.554; 2.465]); its Z-value is 8.497 ***.
26
Tests for multicollinearity were always performed. In all cases, the Variance Inflation Factor (VIF) never reached the critical value of 3.
27
The estimate for the random effect is 1.44 (confidence interval [1.04; 1.933]); its Z-value is 6.021 ***.

References

  1. Aboh, E. O. (2015). The emergence of hybrid grammars: Language contact and change. Cambridge University Press. [Google Scholar]
  2. Adli, A., García García, M., & Kaufmann, G. (2015). System and usage: (Never) mind the gap. In A. Adli, M. García García, & G. Kaufmann (Eds.), Variation in language: System- and usage-based approaches (pp. 1–25). de Gruyter. [Google Scholar]
  3. Auer, P. (1998). Zwischen Parataxe und Hypotaxe: Abhängige Hauptsätze im gesprochenen und geschriebenen Deutsch. Zeitschrift für Germanistische Linguistik, 26, 284–307. [Google Scholar]
  4. Bach, E., Brown, C., & Marslen-Wilson, W. (1987). Gekreuzte und geschachtelte Abhängigkeiten im Deutschen und Niederländischen. In J. Bayer (Ed.), Grammatik und Kognition (pp. 7–23). Westdeutscher Verlag. [Google Scholar]
  5. Barbiers, S. (2005). Word order variation in three-verb clusters and the division of labour between generative linguistics and sociolinguistics. In L. Cornips, & K. P. Corrigan (Eds.), Syntax and variation: Reconciling the biological and the social (pp. 233–264). John Benjamins. [Google Scholar]
  6. Belletti, A. (2004). Aspects of the low IP area. In L. Rizzi (Ed.), The structure of CP and IP: The cartography of syntactic structures (volume 2, pp. 16–51). Oxford University Press. [Google Scholar]
  7. Biskup, P. (2006a, April 9–10). Scrambling in Czech: Syntax, semantics and information structure. NWLC 21 (UBC Working Papers in Linguistics) (pp. 1–15), Vancouver, BC, Canada. [Google Scholar]
  8. Biskup, P. ((2006b,, May). Scrambling in Czech. CamLing 2005, Cambridge, UK. [Google Scholar]
  9. Biskup, P. (2007). Phase featuring-driven EPP-features and EPP-feature-driven subjacency in Czech. In P. Kosta, & L. Schürcks (Eds.), Linguistic investigations into formal description of Slavic languages (pp. 117–125). Peter Lang. [Google Scholar]
  10. Boeckx, C. (2011). Approaching parameters from below. In A. M. di Sciullo, & C. Boeckx (Eds.), The Biolinguistic enterprise: New perspectives on the evolution and nature of the human language faculty (pp. 205–221). Oxford University Press. [Google Scholar]
  11. Boulin, M. (2017a). Now and xianzai: A contrastive study of two deictic adverbs. Languages in Contrast, 17(1), 1–17. [Google Scholar] [CrossRef]
  12. Boulin, M. (2017b). Temporal Deixis and Anaphora in English and Mandarin Chinese: A contrastive study of now and then and their Mandarin equivalents [Unpublished Ph.D. dissertation, Paris Diderot University (Paris 7)]. Available online: https://theses.hal.science/tel-01599127/file/THESE%20REVUE%20MYRIAM%20BOULIN.pdf (accessed on 13 March 2025).
  13. Bousquette, J., Eide, K. M., Hjelde, A., & Putnam, M. T. (2021, October 10–12). Competition at the left edge: Left-dislocation vs. topicalization in Heritage Germanic. 10th Workshop on Immigrant Languages in the Americas (pp. 11–21), Somerville, MA, USA. [Google Scholar]
  14. Broekhuis, H. (2007). Object shift and subject shift. The Journal of Comparative Germanic Linguistics, 10, 109–141. [Google Scholar]
  15. Broekhuis, H., & den Besten, H. (1989). Woordvolgorde in de werkwoordelijke eindreeks. GLOT, 12, 79–137. [Google Scholar]
  16. Bybee, J. L., Perkins, R. D., & Pagliuca, W. (1994). The evolution of grammar: Tense, aspect, and modality in the languages of the world. The University of Chicago Press. [Google Scholar]
  17. Chomsky, N. (2001). Derivation by phase. In M. Kenstowicz (Ed.), Ken Hale: A Life in Language (pp. 1–52). MIT Press. [Google Scholar]
  18. Christensen, M. H., Christensen, T. K., & Jensen, T. J. (2020). Foregrounding of subordinate clauses by word order: Psycholinguistic evidence of the function of V>Adv (V2) word order in Danish. Linguistics, 58(1), 245–273. [Google Scholar] [CrossRef]
  19. Christensen, T. K., & Jensen, T. J. (2022). When variants lack semantic equivalence: Adverbial subclause word order. In T. K. Christensen, & T. J. Jensen (Eds.), Explanations in sociosyntactic variation (pp. 171–206). Cambridge University Press. [Google Scholar]
  20. Cinque, G. (1999). Adverbs and functional heads: A cross-linguistic perspective. Oxford University Press. [Google Scholar]
  21. Cinque, G. (2004). “Restructuring” and functional structure. In A. Belletti (Ed.), Structures and beyond: The Cartography of syntactic structures (volume 3, pp. 132–191). Oxford University Press. [Google Scholar]
  22. Cinque, G. (2017). On the status of functional categories (heads and phrases). Language and Linguistics, 18(4), 521–576. [Google Scholar] [CrossRef]
  23. De Sutter, G. (2009). Towards a multivariate model of grammar: The case of word order variation in Dutch clause final verb clusters. In A. Dufter, J. Fleischer, & G Seiler (Eds.), Describing and modeling variation in grammar (pp. 225–254). de Gruyter. [Google Scholar]
  24. Ernst, T. (2002). The Syntax of Adjuncts. Cambridge University Press. [Google Scholar]
  25. Evans, M. J., & Wimmer, R. (1990). Searle’s theory of proper names, from a linguistic point of view. In A. Burkhardt (Ed.), Speech acts, meaning and intentions: Critical approaches to the philosophy of John R. Searle (pp. 259–278). de Gruyter. [Google Scholar]
  26. Fabricius-Hansen, C. (1992). Subordination. In L. Hoffmann (Ed.), Deutsche Syntax: Ansichten und Aussichten (pp. 458–483). de Gruyter. [Google Scholar]
  27. Frey, W. (2011). Peripheral adverbial clauses, their licensing and the prefield in German. In E. Breindl, G. Ferraresi, & A. Volodina (Eds.), Satzverknüpfungen: Zur Interaktion von Form, Bedeutung und Diskursfunktion (pp. 41–77). de Gruyter. [Google Scholar]
  28. Grewendorf, G. (1995). Präsens und Perfekt im Deutschen. Zeitschrift für Sprachwissenschaft, 14, 72–90. [Google Scholar] [CrossRef]
  29. Grewendorf, G., & Sternefeld, W. (1990). Scrambling theories. In G. Grewendorf, & W. Sternefeld (Eds.), Scrambling and barriers (pp. 3–37). John Benjamins. [Google Scholar]
  30. Haegeman, L. (1994). Verb raising as verb projection raising: Some empirical problems. Linguistic Inquiry, 25(3), 509–522. [Google Scholar]
  31. Haegeman, L. (2012). Adverbial clauses, main clause phenomena, and the composition of the left periphery. Oxford University Press. [Google Scholar]
  32. Haider, H. (1995). Downright down to the right. In U. Lutz, & J. Pafel (Eds.), On Extraction and Extraposition in German (pp. 245–272). John Benjamins. [Google Scholar]
  33. Haider, H. (2010). The syntax of German. Cambridge University Press. [Google Scholar]
  34. Hilpert, M. (2013). Satz und Text. In P. Auer (Ed.), Sprachwissenschaft: Grammatik–Interaktion–Kognition (pp. 187–216). Metzler. [Google Scholar]
  35. Hinterhölzl, R. (2018). Focus und Antifocus: Zum Dauerbrenner Scrambling in der Syntax des Deutschen. In E. Leiss, & S. Zeman (Eds.), Die Zukunft von Grammatik—Die Grammatik der Zukunft: Festschrift für Werner Abraham (pp. 217–234). Stauffenburg. [Google Scholar]
  36. Hopp, H., & Putnam, M. T. (2015). Syntactic restructuring in heritage grammars: Word order variation in Moundridge Schweitzer German. Linguistic Approaches to Bilingualism, 5(2), 180–214. [Google Scholar] [CrossRef]
  37. Kaufmann, G. (1997). Varietätendynamik in Sprachkontaktsituationen: Attitüden und Sprachverhalten rußlanddeutscher Mennoniten in Mexiko und den USA. Peter Lang. [Google Scholar]
  38. Kaufmann, G. (2003). The verb cluster in Mennonite Low German. In K. J. Mattheier, & W. Keel (Eds.), German language varieties worldwide: Internal and external perspectives (pp. 177–198). Peter Lang. [Google Scholar]
  39. Kaufmann, G. (2007). The verb cluster in Mennonite Low German: A new approach to an old topic. Linguistische Berichte, 210, 147–207. [Google Scholar]
  40. Kaufmann, G. (2015). Rare phenomena revealing basic syntactic mechanisms: The case of unexpected verb–object sequences in Mennonite Low German. In A. Adli, M. García García, & G. Kaufmann (Eds.), Variation in language: System- and usage-based approaches (pp. 113–146). de Gruyter. [Google Scholar]
  41. Kaufmann, G. (2016). The world beyond verb clusters: Aspects of the syntax of Mennonite Low German. Unpublished Habilitation Treatise. Available online: http://paul.igl.uni-freiburg.de/kaufmann/userfiles/downloads/Formatiert%20(Englisch).pdf (accessed on 13 March 2025).
  42. Kaufmann, G. (2018). Mennonitenplautdietsch in Nord- und Südamerika (MEND). In Archiv für Gesprochenes Deutsch. Institut für Deutsche Sprache. Available online: http://agd.ids-mannheim.de/MEND_extern.shtml (accessed on 13 March 2025).
  43. Kaufmann, G. (2022). In the thick of it: Scope rivalry in past counterfactuals of Pomerano. The Journal of Comparative Germanic Linguistics, 25(3), 333–384. [Google Scholar] [CrossRef] [PubMed]
  44. Kaufmann, G. (2023). Aspectual marking in three (Low) German varieties from Southern Brazil. In B. Hans-Bianchi, & B. Vogt (Eds.), Deutsch im Kontakt: Neue empirische Studien zu Kontaktphänomenen und -szenarien in der Gegenwart (pp. 335–393). Olms. [Google Scholar]
  45. Labov, W. (2001). Principles of linguistic change: Social factors. Blackwell. [Google Scholar]
  46. Langacker, R. W. (2009). Investigations in cognitive grammar. de Gruyter. [Google Scholar]
  47. Lehmann, C. (1984). Der Relativsatz: Typologie seiner Strukturen, Theorie seiner Funktionen, Kompendium seiner Grammatik. Narr. [Google Scholar]
  48. Leiss, E. (1992). Die Verbalkategorien des Deutschen: Ein Beitrag zur Theorie der sprachlichen Kategorisierung. de Gruyter. [Google Scholar]
  49. Lenerz, J. (1993). Zu Syntax und Semantik deutscher Personalpronomina. In M. Reis (Ed.), Wortstellung und Informationsstruktur (pp. 117–153). Niemeyer. [Google Scholar]
  50. Lötscher, A. (1978). Zur Verbstellung im Zürichdeutschen und in anderen Varianten des Deutschen. Zeitschrift für Dialektologie und Linguistik, 45(1), 1–29. [Google Scholar]
  51. MacDonald, J. E. (2008). The syntactic nature of inner aspect: A minimalist perspective. John Benjamins. [Google Scholar]
  52. Musan, R. (2002). The German perfect: Its semantic composition and its interactions with temporal adverbials. Kluwer. [Google Scholar]
  53. Penner, Z. (1990). On the acquisition of verb placement and verb projection raising in Bernese Swiss German. In M. Rothweiler (Ed.), Spracherwerb und Grammatik: Linguistische Untersuchungen zum Erwerb von Syntax und Morphologie (pp. 166–189). Westdeutscher Verlag. [Google Scholar]
  54. Pollock, J.-Y. (1989). Verb movement, Universal Grammar, and the structure of IP. Linguistic Inquiry, 20(3), 365–424. [Google Scholar]
  55. Postma, G. (2019). A contrastive grammar of Brazilian Pomeranian. John Benjamins. [Google Scholar]
  56. Quarezemin, S. (2020). Cartography, left periphery and criterial positions: An interview with Luigi Rizzi. Documentação de Estudos em Lingüística Teórica e Aplicada, 36, 1–19. [Google Scholar] [CrossRef]
  57. Ramchand, G., & Svenonius, P. (2014). Deriving the functional hierarchy. Language Sciences, 46, 152–174. [Google Scholar] [CrossRef]
  58. Reis, M. (1997). Zum syntaktischen Status unselbständiger Verbzweit-Sätze. In C. Dürscheid, K. H. Ramers, & M. Schwarz (Eds.), Sprache im Fokus: Festschrift für Heinz Vater zum 65. Geburtstag (pp. 121–144). Niemeyer. [Google Scholar]
  59. Rizzi, L. (2004). On the cartography of syntactic structures. In L. Rizzi (Ed.), The structure of CP and IP: The cartography of syntactic structures (volume 2, pp. 3–15). Oxford University Press. [Google Scholar]
  60. Rizzi, L. (2013). Notes on cartography and further explanation. Probus, 25(1), 197–226. [Google Scholar]
  61. Rizzi, L., & Cinque, G. (2016). Functional categories and syntactic theory. Annual Review of Linguistics, 2, 139–163. [Google Scholar] [CrossRef]
  62. Roberts, I. (2010). Agreement and head movement: Clitics, incorporation, and defective goals. MIT Press. [Google Scholar]
  63. Schwitalla, J. (2006). Kommunikative Funktionen von tun als Hilfsverb. In S. Günthner, & W. Imo (Eds.), Konstruktionen in der Interaktion (pp. 127–151). de Gruyter. [Google Scholar]
  64. Seggewiß, F. (2012). Current changes in English modals: A corpus-based analysis of present-day spoken English [Unpublished Ph. D. Dissertation, University of Freiburg]. Available online: https://freidok.uni-freiburg.de/fedora/objects/freidok:9654/datastreams/FILE1/content (accessed on 13 March 2025).
  65. Shlonsky, U. (2010). The cartographic enterprise in syntax. Language and Linguistics Compass, 4(6), 417–429. [Google Scholar] [CrossRef]
  66. Siemens, H. (2012). Plautdietsch: Grammatik, Geschichte, Perspektiven. Tweeback. [Google Scholar]
  67. Sternefeld, W. (2008). Syntax: Eine morphologisch motivierte generative Beschreibung des Deutschen. Stauffenburg. [Google Scholar]
  68. Thiessen, J. (2003). Mennonite Low German dictionary—Mennonitisch-plattdeutsches Wörterbuch. Max Kade Institute. [Google Scholar]
  69. Vanden Wyngaerd, G. (1989). Verb projection raising and the status of infinitival complements. In D. Jaspers, W. Klooster, Y. Putseys, & P. Seuren (Eds.), Sentential complementation and the lexicon: Studies in honour of Wim de Geest (pp. 423–438). Foris. [Google Scholar]
  70. von Stechow, A. (1990). Status government and coherence in German. In G. Grewendorf, & W. Sternefeld (Eds.), Scrambling and barriers (pp. 143–198). John Benjamins. [Google Scholar]
  71. Wiltschko, M. (2014). The universal structure of categories: Towards a formal typology. Cambridge University Press. [Google Scholar]
  72. Wurmbrand, S. (2001). Infinitives: Restructuring and clause structure. de Gruyter. [Google Scholar]
  73. Wurmbrand, S. (2017). Verb clusters, verb raising, and restructuring. In M. Everaert, & H. van Riemsdijk (Eds.), The Blackwell companion to syntax (pp. 1–109). Wiley-Blackwell. [Google Scholar] [CrossRef]
  74. Zifonun, G., Hoffmann, L., & Strecker, B. (1997). Grammatik der deutschen Sprache (3 volumes). de Gruyter. [Google Scholar]
  75. Zimmermann, M. (2011). On the functional architecture of DP and the feature content of pronominal quantifiers in Low German. Journal of Comparative German Linguistics, 14, 203–240. [Google Scholar] [CrossRef]
  76. Zwart, J.-W. (1996). Verb clusters in Continental West Germanic dialects. In J. R. Black, & V. Motapanyane (Eds.), Microparametric syntax and dialect variation (pp. 229–258). John Benjamins. [Google Scholar]
Table 2. Distribution of the NR variant and the V(P)R variants in sentence <15> depending on different auxiliary verbs.
Table 2. Distribution of the NR variant and the V(P)R variants in sentence <15> depending on different auxiliary verbs.
mute(n)
(‘must’)
woare(n)
(‘will’)
dune
(‘do’)
n (tokens)247185
ObjDP-V2-V111314
45.7%5.6%80%
χ2(2, n = 270) = 13.7; p < 0.001 ***/Cramer’s V: 0.23/2 cells (33.3%) with less than 5 expected tokens
Fisher’s Exact Test(n = 270) = 15; p < 0.001 ***
(ObjDP)-V1-(ObjDP)-V2134171
54.3%94.4%%20%
Table 5. Generalized linear mixed model for scrambling of the internal complement in complement, conditional, and relative clauses (only tokens with the V(P)R variants).
Table 5. Generalized linear mixed model for scrambling of the internal complement in complement, conditional, and relative clauses (only tokens with the V(P)R variants).
definitenessobject typeclause typeMLG
F: 34 ***F: 17.8 ***F: 10.2 ***F: 3.1 (*)
indefinite ObjDP/PP (10.45 ***)ObjPP (3.82 ***)complement (+dat) (4.27 ***)
complement (-dat) (3.81 ***)
definite ObjDP/PPObjDP±extraposed relative
−dislocated relative (-der)
±dislocated relative (+der)
conditional (-dann)
conditional (+dann)
disintegrated conditional
demonstrative pronoun (0.148 ***) MLG (0.9 (*))
Table 6. Scrambling behavior of 167 informants in relative/conditional clauses and in complement clauses (only tokens with the V(P)R variants).
Table 6. Scrambling behavior of 167 informants in relative/conditional clauses and in complement clauses (only tokens with the V(P)R variants).
share of VPR variant[>75%[[60%]]<60%; >40%[[40%]]<25%]
n (informants)1722243965
n (relative + conditional)116170175278407
VPR variant104114868832
89.7%67.1%49.1%31.7%7.9%
ratio (compl/rel + cond)1.031.391.622.277.86
n (complement)547358100132
VPR variant5068467282
92.6%93.2%79.3%72%62.1%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kaufmann, G. Two-Verb Clusters in Mennonite Low German: The Impact of Auxiliary Verb and Clause Type. Languages 2025, 10, 95. https://doi.org/10.3390/languages10050095

AMA Style

Kaufmann G. Two-Verb Clusters in Mennonite Low German: The Impact of Auxiliary Verb and Clause Type. Languages. 2025; 10(5):95. https://doi.org/10.3390/languages10050095

Chicago/Turabian Style

Kaufmann, Göz. 2025. "Two-Verb Clusters in Mennonite Low German: The Impact of Auxiliary Verb and Clause Type" Languages 10, no. 5: 95. https://doi.org/10.3390/languages10050095

APA Style

Kaufmann, G. (2025). Two-Verb Clusters in Mennonite Low German: The Impact of Auxiliary Verb and Clause Type. Languages, 10(5), 95. https://doi.org/10.3390/languages10050095

Article Metrics

Back to TopTop