3. Searching the Corpora
We searched for ICC imperatives using NoSketch engine at
clarin.si (accessed on 1 August 2024) in two written corpora: Gigafida 2.0, a reference corpus of written Standard Slovene,
7 and Janes 1.0, a corpus of social media texts (
Erjavec et al., 2018).
8 The procedure involved two steps. We first constructed and deployed the CQL search string shown in (14). This yielded 1414 hits, with 410 in Gigafida and 1004 in Janes. The hits contained many false positives, however; narrowing down the result set further automatically turned out to be unfeasible. We proceeded with manual filtering, which led to 98 sentences with imperative ICC clauses, 37 in Gigafida and 61 in Janes. In what follows, we comment on the construction of the search string, before implementing manual filtering. The subsequent categorization of the final results into three distinct discourse environments is presented in
Section 4.
The overall composition of the search string in (14) is as follows: We search for a string of tokens beginning with a sentence-initial clitic (lines 1–2) and ending with an imperative verb (line 4). The clitic and the verb may be adjacent, or separated by any number of non-verbal tokens except punctuation (line 3). The matched tokens should be found in a single sentence (line 5).
(14) | 1 ( <s> [word="(?i)me|mi|ga|mu|ji|jo|ju|jima|jih|jim|se|si"] |
| 2 | [word="Me|Mi|Ga|Mu|Ji|Jo|Ju|Jima|Jih|Jim|Se|Si"] ) |
| 3 [!(tag="G.*"|tag="U")]{0,} |
| 4 [tag="Gg.v.*"] |
| 5 within ( <s/> !containing [word="\?|:\)|;\)|:-D|:\(|\[.*"] ) |
While we relied on corpus tags to recognize imperative verbs (line 4), it turned out to be more practical to recognize clitics by listing them explicitly (lines 1–2). Note that the list does not include verbal auxiliary clitics, which do not occur in imperatives. We also decided against searching for second person singular clitic personal pronouns ‘ti’
2sg.dat and ‘te’
2sg.acc, as they have non-clitic homographs ‘ti’ “you”
1sg.nom and ‘te’ “this”
fem.sg.nom/acc, which leads to too many false positives. However, the exclusion is innocuous: non-nominative, second person pronouns are not expected to occur in imperative clauses, as that would lead to the Principle B violation.
9 In non-subject positions, the addressee is referred to using reflexive pronouns ‘si’
2sg.dat.refl and ‘se’
2sg.acc.refl, which were included in the list.
As corpus annotations of sentence boundaries are not perfect, we searched for ICCs using not only the <s> tag immediately preceding a case-insensitive regular expression catching the clitics (line 1), but also by searching for title-cased clitics (line 2). We have decided against searching only for the latter as there is no guarantee that the authors of the non-proof-read social media texts would use capitalization as required by the norm. The non-perfect annotation of sentence boundaries is also why we explicitly excluded matches containing emoticons (line 5). These are often used as final punctuation in social media texts, which is not reflected in corpus annotations.
As mentioned above, most of the hits yielded by the automatic search were false positives and needed to be filtered out manually. The most common reason for false positives was that the clitic cluster was not actually sentence-initial. In some examples, the sentence-initial phrase occurs in the left context, so the false positive presumably occurs due to imperfections of the sentence segmentation algorithm. Most often, however, the corpus does not contain the initial word of a post at all, i.e., the initial word is missing and the corpus sentence starts with a lowercased word. Most commonly, the missing word is ‘Bog’ “God” (15a), but there are also examples with proper names (15b) and other words, mainly nouns (15c).
(15) | a. | Bog ji pomagaj! |
| | God her help |
| | “God help her!” |
| b. | Kučan si lasti zdej tudi osamosvojitev. |
| | Kučan refl owns now also independence. |
| | “Kučan now wants to own the independence, as well.” |
| c. | Lupčka mi dejte sm reku! |
| | kiss to me give aux said |
| | “Give me a kiss, I said!” |
The missing-proper-name situation typically occurred in social media posts (especially Twitter/X), presumably because the corpus parser misinterpreted the combination of a user handle (@username) and the real (and capitalized) initial word of the sentence as irrelevant, perhaps as a part of the general strategy for avoiding the inclusion of proper names in the corpus (although normally, these are replaced by [per], for “personal name”). Whereever possible, we verified that the sentence-initial word was indeed missing by locating the original sources. As virtually all of the problematic examples from the social media corpus we could access were confirmed to exhibit the missing-initial-word problem, we decided to disregard the problematic lowercase-initial social media posts whose sources could not be found. (On Twitter/X, the posts may be unavailable either because of the author’s privacy settings, or because the author has unsubscribed from the service.)
The second most common reason for false positives was that the intended ICC contained the non-clitic pronoun ‘mi’ “we”
1pl.nom, homographic with the target clitic pronoun ‘mi’ “I”
1sg.dat, as illustrated in (16). Note that strategy of removing ‘mi’ from the list of allowed clitics, deployed for ‘ti’ and ‘te’, could not have been applied here, as the clitic ‘mi’ can and does occur in imperative sentences.
(16) | Mi | pa | se | podajmo | nekoliko | južneje | … |
| We | pa | refl | go | somewhat | more.south | … |
| “As for us, let’s go somewhat more south …” |
We encountered several other common sources of easily recognizable false positives. Some intended ICCs contained ‘še’ spelled as ‘se’ (without the hacek, 17). Sometimes, a word was misannotated as an imperative verb due to homonymy—most commonly ‘spomnite’ “remember”, homonymous between imperative and indicative 2nd person plural (18), and ‘recimo’ “say”, an archaic first person plural imperative which functions as a particle used for (i.a.) introducing hypotheticals and exemplification (19). Several sentences exhibited poetic, metrum-oriented word order (20). We also found examples of foreign words and proper names homographic to Slovene clitics, and occasionally, the target clitic pronoun was capitalized as the beginning of a multi-word proper name, such as a book title or a product name.
(17) | se→še | enkrat | preber-i | kaj | si | napisu |
| refl→additionally | once | read-imp | what | aux.2sg | write.ptcp.m.sg |
| “Read what you wrote once again.” |
(18) | Se | še | spomn-i-te | naših | bendov? |
| refl | still | remember-tv.prs-2pl | our | bands |
| “Do you still remember our bands?” |
(19) | Si | recimo | predstavlj-a-te, | … ? |
| refl | for.example | imagine-tv.prs-2pl | |
| “Can you for example imagine that … ?” |
(20) | Me | groba | noč | in | strah | obda-j | … |
| | rough | night | and | fear | surround-imp | |
| “Let the crude night and fear surround me …” |
Filtering out the easily recognizable false positives left us with 129 candidate ICC imperative sentences. We could assign most of these to one of the three categories, which are described in
Section 4. There was, however, a residue of 31 sentences that fit into none of the developed categories, and which we at the same time judged to be, at most, marginally acceptable. Given that the categorization of the acceptable ICC imperatives turned out to be based on discourse, we investigated the linguistic context of these marginally acceptable examples in more detail.
Nine of the problematic sentences were preceded by ‘–’, indicating that they occur as an item in a list (21). Typically, other sentences beginning with ‘–’ occured in their vicinity in the corpus, reaffirming this conclusion. An inspection of the original source, when it could be located, showed that we are most often dealing with a list of advices in a magazine. Such lists typically begin with a list header that introduces the items, such as ‘vedno’ “always”, followed by several instructions. For a grammatical reading of the instructions, one needs to prefix the header to each instruction, which puts the CC in the usual second position behind the list header. (The inspection of the context of the list item candidates also filtered out several instances of indicative sentences containing verbs with homographic indicative and imperative forms, e.g., ‘reš-i-te’ “save-
ind/imp-2pl”. These are not counted among the 31 problematic examples.)
(21) | Vedno “Always”: |
| – … |
| – Se pred vožnjo in pred težavnimi ali nevarnimi opravili prepričajte, da zdravila ne povzročajo zaspanosti ali kako drugače ne prizadenejo vaših sposobnosti. |
| “Before driving and difficult or dangerous tasks, make sure that the medication does not cause sleepiness or harm your abilities in some other way.” |
| – … |
The detailed investigation of the list items reinforced the lesson taught by the acceptable examples (which will be presented in
Section 4): the linguistic context—sometimes even large chunks thereof—is of utmost importance for establishing the grammaticality of ICC imperatives. We have therefore decided to disregard 10 of the problematic cases for which neither the relevant context could be determined from the corpus itself, nor could the original source be located.
10 Among these, there were five sentences beginning with ellipsis points, indicating that in the author’s mind, the clitic pronoun might not be clause-initial after all, but we could not find the preceding context which the ellipsis points refer to.
(22) | ?… si | ogle-j-te | posnetek! | ☞ [URL] [URL] |
| … | look.at-imp-2pl | recording! | |
| “Look at the recording!” |
We were left with six sentences, among them (23), for which we could determine the relevant context (and which we, remember, judge to be at most marginally acceptable). One possibility is that their authors are speakers of one of the western, Romance-adjacent Slovene dialects. In these dialects, the position of clitics is less restricted than in the rest of Slovene, and ICCs appear more freely in general (
Zuljan Kumar, 2022).
11 However, as all six examples were found in social media posts, which are not proof-read, another possibility is that their authors would ultimately find them unacceptable as well. This hypothesis is corroborated by comparing the frequency of the problematic clitic-first order to the frequency of the clitic-second order, which reveals that the latter is always radically higher. For example, Janes contains 5 examples of a clitic-first ‘Se ne sekiraj’ “
not worry” and 460 examples of a clitic-second ‘Ne sekiraj se’ “not worry
”, and there are 2 hits for ‘Si preberite’ “
read”, as opposed to 1061 hits for ‘Preberite si’ “read
”.
(23) | ? A: Danes se nisem spomnil še nič kaj neumnega, kaj šele pametnega, vredno objave. Se oproščam :-) |
| B: Se | ne | sekira-j | –za neumnosti skrbita vlada in malomarna. |
| | neg | worry-imp.2sg | |
|
“A: Today I haven’t yet thought of anything stupid, let alone smart,
worthy of posting. I’m sorry :-) |
| B: Don’t worry –the government and the [careless one](?) take care of
stupidities.” |
4. Results
The manual filtering of the corpus search results ultimately led to 98 imperative sentences containing an ICC. It turned out that they can be grouped into three discourse-based categories (note that all three categories are represented in both corpora):
Imperative sentences containing certain discourse particles;
Modally subordinated imperative sentences;
Imperative sentences appearing within instructions.
In this section, we present a representative sample of the result set by category. The full dataset is available as
Živanović and Štarkl (
2025). It also contains the 31 examples of non-obvious (potential) false positives discussed at the end of the previous section. A quantitative overview of the dataset is presented in
Table 1.
4.1. Discourse Particles
Most of the ICC sentences found in the corpora contained one of a small set of discourse particles. The most common of these particles, already noticed by
Dvořák (
2005), is ‘pa’ (24–25). Although highly multifunctional, ‘pa’ is most commonly an adversative particle: in (24b), it might be unexpected that we might have to stop planting before the end of the autumn, given that the preceding advice was to plant the entire autumn. (We discuss the difference between the two batches of ‘pa’-examples in
Section 5.2). Additionally, ICCs are licensed by the particle ‘pač’, at least when it expresses resignation to the given facts or circumstances (26): in (26a), the high headwind speed is undesired, but as there is nothing we can do about it, we can at least keep the peace of mind by imagining that we’re cycling uphill. (Note that in absence of the respective particle, the examples below are ungrammatical.)
(24) | a. | Da za tovrstne ceremonije nimate časa, boste rekli? |
| | Si | ga | pa | vzem-i-te! |
| | | | pa | take-imp-2pl |
| | “That you don’t have time for such ceremonies, you say? Well, take the time!” |
| b. | Sadite lahko celo jesen, če so temperature dovolj visoke. |
| | Se | pa | vseeno | ravna-j-te | po | tem, |
| | | pa | nevertheless | rule-imp-2pl | after | this, |
| | da prenehate saditi vsaj šest tednov pred prvo zmrzaljo, … |
| | “You may plant the entire autumn, if the temperatures are high
enough. Do however pay attention to stop planting at least
six weeks before the first frost, …” |
| c. | Tako naj vas trenutne izgube ne odvrnejo od vlaganja, ampak se še
naprej držite svojega varčevalnega načrta ali v teh časih postopno
namenite za borzo več denarja. Tako ne boste čez nekaj let znova
objokovali zamujene priložnosti. |
| | Se | pa | v | teh | negotovih | časih | izogiba-j-te | posojil | in | izvedenih |
| | | pa | in | these | uncertain | times | avoid-imp-2pl | loans | and | derived |
| | finančnih instrumentov z visokimi vzvodi, … |
| | financial instruments with high leverages |
| | “So don’t let the current losses make you refrain from investing,
but stick to your investment plan or gradually put even more money into
the stock market. This way, you will not cry over a missed opportunity
in a couple of years. However, in these uncertain times, do
avoid loans and derived financial instruments with high leverage, …” |
(25) | a. | Na peto mesto smo tokrat postavili juho, ki je na pogled najbolj simpatična: […]. |
| | Pripomba, ki je odnesla boljšo uvrstitev:[…] |
| | Si | pa | ta | recept | zapomn-i-te; | […] |
| | refl.dat | pa | this | recipe | remember-imp-2pl | |
| | “The fifth place goes to the soup which looks the best […] The remark, which prevented it to be placed higher: […] Nevertheless, do remember this recipe; […]” |
| b. | To za zdaj še ni zame. | Me | pa vpraša-j-te | čez | 40 let, | … |
| | | I.acc | pa ask-imp-2pl | after | 40 years.gen | |
| | “This is not for me at the moment. However, do ask me in 40 years, …” |
(26) | a. | Za kolesarjenje je moteče že, če piha (nasproti) tam od 20 do 30
kilometrov na uro, ampak to ne pomeni, da se ne da poganjati pedalov. |
| | Si | pač predstavlja-j-te, | predstavlja-j-te, |
| | | pač | imagine-imp-2pl |
| | da od Murske Sobote do Beltinec grizete v klanec. |
| | “For cycling, the (head)wind speeds of 20 to 30 kilometers per
hour can already be disturbing, but this does not mean that one cannot
pedal. Just imagine that you’re riding uphill from Murska Sobota
to Beltinci.” |
| b. | Starši se moramo nadzorovati in si vzeti dovolj časa, da nismo
nervozni in na primer zaradi pomanjkanja časa ne posegamo v samostojnost otrok. |
| | Si | pač | načrtuj-mo | čas | drugače. |
| | | pač | plan.imp-1pl | time | differently |
| | “As parents, we should control ourselves and take enough time, so that
we are not nervous and, for example, due to lack of time, intrude into
the independence of the children. We should simply plan our time
differently.” |
Two other candidates for particle licensing the ICC in an imperative clause are the encouraging particle ‘kar’ (27a), and the particle ‘raje’ that introduces an alternative that the speaker judges to be better (27b). However, we judge these sentences to be less acceptable than sentences involving ‘pa’ and ‘pač’, which is congruent with the small number of occurrences in the corpus. Additionally, a closer inspection of the context revealed that the example we found the most acceptable (it involves ‘kar’) actually belongs to the category of modally subordinated imperatives, which we discuss in the following section. Finally, searching for sentences with ‘kar’ and ‘raje’, and the CC in the usual second position revealed that those are much more common. We found 1718 examples of ‘kar’ + CC and 1620 examples of ‘raje’ + CC. They are illustrated in (28); note that ‘kar’ must be adjacent to the verb, so the sentence-initial position in contains ‘kar’ + verb. These numbers are in stark contrast to the same experiment with ‘pač’, where searching for ‘pač’ + CC results in merely two hits. Searching for ‘pa’ + CC yields 2325 results, but note that the ratio of CC + ‘pa’ vs. ‘pa’ + CC is still much higher than the ratio of CC + ‘kar/raje’ vs. ‘kar/raje’ + CC, and that the multifunctional ‘pa’ in the initial position has more functions than when it follows the ICC, among them a very common coordinative function. In conclusion, the status of ‘kar’ and ‘raje’ as ICC licencers is dubious, although not rejectable out of hand, and requires further research.
(27) | a. | ? Janše ne damo! | Ga | kar | ime-j-te, | ga ne rabimo. |
| | | | kar | have-imp-2pl | |
| | “We’re not giving away Janša! Go ahead, have him, we don’t need him.” |
| b. | ?
Ne bodi mulc! | Se | raje | fajn | v | postelji | povalja-j. |
| | | | rather | well | in | bed | roll-imp.2sg |
| | “Don’t be such a teenager! Better roll well in the bed.” |
(28) | a. | Šolska pravila so javna. | Kar | preber-i | si | jih! |
| | | kar | read-imp.2sg | refl | them.acc |
| | “The school rules are public. Go ahead, read them!” |
| b. | Dobro vem, kako je z njim, nikar mi zdaj ne pripoveduj o tem. |
| | Raje | mi | pove-j, | kaj se s tabo dogaja. | |
| | rather | I.dat | tell-imp.2sg, | | |
| |
“I know very well how he is doing, no need to tell me about this now. |
| | Rather tell me what’s up with you.” |
4.2. Modally Subordinated Imperatives
The set of imperative sentences with an ICC revealed by our search contains several examples that bear all the hallmarks of modal subordination (see
Section 2.1).
12 The crucial observation regarding the examples in (29) is that the addressee should not follow the given directive unconditionally, but only if some contextually determined condition is obtained. In (29a), the addressee does not need to read the linked web page if they already know about the child-killer whale; in (29b), A directs B to move away only if everybody is working; and so on.
(29) | a. | Če še niste slišali o kitu morilcu otrok … |
| | Si | to | nujno | preber-i-te. | [URL] | | | |
| | | this | necessarily | read-imp-2pl | | | | |
| | “If you haven’t heard about the child-killer whale yet … |
| | Be sure to read this. [URL]” |
| b. | A: Če vsi jočejo, joči z njimi. Če se vsi smejijo, se smej z njimi. |
| | B: Kaj pa, če vsi delajo? |
| | A: Se | umakn-i, | da | jih | ne | moti-š. | | |
| | | move.away-imp.2sg,
| that | | neg | disturb.prs-2sg | | |
| | “A: If everybody is crying, cry with them. If everybody is laughing, laugh with them. |
| | B: And what if everybody is working? |
| | A: Move away, so that you don’t disturb them.” |
| c. | A: Uf, da me tko pričaka … [a photo of a woman in an appealing posture] |
| | B: … | jo | pol | še | k | meni | pošlj-i | … ;) |
| | | | then | also | to | me | send-imp.2sg | |
| | “A: Uff, to have her wait for me like that … |
| | B: … send her to me as well … ;)” |
| d. | Kaj potrebujete za lastno solarno elektrarno? |
| | Si | preber-i-te | v | članku: | [URL] | | | |
| | refl | read-imp-2pl | in | article | | | | |
| |
“What do you need for your own solar power plant? |
| | Read about it in the following paper: [URL]” |
The other important characteristic property of modal subordination, licensing of additional anaphoric relations, is demonstrated by (29b): ‘jih’ “them” is anaphoric to ‘vsi’ “everybody” from the third question. Admittedly, (29b) is the only example which exhibits this property, but this is not very surprising, as the number of modally subordinated imperatives in our dataset is small. However, our examples also exhibit other signs indicative of modal subordination. For example, (29c) contains an overt ‘pol’ (in Standard Slovene, ‘potem’) “then”, which is usually analyzed as anaphoric to the reference time in the preceding sentence, which is also a feasible analysis of (29c).
Finally, four out of the seven corpus examples we have categorized as involving a modally subordinated imperative consist of a conditional and an imperative clause connected by ellipsis points, as seen from (29a) above. This might imply that these examples do not consist of two syntactically independent sentences and thus do not count as instances of modal subordination. However, in all these examples, the imperative clause starts with a capital letter, indicating that might be an independent sentence after all. Furthermore, even if these turn out to be mono-sentential (in the age of auto-correct, the initial capital might not be a very realiable indicator of a start of a sentence), the remaining three examples cannot be analyzed as a single sentence. For example, the condition in (29b) is embedded, and in (29c) the condition-to-be is not introduced by ‘če’ “if” but by complementizer ‘da’ “that” in the function of expressing a wish. In the latter example, the bi-sentential nature of the example is also indicated by the fact that the interlocutors switch roles between the clauses. We conclude that even if not very numerous (in written discourse), modally subordinated imperatives are a class of imperative sentences allowing for initial clitic clusters.
4.3. Instructions
The final discourse environment in which we found imperative clauses with an ICC is what we dub
instructions (30). A common example of such texts are cooking recipes (our dataset includes one such example):
(30) | a. | Stvari na mizo. Pisala v roke. Slovarje pred nos. Razgrnite
tekst. Poiščite slovnične strukture. |
| | Jih | podčrta-j-te | in | izpiš-i-te. | | | | |
| | | underline-imp-2pl | and | write out-imp-2pl | | | | |
| | “Things on the desk. Pens in hand. The dictionaries under your noses.
Open the text. Find grammatical structures. Underline them, and write
them out.” |
| b. | Meditacija s spominjanjem: vzemite si nekaj trenutkov zase. Zamižite
in v mislih zajadrajte v najljubši kraj s potovanja. |
| | Si | priklič-i-te | tamkajšnje | vonjave, | zvok, | barve | in | občutja. |
| | | recall-imp-2pl | thereby | smells | sound | colors | and | feelings. |
| | “Meditation with remembering: take a couple of moments for yourself.
Close your eyes and in your mind sail into your favourite place from
the trip. Recall the smells, sound, colors and feelings you had there.” |
We argue that instructions belong to a wider category of narratives, i.e., we take instructions to be narratives composed of sentences in the imperative mood. Instructions clearly fulfill the criteria for narration (see
Section 2.2); in fact, they are used as examples of narrative discourse in
Hobbs (
1978). In particular, note that the textual order matches the temporal order. Furthermore, if instructions are indeed a subspecies of narratives, we expect them to license telescoping (see
Section 2.2). This indeed turns out to be the case. However, as the small set of ICC-containing instructions we found in the corpora contains no examples of universal quantification, we can only provide a constructed example:
(31) | Negovanje konjev ni zelo pomembno le za njihov izgled, temveč pripomore tudi h gradnji čustvene vezi med konjem in jezdecem. Zato se posvetite vsakemu konju posebej. Očistite mu kopita. Preglejte njegovo dlako.
Ga skrtač-i-te. … |
| brush-imp-2pl |
| “Grooming horses is not only important for their appearance, but also contributes toward building an emotional bond between the horse and the rider. Therefore, take the time for each and every horse. Pick its hooves. Inspect its coat. Brush it. …” |
Returning to the corpus examples, note that none of the ICC sentences we found in the corpus was the first step of the instruction. Indeed, we judge that beginning an instruction with an ICC sentence is ungrammatical. For example, one cannot rework (30b) into (32).
(32) | Meditacija s spominjanjem: | *si | vzem-i-te | nekaj | trenutkov | zase. |
| | | take-imp-2.pl | several | moments | for yourself |
| “Meditation with remembering: take a couple of moments for yourself.” |
Finally, note that the temporal sequence of instruction steps can be interrupted by an elaboration of a step. (Elaboration is one of the coherence relations recognized by Coherence Theory). Our corpus data contains one example of an ICC elaboration of the previous imperative sentence. In (33), the directive to magnify the photo elaborates on the preceding imperative by telling the addressee how they could observe it better.
(33) | Poglej bolje. |
| Si | slik-ico | poveča-j. |
| | photo-dim | magnify-imp.2sg |
| “Observe better. Magnify the photo.” |
Summing up, our corpus study shows that ICCs are not ungrammatical in Slovene imperative sentences. Additionally, the categories of ICC sentences established in this section—discourse particles, modal subordination and instructions—indicate that, at least in imperatives, the distribution of ICCs is governed by discourse-related factors.
5. Discussion
ICCs in Slovene imperatives have been discussed by
Milojević Sheppard and Golden (
2000,
2002) and
Rus (
2005). However, these analyses were based on the erroneous assumption that ICCs in (matrix) imperatives are always ungrammatical. It is therefore not surprising that
Milojević Sheppard and Golden (
2002), after attempting to apply the theories of
Rivero and Terzi (
1995) and
Han (
1998) to Slovene, conclude by stating that “the question of what makes the sentence-initial Clitic—Imperative Verb order unacceptable remains unsolved and merits further research.”
13 More generally, given the conclusion from
Section 4 that the distribution of ICCs in Slovene imperatives is governed by discourse-related factors, any analysis that attempts to explain the placement of the clitic clusters based only on morphosyntactic properties of the verbal system is destined to fail.
In broad strokes, the main idea behind our analysis is that while ICCs seem to be licensed in a heterogeneous set of discourse environments, these environments have a common denominator: they all require reference to some preceding discourse material. We assume that this discourse material, or perhaps an anaphoric reference to this material, is covertly integrated into the LF of an ICC sentence as a specifier of the topmost functional head—typically CP or, assuming a split CP, some high, discourse-related functional projection. Under the standard assumption that the clitic cluster is adjoined to C° (
Golden & Sheppard, 2000) and obligatorily preceded by a specifier, the clitic cluster, which occupies the sentence-second position in the usual situation of the overt specifier of CP, will then occupy the sentence-initial position when the specifier is covert. Note, however, that the analysis of instructions will suggest that the topmost head is not always C°, i.e., that instructions may be composed of impoverished clauses and that the clitic cluster appears in the highest head of the clause, whether this is C° or some other head.
We emphasize that we do not aim to explain why ICCs may arise in Slovene, as opposed to almost all other Wackernagel languages. For concreteness, we subscribe to the standard view that this is so because “Slovene clitics have the option of being proclitic” (
Golden, 2003, p. 214).
14 The idea is that proclitics, which lean on the following phonological material, impose no requirements on the preceding phonological context, thereby allowing the preceding syntactic constituent to be covert.
15 However, nothing in our analysis hinges on this assumption. Our analysis is compatible with any mechanism, syntactic or phonological, which allows the clitic cluster, standardly assumed to be adjoined to C° (
Golden & Sheppard, 2000), to overtly occur sentence-initially. Our goal is more modest and limited to accounting for the corpus data presented in
Section 4. We only attempt to explain the distribution of ICCs in Slovene imperatives rather than what makes Slovene have ICCs at all.
That said, our analysis does have a goal above and beyond accounting for the corpus data. We wish to emphasize that it supports the view on the syntax–semantics interface held by
Ludlow and Živanović (
2022) and
Živanović and Ludlow (
2023), presented in
Section 2.3. Zero Interface and its extension into the realm of discourse predict an even tighter correlation between syntactic and semantic/pragmatic phenomena as is usually envisioned. Crucially for our discussion, in mainstream semantic theories discourse, information is accessed by semantics through the model-theoretic representation of the context, carrying no repercussions for syntax. In Zero Interface, however, any discourse material contributing to the meaning of a linguistic expression must be integrated into the LF. Of course, whether the integration of certain discourse material into the LF carries visible consequences for overt syntax is a matter resolved by a conspiracy of various phenomena. In this paper, we claim that ICCs in Slovene imperatives indeed arise via such a conspiracy. On the one hand, Slovene allows the clitic cluster to occur clause-initially (for reasons that are not investigated in this paper, though we have hinted that the culprit might be the proclitic nature of Slovene Wackernagel clitics). On the other hand, certain discursive situations compatible with the directive force emanating from imperative sentences require reference to the preceding discourse material. In case this reference is covert, it results in a clitic cluster following a covert initial phrase, yielding a clause-initial clitic cluster in overt syntax.
We now turn to individual ICC categories, with the aim of showing that each of them, in fact, requires reference to some preceding discourse material. We start with modally subordinated imperatives in
Section 5.1, because the discussion of discourse particles in
Section 5.2 relies on the analysis of modally subordinated imperatives, and conclude with
Section 5.3 on instructions.
5.1. Modally Subordinated Imperatives
We arrive at the LF of modally subordinated ICC imperatives by using the basic structure of modal subordination exemplified by (12) in
Section 2.3 and adding the standard assumption that the clitic (cluster) is located in C° (
Golden & Sheppard, 2000). C° immediately follows the covert conditional, which gives rise to the overt clause-initial position of the cluster.
(34) | ![Languages 10 00217 i004 Languages 10 00217 i004]() |
In (34), we used a corpus example where the ICC and preceding sentence were linked by ellipsis points. In such a case, the accommodated restrictor is most likely a verbatim copy of the preceding sentence. However, as mentioned in
Section 2.1, the restrictor may be arrived at by various kinds of inferential reasoning—indeed, this is one of the hallmarks of modal subordination. The exact form of the restrictor is thus uncertain, so (35) shows merely one possibility for each of the modally subordinated imperatives from (29); note especially (35d), which requires the most reasoning. (We only provide the English translations, as the exact form of the original is irrelevant here.)
(35) | a. | If you haven’t heard about the child-killer whale yet, sure to read this. |
| b. | If everybody is working, move away, so that you don’t disturb them. |
| c. | If she waits for you like that, send her to me as well. |
| d. | If you want to learn what you need for your own solar power plant,
read about it in the following paper. |
5.2. Discourse Particles
Our analysis of ICC sentences involving a discourse particle is essentially the same as the analysis of modally subordinated imperatives: covert discourse material in the specifier of CP gives rise to the overtly clause-initial clitic cluster. Actually, most of the ICC sentences involving a discourse particle found in the corpus are modally subordinated. Remember that the crucial evidence for the modal subordination is that the given directives should not be taken as unconditional, but pertaining to a specific restriction on the domain of possible worlds. This turns out to be the case for the sentences with ‘pa’ illustrated in (24) and for all sentences containing particle ‘pač’. In (36) and (37), we show possible covert conditionals in the function of the accomodated modal restrictor for the corpus examples in (24) and (26) containing ‘pa’ and ‘pač’, respectively (as with the example at the end of the previous section, the final subexample of each batch seems to require the most reasoning for arriving at the restrictor). For example, in (36b), we are not directed to stop planting six weeks before the first frost, generally, but only if we followed the advice to plant in autumn. In fact, if the directive was general, the discourse would not make logical sense due to a failed presupposition: we cannot stop planting if we are not planting. (Notice the similarity of this exceptionally licensed presupposition to the exceptional discourse anaphoric relations, the hallmark of modal subordination.)
(36) | a. | That you don’t have time for such ceremonies, you say? If you don’t have time for such ceremonies,
well, take the time! |
| b. | You may plant the entire autumn, if the temperatures are high enough. If you will plant in the autumn, you should however pay
attention to stop planting at least six weeks before the first frost, … |
| c. | So don’t let the current losses
make you refrain from investing, but stick to your investment plan or
gradually put even more money into the stock market. This way, you will
not cry over a missed opportunity in a couple of years. If you will invest / If you follow this advice,
however, in these uncertain times, you should avoid loans
and derived financial instruments with high leverage, … |
(37) | a. | For cycling, the (head)wind speeds of 20 to 30 kilometers per
hour can already be disturbing, but this does not mean that one cannot
pedal. If there is headwind with the speed between 20 and 30 km/h,
just imagine that you’re riding uphill from Murska Sobota to Beltinci. |
| b. | As parents, we should control ourselves and take enough time, so that we
are not nervous and, for example, due to lack of time, intrude into the
independence of the children. If the time is severly limited,
we should simply plan our time differently. |
Unlike the examples with ‘pa’ discussed above, ICC sentences containing the particle ‘pa’ shown in (25) do not involve modal subordination. The directives expressed in (25) are not conditional, but merely adverse to some proposition expressed (or implied) by the preceding linguistic context. These sentences thus cannot receive the same semantic analysis. However, the syntactic analysis, and thus the reason for the appearance of an ICC, remains essentially the same: we assume that these sentences contain a covert embedded clause (38). That this is a sensible analysis is confirmed by the fact that the sentences can be paraphrased using an overt embedded clause introduced by connective ‘čeprav’ “although”. Note that the content of ‘čeprav’-clause might need to be inferred, same as with the restrictor in the modally subordinated imperatives above. Another interesting detail (which we leave to further research) is that when ‘čeprav’-clause is overt, the sentences are only grammatical without ‘pa’.
(38) | a. | Čeprav ta recept ni bil uvrščen višje, kot je bil,
si pa ta recept zapomnite. |
| | “Although this recipe was not placed higher than it was,
remember this recipe.” |
| b. | Čeprav to za zdaj še ni zame, me pa vprašajte čez 40 let. |
| | “Although this is not for me at the moment, ask me in 40 years.” |
Summing up, our (syntactic) analysis of ICCs involving discourse particles is the same as the analysis of modally subordinated ICC sentences: the overtly initial clitic cluster is, in fact, preceded by a silent embedded clause. Finally, note that our analysis is not unique in assuming that sentences may contain covert discourse material. For example,
Giorgi (
2016) in her analysis of Italian surprise questions introduces a (dis)course head containing ‘ma’ “but” connecting two parts of the discourse. In particular, the specifier of this head hosts “the expected, but silent, portion of the construction” (
Giorgi, 2016, p. 8).
16 Giorgi (
2023) furthermore argues for a distributed pragmatics, demonstrating “that the syntactic explanation of complex phenomena can now start to include units beyond a single sentence” (
Giorgi, 2023, p. 124)—precisely what we do in our analysis to account for the distribution of ICCs in Slovene imperatives.
5.3. Instructions
The core idea behind our analysis of ICC sentences appearing in instructions is the same as for modally subordinated imperatives and for imperatives containing discourse particles: the overtly clause-initial clitic cluster is preceded by covert discourse material. The details, however, are different. In
Section 4.3, we have shown that instructions belong into a wider class of narratives. We therefore apply the account of narratives presented in
Section 2.3 to instructions.
Below, we provide a discourse LF for the sequence of instruction steps in (39); the sentences belong to the corpus example (30a), but for simplicity, we are pretending that they are starting the instruction. Individually, the two sentences receive the LFs shown in (40). Although the sentences seem rather similar at first sight, their LFs differ in important respects. First, only (40a) carries a marker of imperative force.
17 This is so because we assume that the directive force applies to the instruction as a whole, not to separate imperative sentences—it is not practical or logical to only follow some selected steps of an instruction. Second, only (40b) contains the Narration head (and so would all further imperative sentences in the instruction); the first sentence does not contain it as it is starting the instruction and therefore is not a narrative continuation of the preceding discourse. The two LFs are eventually integrated (post-syntactically, by mechanisms used for inferential reasoning) into the discourse LF shown in (41) (cf. 13), but for our analysis, the crucial aspect of the LFs shown below is the location of NarrationP.
(39) | Poiščite slovnične strukture. Jih podčrtajte in izpišite. |
| “Find grammatical structures. Underline them, and write them out.” |
(40) | a. | ![Languages 10 00217 i005 Languages 10 00217 i005]() | b. | ![Languages 10 00217 i006 Languages 10 00217 i006]() |
(41) | ![Languages 10 00217 i007 Languages 10 00217 i007]() |
As shown by (40b), Narration heads the entire LF of a non-initial instruction clause. The covert discourse material (or a some marker triggering the integration of this material) is located in the specifier of Narration in (40b) (shown as an empty branch), which leaves the clitics residing in the Narration head in overtly clause-initial position. Given this location of the clitic cluster, we have to of course relax our initial assumption that the clitic cluster is located in C° and replace it with the assumption that it resides in whatever syntactic head turns out to be the highest; such an analysis supports the idea that clitics move as far as they can (
Bošković, 2001;
Franks, 2010;
Marušič, 2008), rather than having a fixed (displaced) location in C°. Finally, note that as we place the ICC in the Narration head, and this head only occurs in the LF of the non-first sentences, we correctly predict the observation from
Section 4.3 that only non-first sentences of an instruction may contain an ICC.
6. Conclusions
In this paper, we have presented corpus data showing that, contrary to what is claimed in most of the previous literature, initial clitic clusters (ICCs) in Slovene imperatives are not ungrammatical. Our data thus speaks against a syntactic uniqueness of Slovene imperatives, contrary to what is claimed by
Milojević Sheppard and Golden (
2002). We have categorized ICC imperatives into three classes: modally subordinated imperatives, imperatives containing the adversative or the concessive particle, and imperatives occuring as a step in an instruction. The categorization shows that the distribution of ICCs in Slovene imperatives is ultimately governed by discourse. We speculate that ICCs in imperatives are uncommon in comparison to other sentence types because less discourse situations allowing for ICCs are compatible with imperatives than with assertions and questions.
At first sight, the positioning of the clitic cluster seems to be a syntactic affair, perhaps with phonology playing a minor role as well, but given the classification of ICC imperatives presented in the paper, the conclusion that there is a discourse component to the ICC phenomenon is inescapable. Interestingly, however, despite being licensed by a heterogeneous set of discourse situations, it seems that the ultimate source of ICCs (at least in imperatives) is one and only one: ICCs arise when the second-position clitic cluster is preceded by the covert discourse material required by coherence relations—a conclusion that arguably vindicates the view that all discursive information must be represented in syntax, even if covertly.
Our paper raises additional questions about the status of clitics in Slovene and Slavic in general. One question is whether the analysis can be broadened to all Slovene sentence types. Specifically, do all of the discourse situations our categorization relies on allow for ICCs in assertions and questions as well? Are there any additional discourse situations that allow them in other moods? Moreover, the tentative idea that the availability of ICCs is due to the proclitic nature of Slovene Wackernagel clitics should be further explored. According to
Lenertová (
2004), informal spoken Czech allows for ICCs. Further research can look into whether our analysis can be extended to Czech as well. Lastly, we note that our analysis is viable for the Slovene dialects that exhibit stable Wackernagel clitic position. This excludes the western Romance-adjacent dialects that allow first-position clitics more freely (
Zuljan Kumar, 2022). Additional syntactic discussions of Slovene dialects are desirable to detect possible fine-grained differences between the dialects.
The future research will undoubtedly reshape our analysis of Slovene ICC imperatives. However, the novel data will remain: initial clitic clusters may occur in Slovene imperatives, and their distribution is crucially governed by discourse.