1. Introduction
In English, the term ‘Chinese’ is commonly used to refer to the standard language of the People’s Republic of China, as well as one of the official languages of Singapore, i.e., Standard Mandarin Chinese (henceforth: SMC), also known in China as 普通话
Pǔtōnghuà (lit. ‘common language’). However, ‘Chinese’ can also be used to refer, more generally, to any variety belonging to the Sinitic branch of the Sino-Tibetan language family (see, e.g., [
1,
2]). In the Chinese tradition, as well as in much of the international literature on China and its language situation, it is customary to refer to Sinitic languages other than SMC as ‘dialects’ (方言
fāngyán). This is mostly due to the fact that these varieties are not officially recognized as languages, at least in Mainland China, nor did they undergo a process of standardization (with the partial exception of Cantonese; see, e.g., [
3]). Also, in the Chinese context, the idea of having only one ‘language’ is seen as a symbol of national unity, and as a marker of ethnic identity [
4]. Thus, the labels ‘Sinitic languages’, ‘Chinese languages’, and (somewhat paradoxically) ‘(the) Chinese language’ are all used to refer to the same group of varieties.
The idea of ‘Chinese’ as a unitary language with regional variation, somewhat similarly to English, was famously captured in Yuen Ren Chao’s idea of a ‘universal Chinese grammar’ [
5] (p. 13):
“[…] It is in matters of grammar that the greatest degree of uniformity is found among all the dialects of the Chinese language. Apart from some minor divergencies, […] one can say that there is practically one universal Chinese grammar”.
Chao can be considered “the founder of modern Chinese Linguistics”, and his views of Chinese/Sinitic have indeed been very influential in the field [
6] (p. 217). However, Chao’s idea of a fundamentally shared grammar among all Sinitic languages (or, as he puts it, “dialects of the Chinese language”) has now been conclusively proven to be highly misleading. A rich and growing body of research has indeed shown that the so-called Chinese ‘dialects’ are at least as diverse as, e.g., Romance or Germanic languages, or even more, and that their differences are not limited to the phonological and lexical components, but also involve, to a very significant extent, their grammar [
6,
7,
8]. Besides the above-mentioned factors related to language ideology, another reason for this common misconception concerning Sinitic grammar is that, very often, in Chinese dialects, it is possible to create ‘hybrid’ structures, making use of SMC grammar, but dialect phonology and even lexicon (a phenomenon known as ‘ditaxia’ [
9]). See the following Cantonese examples (adapted from [
9], p. 1277; in this article, the glosses follow the general guidelines of the
Leipzig Glossing Rules):
| (1) | a. | 我 | 比 | 佢 | 高 |
| | | ngo5 | bei2 | keoi5 | gou1 |
| | | 1sg | compare |
3sg | tall |
| | b. | 我 | 高過 | 佢 | |
| | | ngo5 | gou1-gwo3 | keoi5 | |
| | | 1sg | tall-surpass |
3sg | |
| | | ‘I am taller than her/him’ |
In (1), we see two possible comparative constructions of Cantonese. (1a), based on the marker 比
bei2 ‘compare’, follows the ‘marker–standard–adjective’ order. (1b), on the other hand, makes use of the marker 過
gwo3 ‘surpass’, and shows the ‘adjective–marker–standard’ order. The latter is the ‘authentic’, colloquial comparative construction for Cantonese, while (1a) belongs to the formal register, and it is more often used by educated speakers [
9]. What is crucial here is that (1a), the formal variant, closely corresponds to the ‘standard’ SMC comparative construction:
| (2) | 我 | 比 | 他 | 高 |
| | wǒ | bǐ | tā | gāo |
| | 1sg | compare |
3sg | tall |
| | ‘I am taller than him’ |
If we look at the formal register of Cantonese, one might be led to believe that the same constructions of SMC grammar can indeed be used in other Sinitic varieties, with differences mostly limited to the phonological and lexical level; however, this is obviously not true if we look at the informal, colloquial register of dialects [
6].
Thus, as pointed out by Norman [
1] (p. 72), for the purposes of cross-linguistic comparison, “Chinese is a vast dialectal complex containing hundreds of mutually unintelligible local varieties, each of which can be viewed as a distinct object […]”. In this article, we use ‘Sinitic language(s)’ and ‘Chinese dialect(s)’ interchangeably, as they normally describe the same entities.
The most recent, generally accepted genealogical classification of Sinitic languages includes ten groups [
10], which are sometimes erroneously labeled as ‘languages’ [
1]: namely, Mandarin, Jin, Wu, Gan, Xiang, Hui, Hakka, Yue, Min, and Pinghua, plus some unclassified languages. An earlier partition, which still enjoys some degree of acceptance, includes only seven groups, to the exclusion of Jin, Hui, and Pinghua [
11]. Each group is then further divided into several levels of subgrouping, down to the individual varieties (see [
4] for details). Since SMC is based on Mandarin dialects, from a historical point-of-view, other Chinese dialects should be seen as sister languages of SMC [
11]. This widely applied classification, however, is almost exclusively based on phonological criteria. Chiefly, the evolution of Middle Chinese voiced obstruents [
12], as the phonological system of all modern dialects (except the Min group) can be understood as the evolution of that of Middle Chinese, at least in the traditional approach to Chinese historical linguistics (but see [
13]). This type of ‘family tree’ approach to dialect classification can represent only groupings justified on the basis of vertical transmission, while the evolution of Sinitic languages has been strongly shaped by contact, both within the family and with non-Sinitic languages [
14,
15] (we shall get back to this below); also, a phonology-based genealogical classification may not reflect differences in grammatical aspects [
12,
14].
Indeed, if we look at the typological features of modern Sinitic languages through the lens of their genealogical classification, we may notice that they appear as very different from the so-called ‘Tibeto-Burman’ languages (i.e., the non-Sinitic Sino-Tibetan languages; on the structure of the Sino-Tibetan family, see [
16]). Sinitic languages are overwhelmingly tonal, verb-medial, and isolating/analytic, while Tibeto-Burman languages are not necessarily tonal, they are dominantly verb-final, and exhibit more varied (and elaborated) morphological profiles, including isolating languages like Karen, languages with transparent and regular agglutinative morphology (Lolo-Burmese, Tibetic, and Boro-Garo), and also paradigmatically complex languages, with elaborate argument indexation and transitivity management systems (Rgyalrongic, Kiranti) [
17]. The divergence between Sinitic and the rest of the Sino-Tibetan family has been attributed, at least partly, to contact. At least since the Qin period (221–207 BCE), speakers of Sinitic migrated repeatedly from the Central Plains region to (what is now) Southern China, leading to contact with the so-called 百越
Bǎi Yuè (lit. ‘Hundred Yue’) ethnic groups inhabiting the region, who were likely speakers of Hmong-Mien, Tai-Kadai, and Austroasiatic languages [
15,
18]. This arguably led to the convergence of Sinitic and the Mainland Southeast Asian language type, characterized, e.g., by the use of lexical tone, isolating/analytic morphology, lack of agreement, verb medial basic constituent order, among other features [
19,
20] (we will get back to this in
Section 2). However, Mainland Southeast Asian traits are actually more common in the Sinitic languages of Southern China. This is because Northern China was a site for contact with speakers of ‘Altaic’-type languages, belonging to the Mongolic, Turkic, and Tungusic languages [
15,
20], leading to the development of some typological features shared with languages of the Northern Asian region [
21]. This is commonly referred to as the ‘Altaicization’ of Northern Sinitic, as opposed to the ‘Taiization’ of Southern Sinitic [
22]. In addition, as mentioned above, internal migration of Chinese-speaking people led to contact, admixture, patterns of areal diffusion and convergence among Sinitic varieties, cross-cutting genealogical subdivisions (see below,
Section 5).
This article is organized as follows. In
Section 2, we provide a general overview of the main typological characteristics of Sinitic. In
Section 3 and
Section 4, we discuss, respectively, morphological and syntactic aspects, while in
Section 5, we focus on the areal patterns of distribution of typological features. In
Section 6, we offer our concluding remarks, as well as some hints on the future prospects of this field of inquiry.
2. Overview
From a typological point of view, Sinitic languages are a perfect example of
concordia in varietate; they do share a considerable number of key typological traits, but they also differ in many aspects. These differences are often based on areal patterns, rather than following genealogical groupings, as mentioned in
Section 1. Also, again as mentioned earlier, many (if not most) of the features which define modern Sinitic languages are consistent with the Mainland Southeast Asian typological profile, a fact which led to the reinterpretation of this areal type as “Mainland East and Southeast Asian” (henceforth: MESEA) [
23]. The MESEA area includes languages both of Mainland Southeast Asia (Vietnam, Laos, Cambodia, Thailand, Peninsular Malaysia, and Myanmar) and of China, belonging to the Sinitic, Tai-Kadai, Austroasiatic, Hmong-Mien, and Austronesian families [
24]. The core features of the MESEA areal type are [
19,
20,
21,
25,
26]:
A tendency towards monosyllabism (sesquisillabism for some languages)
Isolating/analytic morphology
Use of lexical (and grammatical) tone
Use of lexical morphemes with grammatical functions
Use of classifiers
Lack of agreement for number, gender, case, etc.
Lack of obligatory arguments (zero anaphora)
Topic-prominent syntax
Verb-medial, head-modifier order, use of prepositions
Use of serial verb constructions
Use of (modal) sentence-final particles
Prominence of aspect over tense
The above-mentioned features are not all entirely independent from each of the others, since some of them ‘conspire’ to shape the typological profile of Chinese languages. Most importantly, the combination of the lack of agreement, the lack of obligatory arguments, and topic-prominent syntax leads to the so-called ‘indeterminateness’ of Chinese (and, more generally, of MESEA languages) [
27]. As pointed out by Enfield [
19] (p. 188), languages in the MESEA area combine “widespread noun phrase ellipsis (of definite arguments) with noun phrase movement (into clause-external positions like topic), resulting in great indeterminacy of surface sequences”; grammatical categories as, e.g., number, tense, aspect, and reference can also be omitted [
27]. See the following SMC example (adapted from [
27] (p. 112)):
| (3) | 她 | 买 | 报纸 |
| | tā | mǎi | bàozhǐ |
| | 3sg.f | buy | newspaper |
| | ‘She bought/buys/is buying/will buy a newspaper/ the newspaper/ the newspapers/ newspapers’ |
Without a context, a sentence such as (3) is open to many different interpretations in terms of tense, aspect, number, and definiteness status, as shown by the suggested possible translations. In actual communication, ambiguity is normally resolved with contextual and non-contextual cues, such as, e.g., verb semantics, topic continuity, and pragmatic expectations, among others [
19].
Within this general profile, diversity in Sinitic manifests itself on many levels. As for morphology, while lexical tone is nearly universal in Chinese languages, the use of tone to express grammatical meaning (including derivation) varies considerably; also, the tendency towards having monosyllabic morphemes and stable morpheme boundaries [
28] is not as strong in some dialects, especially in Northern China, in which we may find strong reduction in grammatical morphemes, fusion, and cumulative exponence [
29]. It is, however, in the domain of syntax where we see the highest degree of diversity among Sinitic languages, both in terms of the actual construction patterns and of the items which are grammaticalized with a certain function. Some of the best-studied aspects of syntactic variation are the comparative, passive, pretransitive, and ditransitive constructions, negation, and aspect. Often, the variation we see in the domains of morphology and syntax is areally skewed, and can sometimes be understood as a product of areal diffusion and convergence, as hinted at above. We shall discuss this in the following sections.
3. Morphology and the Lexicon
As mentioned above, Chinese is widely regarded as a typical instance of isolating/analytic typology. SMC, and Sinitic in general, is said to have little or no inflectional morphology, few affixes (often with a transparent origin [
30]), very stable morpheme boundaries [
28], and little or no cumulative exponence, allomorphy, or suppletion. Thus, the general picture for Sinitic is that morphemes mostly have only one phonological form, and even grammaticalized items such as affixes remain clearly distinct from the root they combine with [
31]. In a diachronic perspective, this translates into a proposed feature of MESEA languages, namely having grammaticalization “without coevolution of form and meaning” [
27]; in other words, primary grammaticalization (the development of lexical categories into functional/grammatical categories) without secondary grammaticalization (morphological bonding, phonetic erosion, fusion, etc.). This is, in turn, connected with one of the typological traits mentioned in
Section 2, namely the use of lexical morphemes with grammatical functions. If lexical items that grammaticalize into functional elements do not change their shape, from a synchronic point-of-view, they will look like polyfunctional lexical/grammatical items. An example for this is SMC 在
zài, a verb meaning ‘to be at/in’, but also a preposition, ‘at/in’, and a marker of progressive aspect; the evolution of 在
zài did not cause (significant) formal changes in its shape.
Moreover, the above-mentioned lack of inflection and agreement entails that nouns, verbs, and adjectives are not expected to vary morphologically to mark number, gender, case, or TAM, as shown also in (3): a noun as SMC 演员 yǎnyuán can be translated as ‘actor’, ‘actors’, ‘actress’, ‘actresses’; an adjective like 聪明 cōngming means ‘clever’ for all gender and number values. Hence, the phrase 聪明的演员 cōngming de yǎnyuán can mean ‘clever actor’, ‘clever actors’, ‘clever actress’, ‘clever actresses’, in the absence of other cues. Besides lexical cues, some grammatical categories, such as, e.g., aspect, may be expressed with analytical markers, like the above-mentioned SMC 在 zài ‘prog’, or Cantonese -咗 -zo2 ‘pfv’.
Another characteristic of Chinese morphology, which is in line with the general trend of the MESEA area, is the strong tendency towards a 1:1 correspondence between syllables and morphemes. Most morphemes are monosyllabic, although only a subset of them are syntactically free (i.e., able to occur independently in a sentence). Thus, for instance, both SMC 书
shū ‘book’ and 椅
yǐ ‘chair’ are (lexical) morphemes, but only the former is free, and thus corresponds to a (syntactic) word. 椅
yǐ ‘chair’ can be used only in combination with other morphemes to form a word, such as, e.g., 长椅
cháng-yǐ ‘long-chair, bench’. Note that there are no formal differences between the two types of morphemes, i.e., there is nothing in the shape of a morpheme that might suggest whether it is free or bound. Morphemes made of more than one syllable, generally speaking, are either loanwords (e.g., Cantonese 多士
do1si6 ‘toast’), onomatopoeias (SMC 噼里啪啦
pīlipālā ‘crackling sound’), or words which derive from Old Chinese alliterative and rhyming compounds (SMC 徘徊
páihuái ‘pace back and forth; waver’). Interestingly, the tendency towards a 1:1 relationship between syllables and morphemes sometimes leads to the reanalysis of an individual syllable within a polysyllabic morpheme as a morpheme on its own [
11], e.g., Cantonese 多士
do1si6 ‘toast’ > 奶油多
naai5jau4-do1 ‘butter-toast, toast with butter and condensed milk’. Again, such a strong overlap between the basic unit of speech (the syllable) and the basic unit of meaning (the morpheme), combined with the stability of morpheme boundaries, should prevent the reduction in morphemic syllables, fusion, and cumulative exponence [
32].
However, as hinted at in
Section 2, phenomena such as strong reduction in grammatical morphemes, and even erosion of morpheme boundaries, leading to fusion, noncatenative exponence, and cumulative exponence, are all, in fact, attested in a subset of Sinitic. This happens mostly (but not exclusively) in Northern China [
29]. For instance, based on a sample of 26 Mandarin and Jin dialects, Arcodia proposes the following cline of grammaticalization for markers of perfective aspect deriving from the verb 了
liǎo ‘to finish’ (Late Middle Chinese
liawˊ, Early Mandarin
ljɛwˇ [
33];
Figure 1 adapted from [
34] (p. 154)):
Indeed, even in the ‘typological stereotype’ of Sinitic isolating morphology, namely SMC, we do see phonological and prosodic reduction in the perfective aspect marker, whose current shape is 了
le; the triphthong has been reduced to a schwa, and the marker bears no independent tone (and stress). In a number of Northern Sinitic languages spread over an area including Shaanxi, Shanxi, Henan, Hebei, and Shandong provinces, the same meaning may be expressed by suffixes made of a single vowel, which can also undergo fusion with the verb root, being expressed cumulatively in the form or rhotacizazion (i.e., addition of a rhotic coda), rhyme change (ablaut), tone change, (and/)or vowel lengthening [
29,
34], and even combinations of the above. See the examples in
Table 1 (Chinese characters omitted for ease of presentation):
While direct historical evidence to reconstruct the genesis of processes of strong reduction and fusion of grammatical exponents presented above is lacking, even just through the comparison of synchronic data from different dialects, it can be safely argued that these phenomena are the product of growing integration of a concatenative exponent in a lexical root, at least for most cases [
41].
Note that reduction leading to fusional, nonconcatenative, and cumulative exponce is found also in other areas and groups of Sinitic, and may be used to express many different types of grammatical meaning (see, e.g., the overview in [
29,
42,
43]), also including nominal derivation. An often-quoted example is the (loosely defined) ‘diminutive’ tone change (known in Cantonese as 變音
bin3yam1 ‘sound change’, or ‘lexical suprafixing’ [
3,
44]) as, e.g., Cantonese 臺
toi4 ‘stage, terrace’ > 臺
toi2 ‘table’ [
45] (p. 191), found in many Yue dialects also in different forms, including the addition of a nasal coda, or even just a nasal feature on the root, plus tone change [
44]. Another use of tone change found in several Yue varieties, as well as in some Hakka and Gan dialects, is marking perfective aspect, with dropping of the perfective aspect marker, as, e.g., Cantonese 落咗堂
lok6-zo2 tong4 ‘finish-
pfv class’ > 落堂
lok2 tong4 ‘finish.
pfv class’, two ways of saying ‘having finished class’ [
43] (p. 22). However, at present, there seems to be no consensus as to whether these marking patterns for perfective aspect are due to fusion of root and segmental exponent, or simply due to tone sandhi followed by omission of the aspect marker [
43].
As for word formation, the dominant process in Sinitic is compounding, understood as the combination of (either free or bound) lexical morphemes to form a word, as SMC 电脑
diàn-nǎo ‘electric-brain, computer’, Cantonese 甩肺
lat1-fai3 ‘drop-lung, exhausted, wrecked’, or Taiwanese (Southern Min) 頭家
thâu-ke ‘head-family, boss, husband’. Compounding, especially disyllabic/bimorphemic compounding, has become the default word formation strategy for SMC and many other Sinitic languages. This fact has been connected to a preference for disyllabic units, due to emerging prosodic constraints in the historical development of Chinese [
32]. Interestingly, this preference for disyllabic units in word formation leads to a tendency towards creating (mostly, but not exclusively) disyllabic abbreviations for longer words and phrases [
46], as, e.g., 北京大学
Běijīng Dàxué ‘Peking University’ > 北大
Běidà. However, the relative weight of disyllabic words in the lexicon may vary considerably in Sinitic. For instance, Cantonese is known to have a higher proportion of monosyllabic words, compared to SMC, at least in the native layer of its lexicon [
47], as, e.g., 眼
ngaan5 vs. SMC 眼睛
yǎnjīng ‘eye’, or 凳
dang1 vs. 凳子
dèngzi. This has sometimes been interpreted as an areally conditioned difference, since Southern Sinitic should be more closely aligned with MESEA typology, while ‘Altaic’ typology is characterized by more polysyllabic words (see below,
Section 5).
Headedness in Chinese compounds may vary depending on the type of compound, and on its part of speech. As a general rule, endocentric non-coordinate compounds are right-headed (e.g., SMC 面包
miàn-bāo ‘flour-bun, bread’, 心算
xīn-suàn ‘mind-count, do sums in one’s head’), but subordinate verb compounds are left-headed (SMC 开办
kāi-bān ‘open-class, offer a course). In Central and Southern Sinitic, exceptions may be found in the form of left-headed attributive nominal compounds, as, e.g., (Jieyang) Teochew
kiõ33-bo53 ‘ginger-mother, old ginger’,
hou35-21-mui55 ‘rain-minute, drizzle’ [
48] (pp. 65–66). These ‘anomalous’ patterns alternate with others which follow the general trend for headedness described above (e.g., Teochew
t’au55-11-mo55 ‘head-hair, hair’ [
48] (p. 65)). A class of items that are systematically found to the right of the noun they modify in a number of Central and Southern Sinitic varieties is gender markers for nouns denoting animals; see, e.g., Tunxi
ʨie11-21-
kan11 ‘chick-male, rooster’ [
49] (p. 86) and Taiwanese 鷄母
ke-bó ‘chick-female, chicken’. These head-initial structures have sometimes been attributed to contact with non-Sinitic MESEA-type languages of Southern China, which tend towards head-modifier order in word formation [
50].
The status of derivation in Sinitic languages is still debated. Processes which make use of nonconcatenative exponents, or any other means of reduced segmental exponents, as the above-mentioned ‘lexical suprafixing’ of Yue dialects, can be safely described as ‘derivation’, since new words are created by combining lexical morphemes with exponents which, at least synchronically, are definitely non-lexical. This applies also to derivation by means of rhyme change, tone change, and/or vowel lengthening, as, e.g., Boshan 夹
ʨia33 ‘press (from both sides)’ > 夹
ʨia511 ‘clip’ ([
51] p. 316). There are, however, processes of word formation which involve the use of concatenative, syllabic morphemes, which are sometimes treated as compounding, and sometimes as derivation, with no consensus on the borderline between these two processes (for an overview, see [
52]). In Sinitic languages, there are several cases in which a constituent in a word is found in a specific position with a stable meaning, similar to an affix, but with no formal difference from the corresponding lexeme. For instance, in SMC 族
zú ‘clan’ is found as the right-hand constituent in many words indicating a group of people who have something in common, as, e.g., 低头族
dī-tóu-zú ‘lower-head-clan, smartphone addicts’, or 啃老族
kěn-lǎo-zú ‘nibble-old-clan, grownups who live off their parents. Here, we see a semantic evolution of the formative in this construction, if compared to the original meaning of the morpheme, from ‘clan’, ‘ethnic group’ to “a category of people with common characteristics or behaviour” ([
53] p. 264). This type of formatives, which have some of the characteristics of affixes (fixed position, specialized meaning), but are homonymous to existing lexical morphemes, have also been termed as ‘affixoids’ (类词缀
lèicízhuì) or ‘quasi-affixes’ (准词缀
zhǔncízhuì), to distinguish them from ‘proper’ affixes; however, the criteria employed to identify them, and the related definitions, vary considerably in the literature [
12].
Lastly, another distinctive feature of the Chinese lexicon is the obligatory use of noun classifiers whenever an entity is individuated, as in the following SMC example:
| (4) | 三 | *(张) | 桌子 |
| | sān | *(zhāng) | zhuōzi |
| | three | clf | table |
| | ‘three tables’ |
As shown in (3), in the context of counting, a noun in Chinese must be accompanied by a classifier. The omission of 张
zhāng makes the phrase ungrammatical. SMC, and Sinitic varieties in general, are thus often referred to as ‘classifier languages’ [
54,
55]: while in a language like English, a unit of counting is required only with mass nouns (e.g.,
a bucket of sand), in Sinitic, this applies to any noun, be it mass or count. Also, classifier languages make use not only of units of measure, or ‘mensural classifiers’, like
bucket, but also of ‘sortal classifiers’, like 张
zhāng. Mensural classifiers are grammatical items that divide “the inventory of count nouns into semantic classes” [
56]. For instance, the above-mentioned 张
zhāng is generally used for flat objects, while 条
tiáo is used for long and thin objects, as well as some animals. Note that ‘individuation’ does not necessarily involve counting [
57]; indeed, noun classifiers in Sinitic are also used, e.g., with demonstratives.
While Sinitic languages share the basic defining features of classifier languages, their variety and features may vary. Generally speaking, Northern varieties tend to have a smaller inventory of classifiers, compared to Southern varieties [
50], and there are even Sinitic languages that make use of one (sortal) classifier only for all nouns (e.g., Dungan [
58]), thus actually performing the function of individuation without classification. Also, some varieties allow bare classifier phrases, with a broad range of diversity in terms of syntactic distribution and referential properties. For instance, in Cantonese, preverbal bare classifier phrases have definite reference, while postverbal ones may be both definite and indefinite (examples adapted from [
59] p. 118):
| (5) | a. | 本 | 書 | 好睇 | | |
| | | bun2 | syu1 | hou2-tai2 | |
| | | clf | book | good-look | |
| | | ‘The book is interesting’ | |
| | b. | 畀 | 杯 | 茶 | 我 | 飲 |
| | | bei3 | bui1 | caa4 | ngo5 | jam2 |
| | | give | clf | tea |
1sg | drink |
| | | ‘Give me the/a cup of tea to drink’ |
According to Wang [
59], Sinitic languages may be divided into seven types, according to whether they allow preverbal and/or postverbal bare classifier phrases, and according to whether these are definite or indefinite. Generally speaking, preverbal bare classifier phrases are less common than postverbal ones, and the preferential associations are preverbal-definite and postverbal-indefinite. The latter trend is related to the general tendency of Sinitic to have definite noun phrases in the preverbal position and indefinite noun phrases in the postverbal position [
5,
59].
4. Syntax
While most aspects of the syntax of SMC have been studied extensively (see, e.g., the seminal work by Li and Thompson [
60]), Sinitic syntax has received relatively less attention in typology-oriented research. Only in recent years has a relatively significant number of comprehensive ‘grammars’ of Sinitic languages been published, among which the series
Sinitic Languages of China: Typological Descriptions (De Gruyter Brill, edited by Hilary Chappell) constitutes the most ambitious endeavor, offering extensive grammatical descriptions of lesser-known Sinitic varieties, analyzed in a typological perspective.
We mentioned earlier that, as MESEA languages, Sinitic varieties are verb-medial with a canonical (S)VO word order, meaning that, in a ‘canonical’ declarative sentence, the objects, including direct and indirect objects, are commonly placed after the main verb. The following shows two declarative sentences in SMC with a VO word order. In Example (6), the object 饭
fàn ‘rice’ is placed after the transitive verb 吃
chī ‘eat’. In Example (7), both the direct object 一百块钱
yī-bǎi kuài qián ‘one hundred Yuan’ and the indirect object 我
wǒ ‘1
sg’ follow the ditransitive verb 给
gěi ‘give’.
| (6) | 他们 | 吃过 | 饭 | 了 |
| | tā-men | chī-guo | fàn | le |
| | 3m-pl | eat-exp | rice | sfp |
| | ‘They had their meal’ | |
| (7) | 妈妈 | 给了 | 我 | 一百 | 块 | 钱 |
| | māma | gěi-le | wǒ | yī-bǎi | kuài | qián |
| | mom | give-pfv | 1sg | one-hundred | clf | money |
| | ‘Mom gave me one hundred Yuan’ |
From the perspective of word order implicational universals, Sinitic varieties are typologically ‘anomalous’ VO languages, due, e.g., to the relative order of prepositional phrases and verb phrases, or that of relative clauses and their heads. Typologically, VO languages tend to have post-verbal prepositional phrases, yet (adjunct) prepositional phrases regularly precede verb phrases in Sinitic. Preverbal relative clauses in Sinitic also attracted much scholarly attention as a typological ‘puzzle’, since all Sinitic languages possess prenominal relative clauses, contrary to what VO languages overwhelmingly have: indeed, only five out of 879 varieties in Dryer’s [
61] sample have both VO order and postnominal relatives, three of which are Sinitic (SMC, Cantonese, and Hakka). While there are different strategies for relativization within Sinitic, the general rule is that the relative clause virtually always precedes the modified head noun, as in the following Cantonese (adapted from [
62] p. 327) and Shaowu (a transitional Min/Gan dialect; [
63]: 132) examples:
| (8) | 識 | 廣東話 | 嗰 | 的 | 学生 | 考 | 得 | 好 | 啲 |
| | sik1 | gwong2dung1waa2 | go2 | di1 | hok6saang1 | haau2 | dak1 | hou2 | di1 |
| | know | Cantonese | those | cl | student | examine | adv | well | a-bit |
| | ‘The students who know Cantonese did better (on the exam).’ |
Shaowu Transitioanal Gan/Min
| (9) | □ㅤ | 处 | 北京 | 学 | 书 | 个 | 囝儿 |
| | xaŋ35 | thu55-35 | pə53kin21 | xɔ35 | ɕy21 | kəi213 | kin53nə0 |
| | 1sg | loc | Beijing | study | book | rel | son |
| | ‘My son who studies in Beijing…’ |
| | 明朝 | 归 | 来 |
| | maŋ22tɕiau21 | kuei21 | li22 |
| | ‘…will come back tomorrow.’ |
Nevertheless, for some other constructions, variation in word order is attested across Sinitic, often seen as a manifestation of the general typological distinction between Northern and Southern China. As mentioned earlier (
Section 1), different patterns of language contact led Northern Sinitic to develop more OV features, under the influence of the Northern Asian languages, while Southern Sinitic shows more VO features associated with MSEA languages ([
22,
64],
inter alia). These include the comparative construction, the pretransitive object-marking disposal construction, and the ditransitive construction, among others. We will discuss them further, from the perspective of areal typology, in
Section 5.
Moreover, while (S)VO is generally considered to be the basic word order for the vast majority of Sinitic languages, many other options are available. This is often connected with one prominent characteristic of Chinese syntax, namely, topic prominence. Sinitic, like, generally speaking, MESEA languages, are very often classified as ‘topic-comment’ languages (see [
5,
60,
62,
65], among others). The ‘topic’ in topic-comment structures is the element that “sets the spatial, temporal, or other framework in which the predication holds” [
66] (p. 50). Indeed, Morbiato [
67] goes as far as to contend that grammatical relations, especially subject, do not play a significant role in determining word order in SMC; rather, Chinese sentences are best understood as comprising a ‘locatable’ frame-setting topic, and a focal element typically following the verb (see also [
65]). There is indeed a huge variety of constituents that may appear in the topic position in Sinitic, and other possible ‘alterations’ of canonical (S)VO order. Direct objects may be topicalized, as in the following SMC example:
| (10) | 饭 | 他们 | 吃过 | 了 |
| | fàn | tāmen | chī-guo | le |
| | rice | 3pl.m | eat-exp | sfp |
| | ‘As for (their) meal, they had it.’ | |
Also, objects may be fronted without placing them in the sentence-initial topic position:
| (11) | 他们 | 饭 | 吃过 | 了 |
| | tāmen | fàn | chī-guo | le |
| | 3pl.m | rice | eat-exp | sfp |
| | ‘Regarding them, they have had their meal.’ | |
Among the wide variety of sentence constituents that are allowed in the topic position for Sinitic, we also find so-called ‘hanging topics’, having no argumental relationship with the predicate (12) (Cantonese, adapted from [
62] p. 86), and repeated topics with verb fronting (13) (Tunxi, adapted from [
49] p. 277).
| (12) | 而家 | 嘅 | 天氣 | 最 | 易 | 傷風 |
| | ji4gaa1 | ge3 | tin1hei3 | zeoi3 | ji6 | soeng1-fung1 |
| | now | gen | weather | most | easy | catch-cold |
| | ‘It’s easy to catch a cold in this weather.’ |
| (13) | 讲 | 是 | 介式 | 讲 | □ | 不 | 识得 | |
| | kau31 | ɕi24 | ʨiɛ31-ɕin24 | kau31 | uɛ, | pu11 | ɕi5-11-tiʔ5 | |
| | talk | cop | this-way | talk | sfp | neg | know-res | |
| | ‘As for saying, although I have said this, (I’m) not sure...’ |
| | 渠 | 实际 | 考 | 不 | 考 | 得 | 上 | 大学 |
| | kʰə44 | ɕi11ʨɿ31 | kʰə31 | pu11 | kʰə31 | tiʔ5 | ɕiau24 | (tʰo11-21xɔ11). |
| | 3sg | actually | examine | neg | examine | part | res | university |
| | ‘…whether s/he can make it (to the university).’ |
In the Cantonese example (12), the topicalised 而家嘅天氣 ji4gaa1 ge3 tin1hei3 ‘the recent weather’ has no grammatical relationship with the predicate 最易傷風 zeoi3 ji6 soeng1-fung1 ‘(is) easy to catch a cold’, but rather sets the frame for discussing further a condition for getting sick. In the Tunxi example (13), the topicalized predicate 讲 kau31 ‘talk’ is fronted to the beginning of the sentences before being repeated, which then establishes the context for the following discussion (‘As for saying, although I have said this […]’).
Two more key typological features of Sinitic (and, more generally, of MESEA languages; see
Section 2) we shall discuss in this Section are the prominence of aspect over tense, and having a rich inventory of sentence-final particles which carry modal/aspectual meaning. Generally speaking, Sinitic languages lack a grammaticalized category of ‘tense’, i.e., overt, obligatory morphological marking of tense, but rather rely on a variety of factors for interpreting temporal relationships, including, mainly: (grammatical and lexical) aspect, modal verbs, sentence-final particles, time expressions, adverbs, and context ([
62,
68,
69,
70]). As shown in the SMC example below (14), there is no grammatical exponent for past tense on the verb 去
qù ‘go’ itself; rather, past tense reference is established, first and foremost, by the temporal expression 昨天
zuótiān ‘yesterday’. The perfective aspectual marker -了 -
le further suggests it is an accomplished event.
| (14) | 我们 | 昨天 | 去了 | 豫园 |
| | wǒmen | zuótiān | qù-le | Yù-yuán |
| | 1pl | yesterday | go-pfv | yu-graden |
| | ‘We went to Yu Garden yesterday.’ |
Nevertheless, the debate on the ‘tenselessness’ of Sinitic has not settled yet, especially for Northern Sinitic, such as Mandarin dialects of the Central Plains subgroup [
71] and of the Qinghai-Gansu
Sprachbund (Tangwang, Wutun, Gan’gou, Linxia; we will get back to this area in
Section 5), as well as Jin dialects [
72,
73,
74]. In many of those varieties, there are sentence-final particles that have been claimed to convey tense. Typically, the
lai-type (likely cognates to SMC 来
lái) for past tense, and either the
ya/ye-type (for Central Plains Mandarin and Jin, e.g., Shangzhou 呀 [ia] [
75]) or the
li-type (for Qinghai-Gansu varieties, e.g., Minhe Gan’gou 哩 [li] [
76]) for the future tense [
77]. In Yangquan, for instance, propositions are proposed to demonstrate distinctive grammatical marking for the past, present, and future tenses [
71] (p. 67; since no transcription is provided in the source, we added one based on SMC pronunciation):
| (15) | a. | 你 | 干 | 啥 | 来 |
| | | nǐ | gàn | shá | lái |
| | | 2sg | do | what | pst |
| | | ‘What did you do?’ |
| | | | | | |
| | b. | 你 | 干 | 啥 | 嘞? |
| | | nǐ | gàn | shá | lei |
| | | 2sg | do | what | pres |
| | | ‘What are you doing?’ |
| | | | | | |
| | c. | 你 | 干 | 啥 | 呀? |
| | | nǐ | gàn | shá | ya |
| | | 2sg | do | what | fut |
| | | ‘What will you do?’ |
While different proposals remain, according to which these tense markers might have developed as a result of language contact with Altaic languages such as Mongolic or Turkic (especially, the markers for futurity; [
77,
78,
79]), there seems to be no agreement yet as to the origin of these tense markers, which warrants further studies.
Unlike tense, aspect focuses on the “internal temporal constituency of one situation” [
80] (p. 5). Aspect marking is generally considered to be a salient feature across Sinitic, although it appears to be more so in Southern Sinitic than in Northern Sinitic. Among various aspectual categories, the fundamental distinction between perfective and imperfective yields the best researched aspect markers of Sinitic, namely SMC: -了-
le for perfective (16), -过 -
guo for the so-called ‘experiential aspect’ (17), -着 -
zhe for continuous (18), and 在
zài for the progressive (19; for a comprehensive analysis of aspect in SMC, please refer to [
60,
81], among others).
| (16) | 小明 | 早上 | 吃了 | 两 | 个 | 鸡蛋 |
| | Xiǎomíng | zǎoshàng | chī-le | liǎng | gè | jīdàn |
| | Xiaoming | morning | eat-pfv | two | clf | egg |
| | ‘Xiaoming ate two eggs (this) morning.’ | | |
| (17) | 他 | 结过 | 婚 |
| | tā | jié-guò | hūn |
| | 3sg.m | tie-exp | marriage |
| | ‘He once got married.’ |
| (18) | 门 | 关着 |
| | mén | guān-zhe |
| | door | close-cont |
| | ‘The door is closed.’ |
| (19) | 外婆 | 在 | 打 | 麻将 |
| | wàipó | zài | dǎ | májiàng |
| | maternal.grandma | prog | hit | mahjong |
| | ‘Grandma is playing mahjong.’ |
Southern Sinitic varieties, as hinted at above, are reported to possess more complicated aspectual systems, both in terms of forms and of ranges of functions for aspectual markers. For descriptions of aspectual markers/systems of specific Southern Sinitic varieties, see, e.g., [
82,
83,
84,
85]; comparative studies of Sinitic aspect systems include [
86,
87,
88,
89], among others. For instance, Cantonese possesses an habitual aspectual marker -開 -
hoi1 to denote a recurrent habitual activity, as shown in (20) ([
62] p. 241); this category of habitual aspect is not found in SMC, nor in many other Sinitic languages.
| (20) | 佢 | 睇開 | 中醫 | 嘅 |
| | keoi5 | tai2-hoi1 | zung1-ji1 | ge3 |
| | 3sg | see-hab | Chinese-doctor | sfp |
| | ‘S/he usually goes to a Chinese doctor.’ |
Another peculiar phenomenon is found in some Jianghuai and Southwestern Mandarin dialects, in which progressive aspect markers seemingly sharing the same etymon as SMC 在
zài are actually found after the verb phrase, unlike the ‘default’ pre-VP order for this type of exponent seen in SMC (19) and most other Sinitic languages. See, for instance, the following Huoshan example (22) [
90] (p. 10; since no transcription is provided in the source, we added one based on SMC pronunciation):
| (21) | 他 | 打 | 麻将 | 在 |
| | tā | dǎ | májiàng | zài |
| | 3sg.m | hit | mahjong | prog |
| | ‘He is playing mahjong.’ |
Also, interestingly, in some varieties, double marking (i.e., pre-VP and post-VP 在
zài) is also attested, as in Baokang (20) [
86] (p. 56; since no transcription is provided in the source, we added one based on SMC pronunciation), showing ‘hybridization’ between a dialectal construction and the pattern of the standard language (i.e., SMC):
| (22) | 现在 | 还 | 有 | 谁 | 在 | 看 | 电视 | 在? |
| | xiànzài | hái | yǒu | shuí | zài | kàn | diàn-shì | zài? |
| | now | still | have | who | prog | watch | electrical-vision | prog |
| | ‘Who is still watching TV now?’ |
Due to space restrictions, we cannot offer here a comprehensive discussion of the similarities and differences among the aspectual systems of Sinitic languages; therefore, in
Table 2, we limit ourselves to proposing a simple comparison between the aspectual systems of one representative variety each for Northern Sinitic (SMC), Central Sinitic (Tunxi), and Southern Sinitic (Cantonese).
Another prominent feature of Sinitic aspectual markers is that they are generally considered to have grammaticalized from lexical verbs, a process in which “lexical items and constructions come in certain linguistic contexts to serve grammatical functions, and, once grammaticalized, continue to develop new grammatical functions” [
91] (p. 18), although the degree to which such aspectual markers have grammaticalized may vary across different varieties. While consensus is reached on the source lexical verbs of most SMC aspect markers, e.g., the continuous aspect marker -着 -
zhe ‘
cont’ grammaticalized from the stative verb 着
zhuó ‘to adhere to’ [
92], and the perfective marker -了 -
le ‘
pfv’ grammaticalized from the action verb 了
liǎo ‘to finish’ [
93], and sometimes also of other major Sinitic varieties, such as the habitual aspectual marker -開 -
hoi1 ‘
hab’ in Cantonese, grammaticalized from the action verb 開
hoi1 ‘open’ [
94], the etyma of aspectual markers in the lesser-studied Sinitic varieties remain a topic for further studies.
Yet another aspect related to aspect marking in Sinitic is sentence-final particles (SFPs). As mentioned above, sentence-final particles constitute an evident areal feature of languages spoken in the MESEA region. The SFPs in MESEA, or what Panov [
95] (p. 22) refers to as “Asian style” FPs, differ from those of the languages of Europe in that they function to express the speakers’ attitudes towards the propositions per se, as well as towards the addressees, instead of reflecting how a proposition is related pragmatically to the previous proposition [
96]. The functions of SFPs in Sinitic, as in many other Asian languages, comprise expressing illocutionary force, sentence moods, epistemic modality, evidentiality, and allocutivity, as well as the lesser-discussed aspect marking function. In terms of morphological form and word order, SFPs in Sinitic mostly carry neutral tones and appear precisely as the last morpheme of an utterance. For SMC, Li and Thompson [
60] (p. 238) identify six major SFPs, among which four of them express information on the speaker–hearer relationship in a given context, ranging from response to expectation (呢
ne), seeking agreement (吧
ba) to ‘friendly reminder’ (喔
ou) and softening the tone (啊/呀
a/
ya). Besides allocutivity, SFPs are also employed to mark sentence mood (e.g., 吗
ma for questions), and a
mélange of pragmatic functions highlighting the current relevance of the proposition (了
le). See the following examples showing the contrast between 吧
ba and 呢
ne (adapted from [
60] (pp. 304, 310)):
| (23) | 她 | 很 | 好看 | 吧? |
| | tā | hěn | hǎo-kàn | ba |
| | 3sg.f | very | good-look | sfp |
| | ‘She is pretty (don’t you agree?)’ |
| (24) | 他 | 很 | 开心 | 呢! |
| | tā | hěn | kāixīn | ne |
| | 3sg.f | very | happy | sfp |
| | ‘He is happy (hence no need to worry).’ |
Besides expressing types of speech acts, a less-studied function of SFPs is aspect marking, which is reportedly attested in some Sinitic varieties. For instance, in Wuhan (23) [
86] (p. 57; since no transcription is provided in the source, we added one based on SMC pronunciation), the first clause ends with the SFP 在
zài marking the progressive aspect, similarly to the Huoshan and Baokang examples seen above (21–22)
. Notably, besides marking progressive aspect, this Wuhan sentence-final particle clearly implies the speaker’s discontent towards the addressee, indirectly expressing the wish that s/he could stop what s/he is currently doing.
| (25) | 我 | 在 | 听 | 老师 | 讲课 | 在, |
| | wǒ | zài | tīng | lǎoshī | jiǎng-kè | zài |
| |
1sg | prog | listen | teacher | teach-class | sfp.prog |
| | | | | | | |
| | 你 | 莫闹 | | | |
| | nǐ | mò-nào! | | | |
| |
2sg | neg.imp-make.noise | | | |
| | ‘I am listening to the teacher, don’t make noise!’ |
The dual function of the sentence-final particle 在 zài ‘sfp.prog’ as both an aspect marker and a sentence-final discourse marker arguably reflects a ‘bridging context’ between the aspectual function and illocutionary force. This aspect of the evolution of SFPs merits further study, concerning the interplay between tense, aspect, and mood.
5. Areal Typology
The internal classification of Sinitic languages has attracted lots of scholarly discussion, which revolves around two interconnected parameters—phylogenetic classification and areal typology. The traditional ‘dialectal’ classification of Chinese started with Li Fang-Kuei [
97], when he categorized Chinese into eight ‘dialect’ groups, primarily based on phonological criteria, as pointed out in
Section 1. His classification was later modified by Yuan et al. [
98] to seven groups, paving the way for modern “dialectological” research, such as the first and second editions of the Language Atlas of Chinese Dialects [
99,
100], which acknowledges the ten major groups of Sinitic introduced in
Section 1. Needless to say, such a traditional approach to Sinitic languages has its own merits, but, as pointed out earlier, phonological traits alone may not suffice to highlight distinctions among a language family with such a remarkable internal diversity, calling for an alternative perspective in accounting for the universality and diversity among Sinitic: namely, an areal-typological approach.
As mentioned in
Section 1, the areal-typological approach to the analysis of Sinitic began with Hashimoto’s [
22,
101] north–south division among Sinitic languages along the geographic Qinling–Huaihe Line. His ‘Altaicization–Taicization’ hypothesis proposed that Northern Sinitic languages converge more towards ‘Altaic’ typology (i.e., Tungusic, Mongolic, and Turkic), while Southern Sinitic languages affiliate with Mainland Southeast Asian languages to the south. The key parameters for this Northern–Southern distinction are presented in
Table 3.
Following Hashimoto’s proposal, Norman ([
1,
103]) added a third zone to his north–south divide, namely “Central” Sinitic, which exhibits hybrid features of the northern and southern varieties. Norman [
103] listed ten (mainly) phonological and lexical diagnostic traits, which he later expanded to 15 ([
1]), to be employed for an ‘intuitive’ classification between Northern (here, essentially Mandarin), Central, and Southern Sinitic: languages with all or nearly all of the 15 traits belong to the Northern group; those which have none or nearly none of the traits belong to the Southern group; those which possess only a part of those 15 traits, and are thus ‘hybrid’ between the Northern and Southern group, represent the Central zone. In
Table 4, we give the list of the features proposed by Norman for his model of areal classification.
Leveraging on the feature-based methods of these two classic works, Chappell [
102] identified three grammatical constructions, namely pretransitive differential object marking, passives, and comparatives of inequality, through which she categorizes Sinitic languages into four convergence zones: Northern, Central Transitional, Southeastern, Southwestern, and Far Southern areas. On the other hand, Iwata [
104,
105,
106] adopted a different approach to the areal classification of Sinitic, investigating the isoglosses of 49 and 46 lexical items in his Interpretative Maps of Chinese Dialects (vol. 1 [
105] in 2009, vol. 2 [
106] in 2012). He proposes a ‘Yangtze-type’ cluster, a much neglected topic in the areal studies of Sinitic, according to which the Yangtze Plains represent a source of regional innovation, providing impetus for linguistic diffusion both to the north and the south.
In recent years, areal-typological studies of Sinitic have witnessed new developments, both in terms of methodological breakthrough and scope of study. Methodology-wise, scholars resort to quantitative methods in areal-typological studies of Sinitic, such as the NeighborNet algorithm [
107] built in the computational phylogenetic program SplitsTree4 [
108]. For example, Szeto and Yurayong [
50] identify four areal groups in Sinitic, namely Northern, Transitional, Central Southeastern, and Far Southern, based on a sample of 30 typological features, applied to a sample of more than 360 languages. Yang et al. [
109] identified a north–south continuum of lexical differences, through phylogenetic analysis and admixture inference on 1018 lexical traits. Huang et al. [
110] concluded with a six-cluster analysis, based on a large-scale quantitative analysis with a coverage of 930 data sites and 510 typological features extracted from the Linguistic Atlas of Chinese Dialects [
111]. In addition to methodological innovation, linguistic areas shaped by prolonged areal convergence between Sinitic and non-Sinitic languages came under the spotlight too, such as the Gansu-Qinghai (or Amdo) Sprachbund ([
112,
113,
114]) and the Western Lingnan Sprachbund ([
115]).
To conclude, areal-typological studies of Sinitic languages (or, also, of languages of China in the broad sense) have begun to shed light on the synchronic distribution of linguistics features across Sinitic, as well as on their diachronic development and directions of areal diffusion, as a result of the complicated interplay between geoecological constraints, historical evolution of languages, and demic diffusion of population. However, more research is necessary to reconcile the existing classifications and analyses, which do not completely overlap, and to highlight further aspects of the history of Sinitic languages and of their speakers.