1. Introduction
In the long-standing research on numeral classifiers, it is widely acknowledged that the most common and fundamental functions of numeral classifiers are individuation and classification. The former means classifiers are used to make count nouns enumerable by individuating (
Greenberg 1972) or atomizing (
Chierchia 1998) them, while the latter refers to the fact that classifiers form a semantic system to classify nouns according to their semantic properties like animacy, shape, consistency, size, etc. (
Aikhenvald 2000;
Allan 1977;
Denny 1976). As demonstrated in (1), in Mandarin Chinese, the classifier
fēng ‘seal’ is used in the context of counting letters, given that letters used to be sealed at the time when that classifier was introduced.
(1) | Mandarin [NUM CL N]: |
| a. | sān | fēng | xìn |
| | three | CL: sealed.item | letter |
| b. | *sān | xìn |
| | three | letter |
| | ‘three letters’ |
It is also well-known that numeral classifiers can go beyond the functions of individuation and classification and obtain additional functions in many languages of East and mainland Southeast Asia (EMSEA). According to
Bisang (
1999, p. 115), “classification can be employed to compare one particular sensory perception and its properties to the properties of other sensory perceptions in order to identify that particular perception by subsuming it under a certain concept”. This operation is called identification, and identification forms the point of departure for CLs to take on the function of referentialization. Reference, according to
Searle’s (
1969) definition, is an act of identifying some entity that the speaker intends to talk about. The existing literature shows that numeral classifiers can contribute to the process of referentialization in various ways.
In studies of Sinitic, the referential function of CLs is typically concerned with the marking of (in)definiteness in the bare classifier construction [CL N], which comprises only the CLs and the head noun that follows (for a discussion of the referential functions of CLs in some non-Sinitic languages, cf.
Section 4.3). The syntactic position and its semantic interpretation are language-specific, with considerable influence exerted by pragmatic factors and word order. For instance, in Mandarin Chinese (2a), the bare classifier construction [CL N] can exclusively occur in the postverbal position, where it only conveys indefinite interpretation. In contrast, in Wu (2b), the bare classifier construction can be employed on both sides of the verb, with the pre- and postverbal [CL N] exclusively conveying definite and indefinite interpretation, respectively. Cantonese (2c) further complicates the picture as the construction can manifest on both sides of the verb but with different interpretations: the preverbal [CL N] only denotes definiteness, while the postverbal [CL N] can be either definite or indefinite, depending on the context.
(2) | Some Sinitic languages: [CL N] |
| a. | (*ge) | laoban | mai | le | liang | che. | (Mandarin) |
| | CL | boss | buy | PFV | CL: vehicle | car | |
| | ‘The boss bought a car.’ |
| b. | kɣ | lɔpan | ma | lə | bu | tshotshɿ. | (Wu) |
| | CL | boss | buy | PFV | CL: vehicle | car | |
| | ‘The boss bought a car.’ |
| c. | go | louban | maai | zo | ga | ce. | (Cantonese) |
| | CL | boss | buy | PFV | CL: vehicle | car | |
| | ‘The boss bought a/the car.’ |
| (Li and Bisang 2012, p. 2) |
Wang’s (
2015) typology of CL systems based on an analysis of 120 Sinitic languages reveals the existence of seven possible types. These types are established with reference to the specific combination of syntactic position relative to the verb and the semantic interpretation of the bare classifier construction in terms of [±definite] (cf.
Table 1). According to Wang’s categorization, Mandarin is a Type VII language, the Wu dialect of Fuyang aligns with Type IV, and Cantonese belongs to Type III.
Remarkably, the majority of Min languages, including Xiamen Southern Min (XSM)
1 as the focus of the present study, are classified as Type VI languages, which prohibit the use of a bare classifier construction, regardless of the position relative to the verb. These languages use [DEM CL N] and [‘one’ CL N] to denote definiteness (3a) and indefiniteness (3b), respectively. In both constructions, if the referent is identifiable from the context (e.g., through a clear pointing gesture or previous mentions), the DEM and the CL can both be omitted, i.e., a bare noun can be used. Like most Sinitic languages, XSM has a two-term demonstrative system with the proximal demonstrative
zīt ‘this’ and the distal demonstrative
hīt ‘that’, which generally identify items according to their distance from the speaker (
Lien and Chiu 2014).
(3) | Xiamen Southern Min: |
| a. | [DEM CL N] |
| | *(hīt) | ziāh | ziăo-ă | bē | kì | lo. |
| | DEM | CL: small.animal | bird-SUF | fly | go | PRT |
| | ‘The bird flew away.’ |
| b. | [‘one’ CL N] |
| | yī | gâng | gguă | gǒng | *(zít) | hâng | dâizì. |
| | 3SG | with | 1SG | tell | one | CL: matter | thing |
| | ‘He told me something.’ |
| (Wang 2015, p. 122)2 |
However, this does not mean that referentiality does not manifest itself in the use of CLs in XSM, because the selection of specific vs. general CLs also contributes to referentiality. Specific CLs are selected according to semantic properties of nouns and are limited to a specific and often small set of nouns, while general CLs can basically co-occur with any count noun and are the only possible CL for many nouns. In XSM, the general CL is
é. It is often written with the Chinese character 個, even though its lexical source is still controversial (
cf. Chappell 2018;
Li 2007). An example of the substitution of a specific CL by the general CL is given in (4a) and (4b).
(4) | Xiamen Southern Min: |
| a. | zít | kiā | cēhbāo |
| | one | CL: bag | schoolbag |
| b. | zít | é | cēhbāo |
| | one | CL: gen | schoolbag |
| | ‘one schoolbag’ |
| (Zhou et al. 2006, p. 236; glossed by us) |
According to the fieldwork data of one of the authors, older speakers (over 69) of XSM are actually rather rigid in the assignment of specific CLs and do not allow the substitution of a specific CL by the general CL with some particular nouns. However, young speakers (under 30) of XSM tend to accept the substitution of a specific CL with the general CL, although this is not absolute. Since the data collected in this experiment show that all of the referents are mentioned with the general CL by many of the speakers, we believe that it is justified to claim that at least among young speakers of XSM, the substitution of a specific CL with the general CL is grammatical.
As for the conditions under which that substitution happens, there has been some research by
Erbaugh (
2002),
Erbaugh and Yang (
2006), and
Erbaugh (
2013), indicating that the selection between specific and general CLs is associated with information structure, syntactic function, and number. To be more specific, specific CLs are assumed to be used with the information focus in the postverbal object position, while higher numbers typically appear with the general CL.
While the association of the postverbal position with the information focus follows a universal tendency (
Kiss 1998),
Erbaugh’s (
2002,
2013) generalization that specific CLs tend to be used at the first mention of an object may be due to the fact that her analysis is based on endophoric (text-internal) reference, as it is found in her narrative texts based on the Pear Story (
Chafe 1980). Even if her results are statistically significant, it will be seen in our study that this is not replicated in the case of exophoric (text-external) reference, which is characteristic of dialogic texts. Since the exact use of specific CLs vs. the general CL in exophoric reference is largely unknown, this study will investigate the referential function of CLs in an exophoric context. Exophoric or dialogic contexts are generally characterized by the interaction of the speaker and hearer and, in the context of information structure, the assessment of the identifiability or accessibility of a concept in a concrete discourse situation (e.g.,
Lambrecht 1994). Thus, an utterance made by the speaker includes the hearer in the sense that the speaker tries to assess the degree to which a concept is referentially activated/accessible to the hearer. Given that dialogue is going on through time and is characterized by role change between the speaker and hearer (for a more detailed explanation, cf.
Levinson 2016), the assessment of identifiability/accessibility requires permanent updates (cf.
Section 3.3.1).
Based on these explanations, we summarize our research questions as follows:
The factors that may affect the selection of the options given in question 2 in the experimental setting for identifying objects in space are as follows:
- (a)
Distance: the distance of the referent from the speaker.
- (b)
Visibility: the visibility of the referent from the speaker’s point of view, i.e., whether the speaker’s view of the referent is obstructed by another item, so the speaker cannot see the referent directly from where she/he is.
- (c)
Uniqueness: the uniqueness of the referent within the speaker’s visual scope, i.e., is there only one object associated with the same noun [+unique] or is there more than one adjacent object of the same kind in the speaker’s scope [−unique]?
The choice of factors (a) distance and (b) visibility is based on
Dixon’s (
2003) major parameters of reference for demonstratives, as they are common categories encoded in forms which can have a deictic function. As this study will show, such forms include not only DEM, but also CLs. The choice of factor (c) uniqueness is based on the fact that unlike demonstratives, XSM CLs have additional functions of individuation and the marking of singular
3. Given that CL use is obligatory with demonstratives, we believe that the different semantics of general vs. specific CLs may affect classifier selection depending on [±uniqueness].
Moreover, to understand to what extent the presence of DEM would influence the result of our investigation on CLs, we conducted a binary logistic regression analysis of the potential influence of distance, visibility, and uniqueness on the distribution of the proximal and distal DEM (see
Section 3.1 for more information on this type of test). The results show that DEM is sensitive to distance (
p < 0.001) in XSM, but to neither visibility (
p = 0.659) nor uniqueness (
p = 0.469), which further corroborates
Lien and Chiu’s (
2014) observation.
The present study is based on a cognitive experimental approach and involves both quantitative and qualitative analyses. The results will provide significant evidence for the higher probability to use the [DEM CL N] construction if the referent to be identified is visible. As for the probability to use a specific or general CL, it is associated with distance and uniqueness in an intertwining way.
The remainder of this paper is structured as follows:
Section 2 presents the research method used for the experiment and is followed by the results and the analysis in
Section 3. The discussion is found in
Section 4. It presents the cognitive motivations for using the [DEM CL N] construction and the use of a specific CL vs. the general CL in contexts of contrastive focus. In addition, it discusses three other languages in which CLs are used in contrastive contexts (Thai, Vietnamese, and Ponapean).
Section 5 will conclude this paper.
5. Conclusions
This study started out from the question of whether CLs in Xiamen Southern Min can be associated with reference if they have no bare classifier construction [CL N] associated with definiteness/indefiniteness. To test this, we looked at the difference between specific CLs and general CLs and their referential potential in an exophoric context provided by the “Hidden color-chips” task in an environment characterized by the three factors of distance, visibility, and uniqueness. This method contrasts with previous studies based on endophoric contexts as they are found in narratives (
Erbaugh 2002,
2013;
Erbaugh and Yang 2006). Our quantitative and qualitative analyses showed that the use of specific vs. general CLs is associated with reference in the following specific pragmatic contexts:
Compared with using a bare noun, speakers are clearly more likely to use the DEM construction when the object is clearly visible to the speaker.
In cases with [+unique], speakers are significantly more likely to use a specific CL when the object has [−distant] than when it has [+distant].
In cases with [+distant], speakers are significantly more likely to use a specific CL if the referents have [−unique], and a general CL if they have [+unique].
Additionally, the qualitative analysis shows that if the object has [−unique] and the first reference with a general CL is not sufficient to identify it, a specific CL is much more likely to be used in later mentions (Effect IV).
Effects I and II can be explained by the fact that referents which are cognitively more accessible (i.e., close and visible) tend to be further specified. Effect III and the qualitative analysis reveal that specific CLs can deictically mark contrast in the cases of [−unique] and [+distant]. The functional extension from specific CL to contrastive marker is arguably motivated by the insufficiency of referential clues in the demonstrative construction in the specific context with [+distant] and [−unique].
Moreover, this paper briefly introduced another environment [ADJ CL N] where specific CLs are used for contrast in XSM and illustrated the contrastive use of CLs in three other languages, i.e., Thai, Vietnamese, and Ponapean. This illustration shows that both specific and general CLs are potential sources of contrastive markers, but their grammaticalization follows two mutually exclusive pathways.