4.1. Verbal Pragmatic Markers in Feedback Sequences
Out of the 431 feedback sequences identified in the data, 231 (53.6%) include one or several verbal pragmatic markers. As a reminder, this figure also includes cases where the verbal markers co-occur with a visual-gestural form. The most frequent verbal markers are
ouais ‘yeah’ (76 occurrences),
mh (18),
ah oui ‘ah yes’,
mhm (11),
d’accord ‘alright’, and
OK ‘okay’ (10). In addition to these expressions that typically express agreement or alignment, pragmatic markers in feedback sequences also include more standard forms like
mais ‘but’,
bah ‘well’,
alors ‘so/well’, or
parce que ‘because’, but to a much smaller extent. Only
mais stands out with 12 occurrences, which could be due to its affinity with the expression of (opposing) opinions (see qualitative analysis in
Section 4.3 below).
If we break down these figures by feedback type, we can see that verbal markers are present in 58.62% of all sequences expressing alignment, and 43.26% of those expressing affiliation. These proportions go up to 75% and 54%, respectively, if we restrict to the 339 audible and multimodal sequences (that is, excluding visual-only feedback sequences). The difference between alignment and affiliation is statistically significant according to a z-score test for two population proportions (
z = 4.116,
p < 0.001). It shows that verbal pragmatic markers are very frequent in all types of feedback, but particularly prevalent in expressing alignment. Listeners communicate that they are understanding and following by using markers such as
okay or
mhm. The relatively smaller proportion in affiliation can be explained by the role of laughter (coded as audible or multimodal) in this type of feedback. Listeners often laugh to react to a joke or a funny anecdote, thereby affiliating with the speaker’s stance (“I agree that this is funny”), which restricts the opportunity for verbal pragmatic markers to occur. A second explanation relates to (
Kosmala & Crible, in press) finding that affiliation resorts to specific (rather than generic) expressions more than alignment in order to evaluate or elaborate on some stance or opinion. Pragmatic markers, being generic by definition, will thus occur relatively less frequently in these specific sequences of affiliation.
Zooming in on the particular pragmatic markers that occur in the two sequence types, it appears that the same forms are often used for alignment or affiliation.
Ouais is the most frequent marker overall and in both types, either in isolation or combined with other verbal markers (e.g.,
ah ouais,
bah ouais,
ouais voilà,
ouais ouais OK, etc.). In total, it appears 82 times in alignment and 31 times in affiliation. Examples (1) and (2) illustrate its use in both sequence types.
(1) | F2 j’ai vu le premier épisode |
| F1 ouais |
| F2 avec Cécile de France |
| F2 I saw the first episode |
| F1 yeah |
| F2 with Cécile de France |
(2) | C1 une seconde femme un peu parce que |
| C2 tu trouves ? ((frown)) |
| C1 bah elle est très maternelle avec lui |
| C2 ouais ((frown)) c’est vrai qu’elle le gronde en plus au début |
| C1 a second wife a bit because |
| C2 you think so? ((frown)) |
| C1 well she’s very maternal with him |
| C2 yeah ((frown)) it’s true that she scolds him also at the beginning |
Ouais is an informal variant of oui ‘yes’, which is less frequent and appears a total of 42 times in alignment and 19 in affiliation. The difference between the two sequence types is thus smaller for oui than for ouais, with relatively more oui in affiliation, which suggests that the former might be closer to its original affirmative semantics while the latter has developed more pragmatic uses, such as expressing mutual understanding.
The majority of the verbal pragmatic markers occur in both alignment and affiliation and tend to be more frequent in the former because alignment is more frequent overall. Exceptions are
bah ‘well’ alone or with other verbal markers (
bah ouais,
bah oui,
oui bah ouais du coup), which is much more frequent in affiliation (13 vs. 3 occurrences),
mais ‘but’ which has 6 occurrences in both, and rare forms such as
exactement ‘exactly’ or
si ‘yes’ (after a negation) which are only used once, in affiliation. The affiliative preference of
bah reflects its strong connection with
ouais and
oui, where it reinforces the original affirmative semantics of agreement.
Bah seems to add an evidential dimension to agreement (something like “obviously yes”) and thus expresses the speaker’s stance in a more assertive way, which is particularly visible in Example (3) where it is also combined with the expression of strong agreement
exactement ‘exactly’.
(3) | F1 du coup y’a plein de références comme ça euh (0.299) ‘fin du coup j’ai l’impression d’être euh tu sais décalée dans le temps euh |
| F2 bah oui exactement j’avais la même impression quand je regardais en différé |
| F1 so there are a lot of references like that uh (0.200) I mean so I feel like I’m not in sync in time euh |
| F2 well yes exactly I had the same feeling when I watched recorded shows |
On the other hand, some (combinations of) markers of two or more instances occur exclusively in alignment sequences. This is primarily the case of
d’accord ‘alright’, which, despite what its original semantics of
accord ‘agreement’ suggests, has 13 occurrences in alignment and none in affiliation. This shows its high degree of grammaticalization and specialization in the expression of mutual understanding. Other markers exclusive to alignment are the reduplicated
ouais ouais (4 occ.) and
oui oui (5 occ.), with one notable exception where
oui is repeated four times in the same sequence to express affiliation:
(4) | C1 bon après sur scène je pense que c’est plus abordable que |
| C2 oui vachement plus oui oui oui |
| C1 well but on stage I think it’s more accessible than |
| C2 yes much more yes yes yes |
Apart from this exception, reduplicated or repeated ouais and oui express alignment in the data, which is also somewhat counter-intuitive, as was the result for d’accord. One possible explanation is that listeners will rush to signal their understanding with multiple generic markers so that the conversation can continue as soon as possible (i.e., “no need to keep speaking about this, I understand”), whereas affiliation tends to resort to more specific forms of feedback or to combinations of one generic form (e.g., oui) followed by more specific elaborations of stance (cf. “oui vachement plus” in Example 4). These tentative interpretations are only based on our small dataset and should be further substantiated.
All in all, verbal pragmatic markers are highly present in feedback sequences, and relatively more so in alignment than in affiliation, with a few notable exceptions. They mainly consist of markers of agreement that are either used in their original meaning or more abstractly to signal understanding rather than agreement. It appears from this first analysis of the verbal modality that these pragmatic markers very often tend to combine with each other, and that they are very versatile in their meaning-in-context—observations that will also apply to our multimodal analysis in
Section 4.3.
4.2. VisualGestural Forms in Feedback Sequences
We now turn to the visual-gestural forms that are used in feedback sequences, following a similar objective of identifying the most frequent forms and those that are favored to express alignment or affiliation. Again, as a reminder, these data also include forms that co-occur with a verbal-vocal element. In our data, two-thirds (283) of the 431 sequences involve visible feedback markers, mostly non-manual signals. This strikingly high rate is similar within alignment (68.25%) and affiliation (60.28%) (not significant: z = 1.6394, p = 0.101).
Table 1 shows all visual markers and combinations thereof across sequence types (in both visible-only and multimodal sequences).
In aligning sequences, head nods take up over half of all visual forms, while all others are much less frequent. Nodding is a highly conventionalized form to signal alignment without interrupting the speaker, thus maximizing conversational efficiency (
Stivers, 2008). It is also often found in affiliative sequences, where it is the third most frequent device. Nodding often combines with smiling, which is another silent, non-interrupting feedback device.
In affiliative sequences, there is a more balanced use of four main non-manual forms, namely raised eyebrows (28 occurrences in total), smile (18), nod (17), and head tilt (9). This greater variety reflects the wider panel of stance and emotions that can be communicated in feedback, agreement and disagreement, but also surprise, humor, disapproval, concern, etc. These would be expressed through different types of eye or head movements, while alignment is more binary (“I understand” or “I don’t”) and primarily positive, which explains the prevalence of nods in these sequences. Nevertheless, all visual-gestural forms are polyvalent between alignment and affiliation, with at least one occurrence in each type, except for single-occurrence combinations. The variety of visual-gestural forms of feedback, especially in affiliation sequences, attests to the richness of the visual modality and all the embodied resources it offers to express meaning.
Because of this formal variety in our small sample, it is difficult to identify visual/gestural forms that might qualify as conventionalized pragmatic markers. Two exceptions stand out. The first is nodding, which we already described as highly frequent and specialized in the expression of alignment. The combination of high frequency and stable meaning (also observed by
Cerrato, 2005) indeed supports its classification as a pragmatic marker, along with its verbal equivalents
mhm or
ouais.
The second pragmatic marker candidate in the visualgestural modality is raised eyebrows. This is the second most frequent form in feedback sequences and, although it is evenly spread across alignment and affiliation, it seems to display a constant core meaning of (inter-)subjective distance, which can then take on the particular values of surprise or sarcasm.
Debras (
2025) recently reached a similar conclusion and considered raised eyebrows as a visual pragmatic marker on the basis of its intersubjective core function of expressing a “differential in expectations”. This is illustrated in (5) and (6):
(5) | C2 par exemple enfin tu te dis que même si le titre c’est les fourberies de Scapin donc on parle de Scapin majoritairement |
| C1 | ((nods)) |
| C2 mais en fait il apparaît pas énormément qu’à chaque fois il faut une tierce personne |
| C1 | ((head tilt+ raised eyebrows)) |
| C2 for instance well you realize that even if the title is “Scapin’s tricks” so we talk mostly about Scapin |
| C1 | ((nods)) |
| C2but actually he doesn’t appear so much every time there has to be another person |
| | ((head tilt+raised eyebrows)) |
(6) | C2 c’est mes rappels de CM2 |
| C1 ((laughter)) ((raised eyebrows)) ah tu l’as vu en CM2? |
| C2 ouais |
| C2 it’s my memories from 5th grade |
| C1 ((laughter)) ((raised eyebrows)) ah you saw it in 5th grade? |
| C2 yeah |
In (5), C1 expresses that she slightly disagrees with C2, or at least is not entirely convinced yet by his argument that Scapin never appears on his own in the play, through the co-occurrence of a head tilt and raised eyebrows. She, thus, visually displays some distance from the other speaker’s stance. Alignment and understanding are not at play here since both speakers are quite familiar with the topic discussed. Similarly, in (6), C1 expresses her surprise that C2 studied the play they are talking about at such a young age, again based on her knowledge of the difficulty of this play. The core meaning of (inter-) subjective distancing is thus shared by both examples, which argues in favor of a treatment of raised eyebrows as a visual pragmatic marker.
The combined criteria of high frequency and stable pragmatic meaning thus suggest that nods and raised eyebrows might be potential visual pragmatic markers, on a par with verbal markers such as
OK or
mhm. However, from a mono-modal perspective alone, visual forms still exhibit a wide formal and functional diversity that can hinder the comparison with verbal pragmatic markers. Let us not forget that our study only targets occurrences within feedback sequences, and it is highly likely that nods or raised eyebrows would display many more configurations in other contexts (cf.
Cerrato, 2005 on the other values of nodding). This issue of categorization is, therefore, best tackled from a multimodal perspective, as in the next section, where the combination with verbal-vocal markers further narrows the selection of potential multimodal gestalts to more restricted—and therefore more consistent—contexts of use.
4.3. Multimodal Gestalts
We now focus on the multimodal sequences, and particularly those that combine a visual-gestural element with a verbal form classified as a pragmatic marker. This concerns 153 sequences (66.2%) out of the 231 sequences that contain one or more verbal pragmatic markers, a high figure that already attests to the prevalence of these combinations. Our goal here is to explore whether some patterns are conventionalized enough to be considered as multimodal pragmatic markers or multimodal gestalts.
One criterion to determine that an expression or, in our case, a multimodal pattern, is a pragmatic marker is its high degree of grammaticalization, which requires widespread use in the linguistic community and, therefore, a high frequency. In our limited dataset, following this quantitative criterion alone, only combinations of [ouais + nod] would qualify as multimodal gestalts with at least 19 occurrences, more if we include combinations with additional markers such as [ah ouais + nod] or [ouais + nod + smile]. In the previous section, we observed that nodding is pervasive and a prime device to silently communicate alignment. These multimodal sequences also attest to a non-silent version of nodding where it is combined with the most frequent agreement marker ouais. Preference for the visual or multimodal configuration might be idiosyncratic or depend on factors beyond our scope here. In either case, the two components have a similar semantic charge of agreement.
Given the small size of our data and the variety of forms used in the feedback sequences, it seems relevant to take into account not only frequency but also non-literal meaning as a cue that a particular form is used as a pragmatic marker. As we noted repeatedly above, many verbal and visual-gestural forms are polyvalent and not restricted to their encoded semantics, even displaying an opposite meaning at times. This polysemy and distance from the original literal meaning is another feature of grammaticalization that characterizes many pragmatic markers (e.g.,
well,
actually, etc.). In feedback sequences, non-literal meaning can often be observed with the marker
oui ‘yes’, which, combined with some visual-gestural marker, obtains a negative value of disaffiliation. There are two interesting examples of this phenomenon in our data (7)–(8).
(7) | C1 c’est elle qui a eu l’idée de d’ouvrir les yeux de d’Argan sur euh Béline en faisant tu sais le mort |
| C2 ah oui oui avec ce jeu-là oui c’est vrai c’est vrai |
| C1 voilà mais euh |
| C2 ((head tilt)) oui (0.150) mais mm ((wince + gaze away)) mais ça c’est plus pour ((brings her hand to her chin)) |
| C1 it’s her who had the idea to open Argan’s eyes about Béline by you know playing dead |
| C2 ah yes yes with this game yes it’s true it’s true |
| C1 right but uh |
| C2 ((head tilt)) yes (0.150) but mm ((wince + gazes away)) but that is more to ((brings her hand to her chin)) |
In (7), speaker C2 reluctantly or hesitantly agrees with C1 about the role of a character in one of Molière’s plays. The negative value of the affiliation is conveyed by the head tilt, which slightly precedes the verbal marker. The sequence is immediately followed by the marker of opposition
mais ‘but’ (twice) with other visual/gestural forms that indicate interpersonal distance and disapproval, as C2 complements her answer and provides a counter-argument. The visual modality thus clearly indicates from the start of the turn that this is at best a partial, temporary agreement, thereby transforming the meaning of
oui. This somewhat differs from the sequence in (8), where another attitudinal or evidential dimension is at play.
(8) | D1 le type de Les Choristes Jean-Baptiste Meunier au *restaurant* |
| D2 | ((nod)) | oui oui oui |
D2 | ((eyes wide open)) ah il était au *restaurant*? |
| D1 ((head move forward)) mais oui mais mais t’étais là quand non t’étais pas là ? |
| D2 | non | |
| D2 non j’étais pas là | | |
| D1 the guy from Les Choristes Jean-Baptiste Meunier at the *restaurant* |
| D2 | ((nod)) | yes yes yes |
| D2 ((eyes wide open)) ah he was at the *restaurant* ? |
| D1 ((head move forward)) well yes but but you were there when no you weren’t there ? |
| D2 | | no |
| D2 no I wasn’t there | | |
In (8), D2 is telling an anecdote that happened at a restaurant (the name has been anonymized), where she saw an actor. D2 believes that D1 was there that time. She first helps D2 identify that actor, which is successful (cf. «oui oui oui»). However, D2 does not believe that she was there during that anecdote and further asks for confirmation that this actor was indeed seen at that restaurant. D1 then replies with mais oui (literally ‘but yes’), a very common answer in conversational French which can take multiple argumentative values. In this case, the head moving forward indicates that mais oui reacts to the relevance of D2’s question. It should be obvious to D2 that the actor was there because, according to D1, D2 was also there that time and saw him, too (a claim that D2 denies in her next turn). The supposedly obvious character of the answer, as well as some attitude of judgment or disbelief (‘you should know this, how come you don’t’) is conveyed by the visual modality. It is thus the combination of mais and the head move that gives oui its negative value, while the visual-gestural marker alone is responsible for this additional attitude of interpersonal distance.
Both (7) and (8) thus illustrate the pragmatic versatility of
oui in conversation, with clear negative values conveyed by head movements. In fact, these examples are two of the only three instances of
oui which were coded as negative in our data, and all of them are multimodal (the third one is combined with a pout), which confirms that it is the visual modality that enables this shift of semantic value. Therefore, we would argue that multimodal combinations such as (7) or (8) are full-fledged multimodal pragmatic markers, considering that the meaning of the combination is not the same as the meaning of its individual components. This is the criterion that
Cuenca and Crible (
2019) used to define “compound discourse markers”. Originally proposed for verbal markers only, this notion covers discourse (or pragmatic) markers “that can occur independently but, when combined, they jointly act as a single marker and their individual meaning cannot be disentangled” (p. 172). The authors identify
and then,
but I mean, or
now then as examples of compound discourse markers. We would, therefore, propose that [
oui + head movement] is a multimodal compound pragmatic marker since the visual modality alters the meaning of the verbal marker.
Cuenca and Crible (
2019) further argue that some co-occurring markers, primarily
and then, can either act as a compound discourse marker or simply as single markers that happen to co-occur while retaining their individual meaning, depending on context. This could also be a possibility for multimodal sequences, although in our data, the three [
oui + head movement] sequences are all coded as negative and do not display this contextual variation. By contrast, it should be noted that the multimodal gestalt [
oui + nod] discussed above does not qualify as a compound pragmatic marker, since the individual meanings are retained. In these sequences, both the verbal and the visual marker express a similar meaning of positive alignment or affiliation, one modality reinforcing the other. They would thus qualify as two co-occurring (but relatively independent) pragmatic markers, one from each modality.
The same can be said of another recurrent combination of semantically aligned forms, namely [
voilà + pointing gesture]. Manual gestures are strikingly rare in feedback sequences, with only 11 cases; 5 of them correspond to pointing, of which 3 co-occur with the pragmatic marker
voilà ‘that’s it’. Both the verbal and gestural forms share an indexical semantic meaning, which is put to use in feedback sequences to express alignment. The three examples from our dataset are given below. In (9), the two participants are talking about their assignment and realize that they study a similar topic but about different genders. In (10), A1 is describing a board game that involves archeologists, but cannot remember the word for it, while in (11), the two participants are talking about a place where D1 saw celebrities but also cannot remember their names.
(9) | C2 j’ai le même sujet mais moi c’e:est suivantes et maîtresses |
| C1 ((raised eyebrows)) a:ah d’accord ah oui |
| C2 ouais c’e[:est le personnage féminin] |
| ((points back and forth towards herself and C2)) |
| C1 | [c’est c’est le:e oui voilà Figure 2] alors que moi c’est le personnage masculin |
| | ((points towards C1 with index finger)) |
| C2 I have the same topic but for me it’s maids and mistresses |
| C1 ((raised eyebrows)) a:ah okay ah yes |
| C2 yeah it’s the[:e female character |
| ((points back and forth towards herself and C2)) |
| C1 | [it’s it’s the yes that’s it whereas for me it’s the male character |
| | ((points towards C1 with index finger)) |
(10) | A1 comment ça s’appelle euh qui font des fouilles là? |
| A2 (0.561) un archéologue |
| A1 un archéologue voilà |
((smiles and points towards A2 with palm oriented side ways, Figure 3)) |
| A2 ((nods)) |
| A1 how do you call those who do excavations? |
| A2 an archeologist |
| A1 an archeologist that’s it |
((smiles and points towards A2 with palm oriented side ways, Figure 3)) |
Figure 3.
Example 10, interactive pointing gesture (side palm) co-occurring with voilà.
Figure 3.
Example 10, interactive pointing gesture (side palm) co-occurring with voilà.
(11) | D1 c’est qui les jumeaux ? ((frowns)) |
| D2 tu sais les jumeaux horribles là pleins de chirurgie esthétique ((brings her hands to her face)) |
| D1 les frères Bogdanoff? |
| D2 oui voilà et eux ils étaient au *restaurant* |
((points towards D1, Figure 4)) |
| ((nods and points towards D2 with her little finger)) |
| D1 who are the twins? ((frowns)) |
| D2 you know the horrible twins full of plastic surgery ((brings her hands to her face)) |
| D1 the Bogdanoff brothers? |
| D2 yes that’s it and they were at the *restaurant* |
((points towards D1, Figure 4)) |
Figure 4.
Example 11, interactive pointing gesture (little finger) co-occurring with voilà.
Figure 4.
Example 11, interactive pointing gesture (little finger) co-occurring with voilà.
In each example, an interactive hand gesture used for pointing (in various forms, using the index finger, the little finger, or the palm) is used in the context of feedback and co-occurs with the marker
voilà. All three examples share a similar context, with the multimodal combination signaling the achievement of common ground after a brief period of uncertainty or misalignment. As for [
oui + nod], the semantic charge of the two modalities in [
voilà + pointing gesture] patterns is similar and the two forms reinforce each other, thus qualifying as a multimodal gestalt (functioning together with a stable pragmatic meaning) but not as a compound pragmatic marker in the sense of
Cuenca and Crible (
2019), since their individual meaning is retained. The difference between these two multimodal combinations is that the latter is infrequent and thus does not meet either criterion (high frequency and/or non-literal meaning) for pragmatic marker status. We would therefore safely refer to [
voilà + pointing gesture] as a multimodal gestalt until more data can confirm whether this pattern is frequent and widespread enough.
In sum, not all multimodal gestalts are the same. Some can be semantically co-oriented and either highly frequent [oui + nod] or infrequent but consistent [voilà + pointing gesture], while others (less frequent) have opposed individual meanings that are blended into a single new function in context [oui + head movement]. As already noted above, an important caveat to this conclusion is that we only analyzed these forms in the context of feedback sequences, which means that patterns that express a stable function in our limited dataset may very well display a more varied spectrum of uses in other contexts or conversation settings. It is, therefore, an obvious avenue of research to further explore these multimodal combinations outside feedback sequences and in different genres in order to draw more robust conclusions about potential multimodal pragmatic markers.