As is evident from the outline of spoken and written language production above, there are many similarities between the two modalities: both require that speakers/writers plan the content and form of the utterance, execute this as a linguistic expression, and evaluate the result against the plan. However, due to inherent differences between the conditions for the modalities, the signs of increased cognitive load will be manifested differently.
The Results section illustrates various incidents deemed suitable for exploring the distribution of cognitive load throughout narrative accounts and outlines methodological issues concerning the definitions and choices of measures used to capture cognitive load in previous language production research. The presentation of results is structured as follows: first, an examination of meaningful measurements for text length; second, an exploration of the definition and analysis of pauses; third, an investigation into aspects of revisions; and fourth, a proposition suggesting that so-called fluency measures, which integrate all these aspects, may effectively discern segments of increased cognitive load during language production. What is discussed here are thus data from spoken real-time discourse, captured by detailed transcripts, and linear written texts that illustrate the step-by-step actions that writers engage in during text production. However, this article is not concerned with measures for exploring static, final texts. All measures are exemplified and related through excerpts from Alfa and Bravo’s accounts.
3.1. Measuring Text Length in Spoken and Written Text Production
Previous studies of truthful and deceitful accounts have highlighted the importance of considering the text length (
Knapp et al. 1974;
Newman et al. 2003;
Derrick et al. 2013), and linguistic comparison of text length in different speakers/writers further emphasizes that both individual differences (for example, age, education, and linguistic proficiency), genre differences, and spoken and written differences influence the text length (cf.
Biber 1988;
Johansson 2009). Following this, a suitable starting point in describing cognitive load during text production, independent of modality, is to estimate the amount of language production, i.e., the quantity produced by speakers/writers. There are several reasons for this. One assumption posits that the ability to produce longer texts may reflect a less cognitively demanding production process, indicating reduced cognitive load. Another assumption suggests that longer texts may indicate more changes, additions, and explanations, potentially enhancing the credibility of a fabricated narrative (cf.
Undeutsch 1989). Measuring text length also serves as a foundational baseline metric for other relevant measures (such as pauses and text changes; see below). Thus, measuring text length will generate an overview of how easy it may have been to re-tell the events of the narrative and will also perhaps render a rough estimation of how elaborate the story is.
The most commonly proposed unit for measuring text length in studies including both spoken and written data has been the
word, which has also been one of the most important units for measuring text length in deception studies (e.g., see
Vrij et al. 2008;
Colwell et al. 2007;
Derrick et al. 2013). This includes studies illustrating linguistic development (e.g.,
Berman and Verhoeven 2002) or broad approaches to genre differences (
Biber 1988). Importantly, these studies have compared final written texts to transcripts of speaking, i.e., a product to a process (although the transcriptions in these studies can vary according to the degree to which they account for e.g., pauses and repetitions). The reason for this choice is that writing processes have rarely been captured, making studies with process-level descriptions of spoken and written discourse sparse.
Text length in spoken and written discourse has also been compared based on syntax (that is, clauses, sentences, and sentence-like structures). In such comparisons, one major challenge has been that spoken language often lacks the written-like type of grammatical sentences. Therefore, a measure called the T-unit (Terminal Unit;
Hunt 1970), which is defined as “[o]ne main clause plus any subordinate clause or non-clausal structure that is attached to or embedded in it,” was introduced. It is a syntactic entity, and it is roughly equivalent to a “sentence” in written language. A T-unit is not only defined regarding its syntactic information, but it is also possible to use clues from intonation and discourse/thematic content. As such, it has proven useful for describing and understanding syntactic structure and grammatical development in speech and then comparing it to writing (see
Berman and Verhoeven 2002;
Johansson 2009;
Scott 1988). An addition to the T-unit is the C-unit (
Communication Unit), proposed by
Loban (
1976), which allows for utterances without clausal structure to be organized syntactically (see
Johansson 2009, p. 93 for an expanded discussion). However, while these measures have been used to compare spoken and written language, the comparison is based on
dynamic speech and the final products of writing—thus, these measures share the same problems as using the word as a measure. However, there have been attempts to apply T-units to the dynamic linear files from keystroke logging. One example is found in
Bowen (
2019), where some actions of revision were removed from the keystroke logging files to better fit the T-unit analysis. This may illustrate the difficulties in applying T-units to linear files. Finally, while the T-unit/C-unit has much in common with “the sentence”, it is, in many cases, not equivalent to a graphical sentence that starts with a capital letter and ends with a full stop. The reason for this is that a written, graphical sentence may consist of several main clauses. Thus, investigations using T-units require manual coding of the data. Although it currently seems challenging to apply the notion of T-unit or C-unit to the real-time written data collected by keystroke logging due to the manual work that is needed and the difficulty in identifying T-units in linear text files, it may, for some purposes, be fruitful to explore this measure in the future if one is looking for a rewarding way to compare spoken and written real-time data on a syntactic level. To summarize, while comparisons of text length from a syntactic perspective may prove rewarding, especially through the use of T-units, this is mainly a measure that is suitable for exploring final written texts.
From the discussion above, we can conclude that, in studying real-time data of spoken and written processes, it is complicated to use measures that are adapted for examining final texts. Let us, however, explore the options to use “the word” as a unit a bit further. The traditional definition of a word is a string of letters surrounded by spaces in printing, corresponding to a distinct unit with meaning; however, this definition is difficult to apply to multi-word units, such as “in spite of”. For example, in
Table 3, the sample of the spoken truthful account of Alfa in the left column includes several occasions of filled pauses (
&-eh). This raises the question of whether a filled pause should be included in a word count. Another question concerns the instances when Alfa rephrases the information about a bag that is turned over, as in “och välter ner (.) &-eh den hära tjejen som satt på en stol hennes väska &-eh välter omkull den” (
turns over (.) &-eh this girl who sat on a chair her bag &-eh turns it over). Should both versions of how the bag was overturned be included in the word count? The important point is that there is no correct answer here; instead, the choice of inclusion or exclusion depends on the purpose of the study.
Regarding the sample of a written truthful account from Bravo, new challenges arise (see the bolded fragments in
Table 3). Corrections of misspellings here result in word fragments. For instance, the word “välde” (
turnet, a misspelled version of
turned) involves three presses on the backspace to delete one space and the two last letters. The correct letters “te” are then immediately added to create “välte” (
turned). How should we calculate words in this context? Should “välde” be considered one word and “te” another, should we treat “välde” and “välte” as two separate words, or should all be counted as one final, correct word, “välte”? From the perspective of studying deceitful narratives, one could argue that the correction of misspelled words is uninteresting, as such corrections merely contribute to the surface level of the final text. However, it may influence the cognitive load in a way that writers focusing on low-level orthographic processes have fewer cognitive resources available for other activities. With this argumentation, it is essential to establish a method for including word fragments and alternative spelling varieties as they contribute to the understanding of how the writers’ resources were distributed during their writing.
In summary, spoken texts include word repetitions, and a decision to count each repetition of the same word (or phrase, or part of the phrase) leads to the risk of obscuring parts of the process only once. Issues also arise regarding whether “filled pauses”, (e.g., um, eh) should be counted as words or not. Yet another issue is word fragments, which occasionally occur in speaking, that is, the instantiation of words where only a few speech sounds are included and when it is sometimes difficult to guess which word was intended.
In writing, on the other hand, fragments are frequent, and just as in speaking, the mapping of the fragments into words can pose challenges. The fragments may comprise just one or two letters, where it is impossible to comprehend whether they are signs of a false start or mistyping. However, often the fragments present alternatively spelled versions of the same word, and it is common for only a portion of the word to be deleted and rewritten. Should these instances be calculated as one word or two words (or more)? Furthermore, it is typical for writing to encounter letter combinations that arise accidentally or erroneously by pressing the wrong key. Finally, in writing, it is not uncommon for entire clauses, sentences, or paragraphs to be deleted, rewritten, or pasted and moved around within the text. Just as in spoken language, a decision must be made regarding whether to count the words and phrases once or multiple times.
When comparing the text length of written texts, for instance, across genres or between or within subjects, a proposition is to use the number of written characters (i.e., letters, numbers, punctuations, and spaces in writing) in the linear text (see example in
Johansson 2009). Such an approach would account for fragments, words, phrases, and other parts of the texts, which may have been deleted and are thus invisible in the final text but which are nevertheless part of the work that writers have put into producing a text.
Spoken texts do not offer the same possibility to easily capture all phonemes. However, with a carefully conducted transcription of the spoken process, the result will not only be a written, linear reflection of the spoken process, but it can also serve to capture text length as the number of written characters in the transcriptions. Although this is not the same as a phonetic transcription that encompasses all phonemes, this approach will serve the purpose of estimating text length in relation to cognitive load and, importantly, enable a rough estimation of how truthful and deceitful accounts compare within and across modalities.
Table 4 illustrates the outcome of the different ways of measuring text length: number of words and number of characters. In this example, the computations are conducted using the excerpts in
Table 3. Here, it is evident that variations exist in the word count of the written linear texts that are contingent upon the inclusion or exclusion of word fragments.
Hence, our optimal recommendation for comprehensively understanding the text length of spoken and written accounts while concurrently capturing the expression, repetition, or reformulation of small units is by measuring the number of characters in writing and the number of characters in the transcription of spoken accounts. In doing so, we propose that, for some purposes, it may be suitable to include filled pauses in speech to account for the fact the speaker is uttering something, in contrast to being silent. In our methodological approach, a filled pause denoted as &-eh in the transcription would be computed as two characters (eh). Silent pauses in speaking would be excluded from the calculation, just like the pauses in writing.
Table 5 provides an overview of the number of characters, with filled pauses in speaking being annotated and counted as two characters (i.e.,
eh; see
Section 2.2 for an elaboration of the transcription decisions). See the
Supplementary Materials for the complete accounts for both Alfa and Bravo.
3.2. Defining and Measuring Pauses in Spoken and Written Text Production
Regarding the spoken account in
Table 6, numerous examples of silent (.) and filled (
&-eh) pauses are discernable in the transcription. Similarly, in the written account, examples of pauses (<2.809>) are evident in the linear representation of the writing process. As detailed earlier, pauses during language production are closely linked to moments of increased cognitive load. Consequently, the identification of pauses and their location and duration are highly relevant for our purposes.
So, what is a pause? Pausing during writing (on a keyboard) is often defined as inactivity between two keypresses. However, it is debatable how long the duration of the inactivity should be for it to count as a pause. Technically, pauses can be defined as short as the hardware accounts for, that is, there is generally a latency between the pressing of a key until it is registered. In writing theories (e.g.,
Flower and Hayes 1981), longer pauses are highly associated with high-level processes such as planning processes (see
Torrance 2016;
Torrance et al. 2016), while shorter pauses, to a greater extent, have been seen as indicative of low-level processes related to transcription, orthography, and spelling (
Wengelin 2006). The literature does not propose strict cut-off points when a pause is long or short, and instead, this must be seen as relative given the particular task or circumstance. However, in general, writing researchers adopt ad hoc pause criteria based on the purpose of the study. Often pauses of 2 s and longer have been proposed as indicating high-level processes (
Wengelin 2006), while shorter pauses have been associated with the low-level processes.
The variation of pauses between subjects has been acknowledged in many studies (see
Spelman Miller 2006b;
Lindgren et al. 2019) and has been explained through background factors (such as writing in first or second language, education, and practice in writing (including handwriting and/or typing skills), age, linguistic development, and reading and writing difficulties due to dyslexia or aphasia), and contextual factors (topic and genre knowledge, audience awareness, or occasional disturbance in the surroundings). Further attempts have been made to propose methods for establishing individual pause criteria, which would allow for a more reliable comparison between subjects. Proposals include correlating the individual pausing behavior to writing speed (
Chenu et al. 2014), or relating to the dynamic variation of keypresses across a writing session (
Olive 2014). Thus, writing studies examining pauses must establish their own ad hoc criteria for how to define a pause and how to discriminate between long and short pauses if that is relevant given the research questions, and they must use an experimental design that controls for within- and between-subject factors. To facilitate the analysis of pauses, we have used the ad hoc criteria of 1 s. This will allow us to capture pauses on a relative micro-level but avoid having to address pauses that may be primarily related to transcription skills (see
Wengelin 2006).
Regarding speaking, the definition of a pause is equally tricky, not the least the issue of individual variation that applies to this modality as well. When (silent) pauses are investigated in speech, there has been a general acknowledgement that the definition of a pause must be related to individual speaking rates. However, the speaking rate may vary between sessions and within the same session. A common solution in the CA transcriptions is to include so-called perceived pauses (
Sacks et al. 1974), which is how we operationalize it (while only including pauses with a minimal length of 200 ms).
In addition, a decision on whether to treat filled and silent pauses equally or not must be made. Do filled pauses (
eh,
um, etc.) serve the same purpose as silent pauses? Studies on conversation show that filled pauses are communicative and can help the speaker keep their turn, whereas silent pauses are not necessarily so (
Clark 1996). However, while our data contain spoken
monologues where “keeping the floor” should not be an issue for the speakers, there are still ample examples of filled pauses in the data. For instance, in
Table 4, it is shown that, of a total of 13 pauses in the spoken sample, 8 are filled. Occurrences of filled pauses in monologues should perhaps be interpreted along the lines that speakers have incorporated filled pauses as one of several planning strategies during speaking and that it is difficult to abandon this habit when invited to speak uninterrupted.
To sum up, pauses are seen as important indicators of speakers’ and writers’ increased cognitive load during language production. However, it can be difficult to define a pause; previous research has established a rough standard for the respective modality, and we have employed these standards in our analysis. Since there may be different preferences for using filled or silent pauses, which may vary between and within different accounts of speakers, we measured the number of silent pauses as well as the number of filled pauses. In addition, when appropriate, we advocate a measure where both types are included—for instance, to illustrate the overall number of pauses.
Once we have established the definition of pauses in each modality, we turn to measuring
the number of pauses. One can assume (based on, e.g., findings from
Goldman Eisler 1968;
Heldner and Edlund 2010) that frequent pausing would be an indication of instances where the cognitive load is increased and where the speakers/writers need extra time to think about the linguistic expression. With our definition of pauses, it is relatively easy to calculate the number of pauses in a written or spoken account from the linear files in writing or the transcription of the spoken accounts. Since the text length and/or the amount of time dedicated to the accounts will differ, the number of pauses must be calculated relative to text length and/or writing/speaking time.
Further,
pause duration has been proposed as an important indicator of cognitive load, and longer pauses are often found preceding more linguistically complex constructions, e.g., subordinated clauses or complicated noun phrases in both speaking (
Goldman Eisler 1968) and writing (
Nottbusch 2010). While there are tools that can identify silences, these will be rendered useless if there is any kind of background noise in the audio file and filled pauses will also not be captured with these tools. Thus, in speaking, the calculation of pause length will need manual attention and consequently be very time-consuming. Written data, collected through keystroke logging, will have various options for calculating pause length readily and will be automatically accessible (
Leijten and Van Waes 2013).
Pause location is a final component that is likely to be relevant for addressing cognitive load during language production. Pause location in connection with specific syntactic constructions, or semantic information may reflect, on the one hand, difficulties in structuring the message, or, on the other hand, difficulties with finding lexical expressions that reflect what one needs to say (
Matsuhashi 1981;
Spelman Miller 2006a). These types of investigations may be rewarding in establishing which segments are particularly challenging for speakers/writers from a forensic perspective. Pause location can, to a certain degree, be annotated in the transcriptions with the use of speech technology tools, such as linguistic parsers that indicate parts of speech (there are a few parsers trained for Swedish, e.g.,
Qi et al. (
2020), which could aid in this). However, one must expect that substantial manual handling is needed, not the least since transcriptions with pauses and repetitions will make automatic analyses difficult.
3.3. Revisions and Reformulations in Spoken and Written Language Production
This section addresses, on the one hand, revisions in writing and how they may be expressed and studied, and, on the other hand, how reformulations and repetitions can be studied in speaking. We treat all these aspects as manifestations of changes in the linguistic message. According to the models of writing and speaking, such changes would occur by monitoring what has been previously produced and will happen if the speakers or writers after such an evaluation conclude that the previous text needs to be modified.
We start by outlining how the concept of revision has been described in writing. Its complexity is discussed in a seminal article by
Faigley and Witte (
1981), where they make the point that revision should not only be viewed as tidying up the text after the first draft. Instead, there is substantial evidence of it being a complex process that writers engage in, concurrent with planning new content and generating text. Therefore, reading or monitoring the text written so far is an important component (see
Johansson et al. 2010; see
Wengelin et al. 2023). Changes in the written text can be made at any point during the composition of text: before the text has been transcribed, at the point of inscription (i.e., at the end, or
the leading edge, of the text being produced), or at a previous point in the text (cf.
Fitzgerald 1987;
Lindgren et al. 2019).
There are undoubtedly revision processes of different kinds, and from a processing point of view, there can be so-called
internal revisions, also referred to as pre-linguistic and pre-textual revisions (see
Murray 1978), which will occur mentally and never be manifested or overtly expressed.
External revisions, on the other hand, can be made at the point of inscription or in the previous text. Revisions can further be classified as
surface revisions, i.e., language revisions (associated with formal changes), or
deep revisions, i.e., content revisions (associated with semantic information) (
Chanquoy 2009;
Stevenson et al. 2006). The concept of internal revision can further be compared to the idea of text generation as part of the planning process in the model of
Flower and Hayes (
1981). It is important for the purpose of our exploration that some revision processes may not be overtly visible in the written data, but instead, to a certain extent, incorporated in pauses where the writer is planning the linguistic expression and trying out and rejecting possible solutions before settling on one decision. Existing literature provides many examples of different taxonomies for categorizing the types of revisions occurring in writing, where adding, deleting, and substituting content are the most agreed upon (for some examples, see
Johansson et al. 2023).
Just like other linguistic processes, the acts of revision will fluctuate depending on the context, the task at hand, and the background of the writer. Here, age, education, linguistic proficiency (writing in the first or second language and grammatical and lexical knowledge), writing proficiency, knowledge of the topic and genre, and writing mode (for example, typing and handwriting) will influence how, what, and when revisions occur (for overviews, see
Chanquoy 2009;
Lindgren 2005).
Table 6 provides examples of Bravo’s revisions in the deceitful written account, mostly consisting of surface revisions at the leading edge, where typos (e.g., errors occurring due to the pressing of the wrong key and not because of ignorance of orthography) are corrected. The erroneous ‘d’ at the end of
mod is changed to
mot (‘towards’); the initial letter combination
std is immediately corrected and the word
studenterna (‘the students’) is written; the misspelled word
tröa is at once corrected to
tröja (‘sweater’). One change can be categorized as a content revision, where
flick (orna) (‘young girls’) is changed mid-word to the (near) synonym
tjejerna (‘girls’). Similar surface revisions at the leading edge are found in
Table 3, in the linear text of Bravo’s truthful written account. Here, we see no examples that can be categorized as content revisions. The examples of revision in these excerpts thus show how the writers immediately tend to surface revisions (note that there are no pauses between the deleted written text and the use of backspace and the added written text), which suggests a constant monitoring of what is being written. We have also seen examples of content or semantic revision, where another lexical choice for “the girls” was made.
The linear files of the writing sessions further allow for the study of other types of revision behavior: using the arrow keys or mouse to move around in the text. Such movements may or may not be followed by a backspace (for deleting text) or the addition of text to previously written parts. Writers can also highlight parts of the text by using the mouse and click and drag functions, or by using combinations of keys (shift + alt and arrow keys). Once highlighted, the text can be deleted, moved, copied and pasted, or overwritten if writers type over the highlighted text with new text. Consequently, a lot of text can be deleted or moved with very few keypresses or mouse movements. Therefore, it can be relevant to account for the number of
editing operations that take place independent of how much text is being removed or added in each operation. These types of editing operations are unique for keyboard writing, but the same concept can be adapted for speaking if reformulations or self-repairs are included in calculations. In
Table 5, the total number of editing operations for the complete sample files (found among the
Supplementary Materials) is included. The numbers in the table further illustrate how common editing operations are in writing, compared to reformulations in speaking.
The full writing session of the truthful account by Bravo can provide an illustration of what it can look like when a revision is made away from the leading edge, in the previously written text. By the end of the final text of Bravo’s truthful account (see the
Supplementary Materials), she uses the mouse to move the cursor to a spot preceding the last written sentence. There, she adds a sentence.
Table 1 shows the linear file of this sequence, where the indication of <MOUSECLICK> is seen on the last line, followed by a pause of 2.931 s, and then, the sentence fragment that was added (in boldface in
Table 1): “Efter det gick ut därifrån och” (
After that went out and). Note that this sentence lacks a subject, possibly the pronoun “she”, and that the last word “och” was immediately deleted. An illustration of what it looks like is found in
Figure 1, where we see two screenshots from the real-time replay of the writing session: the first one just before the mouse click and the second one immediately after the first word of the new sentence has been written (“Efter”,
After). In the figure, the red circles show the placement of the cursor in the two examples.
As mentioned above, revisions can occur far from the inscription point, that is, when the writer uses the mouse or the arrow keys to move the cursor away from the inscription point to change something that has already been written (
Lindgren et al. 2019). This means that writers can add, delete, or change the previously written text at any point during the writing session anywhere in the text. For example, a writer may add an initial paragraph of the text as the last part of the writing process, change a description of a protagonist, or delete a chain of events. In the final text, there will be no trace of this (see
Wengelin et al. 2023 for examples of this execution in advanced writers). However, examining when writers decide to make changes in their previous written texts offers new perspectives for the understanding of how the message is constructed and can give insights into how deception is built. One example of what revisions may look like when they occur away from the leading edge is shown in
Figure 1, where the writer Bravo has finished a sentence (“Den yngre tjejen såg detta och gick snabbt iväg från cafét”
The younger girl saw this and quickly departed from the café) and then, she moves the cursor to before this sentence to add a new sentence (“Efter det gick [de] ut därifrån”
After that they left.) These kinds of revisions do not have an obvious equivalence in speaking, which probably can be attributed to changes in speech due to the necessity for immediate changes—using the terminology from revision in writing, one can say that changes during speaking will occur in a linear fashion and always at the leading edge. It is undoubtedly an option for speakers to address something that was said further back in the spoken message, and draw the attention to what they need to change or add information to what was previously stated. However, speakers can never “move away” from the leading edge of their spoken account. The phenomena of “revision” in speech are typically referred to as
disfluencies in the literature (see
Clark and Wasow 1998). This term covers filled and silent pauses, prolongations, repetitions of words and utterances, as well as reformulations. The psycholinguistic view on disfluencies is expressed in
Goldman Eisler’s (
1968) seminal work that connects increased number and duration of pauses and other signs of disfluencies with increased linguistic complexity (especially regarding syntactic complexity at the clause or phrase level). Similar views are shared by
Clark (
1996) and
Levelt (
1989) (see also
Eklund 2004 for an overview, with a phonetic focus on disfluencies in speech). Here, we will mainly be concerned with repetitions and reformulations since they, just like revisions in writing, serve the purpose of being overt changes to the linguistic message.
A common example of disfluencies in speaking is to repeat one or more words occurring at the start of a clause verbatim, a strategy that is often associated with planning (
Clark and Wasow 1998).
Table 2 shows an illustration of this from Alfa’s spoken truthful account (verbatim repetition in boldface). She says ”när mannen är och &-eh plockar på ett annat bord så går hon fram å häller ner peppar <
i hans> [/] &-eh
i hans mugg” (
when the guy is and &-eh picking at another table then she walks up and pours pepper <in his> [/] &-eh in his mug). Here, the repetition (
in his) occurs at the end of the clause, where it precedes the noun “mug”. Note that in connection with the repetition, we also find a filled pause (
&-eh).
Table 6 illustrates parts of the spoken deceitful account from Alfa, which contains numerous examples of reformulations. She says “personen till höger” (
the person on the right), which is followed by a silent pause, a filled pause (
&-eh), and another silent pause. She then says “den här personen till höger vill” (
this person on the right wants), and then, the last verb (“vill”) is changed to “försöker” (
tries). Thus, taken together, this is a sequence of self-repair consisting of a series of reformulations of what is pretty much the same content. Just a little bit further on, Alfa has another sequence of reformulations: “tanken är att” (
the idea is to), which is followed by a pause and the fragment “eller hon” (
or she), which, again, is abandoned for the clause “man ser att hon försöker” (
one sees that she tries). First, these kinds of sequences of reformulations are particularly interesting to study because they highlight a circumstance or event that the participant finds difficult to express in words. Second, they constitute a noteworthy example of how the strategy of “talking around” a subject allows more time to think while at the same time ensuring no interruptions from listeners—a purpose often attributed to filled pauses. Finally, this example illustrates a sequence of (extensive) consecutive revisions. For our purposes, such sequences are intriguing in both modalities as they have the potential to reveal particularly challenging portions of the narrative accounts.
These instances of verbatim repetition and consecutive reformulations during speech demonstrate that speakers often make repeated attempts to find the right expression with the rephrasing frequently involving multiple words. Notably, the observed changes in our examples appear to be more closely associated with linguistic content, specifically lexical choices, rather than linguistic form.
For our objectives, it is pertinent to explore methods of quantifying revisions, repetitions, and reformulations as a cumulative display of such occurrences may indicate disturbances in the planning processes due to heightened cognitive load. One approach, used in writing studies employing keystroke logging technology, involves subtracting the final text’s character count from the character count in the linear files (see example in
Gärdenfors and Johansson 2023). This will result in a proportion of the text that was deleted (see
Table 5 for an example from our data). Another option is to calculate how many
editing operations there are (i.e., the number of occasions something was deleted, independent of how much text was deleted each time, cf.
Johansson 2000). In both cases, this will demonstrate a quantitative approach to capturing how frequent revision occurs.
In spoken language, the concept of “deleted text” becomes irrelevant as all utterances, whether rephrased or not, are overtly expressed. However, quantifying and accounting for the number of repetitions and reformulations provides an overview of how frequently speakers rephrase themselves.
An additional potentially valuable approach to investigating changes would be to annotate their location or context and/or categorize the nature of the revision. This could shed light on the causes of increased cognitive load, following the insights of
Goldman Eisler (
1968), and reveal whether the linguistic expression leading up to deceitful information is more prone to revision or if the deceitful information itself is the focus. However, it is important to note that such annotations necessitate manual execution, making it a time-consuming task.
3.4. Fluency and Disfluency in Spoken and Written Language Production
We have touched upon that accumulative signs of cognitive load may be of relevance for our purposes—that is, where pauses and/or changes occur together or within a short time frame. In addressing this issue, we turn to the concept of
fluency–disfluency. For speaking, the concept of fluency has been an important concept for estimating how easily speakers carry out different oral tasks. There are many examples that comprise proficiency in second-language learning (e.g.,
Jong 2016) or fluency in regard to disturbances during speaking, for example, stuttering (e.g.,
Alm 2011). In the study of spontaneous speech (whether from a cognitive approach or CA perspective), it is also contrasted against the notion of disfluency, which would be viewed as unwanted disturbances during speaking (
Clark and Wasow 1998;
Eklund 2004;
Norrby 2014).
In writing, fluency was brought to the forefront by
Chenoweth and Hayes (
2001) as a way to shed light on linguistic proficiency (often from an L2 perspective, see examples in
Manchón and Roca de Larios 2023) and writing competence. Fluency during writing will typically be captured by dividing the number of linguistic units (words or written characters) per time unit (seconds, minutes, or the whole time on task/total writing time) (see
Kaufer et al. 1986 for early examples). From a processing perspective, fluency is often measured through “bursts”, that is, the number of words or the number of typed characters between pauses (
P-bursts) or between revisions (
R-bursts) (see
Alves and Limpo 2015 for a comprehensive overview of bursts in writing). Increased fluency will occur when writers have few and/or very short pauses and few revisions. Keystroke logging software, especially the widely used Inputlog (
Leijten and Van Waes 2013), can provide automatic output with a variety of different bursts and applied pause criteria. Such output can show the mean length of P-burst in a text, or, in other words, the average number of characters that are written between each pause. According to the hypothesis, the P-bursts will be longer if writers produce new text with ease.
Here, we can refer to
Table 4, where the number of pauses in speaking and writing are presented across modalities. The number of pauses is divided by the number of characters. This would be an example of the mean length of a P-burst. For the spoken account, we have included several comparable measures: one measure where characters have been divided by the number of silent pauses (47.4), one that divides them with the number of filled pauses (29.63), and one that includes all pauses (18.23). The different results illustrate that the definition of pauses is important for the outcome. For the written account, we only included one measure: number of characters per pauses longer than 1 s. Note that, with a different pause criterion, the number of characters per pause would also change. Determining which pause criteria to choose and whether or not to include or exclude filled pauses will depend on the research questions that are posed, but it may also be valuable to explore various options before deciding on the definition in a particular study.
The fluency approach thus requires measuring the total time on task in the written and spoken task. At the overall text level, this would mean that, for the written task, the keystroke logging software offers automatic output, while the spoken task requires some, but limited, manual attention. The number of written characters or the number of characters in the transcripts of the spoken accounts can then be divided by the time on task. We advocate using the linear text files for this type of calculation. In examining the results of such calculations, a few effects can occur: if speakers/writers have long and/or many pauses, there will be, on average, fewer characters written per second. However, if speakers/writers engage in many changes (revisions, repetitions, and reformulations), the effect may be more characters written/spoken per second. It may be the case that writers who have longer pauses also revise more, but not necessarily so. Given the previous studies we have repeatedly referred to above, it is evident to expect a fluctuation regarding where pauses and changes occur during the unfolding of both spoken and written language production.
We have already concluded that it is difficult to automatically identify and isolate pauses in speaking due to potential background noise in the recording and the existence of filled pauses; consequently, we have ruled out measuring pause duration in speaking as a cost-effective way to approach our goals. However, to account for the fluctuation in fluency during language production, we suggest another approach, that is, dividing the spoken and written texts into different segments. Given our experimental design, where we have identified which portions of the events in the elicitation films should be altered by the participants in their deceitful accounts, we propose a threefold division: before the lying event, during the lying event, and after the lying event. This will be a way to operationalize the variation in fluency during different sequences of the narrative accounts and serve as an initial but potentially rewarding attempt that can serve our purposes.
Similar approaches were applied to written data in a study by
Johansson (
2009), but in that case, the writing time was divided into five equally long segments and then, the proportion pause time was measured in each segment. The results demonstrated different pause time distributions throughout the writing of narrative and expository genres. For our data, this would be a more time-consuming but perhaps fruitful way to divide the speaking and writing into fixed time segments or 20% divisions of the time on task and then explore the proportion of pause time and/or changes in each segment.
Finally, an initial quantitative and cost-effective approach to identifying sequences with accumulated signs of cognitive load can later be combined with more qualitative inspections and annotations. For our purposes, measuring fluency may thus provide an approach that combines text length, pauses, and changes—all outlined above. The advantage of looking at fluency is that it is a proportional measure and thus more suitable for comparing accounts across participants with different text lengths, and consequently, across modalities and deceitful–truthful conditions.