Introduction
Multisemiotic written texts, composed of multiple semiotic systems (e.g., words, graphs, images, diagrams), are widely employed in academic and professional contexts and especially in cross-disciplinary genres (
Parodi, 2010,
2015). Graph/word texts, for example, have been identified as fundamental written discourse tools in the construction and transmission of specialized knowledge (
Kosslyn, 1994;
Schnotz & Baadte, 2015;
Danielson & Sinatra, 2016;
Parodi & Julio, 2016,
2017). Similarly, economics reports are written using a wide range of diverse semiotic systems, such as verbal and diagrammatic (Parodi, Julio, & Vásquez-Rocca, 2015). Both systems may express causation, which has been defined as the most significant relation in the world (
Mackie, 1974), not only through descriptive textual analyses (
García Izquierdo, 1998; Martín Zorraquino & Portolés, 1999;
Domínguez, 2007; Briz, Pons, & Portolés, 2017;
Loureda & Acín, 2010), but also through psycholinguistic cognitive processing research (Sanders, Spooren, & Noordman, 1992, 1993;
van den Broek, 1990;
Graesser & Bertus, 1998; Otero, León, & Graesser, 2002; Acartürk, Habel, Cagitay, & Alacam, 2007;
Parodi, 2005; Canestrelli, Mak, & Sanders, 2013). Nevertheless, research on causal relations in multisemiotic texts constituted by words and graphs is scarce with only a few exceptions (
Green, 2005;
Habel & Acartürk, 2011). Even more limited is the use of eye tracking measures in the study of reading processing with texts written in Spanish and a varying degree of specialization (
Parodi & Julio, 2016; Julio, Parodi, & Loureda, 2018). As is well known, eye tracking technology allows for the determination of the exact location of the point of gaze of a subject’s eye, which in turn allows researchers to register the intermodal connections between words and graphs in the construction of a mental representation while reading on line in a moment-to-moment processing of written discourse. The current state of knowledge, therefore, leaves important questions unanswered.
In light of these shortcomings, the objective of this study is twofold. Firstly, we aim to observe the main effects of the causal discourse marker (DM) por tanto (therefore) (i.e., presence/absence), and the causal statistical graph (G) (i.e., presence/absence), as well as the interaction effect of both variables on different eye tracking measures (e.g., Total Fixation Time, First and Second Pass Reading Time), when students read a set of causallyrelated texts in Spanish. We expect to observe that the presence of both the DM and the G diminishes reading times, separately. In addition, we expect that the interaction (combined effect) of both variables further decreases reading times. These hypotheses were tested for the verbal system (VS), the causal discourse segment (S1), and the consequence discourse segment (S2).
The second goal of this study is to observe the effects of the DM (i.e., presence/absence) on the same eye tracking measures for the graph system (GS) area of interest (AOI). We expect to observe that the presence of the DM decreases the reading times on the GS AOI.
The article is structured as follows. First, in the theoretical background, we review some relevant distinctions of causal semantic discourse relations when considering multisemiotic texts, discourse markers and graphs. The second section describes the methodology, focusing on the the participants, experimental design, the material and procedures. Section three presents the results. In the final section, we summarize the results and discuss the findings, highlighting some of the problems encountered, as well as possible solutions and avenues for further research.
Discussion
We had expected to observe that the presence of the DM and the G would diminish reading times, separately. Moreover, we had expected that the interaction of both of them would also decrease reading times. These hypotheses were tested for the VS, the S1, and the S2. An additional expectation was that the presence of the DM would decrease reading times on the GS. In general terms, results regarding the first objective showed main effects of DM and of G. However, no interaction effects (i.e., DM x G) were observed. The analyses for the second objective revealed that no significant effects for DM on the GS were observed.
According to the results obtained, no main effects of the DM were observed on the complete verbal system (S1+S2); however, the presence of the connective showed varying influences on the two independent discourse segments constituting the cause-consequence semantic relation. Considering the S1, results show that the presence of the DM slows-down Second Pass Reading Time on the causal segment. More interestingly, the DM showed important effects on the consequence discourse segment (S2) for all the three measures involved in this study. The results revealed that the DM reduces Fixation Time, and First and Second Pass Reading Time on the S2 segment. By contrast, no main effects of the DM were observed on the GS. These findings partially support one of the hypotheses of this study: the DM reduces cognitive efforts while reading causally-related texts by speeding up the time needed to process the consequence segment.
As mentioned, for the causal discourse segment (S1) an effect of the DM in Second Pass was found in the inverse direction than for S2. In fact, the S1 is processed more slowly in presence of the connective
por tanto. Its procedural instruction leads readers to return to the S1 more often than in the implicit condition in order to build a situation model Explaining this result deserves reference to how both S1 and S2 are processed during First Pass reading. No significant effects of the DM were found on S1 in First Pass, which may find a straightforward explanation based on the ‘principle of continuity’ (
Murray, 1997): reading starts at S1, and it is not until readers finish reading it that they start to have expectations about the type of discourse relation that holds between both segments. Also, the significantly lower First Pass Reading Time of S2 in the explicit condition shows that the presence of
por tanto allows readers to acknowledge, from the beginning of the processing, that the S2 following the DM is a consequence that must be reasoned out from the content of S1. Such facilitating effect of the DM seems to be suppressed during the Second Pass of S1, which is longer in the explicit condition. However, the facilitating effect of the DM endures if global processing is considered, as reflects in the fact that no significant differences were observed in the total reading time (Fixation Time) for S1. In other words, the participants of the study are compelled to link S1 and S2 causally, and this produces a slow-down effect of the Second Pass on S1 (cancelled out in Fixation Time), probably due to the readers’ need to re-check the semantic status of S1 as the cause of the discourse relation in order to build the situation model.
For the consequence discourse segment, the global facilitating effect of the DM throughout reading (in First and Second Pass as well as in Fixation Time) may be due to the nature of connection in general and to the structure of forward consecutive relations in particular. Instances of connection need at least two discourse segments for the relation to arise. In the case of forward consecutive relations (a cause is followed by a consequence), the first segment argumentatively points to the second, either based on its lexical content and/or by means of the presence of a forward consecutive connective. In other words, the event expressed in S1 moves readers towards S2, where they will carry out further inferential processes until a complete mental representation is constructed. A procedural instruction like the one coded in por tanto, as a generator of readers’ expectations about ‘what comes next’, facilitates the integration of the argument contained in S2, which, as the findings revealed, is read faster when the DM is provided.
In general terms, in this study
por tanto contributes to clarify the underlying causal relation between the discourse segments by means of its procedural instruction: “process the second discourse segment as a consequence deduced from the events depicted in the first segment” (Martín Zorraquino & Portolés, 1999, p. 4100). Even if
por tanto is not itself the source of the discourse relation – given it does not create it – the DM appears to force the connecting of the segments, “to add some meaning to the reading of the overall fragments” (
Degand, 1998, p. 34). Thus, it makes the relation explicit and conveys a more constrained semantic status to each of the discourse segments that constitute the text, particularly to the S2.
In short, in spite of the fact that in general readers expect two juxtaposed verbal segments to be arranged linearly and to be causally related, in this study, the implicit condition demands, in general, longer processing times compared to the explicit condition. This finding supports the importance of explicit linguistic markers in order to establish discourse relations; particularly, in relation to the facilitating role of causal connectives to construct causal relations, especially when the access to a stored assumption in which to integrate the processed text does not seem to be sufficiently constrained by the propositional content of the discourse segments (McNamara, Kintsch, Songer, & Kintsch, 1996;
Sanders, 2005;
Taboada, 2006,
2009; van Silfhout, Evers-Vermeul & Sanders, 2014, Zunino, Abusamra, & Raiter, 2012;
Zunino, 2017; see also Loureda et al., 2016, and
Nadal & Recio, in press, on Spanish por tanto for different results when participants are confronted with causal relations reflecting assumptions clearly stored in their long-term memory as everyday world knowledge).
In relation to the G, in global terms, that is, considering Fixation Time, the G showed a slow-down effect on the VS. For S1 AOI, no main effects of the G were observed, neither in Fixation Time, nor in First or Second Pass Reading. Regarding the S2 AOI, even if the G reduced First Pass reading, it increased Second Pass, which cancelled out the earlier effect, as can be seen in the significantly longer Fixation Time of S2. The fact that a G effect in the same direction of the VS (slow-down) was found for S2 during Second Pass, allows us to maintain that all the detrimental impacts of the G concentrate particularly on S2.
Contrary to our hypothesis, the results revealed that reading the VS (S1+S2) of the texts including the G required more cognitive efforts and led to a more demanding task. Thus, the presence of a graph does not facilitate the processing of causally-related information; on the contrary, it seems to delay moment-to-moment reading processes, increasing one reading measure (Fixation Time). While the G showed an effect on the VS, the DM did not reveal any main effects.
The discussion so far suggests that constructing a mental coherent representation of multisemiotic texts (+DM+G) is not favored when the text presentation includes, at the same time, synonymous information from two different sources (‘Redundancy Effect’;
Chandler & Sweller, 1991,
1996;
Sweller, 2005; Sweller, Ayres, & Kalyuga, 2011). The finding that the presence of the DM has positive main effects supports, therefore, only partially the general hypothesis of this study: only the procedural meaning of the DM – and not the pictorial visualization of the causal relation by means of a statistical graph– seems to speed up reading times of causally-related texts.
Processing multisemiotic causal texts constituted by a verbal system and a graph system may produce a cognitive overload in working memory and, consequently, may delay processing in semantic memory. This may occur preferably when the information is presented in spatial and temporal contiguity (Mayer, 2014). In this line,
Schnotz’s (
2014) Integrated Model of Text and Picture Comprehension predicts that the combination of text and pictures could also have detrimental effects. This negative effect (‘Redundancy effect’) may occur when information is presented in multiple additional forms or is unnecessarily elaborated. For texts written in Spanish,
Parodi and Julio (
2017) observed better results among university students in writing summaries based on comprehension of economics texts when readers were given a single semiotic system version (only verbal or only graphic) of the Monetary Policy Report genre. When texts present this kind of information, it is possible that readers be affected by the so called ‘Split-Attention Principle’ (
Mayer & Moreno, 1998;
Schnotz, 2005;
Mayer, 2009); therefore, their overloaded attentional resources focus on only one system at a time, and then on the other, not being able to connect them immediately. Using eye tracking techniques,
Parodi and Julio (
2016) also reported findings that support the idea that when presented with verbal+graphic information in economics texts, university students preferred reading in first and for longer period of time the verbal system rather than the graphic. This could imply that integrating information from verbal and graph systems, in some contexts and for some readers, demands more cognitive effort, so readers tend to select and to concentrate preferably on the verbal system due to an overload on the visual working memory (‘Split-Attention Principle’).
However, on the other hand, the dual-coding theory (
Sadoski & Paivio, 2001) assumes that adding pictures to texts always leads to better learning (two codes in memory are better than one). Similarly, Schnotz (2014) also points out that numerous studies have shown that—depending on specific conditions- students usually learn better from words and pictures than from words alone (
Mayer, 1997: ‘Multimedia Effect’). In the same line, Holsanova, Holmqvist and Holmberg (2009) in a naturalistic newspaper eye tracking study, found that an integrated format with spatial contiguity between text and graphics facilitates integration and prolongs reading.
Complementarily, we may advance another possible explanation for the findings. Readers of our experiment have been educated to pay more attention to the verbal system than to any other semiotic system (‘Logocentric Principle’,
Parodi & Julio, 2016). In most cases, they certainly read and comprehend multisemiotic texts (
Parodi & Julio, 2017), but they have been formally cultured to believe that the most important component of a written text is the words. Moreover,
Parodi and Julio (
2016) asked university students in economics and in language studies three questions about perception and preference of relevance of the verbal or graphic information in written texts. More than 70% of the students in each discipline agreed that germane information was found in words rather than in graphs. No statistically differences were observed between disciplinary origin of the university students. For
Radford (
2010), the eyes have been domesticated, that is, culturally educated to read a semiotic system with a specific emphasis. This seems a promising hypothesis that may deserve further research.
Nevertheless, considering the findings of the present study, it is not only the case that readers did not pay attention to the graph at all; in fact, they did read it (see
Table 6). The specific findings regarding the G showed that the VS and, particularly, the consequence segment (S2) were read for a longer time in the presence of the G. This indicates that when reading a text in which different semiotic systems are present, the reader should read them both and then integrate the propositional contents of each system into one coherent mental representation (
Schnotz, 2014;
Mayer, 2005,
2009; Mason, Pluchino, & Tornatora, 2015). This additional process should increase second reading times and the integrative transitions between semiotic systems. In order to find out whether the increase on the reading times on the VS and especially on the S2 was due to an effect of the transitions between S1 or S2 and GS, we conducted further preliminary analyses. To this end, transitions (from and back) were obtained, and a Wilcoxon Signed-Rank test for related samples was conducted to test the hypothesis. Our interim results showed that transitions between the S2 and the GS were higher in number than the transitions between the S1 and the GS (Z = −7.366;
p = .000) (see
Table A1).
These results may suggest that the integration processes of the consequence segment (S2) may occur differently from those of the cause segment (S1), which would offer a preliminary explanation for the longer reading times on the consequence. Furthermore, the provisional data opens the possibility that the integration of both semiotic systems, the graphic and the verbal, could take place with focus on the consequence segment. More analyses would be needed to specify the precise region in the graph from which the information is contrasted with the verbal information. We are certain that these analyses were beyond the scope of the current study, but this preliminary data may inspire hypotheses for future research.
Conclusions
After decades of intensive research primarily focused on the verbal system of written texts, processing multisemiotic discourse is now receiving increasing interest, aimed in particular at the comprehension of texts that include graphs. In this article, we have argued that words and graphs working together may help readers construct a coherent mental representation of the text. Thus, when constructing a complex mental model, readers would benefit from a more informationally dense text containing a DM and a G, expressing an inverse causal objective, content-related semantic relation (
van Dijk & Kintsch, 1983;
Kintsch, 2013;
Sadoski & Paivio, 2001;
Mayer, 2009;
Schnotz & Baadte, 2015). In light of these arguments, our initial hypothesis was that a higher informative text would lead to faster processing of the information expressed verbally and graphically, and that subsequent reading times would then slow down. As discussed above, however, the joint presence of the DM and the G as an integral unity that facilitates discourse processing was supported only partially. Although we found no evidence for interaction effects between the presence/absence of the cause-consequence discourse marker
por tanto and the statistical causal-consequence graph, we observed main effects on the DM and the G, separately.
The effects of the DM are particularly clear and reveal that its instructional value endures until the construction of a coherent mental representation has been completed. The presence of the G, however, slows down the processing of the verbal system, particularly in the consequence segment. This is due to the fact that the G requires to be integrated and this is reflected on the VS.
Complementarily, additional preliminary analyses on integrative transitions between the S1, the S2 and the GS revealed that the cause and the consequence discourse segments were proceeded differently in their interplay with the GS. More interactions were observed between the consequence discourse segment and the graph.
Considering methodological implications, we are aware that the argument over which measure is best to use as an index of cognitive processing partly depends on what is the determined focus of the examination at hand. The currently available measures do not always help the researcher capture or reflect the whole reality involved in cognitive processing, particularly at discourse level. In this vein, the preliminary evidence advanced here, analyzing discourse segments with particular distinctive functional and semantic properties by resorting to different eye tracking measures, helped reveal that the cause segment was processed differently. This suggests that the analysis of global processing indicators is well complemented by precise parameters, which in this study helped disclose effects of combining different semiotic systems in texts, an effect particularly visible on S2.
As well known, research with eye tracking technology for a long time focused mainly on the processing of syllables, words, and isolated short sentences; thus, exploring and defining new related fine-grained measures are challenges for researchers, particularly when studying processing, for example, at multisemiotic discourse level or on global or macro dimension (e.g., Liversedge, Patterson & Pickering, 1998;
Hyönä et al., 2003; Mikkilä-Erdmann, Penttinen, Anto, & Olkinuora, 2007;
Holmqvist et al., 2011).
As for the limitations of this paper, in future research, we should move, on the one hand, beyond synonymic relations between words and graph. A more demanding and probably diverse data could be explored studying multiple relations between systems, such as antonym or more specifically what we could call ‘complementarity’: a multisemiotic relation in which a graph would add crucial exclusive information to the words forcing the reader to critically integrate both representational systems in order to construct a unified coherent mental representation of the whole text. On the other hand, this study focuses in cause-consequence content relations marked by the Spanish connective por tanto in which the related segments present an inverse negative relation. The effects of other types of causal relations and of other connectives on multimodal processing should be investigated in the future, as well as the effects of other types of causal relations and of other connectives, also those that require a different order of the cause and consequence segments.
Despite these limitations, the current research has scientific significance as it suggests the potential of a novel approach to combine words and graphs from a discourseoriented perspective; also, it can be taken as a starting point to further examine, for instance, the theoretical implications of the various formats of multisemiotic text representations and their integration processes. As of our knowledge, even though there exist some advances, there is no up-to-date systematic available description of the words-graphs intermodal connections across disciplinary discourse genres and their respective levels in comprehension, as well as in learning. A more in-depth analysis should take into account the present findings.
After a decade of research on multimodal words/graph text processing and comprehension, we still adhere to
Acartürk et al.’s (
2007:10) words “…contra to the models for eye movement control in reading, there is no model for eye movement control in text-graphics documents, despite their potential for formal descriptions”.
Finally, although our study only conducted preliminary analyses on integrative transitions, it reveals a promising future for this line of research. Investigating on the reading routes of multisemiotic texts and the processes involved in comprehension and discourse integration in more natural settings is a challenging niche. In particular, some questions for future multimodal research are:
Which are the reading routes that lead to integration of specialized disciplinary words/graph texts?
At which exact moments do readers look at the graph?
From which verbal system region do readers move to the graph system?
How often do readers pay attention to the graph compared to the verbal system?
How often do readers switch between the verbal system and the graph?