Introduction
Current theories of text comprehension assume that readers try to form a coherent memory representation of the text they read (e.g.,
Kintsch, 1998). Forming a coherent memory representation of the text requires that the reader is capable of connecting the ideas presented in the text to each other as well as of integrating text information with his or her background knowledge (e.g., Kintsch & van Dijk, 1978; van Dijk, Kintsch & van Dijk, 1983). According to the Lansdscape model (see van den Broek, Young, Tzeng, & Linderholm, 1999; van den Broek, 2010), the memory representation of text reflects the landscape of activations of different concepts during the course of reading: concepts may be activated automatically by the information presented in text (
Myers & O’Brien, 1998) or they could be retrieved from the episodic text representation or long-term memory (
van den Broek et al., 1999;
van den Broek, 2010). Concepts that are simultaneously activated are likely to be connected with each other and to be encoded to memory.
The goal or the task the reader has in mind plays an important role in how a reader processes and learns information presented in texts (Britt, Rouet & Durik, 2017). Goals are formed on the basis of reader’s personal interests and intentions, or they could be induced by instructions given to the reader (Britt et al., 2017; McCrudden & Schraw, 2007; McCrudden, Magliano & Schraw, 2010). The task the reader is trying to accomplish has a big impact on how much effort reader invests in building a coherent memory representation of text (e.g., Lorch & Lorch, 1986; Lorch, Lorch & Mogan, 1987; Lorch & Lorch, 1996; Narvaez, van den Broek & Ruiz, 1999; van den Broek, Lorch, Linderholm & Gustafson, 2001; Lorch, Lorch, Ritchey, McGovern & Coleman, 2001).
The task the reader has in mind defines what information is considered important in text: when readers are given specific instructions for reading, they rate text paragraphs containing task-relevant information as more important than paragraphs containing irrelevant information (e.g.,
Pichert & Anderson, 1977). Text information is then processed in order to meet the reading goal, and relevant information is given priority over information that is not relevant. For example, when readers are presented with a question prior to reading, they will devote more attention to question-relevant than question-irrelevant text contents, and question-relevant information is more likely to be encoded to memory and remembered (
Graesser & Lehman, 2011). Previous eye tracking studies show that readers invest more processing time on relevant than on irrelevant text information, and these effects can be seen during the first-pass reading as well as in later look-backs to relevant text segments (e.g.,
Kaakinen & Hyönä, 2005,
2007,
2008,
2011; Kaakinen, Hyönä & Keenan, 2003). After reading, readers show better recall of task-relevant than irrelevant information (
Baillet & Keenan, 1986; Kaakinen, Hyönä, & Keenan, 2001, 2002). In summary, when readers have a specific reading task in mind, they consider task-relevant text segments as important, and direct processing resources selectively to relevant text information. The resulting memory representation of text reflects the selective attention paid to the relevant text segments: recall is better for task-relevant than irrelevant information.
Prereading questions are an example of a specific reading task that efficiently guides readers’ attention to certain information in text and improves memory for it (e.g., León, Moreno, Escudero et al., 2019; Lewis & Mensink, 2012; Linderholm, Therriault, & Kwon, 2014; Moreno, León, Martín-Arnal & Botella, 2018; Rapp & Mensink, 2011; Rouet & Coutelet, 2008; Vidal-Abarca et al., 2011).
Lewis and Mensink (
2012) conducted two experiments in order to test the benefits of prereading questions for increasing attention to and recall of relevant sentences in texts. Using eye tracking, they found that participants directed additional attention, as reflected in increased first-pass and look-back times, to the relevant sentences in the texts. Participants also produced more information related to the relevant sentences in their recalls if they were exposed to the prereading questions.
In previous studies on task effects, memory for text information has mainly been measured with a recall task, in which participants have been asked to write down as many main points presented in text as they can after reading (e.g.,
Kaakinen & Hyönä, 2005,
2007;
Kaakinen et al., 2003). While free recall reflects the type of information the reader has stored in memory, it is not necessarily the optimal way to measure comprehension (e.g.,
Kintsch et al., 2000). Another, perhaps better way would be to ask readers to summarize the contents of the text (see e.g., Armbruster, Anderson, & Ostertag, 1987; Cordero-Ponce, 2000; Jorge & Kreis 2003; Kirkland & Saunders, 1991; León, Olmos, Escudero, Cañas & Salmerón, 2006; Nelson & Smith, 1992; Taylor, 1983; Thomas & Bridge, 1980; Vadlapudi & Katragadda, 2010; Zipitria, Arruarte, Elorriaga, & Díaz de Llarraza, 2007). When readers summarize a passage, they have to identify and express the core concepts that represent the general themes of the text in a coherent way. A summary task encourages deep understanding of the text because it requires active construction of the meaning as opposed to merely choosing one response from several alternatives (as in multiple-choice questions), answering isolated questions (as in open-ended questions), or simply reproducing information read in text (as in free recall). Synthesis and coherence are two key aspects of a good summary. In the present study, we used an oral summary task in order to measure how readers synthesize and form a coherent representation of the text they read. An oral summary is a concise production about the most important information of the text, and it is a more natural and spontaneous output than a written summary. Written summaries require increased attention to grammatical correctness and writing style, so typically more time is needed to plan and produce a written response. Thereby, producing summaries orally minimizes possible interference from writing-specific requirements (e.g., planning activities, attention to grammatical correctness, writing style), and thus an oral summary can be considered as a purer indicator of the quality of the memory representation than a written summary.
But what makes a good summary? Several models to evaluate the quality of summaries have been proposed. Some authors, such as
Kirkland and Saunders (
1991), suggest that summaries for expository texts should be evaluated on the basis of several criteria: (a) the summary provides a general overview of the text and emphasizes the relations existing between the main ideas, (b) the information given is clarified by secondary ideas, (c) the summarizer makes clear when original text is used, and (d) the summarizer uses his or her own words. In the summary analysis model of
Jorge and Kreis (
2003), the authors identify several parameters to measure the quality of summaries: cohesion and coherence, inclusion of the main ideas contained in the source text, conciseness, information about the source text, and absence of personal opinion.
We propose that the quality of summaries for expository texts can be described with two main dimensions: content and coherence (
León & Escudero, 2015). Content criteria concern the extent to which the summary reflects the essential content of the text, such as whether the summary includes the most task-relevant ideas presented in the text or not. Coherence criteria refer to the connections built between idea units presented in text as well as with reader’s knowledge base, including causal relations between the relevant ideas. These dimensions reflect the extent to which the reader has adopted task-relevant core content of the text, integrated it with his or her prior knowledge, as well as constructed the causal relations between the relevant ideas of the text, including reasons and consequences. All of these aspects should be clear and explicit in a good summary. These criteria have been proved to be useful for assessing reading comprehension (see e.g.,
Armbruster et al., 1987;
Cordero-Ponce, 2000;
Nelson & Smith, 1992;
Thomas & Bridge, 1980). The reliability and validity of a summary test, which is based on scoring the content and coherence of summaries, has been shown to be good (e.g.,
León et al., 2006; León, Escudero, Olmos, Sanz, Dávalos & García, 2009; León, Escudero & Olmos, 2012; León, Olmos, Perry, Jorge-Botana & Escudero, 2013;
León & Escudero, 2015; León, Moreno, Arnal, Escudero & Olmos, 2015). For example,
León et al. (
2015) demonstrated that a summary test (RESUMeV), in which the quality of the summaries is scored for content and coherence, has high reliability (interrater
r’s .69-.97, for four independent raters). León, Escudero, Olmos, Moreno and Martín (2017) showed that the summary task scores correlate highly (
r = .82) with multiple choice test performance, indicating that the summary task is a valid measure of reading comprehension. Additionally, León et al. (2013) analyzed the causal network in a narrative text and compared this to the causal networks generated by the students in their written summaries of the original text. The results showed that more competent readers produced more causal nodes in their summaries and thus got higher scores for causal coherence than less competent readers, which demonstrates the criterion validity of the summary task.
The crucial question is how readers who produce coherent summaries containing relevant information actually process the text information? In order to answer this question, we need to examine the moment-to-moment processes as they occur during reading. Eye tracking is a fruitful methodology for examining the cognitive processes occurring during reading (e.g., Hyönä, Lorch & Rinck, 2003;
Rayner, 1998,
2009), and it can be used to identify reading strategies that underlie successful expository text comprehension (Hyönä, Lorch & Kaakinen, 2002). For example, Hyönä and colleagues (2002) asked college-age students to read two expository texts for comprehension. Readers’ eye movements were recorded during the course of reading, and after reading participants wrote a free recall of the texts they read. The analysis of the eye movement data revealed four different reader groups, who also differed with respect to the comprehension of the text as reflected in the quality of their recall protocols. ‘Topic structure processors’, who were sensitive to the topic structure of the text and made eye fixations back to the subheadings and topic sentences especially from the end of the paragraphs, showed good recall of the text contents. Also ‘Fast linear readers’, who progressed in text relatively quickly and did not make look-back fixations, gained good memory of the text. These results show that the pattern of eye movements is informative about the comprehension processes: topic structure processors invest extra effort in building links between text elements, whereas fast linear readers encode text information to memory relatively effortlessly.
In the present study, we used eye tracking to examine individual differences in how readers manage the demands of a specific reading task – answering a question presented in the beginning of the text – and specifically, whether readers who produce high quality summaries utilize different processing strategies than readers whose summaries are not as comprehensive. We were particularly interested in how readers who provided comprehensive summaries inspected question-relevant and –irrelevant text information.
Individual differences in general cognitive abilities, such as working memory capacity, play a crucial role in how readers process and comprehend text (
Just & Carpenter, 1992). Previous eye movement studies show that there are individual differences in how adult readers inspect and recall task-relevant and irrelevant text information (e.g.,
Kaakinen et al., 2001,
2003). However, these prior studies have not examined differences in the quality of the memory performance, such as both the content and coherence of the recalls. Moreover, prior studies have used written recalls, which may not be optimal for measuring comprehension of text, as discussed above. It is important to understand how different readers perform when given a specific reading task, as it is a way of understanding the performance differences between participants in modern reading assessments such as the OECD-PISA and PIAAC studies (
OECD, 2010,
2016). What the present study adds to the current literature is thus important knowledge about what kind of processing strategies are successful and result in good text comprehension, as measured by an offline summary task.
The texts used in the present study were short expository texts consisting of three paragraphs: an introduction, which also presented a question related to the text’s topic, a paragraph providing information relevant for answering the question presented in the introduction, and a paragraph containing information related to the topic of the text but not relevant to answering the question. For example, one of the texts introduced history related to the pollution of river Thames. In the introduction, general overview about history was given, and in the end of the introduction, a question was presented (“But how did the river become so contaminated?”). In the question-relevant paragraph, reasons for why river Thames got polluted were discussed. In the question-irrelevant paragraph, consequences of the river’s pollution were given. This information, even though it was highly related to the overall topic of the text, was not relevant for answering the question presented in the introduction.
After reading the texts, participants provided oral summaries. Summaries were rated for the content (i.e., whether the summary contained information relevant to answering the question) and coherence, according to the scheme proposed by
León and Escudero (
2015). There was individual variability in the quality of the summaries, and participants were divided into three groups on the basis of the quality of their summaries: low, medium, and high quality summary groups. We then compared the eye movement patterns of the reader groups. Following Hyönä, Lorch & Rinck (2003), we computed measures that reflect the initial processing of the text paragraphs (
first-pass reading time) and the
number of returns and the
duration of look-backs made to the paragraphs from subsequent parts of text, in addition to the total time spent reading the paragraph.
On the basis of previous eye-tracking studies on expository text reading (
Hyönä et al., 2002;
Kaakinen et al., 2002,
2003;
León et al., 2019), we expected that there are consistent individual differences between readers in the processing strategies they use while they are reading. Previous research suggests that some readers who are more sensitive to topic relevant information tend to use a selective reading strategy, in which they devote additional resources to this information and make frequent rereading and looking back to the relevant sections of the text in order to integrate information in memory. On the other hand, some readers are less sensitive to topic relevant information and present a non-selective reading strategy, in which they do not dedicate additional attention to topic relevant information. We expected that readers who produce good quality summaries would present a selective reading strategy, which would be reflected in eye movement records as longer fixation times to question-relevant than to question-irrelevant text segments, already during initial reading of the paragraph. Moreover, we expected that these readers would demonstrate more looking back to the introductory paragraph from the relevant paragraph, as integrating the information presented later in the text with the paragraphs that contains a question should benefit the construction of a coherent memory representation.
Discussion
The present study examined how readers who produce good quality summaries process text information during the course of reading. The results showed that readers who demonstrate good summarization skills show strategic reading behavior (
Hyönä et al., 2002) and utilize a
selective processing strategy (
Kaakinen et al., 2002,
2003; León, Moreno, Escudero, Olmos, Ruiz & Lorch, 2019): they spend more time reading question-relevant than question-irrelevant paragraphs, and this preference is seen already during the first-pass reading of the relevant paragraphs. Readers who produce high quality summaries also do longer look-backs to the relevant paragraphs. Importantly, they also make more frequent and longer lookbacks to the introductory paragraph from the paragraph introducing relevant information than other readers, implying that they engage in building links between the paragraph that includes the question and question-relevant text information. This selective attention to relevant paragraphs and increased looking back to the introductory paragraph is not related to increased recall of text information in general – rather, it is related to increased recall of question-relevant text materials. These results indicate that a successful summary strategy is characterized by increased attention to relevant paragraphs and by coherence-building rereading of the introductory paragraph containing the question.
It should be noted that the differences between the summary groups were observed in the processing of question-relevant text information, not in how readers directed processing time to irrelevant paragraphs. Moreover, readers who produced poorer summaries demonstrated weaker relevance effects overall, and the group producing lowest quality summaries failed to demonstrate sensitivity to relevance during first-pass reading. These findings indicate that readers who demonstrate good summary skills are better in recognizing relevant information, and are aware of and utilize processing strategies that increase its encoding to memory.
In addition, the rating of the quality of the summaries on the basis of their content and coherence (
León & Escudero, 2015;
León et al., 2015) proved to be valid in the sense that it differentiated readers who demonstrated different processing strategies. Readers who produced more elaborated and detailed summaries that included most of the relevant points and the causal connectors between them, also showed sensitivity to relevance and more coherence-building strategies during the course of reading.
Finally, the present study demonstrates the utility of combining online measures of processing, such as eye tracking, with offline measures of comprehension. This kind of methodological triangulation is crucial to understand how the reading processes reflected in eye movements are linked to the memory representation constructed of the text. The present results show that good comprehension of text, as measured by the quality of the oral summary produced after reading, is related to a selective processing strategy during reading. Thus, the present study adds to the current literature important knowledge about what kind of processing strategies are successful in that they result in good text comprehension.
Critical evaluation of the present study
The present results are in line with previous studies showing that task-relevance increases processing effort and subsequent recall of text information (e.g.,
Kaakinen et al., 2003;
León et al., 2019;
Lewis & Mensink, 2012;
McCrudden & Schraw, 2007;
McCrudden et al., 2010;
Graesser & Lehman, 2011), and expand these earlier findings by showing that there are individual differences in how readers make use of this selective attention strategy. Unfortunately, on the basis of the current data, it is impossible to say what the underlying factor for these differences is. In some previous studies readers with high working memory capacity showed greater relevance effects on text recall (
Kaakinen et al., 2001) and in processing (
Kaakinen et al., 2003), suggesting that working memory capacity might be a crucial factor in how efficiently readers can make use of the selective attention strategy. However, these previous studies did not directly examine how the quality of the recall was related to processing, as was done in the present study. Future studies should examine in more detail the factors that influence a reader’s ability to selectively attend to and to build a coherent memory representation of relevant text information.
The eye tracking methodology used in the present study did not allow us to examine processing on a word or sentence level – only paragraph-level measures and transitions between paragraphs could be analyzed. While paragraph level analysis proved to be good enough for understanding what kind of strategies different readers used during the course of reading, more accurate measurements of eye movements would provide valuable information on whether the individual differences emerge already at the levels of word processing or sentence reading. These types of analyses might be helpful in understanding the underlying factors of the individual differences observed in summarization skills.
Conclusions
In line with current theories of text comprehension (
Britt et al., 2017;
van den Broek et al., 1999;
van den Broek, 2010), the present study shows that when readers are given a task, they try to form a coherent memory representation of the text in which task-relevant information is prioritized. This is done by utilizing a selective attention strategy: extra processing effort is directed to task-relevant text information. Readers who demonstrate good summarization skills show increased sensitivity to task-relevance by spending extra time on relevant text information already during first-pass reading, and they also tend to look back from the question-relevant paragraph to the paragraph containing the question, implying that they try to build links between the question and question-relevant text information. We suggest that this coherence-building activity is reflected in the high-quality summaries of these readers.
The present results demonstrate that presenting a question in the beginning of a text helps some readers to build a coherent summary of the text information. However, not all readers are capable of utilizing a selective attention strategy efficiently. These individual differences may be related to general cognitive constraints, such as working memory capacity (
Kaakinen et al., 2001,
2003), or knowledge about efficient processing strategies (
Hyönä et al., 2002; Hyönä & Nurminen, 2003). Future studies should examine in more detail the conditions in which presenting questions in the beginning of text support comprehension, as this would be valuable information for developing efficient educational practices.
Finally, the present results suggest that a summarization task is a useful pedagogical tool, as it is likely to increase elaborative processing and coherence-building activities during the course of reading (e.g., Léon & Escudero, 2015). Moreover, the results show that the quality of the summary reflects the degree to which the reader was engaged with these types of activities. Thus, the summarization task serves as a comprehension-enhancing intervention and the summary itself provides some information about the processes that occurred during the course of reading. In the context of learning, this information could be used to give feedback to the learners about the efficiency of their reading strategies.