Introduction
The reception and appreciation of art crucially depends on the expertise of the recipient (
Kozbelt, 2020). Experts possess knowledge and skills that allow them to approach works of art in ways that are unavailable to novices. Prior research in the domain of verbal art has confirmed that poets – experts in the art of poetry – possess superior relevant skills and pursue different strategic approaches to poetry than novices (
Lea et al., 2021;
Peskin, 1998;
Stumberg, 1928).
However, it remains unclear in how far the process of poetry reading is affected by readers’ level of expertise. Here, we report results of an expert–novice study of poetry reading in Russian, a language that is underrepresented in research into poetry reception. Word-level analyses of selected eye-movement measures that tap into distinct processing stages during reading confirm the hypothesis that expert knowledge affects poetry reading.
Prior research has established that reading experience and exposure to print material improves reading skills and reading efficiency in adults (
Gordon et al., 2020;
Stanovich & West, 1989), affecting not only high-level comprehension processes like inferencing (
Long et al., 1997) but also low-level processes like word recognition (
Ashby et al., 2005;
Chateau & Jared, 2000). For instance, skilled readers with a high degree of prior print exposure have shorter gaze durations, skip words more frequently, and are less susceptible to the influence of low-level word characteristics like word length and -frequency (
Chateau & Jared, 2000;
Eekhof et al., 2021;
Faber et al., 2020;
Gordon et al., 2020). Readers with high exposure to specific text genres (e.g., academic prose, lyrical poetry) are likely to acquire genre-specific expert knowledge beyond the general benefits of print exposure for text reading and comprehension. Sufficient experience with literary texts, for instance, leads to the emergence of distinct processing- and comprehension strategies for literary vs. non-literary texts (
Blohm et al., 2017;
Hanauer, 1998;
Zwaan, 1994) and for different literary genres (
Blohm et al., 2022; Peskin, 2007). Literary text reading is usually slower than non-literary reading, and it results in improved verbatim memory (
Hanauer, 1998) but less accurate memory for situational information (
Zwaan, 1994). Prior results further indicate that poetry-appropriate processing strategies modulate readers’ attentional state even prior to reading (
Blohm et al., 2021), affect readers’ processing routines and interpretive operations during reading (
Blohm et al., 2022;
Fechino et al., 2020, Peskin, 2007), and result in improved verbatim memory after reading (
Hanauer, 1998;
Lea et al., 2021). Poetry reading appears to be more careful than prose reading, since it is characterized by reduced reading speed, shorter progressive saccades and less frequent word skipping (
Blohm et al., 2022), as well as increased total word reading times and a greater tendency to re-read earlier sections of the text (
Fechino et al., 2020). Since such poetry-appropriate strategies depend on prior experience and practice, it seems reasonable to assume that they are particularly pronounced, or nuanced, in poets. Here, we examine whether inspection-time measures of reading behavior are indicative of such expertise-dependent reading strategies.
The expert knowledge of poets has been examined in prior research. Early findings revealed poets’ special abilities with respect to formal (rhyming), lexical, and semantic (verbal imagery and metaphor) aspects of poetry, as well as good verbatim memory for poetic material (
Stumberg, 1928). More recent investigations have refined our knowledge about the psychology of poetic expertise, e.g., poets’ implicit knowledge of rhyming (
Cross & Fujioka, 2019), or expertise-dependent memory effects (
Lea et al., 2021). Corpus-based research has revealed expertise- and skill-dependent differences in the poetic practices of professional vs. amateur poets, e.g., in terms of lexical choice and the treatment of poetic form (
Kao & Jurafsky, 2012). Whether and how poets’ expertise affects the process of reading poetry remains unclear, though. The present study addresses this question by comparing eye movements recorded while poets (experts) and novices silently read selected poetry; its focus on inter-individual differences and on Russian poetry broadens the scope of contemporary eye-tracking research into poetry reading (
Blohm et al., 2022;
Fechino et al., 2020;
Geyer et al., 2020;
Jacobs, 2015;
Koops van ‘t Jagt et al., 2014;
Lüdtke et al., 2014;
Magyari et al., 2020;
Menninghaus & Wallot, 2021;
Papp-Zipernovszky et al., 2021).
The primary aim of the current study was to assess whether and how expert knowledge affects poetry reading. To this end, we recorded eye movements while two age-matched groups of professional poets (i.e., experts) and novices read selected Russian poetry. Expecting that expertise-dependent reading- and comprehension strategies would lead to systematic between-group differences in reading behavior, we conducted word-level analyses of the reading process. First-fixation duration was selected as an index of early stages of word processing during reading, e.g., word recognition. We expected that this measure would be modulated by lexical properties of the words, e.g., word length and -frequency. Finding this measure sensitive to readers’ level of expertise (poets vs. novices) would support the hypothesis that poets’ reading strategy for poetry affects early stages of the reading process. Gaze duration was selected as an index of first-pass reading, which additionally reflects later and more controlled stages of comprehension, e.g., the semantic integration of a word into the previous discourse. We expected this measure to be sensitive to the level of expertise, reflecting that poets’ reading strategy for poetry emphasizes different aspects of the text, e.g., its rhythmic structure or its semantic polyvalence. Total reading time was selected as the most general index of the reading process that additionally reflects the re-reading(s) of a word, either because readers regressed locally within the text or because they re-read the entire poem. While we expected to observe expertise-dependent differences in this measure, we were cautious to base strong conclusions on potential differences in total reading times, since this measure may partly reflect strategic re-reading due to task requirements.
Methods
Participants
We recruited a sample (n = 11) of professional poets (laureates, nominees or winners of poetry awards) by personal appeal (five females, Mage = 38.0, SDage = 12.1, age range: 25–60 years). All experts engaged in literary activities professionally and reported to read poetry daily. Data from one expert had to be discarded due to a technical error during eye-movement recording so that data from ten experts entered the analysis; one expert contributed only three trials since one trial was excluded due to signal loss during recording.
We further recruited an age-matched (unpaired t-test: t(18) = 0.59, p = .563) control group of novice readers who reported to read poetry less than once a month. Data from two novices had to be discarded due to poor calibration and a technical error during eye-movement recording; data from ten novices entered the analysis (seven females, Mage = 34.7, SDage = 12.9, age range: 19–62 years).
All participants were native speakers of Russian, had normal or corrected-to-normal vision, and were naïve to the purpose of the research. All participants gave written informed consent prior to the experiment.
Stimuli and Design
We selected four Russian poems from the late 20th century (see
Table 1). This is a particularly influential era in recent Russian poetics in which poets built on the achievements of earlier centuries and introduced novel trends into poetry (
Korchagin & Larionov, 2019).
All four texts were presented to each participant. We expected that expertise-induced differences in reading behavior should become apparent as between-group differences.
Procedure
Prior to the experiment, participants gave written informed consent to volunteer as participant in the study. In a brief questionnaire, they supplied demographic data (age, gender), indicated whether they engaged with poetry professionally, and reported how frequently they read poetry.
The main experiment took about 30-45 minutes and was conducted in a well-lit and sound-attenuated room. Participants were instructed that they would read poems written by Russian authors after World War II, and that they would be asked to respond to some questions about these texts after reading; instructions asked participants to read the poems "attentively in convenient tempo".
The experiment began with a practice trial to familiarize participants with the reading situation and the subsequent tasks; for practice we presented the first stanza of Mikhail Aizenberg’s poem “The soot is white no matter how blackened…” («Сажа бела, скoлькo б не oчерняли…» (21 words, four sentences,
https://znamlit.ru/publication.php?id=5797). Following practice, participants read all four poems in randomized order while their eye movements were recorded.
Each trial began with a standard 9-point calibration and validation procedure to ensure a spatial resolution error of less than 0.5° of visual angle. Text presentation was triggered when participants fixated a black dot (16 points) conveniently located to the left of where the first word of the text would appear. Texts were left-aligned and displayed in a 25-point Times New Roman font with 1.5-line spacing.
Participants were free to read the poems at their own pace and to go back and forth within as often as they wanted without a time limit. After reading a poem, participants pressed the spacebar to proceed to three oral tasks: a free-association task, in which they were required to name any associations they had after reading the text, a keyword-task, in which they were required to name the words of the poem they considered most significant for its interpretation, and a cloze task, in which they were presented with the poem again and required to fill in gaps. i.e., individual words that had been left out. Data from the oral naming tasks are not reported here; the interested reader is referred to (
Fokin, 2021); results of an unpaired
t-test of mean accuracy rates indicated that both groups performed equally well in the cloze task (
t(18) = 0.79,
p = .438).
Recording
Participants’ eye movements were sampled at 1000 Hz with a desktop mount EyeLink 1000 Plus eye tracker (SR Research Ltd., Mississauga, Ontario, Canada). Stimulus presentation was controlled by SR Research Experiment Builder (version 2.3.38). Stimuli were presented on a 19-inch LCD monitor with a refresh rate of 60 Hz and a resolution of 1600x1024 pixels. Distance from participants’ eyes to the stimulus monitor was approximately 80 cm. A head-and-chin rest was used to minimize participants’ head movements. Viewing was binocular but only the left eye was recorded. Participants’ responses to oral experiments were recorded with App-Dictaphone (Appliqato Software, Nicosia, Cyprus), using Huawei AMN LX-9 (2019).
Data analysis
Raw data were checked manually before we applied an automatic cleaning procedure, accepting fixations between 50 ms and 800 ms. Then we extracted interest area reports, using each word as an individual interest area; we analyzed first fixation durations, gaze durations, and total reading times; skipped words were treated as missing observations. The underlying data are provided in the supplementary material (
https://osf.io/bzcra/).
We removed outlying values exceeding participant-specific cutoffs (mean + 3
SDs) before we analyzed word-level data using linear mixed-effects regression with random effects for participants. Outlier removal resulted in data loss of less than 2% in all cases; remaining observations were distributed evenly across groups although one trial from the poet group had to be discarded (Chi-squared tests for given probabilities: all
χ2(1) < 2.8; all
ps > .095). We then aimed to fit parsimonious models using a three-step selection procedure that involved both forward- and backward-fitting. Analyses were carried out in JASP (Version 0.14); the analysis file is available at
https://osf.io/nqxhe/.
1. First we fitted a base model that contained fixed main and two-way interaction effects of lexical- and text-related variables, which allowed us to control for differences between words and to approximate how readers navigated through the poems. Lexical variables included
word length (i.e., the number of letters per word),
log-frequency (i.e., the log-transformed number of occurrences in the Frequency Dictionary of Modern Russian;
Lyashevskaya & Sharov, 2009) as well as
orthographic-neighborhood size (i.e., the number of transposition- and substitution neighbors retrieved from a lexical database of modern Russian;
Alexeeva et al., 2007), all of which have been identified as relevant lexical variables in prior studies of eye movements during poetry reading (
Xue et al., 2019,
2020); expectably, all lexical variables showed moderately strong correlations (0.3 <
r < 0.7): length – frequency (
r = -0.69), length – orth. neighb. (
r = -0.56), frequency – orth. neighb. (
r = 0.45). Text-related variables included the serial
text position of each word (i.e., 1
st word, 2
nd word, etc.) as well as its
line position (final vs. non-final), which has been shown to influence reading times in poetry comprehension (
Fechino et al., 2020). Including main and interaction effects of text- and line position allowed us to approximate how participants navigated through the poems. Although this is, admittedly, a crude approximation that reduces texts to a linear sequence of words and disregards most of the text structure, e.g., the division of poems into lines and stanzas (
Beck & Konieczny, 2021;
Fechino et al., 2020;
Menninghaus & Wallot, 2021), we refrained from including further structure-related variables, since the available data did not support overly complex models; higher-order interactions were excluded for the same reason. We then eliminated non-significant predictors from the initial base model in a stepwise fashion, using a liberal alpha level of
p < .1.
2. In a second step, we added the main variables of interest to the (back-fitted) base model, i.e., fixed main and two-way interaction effects of group (experts vs. novices). Subsequently, we reduced this extended model in a stepwise fashion again, now using the stricter conventional alpha level p < .05.
3. In the final step, we forward-fitted the random-effect structure of the (back-fitted) extended models, testing whether random slopes for main effects improved the fit of the models, as indexed by the AIC; random slopes were tested in the order in which we included fixed-effects terms, i.e., lexical variables > text-related variables > reader variables. Since including random slopes affects the coefficient estimates of the respective fixed-effects terms, we checked whether the resulting models included nonsignificant (p > .05) predictors and removed them if appropriate; non-significant main effects were kept if they were part of higher-order interactions.
Results
We conducted linear mixed-effects regression analyses of first-fixation durations, gaze durations, and total reading times per word. We report the ANOVA summaries of the final statistical models determined in the model selection procedure. Our primary interest was in main and interaction effects of group, which we assume to reflect expertise-dependent adjustments of reading behavior; these effects are described in detail here. By contrast, main and interaction effects of lexical- and text variables are reported only briefly; we refer the interested reader to the Appendix for post-hoc analyses of the interaction effects.
First-fixation duration
The final regression model included only the lexical variables word length, word frequency and orthographic neighborhood size as well as their interaction as predictors (see
Table 2); there were no effects of group on first-fixation duration.
The main effect of word length (
p = .019) indicated that long words required longer first fixations than short words. The main effect of lexical frequency (
p < .001) indicated that, in line with previous results (
Ashby et al., 2005;
Kliegl et al., 2004;
Rayner, 1998), first fixations on high-frequency words were shorter than those on low-frequency words. The main effect of orthographic-neighborhood size (
p < .001) indicated that first fixations were longer for words with many competing neighbors than for words with no or only a few neighbors. Both the advantage for high-frequency words and the penalty for long words were moderated by the number of orthographic neighbors. The interaction of word length and orthographic-neighborhood size (
p < .001) reflected that the detrimental effect of word length increased with the number of orthographic neighbors; words without orthographic neighbors showed no word-length effect at all (see
Table A1 in the Appendix). The interaction of lexical frequency and orthographic-neighborhood size (
p = .014) indicated that the facilitation effect for high-frequency words was strongest for words with few orthographic neighbors (~1 word) and decreased with increasing numbers of orthographic competitors; words with large orthographic neighborhoods (~14 words) showed no frequency effect at all (see
Table A2 in the Appendix).
First-pass gaze duration
The final regression model included random effects for participants, and fixed effects of all lexical variables (word length, word frequency, orthographic neighborhood), text-related variables (text- and line position) and group as well as several interactions (see
Table 3).
We observed main effects of word length, orthographic-neighborhood size and word frequency (all ps < .001), which indicated that first-pass reading was faster for short words than for long ones, faster for words with small neighborhood sizes than for those with many orthographic competitors, and faster for high-frequency vs. low-frequency words. Main effects of serial text position (p = .016) and of line position (p = .002) reflected that gaze durations were shorter for words occurring later in the poem than for those at the beginning, and for words in line-final position than for words occurring earlier in the line. These main effects of word- and text variables were qualified by several interactions.
The interaction of word length and orthographic neighborhood (
p = .018) reflected that greater numbers of orthographic neighbors increased the detrimental effect of word length (see
Table A3 in the Appendix), whereas the interaction of orthographic neighborhood and log-frequency (
p < .001) indicated that greater numbers of orthographic neighbors reduced the facilitative effect of high frequency (see
Table A4 in the Appendix). The interaction of word length and text position (
p < .001) reflected that the word-length effect was strongest for words that occurred early in the text and decreased as readers progressed through the poems (see
Table A5 in the Appendix).
Crucially, first-pass gaze durations revealed distinct patterns in the two groups of readers. There was no main effect of group (
p = .678), but we observed interactions of group and word length (
p < .001), group and text position (
p = .041), as well as group and line position (
p = .002). While gaze durations of both groups were affected by word length (longer words took longer to read), this effect was more pronounced in novice readers (
B = 17 (±2),
CI95% = [14, 20],
p < .001) than in poets (
B = 9 (±2),
CI95% = [6, 12],
p < .001), i.e., poets were less sensitive to this low-level lexical variable. Moreover, gaze durations of novice readers became faster as they progressed through the poems (
B = -0.5 (±0.1),
CI95% = [-0.8, -0.2],
p = .001) but first-pass reading of poets showed no such tendency (
B = -0.1 (±0.1),
CI95%= [-0.4, 0.2],
p = .548); see
Figure 1.
Similarly, gaze durations of novice readers were sensitive to the position of a word within the verse line (non-final vs. final) such that line-final words were read faster than non-final ones (
Mfinal = 293,
Mnon-final = 323,
z = 4.34,
p < .001), whereas poets’ gaze durations were unaffected by the line position (
Mfinal = 269,
Mnon-final = 269,
z = 0.05,
p = .959); see
Figure 2.
Total reading times
The final model included fixed effects of all lexical variables (word length, word frequency, orthographic neighborhood) and text-related variables (text- and line position) as well as several interactions. Contrary to our expectations, there were no effects of group on total reading times (see
Table 4).
Main effects of word length (p < .001), orthographic neighborhood (p < .001), and lexical frequency (p < .001) indicated that greater word length and greater numbers of orthographic neighbors increased total reading times whereas greater word frequency reduced reading times. Additionally, total reading times decreased as readers progressed through the poems (main effect of text position, p = .017). These main effects were qualified by several interactions.
The interaction of word length and orthographic neighborhood (
p < .001) reflected that greater numbers of orthographic neighbors increased the detrimental effect of word length (see
Table A6 in the Appendix). The interaction of word frequency and orthographic neighborhood (
p = .012) indicated that greater numbers of orthographic neighbors reduced the facilitation effect observed for high-frequency words (see
Table A7 in the Appendix). The interaction of word length and text position (
p < .001) reflected that the word-length effect was strongest for words that occurred early in the text and decreased as readers progressed through the poems (see
Table A8 in the Appendix). Finally, the interaction of word length and line position (
p = .020) revealed that readers spent less time on line-final words than on non-final ones (see
Table A9 in the Appendix).
Discussion
We examined whether expert knowledge affects poetry reading. Selected Russian poems were presented to two groups of native speakers, professional poets (experts) and an age-matched sample of readers who rarely read poetry (novices). Assuming that frequent poetry reading and expert knowledge lead to the emergence of pronounced genre-appropriate reading- and comprehension strategies, we expected to observe distinct eye-movement patterns in expert- and novice readers. We examined indices of early word processing (first-fixation durations), of first-pass reading (gaze durations), and of the entire reading process, including re-reading (total reading times). We employed linear mixed-effects regression to analyze reading times per word, controlling for major lexical variables (word length, word frequency and orthographic neighborhood) as well as for the serial text position of words and their position within the verse line (non-final vs. final).
First-fixation durations were sensitive to lexical variables but unaffected by text- or line position. Crucially, we observed no systematic differences between poets and novices and thus failed to obtain evidence that early word processing during poetry reading is subject to the top-down control of poets’ genre-specific reading strategy. The lexical effects are largely consistent with prior evidence, replicating the well-established effect of word frequency (
Rayner, 1998); However, the word-length penalty we observed is at odds with prior results (
Fechino et al., 2020). The main and interaction effects of orthographic-neighborhood size can be accounted for in terms of lexical competition between similar words during reading.
Confirming the hypothesis of expertise-dependent reading behavior, first-pass gaze durations were not only modulated by lexical variables but further exhibited distinct patterns in poets and novices. For one, gaze durations of novice readers became shorter while progressing through the poems, whereas professional poets showed no such trend, i.e., first-pass reading in novice readers got faster but poets retained the same pace throughout. Since the pattern observed for novices is consistent with prior evidence that word reading times become faster as text reading progresses (e.g.,
Wallot et al., 2013), it is the steady pace observed in poets which is unusual and which presumably forms part of their poetry-appropriate reading strategy. Similarly, poets’ gaze durations were insensitive to the line position of words, but novice readers read line-final words faster than non-final ones. Thus, the reading strategy of poets seems to assign equal importance to all words, both within the entire text and the individual verse line. Notably, the finding that novice readers read line-final words faster than non-final ones is at odds with the results reported by Fechino and colleagues (2020), who observed longer reading times for line-final words across inspection-time measures of early and late processing during poetry reading. However, it seems to align with earlier reports of rhyme-induced facilitation during poetry reading (e.g.,
Hoorn, 1996;
Menninghaus et al., 2014;
Obermeier et al. 2016). The source of this discrepancy is not clear at present but it might reflect differences in the end-rhyme schemes of the stimulus texts, which affect the predictability of line-final words.
We further observed an interaction effect of group and word length such that the effect of word length on first-pass gaze durations (longer reading times for longer words) was less pronounced in professional poets than in novice readers. We note that this result resembles similar findings obtained in prior research into literary reading: Analyzing gaze durations of readers reading Dutch short stories, Eekhof and colleagues (2021) observed that greater levels of previous print exposure and greater degrees of absorption during reading were associated with decreased sensitivity to word length (and other lexical features). However, it is unclear whether the effect observed in the present study also reflects poets’ immersive reading mode due to their extensive experience with poetry, or whether it merely reflects their general reading efficiency due to a generally high level of print exposure (cf. Chateau, 2000;
Gordon et al., 2020). Future investigations aiming to re-assess the relation between readers’ genre-specific expertise and their sensitivity to lexical variables during reading should control for participants’ level of prior print exposure, e.g., by means of an author recognition test (
Stanovich & West, 1989), and preferably match groups in terms of this reader variable.
Total reading times showed effects of lexical- and text variables which are consistent with prior evidence and with the effects we observed on indices of earlier (word) processing; the main and interaction effects of orthographic-neighborhood size can be accounted for in terms of lexical competition during reading. The observed effect of text position on total reading times (readers become faster as they progressed through the poems) replicates earlier evidence from poetry reading in German (
Beck & Konieczny, 2021;
Menninghaus & Wallot, 2021). However, we failed to obtain evidence that total reading times during poetry reading differ between experts and novices.
Taken together, our results confirm that expert knowledge affects poetry reading. While we failed to obtain evidence that these reading strategies modulate early word processing during reading (as indexed by first-fixation durations) or late processing (indexed by total reading times), our results identify gaze durations as indices of expertise-dependent reading behavior. We observed the typical text-reading pattern in novices, whose gaze durations became faster as they progressed through the poems. Poets, by contrast, retained a steady pace throughout the poems and within verse lines. These reading pattern map onto proposed reading stances for literary texts (e.g.,
Jacobs, 2015;
Rosenblatt, 1988), i.e., aesthetic reading (poets) vs. efferent/immersive reading characteristic of prose reading (novices), and it might reflect that poets aim to read without bias and expect that all words might be of significance. This idea of unbiased reading in experts would account both for poets’ steady pace throughout the texts and for their insensitivity to the distinction between line-final words and non-final ones. The latter, however, might also reflect differences in the sensitivity to rhyme between novices, who might have more traditional conceptions of poetry and stronger rhyme expectations, and poets, who are presumably more familiar with and more inclined towards modern unrhymed poetry.
We note that the depth of our analyses was constrained by the amount of available data, which, in turn, was partly determined by the limited availability of professional poets as participants. Hence, the data at hand did not support complex statistical models including, for instance, more text-structural variables that have been shown to affect poetry reading (
Beck & Konieczny, 2021;
Menninghaus & Wallot, 2021). To better assess whether poetic structure (e.g., stanza form or systematic rhyme) differentially affects poetry reading in expert- and novice readers, care should be taken in future investigations that each reader is presented with a sufficient number of texts. Still, the present results provide initial evidence that experts and novices approach poetry differently, and thus identify readers’ level of expertise as a relevant variable whose influence on the reception of verbal art deserves further investigation. While many models of literary comprehension assume distinct modes of processing and comprehending literary texts (
Jacobs, 2015;
Rosenblatt, 1988;
Zwaan, 1996), our results indicate that – in line with widespread assumptions about art reception in other aesthetic domains (
Kozbelt, 2020;
Pelowski et al., 2016) – such processing strategies for verbal art are co-determined by recipients’ level of expertise.