1.1. Effects of Word Length, Frequency, and Predictability on Reading Behavior
Reading is a complex cognitive process that involves visual word recognition, lexical access, syntactic parsing, and semantic integration, all of which contribute to the goal of text comprehension. The pre-lexical process of word recognition, including visual perception and decoding, is followed by lexical access, which involves selecting a suitable candidate in the mental lexicon [
1], while the post-lexical processes of syntactic parsing and semantic integration entail building the sentence structure [
2] and integrating different sentence meanings into a coherent understanding of the text [
3].
The eye-tracking method is used to investigate cognitive processes underlying reading. Eye movement measures can be divided into two groups according to the processing stages they correspond to: early and late. Early measures include the initial saccade landing position, single and first fixation duration, gaze duration (the sum of all fixation durations on a word during the first pass), the probability of skipping a word, and the probability of fixating a word only once [
4]. Late measurements include regression probability, regression path duration (the total duration of all fixations from the first fixation on the word until moving forward), rereading time (the sum of all fixation durations after a regression to the word), and total reading time (the sum of all fixation durations on the word) [
4,
5]. Early eye movement measures are considered primarily to reflect processing up to the stage of lexical access, while late measures are considered to reflect the processes of syntactic, semantic, and discourse integration [
4,
5,
6].
Decades of research have demonstrated robust effects of word length and word frequency—intrinsic word properties—as well as word predictability—an extrinsic property determined by the sentential context—on eye movement measures [
7,
8,
9]. Together, these properties are termed “the big three”. These effects have been demonstrated across typologically diverse languages, and therefore are supposed to be universal.
Word length primarily influences early reading processes, and it is reflected in early eye movement measures such as first-fixation and gaze durations, as well as the late measure of total viewing time [
9,
10,
11]. In general, longer words tend to result in longer and a greater amount of fixations compared to shorter words [
12,
13,
14]. These findings have been demonstrated in alphabetic languages with diverse orthographic systems, such as English [
15,
16,
17,
18,
19,
20], German [
21], French [
22], and Russian [
23], as well as in non-alphabetic languages, including abjad Hebrew [
24], Arabic [
25], logographic Japanese [
26], and Chinese [
27].
The second universal factor that influences word recognition during reading is frequency. Frequent exposure to a word in different contexts determines the rate at which its orthographic, phonological, and semantic representations are activated. Many studies have consistently demonstrated the word frequency effect: when word length is controlled, readers look longer at low-frequency words than at high-frequency words [
28,
29,
30,
31]. The frequency effect is robust and has been replicated in both alphabetic and non-alphabetic languages: English [
32], Spanish [
10], German [
9], Dutch [
33], Russian [
23], Korean [
34], and Chinese [
35]. Frequency primarily influences earlier language processing, such as lexical access, but also impacts later stages like semantic integration, and it has been examined in various measures [
9].
During reading, we anticipate upcoming words. The easier a word is to predict from the preceding context, the faster it is processed [
2,
36,
37]. Predictability has been shown to affect not only early pre-lexical and lexical processing [
2,
9,
38] but also late integration processes [
39]. Each new unit of the information must be incorporated into the discourse; when readers accurately predict words, it facilitates smoother and faster semantic integration, allowing them to connect ideas and maintain comprehension. The predictability effect has also been demonstrated for different languages, including French [
40], Chinese [
38], and Russian [
23]. However, data on the timing of predictability effect are controversial [
41]. Some studies show that the effect is revealed in later stages of linguistic processing, reflected in late eye movement measures like second-pass reading time, total reading time, and regression rates [
10,
42]. In contrast, others also report its influence on early measures, such as first fixation duration [
43,
44,
45] or skipping probability [
46].
Beyond universal parameters, language-specific factors, such as the type of writing system and grammatical structure, also play a significant role in word processing. Recently, language-specific oculomotor reading patterns were extensively covered in the Multilingual Eye movement Corpus (MECO) study, which initially included 13 languages, with a growing number continually being added [
47]. The findings reveal that certain languages stand out, such as Norwegian, Estonian, and Korean. Norwegian readers make shorter and fewer fixations and have a higher skipping rate. Conversely, Estonian language readers make a large number of fixations on the words with relatively long fixations and a high rereading rate. In Korean, where words are short in length (as in Chinese), reading time is short and the skipping rate is high [
47]. Furthermore, in the MECO study, the cross-linguistic variability in reading performance was explored. The skipping rate had the strongest systematic variability, with 46% of the variance explained by language differences. Language also accounted for 24% of the variance in the number of fixations during the first run. It accounted for a smaller portion of the variance in duration metrics: first fixation duration (5%), gaze duration (16%), and total fixation time (13%) [
47]. This pattern suggests that the majority of cross-linguistic variations in oculomotor behavior at the word level demonstrate the spatial distribution of fixations across words, specifically regarding which words capture attention and which do not. The authors attribute the skipping rate to one of the key predictors of reading: word length. A strong negative correlation was found between a language’s average word length and its estimated skipping rate. For example, Finnish, with a mean word length of 7.82 characters (SD = 3.90), has an estimated skipping rate of 6%, while Korean, with a mean word length of 2.92 characters (SD = 1.27), has a skipping rate of 29% [
47].
To sum up, the effects of word length, frequency, and predictability appear to be universal across different languages; at the same time, they are related to language-specific factors. For instance, a language’s average word length is often influenced by its grammatical structure: agglutinative languages tend to have longer words, as they combine multiple morphemes into a single word form, while analytic languages tend to have shorter words, relying on word order and auxiliary words to convey grammatical relationships. Frequency rates highly depend on the degree of inflection in a language (in morphologically rich languages we distinguish between lemma frequencies and wordform frequencies). Predictability, in turn, is influenced by factors such as homonymy rates and word order flexibility (e.g., fixed or free). Another crucial language-specific factor affecting oculomotor reading behavior is the writing system, including the degree of grapheme visual density and orthographic transparency. Therefore, although overall eye movement patterns across languages may appear quite similar and could be claimed as universal, language-specific reading strategies also play a significant role. This underscores the considerable value of cross-linguistic studies.
1.2. Reading Behavior in L2
Bilingual research mainly focuses on the mechanism of competition and coordination during the processing of an L1 and an L2. Among the factors that influence reading patterns are individual differences in language proficiency and language usage. L2 reading studies either use a within-subject design, comparing L1 and L2 reading patterns of one and the same samples [
33,
48] or a between-subject design, comparing the reading patterns of L2 readers with those of L1 readers [
49], or a combination of these two approaches [
50]. A scoping review by Quiñonez-Beltran and colleagues [
51] underlines that L1 characteristics are to be taken into account when planning L2 reading studies design. For instance, a comparative study [
52] with Chinese and Arabic readers of English demonstrates that L1 background affects L2 oculomotor reading behavior, namely fixation duration.
It should be noted that most corpora of eye movements, which allow researchers to explore large datasets and a variety of factors affecting oculomotor behavior, include only L1 data, with few exceptions like GECO [
21], MECO [
50], and CELER [
49].
The GECO [
21,
33] investigates differences between L1 and L2 reading using eye movement measures. The researchers examined unbalanced L1 Dutch–L2 English readers and discovered that these individuals exhibited longer total reading times, increased average fixation durations, a higher number of fixations, and a reduced likelihood of skipping words when reading in their non-dominant language. This oculomotor behavior parallels that of young monolingual children who are just learning to read [
53,
54] or low-literate monolingual adults [
55,
56].
The MECO-L2 [
50] comprises eye movements during English L2 reading by native speakers of 13 typologically different languages. Participants were tested on component skills such as vocabulary size, spelling, decoding efficiency, and print exposure. The study focuses on the role of L1 in L2 English reading. The contribution of different component skills is further explored in the CELER corpus [
57], which includes eye movement data from English L2 readers from diverse linguistic backgrounds (Arabic, Chinese, Japanese, Spanish, and Portuguese), alongside a control group of L1 English readers. CELER also includes scores from standardized L2 proficiency tests. Direct effects of L2 proficiency on oculomotor reading behavior were demonstrated by Berzak, Katz, and Levy [
58]. Their findings revealed that first fixation duration and total reading time not only correlated with scores on standardized English proficiency tests but also effectively predicted test outcomes. Eye movements in low-proficient L2 readers were characterized by longer fixation durations, a lower skipping rate, a higher probability of regressive saccades, and more pronounced frequency effects.
It was shown that L2 readers exhibit many qualitative effects similar to those of L1 readers. Notably, however, they demonstrate a so-called proficiency-sensitive lexicon–context tradeoff [
21,
49]: the eye movements of the most proficient L2 readers resemble those of L1 readers, whereas as L2 proficiency decreases, the readers become less responsive to a word’s predictability based on context and more influenced by word frequency, which remains constant across contexts. This tradeoff reinforces an experience-based explanation for how context-driven expectations are utilized in L2 language processing: diminished exposure and practice result in weaker connections between wordforms and their mental representations. Consequently, accessing vocabulary in the non-dominant language becomes less efficient, as evidenced by greater frequency effects and longer reading times. The less language experience a reader has, the more they rely on context-independent information (e.g., word frequency). As their experience increases, their predictions become more contextualized.
Whitford and Titone [
59,
60,
61], who studied late L2 bilinguals of various ages and different language dominance (English or French), found the same influence of L2 exposure on the strength of the frequency effect in both early and late eye-tracking measures. However, the strength of the predictability effect was not shown to differ in L1 and L2 reading. Mor and Prior [
48] reported that the predictability effect was more pronounced in L2 reading while Berzak and colleagues [
49] showed that predictability effects were, in L2 reading, larger for gaze duration and total reading time, although evidence for a lexicon–context tradeoff was demonstrated in the following: the word predictability effect on total reading time in L1 was larger than word frequency effect, while in L2 it was vice versa. Fernandez & Allen (2025), comparing the L1 English reading and L2 English reading of German learners, demonstrate a predictability effect on early measures—gaze duration and skipping rates [
62]. The effect is found in both L1 and L2 groups, with L2 readers being less efficient. This quantitative, but not qualitative, difference is explained by the reduced capacity for parafoveal processing in L2 readers. The study of Xiao and colleagues with Tibetan–Chinese bilinguals [
63] also demonstrates that the preview benefit is more prominent for L1 than for L2 readers and emphasizes that semantic information in L2 reading is accessed at a later stage.
Regarding word length, Cop and colleagues [
21] demonstrated that a greater number of symbols in a word leads to an increased fixation count, an effect that is more pronounced even in proficient (B2–C1) L2 readers in comparison with native speakers. Thus, the big three effects in L2 reading depend on the proficiency level; length and frequency effects are more pronounced in L2 reading than in L1 reading, whereas the data on the predictability effect are more controversial.
The vast majority of L2 reading studies focus on reading in English, so eye movement patterns remain understudied cross-linguistically. Daniels and Share [
64] emphasize that existing reading theories mostly rely on English reading data and features of English orthography; therefore, taking other writing systems into account can broaden the scope of reading studies and shed some light on universal and language-specific reading mechanisms.
Our study aims to address this gap by presenting data from typologically distinct L1/L2 pairs: native Russian L1 readers and Chinese L2 readers of Russian. These two languages are particularly interesting to compare because they exhibit important differences on two basic levels: writing system and grammar.
1.3. Reading in Russian and Reading in Chinese
Modern written Chinese consists of horizontally arrayed strings of characters, going from left to right. Each character occupies a rectangular region of the same size and can be further decomposed into component radicals or even further into a series of individual strokes. Chinese characters typically map onto one-syllable morphemes. Chinese words can be either monomorphemic (one character) or polymorphemic (composed of two or more characters). According to the Chinese Word Corpus of Academia Sinica Taiwan (1998), over 76% of the words (type frequency) consist of two or three characters [
65]. When token frequency is considered, words contain one and two characters for 54% and 42%, respectively [
61]. Chinese lacks spaces between words, and many cases involve word boundary ambiguity. This makes word segmentation—the extraction of words from a character string—more difficult than in alphabetic languages.
The body of work on eye movement studies on Chinese has been growing in the past two decades. For characters, the effects of character orthography, frequency, and complexity (i.e., the number of strokes) on fixation durations have been obtained [
35,
66,
67,
68,
69]. For words, the effects of word frequency and word space availability have been demonstrated [
38,
70,
71,
72,
73,
74]. For sentential constraints, the interaction of word ambiguity and context in the lexical ambiguity resolution of Chinese homographs was shown in [
75]. These results demonstrate that Chinese readers utilize both word-level and contextual information during reading.
Pan and colleagues [
27] developed the Beijing Sentence Corpus (BSC) and presented the eye-tracking data from Chinese readers, including large-scale predictability norms and their effect on saccade targeting. The results showed that low frequency, low contextual predictability, and high visual complexity may lead to difficulties in parafoveal word segmentation, resulting in fixation locations shifting from the center of the word toward its beginning.
Chinese is visually dense in contrast to alphabetical languages, which are typically more horizontally expansive. This difference affects eye movement patterns [
76]: Chinese readers make fewer but longer fixations that are positioned closely together, whereas Finnish readers make more numerous but shorter fixations spaced further apart [
76]. Also, average fixation durations are longer in Chinese compared with English [
76]. Other findings indicate that rightward saccade length is also affected by language. Chinese readers make the shortest forward saccades, while English readers make the longest—even longer than Finnish readers [
77]. The average forward saccade length for Chinese readers was half that of readers of the two alphabetic languages [
77]. Chinese readers also make more regressions. One of the reasons may be because this is a compensation for frequent word skipping during first-pass reading (more than half of the words are left unfixed). In other words, Chinese readers may need to go back to long sentences to confirm their exact meaning [
77].
Russian, together with Chinese, is among the five most widely spoken languages in the world and the most widely spoken language using the Cyrillic alphabet. Characteristics of Russian, such as its writing system which is in the middle of the continuum between shallow and deep orthographies with quite complex, but sufficiently regular and predictable phoneme–grapheme correspondences [
78], and its rich inflectional and derivational morphology, are of considerable interest for comparative reading research.
Descriptive statistics for basic eye movement variables that are considered as fundamental measures of reading fluency (skipping, first fixation duration, gaze duration, total fixation duration, number of fixations on the word, regression-in, and rereading) for Russian along with 12 other languages is provided in the MECO [
47] and MECO Wave 2 [
79].
The study by Laurinavichyute and colleagues [
23] presents the Russian Sentence Corpus and establishes basic eye movement benchmarks for reading in Russian. It provides descriptive corpus statistics for reading Russian in the form of the average saccade length, landing site, fixation duration measures, and probabilities of skipping and fixating words, as well as proportions of regressions during the reading of natural sentences. It was shown that Russian does not differ from other alphabetic and logographic languages regarding the “big three” variables (word length, frequency, predictability), nor in mean fixation duration and mean saccade amplitude. However, some discrepancies related to the low-level oculomotor characteristics were identified. For example, in Russian, but not in German, the probability of having one fixation on a word increases with the increase in word length and predictability (in Russian, if a word is not fixated once, it is more likely to be skipped than to be fixated more than once, while in German the opposite is true). Given that a prominent characteristic of Russian is its complex morphology, in addition to the “big three” effects, Laurinavichyute and colleagues also account for morphological predictors, namely the part-of-speech category, morphosyntactic ambiguity, and morphological word form (base vs. nonbase). They reported that verb processing requires more effort than noun processing, as reflected in both early (gaze duration) and late (total reading time) measures. Similarly, nonbase wordforms took longer to read than baseword forms; this was also found in both early (first fixation duration) and late (total reading time) measures.
Regarding word length, Alexeeva and Slioussar [
80] compared the effect of longer, same-length, and identical parafoveal previews using the gaze-contingent boundary paradigm. They demonstrated that readers of Russian obtain information about word length parafoveally and use it not only to plan subsequent saccades but also for word recognition: first fixation duration and gaze duration were longer in the longer-preview condition than in the same-length condition. Also, the study by Staroverova and colleagues [
81] demonstrated that Russian readers rely on orthographic, but not phonological, information extracted from the parafovea.
To sum up, the Chinese language uses the logographic script without apparent word boundaries or apparent distinctions between roots and affixes. In contrast, Russian uses the Cyrillic alphabet, exhibits complex but regular phoneme–grapheme correspondences, and features a rich system of morphosyntactic rules. The differences between the two languages may result in specific processing difficulties for Chinese learners of Russian as well as Russian learners of Chinese [
82].
In this study, we aim to investigate the role of universal effects in L2 oculomotor reading behavior, focusing on the typological differences between the readers’ L1 and L2, namely their script and grammar systems. Assuming that reading strategies of L1 affect those of the L2 in non-advanced readers, we hypothesize that native readers of Chinese, which is characterized by visually dense logographic script and monomorphemic word structure, will demonstrate an enhanced word length effect when reading L2 Russian, which has an alphabetic script and a polymorphemic word structure. Furthermore, higher rates of homonymy and homography in monosyllabic tonal Chinese may enhance anticipatory processes during reading. This, in turn, could modulate the word predictability effect not only in L1 Chinese reading (which is beyond the scope of this study) but also in L2 Russian reading. We are particularly interested in the role of these universal effects at different processing stages, as reflected in early and late eye movement measures. Finally, we will investigate the role of language exposure (the duration of studying L2) on the magnitude of these universal effects in L2 reading.