Comparing Eye-Tracking and Verbal Reports in L2 Reading Process Research: Three Qualitative Studies
Abstract
1. Introduction
1.1. Pros and Cons of Verbal Reports and Eye-Tracking
1.2. Studies Comparing Think-Alouds and Eye-Tracking
2. Materials and Methods
3. Results
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| RVRs | Retrospective verbal reports |
| L2 | Second Language |
Appendix A. The Two Gap-Filling Tasks
Appendix A.1. The “Tower” Task
| (A) cheaper | (I) eventually |
| (B) cleaner | (J) height |
| (C) collection | (K) necessarily |
| (D) competed | (L) production |
| (E) constructed | (M) range |
| (F) consulted | (N) scale |
| (G) dimension | (O) undertaken |
| (H) discovered |
Appendix A.2. The “Sara” Task
| (A) campaign | (I) proclamation |
| (B) career | (J) rectified |
| (C) characters | (K) reputed |
| (D) features | (L) rest |
| (E) fierce | (M) supposed |
| (F) inspired | (N) traditional |
| (G) latter | (O) versatile |
| (H) obsessed |
Appendix B. Codes and Examples
Appendix B.1. Reactivity of Eye Tracking
Appendix B.2. Reactivity of Think-Alouds and RVRs
- (1)
- 不影响, 只是很脑海里想的, 我口述表达不出来.
- (2)
- 不影响, 口述 (Theme 3, Code 1) 甚至让我脑海更清楚, 让我对自己的做题过程更加深刻 (Theme 1, Code 1) (Theme 4, Code 1).
Appendix C. P7’s and P6’s Answers Regarding the Influence of Eye-Tracking


References
- Ericsson, K.A.; Simon, H.A. Protocol Analysis: Verbal Reports as Data; MIT Press: Cambridge, MA, USA, 1993. [Google Scholar]
- Afflerbach, P.; Johnston, P. On the use of verbal reports in reading research. J. Read. Behav. 1984, 16, 307–322. [Google Scholar] [CrossRef]
- Anderson, N.J. Individual differences in strategy use in second language reading and testing. Mod. Lang. J. 1991, 75, 460–472. [Google Scholar] [CrossRef]
- Tan, X. Stories behind the scenes: L2 students’ cognitive processes of multimodal composing and traditional writing. J. Second Lang. Writ. 2023, 59, 100958. [Google Scholar] [CrossRef]
- Zhong, Y. Asperger writers’ attention and awareness in written production. Lang. Aware. 2025, 1–26. [Google Scholar] [CrossRef]
- Conklin, K.; Pellicer-Sánchez, A.; Carrol, G. Eye-Tracking: A Guide for Applied Linguistics Research; Cambridge University Press: Cambridge, MA, USA, 2018. [Google Scholar]
- Just, M.A.; Carpenter, P.A. A theory of reading: From eye fixations to comprehension. Psychol. Rev. 1980, 87, 326–354. [Google Scholar] [CrossRef]
- Chung, E.S. The effect of text presentation format on L2 processing of machine translation errors. Comput. Assist. Lang. Learn. 2025, 1–21. [Google Scholar] [CrossRef]
- Godfroid, A. Eye Tracking in Second Language Acquisition and Bilingualism: A Research Synthesis and Methodological Guide; Routledge: New York, NY, USA, 2019. [Google Scholar]
- Winke, P. The effects of input enhancement on grammar learning and comprehension. Stud. Second Lang. Acquis. 2013, 35, 323–352. [Google Scholar] [CrossRef]
- Xie, Y.; Wang, T.; Fan, F.; Yu, M. The impact of input mode on incidental vocabulary learning for CSL learners: Evidence from offline tests and eye-tracking studies. System 2025, 135, 103853. [Google Scholar] [CrossRef]
- Godfroid, A.; Finch, B.; Koh, J. Reporting eye-tracking research in second language acquisition and bilingualism: A synthesis and field- specific guidelines. Lang. Learn. 2025, 75, 250–294. [Google Scholar] [CrossRef]
- Holmqvist, K.; Nyström, M.; Andersson, R.; Dewhurst, R.; Halszka, J.; van de Weijer, J. Eye Tracking: A Comprehensive Guide to Methods and Measures; Oxford University Press: Oxford, UK, 2011. [Google Scholar]
- Ericsson, K.A.; Simon, H.A. Verbal reports as data. Psychol. Rev. 1980, 87, 215–251. [Google Scholar] [CrossRef]
- Russo, J.E.; Johnson, E.J.; Stephens, D.L. The validity of verbal protocols. Mem. Cogn. 1989, 17, 759–769. [Google Scholar] [CrossRef]
- Bowles, M. The Think-Aloud Controversy in Second Language Research; Routledge: New York, NY, USA, 2010. [Google Scholar]
- Godfroid, A.; Spino, L. Reconceptualizing reactivity of think-alouds and eye-tracking: Absence of evidence is not evidence of absence. Lang. Learn. 2015, 65, 896–928. [Google Scholar] [CrossRef]
- Yang, C.S.; Hu, G.W.; Zhang, L.J. Reactivity of concurrent verbal reporting in second language writing. J. Second Lang. Writ. 2014, 24, 51–70. [Google Scholar] [CrossRef]
- Wilson, T.D. The proper protocol: Validity and completeness of verbal reports. Psychol. Sci. 1994, 5, 249–252. [Google Scholar] [CrossRef]
- Barkaoui, K. Think-aloud protocols in research on essay rating: An empirical study of their veridicality and reactivity. Lang. Test. 2011, 28, 51–75. [Google Scholar] [CrossRef]
- Yang, C.S. The veridicality of think-aloud protocols and the complementary roles of retrospective verbal reports: A study from EFL writing. Asia-Pac. Educ. Res. 2019, 28, 531–541. [Google Scholar] [CrossRef]
- Lee, S.; Winke, P. Young learners’ response processes when taking computerized tasks for speaking assessment. Lang. Test. 2018, 35, 239–269. [Google Scholar] [CrossRef]
- Starr, M.S.; Rayner, K. Eye movements during reading: Some current controversies. Trends Cogn. Sci. 2001, 5, 156–163. [Google Scholar] [CrossRef]
- Williams, A.M.; Davids, K. Assessing cue usage in performance contexts: A comparison between eye-movement and concurrent verbal report methods. Behav. Res. Methods Instrum. Comput. 1997, 29, 364–375. [Google Scholar] [CrossRef]
- Pritchard, R. The effects of cultural schemata on reading processing strategies. Read. Res. Q. 1990, 25, 273–295. [Google Scholar] [CrossRef]
- Nevo, N. Test-taking strategies on a multiple-choice test of reading comprehension. Lang. Test. 1989, 6, 199–215. [Google Scholar] [CrossRef]
- Thomas, D.R. A general inductive approach for analyzing qualitative evaluation data. Am. J. Eval. 2006, 27, 237–246. [Google Scholar] [CrossRef]
- Bachman, L.F. Performance on cloze tests with fixed-ratio and rational deletions. TESOL Q. 1985, 19, 535–556. [Google Scholar] [CrossRef]
- Stratman, J.F.; Hamp-Lyons, L. Reactivity in concurrent think-aloud protocols: Issues for research. In Speaking About Writing: Reflections on Research Methodology; Smagorinsky, P., Ed.; Sage: London, UK, 1994; pp. 89–112. [Google Scholar]
- Jourdenais, R. Cognition, instruction, and protocol analysis. In Cognition and Second Language Instruction; Robinson, P., Ed.; Cambridge University Press: Cambridge, UK, 2001; pp. 354–375. [Google Scholar]
- Yang, C.S.; Zhang, L.J.; Parr, J. The reactivity of think-alouds in writing research: Quantitative and qualitative evidence from writing in English as a foreign language. Read. Writ. 2020, 33, 451–483. [Google Scholar] [CrossRef]
- Tabari, M.A.; Sato, M.; Wang, Y. Engagement with written corrective feedback: Examination of feedback types and think-aloud protocol as pedagogical interventions. Lang. Teach. Res. 2023, 1–34. [Google Scholar] [CrossRef]

| Condition | n | Age | Gender | Gaokao English Score | Gap-Filling Scores | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| M | SD | M | F | M | SD | Min. | Max. | M | SD | Min. | Max. | ||
| Eye-tracking | 17 | 18.80 | 0.69 | 8 | 9 | 129.24 | 21.90 | 85 | 148 | 51.47% | 0.28 | 15% | 95% |
| Think-alouds | 9 | 18.74 | 0.42 | 6 | 3 | 135.56 | 16.41 | 105 | 146 | 48.89% | 0.21 | 20% | 75% |
| RVRs | 10 | 18.46 | 0.32 | 8 | 2 | 127 | 22.78 | 85 | 146 | 51.5% | 0.25 | 10% | 80% |
| II Support strategies | “skims reading material for a general understanding” |
| “scans reading material for a specific word or phrase” | |
| IV Strategies for stablishing coherence | “rereads” |
| “reads ahead” | |
| V Test-taking strategies | “looks for the answers in chronological order in the passage” |
| “matches the stem and/or alternatives to a previous portion of the text” | |
| “reads the questions and options after reading the passage” | |
| “reads the questions and options before reading the passage” | |
| “changes an answer after having marked one” | |
| “stops reading the options when they reach the answer” | |
| “skips a question and returns to it later” | |
| “skips a question that is not understood and leaves the response blank” | |
| “marks answers without reading in order to fill the space” |
| (1) Heatmap: The heatmap reveals that Blank 7 (circled) was not intensely processed, compared with other blanks such as Blank 10, the last one, which attracted a much greater accumulation of fixations, as indicated by several prominent yellow clouds. |
![]() |
| (2) Choice-making graph: The choice-making graph demonstrates a straightforward selection for Blank 7 (the circled “N” choice, completed at 04:35), which was made in a single attempt without revision. In contrast, the final selection of “L” for Blank 10 was preceded by four tentative choices and considerable hesitation. The graph also shows that the completion of Blank 7 followed the first closure of Blank 6 and was followed by the completion of Blank 9, skipping over Blank 8. |
![]() |
(3) Measures such as fixation duration, fixation counts, and visits:
|
| (4) Gaze plot: |
![]() |
| The gaze plot shows precisely what P6 read and her reading sequence during the completion of Blank 7. The plot uses red dots to represent fixations, with numerical labels indicating their sequence and dot size corresponding to fixation duration. As shown, P6 read the text before and after the blank, as well as a wide array of options within the bank. Fine-grained analyses reveal that P6 first read the text preceding the blank, starting from the word “have”. Upon reading the word “the” to the left of the blank, she began to read the options. Her scan began at the top with “campaign” and proceeded downward through the options to “versatile.” She then returned to the word “festivals” following the blank, which was the final word of the sentence (after a brief fixation on “Lincoln”). At this point, she may have realized that an adjective, rather than a noun, was required. She then returned directly to fixate on the option “traditional” before finalizing her choice in the blank. (5) Other relevant information that may be gained via replay of screen recording:
|
| Think-alouds (from P19, who achieved 10% accuracy on the task) | Online immediate RVRs (from P27 to P36, who achieved 0 to 80% accuracy on the task) |
![]() | P32 (80% accuracy): “while celebrating the (.) I guess it should be festivals, the traditional festivals.” P33 (80% accuracy): “would bring, that is, get the family together to celebrate this traditional festival.” P27 (70% accuracy): “Celebrating the what festivals, must be celebrating the traditional festivals, that is, celebrating this traditional festival, would bring families together, that is, make families cohesive.” P28 (70% accuracy): “Next, I should choose a word to collocate with this festivals. Based on my experience, and the reference to this foregoing Thanksgiving, traditional should be chosen.” P30 (70% accuracy): “The 7th blank, traditional festival should be chosen.” P31 (50% accuracy): “Because generally it is celebrating some traditional festivals.” P34 (40% accuracy): “It is traditional festivals, the previous choice was wrong.” P36 (20% accuracy): “It also is a traditional.” P29 (0% accuracy): “This blank is celebrating what kind of festival. Then, down below many words I don’t know. I will then choose L” (rest). P35 (60% accuracy): Recording missing. Their answer papers were also available. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Yang, C.; Hu, G.; Que, K.; Fan, N. Comparing Eye-Tracking and Verbal Reports in L2 Reading Process Research: Three Qualitative Studies. J. Eye Mov. Res. 2026, 19, 2. https://doi.org/10.3390/jemr19010002
Yang C, Hu G, Que K, Fan N. Comparing Eye-Tracking and Verbal Reports in L2 Reading Process Research: Three Qualitative Studies. Journal of Eye Movement Research. 2026; 19(1):2. https://doi.org/10.3390/jemr19010002
Chicago/Turabian StyleYang, Chengsong, Guangwei Hu, Keyu Que, and Na Fan. 2026. "Comparing Eye-Tracking and Verbal Reports in L2 Reading Process Research: Three Qualitative Studies" Journal of Eye Movement Research 19, no. 1: 2. https://doi.org/10.3390/jemr19010002
APA StyleYang, C., Hu, G., Que, K., & Fan, N. (2026). Comparing Eye-Tracking and Verbal Reports in L2 Reading Process Research: Three Qualitative Studies. Journal of Eye Movement Research, 19(1), 2. https://doi.org/10.3390/jemr19010002





