1. Introduction
Recent years have witnessed a rapid increase in the use of online video lectures aimed both at young and adult learners [
1]. In fact, in the United States, more than 25% of students enrolled in institutions of higher learning register for online video lecture courses [
2]. In addition, 84.3% of Korean high school students have experienced learning through the Education Broadcasting System online video lectures [
3]. The proliferation of online video learning may be attributed to its numerous advantages over traditional classroom lectures.
First, it is not affected by the barriers of time, place, and circumstances [
4]. It is also more cost-effective than traditional classroom-based learning [
5] and reduces private education expenses [
3]. However, the lack of face-to-face interaction with a teacher is a major problem pertaining to video learning. In video-based instruction, the teacher cannot observe a student’s attentional lapse or extend attention-aware tutoring [
1]. Attention is important for successful learning because students who cannot sustain their attention are more likely to make frequent mistakes and tend to become more engaged in off-task behaviors [
6]. Such abstractions negatively affect learning. For instance, a recent study by [
7] found that the academic achievement of students who reported being in a state of attentional lapse during video learning fell below those who reported that they were on-task and task related thought (TRT). This consequence is natural because students who cannot hold their attention are likely to miss the essence of the intended learning and such fundamental conceptual clarity is closely linked to academic achievement [
8,
9].
Scholars are focusing on mind-wandering (MW), a kind of attentional lapse. Recent studies have demonstrated that MW is very prevalent when students watch videos [
10]. Additionally, people’s minds wandered frequently, no matter what they did. Wandering of the mind occurred in 46.9% of the samples in research [
11]. Accordingly, some researchers have contended that detecting and preventing MW would be helpful to learning [
12] and have tried to understand the phenomenon and to investigate the timing of its occurrence. However, while the observable actions of attentional lapse (yawning, chatting with friends in adjacent seats, staring elsewhere) were easily grasped, it was very difficult to determine MW through observation. MW denotes an attentional shift from TRT to internal task-unrelated thinking [
13] and includes the state of reduced awareness [
14]. It is for this reason that it is very difficult to judge MW solely through external observation.
Previously, scholars would have to depend on retrospective reports from participants. However, attention is deeply related to eye movements (fixation and saccade). Recent developments in eye-tracking technology have accorded researchers the opportunity to detect and to identify MW purely as eye movement data, which is also labeled the “eye-mind” [
15].
Studies based on the “eye-mind” have been conducted to identify MW using eye-tracking. However, most of these investigations have been accomplished in very limited experimental environments (sustained attention related task vigilance task, idea projection). Of course, trials in laboratory environments can eliminate elements that are intrusive to the purpose of the experiment. However, MW represents an attention shift to internal thought and occurs naturally in our daily lives [
13]. The detection of MW data in a limited experimental environment may differ from the determination of MW as it occurs during the viewing of a video lecture. The frequency of mind-wandering in our real-world was considerably higher than is typically seen in laboratory experiments [
11]. Therefore, this study purposed to examine the eye-tracking data element (sampling rate: 300 Hz) to detect MW using videos that recorded the lectures delivered by a teacher in a real class as is commonly utilized for video-based learning.
The self-caught method was used to approximate the MW spot during the watching of video lectures. Subsequently, more accurate MW segments were determined by comparing fixation duration with the saccade count. Finally, the oculomotor data and eye movements were investigated to ascertain which could be used as a marker indicating a learner’s MW. Additionally, we interviewed participants about their MW experience after watching video lecture. This was due to our goal to determine at which temporal point (past, present, or future) they focused on during MW and their feelings after the MW experience.
3. Method
3.1. Participants
The present study’s participants comprised 24 pre-service teachers (16 females and 8 males) who volunteered to participate in the study. The average age of the participants was 23.5.
This study measured human eye movement and oculomotor data with the approval of the Bioethics Review Board of the Korean National University of Education (project identification code: KNUE-2019-H-00225, date of approval: August 29, 2019). We fully explained the purpose of the study, the principles of measuring oculomotor data and eye movement, and the potential benefits and risks of participating in the study to the participants and confirmed whether they were willing to participate in the experiment voluntarily. All participants expressed their willingness to participate in the study voluntarily.
3.2. Stimuli and Apparatus
The study used a 30 min videotape of a live classroom lecture on Gauss’s law course on physics. This address was selected because it was believed to relate to real-world video lectures most commonly used by learners, unlike the slide or simulation-type Guru tutor. Oculomotor data and eye movements were recorded through an A Tobii Pro Spectrum (sampling rate: 300 Hz). A Tobii Pro Spectrum is more flexible in detecting head motion than any other eye trackers. At the same time, it can measure saccate data very accurately. In [
23], Eye Tribe (commercial eye tracker) could not measure saccade data accurately. This function better elucidates the details of human behavioral observation and awareness. Additionally, fast movements such as saccades can be measured without controlling the heads of the subjects.
Eye-tracking data was analyzed through an A Tobii Pro Lab which is the software designed for performing experimental study with Tobii Pro Spectrum.
3.3. Procedure
First, participants were trained by researchers to distinguish attention from MW and from attention such as TRT (task related thought) and on-task attention. Afterwards, they practiced the self-caught method of reporting MW immediately after recognizing their experience with MW. The participants’ distance from the screen and the height of their chairs were adjusted through pre-testing so that their eye movements could easily be tracked as they took positions that would be comfortable for a 30-min duration. The measurement was conducted in a slide room so that participants could watch the video lectures alone and without interruption. Precautions were provided for 20 s after the eye tracker was adjusted through 9-point calibrations.
Subsequently, the participants were asked to stare at the “ㅁ” in a picture that captured the first video lecture scene to measure the baseline of their pupil sizes. According to [
24], it is important to consider the baseline when measuring the size of the pupil because the baseline is a basic assessment of the individual’s physical and mental condition of the physiological data collected before the subject is exposed to experimental stimuli. A green screen was shown for 30 s before and after the baseline screen for accurate measurement. Finally, the video was played and the students were asked to place their finger on the upper left corner of the mouse throughout the watching of the video to prepare for self-capture. The oculomotor data and eye movements of the participants were recorded by the eye tracker during the test. This test process is illustrated in
Figure 3 and the MW self-report process is illustrated in
Figure 4.
After watching a video lecture, participants were immediately interviewed about their MW experience, while watching an eye-tracking video and the time point of MW reporting through Tobii Pro Lab software.
3.4. Data Analysis
The data were assessed through two major stages: first, the correct MW segments were identified and second, the differences between the oculomotor data and the eye movement data were distinguished vis-à-vis the found MW segments and the attention span.
3.4.1. Detect the Correct MW Segment
First, 60 TOI (time of interest) sections were analyzed in 1 s increments for 1 min prior to the MW time point reported by the participants to determine the correct MW interval. TOI can analyze the saccade metrics within the part by setting it in a specific segment that the researcher wants to analyze in the participant’s eye movement timeline (see
Figure 5).
Then, the correct MW interval was detected by comparing the analyzed saccade metrics data and the fixation duration prior to the MW reporting spot on the raw data. The fixation duration on raw data was compared to the saccade metrics because the number of saccades and the fixation duration evince a negative correlation. For example, when MW occurs, the number, velocity, and amplitude of the saccade are rapidly reduced or are not recorded, whereas the fixation duration is large, relative to the sustained attention. Therefore, MW segments can be detected more accurately when they are compared.
3.4.2. Oculomotor Data and Eye Movement Data
Second, two oculomotor data (blink count, pupil size) and 9 eye movements (average peak velocity of saccades; maximum peak velocity of saccades; standard deviation of peak velocity of saccades; average amplitude of saccades; maximum amplitude of saccades; total amplitude of saccades; saccade count/s; fixation duration; fixation dispersion) were analyzed in the MW segment and the attention span of the participants detected in the first stage. Of the total 11 sets of data, the blink count, saccade count, pupil size, fixation duration, and fixation dispersion were analyzed through raw data. The rest of the data were analyzed through saccade metrics. Then, finally, we performed planned t-tests to compare data between MW and attention.
6. Conclusions
The result was that the blink count could not be used as a marker for mind-wandering during learning video lectures among them (oculomotor data and eye movements), unlike previous literatures. On the other hand, the other elements could be used as a mind-wandering marker. Pupil size, fixation duration was bigger during MW segment than attention span. But fixation dispersion, saccade count, and 6 types of saccade metrics (average peak velocity of saccades; maximum peak velocity of saccades; standard deviation of peak velocity of saccades; average amplitude of saccades; maximum amplitude of saccades; total amplitude of saccades) values were bigger during the attention span than MW segments. Based on the results of this study, we identified elements that can be used as mind-wandering markers while learning from video lectures that are similar to real classes, among the oculomotor data and eye movement mentioned in previous literatures. In addition, many students focused on unpleasant memories of the past during the MW and this experience affected mood after the MW experience.