Next Article in Journal
Pun Processing in Advertising Posters: Evidence from Eye Tracking
Previous Article in Journal
Eye Tracking in Optometry: A Systematic Review
 
 
Journal of Eye Movement Research is published by MDPI from Volume 18 Issue 1 (2025). Previous articles were published by another publisher in Open Access under a CC-BY (or CC-BY-NC-ND) licence, and they are hosted by MDPI on mdpi.com as a courtesy and upon agreement with Bern Open Publishing (BOP).
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Behind the Scenes: Impact of Virtual Backgrounds in Educational Videos on Visual Processing and Learning Outcomes

by
Leen Catrysse
*,
Andrienne Kerckhoffs
and
Halszka Jarodzka
Open Universiteit, Department of Online Learning and Instruction, Heerlen, The Netherlands
*
Author to whom correspondence should be addressed.
J. Eye Mov. Res. 2023, 16(3), 1-16; https://doi.org/10.16910/jemr.16.3.4
Submission received: 26 April 2023 / Published: 19 October 2023

Abstract

:
The increasing use of instructional videos in educational settings has emphasized the need for a deeper understanding of their design requirements. This study investigates the impact of virtual backgrounds in educational videos on students' visual information processing and learning outcomes. Participants aged 14-17 (N=47) were randomly assigned to one of three conditions: a video with a neutral, authentic, or off-topic background. Their prior knowledge and working memory capacity (WMC) were measured before watching the video, and eye tracking data was collected during the viewing. Learning outcomes and student experiences were assessed after viewing. The eye tracking data revealed that a neutral background was the least distracting, allowing students to pay better attention to relevant parts of the video. Students found the off-topic background most distracting, but the negative effect on learning outcomes was not statistically significant. In contrast to expectations, no positive effect was observed for the authentic background. Furthermore, WMC had a significant impact on visual information processing and learning outcomes. These findings suggest that educators should consider using neutral backgrounds in educational videos, particularly for learners with lower WMC. Consequently, this research underscores the significance of careful design considerations in the creation of instructional videos.

Introduction

Even though instructional videos are being used since the early 1900s as a means to deliver instruction, research on learning from instructional videos only really started to thrive in the past few years (de Koning et al., 2018; Fiorella, 2021). This relatively recent sense of a need for knowledge concerning video design, can be explained by the substantial increased use of instructional videos in formal as well as non-formal learning environments (Bétrancourt & Benetos, 2018; de Koning et al., 2018; Merkt et al., 2020). This development has taken an even further flight during the Covid-19 pandemic when schools were closed and educators all over the world had to turn to online education, including recording their own instructional videos. However, guidelines on how to design these instructional videos are limited (de Koning et al., 2018; Fiorella, 2021). Since instructional design based on intuition alone, endangers an efficient and effective learning experience (Neelen & Kirschner, 2020), it is important to gain more insight into questions related to the design of instructional videos.
As a research area, instructional video design, offers a broad scale of research topics to explore like, how to present the information, the social and emotional effects of videos, or the added value of videos over static images, to name a few (for an overview, see e.g., Bétrancourt & Benetos, 2018; de Koning et al., 2018). Findings of studies concerning some of these topics have already led to research-based principles for the design of instructional videos, such as the pacing principle or the signaling principle (Fiorella, 2021), but there are still a lot more questions that can be asked and answers that need to be found (de Koning et al., 2018).
One of those questions concerns the effect of a video’s background on learning outcomes. Online software, such as Microsoft Teams or Zoom offers the possibility to choose a virtual background. By doing so, one prevents the other interlocutors from seeing the actual setting from which one is working or presenting. There can be several reasons why one would choose to change the background image, for instance privacy, a lack of a suitable work area, or personal preferences. Instructors, like teachers who work from home, might also wish to use this option for their instructional videos or online courses. It would be helpful for them if there were guidelines concerning background options, so they would know whether it matters which background they use, and if so, which sort of background would be best to choose. The chosen background should, at least, not hamper learning, and preferably, even optimize learning.
To our knowledge, the role of an instructional video’s setting or background, has only been explored by Merkt et al. (2020). They compared learning outcomes from watching a video that was filmed in front of a white wall, with learning outcomes from watching a video filmed in an authentic setting, which was a greenhouse in their case. They expected to find that the video’s setting would affect the learning outcomes, but it turned out that there were no differences between both conditions (Merkt et al., 2020). So far, instructional video research has neglected the potential importance of virtual backgrounds in home-made instructional videos and how it affects both a learner’s visual information processing and learning outcomes. Therefore, the aim of this study is to investigate whether a virtual background in an instructional video affects visual processing and learning outcomes.

The role of backgrounds in educational videos

The Cogntive Theory of Multimedia Learning (CTML) explains how people learn from multimedia presentations such as instructional videos (Mayer, 2021). The theory is based on three assumptions. The dual-channel assumption (Baddeley, 1999; Paivio, 1986), which considers that people process incoming textual and pictorial information through two separate channels. The limited-capacity assumption (Baddeley, 1999; Sweller et al., 2011), which suggests a limit to the amount of information each channel can process at a time. And the active processing assumption (Mayer, 2009; Wittrock, 1989) which states that people should actively attend to useful information in order to transfer this information into the working memory. There, the information should be organized into mental models so it can be connected to prior knowledge already present in the long-term memory (Mayer, 2021).
The limited-capacity assumption is also reflected in the Congitive Load Theory (Sweller et al., 1998, 2019): To be able to learn, the learner must have enough capacity available in working memory for relevant information processing, while unnecessary processes must be avoided (Sweller et al., 2019). According to the CLT, there are three types of cogntive load that a learner deals with during learning: intrinsic, extraneaous and germane load (Sweller et al., 1998, 2019). The amount of intrinsic cognitive load is determined by the complexity of the subject matter itself and the learner’s prior knowledge. This load has nothing to do with the way how the learning material is presented, and it cannot be changed. The amount of extraneous cognitive load on the other hand, is related to the instructional procedures or design. This can be changed, for instance by altering the material’s design. Germane load refers to working memory resources dealing with intrinsic rather than extraneous cognitive load.
Both theories have led to numerous design guidelines (Mayer & Fiorella, 2021; Sweller et al., 2019). The focus of the current study is on the guideline concerning seductive details (Fiorella & Mayer, 2021). Seductive details are interesting but irrelevant details, that are not part of the subject matter. They are only added to the learning material to make the subject and material more appealing to the learner, and thus create interest (Mayer, 2019). Seductive details might increase extraneous cognitive load and might hamper learning since non-relevant information might take up much needed working-memory capacity. The overall conclusion of an extensive meta-analysis by Rey (2012) was that seductive details hamper learning. This could be seen in retention as well as transfer performance (Rey, 2012). According to Merkt et al.’s (2020) distraction hypothesis, a video’s background might be distracting to the learner. Adding a background to an instructional video could be considered as seductive details, since it is not integrated in the subject matter that must be learned (Merkt et al., 2020).
Merkt et al. (2020) assumed a competing hypothesis when studying the effects of an authentic versus a neutral video background. Their expertise hypothesis assumes that learners are more likely to focus on a teacher who they consider to be an expert, than on a teacher they consider to be a novice (Cheng et al., 2013). Indeed, several studies found that instruction from an expert, or even a perceived expert, led to better learning outcomes (Boekhout et al., 2010; Hoogerheide et al., 2016; Lachner & Nückles, 2015). In certain situations, learners have no clue about the teachers’ level of expertise. For example, a learner might see an instructional video about a topic the learner has little or no prior knowledge about explained by a teacher that the learner does not know. In this specific scenario, learners rely on observable factors to form an opinion (Schwarz, 2007), such as clothing styles (Morris et al., 1996), gender (Gurney et al., 2017), or age (Hoogerheide et al., 2016). Next to these personal features, people are also influenced by the context in which they see another person (Schwarz, 2007). Wittenbrink et al. (2001), for instance, showed that the same Black person was perceived differently, if the background picture was a church interior instead of a street corner. The street corner picture led to a larger amount of negative automatic responses (Wittenbrink et al., 2001). Therefore, in line with Merkt et al. (2020), we assume that also an instructional video’s background could serve as a sign of expertise. The background might serve as sign of expertise if it represents the usual work environment of the presenter (Merkt et al., 2020). Seeing the instructor in their natural habitat so to speak, could have a positive effect on the perceived expertise, which in turn might have a positive effect on the learning outcomes.

Working memory capacity

As already stated, leading memory researchers, agree that the capacity of the working memory (WMC) is limited (e.g., Baddeley & Logie, 1999; Wiley et al., 2014), but it also varies across individuals (Engle, 2002). In this view, the WMC is not only determined by the limited number of elements one can remember, but rather on the ability to control one’s own attentional resources (Engle, 2002, 2018; Shipstead et al., 2015). In this approach, WMC indicates how capable a person is in avoiding distraction, while staying focused on information that should be noticed and retained to be further processed (Engle & Kane, 2003; Shipstead et al., 2015).
Research already demonstrated that seductive details are more harmful to learners with low attention control and thus low WMC (Rey 2014; Sanchez & Wiley, 2006). When videos contain backgrounds including seductive details, they might be more harmful for learners with low WMC since they have difficulties to control their attentional resources.

Visual information processing

Over the years, eye tracking has proven to be a useful method to gain insight into the visual and cognitive processing of information while learning (Alemdag & Cagiltay, 2018; Coskun & Cagiltay, 2022; Jarodzka et al., 2017; Lai et al., 2013). For instance, Tsai et al. (2018) studied the effects of seductive illustrations in a PowerPoint presentation. Their eye tracking data showed that seductive details drew learners’ attention away from the relevant pictures in the presentation. Sanchez and Wiley (2006) found that participants with high WMC paid less attention – as indicated by eye tracking data – to seductive images that were added to a Web page.
In instructional videos, several areas of interest (AOIs) can be considered such as the teacher, the PowerPoint slides and the background. The background is considered as a less relevant area compared to the others. A bigger focus of attention on the background, instead of the other areas, could indicate that the background is a distraction comparable to a seductive detail (Sanchez & Wiley, 2006) or that a learner pays more attention to the background to form an opinion on the instructor’s expertise.

Research questions

Like Merkt et al. (2020), this study investigates the effect of a video’s background on learning outcomes. But instead of comparing videos filmed in different environments, we compared home-made instructional videos with different virtual backgrounds. Merkt et al. (2020) defined an expertise hypothesis and a distraction hypothesis. Their expertise hypothesis states that a learner pays more attention to an expert. Therefore, a virtual background that reflects an authentic work environment of the teacher, might contribute to the perception of this teacher as being an expert, and therefore, could positively affect learning outcomes. Their distraction hypothesis on the other hand, states that an authentic background might cause more distraction as the learner would pay less attention to the relevant subject matter, and thus, hamper learning (Merkt et al., 2020). In the current study we compare such authentic to neutral virtual backgrounds and add a third condition with an off-topic background, which is often used in online meeting software. Additionally, we investigate to which extent a learner’s ability to control their attention (Rey, 2014; Sanchez & Wiley, 2006) might interfere with the overall effect of the backgrounds. To do so, we take learners’ working memory capacity (WMC) into account. Finally, we use eye tracking to explore whether and if so, how, a learner’s visual information processing is affected by these different backgrounds (Mayer, 2019).
The first research question examined the effect of background and WMC on visual processing (RQ1). We expect that learners allocate more attention on the backgrounds including (seductive) details (authentic and off-topic background) than the neutral background (RQ1a). Additionally, we expect that individuals with a higher WMC will be less affected by distracting backgrounds (authentic and off-topic) than individuals with a lower WMC and will allocate less attention to distracting backgrounds (RQ1b).
Our second research question examined the effect of the background and WMC on learning outcomes (RQ2). If the expertise hypothesis (Merkt et al., 2020) was correct, then the authentic virtual background would lead to better learning outcomes than the neutral and off-topic condition, since it functioned as a sign of expertise. If the distraction hypothesis was correct, then the neutral virtual background would lead to better learning outcomes, because it contained no seductive details that might distract the viewer. The off-topic condition would make clear if the background mattered at all, since this background contained potentially distracting features that were unrelated to the learning material and it could not provide a sign of expertise (RQ2a). Regarding the influence of WMC on learning, we expect that individuals with a higher WMC will be less affected by distracting backgrounds (authentic and off-topic) than individuals with a lower WMC and their learning outcomes will be unaffected by virtual background whereas individuals with a lower WMC will have weaker learning outcomes in distracting backgrounds (RQ2b).

Methods

Design

A between-subjects experimental design was used. Participants were randomly assigned to one of the three conditions: neutral, authentic, or off-topic. The background of the video was the only difference between the conditions, and thus the independent variable. Participants’ prior knowledge as well as their WMC were measured as potential control or moderator variables. The eye tracking and learning outcome measures were the dependent variables.

Participants

The participants (N=54) were students from secondary education and were between 14 and 17 years old (M=15.57, SD=1.02). Due to common problems with calibration or data quality, we maintained data of 47 participants (12 male and 35 female) for the analysis. All participants spoke and understood Dutch on a native or near-native level, and had normal or otherwise corrected-to-normal vision. Participation was on a voluntary basis. There was no advantage for students who participated, nor was there a disadvantage for students who did not participate.

Materials

The topic of the instructional video was glaciers and lasted 8 min and 55 s. The instructor showed some examples of glaciers and of landscapes formed by them, she explained what a glacier was, discussed five typical aspects of glaciers, and finally talked about the uncertain future of glaciers. While explaining, she showed fifteen different slides containing either pictures, keywords or schematic images. Some of the slides had an animation, such as arrows that appeared to clarify the position of a typical formation. The slides were not visible during the entire video and there were scenes in which only the instructor was present without slides.
The instructional videos were made by recording a presentation in Zoom. The backgrounds of the videos could be added later, so that there would be three videos which were identical except for the background. The teacher in the three conditions was thus identical and other cues that could be used to derive expertise form (such as age, clothing, and gender) were kept constant. The usual work environment of a geography teacher would be a geography classroom. Therefore, we used a photograph of the geography classroom of the students’ participating in this study as the virtual background for our authentic background (Figure 1, left). For the neutral background, no background was added and a grey background was chosen (Figure 1, middle). For the off-topic background, a background of a beach club was chosen since it has nothing to do with the topic of the video and therefore will also not act as a sign of expertise (Figure 1, right).
Working memory capacity as an individual difference variable, was measured through the Letter-Number Sequencing test. This is a subtest for the measurement of working memory and attention (Crowe, 2000) adapted from the Wechsler Adult Intelligence Scale IV. The participant listened to a pre-recorded set of letters and numbers (e.g., K-4-C-2-S) and then repeated these, but not in the same sequence as they were presented. The letters and numbers had to be placed in numerical and alphabetical order (e.g., 2-4-C-K-S). There were ten levels, each level contained three sequences. The amount of numbers and letters in a sequence gradually increased. Level 1 and 2 contain two elements per sequence, Levels 3 to 5 contain three elements per sequence, Level 6 contains four items per sequence, Level 7 contains five items per sequence, Level 8 contains six items per sequence, Level 9 contains seven items per sequence and Level 10 contains eight items per sequence. Participants obtained one point for each correct sequence. If a participant missed on all sequences within a certain level, the test was stopped. The points were summed up to a total score with a minimum of 0 and a maximum of 30 points. We created three WMC groups; high, medium, and low. Participants who made it to Level 7 were placed in the medium group (N = 25), those who made it to Level 6 or lower in the low group (N = 10), and those who made it to Level 8 or higher in the high group (N = 12).
The eye tracking data was recorded by a Tobii Pro Nano eye tracker. The video was presented on a HP ProBook 650 G2 (15,6” monitor, display resolution of 1366 x 768, set 60 to 80 cm in front of the participant) via Tobii Pro Lab (version 1.162.32461 x64) software. The audio was transferred through Tecknet headphones. We used the Tobii I-VT fixation algorithm for fixation identification (Olsen, 2012). The maximum time between fixations was set at 60 milliseconds and the maximum angle between fixations at .5 degrees. The eye tracking data used in this research, was based on the viewer’s interaction with the background, the teacher, and the PowerPoint slides. Therefore, we created three areas of interest (AOIs). The background and PowerPoint AOIs were static, they both did not change in shape or position. The AOI background was activated during the entire video, the AOI PowerPoint was only active if a PowerPoint slide was visible. The teacher AOI was a dynamic AOI which means that its shape an position were adapted to the teachers’ movements. This AOI was also active during the entire video. We calculated the total time spent within each AOI. This was the sum of the total duration in milliseconds of all fixations a viewer made within the AOI.
This study measured prior knowledge as a covariate, since differences in prior knowledge might strongly affect the posttest results (e.g., Ding et al., 2021; Li, 2019; Pi et al., 2023). The prior knowledge was assessed through a paper-and-pencil test in Dutch. It consisted of nine glacier-related terms such as Lambert Fisher or firn. These same terms appeared again in the instructional video. Participants were asked to indicate whether they were familiar with the term and to briefly write down what they knew about it. There was a maximum score of 27 points that could be obtained. One point per familiar item and one or two extra points for the explanation. The rating was based on a rubric containing keywords that should be mentioned in the explanation of the item. For instance, if a participant indicated that she or he was familiar with the term Lambert Fisher, this person obtained one point. If she or he mentioned that it is the biggest glacier in the world and that it is situated in Antarctic, this person obtained two extra points. If the participant only mentioned one of those aspects, then only one extra point was attributed. The first twenty tests were rated by two independent persons. Since the ICC score of .97 showed a good reliability (Koo & Li, 2016), the remaining tests were rated by one person. The score of the rater assessing all tests was used for the analysis.
The learning outcomes were also assessed through a paper and pencil test. It consisted of 13 open-ended questions since these might be more sensitive to the differences between the instructional conditions (Mayer, 2010). There were 3 transfer questions and 10 retention questions. This research developed a grading rubric containing either the exact answers, or the keywords that needed to appear in the answers in order to obtain points. Participants could obtain a total of 22 points for the test. Fourteen points for the retention questions and 8 points for the transfer questions. An example of a retention question is: Indicate the location of the Lambert Fisher glacier and the Kutiah Lungma glacier. The participant obtained half a point for each, if the answer was correct. An example of a transfer question is: a glacier in Italy has been covered with white cloth, explain why this might have been done. To obtain the maximum of three points the answer should contain the following words, or words similar to: lack of snow, reflection, warming up. Again, the first twenty tests were rated by two independent raters. The ICC score for the total test, as well as for the retention questions, was .98. The ICC score for transfer questions was .97. Because of these high scores, the remaining tests were rated by one person. The score of the rater assessing all tests was used for the analysis.
To shed further light on the outcomes of this research, participants were asked to answer six statements concerning their learning experience. For this they could answer on a scale form 1 (not at all) to 7 (very much). The statements were based on the questions asked in the study of Merkt et al. (2020). They concern distraction (“I found the background of the video to be very distracting”), difficulty (“The topic of the video was very difficult”), comprehension (“After watching the video I had a good understanding of different glacier formations”), quality of instruction (“The explanations given in the video were very clear”), and expertise of the instructor (“The teacher was very competent”). The last question is a manipulation check (“There was a good fit between the video and the background”).

Procedure

Participants were tested in individual sessions, which took about 50 minutes. First, there was a brief welcome during which the procedure was explained. The experimenter verified if the informed consent was signed by the participant, and a parent if applicable. After this the participant answered a few demographic questions concerning age, sex, study year, and level. This was followed by the paper-and-pencil prior knowledge test. Once this was finished, the WMC was assessed. Participants put on the headphones and, after a sound check, the actual test started. The experimenter kept the score. Next, the participant received a brief explanation about the eye tracking procedure, if necessary, participants were asked to remove their eye makeup. The experimenter made sure the distance between the screen and the participant was about 60-70 cm. Then the five-point calibration was started. Once this was validated the participant was friendly reminded not to move too much and was invited to start watching the instructional video. After watching the video, the participant took the pencil-and-paper posttest to assess the learning outcomes. Finally, the participant answered a few subjective questions about the learning experience. After this there was a debriefing, however participants were only informed about the exact research topic once all the data had been collected, since they all attended the same school.

Data Analysis

Initially data was collected of 54 participants. After screening the eye tracking data, seven participants were excluded from the analyses: Five because of a gaze sample (i.e., ratio of eye movements tracked by the eye tracking device) below 90% and two due to outlier analyses.
The datasets and scripts used for the analysis can be found on OSF (https://osf.io/mgsdh/?view_only=d6d3525248d34ca786d3c9ad8bbcdf20). We used R version 4.3.0 (R Core Team 2023) and the following R packages: car v. 3.1.2 (Fox & Weisberg, 2019), emmeans v. 1.8.6 (Lenth, 2022), ggpubr v. 0.6.0 (Kassambara, 2022), here v. 1.0.1 (Müller, 2020), lsr v. 0.5.2 (Navarro, 2015), openxlsx v. 4.2.5.2 (Schauberger & Walker, 2022), pgirmess v. 2.0.2 (Giraudoux, 2022), rmarkdown v. 2.22 (Allaire et al., 2022; Xie et al, 2018; 2020;), tidyverse v. 2.0.0 (Wickham et al., 2019) and ordinal v.2022.11.16 (Christensen, 2022). In a first step, descriptive statistics (means and standard deviations) are calculated for prior knowledge, eye tracking measures and posttest measures. In a second step, normality was checked for all variables by inspecting a qqplot and performing a Shapiro-Wilk test for normality. Prior knowledge was not normally distributed and therefore a Kruskal-Wallis test is carried out to examine the relation between background and these measures. Selfreport measures were analyzed with ordinal logistic regression. The total fixation duration on the background was also not normally distributed and was logarithmically transformed for further analysis.
Regarding the first research question examining the effect of background and WMC on visual processing, three separate regression analyses were carried out for the three dependent variables: total fixation duration on background, total fixation duration on PowerPoint and total fixation duration on teacher. The independent variables were background (categorical with three factors) and WMC (categorical with three factors) and their interaction effect. For the second research question, examining the effect of background and WMC on learning outcomes, we took the same approach. Three separate regression analyses were carried out for the dependent variables posttest total, retention and transfer. The independent variables are background (categorical with three factors) and WMC (categorical with three factors) and their interaction effect. For both, research question one and two, multiple comparisons of means for levels of WMC within background were carried out.

Results

Table 1 provides the descriptive statistics of the numeric variables and Table 2 of the ordinal variables. Due to a violation of normal distribution, the prior knowledge measure was tested through a Kruskal-Wallis test. The overall score on prior knowledge was very low compared to the maximum score of 27 that could be obtained, furthermore the Kruskal-Wallis test indicated no significant difference between groups; H(2) = 2.02, p = .363, therefore prior knowledge was not included as a covariate in our analyses.
Ordinal regressions demonstrated that there was a statistically significant difference in the perceived distraction caused by the background across the three different background version groups. Pairwise comparisons demonstrated that the off-topic background group reported a significantly higher perceived distraction than the neutral group (p = .019). There was also a statistically significant difference in the perceived fittingness of the background across the three groups. Pairwise comparisons showed that the off-topic background group reported a significantly lower background fittingness than the authentic (p < .001) and neutral group (p < .001). The authentic background group also reported a significantly lower background fittingness than the neutral background group (p < .029). There were no statistically significant differences across the three groups regarding the other measures; difficulty of the topic, understanding of the topic, clear explanations and competency of the teacher.
To examine the effect of the video’s background and WMC on visual processing (RQ1), three separate regression analyses were carried out for the total fixation duration on the background, the total fixation duration on the PowerPoint and the total fixation duration on the teacher (Table 3). For the background, the adjusted R2 is negative and shows a neglectable effect (Cohen, 1988) of background and WMC on total fixation duration on the background. No main effect of background and WMC nor an interaction effect was found. Similar results are found for total fixation duration on the PowerPoint, 1.82 % of the variance in total fixation duration can be explained by background and WMC, which is a small effect (Cohen, 1988). No main effect nor interaction effect of background and WMC can be found. For total fixation duration on the teacher in the video, 12.77 % of the variance is explained by background and WMC, which is a medium effect (Cohen, 1988). An interaction effect was found between background and WMC (Figure 2). More specifically, students with a high WMC spend more attention on the teacher than students with a low WMC for the video with an off-topic background (p = .009, Table 4).
To investigate the effect of the video’s background and WMC on the learning outcomes (RQ2), a regression analysis was conducted for the total posttest score, followed by a regression analysis for the scores on retention and transfer separately. Results of these analyses are displayed in Table 5 and Table 6. Regarding the total posttest score, results demonstrate that 13.76% of the variance in posttest score can be explained by the background and WMC, which is a medium effect (Cohen, 1988). Results demonstrate that the effect of background is depending on WMC. For an off-topic background, students with a high WMC gain a better score than students with a low WMC on the posttest (p = .016, Table 6). For the other backgrounds, no significant differences were found (Figure 3).
For the retention questions, similar results were found. 15.09 % of the variance in retention can be explained by the background of the video and WMC, which is a large effect (Cohen, 1988). Again an interaction effect was found between WMC and background. For the off-topic background results show that students with a low WMC score lower than students with a high WMC on retention (p = .037, Table 6). For the transfer questions, 8.42 % of the variance in transfer can be explained by background and WMC, which is a medium effect (Cohen, 1988). Again an interaction effect was found: students with a low WMC gained a lower score on the transfer questions than students with a high WMC for the off-topic background (p = .038, Table 6).

Discussion

The primary goal of this study was to investigate the effects of a video’s background on visual processing and learning outcomes. We compared three conditions; an authentic background, which could have a positive impact on learning since it might be a sign of expertise, a neutral background, which could lead to better learning results because of the lack of potentially distracting seductive details, and an off-topic background which contained seductive details and lacked any sign of expertise, therefore being the background that would negatively impact learning the most, provided that the background had any impact.
Regarding the first research question, we examined the effect of background and WMC on visual processing. Our results demonstrated an interaction whereby in the video with an off-topic background students with a high WMC focused more on the teacher than students with a low WMC. The off-topic background can be considered as a seductive detail. Our results are in line with similar research on seductive images in webpages (Sanchez & Wiley, 2006). They found that participants with a high WMC paid less attention to seductive images. Hence, in our study, students with a high WMC are better able at focusing their attention on the teacher, providing important cues through mimics, and hereby neglecting seductive information compared to students with a low WMC.
Concerning the second research question, our results showed that for the off-topic background students with a high WMC performed better at the posttest, for both retention and transfer questions, compared to students with a low WMC. It demonstrates that students with a high WMC are less affected by the potential negative effect of a distracting background. This is in line with several studies on learning from illustrated text in which it was demonstrated that a higher WMC positively affected learning outcomes when seductive details were added (Banas & Sanchez, 2012; Rey, 2014; Sanchez & Wiley, 2006).

Limitations of This Study and Suggestions for Future Research

A first limitation of this study is that the entire PowerPoint slides were labeled as areas of interest. For future research, it would be interesting to identify specific relevant areas within the PowerPoint in order to examine whether students did look at the specific relevant parts of the PowerPoint slides and how quickly their attention was focused on relevant parts. In addition, it would be interesting to investigate how attention allocation on the background varies over time. In line with the study of van Wermeskerken et al. (2018), the video could be split into shorter segments to examine whether differences are only present at the start or persist over time.
A second limitation of this study is that the teacher in the instructional video was not making use of explicit gaze and gesture cues. The teacher did use mimics, and we know that natural, non-verbal behavior of a teacher, including mimics, is relevant to understand and learn from him or her (e.g., Sime, 2006; Zeki, 2019). It is known that faces attract attention and face perception is a highly developed skill (Haxby et al., 2020). It might even be linked to cultural backgrounds because looking at the face is linked to showing interest in the speaker (Haxby et al., 2020). Prior eye tracking research suggests that people from Western cultures tend to look at objects in the foreground, while people from Eastern cultures tend to look more at the background (e.g., Chua et al., 2005). It would be, hence, very interesting to see whether such an effect amplifies our findings of the various backgrounds on students’ visual processing and learning outcomes. For future research, we suggest to examine videos in which teachers make use of gaze guidance, gesture guidance or a combination of both (e.g., Ouwehand et al., 2015). Research already demonstrated that these cues might guide the attention away from the speaker.
Another suggestion for future research is to investigate the role of expertise of the teacher more explicitly, since, as previously mentioned, the way the viewer experiences the teacher’s expertise, has an impact on learning outcomes (Boekhout et al., 2010; Hoogerheide et al., 2016; Lachner & Nückles, 2015). Therefore, we suggest to manipulate other observable factors linked to expertise such as clothing styles, gender or age. In addition, it would be interesting to include students with prior knowledge on the topic as well, since this might also impact the way they judge the expertise of the teacher in the video.
Regarding students’ characteristics, this study measured WMC through the Letter-Number Sequencing test, which is based on auditory input, but this could also be measured through other instruments, such as a visual arrays task (Martin et al., 2021) which is based on visual input. Also, unlike Merkt et al. (2020), we did not measure other individual differences such as the participants’ interest in the topic, or their motivation, even though these can impact learning (e.g., Krapp, 1999). Another point is that we measured students’ ratings of the video after they finished the knowledge test, which might have impacted the way they rated the videos.
Concerning the learning environment, Choi et al. (2014) suggest that the physical environment in which learning takes place could have an effect on the experienced cognitive load. This research was conducted in a small office. The only two persons present were the researcher and the participant. There were no distractions. In reality learners are likely to watch instructional video’s in a potentially more distraction environment such as a classroom, or their own living rooms or bedrooms. Since this might increase the experienced cognitive load, further research would be needed to investigate whether this amplifies the effects of a video’s background and WMC.
The last limitation and suggestion concerns the choice of the backgrounds. It would be interesting to investigate whether the off-topic background as a whole was distracting, or only certain features in the image. For instance, research has shown that peoples gaze is drawn towards faces (e.g., Langton et al., 2008; Thoma & Lavie, 2013) and there was a women present in the picture used as off-topic background. Our findings might have been different if there was no human present in the off-topic background.

Conclusions

Even though further research is still needed, this study tends to confirm the distraction hypothesis especially for students with a low WMC. Educators should be aware that their choice for a background in an instructional video, and possibly also during an online course, has an effect on their students’ information processing and learning. It is important that educators are aware of the fact that a distracting background affects students with a low WMC more than those with a high WMC. Because of their visual information processing, students with high WMC are not as easily overloaded compared to persons with a low WMC, giving them a double advantage, the in itself higher capacity and the lower load that needs to be processed. A distracting background could therefore amplify existing differences between students.

Ethics and Conflict of Interest

The authors declare that the contents of the article are in agreement with the ethics described in http://biblio.unibe.ch/portale/elibrary/BOP/jemr/ethics.html and that there is no conflict of interest regarding the publication of this paper. This research was ethically approved by The Research Ethics Committee of the Open University (U202103165).

References

  1. Allaire, J. J., Y. Xie, J. McPherson, J. Luraschi, K. Ushey, A. Atkins, H. Wickham, J. Cheng, W Chang, and R. Iannone. 2022. rmarkdown: Dynamic Documents for r. Available online: https://github.com/rstudio/rmarkdown.
  2. Alemdag, E., and K. Cagiltay. 2018. A systematic review of eye tracking research on multimedia learning. Computers & Education 125: 413–428. [Google Scholar] [CrossRef]
  3. Baddeley, A. D., and R. H. Logie. 1999. Edited by A. Miyake and P. Shah. Working memory: The multiple-component model. In Models of working memory, 1st ed. Cambridge University Press: pp. 28–61. [Google Scholar] [CrossRef]
  4. Banas, S., and C. A. Sanchez. 2012. Working memory capacity and learning underlying conceptual relationships across multiple documents: Working memory and underlying relationships. Applied Cognitive Psychology 26, 4: 594–600. [Google Scholar] [CrossRef]
  5. Bétrancourt, M., and K. Benetos. 2018. Why and when does instructional video facilitate learning? A commentary to the special issue developments and trends in learning with instructional video. Computers in Human Behavior 89: 471–475. [Google Scholar] [CrossRef]
  6. Boekhout, P., T. van Gog, M. W. J. Wiel, D. Gerards-Last, and J. Geraets. 2010. Example-based learning: Effects of model expertise in relation to student expertise. British Journal of Educational Psychology 80, 4: 557–566. [Google Scholar] [CrossRef] [PubMed]
  7. Cheng, J. T., J. L. Tracy, T. Foulsham, A. Kingstone, and J. Henrich. 2013. Two ways to the top: Evidence that dominance and prestige are distinct yet viable avenues to social rank and influence. Journal of Personality and Social Psychology 104, 1: 103–125. [Google Scholar] [CrossRef] [PubMed]
  8. Christensen, R. H. B. 2022. Ordinal—Regression Models for Ordinal Data. [Google Scholar]
  9. Choi, H.-H., J. J. G. Van Merriënboer, and F. Paas. 2014. Effects of the physical environment on cognitive load and learning: Towards a new model of cognitive load. Educational Psychology Review 26, 2: 225–244. [Google Scholar] [CrossRef]
  10. Chua, H. F., J. E. Boland, and R. E. Nisbett. 2005. Cultural variation in eye movements during scene perception. PNAS 102, 35: 12629–12633. [Google Scholar] [CrossRef]
  11. Cohen, J. 1988. Statistical power analysis for the behavioral sciences. Lawrence Erlbaum Associates. [Google Scholar]
  12. Coskun, A., and K. Cagiltay. 2022. A systematic review of eye-tracking-based research on animated multimedia learning. Journal of Computer Assisted Learning 38, 2: 581–598. [Google Scholar] [CrossRef]
  13. Crowe, S. F. 2000. Does the letter number sequencing task measure anything more than digit span? Assessment 7, 2: 113–117. [Google Scholar] [CrossRef]
  14. de Koning, B. B., V. Hoogerheide, and J.-M. Boucheix. 2018. Developments and trends in learning with instructional video. Computers in Human Behavior 89: 395–398. [Google Scholar] [CrossRef]
  15. Ding, L., K. Cooper, M. Stephens, M. Chi, and S. Brownell. 2021. Learning from error episodes in dialogue-videos: The influence of prior knowledge. Australasian Journal of Educational Technology, 20–32. [Google Scholar] [CrossRef]
  16. Engle, R. W. 2002. Working memory capacity as executive attention. Current Directions in Psychological Science 11, 1: 19–23. [Google Scholar] [CrossRef]
  17. Engle, R. W. 2018. Working memory and executive attention: A revisit. Perspectives on Psychological Science 13, 2: 190–193. [Google Scholar] [CrossRef]
  18. Engle, R. W., and M. J. Kane. 2003. Edited by B. H. Ross. Executive attention, working memory capacity, and a twofactor theory of cognitive control. In The psychology of learning and motivation: Advances in research and theory. Elsevier: pp. 145–199. [Google Scholar]
  19. Fiorella, L. 2021. Edited by R. E. Mayer and L. Fiorella. Multimedia learning with instructional video. In The cambridge handbook of multimedia learning. Cambridge University Press: pp. 487–497. [Google Scholar] [CrossRef]
  20. Fiorella, L., and R. E. Mayer. 2021. Edited by R. E. Mayer and L. Fiorella. Principles for reducing extraneous processing in multimedia learning: Coherence, signaling, redundancy, spatial contiguity, and temporal contiguity principles. In The cambridge handbook of multimedia learning. Cambridge University Press: pp. 185–198. [Google Scholar] [CrossRef]
  21. Fox, J., and S. Weisberg. 2019. An R Companion to applied regression. Sage. [Google Scholar]
  22. Giraudoux, P. 2022. pgirmess: Spatial analysis and data mining for field Ecologists. Available online: https://CRAN.R-project.org/package=pgirmess.
  23. Gurney, D. J., N. Howlett, K. Pine, M. Tracey, and R. Moggridge. 2017. Dressing up posture: The interactive effects of posture and clothing on competency judgements. British Journal of Psychology 108, 2: 436–451. [Google Scholar] [CrossRef] [PubMed]
  24. Haxby, J. V., E. A. Hoffman, and M. I. Gobbini. 2000. The distributed human neural system for face perception. Trends in Cognitive Sciences 4: 223–233. [Google Scholar] [CrossRef]
  25. Hoogerheide, V., M. van Wermeskerken, S. M. M. Loyens, and T. van Gog. 2016. Learning from video modeling examples: Content kept equal, adults are more effective models than peers. Learning and Instruction 44: 22–30. [Google Scholar] [CrossRef]
  26. Jarodzka, H., K. Holmqvist, and H. Gruber. 2017. Eye tracking in educational science: Theoretical frameworks and research agendas. Journal of Eye Movement Research 10, 1: 1–18. [Google Scholar] [CrossRef]
  27. Kassambara, A. 2022. ggpubr: “ggplot2” based publication ready plots. Available online: https://CRAN.R-project.org/package=ggpubr.
  28. Koo, T. K., and M. Y. Li. 2016. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. Journal of Chiropractic Medicine 15, 2: 155–163. [Google Scholar] [CrossRef]
  29. Krapp, A. 1999. Interest, motivation and learning: An educational-psychological perspective. European Journal of Psychology of Education 14, 1: 23–40. [Google Scholar] [CrossRef]
  30. Lachner, A., and M. Nückles. 2015. Bothered by abstractness or engaged by cohesion? Experts’ explanations enhance novices’ deep-learning. Journal of Experimental Psychology: Applied 21, 1: 101–115. [Google Scholar] [CrossRef]
  31. Lai, M.-L., M.-J. Tsai, F.-Y. Yang, C.-Y. Hsu, T.-C. Liu, S. W.-Y. Lee, M.-H. Lee, G.-L. Chiou, J.-C. Liang, and C.-C. Tsai. 2013. A review of using eye-tracking technology in exploring learning from 2000 to 2012. Educational Research Review 10: 90–115. [Google Scholar] [CrossRef]
  32. Langton, S. R. H., A. S. Law, A. M. Burton, and S. R. Schweinberger. 2008. Attention capture by faces. Cognition 107, 1: 330–342. [Google Scholar] [CrossRef] [PubMed]
  33. Lenth, R. V. 2022. emmeans: Estimated marginal means, aka least-squares means. Available online: https://CRAN.R-project.org/package=emmeans.
  34. Li, L.-Y. 2019. Effect of prior knowledge on attitudes, behavior, and learning performance in video lecture viewing. International Journal of Human–Computer Interaction 35, 4–5: 415–426. [Google Scholar] [CrossRef]
  35. Martin, J. D., J. S. Tsukahara, C. Draheim, Z. Shipstead, C. A. Mashburn, E. K. Vogel, and R. W. Engle. 2021. The visual arrays task: Visual storage capacity or attention control? Journal of Experimental Psychology: General 150, 12: 2525–2551. [Google Scholar] [CrossRef]
  36. Mayer, R. E. 2010. Unique contributions of eyetracking research to the study of learning with graphics. Learning and Instruction 20, 2: 167–171. [Google Scholar] [CrossRef]
  37. Mayer, R. E. 2019. Taking a new look at seductive details. Applied Cognitive Psychology 33, 1: 139–141. [Google Scholar] [CrossRef]
  38. Mayer, R. E. 2021. Edited by R. E. Mayer and L. Fiorella. Cognitive theory of multimedia learning. In The cambridge handbook of multimedia learning. Cambridge University Press: pp. 57–72. [Google Scholar] [CrossRef]
  39. Mayer, R. E., and L. Fiorella, eds. 2021. The cambridge handbook of multimedia learning, 3rd ed. Cambridge University Press. [Google Scholar] [CrossRef]
  40. Merkt, M., S. Lux, V. Hoogerheide, T. van Gog, and S. Schwan. 2020. A change of scenery: Does the setting of an instructional video affect learning? Journal of Educational Psychology 112, 6: 1273–1283. [Google Scholar] [CrossRef]
  41. Morris, T. L., J. Gorham, S. H. Cohen, and D. Huffman. 1996. Fashion in the classroom: Effects of attire on student perceptions of instructors in college classes. Communication Education 45, 2: 135–148. [Google Scholar] [CrossRef]
  42. Müller, K. 2020. here: A simpler way to find your files. Available online: https://CRAN.R-project.org/package=here.
  43. Navarro, D. 2015. Learning statistics with r: A tutorial for psychology Students and other beginners (Version 0.6). Available online: https://learningstatisticswithr.com.
  44. Neelen, M., and P. A. Kirschner. 2020. Evidence-informed learning design: Creating training to improve performance. KoganPage. [Google Scholar]
  45. Olsen, A. 2012. The Tobii I-VT fixation filter. Tobii Pro. [Google Scholar]
  46. Ouwehand, K., T. van Gog, and F. Paas. 2015. Designing effective video-based modeling examples using gaze and gesture cues. Educational Technology and Society 18, 4: 78–88, soci.18.4.78. Available online: http://www.jstor.org/stable/jeductech-.
  47. Pi, Z., X. Zhang, X. Zhang, M. Gao, and X. Li. 2023. Students’ prior knowledge moderates the effects of group motivation compositions on learning from video lectures. British Journal of Educational Technology bjet.13316. [Google Scholar] [CrossRef]
  48. R Core Team. 2023. R: A language and environment for statistical computing. R Foundation for Statistical Computing: Available online: https://www.R-project.org/.
  49. Rey, G. D. 2014. Seductive details and attention distraction–An eye tracker experiment. Computers in Human Behavior 32: 133–144. [Google Scholar] [CrossRef]
  50. Sanchez, C. A., and J. Wiley. 2006. An examination of the seductive details effect in terms of working memory capacity. Memory & Cognition 34, 2: 344–355. [Google Scholar] [CrossRef]
  51. Sime, D. 2006. What do learners make of teachers' gestures in the language classroom? International Review of Applied Linguistics in Language Teaching 44, 2: 211–230. [Google Scholar] [CrossRef]
  52. Schauberger, P., and A. Walker. 2022. openxlsx: Read, write and edit xlsx files. Available online: https://CRAN.R-project.org/package=openxlsx.
  53. Schwarz, N. 2007. Attitude construction: Evaluation in context. Social Cognition 25, 5: 638–656. [Google Scholar] [CrossRef]
  54. Shipstead, Z., T. L. Harrison, and R. W. Engle. 2015. Working memory capacity and the scope and control of attention. Attention, Perception, & Psychophysics 77, 6: 1863–1880. [Google Scholar] [CrossRef]
  55. Sweller, J., J. J. G. Van Merriënboer, and F. Paas. 2019. Cognitive architecture and instructional design: 20 years later. Educational Psychology Review 31, 2: 261–292. [Google Scholar] [CrossRef]
  56. Thoma, V., and N. Lavie. 2013. Perceptual load effects on processing distractor faces indicate facespecific capacity limits. Visual Cognition 21, 8: 1053–1076. [Google Scholar] [CrossRef]
  57. Tsai, M., A. Wu, and Y. Chen. 2018. Static and dynamic seductive illustration effects on text-and-graphic learning processes, perceptions and outcomes: Evidence from eye-tracking. Applied Cognitive Psychology, acp.3514. [Google Scholar] [CrossRef]
  58. Wickham, H., M. Averick, J. Bryan, W. Chang, L. D’Agostino McGowan, R. François, and G. Grolemund. 2019. Welcome to the tidyverse. Journal of Open Source Software 4, 43. [Google Scholar] [CrossRef]
  59. Wiley, J., C. A. Sanchez, and A. J. Jaeger. 2014. Edited by R. E. Mayer. The individual differences in working memory capacity principle in multimedia learning. In The cambridge handbook of multimedia learning. Cambridge University Press: pp. 598–620. [Google Scholar] [CrossRef]
  60. Wittenbrink, B., C. M. Judd, and B. Park. 2001. Spontaneous prejudice in context: Variability in automatically activated attitudes. Journal of Personality and Social Psychology 81, 5: 815–827. [Google Scholar] [CrossRef]
  61. Xie, Y., J. J. Allaire, and G. Grolemund. 2018. R markdown: The definitive guide. Available online: https://bookdown.org/yihui/rmarkdown.
  62. Xie, Y., C. Dervieux, and E. Riederer. 2020. R markdown cookbook. Available online: https://bookdown.org/yihui/rmarkdown-cookbook.
  63. Zeki, C. P. 2009. The importance of non-verbal communication in classroom management. Social and Behavioral Sciences 1, 1: 1443–1449. [Google Scholar] [CrossRef]
Figure 1. Screenshots of the instructional videos with different virtual backgrounds.
Figure 1. Screenshots of the instructional videos with different virtual backgrounds.
Jemr 16 00016 g001
Figure 2. The interaction effect of background and WMC on total fixation duration on the teacher.
Figure 2. The interaction effect of background and WMC on total fixation duration on the teacher.
Jemr 16 00016 g002
Figure 3. The interaction effect of background and WMC on the total posttest score.
Figure 3. The interaction effect of background and WMC on the total posttest score.
Jemr 16 00016 g003
Table 1. Means (M), Standard Deviations (SD), Minimum and Maximum of the Numeric Variables per Condition.
Table 1. Means (M), Standard Deviations (SD), Minimum and Maximum of the Numeric Variables per Condition.
Jemr 16 00016 i001
Table 2. Medians of the Ordinal Variables per Condition.
Table 2. Medians of the Ordinal Variables per Condition.
Jemr 16 00016 i002
Table 3. Parameter Estimates of the Regression Analysis for Total Fixation Duration on Background, PPT and Teacher.
Table 3. Parameter Estimates of the Regression Analysis for Total Fixation Duration on Background, PPT and Teacher.
Jemr 16 00016 i003
Table 4. Parameter Estimates of the Multiple Comparisons of Means for Levels of WMC within Background for Attention Allocation (Post-Hoc Tukey Contrast).
Table 4. Parameter Estimates of the Multiple Comparisons of Means for Levels of WMC within Background for Attention Allocation (Post-Hoc Tukey Contrast).
Jemr 16 00016 i004
Table 5. Parameter Estimates of the Regression Analysis for Posttest total, Posttest Retention and Posttest Transfer.
Table 5. Parameter Estimates of the Regression Analysis for Posttest total, Posttest Retention and Posttest Transfer.
Jemr 16 00016 i005
Table 6. Parameter Estimates of the Multiple Comparisons of Means for Levels of WMC within Background (Post-Hoc Tukey Contrast).
Table 6. Parameter Estimates of the Multiple Comparisons of Means for Levels of WMC within Background (Post-Hoc Tukey Contrast).
Jemr 16 00016 i006

Share and Cite

MDPI and ACS Style

Catrysse, L.; Kerckhoffs, A.; Jarodzka, H. Behind the Scenes: Impact of Virtual Backgrounds in Educational Videos on Visual Processing and Learning Outcomes. J. Eye Mov. Res. 2023, 16, 1-16. https://doi.org/10.16910/jemr.16.3.4

AMA Style

Catrysse L, Kerckhoffs A, Jarodzka H. Behind the Scenes: Impact of Virtual Backgrounds in Educational Videos on Visual Processing and Learning Outcomes. Journal of Eye Movement Research. 2023; 16(3):1-16. https://doi.org/10.16910/jemr.16.3.4

Chicago/Turabian Style

Catrysse, Leen, Andrienne Kerckhoffs, and Halszka Jarodzka. 2023. "Behind the Scenes: Impact of Virtual Backgrounds in Educational Videos on Visual Processing and Learning Outcomes" Journal of Eye Movement Research 16, no. 3: 1-16. https://doi.org/10.16910/jemr.16.3.4

APA Style

Catrysse, L., Kerckhoffs, A., & Jarodzka, H. (2023). Behind the Scenes: Impact of Virtual Backgrounds in Educational Videos on Visual Processing and Learning Outcomes. Journal of Eye Movement Research, 16(3), 1-16. https://doi.org/10.16910/jemr.16.3.4

Article Metrics

Back to TopTop