1. Introduction
Learning Chinese characters poses significant challenges for non-native beginners, especially those from alphabetic language backgrounds (
Hu, 2010;
Shen, 2004), because Chinese characters use a logographic system where each character conveys meaning through visual elements rather than phonetics (
Shen, 2013). Thus, strong visual-spatial skills are vital for learning Chinese characters, as the ability to distinguish subtle visual differences is essential due to the large number of similar-looking radicals and characters in Chinese orthography (e.g.,
Chen et al., 1996;
McBride-Chang et al., 2005, etc.). In this study, we aim to explore to what extent initial L2 Character learning can be enhanced by manipulating perceptual salience and constructed salience (e.g.,
Gass et al., 2018) to direct learners’ attention to critical visual features of characters, to reduce cognitive load or enhance memory retention. Here, we focus on three visual factors that can be manipulated in learning: color coding, stroke structure, and presentation style.
In the field of second language acquisition (SLA), salience is a critical factor in learning, referring to a property that makes a linguistic feature more perceptually prominent. As
Gass et al. (
2018) delineate, salience can be either an inherent characteristic of the input (perceptual salience) or intentionally induced through instruction (constructed salience). Perceptual salience arises from the innate features of the language itself, such as stress or syllabicity, which naturally attract learners’ attention. In contrast, constructed salience occurs when external agents, such as instructors or materials, enhance the prominence of a feature through deliberate means. This is frequently achieved in instructed learning settings via interactional feedback or techniques. The influence of both perceptual and constructed salience extends across various domains of SLA.
A substantial body of research has examined their role in vocabulary acquisition (e.g.,
Ko, 2012;
Elgort & Warren, 2014). Here, in the framework of salience in SLA, we investigate the role of both language-specific properties (perceptual salience) and learning materials (constructed salience) in learning Chinese characters. We leverage the vast number of Chinese characters that share visual similarity (i.e., Chinese orthographic neighbors), as they could be hard to differentiate among themselves and hard to learn (e.g.,
Ellis, 2016). Regarding the external agents to assist learning, we leverage color coding and presentation style, which are typically adopted in modifying learning materials and instructions (e.g.,
Tortorelli et al., 2021;
Vlach et al., 2022). In the following sections, we first introduce the basic properties of Chinese characters, followed by a thorough review of the three visual factors that can contribute to the ‘salience’ in learning.
1.1. Basic Properties of Chinese Characters
The orthographic structure of Chinese characters can be analyzed hierarchically to its components: radicals and strokes. Radicals often serve as semantic or phonetic indicators. For example, the semantic radical 氵 (three dots/strokes to indicate
water) is found in characters related to water, like 河 (river) and 海 (sea). Likewise, the phonetic radical 青 (pronounced as qing 1) in 清 (qing1), 情 (qing2), 请 (qing3), etc., produces a similar pronunciation. They are essential for organizing and understanding Chinese writing, as they help group characters with similar meanings or pronunciations. Strokes are the basic components of Chinese radicals and characters. A stroke in Chinese writing is created with a single, continuous motion, often involving changes in direction (
Anderson et al., 2013).
Shu et al. (
2003) analyzed 2570 characters taught in Chinese elementary schools and identified eight types of strokes that vary in length and spatial orientations. Strokes can take various forms—vertical, horizontal, or diagonal; straight or slightly curved; and some may include a “hook” (
Taylor & Taylor, 1995). For instance, the simple character 木 (wood) contains four types of strokes: horizontal (一), vertical (丨), left-slanting (丿), and right-slanting (㇏). Additionally, shorter stroke segments, such as dots, can vary in direction, as seen in the character 兴 (prosperous), as compared to the character 河 (river).
Characters are categorized into two types based on their structures: integral and compound characters. Integral characters can be formed as simply as one stroke (一) or as complex as multiple strokes (束). One dimension of the visual complexity of Chinese characters is the relative positions of different strokes. For example, the same two strokes can create different characters by varying the stroke positions. For instance, both 人 (person) and 入 (enter) are composed of two strokes: 丿 and ㇏, but their differing arrangements result in distinct characters.
Similarly, in compound characters, the position of sub-lexical orthographic units (radicals) can change to form entirely different characters (
J. Zhang, 1992, pp. 31–32). For example, 杏 (apricot) and 呆 (dumbness) both include木 (wood) and 口 (mouth) arranged in a top-down pattern. When the positions of these radicals are reversed, a completely different character is formed. In another example, 晾 (air/sundry) and 景 (scenery) both contain the radical 日 (sun) and京 (capital). However, the structural arrangement differs: 晾 follows a left-right pattern, while 景 adopts a top-down structure.
The phenomenon where Chinese characters share the same strokes but differ in stroke structure is intriguing, especially when considering how L2 beginners of alphabetic language backgrounds learn and identify these visual complexities to eventually map to the meanings. Previous studies on visual complexity typically focus on stroke counts, assuming that more strokes equate to greater visual complexity (e.g.,
Shen, 2013;
Yu et al., 2018) when considering L2 learning of Chinese characters. Yet, learning characters not only involves learning the individual strokes but also identifying characters that look similar but differ in stroke structures. The latter is probably harder to learn, in particular, at the early stage of learning, given the subtlety. To our knowledge, no study has yet explored whether characters of the same stroke counts but presenting different stroke structures pose additional difficulties for Chinese L2 learners. Thus, the aim of the study is to examine how visual similarity (i.e., stroke structure) influences the early stage of L2 Chinese character learning. We focused on integral characters because, as compared to compound characters, they contain fewer strokes and simpler stroke patterns, normally defined as “spatially contiguous patterns of strokes” (
Anderson et al., 2013, p. 45).
1.2. Literature Review of the Three Visual Factors
The concept of salience in character learning manifests in two forms. Perceptual salience denotes the inherent visual cues that signal the complex similarities among Chinese characters (
Section 1.2.1). Constructed salience, on the other hand, is developed through instructional aids such as color coding (
Section 1.2.2) or presentation styles (
Section 1.2.3).
1.2.1. Visual Similarity in Chinese Characters
A basic examination of the Chinese writing system quickly highlights its stronger emphasis on visual-spatial processing compared to alphabetic orthographies that bear certain phonological relationships with words. That is, the alphabet usually indicates how words sound to a large or small extent, depending on the orthographic depth of the writing system. This is usually referred to as the Grapheme-Phoneme-Correspondence (GPC) rule. Thus, unlike alphabetic systems, which rely on a segmental and “atomistic” structure (
Ho & Bryant, 1999), Chinese lacks this rule. Although many Chinese characters (e.g., phono-semantic compounds) contain phonetic radicals that offer clues to pronunciation, the relationship between orthography and phonology in Chinese is generally less consistent and transparent than in alphabetic writing systems. According to
Jin (
2006), learning Chinese characters is a more visual than auditory process. The best way to remember them is by focusing on their overall shape and radicals, not by memorizing stroke order or pronunciation. This is due to the unique nature of Chinese characters. Unlike the linear sequence of letters in alphabetic languages, each character is a dense, square-shaped image filled with intricate details—similar to a complex icon or a piece of fine print. This “high spatial frequency” means that learning Chinese is like building a visual library, requiring strong spatial memory to distinguish between characters with many small, detailed strokes (e.g.,
M. Wang et al., 2003). The development of sensitivity to the internal structure of Chinese characters is a gradual process.
Gao and Meng (
2000) found that beginning learners were significantly less accurate than advanced learners at distinguishing between visually similar characters (e.g., 会 vs. 公, 休 vs. 体) in a reading context. Beginners often struggle to notice subtle differences between similarly shaped characters and are not yet familiar with perceiving the details of stroke-based writing.
The literature contains various measures of visual complexity for Chinese characters, which aim to accurately reflect their complexity. For example, these measures include stroke frequency (
Majaj et al., 2002;
J. Y. Zhang et al., 2007) and the “ink” measure (
Pelli et al., 2006), defined as the ratio of black pixels to total pixels from a binary image of the character. Among these, visual complexity is typically measured by the number of strokes (
Liversedge et al., 2014;
Yu et al., 2018). However, some Chinese characters, such as 犬 (dog) and 太 (too), share the same three strokes. Their stroke patterns and spatial arrangements vary significantly. This complexity extends beyond stroke count to include differences in radicals, spatial arrangements, and even subtle variations in individual strokes. In a very similar way, characters like 大 (big) and 犬 (dog), where the latter is formed by adding a single stroke to the former, are normally considered as visually similar characters. These conditions highlight the intricate nature of Chinese orthography and, thus, raise an intriguing question: when comparing characters like 大 (big) vs. 犬 (dog), and 犬 (dog) vs. 太 (too), where the former pair differs by the addition of a stroke to the first character and the latter pair consists of two characters with the identical number of strokes but different patterns, which pair can be considered more visually complex or more difficult to learn?
Research on Chinese visual word recognition, particularly through the stroke number effect, remains limited.
Jiang et al. (
2020) conducted a study comparing native Chinese speakers (NS) and Chinese as a second language (CSL) learners using a lexical decision task involving disyllabic compound words (each containing two characters) and an equal number of nonwords. The results revealed that while both groups were influenced by word frequency and familiarity, only the CSL group demonstrated a stroke number effect. In a subsequent study,
Jiang and Feng (
2022) used single Chinese characters as stimuli and found that CSL learners again exhibited a stroke number effect in a lexical decision task. They argued that adult L2 Chinese learners process Chinese characters analytically, much like beginning readers, focusing on individual strokes or radicals rather than recognizing the overall character structure (
Yeh et al., 2003).
If the number of strokes plays a key role in L2 character learning, we hypothesize that visually similar character pairs with an additional stroke will be more challenging to learn. On the other hand, character pairs that differ by an additional stroke are easier to differentiate due to the visual detectability of the added mark. However, perceiving the subtler structural patterns that distinguish characters with the same number of strokes is a unique challenge in Chinese character identification. Difficulties arise when a feature exists in the L2 but is absent in learners’ L1 (e.g.,
Franceschina, 2005;
Hawkins & Liszka, 2003). Therefore, learners may struggle to differentiate character pairs with identical strokes. However, there is currently no clear evidence indicating whether integral characters with the same number of strokes are equally difficult or harder to process compared to those with an additional stroke.
1.2.2. Color Coding in Character Learning
Color coding has been shown to facilitate word learning by enhancing morphological awareness and supporting the decoding of linguistic structures (e.g.,
Tortorelli et al., 2021;
Songsangkaew et al., 2023). For example,
Tortorelli et al. (
2021) reviewed instructional approaches to teaching code-related skills in English reading and highlighted the effectiveness of strategies such as separating words into phonemes, syllables, and morphemes, as well as teaching affixes explicitly. These methods of integrating colors into linguistic structures are instrumental in helping learners navigate the complexities of an alphabetic language. Supporting this,
Songsangkaew et al. (
2023) demonstrated that color-coding derivational and inflectional morphemes significantly improved morphological awareness, lexical inferencing ability, and reading comprehension among non-English major undergraduates. By visually distinguishing morphemes, color coding may provide learners with a clearer understanding of character structures, aiding the acquisition of both vocabulary and reading skills.
If visual support, like color coding, can benefit learning alphabetical languages, as evidenced in various studies, this strategy might also be helpful in learning Chinese characters, which rely more on visual processing given their complex stroke structures, though relevant research is limited and far from being conclusive. To start, some researchers consider lower-level structures of characters as a unit to learn, i.e., radicals, which facilitate L2 character learning by providing integrated structural features, unlike the complexity and variability of individual strokes (e.g.,
Huang & Chen, 1988). Radicals help to organize stroke-level information (
Taft & Chung, 1999) and draw learners’ attention to the internal structure of characters to some extent (
Cao et al., 2013). For example,
Taft and Chung (
1999) conducted a study in which participants learned 24 Chinese characters composed of two radicals paired with meaning (e.g., 咀-CHEW) under four different radical presentation conditions to see whether understanding the internal structure of Chinese characters aids novice learners in memorization. Participants were exposed to the character-meaning pairs three times and then presented with each character alone to recall its associated meaning. In a between-subject design, four groups received different types of radical instructions: Radicals Before (radicals pre-learned before exposure), Radicals Early (radicals informed during the initial exposure), Radicals Late (radicals identified during the third exposure), and No Radical (no instruction on radicals). The Radicals Early group performed best, suggesting that introducing radicals at the first exposure enhances learning.
If radical learning serves as the foundation of character learning, as indicated by previous studies (
Taft & Chung, 1999), strategies like color-coding to mark the boundaries of radicals within a character might be facilitatory in L2 character learning as well. However, existing empirical evidence does not seem to support this rationale. Among the limited studies investigating whether marking radicals with different colors—a common teaching technique—supports L2 orthographic and semantic learning,
Hou and Jiang (
2022) explored its impact on Chinese L2 learners of alphabetic language backgrounds. In a Latin square design, participants were required to learn characters presented across four distinct conditions: (a) radical markings with stroke animations, (b) no radical markings with stroke animations, (c) radical markings without stroke animations, and (d) neither radical markings nor stroke animations. They were shown 40 high-frequency fillers and 120 low-frequency target characters, with each character displayed for 1000 milliseconds. In the study, compound characters were divided into two radicals, either left and right or top and bottom, with the radicals marked in red and blue, respectively. For instance, in the character “苛,” the radical “艹” on top was marked in red, while the radical “可” on the bottom was marked in blue. After learning, participants completed character recognition and meaning-matching tests. The results showed that marking radicals increased participants’ reaction times and reduced their accuracy in character recognition tests. Similarly, stroke-order animations also negatively affected recognition accuracy. These findings suggest that providing radical and stroke information may interfere, rather than facilitate character learning, as excessive visual information can increase cognitive load for L2 learners. Although strokes and radicals are key sub-character units in native Chinese speakers’ character processing (e.g.,
Peng & Wang, 1997;
Li et al., 2005), their utility for L2 learners in developing Chinese orthographic representations and orthography-semantics connections remains controversial. It is empirical to test whether color-coding a single critical stroke in a character would change the learning outcome.
In the present study, instead of applying color to the entire character (radical), we focused on highlighting one or two specific strokes that distinguish paired characters from one another.
Su and Samuels (
2010) suggest that “beginning Chinese readers process characters in an analytic way”. To emphasize the critical differences between paired characters, we applied color coding to the distinguishing strokes. For example, in character pairs with the same number of strokes, such as 犬 and 太, the differentiating dot was highlighted in color to provide critical input. Similarly, for characters with a different number of strokes, such as 日 and 旦, the distinguishing stroke 一 was color-coded.
1.2.3. Presentation Style
Given the pivotal role of generalization in cognition, there has been substantial research exploring the factors that facilitate it. Among them, the timing of instance presentation is particularly important. Research findings reveal a paradox: both simultaneous presentation (presenting instances at the same time), which allows for direct comparison (e.g.,
Gentner et al., 2009;
Oakes & Ribar, 2005), and spaced presentation (presenting instances apart in time), which separates instances over time (e.g.,
Cepeda et al., 2006), have been shown to enhance generalization.
According to structure mapping theory (
Gentner, 1983), presenting exemplars at the same time draws attention to shared features, encouraging mental comparison and the identification of commonalities within a category. Researchers suggest that simultaneous presentations help learners focus on the similarities between exemplars while reducing the short-term memory load required to compare them. As all exemplars are visible during this process, learners do not need to rely on memory to recall previously seen examples, which reduces cognitive demands. In addition, identifying these shared features helps learners generalize to new exemplars within the category. In short, simultaneous presentations support generalization by directing visual attention to key similarities and easing memory requirements.
On the other hand, other researchers suggest that both simultaneous and spaced learning facilitate generalization. For example,
Vlach et al. (
2012) investigated the effects of simultaneous, massed, and spaced presentations on 2-year-old children’s performance in a novel noun generalization task. In this task, four new objects were introduced and labeled (e.g., “fep”) using simultaneous, massed, or spaced presentation methods in the learning phase. In the simultaneous condition, all exemplars were presented at once, allowing children to visually compare them. In the massed condition, exemplars were presented one at a time in immediate succession, with less than 1 s between presentations. In a spaced condition, 30 s elapsed between each presentation. The massed and spaced presentation belonged to a sequential presentation. In the test phase, four objects were shown, and children were asked to identify the target object (e.g., “Can you hand me the fep?”). For children in the immediate testing condition, the test was conducted right after the learning phase. For those in the delayed testing condition, the test took place 15 min later. Results showed that children in the simultaneous condition outperformed those in the other two conditions on immediate tests, while children in the spaced condition performed best after a delay (15 min). Overall, research suggests that visually comparing multiple exemplars presented simultaneously promotes better skills of abstraction, retention, and generalization compared to sequential (massed) presentations.
In their ongoing research program,
Vlach et al. (
2022) investigated how preschool-aged children learn science category exemplars under three schedules: simultaneous, massed, and spaced presentations. While no differences were observed between conditions during the immediate test, their first experiment showed that children in the spaced condition demonstrated the strongest generalization performance at a delayed post-test. However, further experiments revealed that spaced learning led to reduced visual attention and increased forgetting during the learning process.
Previous studies have investigated the effects of simultaneous, massed, and spaced learning on students’ performance in science education. The presentation style of literacy development has not been extensively studied. While spaced presentation has been found to reduce visual attention and increase forgetting (
Vlach et al., 2022), in the present study, we aim to adopt simultaneous and massed presentation methods to explore their impact on character learning. This study aims to address this gap by investigating how visual stimuli, specifically presenting characters in sequence or in pairs, impact learners’ ability to acquire and retain Chinese characters.
In summary, the acquisition of Chinese characters presents a unique challenge to perceptual salience due to their intricate visual complexity. To overcome this, constructed salience can be introduced through instructional interventions such as color coding and manipulated presentation styles. This study aims to address the following questions:
Do beginning Chinese learners find characters with identical strokes easier to learn than those differing by one stroke, or the other way around?
Does the use of color coding to highlight the critical strokes that differentiate visually similar characters facilitate L2 character learning?
Does simultaneous presentation (2 characters presented together) enhance L2 character learning more effectively than massed presentation (1 character presented individually)?
To address the above questions, we designed a study of four different experimental conditions, namely, “One Character, Color-Coded”, “One Character, No Color”, “Two Characters, Color-Coded”, “Two Characters, No Color”, such that we can test how L2 learners learn Chinese characters in different contexts. L2 learners were expected to learn both the stroke structures and the meanings of characters. To investigate this, we employed a 2 × 2 × 2 experimental design of three dependent variables: (1) Critical strokes cued with vs. without color, (2) Presentation with 1 vs. 2 characters, and (3) Characters with identical vs. different stroke numbers. Color coding and Presentation Style were between-subjects factors, while Stroke Number was a within-subjects factor, as all learners were exposed to characters with both the same and different stroke numbers. Participants received the training first, followed by an immediate test to see how well they could recognize the meaning of the characters. The next day, a delayed test was conducted to measure how well they retain the memory of the characters.
4. Discussion
This study aims to determine whether visual complexity of characters with identical stroke number or different stroke number poses the same level of challenge to L2 Chinese learners and to understand an optimal way to present input (i.e., Chinese characters) by empirically testing two strategies: color coding and presentation style. Specifically, we aim to understand whether and how these strategies can be applied to visually similar Chinese characters during the early stage of character learning. We employed an online training and testing approach to examine how L2 Chinese beginning learners retain visually similar characters under different training conditions: with or without color coding and presented either simultaneously or in a massed format. The current study also compared the retention of visually similar character pairs of identical strokes to those of different strokes. Results from both Immediate tests and Delayed tests revealed a consistent pattern: character pairs of different strokes were better retained than character pairs of identical strokes, as evidenced by both accuracy and RT analyses. The presentation style showed no main effect either in the Immediate tests or the Delayed tests analysis. However, accuracy analysis showed that in the massed presentation conditions, learners did better in different-stroke number characters than identical-stroke characters, while this pattern disappeared in the simultaneous presentation conditions, such that learners performed similarly across both types of characters. This suggests that simultaneous presentation was more effective for learning identical-stroke characters than different-stroke ones. Consequently, it reduced the difference in learning outcomes between the two character types. This is because simultaneous presentation encourages learners to compare the stroke structures of identical-stroke characters, which is consistent with the structure mapping theory (
Gentner, 1983;
Namy & Gentner, 2002). Finally, color coding showed no main effects in the accuracy analysis in either immediate tests or delayed tests. However, the RT analysis showed that the use of color cues slowed down word recognition in both immediate tests and delayed tests.
To address the first research question, we examined which type of visually similar word pairs—identical-stroke or different-stroke—is easier to learn. By logic, the addition of a stroke increases the overall stroke count, which, according to the conventional measure of visual complexity, should make these characters more difficult to learn and retain. However, the results contradicted this expectation, revealing that characters with identical stroke numbers were harder to remember. According to
Ellis (
2016), salience is a stimulus property that causes it to be easily noticed and learned. Characters with additional strokes are often more visually salient than those with identical strokes, which might explain this phenomenon.
The current study further explores how well beginners can identify the stroke counts and internal structures of characters, through exposure and practice, such that they can visualize and familiarize themselves with basic character radicals. As
Reder et al. (
2016) have noted, novice learners can accurately decode characters with minimal training, especially in controlled laboratory settings. This ability may enhance their capacity to differentiate characters with different stroke counts, as these characters share most of their stroke structure except for one stroke. In contrast, for characters with identical strokes, the stroke structures are different even though they share the same number of strokes. Our results indicated that word pairs with identical strokes, but different stroke orientations, were harder to learn than those with additional-stroke differences.
This difficulty might stem from the more subtle differences in identical-stroke characters, which require higher visual processing skills. For learners whose L1 lacks such visual patterns (displayed in identical stroke characters), the requisite features for processing them are unavailable. Furthermore, ingrained L1 processing strategies can inhibit the ability to notice correct orthographic cues in L2. This aligns with
Tomlin and Villa’s (
1994) cognitive theory of detection, which posits that detection—a function of alertness and orientation—is a critical first step for linguistic features to become available for processing and acquisition.
As
M. Wang et al. (
2003) have highlighted, the complex, square-shaped structure and high spatial frequency of Chinese characters place significant demands on spatial memory. This may impose an additional processing burden when learning and retaining identical-stroke characters compared to different-stroke characters.
McBride-Chang et al. (
2005) found a positive correlation between visual-spatial abilities and the acquisition of Chinese characters. Thus, visual working memory is crucial for learning Chinese characters, as it allows for the temporary storage of approximately three to four visual objects (e.g.,
Luck, 2008;
Luck & Vogel, 2013). This implies that the average capacity for remembering items or features is quite limited. In our study, the characters selected averaged 3–4 strokes, which falls within the capacity of visual working memory. According to
Zimmer and Fischer (
2020), even native Chinese speakers rely on visual working memory rather than semantic information when memorizing trained characters and pseudo-characters whose one radical was altered from real characters.
Zimmer and Fischer (
2020) found in Experiment 4 that highly conceptually similar distractors did not increase memory errors, while visually similar distractors weakened memory performance. It can be inferred that beginners would primarily use visual-spatial memory to memorize Chinese characters. The current study is the first to show that characters of the same number of strokes, but different spatial orientations, are more challenging to learn compared to visually similar characters of different stroke numbers.
Second, presentation style showed no main effects. However, the interaction effect indicated that simultaneous presentation facilitated learning identical stroke characters more than different stroke characters. The highest accuracy rate was observed in the massed presentation of characters with different strokes, whereas the lowest accuracy rate was in the simultaneous presentation with identical strokes. To elaborate, when characters were presented one by one, learning different-stroke pairs resulted in higher accuracy compared to identical-stroke pairs. Conversely, in the simultaneous presentation condition, no significant difference was found between identical-stroke and different-stroke pairs. This suggests that the simultaneous presentation format helped reduce the difficulty associated with learning characters of identical strokes and, thus, reduced the difference between identical-stroke characters and different-stroke characters. As previously noted, simultaneous presentation allows learners to directly compare the stroke patterns of two characters, enhancing learning efficiency. In contrast, massed presentation may increase cognitive load by requiring learners to process two elements presented in random order and integrate them later, which can strain working memory (
Chandler & Sweller, 1992,
1996;
Kalyuga et al., 2000).
Shen (
2005, p. 56) identified learners’ common strategies such as “paying attention to graphic structures” and “visualizing the graphic structure of the character”. This study provides strong evidence that simultaneous presentation significantly improves beginners’ ability to learn the identical-stroke characters by comparing the graphic structures. Next, in the massed presentation condition, the RT analysis revealed that colored characters led to slower responses compared to the no-color condition. In contrast, during simultaneous presentation, there were no significant differences in RT between the colored and no-color conditions, irrespective of whether the tests were immediate or delayed. The analysis also suggested that while color exerted a slight influence on response times, simultaneous presentation seemed to mitigate the effects of color on response times. Additionally, the results of our study were consistent with a large number of studies that have shown that viewing multiple instances simultaneously enhances generalization (e.g.,
Gentner et al., 2009;
Oakes & Ribar, 2005;
Vlach et al., 2012). For example,
Vlach et al. (
2012) found that 2-year-old children who were exposed to stimuli simultaneously outperformed their peers who were in spaced or massed conditions on a novel noun generalization task when tested immediately afterward. The researchers suggested that the simultaneous presentation condition likely reduced forgetting and memory load, allowing learners to engage more readily in comparative mental processes during learning. Since all instances were visible throughout the learning phase, children in the simultaneous condition did not need to mentally retrieve previous instances they had observed. In summary, simultaneous presentation might help beginners retain the memory of visually similar characters, but it is inconclusive to draw this conclusion because the main effect was not significant.
Finally, we applied a color cue to highlight the stroke, distinguishing the paired-characters, and found that the color cue increased recognition reaction times in both the immediate tests and delayed tests. The design of the color cues aimed to guide learners’ visual focus towards the differential stroke structure within characters. Nonetheless, the use of color might inadvertently introduce a negative effect by acting as an additional distraction. The intricate visual information could potentially compel participants to engage in an extra process of matching radicals with colors, resulting in a divided attention effect. As a result, this could adversely affect learners’ character acquisition process. These findings align with
Hou and Jiang (
2022), who reported that radical marking using color or animation increased RT and reduced recognition accuracy in character recognition tasks. They suggested that providing radical and stroke information might interfere, rather than facilitate character learning. Our study supports this view, indicating that excessive visual information introduced during the learning process may increase cognitive load for L2 learners, thereby reducing learning effectiveness (
Baddeley, 1992). When learners process multiple elements of information simultaneously, they must divide their attention across all elements. This simultaneous presentation of information can result in the split attention effect (
Chandler & Sweller, 1991,
1992;
Owens & Sweller, 2008), further straining working memory capacity. Thus, the color cue did not facilitate learning characters in our study, which may have important pedagogical implications for L2 instruction.
Constructed salience, a concept derived from input enhancement (IE;
Sharwood Smith, 1991), is, therefore, a key pedagogical tool for directing learners’ attention to specific linguistic features, such as the structural components of Chinese characters. In this study, it took the form of specific presentation styles and color coding. While the simultaneous presentation of characters proved effective for retention, color coding did not function as intended. The results align with
Sharwood Smith’s (
1991) central caution that externally induced salience “may not necessarily be registered by the learner and even when it is registered, it may not affect the learning mechanisms per se” (p. 118). This outcome demonstrates that while IE can draw a learner’s attention, its subsequent processing into the language system depends on the type of linguistic evidence provided (
Sharwood Smith, 1991, pp. 122–125). In this study, simultaneous presentation was an effective approach for teaching visually similar words to beginning learners, as it successfully facilitates the intake of orthographic information; in contrast, color coding was ineffective, possibly because highlighting a single stroke in red failed to provide adequate linguistic evidence.
Based on our results, we recommend the simultaneous presentation of visually similar characters to raise learner awareness. This approach should be integrated into classroom instruction (e.g., on the same slide) and textbook design (e.g., within the same exercise or page). Conversely, color marking, whether applied to a single stroke (this study) or a radical (
Hou & Jiang, 2022), appears ineffective for character learning and is not recommended.