5.1. Type Effect of V-Diao
The mean scores of the four types of
V-diao of the child groups and adult group were calculated and compared, respectively, using a one-way ANOVA, as shown in
Table 5. The scores from the WISE and WISC tasks were summed
8 and then averaged
9. The performances of the children showed a significant difference (
p < 0.001), while no significant effect was found in the adult group (
p > 0.05).
In addition, a two-way ANOVA with type effect (
V-diao1,
V-diao2,
V-diao3,
V-diao4) and age (G1, G2, G3, the adult group) as factors was conducted, which yielded a significant effect of type (
F(3, 304) = 27.17,
p < 0.001), a significant effect of age (
F(3, 304) = 74.21,
p < 0.001), and a significant effect of the interaction between these two factors (
F(9, 304) = 4.57,
p < 0.001), as shown in
Table 6.
Furthermore, the overall tendency of the participants’ performances on the meanings of
V-diao is presented in
Table 7.
The analysis revealed significant differences in the acquisition of the four types of V-diao among the child participants. As hypothesized, the core meaning of V-diao1 ‘to fall’ was the easiest for the children to grasp, while more abstract meanings involving metaphorical extensions posed greater challenges. The results confirmed a hierarchical difficulty in the types of V-diao, supporting the following proposed grammaticalization order: V-diao1 > V-diao2 > V-diao3 > V-diao4.
As shown in
Table 7, there were significant differences in the meanings of
V-diao in terms of the performances of G1 (
p < 0.001), G2 (
p < 0.001), and G3 (
p < 0.001), but not in those of the adults (
p > 0.05). This suggests that some types of
V-diao were easier for children to acquire, while some were more challenging. For adults, the difficulty of the four types of
V-diao was almost the same. Moreover, all the groups performed best on
V-diao1 (G1: mean = 0.88; G2: mean: 0.88; G3: mean = 0.91; Adult: mean = 1.00), while
V-diao4 was the most challenging (G1: mean = 0.54; G2: mean: 0.65; G3: mean = 0.74; Adult: mean = 0.99), which is consistent with the hierarchical difficulty proposed in
Section 2. G1 performed significantly better on
V-diao3 than on
V-diao4 (
p < 0.05), indicating that
V-diao4 was much more challenging for the seven-year-olds (G1). However, the nine-year-olds (G2), the eleven-year-olds (G3), and the adults displayed a similar tendency in interpreting
V-diao3 and
V-diao4, as no significant differences between them were found, which generally matched the hierarchy assumed at first.
Regarding between-group differences, the adults performed significantly better on V-diao1 than G1, G2, and G3 (p < 0.001, p < 0.001, p < 0.001, respectively). However, there were no significant differences among the three child groups. Children at the ages of 7, 9, and 11 performed similarly well on V-diao1. G3 performed significantly better on V-diao2 than G1 (p < 0.001), but not better than G2 (p > 0.05). However, the adults outperformed G1, G2, and G3 (p < 0.001, p < 0.001, p < 0.001, respectively). A similar tendency was observed for the results of V-diao3 and V-diao4. The adults outperformed G1, G2, and G3 on V-diao3 (p < 0.001, p < 0.001, p < 0.001, respectively) and on V-diao4 (p < 0.001, p < 0.001, p < 0.001, respectively). Additionally, G3 performed significantly better than G1 on V-diao3 (p < 0.05) and V-diao4 (p < 0.001).
Generally speaking, significant differences were found in the children’s performances (G1, G2, and G3) on the different meanings of
V-diao. According to
Liu (
2007),
V-diao1 contains the core meaning “to fall,” while
V-diao2 “to eliminate something” undergoes the process of metonymization, and
V-diao3 “to complete an event” and
V-diao4 “to change a state” undergo both metonymization and metaphorization. In this case, the hierarchical difficulty is as follows:
V-diao1 >
V-diao2 >
V-diao3 >
V-diao4.
Previous empirical studies, such as
Hsu and Chen (
2016), have shown that words with a literal core meaning or those that are frequently used are the easiest for children to acquire. For instance, the literal meaning of
lao, with its core definition, was found to be easier for child participants to acquire compared to other types with non-literal meanings. Furthermore, concreteness plays a crucial role in understanding and developing word meanings (
Schwanenflugel, 2013). Words with concrete meanings are easier to acquire than those with abstract meanings, as abstract concepts are more challenging to process. In the present research, the semantic meaning of
V-diao1 is concrete and contains the core meaning, which shows that it is the easiest meaning to acquire among the four. This was also supported by the results of the present study, which found that the scores of
V-diao1 for G1 and G2 were significantly higher than those of
V-diao2 (G1: < 0.001; G2: < 0.001; G3: > 0.05; Adult: > 0.05). Thus,
V-diao1 was the easiest type for G1 and G2. In addition, no significant differences in
V-diao1 were found among all the child groups, indicating that G1, G2, and G3 all performed similarly well on this type.
However,
V-diao2 ‘to eliminate something’ is grammaticalized and undergoes metonymization, which involves semantic shifts and is not conventionally related to the lexical item (
Paradis, 2004). Hence, it is supposed to be slightly more challenging to acquire when compared with
V-diao1. As mentioned earlier, the scores on
V-diao2 for G1 and G2 were significantly lower than those on
V-diao1. Hence,
V-diao2 was indeed more challenging than
V-diao1. Furthermore, since no significant differences were found between G1 and G2 and between G2 and G3, this shows that these three age groups had the ability to interpret
V-diao2.
Next, due to the processes of metonymization and metaphorization, the semantic meanings of V-diao3 and V-diao4 gradually become abstract and more grammaticalized than those of V-diao1 and V-diao2.
According to
Heine et al. (
1991), metaphors can be classified into different categories, as shown in (9), which displays a scale for the degree of metaphorical abstraction.
(9) PERSON > OBJECT > ACTIVITY > SPACE > TIME > QUALITY |
V-diao3 is used to describe the completion of an event in a time domain (ACTIVITY -> TIME), while V-diao4 indicates the change in a state (ACTIVITY -> QUALITY). Based on the proximity of TIME and QUALITY, V-diao3 and V-diao4 presumably pose the same degree of difficulty for the children to acquire, since the results showed no significant differences between G1 and G2 and between G2 and G3. However, it was found that only G1, but not G2, G3, and the adult group, performed significantly differently on V-diao3 and V-diao4 among all the groups, indicating that V-diao3 was obviously easier for the children at 7 to acquire than V-diao4.
This phenomenon may be explained by looking at the test items used in the present study. All the test items related to
V-diao3 were paired with a quantifier, namely
quan bu and
dou, which mean ‘all’. The way that children interpreted the interaction between the quantifier and negation was different from that of adults, and this situation was rule-governed (
Musolino, 1998). In addition, young children tend to map quantified noun phrases and their semantic meanings isomorphically (
Musolino, 1998). For example, one of the test items in the WISC task stated that Doraemon had washed “all” the clothes and asked whether there were still some dirty clothes left, as the picture showed. Since there were still some dirty clothes left on the ground in the picture, it contradicted children’s isomorphic resolution of the quantified noun phrases, which indicated there were supposed to be no dirty clothes left due to the quantifier “all”. Hence, they tended to answer “no” and correctly answered the question, leading to a higher accuracy rate with
V-diao3 than with
V-diao4. However, this situation diminished with age, as found in
Musolino (
1998), who also showed that young children’s interpretation gradually moved on to the adult system.
Overall,
V-diao1 and
V-diao2 were found to be easier to acquire than
V-diao3 and
V-diao4, which posed greater challenges due to their higher levels of abstraction and grammaticalization. Furthermore,
V-diao3 and
V-diao4 were equally challenging for the participants, as both required more advanced cognitive processing. The revised hierarchical difficulty of
V-diao is visually represented in
Figure 1, illustrating the progressive complexity of these constructions.
The findings of this study highlight a hierarchical pattern in children’s acquisition of V-diao constructions, closely tied to the semantic complexity and degree of grammaticalization of each type. As shown in the results, V-diao1 and V-diao2—which are more concrete and semantically transparent—are more readily acquired by younger children, as their meanings are closely associated with observable physical experiences. In contrast, V-diao3 and V-diao4, which involve greater metaphorical abstraction and more advanced grammaticalization, pose greater cognitive challenges. These constructions require not only more abstract reasoning but also a heightened sensitivity to contextual and linguistic cues.
The influence of contextual information is especially evident in the interpretation of
V-diao3, where quantifiers and surrounding cues play a crucial role in meaning resolution. This underscores the importance of linguistic and environmental scaffolds for younger children who are still developing the cognitive capacity for abstract and non-literal language processing (
Musolino, 1998;
Paradis, 2004). These observations align with prior research emphasizing the role of concreteness, developmental readiness, and contextual support in language acquisition (
Gentner, 1982;
Nippold & Rudzinski, 1993). In line with this,
Gibbs and Colston (
2012) argue that multimodal and experience-based exposure significantly enhances children’s ability to comprehend figurative and abstract meanings—an effect particularly relevant for constructions like
V-diao3 and
V-diao4.
This study advances
Tsao’s (
2017) structural classification of
V-diao constructions by providing empirical evidence on how children interpret and acquire them. While Tsao distinguishes the four types based on their degrees of grammaticalization, our research adds a developmental perspective, examining how semantic features—such as directionality, endpoint, and evaluative meaning—interact with children’s cognitive maturity and sensitivity to context. By linking these features to actual performance patterns, we move beyond a descriptive framework to offer insights into how children process different degrees of abstraction in language.
In conclusion, the acquisition of V-diao follows a developmental trajectory that corresponds to the semantic and cognitive complexity of each type. More concrete and physically grounded forms, like V-diao1, are acquired earlier, while more abstract and context-dependent types, such as V-diao4, emerge later. This pattern not only supports Tsao’s grammaticalization hierarchy but also underscores the importance of cognitive development and contextual support in figurative language acquisition.
5.2. Literal and Non-Literal Meanings of V-Diao
The second research question addressed in the present study is whether L1 Chinese children can distinguish between the literal and non-literal meanings of V-diao. In this study, literal meanings are assumed to be easier for children to acquire than non-literal meanings.
Table 8 presents the mean scores across different age groups, showing that the ability to interpret non-literal meanings becomes more refined with age but does not reach adult-level proficiency until later in adolescence.
Table 9 shows that a two-way ANOVA with metaphoricality (literal and non-literal) and age (G1, G2, G3, the adult group) as factors was conducted to examine the effects. The analysis revealed a significant effect of meaning type (
F(1, 312) = 36.40,
p < 0.001), a significant effect of age (
F(3, 312) = 35.5,
p < 0.001), and a significant interaction between the two factors (
F(3, 312) = 16.48,
p < 0.001).
Table 10 presents the acquisition tendencies of literal and non-literal meanings, as well as the mean scores and
p-values for G1, G2, and G3, enabling the examination of the acquisition tendencies of all the groups.
According to
Table 10, the mean scores indicate that all groups (G1, G2, G3, and Adults) found it easier to understand the literal meanings of
V-diao (G1: 0.80; G2: 0.85; G3: 0.88; Adults: 0.98) than its non-literal meanings (G1: 0.61; G2: 0.62; G3: 0.77; Adults: 0.97). A one-way ANOVA analysis revealed significant differences in the performance of G1 (
p < 0.05), G2 (
p < 0.001), and G3 (
p < 0.05), showing that the acquisition of non-literal meanings of
V-diao was significantly more challenging for the children than that of literal meanings. By contrast, the performance of the adults on the literal and non-literal meanings was not significantly different (
p > 0.05), suggesting that non-literal meanings were not as difficult for them to understand.
Regarding the differences between the age groups, the following comparisons of their acquisition of literal and non-literal meanings of V-diao are presented. On literal meanings, G3 did not perform significantly better than G2 (p > 0.05) or G1 (p > 0.05), and neither did G2 perform significantly better than G1 (p > 0.05). However, the adult group outperformed all the child groups significantly (p < 0.001 for all). On non-literal meanings, G3 performed significantly better than G2 (p < 0.05) and G1 (p < 0.001), and G2 did not perform significantly better than G1 (p > 0.05). Again, the adult group outperformed all the child groups significantly (p < 0.001 for all). With increasing age, it appears that children’s abilities to acquire and comprehend both literal and non-literal meanings improve. However, the mean scores for non-literal meanings for G1 and G2 were similar and not significantly different, indicating that they both had a good understanding of non-literal meanings. In conclusion, G1, G2, and G3 were at the same stage in acquiring literal meanings of V-diao. As for non-literal meanings, G1 and G2 were at the same stage, which was different from G3, indicating that older children have an easier time interpreting non-literal meanings.
The difficulty children experienced in interpreting the non-literal meanings of
V-diao constructions can be better understood through the lens of metaphor processing, particularly as outlined in the Generalized Elaboration Model (GEM) proposed by
Levorato and Cacciari (
2002). The GEM offers a cognitive-developmental account of how metaphor comprehension evolves across five progressive phases.
In Phase 1 (birth to age 7), children primarily interpret language literally due to limited inferential abilities, semantic flexibility, and contextual integration skills. As a result, metaphors—especially those that are novel or culturally embedded—are often misunderstood or ignored. In Phase 2 (ages 8–9), children begin to utilize contextual cues to infer figurative meanings. However, their comprehension remains fragile and highly dependent on the transparency of the metaphor. Metaphors that involve cross-domain mappings or abstract associations, such as many
V-diao constructions, continue to pose challenges. By Phase 3 (ages 10–12), children exhibit a significant shift in metaphor processing. They begin to treat metaphors not merely as decorative language but as structured conceptual tools that map one experiential domain onto another. This reflects their growing capacity to integrate linguistic form with conceptual structure, especially when metaphors are supported by rich context. In adulthood (Phase 5), metaphor processing becomes fluent, creative, and flexible. Adults can generate, comprehend, and reinterpret metaphors across varying degrees of conventionality, relying on well-established metaphorical schemas and entrenched cross-domain mappings. This is consistent with Conceptual Metaphor Theory (
Lakoff & Johnson, 1980), which posits that metaphors are foundational to human thought.
Based on the GEM, participants in G1 and G2 should align with Phases 1 and 2, respectively. However, our findings showed no significant difference in their comprehension of V-diao metaphors, suggesting both groups may be operating within a similar developmental phase—contrary to GEM’s predicted progression. In contrast, G3 performed significantly better than G1 and G2, aligning more closely with Phase 3, yet still falling short of adult proficiency. This partially deviates from prior research indicating that children around age 9 typically outperform younger peers in figurative tasks and that 11-year-olds begin to approach adult-like competence.
Overall, our findings broadly support GEM’s developmental trajectory: younger children default to literal interpretations, older children begin to interpret metaphors with contextual support, and adults exhibit mature metaphorical competence. These developmental differences highlight that metaphor comprehension is not merely linguistic but deeply cognitive, involving the gradual integration of pragmatic reasoning, analogical mapping, and semantic abstraction—skills that emerge and refine over the course of development.
The difficulty in interpreting
V-diao in this study may be attributed to the inherent complexity of the expression.
Hsieh (
2008) investigated various
si-related constructions and found that the V+
si construction (resultative verb compound, or RVC) was particularly challenging for young children due to its “cause and state change” feature. This complexity requires advanced cognitive processing, which may explain younger children’s struggles. In the current study, only a few V+
si expressions were included as test items, reducing the overall task difficulty. Consequently, the children in Hsieh’s study, who were exposed to a broader and more complex range of
si-related expressions, may have developed metaphorical abilities earlier than those in the present study.
The findings of this study indicate that all three child groups (G1, G2, and G3) encountered greater difficulty in interpreting the non-literal meanings of V-diao constructions compared to the adult group. Adults, in contrast, showed no significant difference between their comprehension of literal and non-literal meanings, demonstrating an advanced ability to flexibly interpret both forms. This developmental progression aligns with the principles of usage-based models of language acquisition, which emphasize that linguistic knowledge emerges from a speaker’s experience with language in use rather than from an innate universal grammar.
From a usage-based perspective (
Tomasello, 2003), language learning is input-driven and probabilistic. Learners gradually build up linguistic competence by attending to the frequency and salience of their forms in communicative interactions. In this study, the difficulty experienced by children—especially with non-literal
V-diao constructions—suggests that they had not yet encountered these forms frequently enough in varied and meaningful contexts to have abstracted their figurative meanings. These constructions, being semantically opaque or metaphorical, are less likely to be encountered with high frequency in child-directed speech, thus limiting opportunities for entrenchment and schematic generalization. Furthermore, the adult group’s uniform performance across literal and non-literal meanings supports the idea that repeated exposure and rich contextual experience result in greater entrenchment and more robust mental representations of both literal and figurative constructions. Adults likely had more varied input over time and had internalized not just item-specific patterns but also higher-level constructional schemas—an outcome predicted by usage-based models.
The differential performance across the age groups also reinforces the importance of contextual learning. Children appeared to rely more heavily on literal interpretations, which are often more concrete and directly tied to perceptual experience. As they accumulate linguistic input, they begin to abstract over specific instances and infer more complex figurative meanings, as predicted by item-based learning (
Tomasello, 2000).
Overall, this study provides evidence for the gradual abstraction and schematization of constructions through experience. This supports the usage-based claim that grammar is not a separate module but a product of general cognitive processes acting on language input over time. The trajectory observed in the comprehension of V-diao forms—from literal to figurative—exemplifies how usage frequency, contextual richness, and age-related cognitive development interact in language learning.
Recent studies shed light on the importance of task complexity and multimodal learning.
Olivero (
2024) noted that task complexity influences how children’s developmental phases are assessed, suggesting that simpler tasks may mask their true abilities. In this context, multimodal learning approaches—such as pairing verbal explanations with gestures or visual aids—could help bridge the gap between Phases 2 and 3.
Healey et al. (
2018) found that combining verbal and non-verbal cues enhances one’s understanding of figurative language, especially for children transitioning to Phase 3.
Additionally, cultural and linguistic exposure also plays a crucial role. As
Dobrovol’skij and Piirainen (
2021) observed, children exposed to diverse idiomatic and metaphorical constructions in their linguistic environment tend to progress through GEM phases more quickly. This suggests that the children in the present study may benefit from increased exposure to figurative constructions like
V-diao in diverse and enriched linguistic contexts.
These findings support the phase-based development of figurative competence described by GEM but highlight several factors that influence progression, including the complexity of expressions like V-diao, the design of tasks, and the role of contextual and multimodal learning. Future studies could build on this by incorporating more complex tasks and multimodal teaching methods to better understand and enhance children’s abilities to comprehend non-literal language.
5.3. Contextual Effect
Table 11 presents the general performance of all the children and adults on the WISE task (without context) and the WISC task (with context). A one-way ANOVA found a significant difference among all child groups (
p < 0.001), but no significant difference was found for the adult group (
p > 0.05).
Table 12 shows the results of a two-way ANOVA with contextual effect (WISE and WISC) and age (G1, G2, G3, the adult group) as factors. The analysis revealed a significant effect of context (
F(1, 312) = 18.90,
p < 0.001), a significant effect of age (
F(3, 312) = 33.15,
p < 0.001), and a significant interaction between the two factors (
F(3, 312) = 4.14,
p < 0.05).
Table 13 compares the performances on the WISE and WISC tasks, showing that context only benefited older children and adults, who likely possess the necessary cognitive maturity to integrate multiple cues.
As shown in
Table 13, all child groups performed better on the WISE task than on the WISC task in terms of their mean scores (G1: WISE 0.75 > WISC 0.68; G2: WISE 0.84 > WISC 0.69; G3: WISE 0.92 > WISC 0.73), while the adults performed similarly on the two tasks (Adults: WISE 0.99 = WISC 0.99). In other words, most participants performed better when no context was provided to them. With age, the performance on both tasks improved for all participants.
Regarding within-group performance, a one-way ANOVA showed no significant differences for G1 (p > 0.05) and the adult group (p > 0.05). However, G2 and G3 performed significantly better on the WISE task than on the WISC task (G2: p < 0.001; G3: p < 0.001). This finding suggests that the tasks without or with context presented a similar degree of difficulty for G1 and the adult group in comprehending V-diao, but not for G2 and G3.
In terms of the between-group differences on the WISE task, the adult group outperformed G1 (p < 0.001), G2 (p < 0.001), and G3 (p < 0.001). G3 performed significantly better than G1 (p < 0.001) and G2 (p < 0.05), and there was also a significant difference between G1 and G2 (p < 0.001). However, on the WISC task, only the adults outperformed G1 (p < 0.001), G2 (p < 0.001), and G3 (p < 0.001). No significant differences were found between G3 and G2 (p > 0.05), between G3 and G1 (p > 0.05), or between G2 and G1 (p > 0.05). Overall, when no context was provided, the participants performed significantly better with increasing age. However, when context was provided, it did not significantly improve the children’s comprehension of V-diao.
Previous studies (
Gibbs & Gerrig, 1989;
Hsieh, 2008;
Hsu & Chen, 2016;
Ortony, 1993) have consistently argued that context enhances one’s comprehension of non-literal meanings. These studies suggest that children find it easier to grasp metaphorical meanings when presented within a contextual framework. For example,
Hsieh (
2008) demonstrated that contextual cues could aid children in interpreting idiomatic expressions by providing inferential scaffolding. Similarly,
Ortony (
1993) emphasized that rich, relatable contexts could serve as cognitive bridges, enabling younger participants to decode abstract meanings.
However, other research (
Y.-C. E. Cheng & Chen, 2023;
Schatz & Baldwin, 1986;
Shinjo, 1986) challenges the idea that context always facilitates comprehension. Consistent with these findings, the present study revealed that G1, G2, and G3 generally performed better on tasks when no context was provided.
Shinjo (
1986), for example, explored the role of semantic priming in metaphor comprehension and found that while priming may activate related concepts, its effects on literal and metaphorical meanings were comparable. Furthermore, if the priming context was unrelated to the metaphor, its influence was negligible. Similarly,
Schatz and Baldwin (
1986) conducted a series of experiments examining the impact of context on inferring word meanings, concluding that the benefits of context were overstated. Their findings suggested that participants could not derive significant advantages from contextual information, particularly when tasked with inferring novel meanings.
Y.-C. E. Cheng and Chen (
2023) extended these findings by examining Mandarin frequency adverbs in both context-free and context-provided tasks. Their results highlighted that older children and adults did not benefit from contextual support, likely due to the increased complexity of processing such information. Notably, they adopted a methodological approach similar to the present study, presenting test items as sentences rather than isolated phrases, which provided a more robust comparison.
The Global Elaboration Hypothesis (
Levorato et al., 2004) posits that children employ specific strategies to interpret idiomatic expressions and noun phrases in context, such as hypothesizing meanings at the sentence level and filtering irrelevant interpretations. According to this hypothesis, presenting metaphorical expressions in sentences with context should provide additional cues to facilitate comprehension. However,
Y.-C. E. Cheng and Chen (
2023) found that their participants struggled with the context-provided task due to its complexity, which required them to process multiple sentences and story plots within a limited time. This finding aligns with the Limited Cognitive Capacity Hypothesis (
Skehan & Foster, 2001), which asserts that excessive cognitive demands can hinder performance. Therefore, Cheng’s participants performed better on context-free tasks, a result mirrored in the present study.
Contrary to earlier research emphasizing the facilitative role of context in language comprehension, our findings revealed that contextual enrichment did not consistently support children’s understanding of non-literal meanings in
V-diao constructions. When analyzed through the lens of Construction Grammar (
Goldberg, 1995), these results highlight the developmental limitations in children’s ability to process and integrate form–meaning pairings within dynamic discourse contexts.
In Construction Grammar, constructions are learned and represented as holistic, experience-based pairings of form and meaning, ranging from fully lexical items to abstract syntactic templates. The V-diao construction, particularly in its non-literal usage, functions as a partially schematic construction—that is, one where the meaning cannot be fully predicted by the individual lexical items or syntactic form alone. Understanding such constructions requires recognizing a constructional schema that maps specific morphosyntactic patterns onto idiomatic or metaphorical meanings. This mapping is cognitively demanding and typically acquired through frequent, context-rich exposure over time.
In our study, younger children (G1, G2, and G3) performed better on tasks without contextual support, suggesting that the addition of story-based and visual context imposed a cognitive load that exceeded their processing capacity. From a constructional perspective, this indicates that the children may have not yet entrenched the V-diao construction in its metaphorical sense. When multiple semiotic inputs—text, image, and construction—competed for attention, the children struggled to establish the intended form–meaning link. Their reliance on literal interpretations, especially when primed by visual cues, further implies that non-literal constructions are still “marked” and cognitively effortful at this stage of development.
This integration difficulty was particularly evident when context was delivered through illustrated narratives. Children frequently fixated on the literal truth conditions presented in the image, rather than evaluating whether the visual narrative supported or contradicted the constructional meaning encoded in the sentence. In Construction Grammar terms, this suggests a failure to align multimodal cues with the abstracted constructional meaning. The pictures may have reinforced default (literal) readings of the construction rather than promoting the recognition of its idiomatic or metaphorical sense. Adults, by contrast, performed equally well with or without contextual support, reflecting their highly entrenched and flexible representation of both literal and non-literal V-diao constructions. For them, context served more as a confirmatory aid than a source of interpretation. Their ability to draw on a network of related constructions allowed them to bypass the surface form and directly access the intended figurative meaning, a hallmark of constructional fluency. Interestingly, while G3 outperformed G1 and G2, their continued struggle with integrating contextual and linguistic cues indicates that they are in transition toward developing abstract, context-sensitive constructional knowledge. They may have begun to recognize the V-diao construction as a multi-functional form but lack the schematic abstraction and processing efficiency seen in adults.
Ultimately, these findings suggest that contextual information does not uniformly facilitate comprehension; rather, its usefulness depends on the learner’s ability to map it onto the relevant constructional schema. Until children internalize these mappings through repeated exposure and pattern abstraction, additional context may function less as a scaffold and more as a distraction—especially in the case of non-transparent or metaphorical constructions.
Glenberg and Langston (
1992) observed that while texts and images are typically processed separately in the mind, carefully designed visuals can enhance comprehension when paired with text. However, this effect was not observed in the child participants of the current study. Building on this,
L. Cheng et al. (
2024) examined the development of metaphor comprehension in Chinese children aged 5–8. They identified three developmental stages: the “perception period” (age 5), when children began recognizing metaphorical language; the “development period” (ages 6–7), marked by a gradual improvement in comprehension; and the “rational decision period” (age 8), characterized by faster and more accurate understanding. This research emphasizes the important roles of age and salience in shaping metaphor comprehension within a Chinese cultural context. Similarly,
Inhoff et al. (
1984) investigated how context affects cognitive load during metaphor comprehension. They found that metaphors were processed more efficiently when preceded by metaphorical contexts, while literal contexts worked best for literal targets. These findings reinforce the importance of simplicity in designing effective learning environments for young children. However, while context is often viewed as a helpful aid for understanding metaphors and idioms, this study and others (e.g.,
Y.-C. E. Cheng & Chen, 2023;
Schatz & Baldwin, 1986;
Shinjo, 1986) suggest that its effects are not always positive. For younger learners, overly complex contexts may hinder rather than enhance comprehension.
In conclusion, the relationship between context and comprehension depends heavily on the cognitive development of the learner. Future research should aim to identify the ideal level of contextual support for different age groups to optimize comprehension, especially in metaphorical and idiomatic language learning.