1. Introduction
A core claim of Embodied Cognition Theory is that knowledge is grounded in sensorimotor mechanisms (
Norman & Peleg, 2022). Accordingly, language comprehension involves unconsciously simulating the sensory–motor experiences described by the language (
Barsalou, 1999). Rather than being a purely abstract, amodal process, language processing is believed to recruit the same neural systems used for perception, action and emotion.
Strong behavioral evidence for this simulation account comes from the Action–Sentence Compatibility Effect (ACE) (
Glenberg & Kaschak, 2002), which has become a paradigmatic method for investigating sensorimotor simulation in language comprehension (e.g.,
Winter et al., 2022;
Zwaan, 2021). The ACE is found when the direction of action described in the sentence read by the participants is consistent with the direction of the arm movement, the arm movement response is promoted and the reaction time is shortened; otherwise, responses are slower due to motor interference (
Cai et al., 2025). In a typical paradigm, participants judge the semantic plausibility of sentences describing transfer actions. To make the task engaging and concrete, the sentences often depict scenarios involving the sharing or withholding of objects (e.g., “You offer the last piece of cake to your friend” vs. “You keep the last piece of cake for yourself”) or the transfer of personal items (e.g., “The student handed the essay to the teacher”). Participants indicate their judgment by making a physical response toward or away from their body. The robust finding, known as ACE, is that responses are significantly faster when the response direction matches the action direction described in the sentence. For instance, responding “away” is faster for the sentence about offering the cake, as if the reader is mentally simulating the act of giving. This interaction between sentence meaning and motor response provides compelling evidence that language comprehension involves mentally simulating the described events.
This paradigm raises a crucial question about the linguistic variables that influence the strength of mental simulation. The grammatical aspect has emerged as a pivotal modulator. Distinct from tense, which locates an event in time (e.g., past, present), the grammatical aspect characterizes the internal temporal contour of an event (
Comrie, 1976;
Dowty, 1977). The progressive aspect (e.g., is walking, is reading a book) portrays an event from an internal viewpoint, highlighting its ongoing, dynamic progression without regard to the endpoint. The perfective aspect (e.g., has walked, has read a book), instead, offers an external viewpoint, presenting the event as a complete, bounded unit with a focus on its culmination (
Becker et al., 2013;
Madden & Zwaan, 2003).
By shaping how we conceptualize the temporal flow of an event, aspect is thought to directly guide mental simulation. The progressive, by focusing on the action-in-progress, should potentiate a richer sensorimotor simulation of the action itself, whereas the perfective, by focusing on the end-state, should attenuate such online simulation (
Matlock et al., 2007;
Anderson et al., 2010;
Ferretti et al., 2007). Compelling support for this view comes not only from reaction-time paradigms like the ACE but also from studies on co-speech gestures. For instance, the durative meaning of the progressive aspect is mapped onto metaphoric container gestures, and gestures accompanying progressive utterances are typically longer lasting and more complex than those with non-progressive forms (
Matsumoto & Dobs, 2017;
Parrill et al., 2013). In fact, research on native English speakers has revealed a robust ACE for progressives but a weakened or absent ACE for perfectives (
Bergen & Wheeler, 2010), suggesting that grammatical aspect guides the construal of events during simulation.
While the modulation of ACE by grammatical aspect is well-established in native (L1) processing, it is unclear how this is manifested in second-language (L2) learners. The theoretical link between grammatical aspect and the ACE lies in how aspect modulates the content and timing of mental simulation. The ACE is predicated on the real-time engagement of the motor system during language comprehension. Grammatical aspect directly governs this engagement by shifting the listener’s mental perspective on an action. The progressive aspect, by directing attention to the ongoing action, fosters a dynamic and kinematically detailed motor simulation that is highly susceptible to interference or facilitation from congruent physical movements. This results in a strong ACE. In contrast, the perfective aspect, by directing attention to the completion of an event, promotes a simulation of the end-state or resultant goal, rather than the specific motor processes to achieve it. This decouples the comprehension process from the fine-grained motor system, thereby attenuating or eliminating the ACE.
Given the substantial cross-linguistic differences between English and Mandarin Chinese, this gap is particularly important. Chinese primarily use lexical particles to indicate grammatical aspect, notably “在”[zài] for the progressive and “了”[le] for the perfective. These particles have distinct grammaticalization pathways. Critically, it is hypothesized that the progressive marker “在”[zài] grammaticalized from a spatial meaning (i.e., “to be at”;
Ruan & Wang, 2015;
W. Wang, 2013). For Chinese learners of English, this basic divergence in the conceptual underpinnings of aspect—especially for the progressive—may pose a major learning challenge, making L2 proficiency an important consideration.
Notably, it has been demonstrated that embodied semantic representation is significantly influenced by L2 proficiency (
Ni, 2023;
Ahlberg et al., 2018;
Birba et al., 2020). However, the evidence is inconclusive regarding the modulation of L2 proficiency on the embodied representations in L2 processing. Several studies have proposed that L2 proficiency modulates L2 embodiment effects (e.g.,
Q. Wang, 2016;
Birba et al., 2020;
Gu et al., 2021;
Tang et al., 2023). For example,
Birba et al. (
2020) investigated how embodied semantic systems interacted with L2 proficiency for Spanish–English bilinguals. Participants read action and neutral texts naturally in both languages with their neural activities being tracked by EEG. Results showed higher L2 proficiency correlated with enhanced EEG functional motor-related connectivity during L2 action text comprehension. Similarly, L2 proficiency constrains the ability to represent vertical spatial metaphors related to power words such as “up” and “down” in a second language (
Q. Wang, 2016). Nevertheless, according to studies by
Chen et al. (
2020),
Kogan et al. (
2020),
X. Zhang et al. (
2020), L2 proficiency appears to have little effect on the embodied representations in L2 processing. In particular,
X. Zhang et al. (
2020) examined embodied semantics in L2 processing in an fMRI study. Researchers found that Chinese–English bilinguals, regardless of proficiency level, primarily engaged the semantic systems during the judging of the semantic relatedness of English words, unlike native speakers whose sensorimotor systems were connected with semantic system.
Given these conflicting findings, a critical question arises concerning the predictive power of a major theoretical framework: does the widely accepted neural convergence hypothesis (
Green, 2003) hold true for the domain of embodied cognition? This hypothesis states that L2 processing becomes more native-like with increasing proficiency. It follows that high-proficiency learners should develop a different representation from low-proficiency ones, potentially including embodied simulations, due to prolonged exposure and practice. Critical evidence for this progression comes from neurocognitive studies using Event-Related Potentials (ERPs). Specifically, research has leveraged the P600 component as an index of processing efficiency. P600 component is a robust electrophysiological marker of syntactic processing difficulty, including structural reanalysis, integration costs, and the violation of syntactic rules (e.g.,
Osterhout & Holcomb, 1994). Findings of an attenuated P600 in advanced learners in processing English tense-aspect morphology are thus interpreted as direct evidence of more automatic processing of English tense-aspect morphology (
Geng & Yang, 2016;
H. Zhang et al., 2017). However, while this compelling evidence for neural convergence is confined to the syntactic domain, it remains unknown whether this proficiency-driven convergence holds true for the embodied simulation of grammatical aspect concepts—that is, for online sensorimotor modulation.
The current work uses the ACE paradigm to fill this gap and elucidate the proficiency-dependent cognitive mechanisms underlying aspectual simulation. However, a clear research gap remains. First, at the behavioral level, it is unknown whether and how L2 learners demonstrate the ACE modulated by grammatical aspect. Second, at a theoretical level, it is unclear whether the Neural Convergence Hypothesis extends from syntactic processing to the domain of embodied semantic simulation. Finally, given the fundamental differences in how aspect is marked in English and Mandarin, it is critical to test whether this theoretical framework holds for this specific language pair. The present study, therefore, has a clear twofold purpose: (1) to empirically investigate how L2 proficiency modulates the embodied simulation of grammatical aspect in Chinese learners of English, using the ACE paradigm; and (2) to test the boundaries of the Neural Convergence Hypothesis in the domain of embodied cognition. Theoretically, this research addresses a critical gap by determining whether proficiency-driven convergence extends from syntactic processing to embodied simulation. Practically, elucidating how learners internalize grammatical concepts like aspect can inform language pedagogy, highlighting the need for instruction that fosters not only structural knowledge but also the corresponding conceptual and embodied representations.
Our predictions are grounded in two key theoretical perspectives. First, the Neural Convergence Hypothesis (
Green, 2003) leads us to anticipate that higher-proficiency learners will exhibit embodied simulation patterns that more closely resemble those of native speakers. Second, drawing on theories of L1 conceptual transfer (
Jarvis & Paavlenko, 2008) and the critical role of developing strong form–meaning mappings (
DeKeyser, 2007), we predict that lower-proficiency learners will show substantial influence from their L1 system. This perspective is strongly supported by research on event categorization.
Vanek (
2019) demonstrated that even advanced Chinese learners of L2 English retained a persistent L1-tuned, result-oriented bias when categorizing events. Critically, this L1 bias could be rapidly overridden when learners explicitly engaged their L2 through verbalization, which strengthened attentional biases toward the action-oriented categories of English. This evidence highlights that the influence of L1 conceptual patterns and their modulation by L2 engagement are central to understanding L2 semantic processing, directly informing our predictions regarding grammatical aspect.
Our predictions lead to the following specific hypotheses: (1) A significant interaction will exist between grammatical aspect (progressive vs. perfective) and L2 proficiency on the magnitude of the ACE. (2) A specific cross-over interaction is anticipated: advanced learners would show a selective simulation mechanism, with embodied simulation being more robust for progressive than perfective aspect due to strengthened L2 form-concept mappings, as demonstrated by an ACE for progressive but not perfective sentences. This pattern is predicted by the Neural Convergence Hypothesis, as it would mirror the native-like, aspect-sensitive simulation observed in L1 speakers (
Bergen & Wheeler, 2010). In contrast, lower-proficiency learners would demonstrate a different pattern, exhibiting a decreased or reversed modulation of simulation by grammatical aspect, potentially showing no ACE or a reversed effect, due to weaker form–meaning associations and interference from L1 conceptual transfer. This is consistent with models of second language acquisition which posit that at early stages, learners rely more heavily on L1 conceptual representations (
Jarvis & Paavlenko, 2008), as robustly demonstrated in the domain of event categorization by
Vanek (
2019), and have less automatic access to L2-specific grammatical distinctions (e.g.,
Jiang, 2000). By examining this interplay, this study will offer critical evidence on how proficiency shapes the embodied mechanisms of grammatical aspect processing in L2, highlighting the development of form–meaning connections in bilingual cognition.
In summary, this study is designed to uncover the proficiency-dependent cognitive mechanisms through which L2 learners embody grammatical meaning. To test these hypotheses, the following experiment employed the ACE paradigm with Chinese learners of English at varying proficiency levels, as detailed in the
Section 2.
3. Discussion
This study investigated the embodied simulation of grammatical aspect in L2 learners by examining two key questions: (1) whether grammatical aspect modulates embodied simulation during L2 sentence comprehension, and if any, in what manner; and (2) whether L2 proficiency modulates the way grammatical aspect influences embodied simulation. Our results provide clear answers: grammatical aspect does indeed modulate embodied simulation (as measured by the Action–Sentence Compatibility Effect), but this modulation is profoundly and qualitatively shaped by language proficiency, as evidenced by a significant three-way interaction. This proficiency-dependent pattern reveals unique simulation processes at different learning stages and calls into question oversimplifies conceptions of L2 embodiment.
3.1. The Modulation of Embodied Simulation by Grammatical Aspect in Advanced Learners
For advanced learners, the key finding was that grammatical aspect powerfully influenced embodied simulation, as measured by the presence or absence of the Action–Sentence Compatibility Effect (ACE). Embodied simulation was robustly engaged for the progressive aspect, as shown by significantly faster responses in consistent trials compared to inconsistent trials (p = 0.022). The perfective sentences, on the other hand, did not cause any such simulation of the directional action: there was no significant difference in response times between consistent and inconsistent conditions (p = 0.61).
This aspect-specific pattern aligns with simulation-based theories of language comprehension (
Bergen & Wheeler, 2010), which posit that grammatical aspect provides high-level constraints (cf.
Feldman & Narayanan, 2004) that define the type of the mental simulation. The progressive aspect is thought to trigger a mental simulation of the ongoing activity (e.g., “was walking”), which shares neural resources with the actual motor action required by the task. Under congruent conditions, this overlap facilitates processing, but it causes interference under inconsistent conditions, manifesting as the observed ACE. Conversely, the perfective aspect (e.g., “had walked”) shifts the focus to the endpoint or completion of an event. This type of simulation may not activate the specific motor programs engaged by our ACE paradigm, explaining the absence of a compatibility effect.
These results demonstrate that advanced L2 learners, much like native speakers, engage in intricate sensorimotor simulation that is highly sensitive to grammatical cues. This finding contributes to the growing body of research on embodiment effects in L2 processing (e.g.,
Q. Wang, 2016;
Y. Zhang & Zhang, 2015), and crucially extends it from the lexical level to the domain of grammar. As
Bergen and Wheeler (
2010) suggest, the influence of grammatical aspect implies that it functions as a “simulation specification”—a higher-order constraint that parameterizes how learners activate perceptual and motor systems to interpret sentences.
3.2. The Reversal of Simulation Patterns in Low-Proficiency Learners
The pattern of embodied simulation for low-proficiency learners differed qualitatively from that of advanced learners, confirming that proficiency fundamentally modulates the way grammatical aspect affects simulation. Simple effects analyses revealed a striking reversal: perfective sentences showed a significant ACE, with faster responses in consistent condition than in inconsistent conditions (p = 0.003), suggesting that the perfective aspect successfully triggered simulation. In contrast, no ACE was found for progressive sentences (p > 0.05), indicating that action simulation was not engaged for this aspect. Furthermore, responses to progressive sentences were significantly faster than those to perfective ones in the inconsistent condition, which poses higher cognitive demands (p = 0.01).
3.2.1. A Shallow Processing Account for the Progressive Aspect
We suggest that this reversal pattern stems from a qualitative difference in how low-proficiency learners process each aspect. As evidenced by
Geng and Yang (
2016), the progressive aspect is associated with a lower syntactic processing cost (smaller P600 amplitude) than the perfective aspect, a pattern which suggests that learners process the “Be + V-ing” construction as a holistic formal unit. However, this formal recognition is not accompanied by a robust, automatic retrieval of the form–meaning connection to its aspectual semantics of ongoing activity. As a result, it fails to trigger the action simulation necessary to produce an ACE. This notion of shallow, conflict-free processing is further supported by the faster reactions to progressive sentences in the demanding inconsistent condition.
3.2.2. Lexically Driven Simulation in the Perfective Aspect
In contrast, learners have trouble integrating morphosyntactic elements for the perfective aspect (“have + V-en”), as indicated by a higher P600 (
Geng & Yang, 2016). As a result, learners might be forced to concentrate on the main verb and use a discrete, lexical-processing approach. The concrete, directional semantics of the verb (e.g., “push”, “pull”) can then directly trigger an action simulation, thereby yielding the observed ACE. This account explains the key findings: the ACE emerges for the perfective aspect due to verb-driven simulation, but is absent for the progressive aspect due to lack of simulation. It also clarifies why response times were faster for progressive sentences under inconsistent condition: shallow formal analysis proceeds efficiently, whereas deep simulation encounters costly conflict. This is in line with the recognized developmental sequence in second-language learning, where bare verbs are acquired first, followed by V-ing forms, with complex periphrastic compounds like “have + V-en” acquired last (
Hawkins, 2001;
Yu, 2008). The later acquisition of the perfective aspect may explain why low-proficiency learners struggled with its morphosyntactic integration and resort to lexical-driven processing.
3.2.3. The Role of L1 Transfer and Typological Differences
Beyond these developmental processes, cross-linguistic transfer from the native language could be an extra source of interference. The Chinese and English aspectual systems differ fundamentally in typology. The progressive aspect of English is realized through a morphosyntactic construction requiring both an inflected form of the auxiliary verb BE and the morphological inflection of the main verb into the -ing form. This highly grammaticalized structure encodes primarily temporal information, emphasizing the ongoing and incomplete nature of an action. However, Chinese progressive aspect markers like “在 (zài)” and “着 (zhe)” are not inflectional and, being grammaticalized from originally spatial verbs, often retain semantic connotations of spatial continuity (
Ruan & Wang, 2015). Therefore, Chinese tends to express aspectual meaning through lexical and pragmatic ways within primarily spatial–conceptual frameworks (
Zhao & Wang, 2017). This typological disparity may lead Chinese learners of English to adopt L1-based processing strategies. This is supported by
Ma and Du’s (
2020) findings that low-proficiency Chinese EFL learners did not use native-like ambiguity resolution mechanisms that rely on aspectual cues, revealing that they were less sensitive to grammatical aspects when processing sentences with progressive morphology (e.g., -ing forms) online. The absence of an ACE for the progressive aspect can be explained by this lack of sensitivity, which renders the compatibility between sentence and response direction irrelevant. Conversely, the observed ACE for the perfective aspect may result from learners relying more on parsing discrete morphological cues and lexical verb semantics, which can directly trigger concrete action simulations.
3.2.4. Complementary Evidence from Response Accuracy
Crucially, the analysis of response accuracy provides complementary evidence that further elucidates the nature of this lexically driven simulation. For low-proficiency learners, the consistency between sentence direction and response direction not only influenced the speed of their responses to perfective sentences (as seen in the RT-based ACE) but also significantly affected the accuracy of their semantic judgment (β = 0.52, SE = 0.16, z = 3.26, p = 0.001). This pattern—where consistent trials yield higher accuracy than inconsistent trials—is a classic signature of high cognitive load. It indicates that when simulation is engaged via the lexical verb in the perfective aspect, the conflict in inconsistent conditions is so demanding that it compromises the final judgment, leading to more errors. This finding directly supports our interpretation that low-proficiency learners employ an effortful and strategic processing route for the perfective aspect, where simulation is not automatic but comes at a cost to both processing efficiency and outcome reliability.
In stark contrast, for the progressive aspect, where no ACE was observed in RTs, accuracy was also unaffected by consistency (p = 0.35). This double dissociation—no effect on either speed or accuracy—robustly confirms the “shallow processing” account. It suggests that the progressive aspect fails to trigger a deep simulation that could interact with the motor system, thereby rendering the compatibility of the sentence direction and arm movement direction irrelevant to the task, both in terms of processing speed and final judgment.
In summary, we contend that the proficiency-dependent reversal of the ACE is caused by a fundamental difference in processing strategies: a shallow, form-based processing of the progressive aspect that fails to trigger action simulation, versus a lexically driven, analytical processing of the perfective aspect that successfully engages the sensorimotor system.
3.3. Theoretical Integration and Implications
This study goes beyond merely documenting the presence or absence of embodied effects in L2 processing by demonstrating the proficiency-dependent reversal of the ACE. Rather, it reveals a deeper principle: the pathway and efficiency of embodied simulation are dynamically modulated by language proficiency. Our findings challenge the assumption of a universal, direct link between grammar and simulation. Therefore, we propose a dual-pathway model of L2 embodiment, in which the engagement of either pathway is conditioned by developmental readiness and L1-conceptual filters.
This reversal pattern compels a crucial theoretical refinement. The seminal monolingual framework (
Bergen & Wheeler, 2010) posits a direct mapping wherein grammatical aspects automatically specify distinct simulation formats (process vs. endpoint). Our data suggest that for L2 learners, this direct grammar → simulation link is not a starting point but rather a developmental accomplishment. For low-proficiency learners, the process is asymmetrical and indirect: grammar → analytical strategy → simulation. Through an alternative, lexically driven route, the perfective aspect, affords a pathway to simulation via its lexical verb. In contrast, the progressive aspect, with its complex morphosyntactic demands, leads to shallow processing that fails to trigger simulation.
The dissociation between reaction time and accuracy patterns across proficiency levels offers a clear behavioral marker of this transition. For low-proficiency learners, the consistency effect manifested in both RT and accuracy for the perfective aspect, underscoring the costliness and strategic nature of their simulation. For advanced learners, however, the effect was isolated to RT (specifically for the progressive aspect) without compromising accuracy. This pattern is indicative of emerging automaticity: the simulation process, while not yet entirely cost-free in terms of speed, has become robust enough to ensure error-free final outcomes. Thus, the progression towards the “direct-link paradigm” is not only reflected in what is simulated (as shown by the reversal of the ACE) but also in how efficiently that simulation is executed, with accuracy being the first to stabilize at native-like levels.
Our data provide behavioral support for
Ni’s (
2023) integrated model, which positions proficiency as a central moderator governing the interaction between linguistic representations and the simulation system. Specifically, low L2 proficiency impedes direct simulation and increases dependence on L1-mediated analytical processing. The reversal of the ACE is a stark behavioral indicator of this underlying cognitive configuration. Only at advanced proficiency levels does the system reconfigure to allow for a more direct grammar–simulation pathway, minimizing the mediating influence of the L1.
Ultimately, this study argues for a theory of L2 embodied cognition that is fundamentally probabilistic and developmental. Whether and how a grammatical structure will engage the sensorimotor system is not a fixed property of the structure itself, but a function of the learner’s proficiency and the resultant processing strategies it affords. The ACE does not simply “diminish” with lower proficiency; it can fundamentally reconfigure based on the learners’ available linguistic and conceptual resources.
While our behavioral study does not prescribe specific classroom activities, it provides a crucial theoretical foundation for rethinking L2 grammar instruction and outlines clear directions for future applied research. The central implication is that advanced L2 proficiency entails the development of a native-like “embodiment regime”—the automatic capacity to simulate meaning cued by grammar, which goes beyond the mastery of formal rules. This reconceptualization of proficiency from a purely linguistic to an embodied-cognitive level suggests that the ultimate goal of instruction should be to foster these direct form–meaning–simulation mappings.
Our findings raise critical, empirically testable questions for pedagogy: Could instructional techniques that explicitly engage the sensorimotor system (e.g., having learners perform or observe actions while using target grammatical structures) facilitate the development of this direct grammar–simulation link more effectively than traditional, abstract methods? Does the proficiency-dependent shift from lexical to grammatical simulation which we observed predict the ability to use language fluently and intuitively in real-time communication? Future research that translates these cognitive findings into classroom interventions is needed to bridge the gap between the laboratory and the language classroom. Therefore, the primary contribution of this work to pedagogy at present is not a set of teaching rules, but a new theoretical lens and a robust experimental paradigm for rigorously evaluating how embodied learning principles can optimize the acquisition of grammatical meaning.
In conclusion, we propose that the bilingual mind does not merely contain two linguistic systems but potentially two different embodiment regimes. The low-proficiency regime is characterized by effortful, strategically mediated simulation that is highly susceptible to L1 transfer. The high-proficiency regime approximates the monolingual direct-link paradigm. The transition from strategic to automatic embodiment represents a crucial, previously unidentified milestone in advanced L2 acquisition. Our findings, which document this qualitative shift, call for an extension of embodied cognition theory to account for the dynamic and experience-dependent nature of simulation processes in bilinguals.
4. Conclusions
This study investigated the embodied simulation of grammatical aspect in L2 sentence comprehension by addressing two central questions: (1) whether grammatical aspect modulates embodied simulation in L2 learners and if so, how, and (2) whether L2 proficiency modulates the way grammatical aspect influences such simulation. Using the ACE as a behavioral index of simulation, mixed-effects models revealed a significant three-way interaction, demonstrating that proficiency qualitatively shapes how grammatical aspect constrains simulation.
The main finding is a proficiency-dependent reversal in simulation patterns. Advanced learners, guided by grammar, showed an ACE for the progressive aspects, which fosters simulation of ongoing action, but not for the perfective aspects, which emphasize event completion. In contrast, low-proficiency learners displayed a reversed pattern: a significant ACE for the perfective aspects, suggesting verb-driven simulation due to difficulties with morphological integration, and no ACE for progressive aspects, indicating shallow, form-based processing that fails to trigger semantic simulation.
Theoretically, these findings support a dual-pathway, developmental model of L2 embodiment wherein simulation mechanisms are dynamically shaped by proficiency. The grammar–simulation link in L2 learners is not inherently direct. Instead, this link is strengthened through the development of robust form–meaning mappings and reduced L1 transfer at higher proficiency levels.
Pedagogically, our findings underscore the importance of building robust grammatical form–meaning connections through enriched input and practice. Technologies like virtual reality may provide multimodal, immersive environments to support embodied learning and facilitate the shift from effortful, lexically mediated simulation to automatic, grammatical guided simulation.
This study has several limitations that should be acknowledged. First, addressing the methodological limitations of the present study, future work should explore a wider range of verbs, employ longitudinal designs, and utilize more naturalistic tasks to enhance the generalizability and ecological validity of our findings.
More profoundly, our “two embodiment regimes” model points to a fundamental question in bilingual cognition: the potential for bidirectional interaction between L1 and L2 embodied systems. While our model traces the development of L2 processing, it raises the compelling possibility that attaining advanced L2 simulation capabilities may come at the cost of, or even restructure, native-like L1 embodied processing (
Vanek, 2019;
Baker, 2024;
Zingaretti et al., 2025). Directly testing for such L1 restructuring in highly proficient bilinguals constitutes a critical next step. Investigating this interplay is essential for moving beyond static models and toward a truly dynamic understanding of how embodied cognition is shaped by the bilingual experience.
In conclusion, this study demonstrates that the embodied simulation of grammatical aspect in L2 learners is a unitary phenomenon but is fundamentally shaped by proficiency. Higher proficiency enables a more direct, target-like interaction between language and simulation, whereas lower proficiency leads to asymmetrical, compensatory simulation routes. The findings suggest that the bilingual mind may function under different embodiment regimes depending on the stage of learning, with advanced proficiency enabling more direct grammar–simulation links.