Next Article in Journal
Exploring Optical Flow Methods for Automated Fall Detection System
Next Article in Special Issue
Security and Safety Education from the Polish Context to Reinforce Social Education at a Time of Global Uncertainty
Previous Article in Journal
A Deep Hybrid Recommendation Method for Multimodal Information Integrating Content Generated by Large Language Models
Previous Article in Special Issue
Dual-View Sign Language Recognition via Front-View Guided Feature Fusion for Automatic Sign Language Training
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Fluency Illusion: A Review on Influence of ChatGPT in Classroom Settings

1
Zaven P. and Sonia Akian College of Science and Engineering, American University of Armenia, Yerevan 0019, Armenia
2
Faculty of Culture and Arts, Makhambet Utemisov West Kazakhstan University, Uralsk 090000, Kazakhstan
*
Author to whom correspondence should be addressed.
Information 2026, 17(3), 299; https://doi.org/10.3390/info17030299
Submission received: 13 February 2026 / Revised: 12 March 2026 / Accepted: 18 March 2026 / Published: 19 March 2026

Abstract

The rapid adoption of generative artificial intelligence tools such as ChatGPT in educational settings has generated both enthusiasm and concern regarding their influence on student learning. While several studies report improvements in efficiency, confidence, and perceived understanding, evidence for durable conceptual learning and knowledge transfer remains mixed. This article examines these tensions through the concept of fluency illusion, a cognitive phenomenon in which information that is easy to process is mistakenly judged as being well understood. Using a narrative conceptual review approach, this study synthesizes findings from 41 publications identified through searches of Google Scholar, Scopus, Web of Science, and ERIC covering the period from 2022 to early 2026. The reviewed literature includes 28 empirical studies, nine conceptual or theoretical analyses, and four review articles addressing the use of ChatGPT in educational contexts. Across domains such as writing and language learning, STEM problem solving, feedback and tutoring, and assessment, the literature shows a recurring pattern in which fluent AI-generated responses increase learners’ confidence without consistently improving deeper conceptual understanding. Drawing on research from cognitive psychology and metacognition, this review proposes an integrative conceptual account of how fluent AI output may shape learners’ judgments of understanding and influence their engagement with learning tasks. The paper concludes by discussing implications for instructional design, assessment practices, and metacognitive scaffolding, and outlines directions for future research aimed at empirically examining the proposed framework and identifying strategies to reduce fluency-driven misjudgments while preserving the potential benefits of generative AI in education.

1. Introduction

The rapid emergence of large language models has introduced a new class of tools into educational contexts, with ChatGPT among the most widely adopted examples [1]. Capable of generating coherent, contextually appropriate, and stylistically polished responses across a wide range of academic tasks, ChatGPT is increasingly used by students and educators for writing support, problem explanation, idea generation, and feedback [2]. Its accessibility and apparent versatility have accelerated its integration into classroom settings, often preceding the development of well-established pedagogical frameworks or shared understandings of its influence on learning processes.
While early discussions have highlighted the potential benefits of ChatGPT for personalized learning and instructional support, growing attention has begun to focus on a more subtle challenge [3]. The fluency of AI-generated text frequently creates an impression of correctness, depth, and understanding that may not correspond to genuine learning [4]. Well-structured and linguistically polished responses can appear authoritative even when they contain inaccuracies or simplified reasoning [5]. This divergence between surface quality and underlying understanding raises important questions about how learners interpret and engage with AI-generated content in educational environments [6].
From a cognitive perspective, this phenomenon aligns closely with what has been described as fluency illusion [7]. Research in cognitive psychology has long demonstrated that information that is easy to process is often judged as more accurate, more credible, and better understood than information that requires greater cognitive effort [8]. In classroom contexts, such fluency-based judgments can lead learners to overestimate their own comprehension, reduce critical evaluation, and prematurely conclude that learning has been achieved. When fluency is externally generated by an AI system rather than arising from the learner’s own cognitive processing, the risk of misattributing understanding becomes particularly pronounced.
The rapid adoption of ChatGPT in education has, in many cases, outpaced systematic pedagogical reflection on these cognitive dynamics [9]. Existing studies often focus on discrete outcomes such as writing quality, task efficiency, or student attitudes, and frequently report mixed or context-dependent findings. However, these investigations are rarely connected through a shared conceptual framework capable of explaining why fluent AI assistance may simultaneously enhance performance while undermining deeper learning. As a result, the literature remains fragmented, with limited integration between empirical observations and established theories of learning and metacognition.
At the same time, the empirical evidence regarding learning outcomes remains uneven. Several experimental and classroom-based studies report improvements in task completion, perceived understanding, or writing quality when students use generative AI tools. However, evidence concerning deeper conceptual learning and knowledge transfer is more limited and sometimes inconsistent. Some studies indicate that while AI-assisted explanations can support immediate performance, learners may still struggle to reproduce or apply the underlying reasoning independently. These findings are consistent with research on self-regulated learning and metacognitive calibration, which shows that learners often rely on subjective cues such as the ease with which information is processed when judging their own understanding. In AI-supported learning environments, fluent responses generated by systems such as ChatGPT may therefore increase learners’ confidence even when underlying understanding remains uncertain.
Recent empirical studies also illustrate this tension more directly. Experimental and classroom-based investigations have reported that the use of ChatGPT can improve task completion, writing fluency, and students’ perceived understanding. For example, studies examining AI-assisted writing tasks report that students often produce more fluent and well-structured text, while improvements in argument quality or conceptual reasoning remain limited [10,11,12]. Similarly, research in STEM learning contexts suggests that students may successfully follow AI-generated explanations yet struggle to reproduce or adapt the same reasoning independently in new problems [13,14]. These findings indicate that fluent AI support may increase confidence and short-term performance while leaving deeper learning outcomes less certain.
Viewing ChatGPT through the lens of fluency illusion offers a unifying perspective for understanding these tensions [15]. Rather than framing AI use as inherently beneficial or inherently problematic, this perspective emphasizes how the perceptual qualities of AI-generated content shape learners’ judgments, behaviors, and learning strategies. It also highlights the difference between fluent academic performance and genuine conceptual understanding, a distinction that is central to education but often blurred when students use AI tools [16].
Against this background, the present study adopts a narrative and conceptually oriented review approach to examining the influence of ChatGPT in classroom settings, with particular attention to how fluent AI-generated responses may shape learners’ perceptions of understanding. It brings together empirical findings, theoretical discussions, and emerging pedagogical perspectives to clarify how fluency illusion operates in AI-supported learning. Specifically, the review seeks to synthesize existing work on the use of ChatGPT in classrooms, to advance a conceptualization of fluency illusion in AI-mediated learning, and to outline implications for pedagogy and assessment. By situating current research within a coherent theoretical framework, the review aims to contribute to a more nuanced understanding of how generative AI reshapes not only what students produce, but also how they perceive and evaluate their own learning.
The objectives of this review are threefold. First, it integrates research on the adoption of ChatGPT across diverse classroom contexts. Second, it links insights from cognitive psychology with emerging evidence on AI-mediated learning to elaborate the concept of fluency illusion. Third, it examines how this phenomenon challenges prevailing assumptions about student understanding, instructional design, and assessment practices. Throughout, the review prioritizes theoretical coherence and explanatory insight over exhaustive coverage, with the goal of illuminating recurring mechanisms that shape learners’ experiences with fluent AI systems.

2. Methodology

This study adopts a narrative review approach to examine how ChatGPT (OpenAI) and related generative AI tools are being used in educational settings, with particular attention to patterns of perceived fluency, learning confidence, and evidence of actual learning outcomes. The review was designed to prioritize conceptual coherence and pedagogical relevance rather than exhaustive coverage of all published studies.

2.1. Search Strategy

A targeted literature search was conducted using Google Scholar, Scopus, Web of Science, and ERIC. The searches were carried out between January 2022 and January 2026. Core search terms included combinations of “ChatGPT”, “generative AI”, “large language models”, “education”, “learning”, “writing”, “assessment”, “feedback”, “student confidence”, and “learning outcomes”. Searches were performed primarily in titles, abstracts, and keywords. The review focused on studies published between 2022 and early 2026, reflecting the period during which ChatGPT and similar tools entered widespread educational use.

2.2. Inclusion and Exclusion Criteria

Studies were included if they examined the use of ChatGPT or closely related generative AI systems in formal educational contexts, including secondary and higher education. Eligible publications included empirical studies reporting classroom implementation, student perceptions, learning outcomes, or assessment-related effects, as well as conceptual or theoretical papers that provided analytical discussion of the pedagogical or cognitive implications of generative AI in education.
Empirical studies were included when they reported original data collected through experimental, survey-based, classroom-based, or mixed-methods designs. Conceptual and theoretical contributions were included when they offered substantive analysis relevant to AI-mediated learning, metacognition, or instructional design. Studies that did not address educational contexts or that focused exclusively on technical model development were excluded.
Studies were excluded if they focused purely on technical model development without educational application, discussed generative AI only in non-educational contexts, or consisted solely of opinion pieces without analytical or empirical grounding. Only English-language publications were considered.

2.3. Screening and Selection Process

An initial set of articles was identified through database searches and reference list screening. Duplicates were removed, after which titles and abstracts were reviewed for relevance. Full texts were then examined to confirm alignment with the study’s focus on learning processes, perceived fluency, and educational impact.
During the full-text screening stage, attention was also given to basic indicators of methodological quality. Although the review did not apply a formal scoring system, studies were evaluated with reference to commonly used appraisal principles in educational research, such as clarity of research design, transparency in data collection procedures, and adequacy of the reported analysis. These criteria were informed by widely used appraisal frameworks, including the Critical Appraisal Skills Programme (CASP) and the Mixed Methods Appraisal Tool (MMAT).
Publications lacking sufficient methodological description or analytical grounding were excluded during the eligibility assessment stage. This approach allowed the review to maintain methodological transparency while remaining consistent with the interpretive and conceptually oriented nature of a narrative synthesis.

2.4. Data Extraction and Synthesis

For each included study, key information was extracted, including educational setting, participant group, task type, reported benefits, reported concerns, and the type of evidence used to evaluate learning. Particular attention was paid to whether studies relied on self-reported confidence, perceived ease of use, or measurable performance outcomes.
The synthesis followed a thematic narrative approach. Studies were grouped by instructional context such as writing, language learning, STEM education, feedback, and assessment. Recurring patterns across these contexts were used to develop the conceptual framework presented in the figures, linking fluency cues to confidence, reduced cognitive friction, and potential mismatches between perceived and actual learning.

2.5. Scope and Limitations

This review does not claim to be exhaustive or fully systematic. Its aim is to clarify emerging patterns and conceptual tensions in the literature rather than to quantify effect sizes or rank interventions. While this approach allows for integrative interpretation across diverse study designs, it also means that conclusions should be read as analytically grounded rather than statistically definitive.

2.6. Study Selection Overview

To improve transparency in the selection process, a structured screening procedure was applied during the literature search. The initial search across the selected databases produced a broad set of publications related to generative AI and education. After removing duplicate records, titles and abstracts were screened to identify studies that examined the use of ChatGPT or closely related generative AI systems in educational contexts.
Articles focusing exclusively on technical model development or non-educational applications were excluded at this stage. The remaining publications were then evaluated through full-text review to determine their relevance to learning processes, student perceptions, or instructional outcomes associated with generative AI use.
The overall study selection process is summarized in Figure 1. The final review corpus consisted of 41 publications that met the inclusion criteria and directly addressed classroom applications of ChatGPT or comparable generative AI tools. Of these, 28 were empirical studies reporting original classroom-based investigations or survey-based analyses of ChatGPT use in educational settings. Nine publications were conceptual or theoretical papers that examined the pedagogical, cognitive, or metacognitive implications of generative AI in learning environments. In addition, four review articles were included because they provided broader syntheses of emerging research on generative AI in education and helped contextualize the empirical findings discussed in this study.

3. Theoretical Foundations of Fluency Illusion

3.1. Processing Fluency and Judgments of Understanding

Fluency illusion is grounded in a well-established body of cognitive and educational research showing that learners’ judgments of understanding are frequently shaped by subjective experiences rather than by objective indicators of learning [17]. A central construct in this literature is processing fluency, which refers to the perceived ease with which information is read, interpreted, or mentally manipulated (see Figure 2 for the proposed AI-mediated pathway).
The model presented in Figure 2 should be interpreted as a conceptual framework designed to organize insights from cognitive psychology and emerging research on AI-supported learning. It does not represent an empirically validated causal model. Rather, it offers a theoretical proposal that highlights potential relationships between processing fluency, metacognitive judgments, and learning outcomes in AI-mediated learning environments.
Information that is processed fluently is consistently judged as more accurate, more comprehensible, and better learned, even when such judgments are misaligned with performance on transfer or application tasks [18]. In instructional contexts, this tendency can lead learners to equate clarity of presentation with conceptual mastery.
Research on illusions of competence further elucidates how fluency can distort self-evaluations of learning [19]. When instructional materials are well structured, linguistically polished, or immediately responsive, learners often report high confidence despite limited ability to independently reproduce or apply the content. This pattern reflects long-standing findings in educational psychology showing that exposure and recognition are frequently mistaken for genuine learning. From a metacognitive perspective, such misjudgments arise because learners rely on salient and cognitively economical cues when evaluating their own understanding. Judgments of learning are often informed by how easily information is processed in the moment rather than by diagnostic evidence of learning, such as the ability to retrieve, explain, or transfer knowledge [20].
These dynamics can also be considered in relation to research on retrieval practice and cognitive offloading. A large body of work on retrieval practice shows that actively recalling information strengthens long-term learning more effectively than passive exposure to explanations or summaries. When learners rely heavily on AI-generated explanations, opportunities for retrieval-based learning may diminish, especially if students consult the system before attempting to articulate their own understanding. At the same time, generative AI tools can function as a form of cognitive offloading by allowing learners to delegate portions of explanation or reasoning to an external system. While such support may improve efficiency and accessibility, it may also reduce the cognitive effort required for durable learning when it replaces rather than complements active engagement with the material.

3.2. Metacognitive Monitoring and Judgments of Learning

Insights from research on metacognitive monitoring further clarify why fluency-driven misjudgments occur in learning contexts [21]. Cue-utilization theories of judgments of learning suggest that learners often rely on heuristic signals when evaluating their own understanding, particularly when more diagnostic evidence is not immediately available. Koriat’s cue-based framework [22] proposes that judgments of learning are influenced by both intrinsic cues related to the material itself and experiential cues such as processing fluency or familiarity. Because these cues are only imperfect indicators of actual knowledge, learners may develop inflated confidence when information is easy to process. Related work on metacognitive calibration by Dunlosky and Metcalfe [23] similarly demonstrates that learners frequently overestimate their level of understanding when evaluation relies on subjective feelings of comprehension rather than retrieval-based evidence. When fluent explanations are generated by AI systems, these metacognitive cues may become even more salient, increasing the likelihood that processing ease will be interpreted as genuine learning.
A related phenomenon discussed in the human–automation interaction literature is automation bias, which refers to the tendency to accept recommendations generated by automated systems without sufficient verification. Although automation bias originates in research on decision support systems rather than metacognitive judgment, it may interact with fluency illusion in AI-mediated learning environments. Fluent and confident system responses can encourage both uncritical acceptance of the output and inflated perceptions of understanding.

3.3. Distinguishing Fluency Illusion from Related Metacognitive Illusions

Several related constructs in the metacognition literature describe situations in which learners misjudge their own understanding. Although these concepts are closely related, they originate from different theoretical traditions and describe distinct mechanisms of miscalibrated learning judgments. The illusion of competence refers broadly to situations in which learners overestimate their level of mastery despite limited ability to recall or apply knowledge independently. Similarly, the illusion of knowing arises when familiarity or recognition is mistaken for genuine understanding.
A related but conceptually distinct phenomenon is the illusion of explanatory depth, which describes individuals’ tendency to believe that they understand complex systems or concepts more deeply than they actually do. When learners are asked to produce detailed explanations of such systems, this perceived understanding often collapses [24]. This effect has been widely observed in domains involving causal reasoning and conceptual explanation.
The fluency illusion, as used in this review, refers more specifically to misjudgments that arise from the subjective ease with which information is processed. When explanations are clear, coherent, and easy to follow, learners may interpret this processing ease as evidence of genuine understanding. In AI-mediated learning environments, fluent explanations generated by systems such as ChatGPT may amplify this effect. In particular, in STEM learning contexts where students frequently request step-by-step conceptual explanations, fluency illusion may interact with the illusion of explanatory depth by encouraging learners to equate the ability to follow an explanation with the ability to construct or apply that explanation independently.

3.4. Surface Coherence and Deep Learning

The distinction between surface coherence and deep learning provides an important lens for interpreting these effects [25]. Surface coherence refers to the internal consistency, readability, and rhetorical smoothness of information, whereas deep learning involves conceptual integration, inferential reasoning, and flexible application. Highly coherent explanations can therefore create a sense of understanding without engaging the cognitive processes required for durable learning. This distinction helps explain why learners may feel confident while remaining unable to apply or extend knowledge beyond familiar contexts.
These dynamics can also be interpreted through the lens of Cognitive Load Theory, which highlights the role of cognitive effort in effective learning [26,27]. While reducing unnecessary cognitive load can improve learning efficiency, highly fluent instructional materials may also reduce the depth of learners’ engagement with the underlying concepts. In such situations, AI-generated explanations that minimize perceived difficulty may inadvertently reduce the level of productive cognitive effort invested by learners. A related perspective is offered by the Desirable Difficulties framework [28], which suggests that learning conditions involving manageable challenges, such as retrieval practice, problem generation, or effortful explanation, often lead to stronger long-term retention and transfer. From this viewpoint, the fluency of AI-generated responses may create learning conditions that feel cognitively efficient in the short term but limit opportunities for the effortful processing that supports durable learning.
Generative AI systems introduce a qualitatively distinct form of fluency that is externally generated and dynamically adaptive [29]. Unlike traditional instructional resources, these systems produce context-sensitive responses that closely align with user prompts, creating interactions that feel personalized and authoritative [30]. As a result, the source of fluency may shift from the learner’s own cognitive activity to the system itself. In learning situations where such output is readily available, learners may experience clarity and resolution that reflect the fluency of the response rather than their own evaluative competence.
The conversational nature of large language models further amplifies this effect by fostering a sense of dialogue and understanding [31]. The coherence and plausibility of generated responses can create strong impressions of preparedness and comprehension, even when learners have not engaged in the reasoning processes required to articulate, defend, or adapt those ideas independently. This highlights how fluency illusion can extend beyond private study to influence participation, confidence, and performance in learning environments.
By integrating insights from processing fluency and metacognition, this review conceptualizes fluency illusion as a key explanatory mechanism shaping learners’ experiences with fluent external supports [32]. Rather than positioning such systems as inherently beneficial or detrimental, this perspective emphasizes how fluent outputs interact with established cognitive biases, potentially undermining accurate self-assessment. Recognizing this mechanism provides a theoretical foundation for interpreting empirical findings on AI-supported learning and for designing pedagogical and assessment practices that promote active engagement, metacognitive awareness, and genuine understanding.
The strength of fluency-driven misjudgments is unlikely to be uniform across learners. Factors such as prior domain knowledge, metacognitive monitoring ability, epistemic motivation, and instructional context may moderate the relationship between processing fluency and perceived understanding. Learners with stronger prior knowledge or more developed metacognitive regulation may be better able to recognize the limitations of fluent AI-generated explanations and therefore less likely to equate processing ease with genuine understanding [33].

4. ChatGPT in Classroom Settings: What the Literature Shows

The growing body of literature on ChatGPT in education reflects a wide range of classroom applications, methodological approaches, and disciplinary perspectives [34]. Rather than converging on a single narrative of benefit or risk, existing studies reveal patterned tensions between perceived usefulness and concerns about learning depth, learner autonomy, and assessment validity. An integrative synthesis of this work highlights how ChatGPT’s pedagogical affordances are frequently accompanied by cognitive effects linked to fluency, even when these effects are not explicitly named or theorized by the original authors. Table 1 summarizes recurring applications, benefits, concerns, and associated fluency-related patterns observed across classroom contexts.
Figure 3 illustrates the conceptual distinction between surface performance and deep learning in AI-mediated environments. In many AI-supported tasks, fluent explanations can help learners complete problems quickly and produce well-structured responses. However, this apparent success may reflect surface-level performance rather than deeper conceptual integration. The figure highlights how learners may successfully follow or reproduce AI-generated reasoning while still lacking the capacity to independently construct or adapt similar explanations. This distinction helps clarify why students may perform adequately on routine tasks yet struggle with problems requiring conceptual transfer or independent reasoning.
Figure 4 summarizes the different contexts in which fluency-driven misinterpretations of learning may emerge. Across writing, STEM problem solving, feedback interactions, and assessment contexts, the common mechanism involves the tendency to interpret coherent and well-structured AI output as evidence of understanding. While the specific instructional situations differ, the underlying cognitive dynamic remains similar: learners rely on the perceived clarity of external explanations rather than on demonstrable reasoning or knowledge generation. The figure therefore emphasizes that fluency illusion is not limited to a single domain but represents a cross-contextual challenge for AI-mediated learning environments.
The final review corpus consisted of 41 studies that met the inclusion criteria described in Section 2. The studies were distributed across four major instructional contexts discussed in this review. Seventeen studies examined the use of ChatGPT in writing and language learning settings, eleven focused on STEM-related problem solving and conceptual explanation, seven addressed the use of generative AI for feedback or tutoring purposes, and six investigated issues related to assessment and academic integrity. This distribution reflects the current concentration of research activity in writing-related applications while also highlighting the growing attention to AI-supported learning across other instructional domains.

4.1. ChatGPT in Writing and Language Learning

Research on writing and language learning constitutes one of the most active areas of inquiry into ChatGPT’s classroom use [10]. Studies consistently report that students perceive AI-generated text as helpful for brainstorming, drafting, paraphrasing, and improving linguistic accuracy [11]. In second language contexts, ChatGPT is frequently described as a source of accessible input, vocabulary expansion, and grammatical support, particularly for learners with limited confidence or exposure to proficient language models [35]. From an instructional perspective, these uses are commonly framed as scaffolding mechanisms that reduce cognitive load and support engagement.
Although a growing number of studies have examined the use of generative AI tools in writing instruction, the research varies considerably in its methodological design and evaluation approaches.
At the same time, the literature raises concerns about students’ reliance on fluent AI-generated prose and the implications for writing-related learning outcomes [36]. Several studies report that while students express increased confidence after using ChatGPT, this confidence does not consistently translate into improvements in argumentation quality, rhetorical control, or independent writing performance [12,36,37,38]. Linguistic polish can obscure weaknesses in organization or reasoning, leading students to overestimate their writing competence. Even when studies focus primarily on authorship or originality, fluency-related effects are evident in the tendency to treat well-formed output as a proxy for quality and learning.
Although these studies generally report improvements in perceived writing quality or argument organization, the methods used to evaluate such outcomes vary considerably across the literature. In some cases, argumentative quality is assessed through rubric-based evaluations focusing on elements such as claim clarity, evidence integration, coherence, and rhetorical structure. Other studies rely on instructor grading, automated text analysis tools, or self-reported perceptions of writing improvement. Differences in participant populations, task types, and outcome measures further contribute to methodological heterogeneity. For example, some investigations examine short argumentative essays written under controlled classroom conditions, while others analyze revisions produced during longer writing assignments or AI-assisted drafting processes. These variations make direct comparison across studies difficult and highlight the need for more standardized evaluation frameworks when assessing the impact of generative AI on writing development.
An additional distinction emerging in the literature concerns the different ways in which students use generative AI during writing tasks. In some cases, ChatGPT functions as a partial support tool for brainstorming, editing, or improving linguistic clarity. In other situations, students rely on the system to generate substantial portions of the written content. These different forms of use can lead to substantially different learning outcomes. Improvements in linguistic fluency or grammatical accuracy do not necessarily translate into stronger argumentative structure or deeper conceptual reasoning. Distinguishing between linguistic support and authorship substitution is therefore important when interpreting reported improvements in writing performance.
To clarify these distinctions, Table 2 summarizes common forms of ChatGPT use in writing tasks and their potential learning implications.
This comparison shows that not all uses of ChatGPT have the same learning implications. When the tool is used for limited support such as grammar correction or idea generation, it can function as a scaffold that helps students improve their work. However, when students rely on the system to generate large portions of text, the cognitive work involved in organizing arguments and developing ideas may be reduced.

4.2. Problem-Solving and STEM Support

In STEM-oriented classroom settings, ChatGPT is commonly examined as a tool for explaining concepts, outlining solution procedures, or supporting debugging and code comprehension [13]. Students frequently report that AI-generated explanations help clarify abstract ideas, translate formal notation into natural language, and provide timely assistance outside scheduled instructional time. In introductory courses, such support is often associated with reduced frustration and increased persistence.
The interpretation of AI-supported problem solving in STEM contexts can also be informed by broader instructional research on cognitive load and mathematical problem-solving strategies. Ngu and Phan [40] examine how instructional guidance and cognitive load influence the development of mathematical proficiency. Although their study does not directly investigate the use of ChatGPT or other generative AI tools, it offers relevant theoretical insights into how external instructional supports may shape learners’ engagement with complex problem-solving tasks.
Several studies show that students who rely heavily on AI assistance perform adequately on routine or near-transfer tasks but struggle with novel or conceptually demanding problems. These findings suggest that recognizing coherent reasoning is not equivalent to developing the ability to generate or adapt solutions independently. While such outcomes are often discussed in terms of dependency or surface learning, they align closely with fluency-related misjudgments observed across instructional contexts.
These findings are also consistent with a long-standing distinction in learning research between studying worked examples and generating solutions independently. Studies in cognitive load theory have shown that worked examples can support initial comprehension by reducing problem-solving complexity, particularly for novice learners. However, deeper learning often requires opportunities for learners to generate explanations or solutions themselves. AI-generated step-by-step explanations may function similarly to worked examples by supporting immediate task completion while reducing the need for learners to actively construct the reasoning process. As a result, students may perform adequately during guided tasks but encounter difficulties when attempting to transfer the same reasoning to new problems.
Recent studies and systematic reviews examining the use of generative AI in education report similar patterns. Reviews of ChatGPT use in STEM higher education indicate that students often rely on AI-generated explanations to understand procedures or verify intermediate steps in problem solving [13]. While this support can improve immediate task completion and reduce frustration, it may not always strengthen independent reasoning skills. Related work on over-reliance on AI dialogue systems also suggests that frequent dependence on AI-generated solutions can reduce opportunities for learners to engage in the active reasoning processes that support durable conceptual learning [14]. These findings reinforce the distinction between recognizing correct explanations and developing the ability to generate or adapt problem-solving strategies independently.
Recent classroom studies in programming and quantitative subjects show similar patterns. Students who use ChatGPT often complete routine exercises more quickly and report that explanations appear clearer. However, some studies also report that students experience difficulties when they are asked to solve similar problems independently or explain the reasoning without AI assistance. These findings suggest that recognizing a correct explanation is not always the same as developing the ability to construct the solution independently [13,14,40].

4.3. Feedback and Tutoring Functions

Another prominent theme in the literature concerns the use of ChatGPT as a feedback provider or tutoring agent [41]. Studies highlight the perceived value of immediate, detailed, and nonjudgmental feedback, particularly in large classes where instructor feedback is limited. Students often describe AI-generated feedback as clearer and more actionable than peer feedback, and instructors note potential efficiency gains when AI is used for formative purposes.
At the same time, concerns arise regarding the epistemic authority attributed to fluent AI feedback. Because responses are typically confident, well structured, and tailored to user input, students may accept them uncritically, even when feedback is incomplete, generic, or subtly inaccurate. The conversational format can further reinforce a sense of personalized understanding, encouraging reliance on AI validation rather than reflective evaluation. Although few studies explicitly frame this phenomenon as a metacognitive issue, the literature suggests that fluency-driven acceptance may occur when students interact with highly polished AI-generated feedback.
Empirical studies on automated feedback uptake provide related evidence. Cavalcanti et al. [42] report similar patterns in research on automated feedback systems, showing that students frequently incorporate feedback suggestions into their revisions without critically evaluating the underlying reasoning, particularly when the feedback is presented in a clear and authoritative form.
These dynamics are also closely related to research on trust in automated systems and human–computer interaction. Research on trust calibration in human–AI interaction suggests that users may place excessive trust in automated systems when responses appear confident, coherent, or authoritative, which may reduce critical evaluation of AI-generated feedback [8]. In educational settings, this tendency may influence how students interpret and use AI-generated feedback. The effects may also vary depending on the type of feedback provided. Corrective feedback typically focuses on identifying specific errors or inaccuracies in a learner’s response and can support targeted revision when students actively engage with it. In contrast, elaborative feedback provides extended explanations intended to deepen conceptual understanding.
In AI-mediated learning environments, highly fluent elaborative feedback may increase the risk of passive acceptance if learners rely on the explanation without critically evaluating the reasoning behind it. Factors such as prior knowledge, digital literacy, and experience with AI tools may therefore moderate how students interpret and use AI-generated feedback. This issue can also be understood through research on trust in human–AI interaction. Studies in this field show that users may place excessive trust in automated systems when responses appear confident, clear, or authoritative. In educational settings, the fluent style of AI-generated feedback may therefore influence how students evaluate its quality. Students may accept feedback more readily because it appears well structured, even when they do not fully examine the reasoning behind it.

4.4. Assessment and Academic Integrity

Assessment and academic integrity are recurring concerns in the literature, often shaped by institutional anxieties surrounding plagiarism, authorship, and fairness [43]. Reported instructional responses include redesigned assessments, greater emphasis on in-class work, and the adoption of reflective or process-oriented tasks. Some studies indicate that when assessments emphasize reasoning, explanation, or oral defense, the direct influence of ChatGPT-assisted output on performance is reduced.
Beyond integrity concerns, the literature also highlights challenges related to assessment validity. Fluent AI-generated responses can make it difficult to distinguish between independent understanding and tool-assisted performance, particularly in written or take-home assessments. Students may also interpret successful completion of AI-assisted tasks as evidence of learning, even when assessments fail to capture transferable competence. These dynamics suggest that fluency-related effects operate not only at the level of individual learning but also at the level of instructional evaluation.
These concerns are increasingly addressed through emerging institutional frameworks that seek to clarify appropriate uses of generative AI in assessment contexts. Universities and professional bodies have begun to articulate policies that distinguish between acceptable forms of AI-assisted learning and practices that undermine independent demonstration of knowledge [44]. At the same time, approaches to AI governance vary considerably across institutions, disciplinary traditions, and national educational systems, reflecting differences in assessment culture and regulatory environments [45]. An important pedagogical distinction also arises between formative and summative assessment contexts. In formative settings, AI-generated explanations and feedback may support practice, reflection, and skill development when used transparently. In contrast, in summative assessments where students are expected to demonstrate independent understanding, fluency-driven misjudgments may be more problematic because highly coherent AI-generated responses can mask gaps in reasoning or conceptual mastery.
These concerns have also encouraged renewed interest in forms of authentic assessment and assessment for learning. Authentic assessment tasks that require explanation, application, or contextual reasoning may reduce the likelihood that fluent AI-generated responses can substitute for genuine understanding. Examples include oral defenses of written work, iterative project submissions that document the development of ideas over time, or reflective commentaries in which students explain how AI tools were used during the learning process. Such approaches shift the emphasis from the final product to the reasoning and learning processes that produced it.

4.5. Integrative Perspective

Across these thematic areas, a consistent pattern becomes visible. ChatGPT is widely perceived as useful and efficient, yet these benefits are frequently accompanied by confidence gains that are not consistently matched by evidence of deep or transferable learning [46]. When viewed through the lens of fluency illusion, these findings suggest that the central challenge posed by ChatGPT in classroom settings lies in the misalignment between fluency-driven perceptions and demonstrable understanding.
By synthesizing research across writing, STEM, feedback, and assessment contexts, this review highlights fluency illusion as a unifying explanatory framework underlying diverse concerns in AI-mediated education. Recognizing this mechanism supports a more coherent interpretation of existing findings and provides a foundation for pedagogical and assessment strategies that address not only what students produce with AI, but how they evaluate and regulate their own learning in its presence.

5. Conceptualizing Fluency Illusion in AI-Mediated Learning

Despite the growing volume of research on generative AI in education, a coherent conceptual explanation of how fluent AI output shapes learners’ judgments of understanding remains underdeveloped [47]. This section advances the concept of fluency illusion in AI-mediated learning as a theoretically grounded lens for interpreting recurring patterns across classroom-based studies. By clarifying its meaning, unpacking its underlying mechanism, and outlining its broader explanatory value, the section positions fluency illusion as a central construct for understanding the educational impact of ChatGPT.
At its core, fluency illusion in AI-mediated learning refers to the misattribution of the clarity, coherence, or apparent correctness of AI-generated content to one’s own understanding or competence. What distinguishes this phenomenon from earlier accounts of learning illusions is the source of fluency itself. Rather than emerging from the learner’s cognitive processing or from instructional design alone, fluency is produced externally by an adaptive system that responds directly to the learner’s inputs, often with remarkable linguistic and conceptual polish.
This externally generated fluency initiates a subtle but powerful shift in metacognitive judgment [17]. When learners encounter responses that are easy to follow and rhetorically convincing, the subjective experience of understanding intensifies. Processing ease is interpreted as mastery, leading to elevated confidence and judgments of learning that may not reflect actual conceptual integration. As perceived understanding increases, learners may reduce cognitive effort, rely more heavily on the tool, or disengage from strategies that would otherwise promote deeper learning.
Importantly, this mechanism unfolds in ways that distinguish AI-mediated learning from more familiar instructional contexts. Traditional sources of fluency, such as well-designed textbooks or clear lectures, are relatively stable and limited in their responsiveness. By contrast, generative AI systems dynamically tailor their output to individual prompts, reinforcing fluency through personalization and conversational continuity. This responsiveness strengthens the learner’s inference that understanding has been achieved through interaction, even when key reasoning processes remain externalized.
For analytical clarity, the relationships illustrated in Figure 4 can also be interpreted in terms of a simplified causal structure. In this framework, the external fluency of AI-generated explanations functions as the primary independent variable. This fluency shapes learners’ judgments of understanding through mediating processes such as increased confidence, reduced perceived cognitive effort, and reliance on surface indicators of comprehension. The strength of these effects may vary across learners and instructional situations. Factors such as prior knowledge, metacognitive monitoring ability, epistemic motivation, and instructional context may therefore act as moderating variables that influence how strongly processing fluency affects perceived understanding and subsequent learning behavior. The model should therefore be interpreted as a theoretical representation of potential relationships rather than as an empirically validated causal mechanism.
Viewed as a process rather than a momentary bias, fluency illusion can be conceptualized as a recursive cycle [48]. Initial interaction with fluent AI output elevates perceived understanding, which in turn shapes learners’ regulatory decisions about effort, strategy use, and task engagement. Reduced cognitive investment limits opportunities for retrieval, elaboration, and transfer, resulting in learning outcomes that are shallow yet subjectively satisfying. Subsequent interactions with AI reinforce this cycle by repeatedly providing fluent resolutions, thereby stabilizing the illusion across learning episodes.
This conceptualization also underscores that fluency illusion is not solely an individual cognitive tendency but an interactional outcome shaped by the affordances of AI systems. Features such as authoritative tone, structured reasoning, and conversational turn-taking contribute to a sense of epistemic reliability that is difficult for learners to interrogate. In classroom environments characterized by time pressure and performance demands, these affordances may further encourage reliance on fluency as a heuristic for learning success.
Framing fluency illusion in this way helps reconcile apparent inconsistencies in the existing literature [49]. Reports of heightened confidence, satisfaction, and perceived learning can coexist with evidence of limited transfer, fragile understanding, or dependence on AI support. Rather than reflecting contradictory findings, these patterns are consistent with a systematic misalignment between subjective experience and learning outcomes that fluent AI output is particularly well positioned to amplify.
By articulating the role of fluency illusion in AI-mediated learning, this review offers a unifying theoretical perspective that connects cognitive psychology with emerging classroom research on generative AI. This framing shifts attention away from binary evaluations of AI use and toward the cognitive processes that shape how learners experience and interpret their own learning. In doing so, it lays the groundwork for pedagogical and assessment approaches that seek not only to leverage AI’s capabilities, but also to counteract fluency-driven misjudgments and support meaningful, self-regulated learning in AI-rich educational contexts.
Conceptualizing fluency illusion as a central mechanism in AI-mediated learning has direct implications for how ChatGPT is integrated into classroom practice and how learning is evaluated [50]. If fluent AI output systematically shapes learners’ perceptions of understanding, then pedagogical and assessment decisions cannot rely solely on observable performance or polished products as indicators of learning. Instead, instructional design must account for the ways in which AI-generated fluency can obscure gaps in understanding and distort metacognitive judgment. This perspective shifts the focus from whether generative AI is used to how its use is structured, monitored, and assessed within educational settings. The following section builds on this conceptual framework to examine how teaching practices and assessment strategies can be adapted to mitigate fluency-driven misjudgments while preserving the potential benefits of AI-supported learning.

6. Pedagogical and Assessment Implications

Recognizing fluency illusion as an important explanatory factor in AI-mediated learning requires reconsidering how generative AI tools such as ChatGPT are integrated into instructional and assessment practices [51]. Rather than framing AI use as inherently beneficial or inherently problematic, a fluency-based perspective highlights the conditions under which fluent AI output supports learning and the conditions under which it may distort learners’ judgments of understanding. The pedagogical implications discussed in this section therefore focus on instructional and assessment strategies that mitigate fluency-driven misjudgments while preserving the learning benefits of AI-supported activities. Accordingly, Figure 5 summarizes the instructional, assessment, and metacognitive design strategies discussed in this section for reducing fluency-driven overconfidence while maintaining the advantages of AI-supported learning.

6.1. Managing the Risk of Over-Reliance

One of the most immediate pedagogical concerns associated with fluency illusion is the risk of over-reliance on AI-generated content [14]. When students experience repeated success through interaction with fluent AI responses, they may increasingly defer cognitive responsibility to the tool. This shift can reduce engagement in effortful processes such as problem formulation, hypothesis testing, and self-explanation. Over time, learners may come to associate understanding with access to fluent output rather than with their own capacity to reason or apply knowledge.
Instructional design can address this risk by deliberately structuring AI use as a support rather than a substitute for cognitive effort [52]. For example, activities that require students to generate initial responses before consulting ChatGPT, or to compare their own reasoning with AI-generated explanations, can help maintain the learner’s role as the primary agent of understanding. Such approaches do not restrict access to AI but instead reposition it within a learning sequence that prioritizes active engagement.
Early classroom experiences suggest that such strategies can often be implemented without major changes to existing course structures. For example, some instructors ask students to submit an initial attempt at solving a problem or outlining an argument before consulting ChatGPT. Others incorporate brief comparison activities in which students examine differences between their own reasoning and AI-generated explanations. Although systematic evaluations of these practices are still limited, preliminary classroom reports suggest that they can encourage more active engagement with the material while preserving the practical advantages of AI-supported assistance.
In practice, several simple strategies can help instructors integrate AI tools while maintaining active learning. For example, students may first attempt a problem or write a draft before consulting ChatGPT. Instructors can also ask students to explain how they used AI assistance or to compare their own reasoning with the AI-generated explanation. These approaches help keep students actively engaged while still allowing them to benefit from AI-supported guidance [53,54].

6.2. Rethinking Feedback Practices

ChatGPT’s ability to provide immediate, detailed, and linguistically polished feedback has clear appeal in classroom contexts, particularly where instructor feedback is limited [55]. However, from the perspective of fluency illusion, the clarity and confidence of AI-generated feedback can be misleading. Students may interpret fluent feedback as authoritative and complete, reducing their inclination to question, revise, or seek alternative perspectives.
To counter this tendency, feedback practices should emphasize interpretation rather than acceptance. Instructors can encourage students to treat AI feedback as a starting point for reflection by requiring them to explain how they used the feedback, which aspects they accepted or rejected, and why. Embedding reflective prompts or requiring justification of revisions can shift attention from the fluency of feedback to its alignment with learning objectives and disciplinary standards.
Emerging classroom experiences provide initial examples of how such strategies can be implemented in practice. In some courses, instructors ask students to submit an initial solution or written draft before consulting ChatGPT, followed by a short reflection comparing their own reasoning with AI-generated responses. Other implementations include brief in-class explanations, oral follow-up questions, or reflective prompts that require students to justify how they used AI assistance. These approaches can encourage active engagement with the material while still allowing students to benefit from AI-supported guidance. However, such practices also raise practical considerations related to instructor workload and institutional policies.
Implementing these practices also requires attention to practical constraints faced by instructors. In large classes, closely reviewing AI-assisted revisions may increase instructor workload if additional monitoring or reflection tasks are introduced. Some instructors have therefore experimented with short reflection prompts in which students explain how they interpreted and applied AI-generated feedback. These brief reflections can provide insight into students’ reasoning without substantially increasing grading demands. At an institutional level, clearer guidance on appropriate uses of AI-generated feedback may also help instructors design assignments that encourage critical engagement rather than passive acceptance.

6.3. Designing Assessments to Counter Fluency Illusion

Assessment design plays a critical role in either amplifying or mitigating fluency illusion [56]. Traditional assessments that emphasize polished written responses or procedural correctness may inadvertently reward fluency-driven performance, particularly when AI assistance is available. In such cases, high-quality output may mask limited understanding, making it difficult to distinguish between independent competence and tool-supported performance.
Assessments that foreground explanation, reasoning, and transfer are better positioned to counter fluency illusion [57]. Tasks that require students to articulate their thinking, apply concepts in novel contexts, or respond to follow-up questions can reveal the extent to which understanding has been internalized. Oral assessments, reflective components, and process-oriented submissions can further support this goal by shifting emphasis from final products to learning trajectories. Importantly, these approaches do not require the exclusion of AI but rather make learners’ engagement with content and tools more transparent.
At the same time, implementing such assessment approaches requires consideration of practical constraints. Large-enrollment courses, standardized assessment formats, and institutional policies may limit the use of extensive oral examinations or complex project-based evaluations. In such cases, hybrid approaches may offer a workable alternative. For example, instructors may combine AI-supported practice activities with shorter in-class explanations, brief oral follow-up questions, or reflective summaries in which students explain the reasoning behind their responses. These formats can maintain logistical feasibility while still encouraging students to demonstrate independent understanding.

6.4. Supporting Metacognitive Scaffolding

Given that fluency illusion operates primarily through distorted metacognitive judgment, explicit metacognitive scaffolding is essential [58]. Students cannot be expected to recognize fluency-driven misjudgments without guidance, particularly in AI-rich learning environments where fluent output is the norm. Instructional interventions that make the distinction between perceived understanding and demonstrated understanding explicit can help learners develop more accurate self-monitoring skills.
Metacognitive scaffolding can take the form of structured self-assessment prompts, prediction and reflection activities, or guided comparisons between AI-assisted and unassisted performance [59]. For instance, asking students to predict their performance before assessment or to reflect on discrepancies between confidence and outcomes can surface fluency effects that would otherwise remain implicit. Over time, such practices can foster more calibrated judgments of learning and reduce dependence on fluency as a heuristic.
An important conceptual tension arises when generative AI tools themselves are used to deliver metacognitive scaffolding. While reflective prompts and self-assessment activities are intended to help students monitor and evaluate their own understanding, AI-generated reflection templates may introduce a secondary layer of fluency. When prompts appear pedagogically sophisticated and easy to complete, students may experience a sense of metacognitive engagement without necessarily performing the underlying evaluative processes. In such cases, learners may follow the structure of AI-generated reflection rather than actively interrogating their own reasoning. This possibility suggests that metacognitive scaffolding supported by AI should be designed in ways that require explanation, justification, or independent articulation rather than passive completion of reflection prompts.
Assessing the effectiveness of these interventions also requires appropriate ways of measuring metacognitive calibration. Future studies could combine traditional learning assessments with measures that capture the relationship between confidence and performance. For instance, students may be asked to estimate how well they expect to perform before completing a task and later compare those estimates with their actual results. Such approaches can help identify situations in which fluent AI-generated explanations increase perceived understanding without corresponding improvements in learning outcomes. Experimental studies that compare AI-assisted and non-assisted learning conditions could also provide useful evidence on how different forms of scaffolding influence students’ judgments of understanding.

6.5. Implications for Instructional Culture

Beyond individual practices, addressing fluency illusion requires a broader shift in instructional culture [60]. When classrooms prioritize efficiency, coverage, or output quality without attending to how understanding is constructed, fluency-driven misjudgments are more likely to persist. By contrast, environments that value explanation, questioning, and intellectual struggle create space for learners to recognize the limits of fluent assistance.
In this sense, the pedagogical challenge posed by ChatGPT is not unique but rather an amplification of long-standing tensions between performance and learning [61]. Fluency illusion provides a lens through which educators can critically examine these tensions and design learning environments that leverage AI’s capabilities while safeguarding the development of genuine understanding.

6.6. Implementation Considerations Across Instructional Contexts

The feasibility of these strategies may vary across instructional contexts and institutional settings. Although the empirical evidence base remains limited, emerging studies on AI-supported learning environments provide initial indications of how such approaches can be implemented in practice. In large-enrollment courses, structured comparison tasks and reflective prompts have been used to encourage students to critically evaluate AI-generated explanations without requiring extensive additional grading by instructors. In resource-constrained settings, simple interventions such as requiring students to explain or justify AI-assisted answers have been shown to promote deeper engagement with the material while preserving the efficiency benefits of AI support.
In educational systems with high-stakes assessment cultures, instructors have increasingly experimented with hybrid approaches that combine AI-supported practice activities with supervised or oral assessments designed to verify independent understanding. Although systematic evaluation of these interventions remains limited, early classroom implementations suggest that pedagogical designs emphasizing explanation, comparison, and reflection can mitigate some of the fluency-related risks associated with generative AI use.

7. Gaps and Future Research Directions

Future research on AI-supported learning should focus on several key priorities. One important direction is examining whether fluent AI explanations increase the gap between students’ perceived understanding and their actual performance. Another area concerns identifying the conditions under which AI support contributes to deeper learning rather than simply enabling faster task completion. In addition, research is needed to evaluate instructional approaches that allow students to benefit from AI tools while maintaining critical thinking and independent reasoning.
Although research on generative AI in education has expanded rapidly, the empirical evidence base remains relatively limited and uneven across methodological approaches [9,34]. In the short term, an important priority for future research is to develop controlled experimental and classroom-based studies that directly examine how AI-generated fluency influences learners’ judgments of understanding, confidence, and learning outcomes. Over a longer time horizon, research should move toward broader investigations that explore how these cognitive effects interact with instructional design, disciplinary contexts, and students’ developing metacognitive skills. Establishing such a progression of research priorities may help clarify which questions can be addressed immediately and which require more sustained empirical investigation.
While the existing literature offers valuable insights into how ChatGPT is being adopted in classroom settings, it remains fragmented in ways that limit cumulative understanding of its cognitive and pedagogical effects [34]. In particular, the phenomenon of fluency illusion, though implicit in many findings, has yet to be systematically examined as a central explanatory construct. Addressing this gap requires a research agenda that moves beyond short-term evaluations and surface-level outcomes toward designs that capture learning processes, metacognitive dynamics, and classroom realities over time. Table 1 illustrates the key gaps and future research directions in AI-mediated learning.

7.1. Moving Beyond Short-Term and Cross-Sectional Designs

A prominent limitation of current research lies in its reliance on short-term and cross-sectional studies. Many investigations examine students’ perceptions, performance, or satisfaction following brief exposure to ChatGPT, often within a single task or instructional unit [46,62]. While such studies are useful for identifying immediate affordances and concerns, they provide limited insight into how fluency-driven misjudgments develop, stabilize, or attenuate over time. Longitudinal research is needed to examine whether repeated interaction with fluent AI output leads to enduring changes in learners’ metacognitive calibration, study strategies, or dependence on external support.
Longer-term designs would also allow researchers to explore how instructional interventions influence the trajectory of fluency illusion. For example, studies could examine whether early exposure to metacognitive scaffolding alters how students interpret fluent AI output across a semester or academic year. Such work would provide a more robust empirical foundation for pedagogical recommendations.

7.2. Reducing Overreliance on Self-Reported Learning

Another recurring feature of the literature is the heavy reliance on self-reported measures of learning, confidence, and perceived usefulness [63]. While these measures are informative, they are particularly susceptible to fluency effects, as learners’ judgments may be directly shaped by the clarity and coherence of AI-generated responses. As a result, positive self-reports may reflect perceived understanding rather than demonstrable learning.
Future research would benefit from triangulating self-reports with performance-based and process-oriented measures. Designs that compare perceived understanding with independent task performance, delayed transfer, or error detection can help disentangle subjective experience from learning outcomes. Such approaches are especially important for studying fluency illusion, as the construct itself concerns the divergence between perception and competence.

7.3. Examining Metacognition as a Central Variable

Although metacognition is often invoked in discussions of AI-supported learning, it is rarely treated as a primary object of empirical investigation [53]. Few studies directly measure how students monitor, evaluate, and regulate their learning when interacting with ChatGPT. This represents a significant gap, given that fluency illusion operates through metacognitive judgment rather than through content acquisition alone.
Future work should more explicitly examine how learners form judgments of understanding in AI-mediated contexts and how these judgments influence subsequent learning behavior. Experimental and mixed-methods studies that capture metacognitive processes, such as confidence calibration, prediction accuracy, and reflective reasoning, would deepen understanding of when and for whom fluency illusion is most likely to occur. Attention to individual differences, such as prior knowledge or epistemic beliefs, may further refine this line of inquiry.

7.4. Advancing Classroom-Level and Ecologically Valid Research

Much of the existing research on ChatGPT in education is conducted under controlled or simulated conditions, often outside of authentic classroom settings [54]. While such designs offer methodological clarity, they may overlook interactional and contextual factors that shape how fluency illusion manifests in practice. Classroom-level observational research is needed to capture how students actually integrate AI tools into their learning routines and how instructors respond to emerging patterns of use.
Naturalistic studies that combine observation, artifact analysis, and instructor interviews can shed light on how fluency-driven perceptions influence participation, feedback uptake, and assessment performance. Such work would also help clarify how institutional norms, assessment practices, and disciplinary cultures interact with AI-generated fluency to shape learning experiences.

7.5. Toward Theory-Driven and Cumulative Research

Finally, future research would benefit from more explicit theoretical integration. Many studies implicitly reference concepts related to fluency, confidence, or overreliance, yet few situate their findings within a coherent cognitive or educational framework. Treating fluency illusion as an organizing construct can facilitate cumulative knowledge building by providing shared terminology and explanatory mechanisms.
By aligning empirical designs with theory-driven questions about perception, regulation, and learning, future research can move beyond isolated findings toward a more coherent understanding of ChatGPT’s role in education. Such an approach not only advances scholarship but also supports the development of pedagogical practices that are responsive to the cognitive realities of AI-mediated learning.
Future research would also benefit from more systematic experimental designs that isolate the cognitive mechanisms underlying fluency illusion in AI-supported learning environments. For example, studies could compare learning outcomes across conditions in which students receive fluent AI-generated explanations, less polished explanations, or no external assistance. Such comparisons could examine how differences in fluency influence confidence judgments, perceived understanding, and performance on transfer tasks. Longitudinal studies may also help clarify whether repeated reliance on fluent AI output affects the development of self-regulated learning strategies over time. Framing these investigations around clearly defined research questions will help move the field beyond descriptive observations toward a more cumulative understanding of AI-mediated learning.
Building on the research gaps identified in this review, several research directions emerge that may guide future investigation. Future research should examine the extent to which exposure to fluent AI-generated explanations increases the gap between perceived understanding and actual learning performance. It should also investigate the instructional conditions under which AI assistance supports conceptual transfer rather than only immediate task completion. Another important direction is to explore how learner characteristics, such as prior knowledge, digital literacy, and metacognitive monitoring ability, influence susceptibility to fluency-driven misjudgments. Finally, future studies should evaluate which instructional and assessment strategies are most effective in reducing overconfidence while preserving the potential learning benefits of generative AI tools.

8. Descriptive Bibliometric Overview

The purpose of this bibliometric overview is not to provide a comprehensive scientometric analysis but rather to offer a descriptive snapshot of the rapidly emerging literature on ChatGPT in educational contexts. The analysis aims to illustrate publication trends and the disciplinary spread of research on generative AI in education, thereby situating the present conceptual review within the broader scholarly landscape.
Because the objective of this section is to provide a descriptive overview rather than a full bibliometric mapping, advanced bibliometric network analyses such as co-citation analysis, bibliographic coupling, or keyword co-occurrence mapping were not conducted.
The Dimensions database [64] was used to retrieve publication metadata using a keyword-based search applied to titles and abstracts. The search query used in the database was: (“ChatGPT” OR “generative AI” OR “large language model”) AND (education OR learning OR classroom OR teaching). The search focused primarily on peer-reviewed journal articles published between 2022 and 2025, corresponding to the period following the public release of ChatGPT and the early development of related educational research. The database query was conducted in January 2026, which resulted in the retrieval of a small number of records indexed in early 2026. Only articles indexed under education- and learning-related Fields of Research were included in order to maintain relevance and consistency across the dataset. The initial search retrieved 112 publications. After screening titles and abstracts according to the inclusion and exclusion criteria described in Section 2, duplicates and non-relevant records were removed, resulting in a final review corpus of 41 studies. Details of the data collection settings, search criteria, and applied filters are summarized in Table 3.
Although the Dimensions database provides broad coverage of scholarly publications, reliance on a single database introduces certain limitations and may exclude relevant publications indexed in other repositories. Some journals, particularly those published in non-English-speaking regions or those not fully indexed in major repositories, may be underrepresented. As a result, the disciplinary distribution presented in the bibliometric analysis should be interpreted as indicative rather than exhaustive.
Figure 6 presents the annual number of publications and citations related to ChatGPT in educational research. The figure shows a rapid growth of academic interest in this topic following an initial emergence phase that aligns with the public release of ChatGPT. The increase in publication volume is accompanied by a steady rise in citation counts, indicating that the topic has not only expanded quickly but has also attracted meaningful scholarly attention within a short period of time. This citation trend suggests that the educational research community is actively engaging with questions related to the pedagogical implications, opportunities, and challenges of large language models such as ChatGPT.
Figure 7 shows the distribution of publications across Fields of Research based on the ANZSRC classification. As expected, the majority of studies fall within education-related fields, reflecting the strong pedagogical focus of this research area. At the same time, notable contributions are observed in Language and Communication Studies, Information and Computing Sciences, and Psychology. This distribution highlights the multidisciplinary nature of research on ChatGPT in education and reflects growing collaboration across disciplinary boundaries.
Overall, the bibliometric analysis reveals clear and consistent patterns indicating that research on ChatGPT in education is rapidly growing. The sharp increase in publications and citations reflects strong academic interest, while the disciplinary distribution shows that this topic extends beyond education into language, computing, and psychological research. These findings support the relevance and timeliness of the present study and underline its contribution to an active and expanding research field.
The bibliometric trends observed in this analysis also provide useful context for interpreting the conceptual focus of the present review. The rapid increase in publications following the introduction of ChatGPT reflects strong interdisciplinary interest in generative AI across education, computer science, and related fields. However, the diversity of disciplinary perspectives also suggests that the theoretical mechanisms underlying AI-supported learning remain fragmented. Many studies focus primarily on practical applications or instructional outcomes, while fewer explicitly engage with cognitive or metacognitive processes that shape learners’ interpretations of fluent AI-generated output. In this sense, the growing body of literature highlights the need for clearer conceptual frameworks, such as the fluency illusion perspective developed in this review, to explain how highly coherent AI responses may influence learners’ judgments of understanding.

9. Conclusions

The growing presence of generative artificial intelligence tools such as ChatGPT in classroom settings has introduced new possibilities for learning support while also raising important questions about how students interpret and evaluate their own understanding. This review has argued that many of the mixed and sometimes contradictory findings in the literature can be more coherently explained through the concept of fluency illusion. When AI-generated output is linguistically smooth, well structured, and immediately responsive, learners may experience a strong sense of clarity that is mistakenly attributed to their own understanding rather than to the fluency of the system itself.
By integrating research from cognitive psychology, metacognition, and empirical studies on ChatGPT use in education, this article highlights the role of fluency illusion in shaping learners’ judgments, engagement, and self-regulation in AI-mediated learning environments. Across writing, problem solving, feedback, and assessment contexts, the literature consistently suggests that increased confidence and perceived learning are not always accompanied by deep or transferable understanding. These patterns are not best understood as simple misuse of technology but as predictable cognitive responses to fluent external support.
From a pedagogical perspective, the findings highlight the need to move beyond evaluating learning solely on the basis of polished outputs or efficient task completion. Instructional and assessment practices should be designed to make students’ reasoning, explanation, and transfer of knowledge visible, particularly in contexts where fluent AI assistance is available. Explicit metacognitive scaffolding can help learners distinguish between perceived understanding and demonstrated competence, reducing overreliance on fluency as a cue for learning success.
Several limitations of this review should also be acknowledged. The study adopts a narrative and conceptual approach rather than a systematic meta-analysis, and the framework proposed in this article should therefore be interpreted as a theoretical synthesis rather than an empirically validated model. In addition, the empirical literature on generative AI in education remains relatively recent and methodologically diverse. While a growing number of classroom-based studies provide valuable insights into how students interact with AI systems, many of the cognitive mechanisms discussed in this review, including the role of fluency illusion in shaping learners’ judgments of understanding, require further empirical investigation.
Future research should examine fluency illusion in AI-mediated learning through longitudinal and classroom-based designs that capture how learners’ judgments and strategies evolve over time. Greater emphasis on performance-based and process-oriented measures, alongside direct assessments of metacognitive calibration, will be essential for understanding when and for whom fluency-driven misjudgments are most likely to occur. By adopting fluency illusion as an organizing framework, future work can support the development of educational practices that leverage the benefits of generative artificial intelligence while safeguarding meaningful and durable learning.

Author Contributions

S.K.: Conceptualization, methodology, formal analysis, investigation, visualization, writing—original draft, writing—review & editing, Supervision. A.M.: Investigation, literature review, visualization, writing—original draft. O.V.: Investigation, validation, writing—review & editing. All authors have read and agreed to the published version of the manuscript.

Funding

The article received no external funding.

Institutional Review Board Statement

Not Applicable.

Informed Consent Statement

Not Applicable.

Data Availability Statement

Data sharing is not applicable to this article as no data were created or analyzed in this study.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Alqahtani, T.; Badreldin, H.A.; Alrashed, M.; Alshaya, A.I.; Alghamdi, S.S.; Saleh, K.B.; Alowais, S.A.; Alshaya, O.A.; Rahman, I.; Al Yami, M.S.; et al. The emergent role of artificial intelligence, natural learning processing, and large language models in higher education and research. Res. Soc. Adm. Pharm. 2023, 19, 1236–1242. [Google Scholar] [CrossRef]
  2. Emara, I.F. A linguistic comparison between ChatGPT-generated and nonnative student-generated short story adaptations: A stylometric approach. Smart Learn. Environ. 2025, 12, 36. [Google Scholar] [CrossRef]
  3. Meyer, J.G.; Urbanowicz, R.J.; Martin, P.C.N.; O’Connor, K.; Li, R.; Peng, P.-C.; Bright, T.J.; Tatonetti, N.; Won, K.J.; Gonzalez-Hernandez, G.; et al. ChatGPT and large language models in academia: Opportunities and challenges. BioData Min. 2023, 16, 20. [Google Scholar] [CrossRef]
  4. Mekheimer, M. Generative AI-assisted feedback and EFL writing: A study on proficiency, revision frequency and writing quality. Discov. Educ. 2025, 4, 170. [Google Scholar] [CrossRef]
  5. Anthonio, T. Linguistically-Informed Modelling of Potentials for Misunderstanding. Ph.D. Thesis, Universität Stuttgart, Stuttgart, Germany, 2024. [Google Scholar]
  6. Lee, S.; Song, K.-S. Teachers’ and students’ perceptions of AI-generated concept explanations: Implications for integrating generative AI in computer science education. Comput. Educ. Artif. Intell. 2024, 7, 100283. [Google Scholar] [CrossRef]
  7. Heersmink, R.; de Rooij, B.; Vázquez, M.J.C.; Colombo, M. A phenomenology and epistemology of large language models: Transparency, trust, and trustworthiness. Ethics Inf. Technol. 2024, 26, 41. [Google Scholar] [CrossRef]
  8. Metzger, M.J.; Flanagin, A.J. Credibility and trust of information in online environments: The use of cognitive heuristics. J. Pragmat. 2013, 59, 210–220. [Google Scholar] [CrossRef]
  9. Lee, H.-Y.; Huang, Y.-M.; Wu, T.-T. ChatGPT in Education: A Systematic Review of Current Landscape, Limitations and Future Directions Through General System Theory Lens. Eur. J. Educ. 2025, 60, e70262. [Google Scholar] [CrossRef]
  10. Punar Özçelik, N.; Ekşi, G.Y. Cultivating writing skills: The role of ChatGPT as a learning assistant—A case study. Smart Learn. Environ. 2024, 11, 10. [Google Scholar] [CrossRef]
  11. Wang, C. Exploring students’ generative AI-assisted writing processes: Perceptions and experiences from native and nonnative English speakers. Technol. Knowl. Learn. 2025, 30, 1825–1846. [Google Scholar] [CrossRef]
  12. Khampusaen, D. The Impact of ChatGPT on Academic Writing Skills and Knowledge: An Investigation of Its Use in Argumentative Essays. Learn. J. Lang. Educ. Acquis. Res. Netw. 2025, 18, 963–988. [Google Scholar]
  13. Zhang, M.; Osman, K. Exploring the Implementation and Challenges of the ChatGPT in STEM Higher Education: A Systematic Literature Review. In 2024 Yearbook Emerging Technologies in Learning; Springer: Berlin/Heidelberg, Germany, 2025; pp. 3–22. [Google Scholar]
  14. Zhai, C.; Wibowo, S.; Li, L.D. The effects of over-reliance on AI dialogue systems on students’ cognitive abilities: A systematic review. Smart Learn. Environ. 2024, 11, 28. [Google Scholar] [CrossRef]
  15. Polozov, A. The deep illusion: A critical analysis of DeepSeek and the limits of large language models. AI Ethics 2025, 5, 5587–5596. [Google Scholar] [CrossRef]
  16. Ou, A.W.; Stöhr, C.; Malmström, H. Academic communication with AI-powered language tools in higher education: From a post-humanist perspective. System 2024, 121, 103225. [Google Scholar] [CrossRef]
  17. Reber, R.; Greifeneder, R. Processing fluency in education: How metacognitive feelings shape learning, belief formation, and affect. Educ. Psychol. 2017, 52, 84–103. [Google Scholar] [CrossRef]
  18. Unkelbach, C.; Greifeneder, R. A general model of fluency effects in judgment and decision making. In The Experience of Thinking; Psychology Press: Hove, UK, 2013; pp. 11–32. [Google Scholar]
  19. Finn, B.; Tauber, S.K. When confidence is not a signal of knowing: How students’ experiences and beliefs about processing fluency can lead to miscalibrated confidence. Educ. Psychol. Rev. 2015, 27, 567–586. [Google Scholar] [CrossRef]
  20. Sobel, D.M.; Kushnir, T. Knowledge matters: How children evaluate the reliability of testimony as a process of rational inference. Psychol. Rev. 2013, 120, 779. [Google Scholar] [CrossRef] [PubMed]
  21. Hacker, D.J.; Dunlosky, J.; Graesser, A.C. (Eds.) Handbook of Metacognition in Education; Routledge: New York, NY, USA, 2009. [Google Scholar]
  22. Koriat, A. Monitoring one’s own knowledge during study: A cue-utilization approach to judgments of learning. J. Exp. Psychol. Gen. 1997, 126, 349. [Google Scholar] [CrossRef]
  23. Dunlosky, J.; Metcalfe, J. Metacognition; Sage Publications: Thousand Oaks, CA, USA, 2008. [Google Scholar]
  24. Rozenblit, L.; Keil, F. The misunderstood limits of folk science: An illusion of explanatory depth. Cogn. Sci. 2002, 26, 521–562. [Google Scholar] [CrossRef] [PubMed]
  25. Dinsmore, D.L.; Alexander, P.A. A critical discussion of deep and surface processing: What it means, how it is measured, the role of context, and model specification. Educ. Psychol. Rev. 2012, 24, 499–567. [Google Scholar] [CrossRef]
  26. Sweller, J. Cognitive load during problem solving: Effects on learning. Cogn. Sci. 1988, 12, 257–285. [Google Scholar] [CrossRef]
  27. Sweller, J.; Van Merriënboer, J.J.; Paas, F. Cognitive architecture and instructional design: 20 years later. Educ. Psychol. Rev. 2019, 31, 261–292. [Google Scholar] [CrossRef]
  28. Bjork, E.L.; Bjork, R.A. Making things hard on yourself, but in a good way: Creating desirable difficulties to enhance learning. In Psychology and the Real World: Essays Illustrating Fundamental Contributions to Society; Worth Publishers: New York, NY, USA, 2011; Volume 2, pp. 56–64. [Google Scholar]
  29. Wagner, M. Adaptive Educational Strategies for Creative Fields in the Era of Generative AI: A Framework for Navigating Technological Uncertainty. In ICERI2024 Proceedings; IATED: Valencia, Spain, 2024; pp. 3068–3075. [Google Scholar]
  30. Kolhatin, A.O. From automation to augmentation: A human-centered framework for generative AI in adaptive educational content creation. In CEUR Workshop Proceedings; CEUR-WS.org: Aachen, Germany, 2025; Volume 4060, pp. 143–195. [Google Scholar]
  31. Grifoni, B.E.; Ferrario, R. The bewitching AI: The Illusion of Communication with Large Language Models. Philos. Technol. 2025, 38, 61. [Google Scholar] [CrossRef]
  32. Abdelhalim, S.M.; Alsehibany, R. Integrating ChatGPT for vocabulary learning and retention: A classroom-based study of Saudi EFL learners. Lang. Learn. Technol. 2025, 29, 1–24. [Google Scholar] [CrossRef]
  33. Murayama, K.; Blake, A.B.; Kerr, T.; Castel, A.D. When enough is not enough: Information overload and metacognitive decisions to stop studying information. J. Exp. Psychol. Learn. Mem. Cogn. 2016, 42, 914. [Google Scholar] [CrossRef]
  34. Albadarin, Y.; Saqr, M.; Pope, N.; Tukiainen, M. A systematic literature review of empirical research on ChatGPT in education. Discov. Educ. 2024, 3, 60. [Google Scholar] [CrossRef]
  35. Li, Y. Usability of ChatGPT in Second Language Acquisition: Capabilities, Effectiveness, Applications, Challenges, and Solutions. Stud. Appl. Linguist. TESOL 2024, 24, 24–37. [Google Scholar] [CrossRef]
  36. Reza, M.; Thomas-Mitchell, J.; Dushniku, P.; Laundry, N.; Williams, J.J.; Kuzminykh, A. Co-writing with ai, on human terms: Aligning research with user demands across the writing process. Proc. ACM Hum.-Comput. Interact. 2025, 9, CSCW385. [Google Scholar] [CrossRef]
  37. Baron, N.S. Who Wrote This?: How AI and the Lure of Efficiency Threaten Human Writing; Stanford University Press: Redwood City, CA, USA, 2023. [Google Scholar]
  38. Jiang, F.K.; Hyland, K. Metadiscursive nouns in academic argument: ChatGPT vs student practices. J. Engl. Acad. Purp. 2025, 75, 101514. [Google Scholar] [CrossRef]
  39. Ismail, H.Y.S. Cohesion and coherence in essays generated by chatgpt: A comparative analysis to university students’ writing. CDELT Occas. Pap. Dev. Engl. Educ. 2023, 83, 143–165. [Google Scholar] [CrossRef]
  40. Ngu, B.H.; Phan, H.P. Instructional approach and acquisition of mathematical proficiency: Theoretical insights from learning by comparison and cognitive load theory. Asian J. Math. Educ. 2024, 3, 357–379. [Google Scholar] [CrossRef]
  41. Graefen, B.; Fazal, N. From Chat bots to Virtual Tutors: An Overview of Chat GPT’s Role in the Future of Education. Arch. Pharm. Pract. 2024, 15, 43–52. [Google Scholar] [CrossRef]
  42. Cavalcanti, A.P.; Barbosa, A.; Carvalho, R.; Freitas, F.; Tsai, Y.S.; Gašević, D.; Mello, R.F. Automatic feedback in online learning environments: A systematic literature review. Comput. Educ. Artif. Intell. 2021, 2, 100027. [Google Scholar] [CrossRef]
  43. Mulenga, R.; Shilongo, H. Academic integrity in higher education: Understanding and addressing plagiarism. Acta Pedagog. Asiana 2024, 3, 30–43. [Google Scholar] [CrossRef]
  44. Kasneci, E.; Seßler, K.; Küchemann, S.; Bannert, M.; Dementieva, D.; Fischer, F.; Gasser, U.; Groh, G.; Günnemann, S.; Hüllermeier, E.; et al. ChatGPT for good? On opportunities and challenges of large language models for education. Learn. Individ. Differ. 2023, 103, 102274. [Google Scholar] [CrossRef]
  45. Holmes, W.; Miao, F. Guidance for Generative AI in Education and Research; Unesco Publishing: Paris, France, 2023. [Google Scholar]
  46. Jo, H. From concerns to benefits: A comprehensive study of ChatGPT usage in education. Int. J. Educ. Technol. High. Educ. 2024, 21, 35. [Google Scholar] [CrossRef]
  47. Blankenship, R. Pedagogical models and generative AI fluency: A three-tiered empirical framework approach. In Exploring New Horizons: Generative Artificial Intelligence and Teacher Education; Association for the Advancement of Computing in Education: Waynesville, NC, USA, 2024; p. 39. [Google Scholar]
  48. Turo, S.; Collin, F.; Brouillet, D. When the sense of fluency triggers an attentional bias. Q. J. Exp. Psychol. 2023, 76, 350–360. [Google Scholar] [CrossRef] [PubMed]
  49. Parks, C.M.; Toth, J.P. Fluency, familiarity, aging, and the illusion of truth. Aging Neuropsychol. Cogn. 2006, 13, 225–253. [Google Scholar] [CrossRef] [PubMed]
  50. Namaziandost, E.; Rezai, A. Artificial intelligence in open and distributed learning: Does it facilitate or hinder teaching and learning? Int. Rev. Res. Open Distrib. Learn. 2024, 25, i–vii. [Google Scholar] [CrossRef]
  51. Cianferoni, F. AI in Higher Education: From Literature to a Course-Anchored Chatbot. Ph.D. Thesis, Politecnico di Torino, Torino, Italy, 2025. [Google Scholar]
  52. Gkintoni, E.; Antonopoulou, H.; Sortwell, A.; Halkiopoulos, C. Challenging cognitive load theory: The role of educational neuroscience and artificial intelligence in redefining learning efficacy. Brain Sci. 2025, 15, 203. [Google Scholar] [CrossRef]
  53. Levin, I.; Marom, M.; Kojukhov, A. Rethinking AI in Education: Highlighting the Metacognitive Challenge. BRAIN Broad Res. Artif. Intell. Neurosci. 2025, 16, 250–263. [Google Scholar] [CrossRef]
  54. Alshahrani, K.; Qureshi, R.J. Review the prospects and obstacles of AI-enhanced learning environments: The role of ChatGPT in education. Int. J. Mod. Educ. Comput. Sci. 2024, 16, 71–86. [Google Scholar] [CrossRef]
  55. Lundberg, S. ChatGPT vs. Teacher Feedback Provision: An Investigation on the Efficacy of ChatGPT Feedback Provision on Written Production Across Proficiency Levels; Stockholm University: Stockholm, Sweden, 2024. [Google Scholar]
  56. Parsons, P. Promoting representational fluency for cognitive bias mitigation in information visualization. In Cognitive Biases in Visualizations; Springer International Publishing: Cham, Switzerland, 2018; pp. 137–147. [Google Scholar]
  57. Topolinski, S. The sources of fluency: Identifying the underlying mechanisms of fluency effects. In The Experience of Thinking; Psychology Press: Hove, UK, 2013; pp. 33–49. [Google Scholar]
  58. Schwartz, B.L.; Jemstedt, A. The role of fluency and dysfluency in metacognitive experiences. In Trends and Prospects in Metacognition Research Across the Life Span: A Tribute to Anastasia Efklides; Springer International Publishing: Cham, Switzerland, 2021; pp. 25–40. [Google Scholar]
  59. Jang, H.; Kim, S.; So, H.-J. Technology-based scaffolding for ill-structured problem-solving: A systematic review and emerging directions with AI. Learn. Res. Pract. 2025, 1–24. [Google Scholar] [CrossRef]
  60. Watts, N.; Galvin, C. Aligning Professional Identity with Institutional Culture: The Role of Educators’ Digital Fluency in Harnessing the Potential of Online and Technology Enhanced Learning. In Proceedings of the 2019 ICDE World Conference on Online Learning; Brown, M., Nic Giolla Mhichíl, M., Beirne, E., Costello, E., Eds.; Dublin City University: Dublin, Ireland, 2020; Volume 1. [Google Scholar]
  61. Baskara, F.X.R. ChatGPT and Critical Digital Pedagogy: Examining the Potential and Challenges for Educational Practice. Proceeding Int. Conf. Inov. Sci. Technol. Educ. Child. Health 2024, 4, 57–73. [Google Scholar]
  62. Almulla, M.A. Investigating influencing factors of learning satisfaction in AI ChatGPT for research: University students perspective. Heliyon 2024, 10, e32220. [Google Scholar] [CrossRef]
  63. Christensen, R.; Knezek, G. Self-report measures and findings for information technology attitudes and competencies. In International Handbook of Information Technology in Primary and Secondary Education; Springer: Boston, MA, USA, 2008; pp. 349–365. [Google Scholar]
  64. Dimensions. Bibliometric Analysis Portal. Available online: https://app.dimensions.ai/discover/publication (accessed on 15 January 2026).
Figure 1. PRISMA-style flow diagram illustrating the study selection process for the literature review, including identification, screening, eligibility assessment, and final inclusion of studies examining the use of ChatGPT and related generative AI tools in educational contexts.
Figure 1. PRISMA-style flow diagram illustrating the study selection process for the literature review, including identification, screening, eligibility assessment, and final inclusion of studies examining the use of ChatGPT and related generative AI tools in educational contexts.
Information 17 00299 g001
Figure 2. Conceptual model illustrating the fluency illusion in AI-mediated learning environments.
Figure 2. Conceptual model illustrating the fluency illusion in AI-mediated learning environments.
Information 17 00299 g002
Figure 3. Conceptual distinction between surface performance and deep learning in AI-mediated learning environments.
Figure 3. Conceptual distinction between surface performance and deep learning in AI-mediated learning environments.
Information 17 00299 g003
Figure 4. Fluency-related risks of ChatGPT use across classroom contexts.
Figure 4. Fluency-related risks of ChatGPT use across classroom contexts.
Information 17 00299 g004
Figure 5. Design strategies for reducing fluency-driven misinterpretations of learning when using ChatGPT.
Figure 5. Design strategies for reducing fluency-driven misinterpretations of learning when using ChatGPT.
Information 17 00299 g005
Figure 6. Annual publications and citations related to ChatGPT in educational contexts. The analysis primarily covers the period 2022–2025; limited records indexed in early 2026 are included because the database search was conducted in January 2026 [64].
Figure 6. Annual publications and citations related to ChatGPT in educational contexts. The analysis primarily covers the period 2022–2025; limited records indexed in early 2026 are included because the database search was conducted in January 2026 [64].
Information 17 00299 g006
Figure 7. Distribution of ChatGPT-related publications across ANZSRC Fields of Research. Field codes are shown on the horizontal axis, with corresponding field names listed in Table 3.
Figure 7. Distribution of ChatGPT-related publications across ANZSRC Fields of Research. Field codes are shown on the horizontal axis, with corresponding field names listed in Table 3.
Information 17 00299 g007
Table 1. Key Gaps and Future Research Directions in AI-Mediated Learning.
Table 1. Key Gaps and Future Research Directions in AI-Mediated Learning.
Research GapRationale/ImplicationsSuggested Research Approaches
Lack of longitudinal studiesShort-term studies cannot capture the development of fluency illusion or long-term learning outcomesLongitudinal designs tracking metacognitive calibration, cognitive strategies, and dependence on AI across semesters
Overreliance on self-reported learningSelf-reports may reflect perceived understanding shaped by fluent AI output rather than actual learningTriangulate with performance-based assessments, error detection tasks, and delayed transfer measures
Limited attention to metacognitionFew studies measure how students monitor and regulate their learning with AIIncorporate explicit metacognitive measures such as confidence calibration, reflective prompts, and think-aloud protocols
Need for classroom-level observational researchControlled studies may not capture interactional and contextual factors affecting AI useMixed-methods approaches combining observation, artifact analysis, and instructor interviews in authentic classroom settings
Theory-driven integrationMany studies lack a unifying framework to interpret findingsUse fluency illusion as an organizing construct to connect cognitive, metacognitive, and educational research
Table 2. Common forms of ChatGPT use in writing tasks reported in the literature and their potential learning implications.
Table 2. Common forms of ChatGPT use in writing tasks reported in the literature and their potential learning implications.
Type of AI UseTypical Student ActivityImmediate BenefitPotential Learning Risk
Linguistic supportGrammar correction, paraphrasing, vocabulary suggestionsImproved language fluency and readabilitySurface-level improvement may be mistaken for stronger writing competence [10,39]
Partial assistanceBrainstorming ideas, outlining arguments, revising draftsFaster idea generation and clearer structureStudents may rely on AI for organization without strengthening reasoning [11,36]
Substantial drafting supportAI generates paragraphs or sections based on promptsFaster completion and more polished textReduced engagement in constructing arguments independently [11,12]
Full authorship substitutionAI generates most or all of the final textHigh-quality surface output with minimal effortLimited development of writing, reasoning, and conceptual understanding [12,37]
Table 3. Bibliometric data collection settings, search criteria, and Fields of Research (ANZSRC 2020) included in the analysis.
Table 3. Bibliometric data collection settings, search criteria, and Fields of Research (ANZSRC 2020) included in the analysis.
CategoryDescription
DatabaseDimensions [64]
Search fieldsTitle and abstract
Core keywords(“ChatGPT” OR “generative AI” OR “large language model”) AND (education OR learning OR classroom)
Publication years2022–2025
Publication typeArticles
LanguageEnglish
Fields of Research39 Education (3901 Curriculum and Pedagogy; 3902 Education Policy, Sociology and Philosophy;
3903 Education Systems; 3904 Specialist Studies in Education);
47 Language, Communication and Culture (4701 Communication and Media Studies;
4703 Language Studies; 4704 Linguistics; 4705 Literary Studies);
36 Creative Arts and Writing (3602 Creative and Professional Writing);
52 Psychology (5201 Applied and Developmental Psychology;
5204 Cognitive and Computational Psychology);
46 Information and Computing Sciences (4601 Applied Computing;
4608 Human-Centred Computing; 4609 Information Systems)
Date of data retrieval15 January 2026
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kumar, S.; Mikayelyan, A.; Vorfolomeyeva, O. Fluency Illusion: A Review on Influence of ChatGPT in Classroom Settings. Information 2026, 17, 299. https://doi.org/10.3390/info17030299

AMA Style

Kumar S, Mikayelyan A, Vorfolomeyeva O. Fluency Illusion: A Review on Influence of ChatGPT in Classroom Settings. Information. 2026; 17(3):299. https://doi.org/10.3390/info17030299

Chicago/Turabian Style

Kumar, Sachin, Anna Mikayelyan, and Olga Vorfolomeyeva. 2026. "Fluency Illusion: A Review on Influence of ChatGPT in Classroom Settings" Information 17, no. 3: 299. https://doi.org/10.3390/info17030299

APA Style

Kumar, S., Mikayelyan, A., & Vorfolomeyeva, O. (2026). Fluency Illusion: A Review on Influence of ChatGPT in Classroom Settings. Information, 17(3), 299. https://doi.org/10.3390/info17030299

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop