Dialogical Learning Support in RAG-Based E-Learning
Abstract
1. Introduction
- A dialogically oriented learning support architecture that models interaction as a continuous learning process rather than as isolated question–answer exchanges.
- A bounded context-aware dialogue mechanism that preserves recent interaction history and supports incremental knowledge construction across multiple turns.
- A retrieval-constrained generation approach grounded in curated and verified educational materials.
- A modular and model-agnostic system design allowing flexible integration of different large language models.
- Demonstration of effective retrieval-grounded operation in a lower-resource language setting (Bulgarian).
- A hybrid retrieval strategy combining semantic similarity and lexical matching within a two-stage retrieval pipeline.
- An empirical evaluation framework including retrieval accuracy, generative response quality, groundedness, hallucination analysis, and usefulness assessment.
- A system design aligned with Compliance-by-Design principles in relation to the EU AI Act, supporting transparency and controlled educational use of AI.
2. Related Work
3. Architecture and Design
3.1. Overall Architecture and Design Principles
- knowledge management layer
- response management layer
- dialogical interaction layer
3.2. Knowledge Base Construction and Content Management
3.3. Retrieval and Answer Generation Workflow
3.4. Pedagogical Design and Context Management
3.5. Regulatory and Ethical Constraints in Educational Use of Language Models
4. Modeling and Implementation
4.1. Calculus of Context-Aware Ambients (CCA) Modeling of Learning Scenario
4.2. Implementation and Prototype
4.3. Empirical Retrieval Evaluation and Sensitivity Analysis
4.3.1. Chunk Granularity Sensitivity Analysis
4.3.2. Retrieval Strategy Comparison
- semantic retrieval based on embedding similarity (E5 model with “query:”/“passage:” formulation);
- hybrid retrieval combining semantic similarity and keyword-based ranking.
4.3.3. Comparative Validation of Retrieval Input Size
4.3.4. Failure Analysis and Dialogue Context
4.3.5. Final Configuration
4.4. Generative Response Evaluation
- In several cases, correct conceptual answers were generated even when the retrieved passages did not contain an exact lexical match, suggesting that the system can preserve semantic relevance beyond superficial term overlap.
- Most answers were correct; however, occasional conceptual simplifications were observed in cases requiring fine-grained distinctions between related UML control elements.
- When retrieval failed, the model sometimes produced plausible domain-relevant answers based on prior knowledge. Although the prompt explicitly instructed the model to rely only on the retrieved context, prompt-level constraints did not fully suppress the influence of pretrained parametric knowledge. Such answers occasionally contained oversimplified or partially contradictory examples, highlighting the importance of grounding and evidence traceability in educational settings.
- In several cases, the responses provided concise summaries or explanatory reformulations that remained useful for learning.
5. Discussion and Future Directions
5.1. Architectural and Pedagogical Implications
5.2. Technical Performance and Scalability
5.3. Limitations and Challenges
5.4. Future Research Directions
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Hajjioui, Y.; Zine, O.; Benslimane, M.; Ibriz, A. Intelligent Tutoring Systems: A Review. In Big Data and Internet of Things (BDIoT 2024); Mahboub, O., Haddouch, K., Omara, H., Hefnawi, M., Eds.; Lecture Notes in Networks and Systems; Springer: Cham, Switzerland, 2025; Volume 887, pp. 663–676. [Google Scholar] [CrossRef]
- Guan, R.; Raković, M.; Chen, G. How educational chatbots support self-regulated learning? A systematic review of the literature. Educ. Inf. Technol. 2025, 30, 4493–4518. [Google Scholar] [CrossRef]
- Shi, Y.; Yu, K.; Dong, Y.; Chen, F. Large language models in education: A systematic review of empirical applications, benefits, and challenges. Comput. Educ. Artif. Intell. 2026, 10, 100529. [Google Scholar] [CrossRef]
- Wang, S.; Xu, T.; Li, H.; Zhang, C.; Liang, J.; Tang, J.; Yu, P.; Wen, Q. Large Language Models for Education: A survey and outlook. IEEE Signal Process. Mag. 2025, 42, 51–63. [Google Scholar] [CrossRef]
- Xing, W.; Nixon, N.; Crossley, S.; Denny, P.; Lan, A.; Stamper, J.; Yu, Z. The Use of Large Language Models in Education. Int. J. Artif. Intell. Educ. 2025, 35, 439–443. [Google Scholar] [CrossRef]
- Khong, T.; Saito, E.; Hardy, I.; Gillies, R. Teacher learning through dialogue with colleagues, self and students. Educ. Res. 2023, 65, 170–188. [Google Scholar] [CrossRef]
- Hannigan, T.; McCarthy, I.; Spicer, A. Beware of botshit: How to manage the epistemic risks of generative chatbots. Bus. Horiz. 2024, 67, 471–486. [Google Scholar] [CrossRef]
- Michel-Villarreal, R.; Vilalta-Perdomo, E.; Salinas-Navarro, D.; Thierry-Aguilera, R.; Gerardou, F. Challenges and Opportunities of Generative AI for Higher Education as Explained by ChatGPT. Educ. Sci. 2023, 13, 856. [Google Scholar] [CrossRef]
- Gao, Y.; Xiong, Y.; Gao, X.; Jia, K.; Pan, J.; Bi, Y.; Dai, Y.; Sun, J.; Wang, M.; Wang, H. Retrieval-Augmented Generation for Large Language Models: A Survey. arXiv 2024, arXiv:2312.10997. [Google Scholar] [CrossRef]
- Barnett, S.; Kurniawan, S.; Thudumu, S.; Brannelly, Z.; Abdelrazek, M. Seven Failure Points When Engineering a Retrieval Augmented Generation System. In Proceedings of the IEEE/ACM 3rd International Conference on AI Engineering; Association for Computing Machinery: New York, NY, USA, 2024; pp. 194–199. [Google Scholar] [CrossRef]
- Zhai, C.; Wibowo, S.; Li, L. The effects of over-reliance on AI dialogue systems on students’ cognitive abilities: A systematic review. Smart Learn. Environ. 2024, 11, 28. [Google Scholar] [CrossRef]
- Lewis, P.; Perez, E.; Piktus, A.; Petroni, F.; Karpukhin, V.; Goyal, N.K.H.; Lewis, M.; Yih, W.; Rocktäschel, T.; Riedel, S.; et al. Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. arXiv 2020, arXiv:2005.11401. [Google Scholar] [CrossRef]
- Chu, Z.; Wang, S.; Xie, J.; Zhu, T.; Yan, Y.; Ye, J. LLM Agents for Education: Advances and Applications. In Findings of the Association for Computational Linguistics: EMNLP 2025; Association for Computational Linguistics: Stroudsburg, PA, USA, 2025; pp. 13782–13810. [Google Scholar] [CrossRef]
- Oprea, S.-V.; Bâra, A. Transforming Education with Large Language Models. IEEE Access 2025, 13, 87292–87312. [Google Scholar] [CrossRef]
- Hu, B.; Zhu, J.; Pei, Y.; Gu, X. Exploring the potential of LLM to enhance teaching plans through teaching simulation. npj Sci. Learn. 2025, 10, 7. [Google Scholar] [CrossRef]
- Tan, K.; Yao, J.; Pang, T.; Fan, C.; Song, Y. ELF: Educational LLM Framework of Improving and Evaluating AI-generated Content for Classroom Teaching. ACM J. Data Inf. Qual. 2025, 17, 14. [Google Scholar] [CrossRef]
- Sunil, K.; Thakkar, A. SocraticAI: Transforming LLMs into Guided CS Tutors Through Scaffolded Interaction. arXiv 2025, arXiv:2512.03501. [Google Scholar] [CrossRef]
- Sonkar, S.; Liu, N.; Mallick, D.; Baraniuk, R. CLASS: A Design Framework for Building Intelligent Tutoring Systems Based on Learning Science Principles. arXiv 2023, arXiv:2305.13272. [Google Scholar] [CrossRef]
- Lefton, L.; Rong, K.; Dankhara, C.; Ghemri, L.; Kausar, F.; Hamdallahi, A. A Socratic RAG Approach to Connect Natural Language Queries on Research Topics with Knowledge Organization Systems. arXiv 2025, arXiv:2502.15005. [Google Scholar] [CrossRef]
- Du, M.; Xu, B.; Zhu, C.; Wang, S.; Wang, P.; Wang, X.; Mao, Z. A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. arXiv 2026, arXiv:2602.03442. [Google Scholar] [CrossRef]
- Wang, P.; Xu, B.; Zhang, L.; Wang, S.; Du, M.; Zhu, C.; Mao, Z. WildGraphBench: Benchmarking GraphRAG with Wild-Source Corpora. arXiv 2026, arXiv:2602.02053. [Google Scholar] [CrossRef]
- Li, S.; Stenzel, S.; Eickhoff, C.; Bahrainian, S. Enhancing Retrieval-Augmented Generation: A Study of Best Practices. In Proceedings of the 31st International Conference on Computational Linguistics, Abu Dhabi, United Arab Emirates; Association for Computational Linguistics: Stroudsburg, PA, USA, 2025; pp. 6705–6717. [Google Scholar]
- Arslan, M.; Ghanem, H.; Munawar, S.; Cruz, C. A Survey on RAG with LLMs. Procedia Comput. Sci. 2024, 246, 3781–3790. [Google Scholar] [CrossRef]
- Chen, Z.; Zhou, K.; Zhang, B.; Gong, Z.; Zhao, W.; Wen, J.-R. ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models. arXiv 2023, arXiv:2305.14323. [Google Scholar] [CrossRef]
- Khattab, O.; Singhvi, A.; Maheshwari, P.; Zhang, Z.; Santhanam, K.; Vardhamanan, S.; Haq, S.; Sharma, A.; Joshi, T.; Moazam, H.; et al. DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines. arXiv 2023, arXiv:2310.03714. [Google Scholar] [CrossRef]
- Zhai, C.; Wibowo, S. A systematic review on artificial intelligence dialogue systems for enhancing English as foreign language students’ interactional competence in the university. Comput. Educ. Artif. Intell. 2023, 4, 100134. [Google Scholar] [CrossRef]
- Klimova, B.; Pikhart, M. Exploring the effects of artificial intelligence on student and academic well-being in higher education: A mini-review. Front. Psychol. 2025, 16, 1498132. [Google Scholar] [CrossRef]
- Radford, A.; Kim, J.; Hallacy, C.; Goh, R.A.G.; Agarwal, S.; Sastry, G.; Askell, A.; Mishkin, P.; Clark, J.; Krueger, G.; et al. Learning Transferable Visual Models from Natural Language Supervision (CLIP). arXiv 2021, arXiv:2103.00020. [Google Scholar] [CrossRef]
- Li, J.; Li, D.; Xiong, C.H.S. BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation. arXiv 2022, arXiv:2201.12086. [Google Scholar] [CrossRef]
- Li, J.; Li, D.; Savarese, S.; Hoi, S. BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models. arXiv 2023, arXiv:2301.12597. [Google Scholar] [CrossRef]
- Li, C.; Gan, Z.; Yang, Z.; Yang, J.; Li, L.; Wang, L.; Gao, J. Multimodal Foundation Models: From Specialists to General-Purpose Assistants. arXiv 2023, arXiv:2309.10020. [Google Scholar] [CrossRef]
- Mei, L.; Mo, S.; Yang, Z.; Chen, C. A Survey of Multimodal Retrieval-Augmented Generation. arXiv 2025, arXiv:2504.08748. [Google Scholar] [CrossRef]
- European Union. Regulation (EU) 2024/1689 of the European Parliament and of the Council of 13 June 2024 Laying Down Harmonised Rules on Artificial Intelligence (Artificial Intelligence Act); (EU) 2024/1689; Publications Office of the European Union: Luxembourg, 2024. [Google Scholar]
- European Commission. Ethical Guidelines on the Use of Artificial Intelligence and Data in Teaching and Learning for Educators; Publications Office of the European Union: Luxembourg, 2022. [Google Scholar] [CrossRef]
- Riely, J.; Hennessy, M. A typed language for distributed mobile processes. In Proceedings of the 25th Annual ACM Symposium on Principles of Programming Languages; Association for Computing Machinery: New York, NY, USA, 1998; pp. 378–390. [Google Scholar] [CrossRef]
- Siewe, F.; Zedan, H.; Cau, A. The calculus of context-aware ambients. J. Comput. Syst. Sci. 2011, 77, 597–620. [Google Scholar] [CrossRef]
- Glushkova, T.; Rusev, K.; Stoyanov, D. Multi-Context Modeling of Processes and Services in Cyber-Physical Educational Space. In Proceedings of the International Conference on Big Data, Knowledge and Control Systems Engineering (BdKCSE); IEEE: New York, NY, USA, 2023; pp. 1–8. [Google Scholar] [CrossRef]




| Notation | Description |
|---|---|
| PA | Personal Assistant of Student |
| RT | Retriever Component |
| SC | Session Context Module |
| LLMG | LLM Generator |
| KM | Knowledge Manager |
| VDB | Vector Database, child of KM ambient |
| UDB | User Database, child of KM ambient |
| MEDB | Multimodal Educational DB, child of KM ambient |
| Chunking | Chunks | TOP-1 | TOP-3 | TOP-5 | TOP-6 |
|---|---|---|---|---|---|
| 300/100 | 233 | 47.14% | 58.57% | 65.71% | 67.14% |
| 300/150 | 353 | 48.57% | 57.14% | 60.00% | 67.14% |
| 500/150 | 149 | 52.86% | 74.29% | 77.14% | 78.57% |
| 800/250 | 119 | 54.29% | 74.29% | 81.43% | 84.29% |
| Method | TOP-1 | TOP-3 | TOP-5 | TOP-6 |
|---|---|---|---|---|
| Semantic | 70.00% | 87.14% | 88.57% | 91.43% |
| Hybrid | 78.57% | 85.71% | 90.00% | 91.43% |
| Retrieval Input | TOP-1 | TOP-3 | TOP-5 | TOP-6 |
|---|---|---|---|---|
| 500/150 | 78.57% | 85.71% | 90.00% | 91.43% |
| 800/250 | 64.00% | 81.43% | 87.14% | 87.14% |
| Metric | Score |
|---|---|
| Correctness | 88.10% |
| Groundedness | 76.19% |
| Hallucination-Free | 76.19% |
| Usefulness | 95.24% |
| Metric | Value |
|---|---|
| Number of queries | 70 |
| First-query latency (initialization) | 3.74 s |
| Median latency (all queries) | 0.0167 s |
| Mean latency (all queries) | 0.0702 s |
| Minimum latency | 0.0141 s |
| Maximum latency | 3.7417 s |
| Typical steady-state range | 0.014–0.021 s |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Toskova, A.; Georgiev, K.; Glushkova, T. Dialogical Learning Support in RAG-Based E-Learning. Information 2026, 17, 418. https://doi.org/10.3390/info17050418
Toskova A, Georgiev K, Glushkova T. Dialogical Learning Support in RAG-Based E-Learning. Information. 2026; 17(5):418. https://doi.org/10.3390/info17050418
Chicago/Turabian StyleToskova, Asya, Kosta Georgiev, and Todorka Glushkova. 2026. "Dialogical Learning Support in RAG-Based E-Learning" Information 17, no. 5: 418. https://doi.org/10.3390/info17050418
APA StyleToskova, A., Georgiev, K., & Glushkova, T. (2026). Dialogical Learning Support in RAG-Based E-Learning. Information, 17(5), 418. https://doi.org/10.3390/info17050418

