Explainable Artificial Intelligence Approaches in Primary Education: A Review
Abstract
:1. Introduction
2. Materials and Methods
- (1)
- In which main categories may the XAI approaches in primary education be discerned?
- (2)
- What are the main trends of the XAI approaches in primary education?
- (3)
- What are the main XAI tools or methods used in the XAI approaches in primary education?
- (4)
- In which primary education learning subjects are the XAI approaches used?
- (5)
- For which AI methods are XAI tools or methods used to provide explanations in XAI approaches in primary education?
- (6)
- Are single XAI tools and methods or a combination of them more preferred in XAI approaches in primary education?
- (7)
- Taking into consideration the XAI tools or methods most used in XAI approaches in primary education, which of the main functionalities offered are exploited?
- (a)
- They involved papers published in journals that were retracted;
- (b)
- They involved work that was authored in a language other than English;
- (c)
- They were not accessible in general or through our institution;
- (d)
- They involved reviews, position papers, overviews, or editorials, and not research studies;
- (e)
- They did not involve education but some other field;
- (f)
- They did not involve primary education;
- (g)
- They did not involve the use of XAI;
- (h)
- They involved theses or technical reports.
3. Main Aspects of the Reviewed Papers
4. Results
4.1. Approaches Using XAI to Assist in Teaching and Learning
4.1.1. Main Trends in the Reviewed Studies Reviewed Studies Using XAI to Assist in Teaching and Learning
4.1.2. Brief Description of Reviewed Studies Using XAI to Assist in Teaching and Learning
4.2. Approaches Using XAI in the Context of AI as a Learning Subject
4.3. Approaches Using XAI in Policymaking, Decision Support, and Administrative Tasks
4.3.1. Main Trends in the Reviewed Studies Using XAI in Policymaking, Decision Support, and Administrative Tasks
4.3.2. Brief Description of Reviewed Studies Using XAI in Policymaking, Decision Support, and Administrative Tasks
5. Methodological Quality of the Reviewed Papers
6. Discussion
6.1. Answers to the Research Questions
- (a)
- Calculation of SHAP values. This is performed in all studies using SHAP.
- (b)
- Generation of various types of plots (Table 9).
- (c)
- Depiction of feature contribution in producing the outputs. This is done in all studies using SHAP. SHAP values and the generated plots were used for this purpose.
- (d)
- Feature selection in order to improve the performance of the tested AI methods. This was done in the work of Gao et al. [23].
- (e)
- Provision of local and global explanations. Global explanations are provided with the global bar, beeswarm summary, collective force plot, and scatter plots. Local explanations are provided with waterfall and force (and individual) plots, and to a certain degree with the beeswarm summary plots. Local explanations are also considered to be the online feedback to teachers and students provided in the study of Kim et al. [26].
6.2. Further Issues
7. Conclusions and Future Directions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
AI | Artificial Intelligence |
BERT | Bidirectional Encoder Representations from Transformer |
CNN | Convolutional Neural Network |
COCO | Common Objects in Context |
CONFEMEN | Conference of Ministers of Education of French-Speaking States and Governments |
CP | Conference Proceeding |
FAMeX | FEature iMportance-Based eXplanable AI algorithm |
FURIA | Fuzzy Unordered Rule Induction Algorithm |
Grad-CAM | Gradient-Weighted Class Activation Map |
GVTSC | Global Variational Transformer Speaker Clustering |
JP | Journal Publication |
KEDI | Korean Educational Development Institute |
KNN | K-Nearest Neighbors |
KOFAC | Korean Foundation for the Advancement of Science and Creativity |
LLM | Large Language Model |
LightGBM | Light Gradient-Boosting Machine |
LIME | Local Interpretable Model-Agnostic Explanations |
LSTM | Long Short-Term Memory |
MOOC | Massive Open Online Course |
NLP | Natural Language Processing |
NSO | National Statistical Office |
PASEC | Program for the Analysis of CONFEMEN Education System |
PIRLS | Progress in International Reading Literacy Study |
PISA | Program for International Student Assessment |
SHAP | SHapley Additive exPlanations |
SVM | Support Vector Machine |
TIMSS | Trends in International Mathematics and Science Study |
UAE | United Arab Emirates |
UNESCO | United Nations Educational, Scientific, and Cultural Organization |
USA | United States of America |
VIRTSI | Variability and Impact of Reciprocal Trust States towards Intelligent Systems |
Weka | Waikato Environment for Knowledge Analysis |
XAI | Explainable Artificial Intelligence |
XGBoost | eXtreme Gradient Boosting |
References
- Chen, L.; Chen, P.; Lin, Z. Artificial intelligence in education: A review. IEEE Access 2020, 8, 75264–75278. [Google Scholar] [CrossRef]
- Bozkurt, A.; Karadeniz, A.; Baneres, D.; Guerrero-Roldán, A.E.; Rodríguez, M.E. Artificial intelligence and reflections from educational landscape: A review of AI Studies in half a century. Sustainability 2021, 13, 800. [Google Scholar] [CrossRef]
- Prentzas, J. Artificial Intelligence methods in early childhood education. In Artificial Intelligence, Evolutionary Computing and Metaheuristics. In The Footsteps of Alan Turing; Studies in Computational, Intelligence; Yang, X.S., Ed.; Springer: Berlin/Heidelberg, Germany, 2013; Volume 427, pp. 169–199. ISBN 978-364-229-693-2. [Google Scholar] [CrossRef]
- Silva, M.; Ferreira, A.; Alves, K.; Valenca, G.; Brito, K. A new perspective for longitudinal measurement and analysis of public education in Brazil based on open data and machine learning. In Proceedings of the Seventeenth International Conference on Theory and Practice of Electronic Governance, Pretoria, South Africa, 1–4 October 2024; Chun, S., Karuri-Sebina, G., Przeybilovicz, E., Barbosa, F., Braga, C., Eds.; ACM: New York, NY, USA, 2024; pp. 130–138. [Google Scholar] [CrossRef]
- Nadaf, A.; Monroe, S.; Chandran, S.; Miao, X. Learning factors for TIMSS math performance evidenced through machine learning in the UAE. In Artificial Intelligence in Education Technologies: New Development and Innovative Practices, Proceedings of the Third International Conference on Artificial Intelligence in Education Technology, Wuhan, China, 1–3 July 2022; Lecture Notes on Data Engineering and Communications Technologies; Cheng, E.C.K., Wang, T., Schlippe, T., Beligiannis, G.N., Eds.; Springer: Berlin/Heidelberg, Germany, 2022; Volume 154, pp. 47–66. [Google Scholar] [CrossRef]
- Sanfo, J.B. Application of Explainable Artificial Intelligence approach to predict student learning outcomes. J. Comput. Soc. Sci. 2025, 8, 9. [Google Scholar] [CrossRef]
- Lee, Y. Applying Explainable Artificial Intelligence to develop a model for predicting the supply and demand of teachers by region. J. Educ. e-Learn. Res. 2021, 8, 198–205. [Google Scholar] [CrossRef]
- Ozmen, M.; Sahin, H. Real-time optimization of school bus routing problem in smart cities using genetic algorithm. In Proceedings of the 6th International Conference on Inventive Computation Technologies, Coimbatore, India, 20–22 January 2021; IEEE: New York, NY, USA, 2021; pp. 1152–1158. [Google Scholar] [CrossRef]
- Perikos, I.; Grivokostopoulou, F.; Hatzilygeroudis, I. Assistance and feedback mechanism in an Intelligent Tutoring System for teaching conversion of Natural Language into Logic. Int. J. Artif. Intell. Educ. 2017, 27, 475–514. [Google Scholar] [CrossRef]
- Chrysafiadi, K.; Virvou, M. Evaluating the integration of fuzzy logic into the student model of a web-based learning environment. Expert Syst. Appl. 2012, 39, 13127–13134. [Google Scholar] [CrossRef]
- Markos, A.; Prentzas, J.; Sidiropoulou, M. Pre-Service teachers’ assessment of ChatGPT’s utility in higher education: SWOT and content analysis. Electronics 2024, 13, 1985. [Google Scholar] [CrossRef]
- Hoffait, A.S.; Schyns, M. Early detection of university students with potential difficulties. Decis. Support Syst. 2017, 101, 1–11. [Google Scholar] [CrossRef]
- UNESCO Institute for Statistics. International Standard Classification of Education (ISCED) 2011. Available online: https://uis.unesco.org/sites/default/files/documents/international-standard-classification-of-education-isced-2011-en.pdf (accessed on 10 May 2025).
- Hatzilygeroudis, I.; Prentzas, J. HYMES: A HYbrid Modular Expert System with efficient inference and explanation. In Proceedings of the 8th Panhellenic Conference on Informatics, Nicosia, Cyprus, 8–10 November 2001; Manolopoulos, Y., Evripidou, S., Eds.; Livanis Publications: Athens, Greece, 2001; Volume 1, pp. 422–431. [Google Scholar]
- Hatzilygeroudis, I.; Prentzas, J. Symbolic-neural rule based reasoning and explanation. Expert Syst. Appl. 2015, 42, 4595–4609. [Google Scholar] [CrossRef]
- Saeed, W.; Omlin, C. Explainable AI (XAI): A systematic meta-survey of current challenges and future opportunities. Knowl.-Based Syst. 2023, 263, 110273. [Google Scholar] [CrossRef]
- Hassija, V.; Chamola, V.; Mahapatra, A.; Singal, A.; Goel, D.; Huang, K.; Scardapane, S.; Spinelli, I.; Mahmud, M.; Hussain, A. Interpreting black-box models: A review on Explainable Artificial Intelligence. Cogn. Comput. 2024, 16, 45–74. [Google Scholar] [CrossRef]
- Chiu, T.K. A holistic approach to the design of Artificial Intelligence (AI) education for K-12 schools. TechTrends 2021, 65, 796–807. [Google Scholar] [CrossRef]
- Khosravi, H.; Shum, S.B.; Chen, G.; Conati, C.; Tsai, Y.S.; Kay, J.; Knight, S.; Martinez-Maldonado, R.; Sadiq, S.; Gašević, D. Explainable artificial intelligence in education. Comput. Educ. Artif. Intell. 2022, 3, 100074. [Google Scholar] [CrossRef]
- Olsen, J.K.; Rummel, N.; Aleven, V. Designing for the co-orchestration of social transitions between individual, small-group and whole-class learning in the classroom. Int. J. Artif. Intell. Educ. 2021, 31, 24–56. [Google Scholar] [CrossRef]
- Melsión, G.I.; Torre, I.; Vidal, E.; Leite, I. Using explainability to help children understand gender bias in AI. In Proceedings of the Twentieth Annual ACM Interaction Design and Children Conference, Athens, Greece, 24–30 June 2021; ACM: New York, NY, USA, 2021; pp. 87–99. [Google Scholar] [CrossRef]
- Alonso, J.M. Explainable Artificial Intelligence for kids. In Proceedings of the 11th Conference of the European Society for Fuzzy Logic and Technology, Prague, Czech Republic, 9–13 September 2019; Atlantis Press: Dordrecht, The Netherlands, 2019; pp. 134–141. [Google Scholar] [CrossRef]
- Gao, H.; Hasenbein, L.; Bozkir, E.; Göllner, R.; Kasneci, E. Exploring gender differences in computational thinking learning in a VR classroom: Developing machine learning models using eye-tracking data and explaining the models. Int. J. Artif. Intell. Educ. 2023, 33, 929–954. [Google Scholar] [CrossRef]
- Kammüller, F.; Satija, D. Explanation of student attendance AI prediction with the Isabelle Infrastructure Framework. Information 2023, 14, 453. [Google Scholar] [CrossRef]
- Kar, S.P.; Das, A.K.; Chatterjee, R.; Mandal, J.K. Assessment of learning parameters for students’ adaptability in online education using machine learning and explainable AI. Educ. Inf. Technol. 2024, 29, 7553–7568. [Google Scholar] [CrossRef]
- Kim, S.; Kim, W.; Jang, Y.; Choi, S.; Jung, H.; Kim, H. Student knowledge prediction for teacher-student interaction. In Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2–9 February 2021; AAAI Press: Palo Alto, CA, USA, 2021; Volume 35, No. 17. pp. 15560–15568. [Google Scholar] [CrossRef]
- Kobusingye, B.M.; Dorothy, A.; Nakatumba-Nabende, J.; Marvin, G. Explainable machine translation for intelligent e-learning of social studies. In Proceedings of the Seventh International Conference on Trends in Electronics and Informatics, Tirunelveli, India, 11–13 April 2023; IEEE: New York, NY, USA, 2023; pp. 1066–1072. [Google Scholar] [CrossRef]
- McIntyre, N.A. Access to online learning: Machine learning analysis from a social justice perspective. Educ. Inf. Technol. 2023, 28, 3787–3832. [Google Scholar] [CrossRef]
- Miao, X.; Mishra, P.K.; Nadaf, A. Evidence and promises of AI predictions to understand student approaches to math learning in Abu Dhabi K12 public schools. Gulf Educ. Soc. Policy Rev. 2021, 1, 109–134. [Google Scholar] [CrossRef]
- Mohan, A.; Mathew, M.; Malavika, K.; Gutjahr, G.; Menon, R.; Nedungadi, P. An Explainable AI model for student engagement in tablet learning. In Proceedings of the Sixth IEEE International Conference on Recent Advances in Intelligent Computational Systems, Kothamangalam, Kerala, India, 16–18 May 2024; IEEE: New York, NY, USA, 2024. [Google Scholar] [CrossRef]
- Onishi, S.; Yasumori, T.; Shiina, H. Visualization of the impact of classroom utterances using generative dialogue models. IIAI Lett. Inform. Interdiscip. Res. 2023, 4, LIIR181. [Google Scholar] [CrossRef]
- Onishi, S.; Kojima, S.; Shiina, H.; Yasumori, T. Estimating the impact of classroom speech using a Large Language Model. In Proceedings of the Sixteenth IIAI International Congress on Advanced Applied Informatics, Takamatsu, Japan, 6–12 July 2024; IEEE: New York, NY, USA, 2024; pp. 409–414. [Google Scholar] [CrossRef]
- Pelánek, R.; Effenberger, T.; Kukučka, A. Towards design-loop adaptivity: Identifying items for revision. J. Educ. Data Min. 2022, 14, 1–25. [Google Scholar] [CrossRef]
- Ribeiro-Flucht, L.; Chen, X.; Meurers, D. Explainable AI in language learning: Linking empirical evidence and theoretical concepts in proficiency and readability modeling of Portuguese. In Proceedings of the Nineteenth Workshop on Innovative Use of NLP for Building Educational Applications, Mexico City, Mexico, 20 June 2024; Kochmar, E., Bexte, M., Burstein, J., Horbach, A., Laarmann-Quante, R., Tack, A., Yaneva, V., Yuan, Z., Eds.; Association for Computational Linguistics: Kerrville, TX, USA, 2024; pp. 199–209. [Google Scholar]
- Ruan, S.; Nie, A.; Steenbergen, W.; He, J.; Zhang, J.Q.; Guo, M.; Liu, Y.; Nguyen, D.K.; Wang, C.Y.; Ying, R.; et al. Reinforcement learning tutor better supported lower performers in a math task. Mach. Learn. 2024, 113, 3023–3048. [Google Scholar] [CrossRef]
- Tsiakas, K.; Barakova, E.; Khan, J.V.; Markopoulos, P. BrainHood: Towards an explainable recommendation system for self-regulated cognitive training in children. In Proceedings of the 13th ACM International Conference on PErvasive Technologies related to Assistive Environments, Corfu, Greece, 30 June–3 July 2020; ACM: New York, NY, USA, 2020. [Google Scholar] [CrossRef]
- Tsiakas, K.; Barakova, E.; Khan, J.V.; Markopoulos, P. BrainHood: Designing a cognitive training system that supports self-regulated learning skills in children. Technol. Disabil. 2020, 32, 219–228. [Google Scholar] [CrossRef]
- Zhen, Y.; Luo, J.D.; Chen, H. Prediction of academic performance of students in online live classroom interactions—An analysis using natural language processing and deep learning methods. J. Soc. Comput. 2023, 4, 12–29. [Google Scholar] [CrossRef]
- Lundberg, S.; Lee, S.-I. A unified approach to interpreting model predictions. In Advances in Neural Information Processing Systems, Proceedings of the Thirty-First International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; Guyon, I., von Luxburg, U., Bengio, S., Wallach, H.M., Fergus, R., Vishwanathan, S.V.N., Garnett, R., Eds.; Curran Associates: Red Hook, NY, USA, 2017; Volume 30, pp. 4765–4774. [Google Scholar]
- Lundberg, S. SHAP Documentation. Available online: https://shap.readthedocs.io/en/latest/index.html (accessed on 15 January 2025).
- Sundararajan, M.; Taly, A.; Yan, Q. Axiomatic attribution for deep networks. In Proceedings of Machine Learning Research, Proceedings of the International Conference on Machine Learning, Sydney, Australia, 6–11 August 2017; Precup, D., Teh, Y.W., Eds.; MLR.org; 2017; Volume 70, pp. 3319–3328. [Google Scholar]
- Selvaraju, R.R.; Cogswell, M.; Das, A.; Vedantam, R.; Parikh, D.; Batra, D. Grad-CAM: Visual explanations From Deep Networks via Gradient-Based Localization. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; IEEE: New York, NY, USA, 2017; pp. 618–626. [Google Scholar] [CrossRef]
- Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
- Vig, J. BertViz: A tool for visualizing multihead self-attention in the BERT model. In Proceedings of the International Conference on Learning Representations Workshop: Debugging Machine Learning Models, New Orleans, LA, USA, 6 May 2019; Volume 3. [Google Scholar]
- Alonso, J.M.; Bugarín, A. ExpliClas: Automatic generation of explanations in natural language for Weka classifiers. In Proceedings of the IEEE International Conference on Fuzzy Systems, New Orleans, LA, USA, 23–26 June 2019; IEEE: New York, NY, USA, 2019. [Google Scholar] [CrossRef]
- Hühn, J.; Hüllermeier, E. FURIA: An algorithm for unordered fuzzy rule induction. Data Min. Knowl. Disc. 2009, 19, 293–319. [Google Scholar] [CrossRef]
- Ribeiro, M.T.; Singh, S.; Guestrin, C. “Why should i trust you?” Explaining the predictions of any classifier. In Proceedings of the Twenty-Second ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; ACM: New York, NY, USA, 2016; pp. 1135–1144. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; ACM: New York, NY, USA, 2016; pp. 785–794. [Google Scholar] [CrossRef]
- Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.Y. Lightgbm: A highly efficient gradient boosting decision tree. In Advances in Neural Information Processing Systems, Proceedings of the Thirty-First International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; Guyon, I., Von Luxburg, U., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R., Eds.; Curran Associates: Red Hook, NY, USA, 2017; Volume 30, pp. 3146–3154. [Google Scholar]
- Piech, C.; Bassen, J.; Huang, J.; Ganguli, S.; Sahami, M.; Guibas, L.J.; Sohl-Dickstein, J. Deep knowledge tracing. In Advances in Neural Information Processing Systems, Proceedings of the Twenty-Ninth International Conference on Neural Information Processing Systems, Montreal, Canada, 7–12 December 2015; Cortes, C., Lawrence, N.D., Lee, D.D., Sugiyama, M., Garnett, R., Eds.; MIT Press: Cambridge, MA, USA, 2015; Volume 28, pp. 505–513. [Google Scholar]
- Junczys-Dowmunt, M.; Grundkiewicz, R.; Dwojak, T.; Hoang, H.; Heafield, K.; Neckermann, T.; Seide, F.; Germann, U.; Aji, A.F.; Bogoychev, N.; et al. Marian: Fast Neural Machine Translation in C++. arXiv 2018, arXiv:1804.00344. [Google Scholar]
- Keller, J.M. Development and use of the ARCS model of instructional design. J. Instr. Dev. 1987, 10, 2–10. [Google Scholar] [CrossRef]
- Liu, F.T.; Ting, K.M.; Zhou, Z.-H. Isolation-based anomaly detection. ACM Trans. Knowl. Discov. Data 2012, 6, 1–39. [Google Scholar] [CrossRef]
- Gupta, N.; Eswaran, D.; Shah, N.; Akoglu, L.; Faloutsos, C. Beyond outlier detection: LOOKOUT for pictorial explanation. In Machine Learning and Knowledge Discovery in Databases, Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases, Dublin, Ireland, 10–14 September 2018; Lecture Notes in Computer Science; Berlingerio, M., Bonchi, F., Gärtner, T., Hurley, N., Ifrim, G., Eds.; Springer: Cham, Switzerland, 2019; Volume 11051, pp. 122–138. [Google Scholar] [CrossRef]
- Schulman, J.; Wolski, F.; Dhariwal, P.; Radford, A.; Klimov, O. Proximal policy optimization algorithms. arXiv 2017, arXiv:1707.06347. [Google Scholar]
- Bull, S.; Kay, J. Open Learner Models. In Advances in Intelligent Tutoring Systems; Studies in Computational Intelligence; Nkambou, R., Bourdeau, J., Mizoguchi, R., Eds.; Springer: Berlin/Heidelberg, Germany, 2010; Volume 308, pp. 301–322. [Google Scholar] [CrossRef]
- Clarke, E.M.; Emerson, E.A. Design and synthesis of synchronization skeletons using branching time temporal logic. In Logic of Programs; Lecture Notes in Computer Science; Kozen, D., Ed.; Springer: Berlin/Heidelberg, Germany, 1982; Volume 131, pp. 52–71. [Google Scholar] [CrossRef]
- Clarke, E.M. The birth of model checking. In 25 Years of Model Checking; Lecture Notes in Computer Science; Grumberg, O., Veith, H., Eds.; Springer: Berlin/Heidelberg, Germany, 2008; Volume 5000, pp. 1–26. [Google Scholar] [CrossRef]
- Prokhorenkova, L.O.; Gusev, G.; Vorobev, A.; Dorogush, A.V.; Gulin, A. CatBoost: Unbiased boosting with categorical features. In Advances in Neural Information Processing Systems, Proceedings of the Thirty-Second International Conference of the Neural Information Processing Systems, Montréal Canada, 3–8 December 2018; Bengio, S., Wallach, H.M., Larochelle, H., Grauman, K., Cesa-Bianchi, N., Garnett, R., Eds.; Curran Associates: Red Hook, NY, USA, 2018; Volume 31, pp. 6639–6649. [Google Scholar]
- Hatzilygeroudis, I.; Prentzas, J. Integrated rule-based learning and inference. IEEE Trans. Knowl. Data Eng. 2010, 22, 1549–1562. [Google Scholar] [CrossRef]
- Prentzas, J.; Hatzilygeroudis, I. Assessment of life insurance applications: An approach integrating neuro-symbolic rule-based with case-based reasoning. Expert Syst. 2016, 33, 145–160. [Google Scholar] [CrossRef]
- Virvou, M.; Tsihrintzis, G.A.; Tsichrintzi, E.-A. VIRTSI: A novel trust dynamics model enhancing Artificial Intelligence collaboration with human users—Insights from a ChatGPT evaluation study. Inform. Sciences 2024, 675, 120759. [Google Scholar] [CrossRef]
- Prentzas, J.; Sidiropoulou, M. Integrating OpenAI Chat-GPT in a University Department of Education: Main types of use and preliminary assessment results. In Extended Selected Papers of the 14th International Conference on Information, Intelligence, Systems, and Applications; Lecture Notes in Networks and Systems; Bourbakis, N., Tsihrintzis, G.A., Virvou, M., Jain, L.C., Eds.; Springer: Cham, Switzerland, 2024; Volume 1093, pp. 306–326. ISBN 978-3-031-67425-9. [Google Scholar] [CrossRef]
- Papadimitriou, S.; Virvou, M. Artificial Intelligence-Based Games as Novel Holistic Educational Environments to Teach 21st Century Skills, Intelligent Systems Reference Library; Springer: Cham, Switzerland, 2025; Volume 93, ISBN 978-3-031-77463-8. [Google Scholar]
Type of Items | Number |
---|---|
Number of total items initially retrieved with both search tools (without omitting duplicate items) | 1372 |
Number of duplicate items | 140 |
Number of papers published in journals that were retracted | 1 |
Number of items not authored in English | 47 |
Number of items that were not accessible | 69 |
Number of items that were excluded by reading their title and/or abstract Number of items that did not involve education but some other field: 397 Number of items that did not involve primary education: 194 Number of items that did not involve the use of XAI: 39 Number of items that involved theses or technical reports: 40 Number of items that involved surveys, overviews, or reviews: 231 | 901 |
Number of items that were excluded by reading their full content Number of items that did not involve education but some other field: 56 Number of items that did not involve primary education: 83 Number of items that did not involve the use of XAI: 52 | 191 |
Number of items that were included in the list of reviewed items | 23 |
Category | Indicative Tasks Concerning XAI |
---|---|
A | Educational applications, presentations, and demonstrations using XAI Lab activities using XAI E-learning tool mechanisms (e.g., selection of learning activities, assessment, feedback to teachers, feedback to students, collaboration) using XAI Classroom co-orchestration using XAI Processing of data collected in the specific learning environment(s) using XAI Processing of data previously accumulated from other studies using XAI Preparation of new teaching content and management of existing teaching content using XAI Lesson planning using XAI |
B | Educational applications, presentations, and demonstrations about XAI Lab activities about XAI E-learning tools about XAI Lesson plans about XAI Design of AI curriculum including XAI |
C | Analysis of educational data, perhaps in combination with other types of data (family-related, demographics, financial, government data, etc.) using XAI Analysis of large-scale educational assessments using XAI Assessment of educational unit(s) using XAI Administrative tasks of teachers, educational personnel, executives, and policymakers using XAI |
ID | Citation, Pub. Type | Category | Country (Countries) | AI Method(s) Explained | XAI Tool(s) or Method(s) |
---|---|---|---|---|---|
1 | [22], CP | B | Spain | Voting scheme of J48, RepTree, RandomTree, and FURIA | ExpliClas |
2 | [23], JP | A | Germany | LightGBM | SHAP |
3 | [24], JP | C | UK | A black-box AI model | Logical rules |
4 | [25], JP | C | India | Random Forest | LIME, SHAP, FAMeX, |
5 | [26], CP | A | South Korea | XGBoost | SHAP |
6 | [27], CP | A | Uganda | Transformer models | SHAP, BertViz |
7 | [7], JP | C | South Korea | XGBoost | Feature importance, partial dependence plots, SHAP |
8 | [28], CP | A | UK | XGBoost | SHAP |
9 | [21], CP | B | Sweden, Spain | CNN with LSTM | Grad-CAM |
10 | [29], JP | C | UAE | XGBoost | SHAP |
11 | [30], CP | A | India | XGBoost | SHAP |
12 | [5], JP | C | UAE | CatBoost | SHAP |
13 | [20], JP | A | Switzerland, USA, Germany | Not specifically mentioned | Not specifically mentioned |
14 | [31], JP | A | Japan | Transformer model | Visualization of attention weights |
15 | [32], CP | A | Japan | LLM and non-LLM transformer models | Visualization methods |
16 | [33], JP | A | Czech Republic | Isolation Forest | Explainable outlier detection |
17 | [34], CP | A | Germany | Transformer models | SHAP |
18 | [35], JP | A | USA | Reinforcement learning | Integrated gradient analysis |
19 | [6], JP | C | Japan | KNN, SVM, Random Forest | SHAP |
20 | [4], CP | C | Brazil | Random Forest | Feature importance |
21 | [36], CP | A | Netherlands | Open Learner Model, probability models | Text-based explanations |
22 | [37], JP | A | Netherlands | Open Learner Model, probability models | Text-based explanations |
23 | [38], JP | A | China | CNN | SHAP |
Citation | (X)AI Tasks | Learning Subject | Type |
---|---|---|---|
[23] | Gender prediction using eye movement (SVM, Random Forest, logistic regression, XGBoost, LightGBM) Feature selection for all tested AI methods (SHAP) Contribution of features in output (SHAP) | Computational thinking | (ii) |
[26] | Student’s knowledge state prediction (deep knowledge tracing model) Prediction of student’s response to a question in the next step (deep knowledge tracing model) Prediction of final test score (XGBoost) Explanation to teachers about final test score prediction (SHAP) Advice to student about proposed activities to carry out (SHAP) | Mathematics | (ii) |
[27] | Machine translation of learning material (transformer models) Visualization of the contribution of features in model output (SHAP) Visualization of the attention mechanism of transformer models (BertViz) | Social studies, English as a second language | (ii) |
[28] | Access inequalities in online learning (XGBoost) Contribution of features in model output (SHAP) | Mathematics | (ii) |
[30] | Prediction of students’ motivation in learning (XGBoost) Contribution of features in prediction (SHAP) | English as a second language in learning after school | (ii) |
[20] | Classroom co-orchestration (teacher and AI system) Explanations about the system decisions | (i) | |
[31] | Classroom dialogue analysis (transformer model) Visualization of attention weights | Mathematics | (ii) |
[32] | Classroom dialogue analysis (LLM and non-LLM models) Visualization methods | Mathematics | (ii) |
[33] | Identification of educational items for revision Outlier detection (Isolation Forest), Listing of item properties with extreme values for each outlier | Mathematics, programming, English as a second language | (ii) |
[34] | Proficiency and readability modeling of Portuguese (transformer models) Contribution of features in output (SHAP) | Portuguese as a second language | (ii) |
[35] | Adaptive selection of a pedagogical strategy (reinforcement learning) Feature contribution in strategy selection (integrated gradient analysis) | Mathematics | (ii) |
[36,37] | Alternative recommendations to students (Open Learner Model, probability models) Personalized text-based explanations for the recommendations | Support of students’ self-regulated skills | (i) |
[38] | Natural Language Processing (NLP) of interaction data in online learning platforms Prediction of academic performance using features extracted from NLP (decision tree, artificial neural network, CNN) Contribution of features in prediction (SHAP) | Language, English as a second language, mathematics | (ii) |
Citation | Main Goals |
---|---|
[22] | Workshops teaching (X)AI to children in combination with visual programming and sports Classification of selected players (voting scheme of J48, RepTree, RandomTree, and FURIA) Text and visual explanations for classification (ExpliClas) |
[21] | Raise awareness and improve students’ comprehension of bias and fairness in AI decision-making Image classification (CNN with LSTM) Visual explanations for classification (Grad-CAM) |
Citation | AI Tasks |
---|---|
[24] | Prediction of student attendance (a black-box AI model) Logical rule-based explanations (Isabelle Insider and Infrastructure framework and the precondition refinement rule algorithm) |
[25] | Prediction of students’ adaptability to online education (Random Forest) Identify learning parameters increasing students’ adaptability (LIME, SHAP, FAMeX) |
[7] | Prediction of the supply and demand of teachers by region (XGBoost) Feature importance How numerical changes in a feature affects prediction (partial dependence plots) Contribution of features in the prediction (SHAP) |
[29] | Investigation of predictive factors for student approaches to math learning (XGBoost), math taught in second language Contribution of predictive factors to math performance (SHAP) |
[5] | Modeling of students’ math performance (CatBoost) Identification of key factors affecting student math performance (SHAP) |
[6] | Prediction of student math learning outcomes (KNN, SVM, Random Forest regressor) Identification of key predictive features (SHAP) |
[4] | Student performance prediction involving primary education learning subjects (linear regression, Random Forest, neural network) Feature importance |
ID | Citation | Study Design | Sample Size | Data Source | Limitations Identified |
---|---|---|---|---|---|
1 | [22] | Design and implementation case study as a workshop (XAI for kids) | Not precisely defined (20 school students in the age range from 6 to 17 years old) | Training data (80 samples) created by the researcher User feedback data | Gender bias in training data Slightly better accuracy of black-box models Simplified interaction design Lack of detailed evaluation with humans |
2 | [23] | Quantitative experimental study using machine learning and eye-tracking data | 381 sixth-grade students (final 280 participants, M:140, F:140) | Eye-tracking data (>3600 samples) created by the researchers Behavioral and physiological measures Questionnaires | Specific context No longitudinal measurement Interpretability challenges No direct correlation to actual computational thinking learning outcomes |
3 | [24] | Explanatory modeling | Synthetic data (gender, location, transport, ethnicity, special needs) | Synthetic data (not sourced from an actual dataset) | No real data used Certain abstract definitions about explanations Not empirically validated |
4 | [25] | AI analysis of parameters affecting | 1205 students at all educational levels in Bangladesh | Survey data from 1205 students in all educational levels | Generalizability |
5 | [26] | Experimental design using AI in a real-world educational setting | 59 first-grade and 227 second-grade students (5 primary schools in South Korea) | Proprietary data from the KOFAC | Data availability Limited evaluation |
6 | [27] | Machine translation model designed to translate social studies notes from Luganda to English, and vice versa | The number of participants is not specified | A dataset (4000 words) created from notes by the researchers (publicly available) and another publicly available dataset | Future extensions to other subjects No extensive evaluation by teachers |
7 | [7] | Predictive modeling | Structured public datasets (data from 17 cities and provinces, 2001–2019) | Korean national public databases, KEDI and NSO | No human evaluation Further research needed with additional features |
8 | [28] | Data mining from e-learning platform (quantitative data) | Students from three countries | Dataset from e-learning platform (random subsample of 5000) | Study prior to and during COVID-19 Quantitative and aggregative approach |
9 | [21] | Mixed-methods experimental study | 76 5th- and 6th-grade students from a primary school | A subset of the Microsoft COCO public dataset | Gender bias in the dataset Sample size and scope Online study constraints Short-term exposure to tool |
10 | [29] | AI-based analysis of test results and questionnaires | 5th- to 9th-grade students in 20 public schools in Abu Dhabi (high-quality data from 1660 students, at least 305 in primary education) | Data from math diagnostic tests and questionnaires (included measures of student self-efficacy, metacognitive strategies, instructional language skills, and math performance outcomes) | Data availability Lack of data on teachers’ knowledge/skills, metacognitive strategies Dependence on data from student questionnaires |
11 | [30] | AI-based analysis of data from students using educational tablets (quantitative study) | 135 1st- to 5th-grade students (after school centers, three states in India) | Interaction data Student questionnaire | No qualitative data Generalizability beyond India |
12 | [5] | AI-based analysis of the UAE students’ math performance in the TIMSS 2019 assessment | 22,163 4th- and 27,342 8th-grade students from the UAE | TIMSS 2019 for the UAE | Economic and metacognitive factors not covered in TIMSS Detailed impact of each learning factor not explored |
13 | [20] | Classroom orchestration and transition taxonomy | 7 teachers from 5 different primary schools teaching various grades (convenience sample) | Insights and feedback gatheredfrom teachers during co-design workshops and prototype evaluations | Sample size Prototype scope No data collected from classrooms |
14 | [31] | Classroom dialogue analysis and impact of classroom utterances in student learning | Data from a 45 min classroom session, 2 classes, 4th and 6th grades in a primary school | Collected from recorded teacher and student dialogues: 193 and 274 utterances (4th and 6th grades, respectively) | Sample size Data from one classroom session |
15 | [32] | Classroom dialogue analysis and impact of classroom utterances in student learning using two models | Data from classroom | Samples of dialogues | Sample size Size of dialogue data |
16 | [33] | Mixed (analyzed item properties, employed both simulation and real-world data analysis, conducted two case studies) | Data from a large-scale learning environment used by tens of thousands students (8–15 years old) | Pool of items Item metadata Students’ responses to items | Sample size details are missing Bias in data (student/system behavior) not considered Detailed analysis only for text items |
17 | [34] | Linguistic modeling (i.e., proficiency and readability modeling) using available corpora | The number of participants is missing | European Portuguese Learner Corpus, Brazilian school materials | Complexity of measures Missing details about human evaluators Class imbalance in corpora Need for enhanced corpora metadata |
18 | [35] | Evaluate the effectiveness of a learning system based on reinforcement learning to enhance math concept learning, 2 case studies | Approximately 300 students | Interaction data from students using the system | Sample size details are missing Remote learning due to COVID-19 Long-term impact |
19 | [6] | AI-based analysis of math learning using PASEC 2019 data | Students from Burkina Faso who participated in PASEC 2019 | Data from the 2019 PASEC for Burkina Faso | Generalizability of findings Selection of features |
20 | [4] | Statistical and AI-based analysis of student performance using open government and large-scale assessment data | Data from Brazilian public education institutions | Open government data (Brazil) Large-scale assessment data (Brazil) | Generalizability beyond Brazil Selection of features Selection of calendar years of collected data |
21, 22 | [36,37] | Conceptual study of a system aiming to enhance self-regulated skills | Conceptual study | Comprehensive review | Empirical validation missing Implementation details are missing |
23 | [38] | Analysis of online live interaction for student performance prediction | >100,000 students in online platform | About 10 million interactive texts | Collection of data from an online platform only Need for more fine-grained dialogue analyses Student only dialogues |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Prentzas, J.; Binopoulou, A. Explainable Artificial Intelligence Approaches in Primary Education: A Review. Electronics 2025, 14, 2279. https://doi.org/10.3390/electronics14112279
Prentzas J, Binopoulou A. Explainable Artificial Intelligence Approaches in Primary Education: A Review. Electronics. 2025; 14(11):2279. https://doi.org/10.3390/electronics14112279
Chicago/Turabian StylePrentzas, Jim, and Ariadni Binopoulou. 2025. "Explainable Artificial Intelligence Approaches in Primary Education: A Review" Electronics 14, no. 11: 2279. https://doi.org/10.3390/electronics14112279
APA StylePrentzas, J., & Binopoulou, A. (2025). Explainable Artificial Intelligence Approaches in Primary Education: A Review. Electronics, 14(11), 2279. https://doi.org/10.3390/electronics14112279