Next Article in Journal
Bidirectional Temporal Attention Convolutional Networks for High-Performance Network Traffic Anomaly Detection
Previous Article in Journal
Analysis of Japanese Twitter Posts Related to COVID-19 Vaccination Focusing on Frequently Occurring Words and Emotional Expressions
Previous Article in Special Issue
Fusion of Deep Reinforcement Learning and Educational Data Mining for Decision Support in Journalism and Communication
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Machine Learning Approaches for Early Student Performance Prediction in Programming Education

1
Department of Software Systems, University of Doha for Science and Technology, Doha 24449, Qatar
2
College of Computing and IT, University of Doha for Science and Technology, Doha 24449, Qatar
*
Author to whom correspondence should be addressed.
Information 2026, 17(1), 60; https://doi.org/10.3390/info17010060
Submission received: 2 December 2025 / Revised: 31 December 2025 / Accepted: 5 January 2026 / Published: 8 January 2026
(This article belongs to the Special Issue Human–Computer Interactions and Computer-Assisted Education)

Abstract

Intelligent recommender systems are essential for identifying at-risk students and personalizing learning through tailored resources. Accurate prediction of student performance enables these systems to deliver timely interventions and data-driven support. This paper presents the application of machine learning models to predict final exam grades in a university-level programming course, leveraging multi-modal student data to improve prediction accuracy. In particular, a recent raw dataset of students enrolled in a programming course across 36 class sections from the Fall 2024 and Winter 2025 terms was initially processed. The data was collected up to one month before the final exam. From this data, a comprehensive set of features was engineered, including the student’s background, assessment grades and completion times, digital learning interactions, and engagement metrics. Building on this feature set, six machine learning prediction models were initially developed using data from the Fall 2024 term. Both training and testing were conducted on this dataset using cross-validation combined with hyperparameter tuning. The XGBoost model demonstrated strong performance, achieving an accuracy exceeding 91%. To assess the generalizability of the considered models, all models were retrained on the complete Fall 2024 dataset. They were then evaluated on an independent dataset from Winter 2025, with XGBoost achieving the highest accuracy, exceeding 84%. Feature importance analysis has revealed that the midterm grade and the average completion duration of lab assessments are the most influential predictors. This data-driven approach empowers instructors to proactively identify and support at-risk students, enabling adaptive learning environments that deliver personalized learning and timely interventions.
Keywords: early prediction; machine learning; personalized learning; programming education; student performance early prediction; machine learning; personalized learning; programming education; student performance
Graphical Abstract

Share and Cite

MDPI and ACS Style

Bouallegue, S.; Omri, A.; Al-Naemi, S. Machine Learning Approaches for Early Student Performance Prediction in Programming Education. Information 2026, 17, 60. https://doi.org/10.3390/info17010060

AMA Style

Bouallegue S, Omri A, Al-Naemi S. Machine Learning Approaches for Early Student Performance Prediction in Programming Education. Information. 2026; 17(1):60. https://doi.org/10.3390/info17010060

Chicago/Turabian Style

Bouallegue, Seifeddine, Aymen Omri, and Salem Al-Naemi. 2026. "Machine Learning Approaches for Early Student Performance Prediction in Programming Education" Information 17, no. 1: 60. https://doi.org/10.3390/info17010060

APA Style

Bouallegue, S., Omri, A., & Al-Naemi, S. (2026). Machine Learning Approaches for Early Student Performance Prediction in Programming Education. Information, 17(1), 60. https://doi.org/10.3390/info17010060

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop