Predicting University Student Dropout Risk Using Deep Learning and Ensemble Voting Mechanism †
Abstract
1. Introduction
2. Methods
2.1. Experimental Setup
2.2. Data Collection and Preprocessing
2.2.1. Data Request
2.2.2. Data Preprocessing
2.3. Model Architecture and Design
2.3.1. AutoKeras Model Construction
2.3.2. Mathematical Formulation of the Neural Network Architecture
2.4. Data Balancing and Model Training Strategy
2.4.1. Handling Class Imbalance and Implementing a Voting Mechanism
2.4.2. Model Training Configuration and Comparison
3. Result and Discussion
3.1. Evaluation of Majority Voting Strategy
3.2. Limitations and Challenges in Model Performance
4. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Araque, F.; Roldán, C.; Salguero, A. Factors influencing university dropout rates. Comput. Educ. 2009, 53, 563–574. [Google Scholar] [CrossRef]
- Borrella, I.; Caballero-Caballero, S.; Ponce-Cueto, E. Taking action to reduce dropout in MOOCs: Tested interventions. Comput. Educ. 2022, 179, 104412. [Google Scholar] [CrossRef]
- Jin, H.; Song, Q.; Hu, X. Auto-Keras: An Efficient Neural Architecture Search System. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD ’19), Anchorage, AK, USA, 4–8 August 2019; Association for Computing Machinery: New York, NY, USA, 2019; pp. 1946–1956. [Google Scholar]
- Alaiad, A.; Migdady, A.; Al-Khatib, R.M.; Alzoubi, O.; Zitar, R.A.; Abualigah, L. Autokeras Approach: A Robust Automated Deep Learning Network for Diagnosis Disease Cases in Medical Images. J. Imaging 2023, 9, 64. [Google Scholar] [CrossRef] [PubMed]
- Al-Sarem, M.; Saeed, F.; Boulila, W.; Emara, A.H.; Al-Mohaimeed, M.; Errais, M. Feature Selection and Classification Using CatBoost Method for Improving the Performance of Predicting Parkinson’s Disease. In Advances in Intelligent Systems and Computing, Proceedings of First International Conference of Advanced Computing and Informatics (ICACIn 2020), Casablanca, Morocco, 12–13 April 2020; Springer Nature Singapore Pte Ltd.: Singapore, 2021; Volume 1188, pp. 189–199. [Google Scholar]
- Batista, G.E.A.P.A.; Prati, R.C.; Monard, M.C. A study of the behavior of several methods for balancing machine learning training data. ACM SIGKDD Explor. Newsl. 2004, 6, 20–29. [Google Scholar] [CrossRef]
- Ruta, D.; Gabrys, B. Classifier selection for majority voting. Inf. Fusion 2005, 6, 63–81. [Google Scholar] [CrossRef]


| Dataset | Status: No | Status: Yes | Total |
|---|---|---|---|
| AY 100–112 | 2309 | 2309 | 4618 |
| AY 108–112 | 684 | 684 | 1368 |
| Validation Set | 288 | 12 | 300 |
| Dataset | Status: No | Status: Yes | Total Records | Sample |
|---|---|---|---|---|
| Single sample | 684 | 684 | 1368 | 1 |
| Majority voting | 22 | |||
| Sample validation set | 288 | 12 | 300 | 1 |
| Data | Single | Multiple | |||
|---|---|---|---|---|---|
| Actual | |||||
| Label | Yes | No | Yes | No | |
| Predicted (students) | Yes | 12 | 102 | 11 | 80 |
| No | 0 | 186 | 1 | 208 | |
| Evaluation | |||||
| Metrics (%) | Accuracy | 66.67% | 73% | ||
| Precision | 10.53% | 12.09% | |||
| Recall | 100% | 91.67% | |||
| F1-score | 19.05% | 21.36% | |||
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Cheng, Y.-H.; Lin, C.-E. Predicting University Student Dropout Risk Using Deep Learning and Ensemble Voting Mechanism. Eng. Proc. 2025, 120, 66. https://doi.org/10.3390/engproc2025120066
Cheng Y-H, Lin C-E. Predicting University Student Dropout Risk Using Deep Learning and Ensemble Voting Mechanism. Engineering Proceedings. 2025; 120(1):66. https://doi.org/10.3390/engproc2025120066
Chicago/Turabian StyleCheng, Yu-Huei, and Che-En Lin. 2025. "Predicting University Student Dropout Risk Using Deep Learning and Ensemble Voting Mechanism" Engineering Proceedings 120, no. 1: 66. https://doi.org/10.3390/engproc2025120066
APA StyleCheng, Y.-H., & Lin, C.-E. (2025). Predicting University Student Dropout Risk Using Deep Learning and Ensemble Voting Mechanism. Engineering Proceedings, 120(1), 66. https://doi.org/10.3390/engproc2025120066

