Predicting the Learning Performance of Minority Students in a Vietnamese High School Using Artificial Intelligence Algorithms †
Abstract
1. Introduction
2. Materials and Methods
Implementation Process
- Step 1: Data collection and preprocessing
- Step 2: Data classification
- Step 3: Building prediction models
- Step 4: Evaluation
- Step 5: Conclusions
3. Results
3.1. Classification
3.2. Feature Selection
4. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Available online: https://baodantoc.vn/dai-hoi-dai-bieu-cac-dtts-tinh-soc-trang-lan-thu-iv-nam-2024-thanh-cong-tot-dep-1723959639108.htm (accessed on 20 October 2024).
- Regulations on Principles, Criteria, and Norms for Allocation of Central Budget Capital and Ratio of Counterfeit Capital of Local Budget to Implement the National Target Program for Socio-Economic Development in Ethnic Minority and Mountainous Areas for the Period 2021–2030, Phase I: 2021–2025. Available online: https://thuvienphapluat.vn/van-ban/Dau-tu/Quyet-dinh-39-2021-QD-TTg-nguyen-tac-tieu-chi-phan-bo-von-ngan-sach-trung-uong-499362.aspx (accessed on 20 October 2024).
- Zafari, M.; Sadeghi-Niaraki, A.; Choi, S.M.; Esmaeily, A. A practical model for the evaluation of high school student performance based on machine learning. App. Sci. 2021, 11, 11534. [Google Scholar] [CrossRef]
- Zeineddine, H.; Braendle, U.; Farahm, A. Enhancing prediction of student success: Automated machine learning approach. Comput. Electr. Eng. 2020, 89, 106903. [Google Scholar] [CrossRef]
- Bujang, S.D.A.; Selamat, A.; Ibrahim, R.; Krejcar, O.; Herrera-Viedma, E.; Fujita, H.; Ghani, N.A.M. Multiclass prediction model for student grade prediction using machine learning. IEEE Access 2021, 9, 95608–95621. [Google Scholar] [CrossRef]
- Tatar, A.E.; Düştegör, D. Prediction of academic performance at undergraduate graduation: Course grades or Grade point average? App. Sci. 2020, 10, 4967. [Google Scholar] [CrossRef]
- Duong, H.T.H.; Tran, L.T.M.; To, H.Q.; Van Nguyen, K. Academic performance warning system based on data driven for higher education. Neural Comput. Appl. 2022, 35, 5819–5837. [Google Scholar] [CrossRef] [PubMed]
- Dinh-Thanh, N.; Thi-Ngoc-Diem, P. Predicting academic performance of high school students. In Nature of Computation and Communication. ICTCC 2022. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering; Phan, C.V., Nguyen, T.D., Eds.; Springer: Cham, Switzerland, 2023; Volume 473. [Google Scholar] [CrossRef]
- Quynh, T.D.; Dong, N.D.; Thuan, N.Q. A case study of student performance predictions in English course: The data mining approach. In International Congress on Information and Communication Technology; Springer Nature: Singapore, 2024; pp. 419–429. [Google Scholar]
- Huynh-Cam, T.T.; Chen, L.S.; Le, H. Using decision trees and random forest algorithms to predict and determine factors contributing to first-year university students’ learning performance. Algorithms 2021, 14, 318. [Google Scholar] [CrossRef]
Factor ID | Factor Description | Values and Transformed Values | |
---|---|---|---|
Khmer Students | Chinese (Hoa) Students | ||
F1 | Gender | 1 = Girls; 2 = Boys | |
F2 | Age | 1 = 16; 2 = 17; 3 = 18 | |
F3 | Home address | 1 = Town; 2 = Rural; 3 = Remote rural | |
F4 | Father’s career | 0 = Not applicable; 1 = Agricultural worker; 2 = Teacher; 3 = Laborer; 4 = Truck driver; 5 = Officer; 6 = Manager; 7 = Musician; 8 = Small retailer; 9 = Freelancer; 10 = Other | 0 = Not applicable; 1 = Agricultural worker; 2 = Laborer; 3 = Truck driver; 4 = Engineer; 5 = Small retailer; 6 = Freelancer |
F5 | Mother’s career | 0 = Not applicable; 1 = Agricultural worker; 2 = Teacher; 3 = Laborer; 4 = Officer; 5 = Photographer; 6 = Small retailer; 7 = Housewife; 8 = Freelancer | 0 = Not applicable; 1 = Agricultural worker; 2 = Laborer; 3 = Hairdresser; 4 = Small retailer; 5 = Housewife |
F6 | Math score for mid-term test_Semester 1 | 1.4~9.8 | 1.8~8.8 |
F7 | Math score for final exam_Semeter 1 | 1.4~10 | 1.8~9.2 |
F8 | Math score for mid-term test_Semester 2 | 1.8~10 | 2.8~10 |
F9 | Math score for final exam_Semeter 2 | 1.3~10 | 1.8~9.6 |
F10 | Literature score for mid-term test_Semester 1 | 3.0~9.0 | 3.0~8.0 |
F11 | Literature score for final exam_Semeter 1 | 3.0~9.0 | 4.0~8.5 |
F12 | Literature score for mid-term test_Semester 2 | 2.0~10 | 4.5~8.5 |
F13 | Literature score for final exam_Semeter 2 | 3.0~9.0 | 3.0~8.5 |
F14 | English score for mid-term test_Semester 1 | 2.3~9.5 | 2.7~7.3 |
F15 | English score for final exam_Semester 1 | 2.2~9.1 | 3.4~9.0 |
F16 | English score for mid-term test_Semester 2 | 3.5~9.8 | 5.0~10 |
F17 | English score for final exam_Semester 2 | 2.8~9.5 | 4.8~8.8 |
Output | Learning performance | 1 = Very good (VG); 2 = Good (G); 3 = Average (AVG); 4 = Poor |
Cases | Transformed Values | Output Factor | Scores | Number of Samples | Classification | |
---|---|---|---|---|---|---|
Khmer Students (number = 147) | Chinese (Hoa) Students (number = 27) | |||||
Case 1: Origin | 4 | Very good (VG) | 8.0–10 points | 13 (12.1%) | 04 (14.8%) | 4 classes: VG, G, AVG, and poor |
3 | Good (G) | 6.5- 7.9 points | 68 (63.8%) | 13 (48.2%) | ||
2 | Average (AVG) | 5.0–6.4 points | 62 (57.9%) | 9 (33.3%) | ||
1 | Poor | 3.5–4.9 points | 04 (3.7%) | 01 (3.7%) | ||
Case 2: Combine | 3 | VG | 8.0–10 points | 13 (12.1%) | 04 (14.8%) | 3 classes: VG, G, and poor |
2 | G | 6.5- 7.9 points | 68 (63.8%) | 13 (48.2%) | ||
1 | Poor | 3.5–6.4 points | 68 (63.8%) | 10 (37%) | ||
Case 3: Focus | 1 | VG | 8.0–10 points | 13 (12.1%) | 04 (14.8%) | 2 classes: VG and poor |
0 | Poor | 3.5–6.4 points | 68 (63.8%) | 10 (37%) |
Group | Classification Case | Classifier | Accuracy (%) | F1 score (%) | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Fold 1 | Fold 2 | Fold 3 | Fold 4 | Fold 5 | Mean (SD) | Fold 1 | Fold 2 | Fold 3 | Fold 4 | Fold 5 | Mean (SD) | |||
Khmer students | Case 1: Origin | RF | 77 | 77 | 83 | 87 | 80 | 80.8 (4.27) | 00 | 00 | 00 | 00 | 00 | 0.0 (0.0) |
GNB | 80 | 80 | 87 | 90 | 80 | 83.4 (4.77) | 00 | 00 | 00 | 00 | 00 | 0.0 (0.0) | ||
Case 2: Combine | RF | 83 | 87 | 83 | 77 | 70 | 80.0 (6.63) | 89 | 88 | 89 | 79 | 81 | 85.2 (4.82) | |
GNB | 90 | 90 | 87 | 73 | 83 | 84.6 (7.09) | 89 | 89 | 85 | 84 | 87 | 86.8 (2.28) | ||
Case 3: Focus | RF | 100 | 94 | 100 | 100 | 100 | 98.8 (2.68) | 100 | 96 | 100 | 100 | 100 | 99.2 (1.79) | |
GNB | 100 | 100 | 100 | 100 | 100 | 100.0 (0.00) | 100 | 100 | 100 | 100 | 100 | 100.0 (0.00) | ||
Chinese (Hoa) students | Case 1: Origin | RF | 67 | 67 | 56 | 78 | 56 | 64.8 (9.2) | 00 | 00 | 00 | 00 | 00 | 0.0 (0.0) |
GNB | 83 | 78 | 78 | 67 | 67 | 74.6 (7.23) | 00 | 00 | 00 | 00 | 00 | 0.0 (0.0) | ||
Case 2: Combine | RF | 67 | 83 | 83 | 100 | 67 | 80.0 (13.75) | 50 | 67 | 67 | 100 | 00 | 56.8 (36.56) | |
GNB | 67 | 33 | 67 | 83 | 50 | 60.0 (19.08) | 00 | 00 | 67 | 00 | 00 | 13.4 (29.96) | ||
Case 3: Focus | RF | 100 | 100 | 100 | 100 | 100 | 100.0 (0.00) | 100 | 100 | 100 | 100 | 100 | 100.0 (0.00) | |
GNB | 100 | 100 | 100 | 100 | 100 | 100.0 (0.00) | 100 | 100 | 100 | 100 | 100 | 100.0 (0.00) |
Rank Order | Factors | |
---|---|---|
Khmer Students | Chinese (Hoa) Students | |
1 | F9. Math score for final exam_Semeter 2 | F11. Literature score for final exam_Semeter 1 |
2 | F6. Math score for mid-term test_Semester 1 | F17. English score for final exam_Semester 2 |
3 | F8. Math score for mid-term test_Semester 2 | F15. English score for final exam_Semester 1 |
4 | F7. Math score for final exam_Semeter 1 | F6. Math score for mid-term test_Semester 1 |
5 | F15. English score for final exam_Semester 1 | F8. Math score for mid-term test_Semester 2 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Le, H.-D.; Huynh-Cam, T.-T.; Chen, L.-S.; Ngan, V.P.T.; Lu, T.-C. Predicting the Learning Performance of Minority Students in a Vietnamese High School Using Artificial Intelligence Algorithms. Eng. Proc. 2025, 98, 22. https://doi.org/10.3390/engproc2025098022
Le H-D, Huynh-Cam T-T, Chen L-S, Ngan VPT, Lu T-C. Predicting the Learning Performance of Minority Students in a Vietnamese High School Using Artificial Intelligence Algorithms. Engineering Proceedings. 2025; 98(1):22. https://doi.org/10.3390/engproc2025098022
Chicago/Turabian StyleLe, Hai-Duy, Thao-Trang Huynh-Cam, Long-Sheng Chen, Vo Phan Thu Ngan, and Tzu-Chuen Lu. 2025. "Predicting the Learning Performance of Minority Students in a Vietnamese High School Using Artificial Intelligence Algorithms" Engineering Proceedings 98, no. 1: 22. https://doi.org/10.3390/engproc2025098022
APA StyleLe, H.-D., Huynh-Cam, T.-T., Chen, L.-S., Ngan, V. P. T., & Lu, T.-C. (2025). Predicting the Learning Performance of Minority Students in a Vietnamese High School Using Artificial Intelligence Algorithms. Engineering Proceedings, 98(1), 22. https://doi.org/10.3390/engproc2025098022