Predicting the Cognitive and Social–Emotional Development of Minority Children in Early Education: A Data Science Approach
Abstract
1. Introduction
1.1. Delays in Early Childhood Development
1.2. Interventions
1.3. Machine Learning for Effective Education
2. Materials and Methods
2.1. Data
2.2. Methods
3. Results
3.1. Classification Results
3.2. Predictions Form Weighted Ensemble Regression
4. Discussion
4.1. Linking Outcomes with Theory
4.2. The Role of Socio-Economic Factors
4.3. Implications and Future Directions
4.4. Promoting Interdisciplinary Research
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
PCA | Principal component analysis (dimensionality reduction algorithm) |
LDA | Linear discriminant analysis (dimensionality reduction and classification) |
KNN | k-nearest neighbors (here used for classification and data imputation) |
NB | Naive Bayes (classification algorithm) |
XGBoost | Extreme gradient boosting (classification and regression algorithm) |
CV | Cross-validation (for the evaluation metric) |
WE_L2/L3 | Weighted ensemble, level two/three (stacking of trained models) |
K-SVM | Kernel support vector machine (classification algorithm) |
ROC | Receiver operating characteristic |
AUC | Area under the curve |
MICE | Multiple imputation by chained equations (data imputation algorithm) |
References
- Schweinhart, J.L. The High/Scope Perry Preschool Study Through Age 40. Summary, Conclusions, and Frequently Asked Questions. High/Scope Educational Research Foundation. 2005. Available online: https://highscope.org/wp-content/uploads/2024/07/perry-preschool-summary-40.pdf (accessed on 30 May 2025).
- Grantham-McGregor, S.M.; Cheung, Y.B.; Cueto, S.; Glewwe, P.; Richter, L.; Strupp, B.; The International Child Development Steering Group. Child development in developing countries 1: Developmental potential in the first 5 years for children in developing countries. Lancet 2007, 369, 60–70. [Google Scholar] [CrossRef] [PubMed]
- Giedd, J.N. Structural Magnetic Resonance Imaging of the Adolescent Brain. Ann. N. Y. Acad. Sci. 2004, 1021, 77–85. [Google Scholar] [CrossRef]
- Bishop, D.V.M.; Snowling, M.J. Developmental Dyslexia and Specific Language Impairment: Same or Different? Psychol. Bull. 2004, 130, 858–886. [Google Scholar] [CrossRef]
- Thompson, R.A. Early attachment and later development. In Handbook of Attachment: Theory, Research, and Clinical Applications; Cassidy, J., Shaver, P.R., Eds.; Guilford: New York, NY, USA, 1999; pp. 265–286. [Google Scholar]
- Liu, D.; Diorio, J.; Tannenbaum, B.; Caldji, C.; Francis, D.; Freedman, A.; Sharma, S.; Pearson, D.; Plotsky, P.M.; Meaney, M.J. Maternal care, hippocampal glucocorticoid receptors, and hypothalamic–pituitary– adrenal responses to stress. Science 1997, 277, 1659–1662. [Google Scholar] [CrossRef]
- Diamond, A. Executive Functions. Annu. Rev. Psychol. 2013, 64, 135–168. [Google Scholar] [CrossRef]
- Head Start Impact Study. Final Report. U.S. Department of Health and Human Services. Administration for Children and Families. 2010. Available online: https://acf.gov/sites/default/files/documents/opre/hs_impact_study_final.pdf (accessed on 30 May 2025).
- Marzola, P.; Melzer, T.; Pavesi, E.; Gil-Mohapel, J.; Brocardo, P.S. Exploring the Role of Neuroplasticity in Development, Aging, and Neurodegeneration. Brain Sci. 2023, 13, 1610. [Google Scholar] [CrossRef]
- Hanish, L.D. The grand challenge in social and emotional development: Building a harmonious world for children and adolescents. Front. Dev. Psychol. 2024, 1, 1130880. [Google Scholar] [CrossRef]
- McCoy, D.C.; Peet, E.D.; Ezzati, M.; Danaei, G.; Black, M.M.; Sudfeld, C.R.; Fawzi, W.; Fink, G. Early Childhood Developmental Status in Low- and Middle-Income Countries: National, Regional, and Global Prevalence Estimates Using Predictive Modeling. PLoS Med. 2016, 13, e1002034. [Google Scholar] [CrossRef]
- Prado, E.; Larson, L.; Cox, K.; Bettencourt, K.; Kubes, J.; Shankar, A. Do effects of early life interventions on linear growth correspond to effects on neurobehavioural development? A systematic review and meta-analysis. Lancet Glob. Health 2019, 7, e1398–e1413. [Google Scholar] [CrossRef] [PubMed]
- Wang, W.; Dong, Y.; Liu, X.; Bai, Y.; Zhang, L. The effect of parents’ education on the academic and non-cognitive outcomes of their children: Evidence from China. Child. Youth Serv. Rev. 2020, 117, 105307. [Google Scholar] [CrossRef]
- Cabrera, J.N. Positive Development of Minority Children. Soc. Policy Rep. 2013, 27, 1–22. Available online: https://www.srcd.org/sites/default/files/file-attachments/spr_272_final.pdf (accessed on 30 May 2025). [CrossRef]
- Dodge, K.; Bai, Y.; Ladd, H.; Muschkin, G.C. Impact of North Carolina’s Early Childhood Programs and Policies on Educational Outcomes in Elementary School. Child Dev. 2016, 88, 996–1014. [Google Scholar] [CrossRef]
- Arango, M.; Nimnicht, G.; Penaranda, F. Twenty Years On: A report of the PROMESA programme in Colombia. Early Childhood Development: Practice and Reflections, No. 22, The Hague, Bernard van Leer Foundation. 2004. Available online: https://issuu.com/bernardvanleerfoundation/docs/twenty_years_on_a_report_of_the_promesa_programme_ (accessed on 30 May 2025).
- Mladenov, C.; Dimitrov, E.; Kazakov, B. Demographical development of Bulgaria during the transitional period. Méditerranée 2008, 110, 117–123. [Google Scholar] [CrossRef]
- Imdorf, C.; Ilieva-Trichkova, P.; Stoilova, R.; Boyadjieva, P.; Gerganov, A. Regional and Ethnic Disparities of School-to-Work Transitions in Bulgaria. Educ. Sci. 2022, 12, 233. [Google Scholar] [CrossRef]
- Ersozlu, Z.; Taheri, S.; Koch, I. A review of machine learning methods used for educational data. Educ. Inf. Technol. 2024, 29, 22125–22145. [Google Scholar] [CrossRef]
- Frias-Martinez, E. Machine Learning for Education: Current State and Future Prospects. In Radical Solutions for Artificial Intelligence and Digital Transformation in Education. Lecture Notes in Educational Technology; Burgos, D., Branch, J.W., Tlili, A., Huang, R., Jemni, M., Stracke, C.M., de la Higuera, C., Looi, C.-K., Berrada, K., Eds.; Springer: Singapore, 2024. [Google Scholar] [CrossRef]
- Ren, X. A hybrid model combining environmental analysis and machine learning for predicting AI education quality. Sci. Rep. 2025, 15, 12577. [Google Scholar] [CrossRef] [PubMed]
- Song, C.; Shin, S.-Y.; Shin, K.-S. Implementing the Dynamic Feedback-Driven Learning Optimization Framework: A Machine Learning Approach to Personalize Educational Pathways. Appl. Sci. 2024, 14, 916. [Google Scholar] [CrossRef]
- Menon, H.K.D.; Janardhan, V. Machine Learning Approaches in Education. Mater. Today Proc. 2021, 43, 3470–3480. [Google Scholar] [CrossRef]
- Labadze, L.; Grigolia, M.; Machaidze, L. Role of AI chatbots in education: Systematic literature review. Int. J. Educ. Technol. High Educ. 2023, 20, 56. [Google Scholar] [CrossRef]
- Davar, N.F.; Dewan, M.A.A.; Zhang, X. AI Chatbots in Education: Challenges and Opportunities. Information 2025, 16, 235. [Google Scholar] [CrossRef]
- Mim, M.A.; Khatun, M.R.; Hossain, M.M.; Rahman, W.; Munir, A. Exploring Early Learning Challenges in Children Utilizing Statistical and Explainable Machine Learning. Algorithms 2025, 18, 20. [Google Scholar] [CrossRef]
- Jin, X. Predicting academic success: Machine learning analysis of student, parental, and school efforts. Asia Pac. Educ. Rev. 2023. [Google Scholar] [CrossRef]
- Deng, J.; Huang, X.; Ren, X. A Multidimensional Analysis of Self-Esteem and Individualism: A Deep Learning-Based Model for Predicting Elementary School Students’ Academic Performance. Meas. Sens. 2024, 33, 101147. [Google Scholar] [CrossRef]
- Chu, T.S.; Ashraf, M. Artificial Intelligence in Curriculum Design: A Data-Driven Approach to Higher Education Innovation. Preprints 2025, 2025050041. [Google Scholar] [CrossRef]
- Sharif, S.; Theeng Tamang, M.; Mani, D.; Elmedany, W. Application of Clustering Algorithms to Enhance Personalized Learning through Recommendation Model. In Proceedings of the International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies, Sakhir, Bahrain, 17–19 November 2024. [Google Scholar] [CrossRef]
- Chen, Y.; Sun, J.; Wang, J.; Zhao, L.; Song, X.; Zhai, L. Machine Learning-Driven Student Performance Prediction for Enhancing Tiered Instruction. arXiv 2025. [Google Scholar] [CrossRef]
- Herath, D.; Dinuwan, C.; Ihalagedara, C.; Ambegoda, T. Enhancing Educational Outcomes Through AI Powered Learning Strategy Recommendation System. Int. J. Adv. Comput. Sci. Appl. 2024, 15, 10. [Google Scholar] [CrossRef]
- Hutter, F.; Kotthoff, L.; Vanschoren, J. Automated Machine Learning: Methods, Systems, Challenges; Springer: Berlin/Heidelberg, Germany, 2019; ISBN 9783030053185. [Google Scholar]
- Wolpert, D.H. Stacked Generalization. Neural Netw. 1992, 5, 241–259. [Google Scholar] [CrossRef]
- Knudsen, I.E. Sensitive Periods in the Development of the Brain and Behavior. J. Cogn. Neurosci. 2004, 16, 1412–1425. [Google Scholar] [CrossRef] [PubMed]
- Mercan, Z.; Akpınar, Ü.; Kocaarslan, M. Cognitive abilities in early childhood: An exploration across gender, age group, school type, and parental educational status. Bartın Univ. J. Fac. Educ. 2024, 13, 705–721. [Google Scholar] [CrossRef]
Category | Parent | Teacher | Expert | Tests |
---|---|---|---|---|
adaptation | − | |||
communication | ||||
cognitive | ||||
physical | ||||
emotional |
Feature | Score | AUC | Best Model |
---|---|---|---|
employment father | Random Forest | ||
education mother | Gaussian NB | ||
marital status | LDA | ||
cluster type | XGboost/K-SMV |
Target Feature | Best Model | Background Features | |
---|---|---|---|
social–emotional | NeuralNetTorch | age mother/child | |
communication | NeuralNetTorch | education/age | |
adaptive behavior | WE_L2 | age mother/child | |
PC1 (’soft’ skills) | WE_L3 | age mother/child | |
PC2 (physical) | WE_L2 | education/age | |
PC3 (cognitive) | WE_L2 | education father |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Brezov, D.; Koltcheva, N.; Stoyanova, D. Predicting the Cognitive and Social–Emotional Development of Minority Children in Early Education: A Data Science Approach. AppliedMath 2025, 5, 113. https://doi.org/10.3390/appliedmath5030113
Brezov D, Koltcheva N, Stoyanova D. Predicting the Cognitive and Social–Emotional Development of Minority Children in Early Education: A Data Science Approach. AppliedMath. 2025; 5(3):113. https://doi.org/10.3390/appliedmath5030113
Chicago/Turabian StyleBrezov, Danail, Nadia Koltcheva, and Desislava Stoyanova. 2025. "Predicting the Cognitive and Social–Emotional Development of Minority Children in Early Education: A Data Science Approach" AppliedMath 5, no. 3: 113. https://doi.org/10.3390/appliedmath5030113
APA StyleBrezov, D., Koltcheva, N., & Stoyanova, D. (2025). Predicting the Cognitive and Social–Emotional Development of Minority Children in Early Education: A Data Science Approach. AppliedMath, 5(3), 113. https://doi.org/10.3390/appliedmath5030113