BNMI-DINA: A Bayesian Cognitive Diagnosis Model for Enhanced Personalized Learning
Abstract
:1. Introduction
- To the best of our knowledge, we are the first to integrate Bayesian networks into the DINA model to obtain continuous mastery levels for knowledge points and account for their interdependence. Our proposed model overcomes the limitation of the DINA model by considering the dependencies between knowledge points, enabling a more comprehensive evaluation of students’ cognitive skills.
- We propose a parallelization method that effectively parallelizes the key steps of our BNMI-DINA model, thereby significantly reducing the computational burden associated with model training. This parallelization enables the application of BNMI-DINA to large datasets. Additionally, we provide theoretical proof of the convergence of BNMI-DINA, establishing its validity and ensuring reliable results.
- We conduct extensive experiments using real datasets to validate the performance of BNMI-DINA. The results of our experiments demonstrate the superiority of BNMI-DINA when compared to other baseline models. Specifically, BNMI-DINA outperforms the alternatives in terms of both model accuracy and training efficiency, solidifying its position as a highly effective and efficient cognitive diagnosis approach.
2. Related Work
3. Preliminaries
4. Design of BNMI-DINA Model
4.1. Framework Overview
4.2. Bayesian Network Module
4.3. MI-DINA Module
- 1.
- Perform initialization operations for parameters that cannot be directly observed and estimate model parameter values.
- 2.
- Based on the estimated model parameter values, estimation operations are performed on parameters that cannot be directly observed.
- 3.
- Re-estimate the model parameters based on the parameter values estimated in step 2 that cannot be directly observed.
- 4.
- Repeat steps 2–3 until the parameters converge.
Algorithm 1 Pseudo-code of MI-DINA cognitive diagnostic process |
Input: Student-Question score matrix R; Question-Knowledge point matrix Q; Output: Estimate of the multidimensional knowledge mastery vector for a student user ;
|
5. Convergence of BNMI-DINA Model
6. Experiments
6.1. Datasets
6.2. Visualizing the Relationship of Knowledge Points
6.3. Evaluation Metrics
- Accuracy is calculated as the ratio of successful predictions to the total number of instances in the test set. It quantifies the proportion of correct predictions in relation to the entire test set size.
- Time refers to the duration of a particular process, measured in seconds. It provides insights into the efficiency of the model in terms of computation time.
- The F1 score combines precision and recall, offering a comprehensive assessment of the model’s predictions. Precision evaluates the ratio of correct positive predictions, while recall assesses the ratio of correctly predicted positive instances.
6.4. Experimental Settings
6.5. Experimental Results
- 1.
- The accuracy of BNMI-DINA significantly outperforms other traditional cognitive diagnosis models. On the Junyi dataset, BNMI-DINA enhances model accuracy by an average of 5.72%. This gap becomes even more obvious on the Junyi-scaled dataset with an improvement of the accuracy by up to 9.21%.
- 2.
- BNMI-DINA has proven to be more computationally efficient than other baselines. The training time is reduced by 26.3% on average while achieving comparable model accuracy.
- Improvement in estimation process: MI-DINA breaks down the estimation process into smaller subtasks and runs them in parallel. The main process then combines the results, which speeds up the overall diagnostic process. This improvement enhances the efficiency and effectiveness of the model.
- Enhancements in E-step and M-step: the MI-DINA model enhances the E-step and M-step without affecting the convergence of parameters. Key parameter estimation can still occur iteratively. This enhancement ensures that the estimation process is accurate and reliable.
- Fusion of Bayesian networks: incorporating Bayesian networks in the approach focuses on estimating students’ mastery probabilities for each knowledge point. This allows for the capture of complex dependencies between variables in cognitive diagnosis, resulting in more accurate understanding of the relationships between students’ cognition and knowledge attributes.
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Templin, J.L.; Henson, R.A. Measurement of psychological disorders using cognitive diagnosis models. Psychol. Methods 2006, 11, 287. [Google Scholar] [CrossRef] [PubMed]
- Wang, F.; Huang, Z.; Liu, Q.; Chen, E.; Yin, Y.; Ma, J.; Wang, S. Dynamic Cognitive Diagnosis: An Educational Priors-Enhanced Deep Knowledge Tracing Perspective. IEEE Trans. Learn. Technol. 2023, 16, 306–323. [Google Scholar] [CrossRef]
- Liu, Y.; Zhang, T.; Wang, X.; Yu, G.; Li, T. New development of cognitive diagnosis models. Front. Comput. Sci. 2023, 17, 171604. [Google Scholar] [CrossRef]
- Luo, J.; Hubaux, J.P. A survey of research in inter-vehicle communications. In Embedded Security in Cars: Securing Current and Future Automotive IT Applications; Springer: Berlin/Heidelberg, Germany, 2006; pp. 111–122. [Google Scholar]
- De La Torre, J. DINA model and parameter estimation: A didactic. J. Educ. Behav. Stat. 2009, 34, 115–130. [Google Scholar] [CrossRef]
- Wafa, M.N.; Zia, Z.; Frozan, F. Consistency and Ability of Students Using DINA and DINO Models. Eur. J. Math. Stat. 2023, 4, 7–13. [Google Scholar] [CrossRef]
- Frederiksen, N.; Mislevy, R.J.; Bejar, I.I. Test Theory for a New Generation of Tests; Routledge: London, UK, 2012. [Google Scholar]
- Nichols, P.D.; Chipman, S.F.; Brennan, R.L. Cognitively Diagnostic Assessment; Routledge: London, UK, 2012. [Google Scholar]
- Leighton, J.; Gierl, M. Cognitive Diagnostic Assessment for Education: Theory and Applications; Cambridge University Press: Cambridge, UK, 2007. [Google Scholar]
- Lee, Y.W.; Sawaki, Y. Cognitive diagnosis approaches to language assessment: An overview. Lang. Assess. Q. 2009, 6, 172–189. [Google Scholar] [CrossRef]
- Gu, Z. Maximizing the Potential of Multiple-Choice Items for Cognitive Diagnostic Assessment; University of Toronto Canada: Toronto, ON, Canada, 2011. [Google Scholar]
- Li, H.; Hunter, C.V.; Lei, P.W. The selection of cognitive diagnostic models for a reading comprehension test. Lang. Test. 2016, 33, 391–409. [Google Scholar] [CrossRef]
- Yang, Y. Modeling Nonignorable Missingness with Response Times Using Tree-Based Framework in Cognitive Diagnostic Models; Columbia University: New York, NY, USA, 2023. [Google Scholar]
- Yang, S.; Wei, H.; Ma, H.; Tian, Y.; Zhang, X.; Cao, Y.; Jin, Y. Cognitive diagnosis-based personalized exercise group assembly via a multi-objective evolutionary algorithm. IEEE Trans. Emerg. Top. Comput. Intell. 2023, 7, 829–844. [Google Scholar] [CrossRef]
- Qi, T.; Ren, M.; Guo, L.; Li, X.; Li, J.; Zhang, L. ICD: A new interpretable cognitive diagnosis model for intelligent tutor systems. Expert Syst. Appl. 2023, 215, 119309. [Google Scholar] [CrossRef]
- Ma, H.; Huang, Z.; Tang, W.; Zhu, H.; Zhang, H.; Li, J. Predicting Student Performance in Future Exams via Neutrosophic Cognitive Diagnosis in Personalized E-learning Environment. IEEE Trans. Learn. Technol. 2023, 16, 680–693. [Google Scholar] [CrossRef]
- Gao, W.; Wang, H.; Liu, Q.; Wang, F.; Lin, X.; Yue, L.; Zhang, Z.; Lv, R.; Wang, S. Leveraging Transferable Knowledge Concept Graph Embedding for Cold-Start Cognitive Diagnosis. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, Taipei, Taiwan, 23–27 July 2023; pp. 983–992. [Google Scholar]
- Wang, S.; Zeng, Z.; Yang, X.; Zhang, X. Self-supervised graph learning for long-tailed cognitive diagnosis. In Proceedings of the AAAI Conference on Artificial Intelligence, Washington, DC, USA, 7–14 February 2023; Volume 37, pp. 110–118. [Google Scholar]
- Wu, R.; Liu, Q.; Liu, Y.; Chen, E.; Su, Y.; Chen, Z.; Hu, G. Cognitive modelling for predicting examinee performance. In Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, Buenos Aires, Argentina, 25–31 July 2015. [Google Scholar]
- De La Torre, J.; Douglas, J.A. Higher-order latent trait models for cognitive diagnosis. Psychometrika 2004, 69, 333–353. [Google Scholar] [CrossRef]
- Tu, D.B.; Cai, Y.; Dai Hai-Qi, D. A polytomous cognitive diagnosis model: P-DINA model. Acta Psychol. Sin. 2010, 42, 1011. [Google Scholar] [CrossRef]
- Aryadoust, V. A cognitive diagnostic assessment study of the listening test of the Singapore–Cambridge general certificate of education O-level: Application of DINA, DINO, G-DINA, HO-DINA, and RRUM. Int. J. List. 2021, 35, 29–52. [Google Scholar] [CrossRef]
- Wang, C.; Liu, Q.; Chen, E.H.; Huang, Z.Y.; Zhu, T.Y.; Su, Y.; Hu, G.P. The rapid calculation method of DINA model for large scale cognitive diagnosis. Acta Electonica Sin. 2018, 46, 1047. [Google Scholar]
- Pearl, J. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference; Morgan Kaufmann: Burlington, MA, USA, 1988. [Google Scholar]
- Murphy, K.P. Inference and Learning in Hybrid Bayesian Networks; Citeseer: Berkeley, CA, USA, 1998. [Google Scholar]
- Tang, J.; Liu, X.; Wang, W. COVID-19 medical waste transportation risk evaluation integrating type-2 fuzzy total interpretive structural modeling and Bayesian network. Expert Syst. Appl. 2023, 213, 118885. [Google Scholar] [CrossRef] [PubMed]
- Chan, L.S.; Chu, A.M.; So, M.K. A moving-window bayesian network model for assessing systemic risk in financial markets. PLoS ONE 2023, 18, e0279888. [Google Scholar] [CrossRef] [PubMed]
- Kamil, M.Z.; Taleb-Berrouane, M.; Khan, F.; Amyotte, P.; Ahmed, S. Textual data transformations using natural language processing for risk assessment. Risk Anal. 2023, 43, 2033–2052. [Google Scholar] [CrossRef]
- Yang, H.; Qi, T.; Li, J.; Guo, L.; Ren, M.; Zhang, L.; Wang, X. A novel quantitative relationship neural network for explainable cognitive diagnosis model. Knowl. Based Syst. 2022, 250, 109156. [Google Scholar] [CrossRef]
- Conati, C.; Gertner, A.; Vanlehn, K. Using Bayesian networks to manage uncertainty in student modeling. User Model. User Adapt. Interact. 2002, 12, 371–417. [Google Scholar] [CrossRef]
- VanLehn, K.; Lynch, C.; Schulze, K.; Shapiro, J.A.; Shelby, R.; Taylor, L.; Treacy, D.; Weinstein, A.; Wintersgill, M. The Andes physics tutoring system: Lessons learned. Int. J. Artif. Intell. Educ. 2005, 15, 147–204. [Google Scholar]
- Käser, T.; Klingler, S.; Schwing, A.G.; Gross, M. Beyond knowledge tracing: Modeling skill topologies with bayesian networks. In Proceedings of the Intelligent Tutoring Systems: 12th International Conference, ITS 2014, Honolulu, HI, USA, 5–9 June 2014; Springer: Berlin/Heidelberg, Germany, 2014; pp. 188–198. [Google Scholar]
- Pelánek, R. Bayesian knowledge tracing, logistic models, and beyond: An overview of learner modeling techniques. User Model. User Adapt. Interact. 2017, 27, 313–350. [Google Scholar] [CrossRef]
- Liu, S.; Qu, H.; Chen, Q.; Jian, W.; Liu, R.; You, L. AFMeta: Asynchronous Federated Meta-learning with Temporally Weighted Aggregation. In Proceedings of the 2022 IEEE Smartworld, Ubiquitous Intelligence & Computing, Scalable Computing & Communications, Digital Twin, Privacy Computing, Metaverse, Autonomous & Trusted Vehicles (SmartWorld/UIC/ScalCom/DigitalTwin/PriComp/Meta), Haikou, China, 15–18 December 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 641–648. [Google Scholar]
- Liu, S.; He, T.; Li, J.; Li, Y.; Kumar, A. An effective learning evaluation method based on text data with real-time attribution-a case study for mathematical class with students of junior middle school in China. ACM Trans. Asian Low Resour. Lang. Inf. Process. 2023, 22, 1–22. [Google Scholar] [CrossRef]
- Liu, S.; Yu, X.; Ma, H.; Wang, Z.; Qin, C.; Zhang, X. Homogeneous Cohort-Aware Group Cognitive Diagnosis: A Multi-grained Modeling Perspective. In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, Birmingham, UK, 21–25 October 2023; pp. 4094–4098. [Google Scholar]
- Zhang, S.; Huang, S.; Yu, X.; Chen, E.; Wang, F.; Huang, Z. A generalized multi-skill aggregation method for cognitive diagnosis. World Wide Web 2023, 26, 585–614. [Google Scholar] [CrossRef] [PubMed]
- Zhang, Z.; Zhang, J.; Lu, J.; Tao, J. Bayesian estimation of the dina model with Pólya-gamma Gibbs sampling. Front. Psychol. 2020, 11, 384. [Google Scholar] [CrossRef] [PubMed]
- Bi, H.; Chen, E.; He, W.; Wu, H.; Zhao, W.; Wang, S.; Wu, J. BETA-CD: A Bayesian meta-learned cognitive diagnosis framework for personalized learning. In Proceedings of the AAAI Conference on Artificial Intelligence, Washington, DC, USA, 7–14 February 2023; Volume 37, pp. 5018–5026. [Google Scholar]
- McLachlan, G. On Aitken’s method and other approaches for accelerating convergence of the EM algorithm. In Proceedings of the AC Aitken Centenary Conference, Dunedin, New Zealand, 28 August–1 September 1995; pp. 201–209. [Google Scholar]
- Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. B (Methodol.) 1977, 39, 1–22. [Google Scholar] [CrossRef]
- Geiger, D.; Pearl, J. Logical and algorithmic properties of independence and their application to Bayesian networks. Ann. Math. Artif. Intell. 1990, 2, 165–178. [Google Scholar] [CrossRef]
- Thiesson, B.; Meek, C.; Heckerman, D. Accelerating EM for large databases. Mach. Learn. 2001, 45, 279–299. [Google Scholar] [CrossRef]
- Chang, H.S.; Hsu, H.J.; Chen, K.T. Modeling Exercise Relationships in E-Learning: A Unified Approach. In Proceedings of the EDM, Madrid, Spain, 26–29 June 2015; pp. 532–535. [Google Scholar]
- Hadi, M.A.; Fard, F.H. Evaluating pre-trained models for user feedback analysis in software engineering: A study on classification of app-reviews. Empir. Softw. Eng. 2023, 28, 88. [Google Scholar] [CrossRef]
- Koren, Y.; Bell, R.; Volinsky, C. Matrix factorization techniques for recommender systems. Computer 2009, 42, 30–37. [Google Scholar] [CrossRef]
- Wang, F.; Liu, Q.; Chen, E.; Huang, Z.; Yin, Y.; Wang, S.; Su, Y. NeuralCD: A general framework for cognitive diagnosis. IEEE Trans. Knowl. Data Eng. 2022, 35, 8312–8327. [Google Scholar] [CrossRef]
- Li, J.; Wang, F.; Liu, Q.; Zhu, M.; Huang, W.; Huang, Z.; Chen, E.; Su, Y.; Wang, S. HierCDF: A Bayesian Network-based Hierarchical Cognitive Diagnosis Framework. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, 14–18 August 2022; pp. 904–913. [Google Scholar]
Aspect | Summary |
---|---|
Cognitive Diagnosis | Over 60 cognitive diagnosis models have been developed, including the rule-based model, attribute hierarchy model, DINA model, and various variations such as the Fuzzy CDF model. |
Bayesian Networks | Bayesian Networks were introduced in 1988 and further expanded in the 1990s, leading to their widespread application in various domains. |
DINA Model | The DINA model has recently been a focus of exploration in education for student modeling, knowledge tracing, and skill topology. |
Our Work | Our work not only incorporates a Bayesian network to capture the dependency relationship between knowledge points, but also enhances computational efficiency, particularly for large datasets. |
Symbol | Description |
---|---|
Q | Question-Knowledge point matrix |
R | Student-Question score matrix |
U | Set of student blocks |
i | Student number |
Question score of student i | |
Computed question scores of student block | |
Probability mass function (joint probability distribution function) | |
o | Initial knowledge point mastery probability |
Posterior probability of knowledge point mastery | |
Hidden mastery status matrix | |
r | Student’s score |
k | Sub-knowledge point |
K | Total number of knowledge points |
x | Parent knowledge point |
u | Student user |
t | Iteration number |
l | Combination of knowledge point mastery |
D | Number of combinations of knowledge point mastery |
Matrix of knowledge point mastery for all students | |
Matrix of knowledge point mastery for student user u | |
Matrix of knowledge point mastery for combination l | |
Estimated matrix of knowledge point mastery for student | |
s | Slip parameter |
g | Guess parameter |
v | Question |
V | Total number of questions |
n | Number of parent knowledge points |
N | Total number of student blocks |
Dataset | #Students | #Attributes | #Question Items | #Response Logs |
---|---|---|---|---|
Junyi | 10,000 | 734 | 734 | 408,057 |
Junyi-scaled | 2400 | 10 | 10 | 6100 |
Datasets | Metrics | IRT | MF | Neural CD | DINA | ||||
---|---|---|---|---|---|---|---|---|---|
Original | Hier | Original | Hier | Original | Hier | BNI-DINA | BNMI-DINA | ||
Junyi | ACC | 0.7218 | 0.6951 | 0.7221 | 0.7348 | 0.6821 | 0.6860 | 0.7516 | 0.7517 |
F1 | 0.8125 | 0.8177 | 0.8143 | 0.8210 | 0.8214 | 0.8210 | 0.8289 | 0.8290 | |
Time | 366 | 372 | 478 | 493 | 479 | 488 | 493 | 320 | |
Junyi-scaled | ACC | 0.6874 | 0.6889 | 0.7044 | 0.7228 | 0.6517 | 0.6771 | 0.7503 | 0.7503 |
F1 | 0.8079 | 0.8150 | 0.8171 | 0.8234 | 0.8266 | 0.8270 | 0.8303 | 0.8303 | |
Time | 174 | 201 | 213 | 227 | 229 | 241 | 258 | 158 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Chen, Y.; Liang, S. BNMI-DINA: A Bayesian Cognitive Diagnosis Model for Enhanced Personalized Learning. Big Data Cogn. Comput. 2024, 8, 4. https://doi.org/10.3390/bdcc8010004
Chen Y, Liang S. BNMI-DINA: A Bayesian Cognitive Diagnosis Model for Enhanced Personalized Learning. Big Data and Cognitive Computing. 2024; 8(1):4. https://doi.org/10.3390/bdcc8010004
Chicago/Turabian StyleChen, Yiming, and Shuang Liang. 2024. "BNMI-DINA: A Bayesian Cognitive Diagnosis Model for Enhanced Personalized Learning" Big Data and Cognitive Computing 8, no. 1: 4. https://doi.org/10.3390/bdcc8010004
APA StyleChen, Y., & Liang, S. (2024). BNMI-DINA: A Bayesian Cognitive Diagnosis Model for Enhanced Personalized Learning. Big Data and Cognitive Computing, 8(1), 4. https://doi.org/10.3390/bdcc8010004