Predicting Learner Contributions in MOOC Learning Forums Using the Hidden Markov Model
Abstract
1. Introduction
2. Literature Review
2.1. Learner Contributions in MOOC Learning Forums
2.2. The Hidden State of Learners in MOOC Learning Forums
2.3. Application of HMM
2.4. Research Gaps and Opportunities
3. Research Methodology
3.1. Contribution Sequence in MOOC Learning Forums
3.2. Covariates for Posts in MOOC Learning Forums
3.3. HMM Structure
3.4. HMM Modeling
3.5. Systematic Approach
4. Data and Method Implementation
4.1. Data Collection
4.2. Determining the Contribution Sequence of Learners
4.3. Determining the Covariates of Learners’ Posts
4.3.1. Evaluation Criteria for Assessing Henri’s Cognitive Level
4.3.2. Descriptive Statistics
4.4. Determining the Number of Hidden States for Learners
4.5. Predicting Learners’ Contributions in MOOC Learning Forums
5. Conclusions and Implications
5.1. Conclusions
5.2. Theoretical Implications
5.2.1. Exploring Learners’ Contributions Through HMM
5.2.2. Introduction of Henri Cognitive Level Analysis
5.2.3. Efficacy of HMM for Predicting Learners’ Contributions
5.3. Practical Implications
6. Limits and Future Research
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Perifanou, M.; Economides, A.A. The Landscape of MOOC Platforms Worldwide. Int. Rev. Res. Open Distrib. Learn. 2022, 23, 104–133. [Google Scholar] [CrossRef]
- Li, J.S.; Li, L.L.; Zhu, Z.X.; Shadiev, R. Research on the predictive model based on the depth of problem-solving discussion in MOOC forum. Educ. Inf. Technol. 2023, 28, 13053–13076. [Google Scholar] [CrossRef] [PubMed]
- Chen, J.; Fang, B.; Zhang, H.; Xue, X. A systematic review for MOOC dropout prediction from the perspective of machine learning. Interact. Learn. Environ. 2022, 32, 1642–1655. [Google Scholar] [CrossRef]
- Kokkodis, M.; Lappas, T.; Ransbotham, S. From Lurkers to Workers: Predicting Voluntary Contribution and Community Welfare. Inf. Syst. Res. 2020, 31, 607–626. [Google Scholar] [CrossRef]
- Rivera, D.A.; Frenay, M.; Paquot, M. The role of MOOC forum discussion tasks in learners’ cognitive engagement. J. Comput. Assist. Learn. 2024, 40, 2103–2120. [Google Scholar] [CrossRef]
- Almatrafi, O.; Johri, A. Systematic Review of Discussion Forums in Massive Open Online Courses (MOOCs). IEEE Trans. Learn. Technol. 2019, 12, 413–428. [Google Scholar] [CrossRef]
- Wu, B.; Wu, C.C. Research on the mechanism of knowledge diffusion in the MOOC learning forum using ERGMs. Comput. Educ. 2021, 173, 104295. [Google Scholar] [CrossRef]
- Chen, W.; Wei, X.; Zhu, K.X. Engaging Voluntary Contributions in Online Communities: A Hidden Markov Model. MIS Q. 2018, 42, 83–100. [Google Scholar] [CrossRef]
- Mor, B.; Garhwal, S.; Kumar, A.A. Systematic Review of Hidden Markov Models and Their Applications. Arch. Comput. Methods Eng. 2021, 28, 1429–1448. [Google Scholar] [CrossRef]
- Sun, Y.B.; Pfahringer, B.; Gomes, H.M.; Bifet, A. SOKNL: A novel way of integrating K-nearest neighbours with adaptive random forest regression for data streams. Data Min. Knowl. Discov. 2022, 36, 2006–2032. [Google Scholar] [CrossRef]
- Jin, J.; Liu, S.Z.; Ma, T.F. Robust and efficient subsampling algorithms for massive data logistic regression. J. Appl. Stat. 2023, 51, 1427–1445. [Google Scholar] [CrossRef] [PubMed]
- Cook, J.A.; Siddiqui, S. Random forests and selected samples. Bull. Econ. Res. 2020, 72, 272–287. [Google Scholar] [CrossRef]
- Ren, Q.X.; Wang, J.G. Research on Enterprise Digital-Level Classification Based on XGBoost Model. Sustainability 2023, 15, 2699. [Google Scholar] [CrossRef]
- Eggebrecht, P.; Lutkebohmert, E. A hybrid convolutional neural network with long short-term memory for statistical arbitrage. Quant. Financ. 2023, 23, 595–613. [Google Scholar] [CrossRef]
- Jiang, H.X.; Fan, S.K.; Zhang, N.; Zhu, B. Deep learning for predicting patent application outcome: The fusion of text and network embeddings. J. Informetr. 2023, 17, 101402. [Google Scholar] [CrossRef]
- Wu, B.; Chen, X.H. Continuance intention to use MOOCs: Integrating the technology acceptance model (TAM) and task technology fit (TTF) model. Comput. Hum. Behav. 2017, 67, 221–232. [Google Scholar] [CrossRef]
- Gong, J.; Liu, T.X.; Tang, J. How monetary incentives improve outcomes in MOOCs: Evidence from a field experiment. J. Econ. Behav. Organ. 2021, 190, 905–921. [Google Scholar] [CrossRef]
- Yu, S.Z.; Androsov, A.; Yan, H.B.; Chen, Y. Bridging computer and education sciences: A systematic review of automated emotion recognition in online learning environments. Comput. Educ. 2024, 220, 105111. [Google Scholar] [CrossRef]
- Almatrafi, O.; Johri, A. Improving MOOCs Using Information from Discussion Forums: An Opinion Summarization and Suggestion Mining Approach. IEEE Access 2022, 10, 15565–15573. [Google Scholar] [CrossRef]
- Kim, C.; Feng, B. Digital inequality in online reciprocity between generations: A preliminary exploration of ability to use communication technology as a mediator. Technol. Soc. 2021, 66, 101609. [Google Scholar] [CrossRef]
- Yang, B.K.; Tang, H.T.; Hao, L.; Rose, J.R. Untangling chaos in discussion forums: A temporal analysis of topic-relevant forum posts in MOOCs. Comput. Educ. 2022, 178, 104402. [Google Scholar] [CrossRef]
- Nguyen, M.D.; Cho, Y.S. A Hybrid Generative Model for Online User Behavior Prediction. IEEE Access 2020, 8, 3761–3771. [Google Scholar] [CrossRef]
- Mustafa, S.; Zhang, W. Predicting users knowledge contribution behaviour in technical vs non-technical online Q&A communities: SEM-Neural Network approach. Behav. Inf. Technol. 2022, 42, 2521–2544. [Google Scholar] [CrossRef]
- Mousavi, S.; Roper, S. Enhancing Relationships Through Online Brand Communities: Comparing Posters and Lurkers. Int. J. Electron. Commer. 2023, 27, 66–99. [Google Scholar] [CrossRef]
- Nguyen, M.; Malik, A.; Sharma, P. How to motivate employees to engage in online knowledge sharing? Differences between posters and lurkers. J. Knowl. Manag. 2021, 25, 1811–1831. [Google Scholar] [CrossRef]
- Badreddine, B.; Blount, Y.; Quilter, M. The role of personality traits in participation in an Online Cancer Community. Aslib J. Inf. Manag. 2022, 75, 318–341. [Google Scholar] [CrossRef]
- Kuo, T.M.; Tsai, C.C.; Wang, J.C. Linking web-based learning self-efficacy and learning engagement in MOOCs: The role of online academic hardiness. Internet High. Educ. 2021, 51, 100819. [Google Scholar] [CrossRef]
- Deng, R.Q.; Benckendorff, P.; Gannaway, D. Learner engagement in MOOCs: Scale development and validation. Br. J. Educ. Technol. 2020, 51, 245–262. [Google Scholar] [CrossRef]
- Wu, B. Influence of MOOC learners discussion forum social interactions on online reviews of MOOC. Educ. Inf. Technol. 2021, 26, 3483–3496. [Google Scholar] [CrossRef]
- Grewal, J.K.; Krzywinski, M.; Altman, N. Markov models—Training and evaluation of hidden Markov models. Nat. Methods 2020, 17, 121–122. [Google Scholar] [CrossRef]
- Elkimakh, K.; Nasroallah, A. Hidden Markov model steady-state estimation. Commun. Stat.-Simul. Comput. 2022, 51, 6792–6807. [Google Scholar] [CrossRef]
- Ray, S.; Kim, S.S.; Morris, J.G. The Central Role of Engagement in Online Communities. Inf. Syst. Res. 2014, 25, 528–546. [Google Scholar] [CrossRef]
- Amjad, T.; Shaheen, Z.; Daud, A. Advanced Learning Analytics: Aspect Based Course Feedback Analysis of MOOC Forums to Facilitate Instructors. IEEE Trans. Comput. Soc. Syst. 2022, 11, 4698–4706. [Google Scholar] [CrossRef]
- Cohen, A.; Shimony, U.; Nachmias, R.; Soffer, T. Active learners’ characterization in MOOC forums and their generated knowledge. Br. J. Educ. Technol. 2019, 50, 177–198. [Google Scholar] [CrossRef]
- Su, Z.; Yi, B. Research on HMM-Based Efficient Stock Price Prediction. Mob. Inf. Syst. 2022, 2022, 8124149. [Google Scholar] [CrossRef]
- Dridi, N.; Delignon, Y.; Sawaya, W. BIC and AIC criteria for the hidden Markov chain Application to numerical communication. Trait. Signal 2014, 31, 383–400. [Google Scholar] [CrossRef][Green Version]
- Tausczik, Y.R.; Pennebaker, J.W. The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods. J. Lang. Soc. Psychol. 2010, 29, 24–54. [Google Scholar] [CrossRef]
- Google Machine Learning Glossary [EB/OL]. Available online: https://developers.google.cn/machine-learning/glossary (accessed on 26 January 2024).
- Neha; Kim, E. Designing effective discussion forum in MOOCs: Insights from learner perspectives. Front. Educ. 2023, 8, 1223409. [Google Scholar] [CrossRef]
- Deng, Q.; Soffker, D. A Review of HMM-Based Approaches of Driving Behaviors Recognition and Prediction. IEEE Trans. Intell. Veh. 2022, 7, 21–31. [Google Scholar] [CrossRef]
- Zhang, X.L.; Xu, M.L. AUC optimization for deep learning-based voice activity detection. Eurasip J. Audio Speech Music Process. 2022, 1, 27. [Google Scholar] [CrossRef]
- Kim, T.; Lee, J.S. Maximizing AUC to learn weighted naive Bayes for imbalanced data classification. Expert Syst. Appl. 2023, 217, 119564. [Google Scholar] [CrossRef]
- Liu, S.; Liu, S.Q.; Liu, Z.; Peng, X.; Yang, Z.K. Automated detection of emotional and cognitive engagement in MOOC discussions to predict learning achievement. Comput. Educ. 2022, 181, 104461. [Google Scholar] [CrossRef]
- Liu, Z.; Mu, R.; Yang, Z.K.; Peng, X.; Liu, S.Y.Y.; Chen, J. Modeling temporal cognitive topic to uncover learners’ concerns under different cognitive engagement patterns. Interact. Learn. Environ. 2023, 31, 7196–7213. [Google Scholar] [CrossRef]





| Variables | Definition |
|---|---|
| Forum actions | |
| Likes per thread received by a leaner | |
| Subsequent responses per thread received by a leaner | |
| Responses per thread received by a leaner | |
| Learner actions | |
| Original posts created by a learner | |
| Original posts replied to by a learner | |
| Weeks since a learner’s last thread initiation | |
| Mean interval between a learner’s thread initiations | |
| Pre-existing count of replies to the original thread before a learner responds | |
| Sub-replies authored by a learner per reply | |
| Replies contributed by a learner per original thread | |
| Average Henri value of a learner’s original, reply, and sub-reply threads |
| Hierarchy | Description | Indicator | Value |
|---|---|---|---|
| No Cognition | Cognitive level is not involved | Asking questions, seeking help, engaging in polite conversations | 0 |
| Basic Clarification | Observing problems, analyzing basic concepts, sorting out connections, and summarizing understanding | Identifying and defining basic concepts; Involving basic subject knowledge; Restating the problem; Asking relevant questions | 1 |
| In-depth Clarification | Analyzing problems, deeply understanding assumptions, logic, conclusions, and application value | Using and defining terminology; Establishing a reference taxonomy; Utilizing examples and analogies | 2 |
| Inference | Endorsing or presenting a point of view through induction and deduction based on accepted facts | Drawing conclusions, making inferences, and elaborating ideas based on previous statements | 3 |
| Judgment | Making decisions and expressing appreciation, criticism, or support | Assessing relevance, effectiveness, and correctness of solutions; Making value judgments; Evaluating reasonableness | 4 |
| Strategy | Proposing specific solutions or actions | Deciding to act; Proposing solutions | 5 |
| Variables | Average | Std. Deviation | Median | Min | Max |
|---|---|---|---|---|---|
| 0.58 | 1.11 | 0 | 0 | 3 | |
| 0.05 | 0.60 | 0 | 0 | 193 | |
| 0.16 | 1.13 | 0 | 0 | 74 | |
| 0.72 | 21.11 | 0 | 0 | 9964 | |
| 0.18 | 0.53 | 0 | 0 | 20 | |
| 0.23 | 0.70 | 0 | 0 | 75 | |
| 3.33 | 2.54 | 3 | 0 | 10 | |
| 1.42 | 1.32 | 1 | 0 | 10 | |
| 17.21 | 146.86 | 0 | 0 | 3048 | |
| 0.34 | 1.15 | 0 | 0 | 117 | |
| 0.25 | 0.68 | 0 | 0 | 43 | |
| 0.56 | 1.26 | 0 | 0 | 5 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Wu, B.; Xie, R. Predicting Learner Contributions in MOOC Learning Forums Using the Hidden Markov Model. Appl. Sci. 2026, 16, 881. https://doi.org/10.3390/app16020881
Wu B, Xie R. Predicting Learner Contributions in MOOC Learning Forums Using the Hidden Markov Model. Applied Sciences. 2026; 16(2):881. https://doi.org/10.3390/app16020881
Chicago/Turabian StyleWu, Bing, and Ruodan Xie. 2026. "Predicting Learner Contributions in MOOC Learning Forums Using the Hidden Markov Model" Applied Sciences 16, no. 2: 881. https://doi.org/10.3390/app16020881
APA StyleWu, B., & Xie, R. (2026). Predicting Learner Contributions in MOOC Learning Forums Using the Hidden Markov Model. Applied Sciences, 16(2), 881. https://doi.org/10.3390/app16020881

