# A Mixture Hidden Markov Model to Mine Students’ University Curricula

## Abstract

## 1. Introduction

## 2. Data Description

## 3. Mixture Hidden Markov Models for Sequence Data

`seqHMM`[38,39] and details on the related algorithms are provided in [40]. Here, it is worth outlining that, to avoid the problem of local maxima that is typical of mixture models, we suggest to repeat the estimation process a certain number of times with random starting values.

## 4. Analysis of Student Paths

`seqHMM`. We first illustrate and discuss results related to the HM model (Section 4.2) and, then, we extend the analysis to the MHM with covariates (Section 4.3 and Section 4.4). The discussion of results focuses on the differences among courses in the tendency to postpone the final tests and on the interpretation of the latent structure of the student population.

#### 4.1. Model Specification

#### 4.2. Hidden Markov Model

#### 4.3. Mixture Hidden Markov Model

#### 4.4. Effect of Concomitant Variables

## 5. Conclusions

## Author Contributions

## Funding

## Institutional Review Board Statement

## Informed Consent Statement

## Data Availability Statement

## Conflicts of Interest

## Abbreviations

AIC | Akaike Information Criterion |

BIC | Bayesian Information Criterion |

EM | Expectation-Maximization |

HS | High school |

HM | Hidden Markov |

MHM | Mixture Hidden Markov |

## Appendix A

Symbol | Description |
---|---|

$i=1,\dots ,n$ | individual (student) |

$j=1,\dots ,J$ | channel (exam) |

$t=1,\dots ,{T}_{i}$ | time occasion |

$h=1,\dots ,H$ | discrete state space |

$k=1,\dots ,K$ | latent class |

${y}_{itj}$ | observed binary state for student i at time t on exam j |

${y}_{ij}={({y}_{i1j},\dots ,{y}_{itj},\dots ,{y}_{iTj})}^{\prime}$ | sequence of observed binary states for student i |

on exam j | |

${x}_{i}$ | vector of time-constant characteristics for student i |

${U}_{it}$ | discrete time-dependent latent variable |

${V}_{i}$ | discrete time-constant latent variable |

${u}_{t}$ | hidden state of ${U}_{it}$ at time t |

${v}_{k}$ | support point of ${V}_{i}$ for latent class k |

$p({U}_{i1}={u}_{1})$ | initial probability of starting from hidden state ${u}_{1}$ |

$p({U}_{it}={u}_{t}|{U}_{i,t-1}={u}_{t-1})$ | transition probability of moving from |

hidden state ${u}_{t-1}$ to hidden state ${u}_{t}$ | |

${\pi}_{k}=p({V}_{i}={v}_{k})$ | mass probability (or weight) of ${v}_{k}$ |

${\pi}_{k}\left({x}_{i}\right)=p({V}_{i}={v}_{k}|{X}_{i}=x)$ | subject-specific mass probability (or weight) of ${v}_{k}$ |

$p\left({y}_{itj}\right|{U}_{it}={u}_{t})$ | conditional probability of observed state |

given hidden state | |

$p\left({y}_{ij}\right|{U}_{i}=u,{V}_{i}={v}_{k})$ | conditional probability of sequence of observed states |

given hidden states and latent class | |

${\beta}_{0},{\beta}_{1}$ | regression coefficients |

## References

**Figure 1.**Observed state chronological sequences of first-year exams. Legend: exam taken (purple rectangle); exam not yet taken (green rectangle).

**Figure 2.**Observed state sequences of first-year exams, by gender (

**top**), HS type (

**middle**), and HS final grade (

**bottom**). Legend: exam taken (purple rectangle); exam not yet taken (green rectangle).

**Figure 3.**Observed state sequences of first-year courses, by status at the end of the follow-up period (

**top**), exams grade point average (

**middle**), and average time to the last exam (

**bottom**). Legend: exam taken (purple rectangle); exam not yet taken (green rectangle).

Student | Exam | Status | $\mathbf{ts}1$ | $\mathit{ts}2$ | $\mathit{ts}3$ | … | $\mathit{ts}24$ | $\mathit{ts}25$ | … | $\mathit{ts}48$ |
---|---|---|---|---|---|---|---|---|---|---|

1 | Accounting | enrolled | 0 | 0 | 1 | 1 | 1 | 1 | … | 1 |

1 | Mathematics | enrolled | 0 | 0 | 0 | 0 | 0 | 0 | … | 0 |

… | … | … | … | … | … | … | … | … | … | … |

2 | Mathematics | graduates | 0 | 1 | 1 | 1 | 1 | NA | … | NA |

Course | Hidden State | Emission Prob. |
---|---|---|

Accounting | ||

State 1 | 0.864 | |

State 2 | 0.982 | |

Mathematics | ||

State 1 | 0.319 | |

State 2 | 0.607 | |

Private law | ||

State 1 | 0.411 | |

State 2 | 0.774 | |

Management | ||

State 1 | 0.714 | |

State 2 | 0.987 | |

Microeconomics | ||

State 1 | 0.400 | |

State 2 | 0.829 | |

Statistics | ||

State 1 | 0.523 | |

State 2 | 0.907 |

**Table 3.**MHM models for $K=1,\dots ,10$: maximum log-likelihood $\widehat{\ell}$, number of free model parameters, BIC values, relative difference between consecutive BIC values (delta).

K | $\widehat{\mathit{\ell}}$ | # par. | BIC | Delta |
---|---|---|---|---|

1 | −93,482.0 | 58 | 187,497.2 | – |

2 | −88,708.0 | 117 | 178,491.6 | −0.048 |

3 | −86,138.3 | 176 | 173,894.7 | −0.026 |

4 | −84,680.2 | 235 | 171,520.8 | −0.014 |

5 | −82,988.5 | 294 | 168,679.9 | −0.017 |

6 | −82,127.9 | 353 | 167,501.0 | −0.007 |

7 | −81,204.7 | 412 | 166,197.0 | −0.008 |

8 | −80,145.0 | 471 | 164,620.1 | −0.009 |

9 | −79,074.0 | 530 | 163,020.5 | −0.010 |

10 | −78,774.5 | 589 | 162,963.9 | −0.000 |

**Table 4.**MHM model with concomitant variables: Average estimated class membership probabilities, number of students, estimated transition probabilities from state 1 to state 2.

Class 1 | Class 2 | Class 3 | Class 4 | Class 5 | |
---|---|---|---|---|---|

Avg. class prob. (${\overline{\widehat{\pi}}}_{k}\left({x}_{i}\right)$) | 0.279 | 0.169 | 0.148 | 0.172 | 0.231 |

# of students | 81 | 49 | 43 | 50 | 67 |

$\widehat{p}({U}_{it}=2|{U}_{i,t-1}=1)$ | 0.067 | 0.056 | 0.057 | 0.049 | 0.049 |

**Table 5.**MHM model with concomitant variables: Estimated emission probabilities by latent class (only first-year courses).

Course | Hidden State | Class 1 | Class 2 | Class 3 | Class 4 | Class 5 |
---|---|---|---|---|---|---|

Accounting | ||||||

State 1 | 0.962 | 0.933 | 0.914 | 0.593 | 0.929 | |

State 2 | 1.000 | 1.000 | 1.000 | 0.904 | 1.000 | |

Mathematics | ||||||

State 1 | 0.739 | 0.626 | 0.269 | 0.049 | 0.013 | |

State 2 | 0.972 | 0.962 | 0.656 | 0.245 | 0.363 | |

Private law | ||||||

State 1 | 0.770 | 0.000 | 0.658 | 0.322 | 0.241 | |

State 2 | 1.000 | 0.461 | 0.998 | 0.767 | 0.725 | |

Management | ||||||

State 1 | 0.760 | 0.734 | 0.746 | 0.504 | 0.802 | |

State 2 | 1.000 | 0.981 | 0.981 | 0.958 | 1.000 | |

Microeconomics | ||||||

State 1 | 0.555 | 0.553 | 0.487 | 0.155 | 0.322 | |

State 2 | 0.972 | 0.968 | 0.851 | 0.600 | 0.800 | |

Statistics | ||||||

State 1 | 0.691 | 0.616 | 0.575 | 0.137 | 0.620 | |

State 2 | 1.000 | 0.974 | 0.934 | 0.280 | 0.994 |

**Table 6.**Performance of students at the end of the follow-up, by latent class: students’ status (proportion), exams grade point average (average values for all students and only for graduates), time to the last exam (average values for all students and only for graduates).

Variable | Class 1 | Class 2 | Class 3 | Class 4 | Class 5 | All |
---|---|---|---|---|---|---|

Status | ||||||

Enrolled | 0.037 | 0.102 | 0.093 | 0.560 | 0.299 | 0.207 |

Graduated | 0.963 | 0.898 | 0.907 | 0.440 | 0.672 | 0.786 |

Retired | 0.000 | 0.000 | 0.000 | 0.000 | 0.030 | 0.007 |

Exams grade point avg. (all) | 25.7 | 24.5 | 24.9 | 23.2 | 23.9 | 24.5 |

Exams grade point avg. (grad.) | 25.8 | 24.6 | 25.1 | 23.3 | 24.2 | 24.9 |

Time to the last exam (all) | 1287.8 | 1556.1 | 1409.4 | 1876.9 | 1741.1 | 1557.5 |

Time to the last exam (grad.) | 1281.0 | 1499.5 | 1362.3 | 1797.7 | 1661.2 | 1462.0 |

**Table 7.**Multinomial logit sub-model for latent class membership (reference is class 1): Estimated regression coefficients with standard errors, t statistics, and p-values.

Estimate | Std. Error | t-Stat | p-Value | |
---|---|---|---|---|

Latent class 2 | ||||

Intercept | 3.360 | 1.471 | 2.283 | 0.022 |

Male | 0.124 | 0.403 | 0.308 | 0.758 |

HS type (ref.: Humanistic or scientific) | ||||

Technical | −0.801 | 0.516 | −1.552 | 0.121 |

Vocational (or others) | 1.387 | 0.677 | 2.048 | 0.041 |

HS final grade | −0.049 | 0.018 | −2.807 | 0.005 |

Latent class 3 | ||||

Intercept | 2.542 | 1.513 | 1.681 | 0.093 |

Male | 0.225 | 0.428 | 0.526 | 0.599 |

HS type (ref.: Humanistic or scientific) | ||||

Technical | 0.769 | 0.442 | 1.739 | 0.082 |

Vocational or others | 2.301 | 0.681 | 3.381 | 0.001 |

HS final grade | −0.048 | 0.018 | −2.708 | 0.007 |

Latent class 4 | ||||

Intercept | 5.686 | 1.380 | 4.119 | 0.000 |

Male | −0.442 | 0.409 | −1.081 | 0.280 |

HS type (ref.: Humanistic or scientific) | ||||

Technical | 0.287 | 0.458 | 0.626 | 0.531 |

Vocational or others | 1.948 | 0.675 | 2.888 | 0.004 |

HS final grade | −0.081 | 0.017 | −4.880 | 0.000 |

Latent class 5 | ||||

Intercept | 5.061 | 1.351 | 3.746 | 0.000 |

Male | −0.990 | 0.385 | −2.571 | 0.010 |

HS type (ref.: Humanistic or scientific) | ||||

Technical | 1.151 | 0.403 | 2.859 | 0.004 |

Vocational or others | 2.427 | 0.638 | 3.805 | 0.000 |

HS final grade | −0.070 | 0.016 | −4.333 | 0.000 |

Variable | Class 1 | Class 2 | Class 3 | Class 4 | Class 5 | All |
---|---|---|---|---|---|---|

Gender (proportions) | ||||||

females | 0.469 | 0.388 | 0.395 | 0.480 | 0.642 | 0.486 |

males | 0.531 | 0.612 | 0.605 | 0.520 | 0.358 | 0.514 |

HS type (proportions) | ||||||

humanistic or scientific | 0.642 | 0.714 | 0.419 | 0.480 | 0.328 | 0.521 |

technical | 0.309 | 0.122 | 0.349 | 0.220 | 0.373 | 0.283 |

vocational or others | 0.049 | 0.163 | 0.233 | 0.300 | 0.299 | 0.197 |

HS final grade (averages) | 82.1 | 75.9 | 76.8 | 66.3 | 73.5 | 75.6 |

