Computational Methods for Estimating the Evidence and Bayes Factor in SEIR Stochastic Infectious Diseases Models Featuring Asymmetrical Dynamics of Transmission

Muteb Faraj Alharthi

doi:10.3390/sym15061239

Department of Mathematics and Statistics, College of Science, Taif University, Taif 21944, Saudi Arabia

Symmetry2023, 15(6), 1239;https://doi.org/10.3390/sym15061239

Version Notes

Order Reprints

Abstract

Stochastic epidemic models may offer a vitally essential public health tool for comprehending and regulating disease progression. The best illustration of their importance and usefulness is perhaps the substantial influence that these models have had on the global COVID-19 epidemic. Nonetheless, these models are of limited practical use unless they provide an adequate fit to real-life epidemic outbreaks. In this work, we consider the problem of model selection for epidemic models given temporal observation of a disease outbreak through time. The epidemic models are stochastic individual-based transmission models of the Susceptible–Exposed–Infective–Removed (SEIR) type. The main focus is on the use of model evidence (or marginal likelihood), and hence the Bayes factor is a gold-standard measure of merit for comparing the fits of models to data. Even though the Bayes factor has been discussed in the epidemic modeling literature, little focus has been given to the fundamental issues surrounding its utility and computation. Based on various asymmetrical infection mechanism assumptions, we derive analytical expressions for Bayes factors which offer helpful suggestions for model selection problems. We also explore theoretical aspects that highlight the need for caution when utilizing the Bayes factor as a model selection technique, such as when the within-model prior distributions become more asymmetrical (diffuse or informative). Three computational methods for estimating the marginal likelihood and hence Bayes factor are discussed, which are the arithmetic mean estimator, the harmonic mean estimator, and the power posterior estimator. The theory and methods are illustrated using artificial data.

Keywords:

epidemic model; infectious diseases data; model selection; marginal likelihood; Bayes factors

1. Introduction and Related Background

Stochastic infectious disease models can be a crucial public health tool for understanding and managing disease progression. More specifically, such models may be used to analyze the efficacy of suggested control measures [1,2] or to examine vaccination strategies [3]. The global COVID-19 epidemic serves as the best illustration of the importance of these models, where the results of such epidemiological modeling have been used to estimate the basic reproduction number [4,5], assess the effectiveness of disease-control measures [6,7] and inform policy-making [8]. These models, however, are of limited use in practice unless they allow efficient estimation of their parameters and offer a good fit to actual epidemic outbreaks. Unfortunately, stochastic epidemic models do not lend themselves to simple parameter inference or model selection. This is because the context of an epidemic is characterized by the difficulties involved, such as the dependence and incompleteness of epidemic outbreak data.

Even if a model’s parameters can be inferred, selecting the best-suited model is a challenging task. For instance, as epidemic data are not independent, straightforward and standard measures of model selection cannot be applied directly. Due to these issues, stochastic epidemic model selection is still in its infancy, and there is a clear need for development [9,10,11].

The focus of this paper is on model selection given complete epidemic data as the researcher’s intention is to derive explicit expressions for the Bayes factor in asymmetrical epidemic settings to gain some insight into the value of Bayes factors as a tool for model selection, and to validate the estimate of the Bayes factor derived from other computational methods proposed in the literature in order to see how reliable they are in situations where complete epidemic data are available but Bayes factors cannot be obtained in closed forms.

1.1. The Stochastic SEIR Epidemic Model

The SEIR model is defined as in [12,13,14]. A closed population of size

N = n + k

is considered, where n and k denote the initial number of susceptible and infective individuals, respectively. The population is divided into four compartments. These are called the susceptible, exposed, infectious and removed classes. Let

S (t), E (t), I (t)

and

R (t)

denote the numbers of individuals in the different classes at time t (so

S (t) + E (t) + I (t) + R (t) = N

for all t). It is also assumed that the epidemic starts at

t = 0

with

(S (0), E (0), I (0), R (0)) = (n, 0, k = 1, 0)

. An individual, while infectious, has contacts with other individuals at the time points of an in-homogeneous Poisson process with rate

g (β, t)

, where

β

is referred to as the infection rate parameter. The overall infection rate function depends on the infection process assumptions. Each such contact results in the susceptible immediately becoming infective. Individuals that receive an infection are initially latent for a time period of

D_{L}

with distribution

F_{D_{L}}

, then infectious for a duration of

D_{I}

with distribution

F_{D_{I}}

, and finally they become recovered for the remaining time period. The disease spreads until there are no exposed or infectious individuals left in the population for the first time. At this point, no more individuals can receive the infection, therefore the epidemic ends. All latent periods, infectious periods, Poisson processes, and uniform contact choices are assumed to be mutually independent.

In this work, we shall focus on assessing the extent to which the infection mechanism is important in the disease spread via asymmetric epidemic scenarios [15,16,17,18]. In particular, we consider comparing the standard SEIR model described above with two alternative SEIR epidemic models. The three models are:

$M_{1}$ : The SEIR with overall infection rate $g (β, t) = β n^{- 1} S (t) I (t)$ .
$M_{2}$ : The SEIR with overall infection rate $g (β, t) = β n^{- 1} e^{- ψ t} S (t) I (t); ψ \in (0, \infty)$ .
$M_{3}$ : The SEIR with overall infection rate $g (β, t) = β n^{- 1} S (t) I^{ζ} (t); ζ \in (0, 0.5)$ .

The second model

M_{2}

is a variation of the standard SEIR model

M_{1}

created by relaxing the assumption of a constant infection rate. In epidemic modeling, it is a common assumption that the infection rate,

β

, remains constant throughout time; however, this is not always accurate. Specifically, the implementation of control measures or changes in behavior in reaction to the epidemic could both have an impact on the infection rate over time. In this model, a time-dependent infection rate,

β (t) = n^{- 1} β e^{- ψ t}

, is considered, where we shall assume that

ψ = 0.1

in order to discriminate

M_{2}

from

M_{1}

as setting

ψ = 0

in the former yields the latter.

The third model

M_{3}

is similarly an adaptation of the standard SEIR model

M_{1}

, where the power parameter

ζ

represents the degree of interaction among infectious and susceptible individuals. We shall assume that

ζ = 0.3

[10,19] in order to more clearly distinguish between the two models

M_{3}

and

M_{1}

. This way of modeling the infection rate is justified by the fact that the rate of new infections need not increase linearly in

S (t)

and

I (t)

. As the epidemic grows, for instance, susceptibles could become more conscious of the possibility of catching the disease and adjust their behavior accordingly. As a result, the power coefficient parameter

ζ

is introduced, where the smaller the

ζ

, the less susceptibles are exposed to the infectives.

1.1.1. Transition Probabilities

For the model

M_{1}

(

M_{2}

and

M_{3}

can be done similarly), if both

D_{L}

and

D_{I}

are exponentially distributed (with rates

δ

and

γ

, say), the model is called a Markovian SEIR model (see Figure 1). A special case of this SEIR model is one in which the exposed (latent) period

D_{L}

is not random, as in [12,13,14]. In this study, a fixed latent period of length l is assumed. The epidemic then develops based on the following transition probabilities in a small interval

[t, t + Δ t)

,

\begin{matrix} P [(S (t + Δ t), E (t + Δ t)) & = & (S (t) - 1, E (t) + 1)] = β n^{- 1} S (t) I (t) Δ t + o (Δ t), \end{matrix}

\begin{matrix} P [(I (t + Δ t), R (t + Δ t)) & = & (I (t) - 1, R (t) + 1)] = γ I (t) Δ t + o (Δ t) . \end{matrix}

Figure 1. The Susceptible (S), Exposed (E), Infectious (I) and Removed (R) (SEIR) model.

1.1.2. Types of Epidemic Data

There are generally two types of epidemic data, temporal data and final size data [20]. Final size data do not show how the disease spreads over time during the epidemic; they only show snapshot information at the beginning and end of the outbreak. Temporal data, on the other hand, reveal details about the state of individuals during the epidemic. Data that contain both infection and removal times are referred to as complete temporal data, whereas data that only include removal times are referred to as partial temporal data. Temporal data may correspond to the times at which individuals are infected or removed.

1.1.3. The Basic Reproduction Number $R_{0}$

Threshold behavior is frequently shown in epidemic models. This roughly suggests that during an epidemic, either relatively few individuals become infected or a substantial number are infected. The basic reproduction number,

R_{0}

, which is heuristically defined as the average number of new infections brought on by a single infective in a large susceptible population, is a quantity of critical relevance in mathematical epidemic theory (see, e.g., [21]). This quantity is important because roughly speaking, in a population of infinitely many susceptibles, if

R_{0} \leq 1

then, with probability one, only a finite number of susceptibles will become infected (i.e., minor outbreak). However, if

R_{0} > 1

there is a positive probability that infinitely many susceptibles will become infected (i.e., major outbreak) and the disease becomes endemic. Knowledge of the value of

R_{0}

makes it possible to calculate the proportion of a population that should be vaccinated in order to prevent an epidemic from spreading.

1.2. Marginal Likelihood Estimation

The marginal likelihood [22,23], also referred to as the evidence or integrated likelihood, is a quantity that gives a standard measure of the fit of a model in a Bayesian setting. Given multiple models that give reasonable predictions, the marginal likelihood of each model offers a mechanism to discriminate between them. What it gives is a measure of the likelihood of the data under the considered model. The issue is that, because it is often analytically challenging, computing the marginal likelihood is rarely straightforward, even in simple models. To calculate the marginal data densities in more complex models, high-dimensional integration is necessary. However, in majority of models, it is not possible to analytically integrate out parameters from the joint distribution for the data and the parameters. This means that numerical methods are required to obtain estimations, and these methods may be computationally costly.

The marginal likelihood could be formally defined as follows. Given a data set

y

, a statistical model M with parameter

θ

, a prior distribution

π (θ | M)

, a likelihood

π (y | θ, M)

and a posterior distribution

π (θ | y, M)

, then

π (y | M)

is the marginal likelihood of the model. It represents the likelihood of the data given the model. It can be calculated by using the equality

π (y | M) = \int π (y | θ, M) π (θ | M) d θ,

where approximations shall be denoted as

\hat{π (y)}

, and the conditioning on M shall be omitted for notational simplicity.

1.3. Bayes Factors

Bayes factor [23] is one of the most crucial tools for Bayesian model comparison and hypothesis testing. The Bayes factor, which enables model comparison, may be calculated once the marginal likelihood has been determined. Now, suppose there are two competing epidemic models,

M_{i}

and

M_{j}

, with parameters

θ_{i}

and

θ_{j}

, respectively, and

y

is the observed data set. The Bayes factor in favor of model

M_{i}

over model

M_{j}

is then given by

B F_{i j} = \frac{π (y | M_{i})}{π (y | M_{j})} = \frac{\int π (y | θ_{i}) π (θ_{i}) d θ_{i}}{\int π (y | θ_{j}) π (θ_{j}) d θ_{j}},

where here, and throughout the paper,

π

denotes a density function.

Analytical calculation of Bayes factors is often challenging. Moreover, the estimation of Bayes factors is, in general, difficult. Many approaches exist to estimate it, but here we focus specifically on simulation-based and easy-to-implement methods. The fact that these techniques are relatively prescriptive and provide the user with few implementation options that might substantially impact the performance of the resultant algorithms is one of its appealing features.

1.4. Model Selection within the Stochastic Epidemic Models Literature

There is currently no favored approach for model selection in the literature of epidemic modeling [11]. In the Bayesian setting, approaches include the use of Bayes factors [24,25,26,27], criteria such as the Deviance Information Criterion (DIC) [26,27,28], and methods based on the predictive distribution of future outbreaks [10,29]. Bayesian model selection tools have been applied to epidemic models with final size data and temporal data in certain scenarios. Here, we focus on the use of Bayes factors via marginal likelihood estimation.

The use of reversible jump Markov chain Monte Carlo techniques has been a common approach for computing Bayes factors for epidemics [24,25], although these methods are often problematic in practice due to the challenge of designing efficient algorithms. Given removal data, a combination of MCMC methods and importance sampling was used to estimate marginal likelihoods from which Bayes factors can be calculated [30]. A path-sampling method [31] was implemented to compare epidemic models given data on the final outcome of the epidemic in [26]. The power posterior approach [32] was extended and implemented for estimating the Bayes factor in the epidemic context where the epidemic data are only partially observed [27]. A more complete picture about this topic can be found in [10,11].

2. Bayesian Inference via MCMC Methods for the SEIR Model with Time-Dependent Infection Rate

2.1. Complete Data Case

Suppose that an SEIR system is considered and let m denote the number of individuals becoming infectious at times

i = (i_{1}, \dots, i_{m})

and removed from the population at times

r = (r_{1}, \dots, r_{m})

with

0 = r_{1} \leq r_{2} \leq \dots \leq r_{m}

. Similarly, let

e = {e_{j} : j \neq ω}

denote the set of the

m - 1

exposure times, that are the times at which individuals catch the infection but are unable to spread it, such that for

1 \leq j \leq m, e_{j}

refers to the exposure time of the individual who becomes infectious at time

i_{j} = e_{j} + l

and recovered at time

r_{j}

, where the initial exposed individual is labeled by

ω

such that

e_{ω} < e_{j}

for all

j \neq ω

.

Now, taking into account model

M_{2}

, the likelihood of the data given the model parameters may be written as

\begin{matrix} L (e, i, r | β, ψ, θ, e_{ω}) & = & (\prod_{j = 1, j \neq ω}^{m} h_{ψ} (β, e_{j} -) I (e_{j} -)) \times exp (- \int_{e_{ω}}^{r_{m}} h_{ψ} (β, t) S (t) I (t) d t) \\ \times & \prod_{j = 1}^{m} f_{D_{I}} (r_{j} - i_{j} | θ), \end{matrix}

(1)

where

h_{ψ} (β, s) = n^{- 1} β e^{- ψ s}

, with

e_{k} = i_{k} = r_{k} = \infty

for

k = m + 1, \dots, N

, and

θ

indicates the parameter governing the infectious period,

D_{I}

, where

θ

may be a vector.

Note that, when the infectious period distribution is exponential with rate

γ

(

r_{j} - i_{j} \sim E x p (γ)

), then

\begin{matrix} \prod_{j = 1}^{m} f_{D_{I}} (r_{j} - i_{j} | θ) = \prod_{j = 1}^{m} γ exp (- γ (r_{j} - i_{j})) \end{matrix}

(2)

Given complete epidemic data, the Bayesian framework can first be used by assigning the model parameters’ independent conjugate gamma prior distributions as in [33], namely

β \sim Γ (λ_{β}, ν_{β})

, and

γ \sim Γ (λ_{γ}, ν_{γ})

. Then, the joint posterior density for

β

and

γ

(assuming

ψ

is known) is obtained by multiplying the prior distributions and the likelihood as follows.

\begin{matrix} π (β, γ | e, i, r, e_{ω}) & \propto & L (e, i, r, e_{ω} | β, γ) π (β) π (γ) \\ = & (\prod_{j = 1, j \neq ω}^{m} n^{- 1} β e^{- ψ e_{j} -} I (e_{j} -)) \times exp (- n^{- 1} β ξ_{ψ}) \\ \times & \prod_{j = 1}^{m} γ exp (- γ (r_{j} - i_{j})) \times β^{λ_{β} - 1} e^{- ν_{β} β} \times γ^{λ_{γ} - 1} e^{- ν_{γ} γ}, \end{matrix}

(3)

where

ξ_{ψ} = \int_{e_{ω}}^{r_{m}} e^{- ψ t} S (t) I (t) d t .

The parameters

β

and

γ

are a posteriori conditionally independent, so we have

π (β | e, e_{ω}, i, r) \equiv Γ (λ_{β} + m - 1, ν_{β} + n^{- 1} ξ_{ψ}),

(4)

π (γ | e, e_{ω}, i, r) \equiv Γ (λ_{γ} + m, ν_{γ} + \sum_{j = 1}^{m} (r_{j} - i_{j})) .

(5)

The posterior distributions of the two parameters,

β

and

γ

, are thus easily accessible, making it simple to obtain any function of these parameters.

Note that the inference for models

M_{1}

and

M_{3}

, given complete data, can be performed in a similar manner (see, e.g., [10,14]).

2.2. Incomplete Data Case

When the exposure times and hence the infection times are not observed, the likelihood of the removal times becomes intractable. A common way to handle this issue is to use data augmentation technique [34,35] by treating the missing data as extra parameters to be inferred from the data [33].

We again derive the following full conditional posterior distributions by assigning independent conjugate gamma prior distributions for the model parameters

β

and

γ

, and further assuming

e_{ω}

has an improper uniform prior density on

(- \infty, r_{1})

,

π (β | γ, e, e_{ω}, i, r) \equiv Γ (λ_{β} + m - 1, ν_{β} + n^{- 1} ξ_{ψ}),

(6)

π (γ | β, e, e_{ω}, i, r) \equiv Γ (λ_{γ} + m, ν_{γ} + \sum_{j = 1}^{m} (r_{j} - i_{j})),

(7)

and

π (i, e, e_{ω} | β, γ, r) \propto \prod_{j = 1, j \neq ω}^{m} e^{- ψ e_{j} -} I (e_{j} -) \times exp (- β n^{- 1} ξ_{ψ}) \times \prod_{j = 1}^{m} exp (- γ (r_{j} - i_{j})) .

(8)

Given that the model parameters

β

and

γ

have closed forms of the posterior distribution, updating them can be performed via Gibbs sampling steps [10,33,36]. However, as the posterior distribution of the component

(i, e, e_{ω})

is not explicitly provided, it needs to be updated using the Metropolis–Hastings step. In general, it is possible to apply (with minor adjustments) the MCMC technique in [10,33] to update both the model parameters and the missing exposure and infection times. Bayesian inference for models

M_{1}

and

M_{3}

can be performed in a similar way (see, e.g., [10,27]).

However, the focus of this paper is on model choice given complete epidemic data, where we aim to derive explicit expressions for the Bayes factor in asymmetric infection process settings to validate the estimate of the Bayes factor derived from other proposed methods.

2.3. The Behavior of the SEIR Models Featuring Asymmetrical Dynamics of Transmission

One crucial issue of special significance when modeling an infectious disease is how infectious the disease is. The rate at which individuals contract a disease, in other words. The use of strategies to stop the spread of a disease among a population can be facilitated by having access to such knowledge. The effective of having different infection processes can be observed by comparing the removal trajectories and the epidemic duration for an epidemic.

The behavior of models

M_{1}

and

M_{3}

was investigated using the removal trajectories and the outbreak duration. A simulation study was conducted to investigate how the two SEIR models behave. We simulated 5000 removal trajectories from each SEIR model with

β = 0.5, γ = 0.2

for the

M_{1}

model and

β = 3, γ = 0.2, ψ = 0.3

for the

M_{3}

model, using a population of size

N = 100 (n = 99, k = 1)

.

The major outbreak components of the final size distributions were permitted to be comparable in the sense that they peaked at around the same values for both models, and we set the mean infectious period to be the same in both models (

1 / γ = 5

). The length of the latent period was set at

l = 1

in both models.

It is clear from Figure 2 that the removal trajectory distributions of the two SEIR models differ in terms of their position and shape. In particular, the

M_{1}

model peaks quicker and has more variants, whereas the

M_{3}

model tends to be more widespread and has a longer epidemic duration.

Figure 2. The distributions of removal trajectories and epidemic duration are represented by the top and bottom rows, respectively. The distributions of 5000 removal curves that were simulated from each SEIR model are displayed in the first row. The distributions of the epidemic duration (

r_{m}

) from each SEIR model based on the 5000 simulated epidemics are displayed in the second row.

3. Bayes Factors and Marginal Likelihood Estimation for SEIR Epidemic Models Given Complete Data

In this section, assuming complete outbreak data, we shall derive explicit expressions for the Bayes factor across several epidemic scenarios. This assumption could be unrealistic in practice, although it may arise when outbreaks are being closely monitored. For instance, complete data can be gathered in the early phases of a suspected major epidemic, or in experimental settings for animal diseases. The motivations for looking at Bayes factor formula are that it is of interest in its own right and it provides us of some insights into how to better comprehend this Bayesian model selection tool. In addition, we can validate the estimate of the Bayes factor derived from other methods proposed in the literature such as the arithmetic mean and harmonic mean methods in order to see how reliable they are in situations where complete epidemic data are available but Bayes factors cannot be obtained in closed forms.

3.1. Bayes Factors: Theoretical Aspects

The Bayes factor in favor of model

M_{1}

over model

M_{2}

can be calculated as follows.

\begin{matrix} B F_{12} & = & \frac{π (e, e_{ω}, i, r | M_{1})}{π (e, e_{ω}, i, r | M_{2})} \\ = & \frac{\int_{γ} \int_{β} π (e, e_{ω}, i, r | β, γ) π (β) π (γ) d β d γ}{\int_{γ} \int_{β} π (e, e_{ω}, i, r | β, γ, ψ) π (β) π (γ) d β d γ} \end{matrix}

(9)

The epidemic likelihood may be broken into two independent (infection and removal) parts and, as the removal processes are the same for both SEIR models, they are canceled out. Therefore, we have the following Bayes factor expression:

\begin{matrix} B F_{12} & = & \frac{ν_{β}^{λ_{β}} Γ (λ_{β})}{ν_{β}^{λ_{β}} Γ (λ_{β})} \times \frac{\prod_{j = 1, j \neq ω}^{m} n^{- 1} I (e_{j} -)}{\prod_{j = 1, j \neq ω}^{m} n^{- 1} e^{- ψ e_{j} -} I (e_{j} -)} \times \frac{\int_{β} β^{m - 1} \times e^{- β n^{- 1} ξ} \times β^{λ_{β} - 1} e^{- ν_{β} β} d β}{\int_{β} β^{m - 1} \times e^{- β n^{- 1} ξ_{ψ}} \times β^{λ_{β} - 1} e^{- ν_{β} β} d β} \\ = & exp (ψ \sum_{j = 1, j \neq ω}^{m} e_{j} -) \times {(\frac{ν_{β} + n^{- 1} ξ_{ψ}}{ν_{β} + n^{- 1} ξ})}^{m - 1 + λ_{β}}, \end{matrix}

(10)

where

ξ = \int_{e_{ω}}^{r_{m}} S (t) I (t) d t; ξ_{ψ} = \int_{e_{ω}}^{r_{m}} e^{- ψ t} S (t) I (t) d t .

Now, we have

ξ_{ψ} < ξ

as

0 < e^{- ψ t} < 1

, therefore

B F_{12}

has an upper bound, that is

B F_{12} < exp (ψ \sum_{j = 1, j \neq ω}^{m} e_{j} -) .

In addition, to see the impact of the infection rate prior distribution on the behavior of

B F_{12}

, we rewrite

B F_{12}

in terms of the mean and variance of the prior distribution, that are

E [Γ (λ_{β}, ν_{β})] = λ_{β} / ν_{β} = μ and Var [Γ (λ_{β}, ν_{β})] = λ_{β} / ν_{β}^{2} = σ^{2},

which implies

λ_{β} = μ^{2} / σ^{2}

and

ν_{β} = μ / σ^{2}

. Substituting these values into (10) yields

B F_{12} = exp (ψ \sum_{j = 1, j \neq ω}^{m} e_{j} -) \times {(\frac{\frac{μ}{σ^{2}} + \frac{ξ_{ψ}}{n}}{\frac{μ}{σ^{2}} + \frac{ξ}{n}})}^{m - 1 + \frac{μ^{2}}{σ^{2}}} .

(11)

Consequently, as

σ^{2} \to \infty

or

σ^{2} \to 0

, we have the following limits for the Bayes factor

B F_{12}

, namely

B F_{12} \to {(ξ_{ψ} / ξ)}^{m - 1} \times exp (ψ \sum_{j = 1, j \neq ω}^{m} e_{j} -), as σ^{2} \to \infty,

(12)

and

B F_{12} \to exp (ψ \sum_{j = 1, j \neq ω}^{m} e_{j} -), as σ^{2} \to 0 .

(13)

It can be noticed that as the prior becomes more and more concentrated at

μ

(

σ^{2} \to 0

), the Bayes factor becomes more decisive in supporting

M_{1}

.

Similarly, the Bayes factor in favor of model

M_{2}

over model

M_{3}

can be obtained as follows.

\begin{matrix} B F_{23} & = & \frac{π (e, e_{ω}, i, r | M_{2})}{π (e, e_{ω}, i, r | M_{3})} \\ = & \frac{ν_{β}^{λ_{β}} Γ (λ_{β})}{ν_{β}^{λ_{β}} Γ (λ_{β})} \times \frac{\prod_{j = 1, j \neq ω}^{m} n^{- 1} e^{- ψ e_{j} -} I (e_{j} -)}{\prod_{j = 1, j \neq ω}^{m} n^{- 1} I^{ζ} (e_{j} -)} \times \frac{\int_{β} β^{m - 1} \times e^{- β n^{- 1} ξ_{ψ}} \times β^{λ_{β} - 1} e^{- ν_{β} β} d β}{\int_{β} β^{m - 1} \times e^{- β n^{- 1} ξ_{ζ}} \times β^{λ_{β} - 1} e^{- ν_{β} β} d β} \\ = & exp (- ψ \sum_{j = 1, j \neq ω}^{m} e_{j} -) \times (\prod_{j = 1, j \neq ω}^{m} I^{1 - ζ} (e_{j} -)) \times {(\frac{ν_{β} + n^{- 1} ξ_{ζ}}{ν_{β} + n^{- 1} ξ_{ψ}})}^{m - 1 + λ_{β}}, \end{matrix}

(14)

where,

ξ_{ζ} = \int_{e_{ω}}^{r_{m}} S (t) I^{ζ} (t) d t .

By rewriting

B F_{23}

in terms of the mean and variance of the prior distribution, we have

B F_{23} = exp (- ψ \sum_{j = 1, j \neq ω}^{m} e_{j} -) \times (\prod_{j = 1, j \neq ω}^{m} I^{1 - ζ} (e_{j} -)) \times {(\frac{\frac{μ}{σ^{2}} + \frac{ξ_{ζ}}{n}}{\frac{μ}{σ^{2}} + \frac{ξ_{ψ}}{n}})}^{m - 1 + \frac{μ^{2}}{σ^{2}}} .

(15)

Consequently, as

σ^{2} \to \infty

or

σ^{2} \to 0

, we have the following limits for the Bayes factor

B F_{23}

, namely

B F_{23} \to {(ξ_{ζ} / ξ_{ψ})}^{m - 1} \times exp (- ψ \sum_{j = 1, j \neq ω}^{m} e_{j} -) \times (\prod_{j = 1, j \neq ω}^{m} I^{1 - ζ} (e_{j} -)), as σ^{2} \to \infty,

(16)

and

B F_{23} \to exp (- ψ \sum_{j = 1, j \neq ω}^{m} e_{j} -) \times (\prod_{j = 1, j \neq ω}^{m} I^{1 - ζ} (e_{j} -)), as σ^{2} \to 0 .

(17)

It is evident that decreasing the prior uncertainty leads to favoring the

M_{2}

model.

These theoretical aspects and explicit Bayes factor expressions are of particular assistance in gaining some insights into what estimated values of Bayes factors might be expected, specifically under different prior assumptions. For this particular epidemic setting, we may conclude that increasing (decreasing) the prior uncertainty of the model parameters reduces (increases) the Bayes factor value.

As an aside, one should be aware of any potential connections to Lindley’s paradox [37] when employing the Bayes factor criteria to conduct a Bayesian model choice task. However, in this epidemic setting, models are not nested and have the same level of complexity.

Prior Sensitivity Simulation Study

It is well known that Bayes factors can exhibit strong dependence on the within-model prior distributions [38]. In our epidemic setting, we explore this issue in detail, considering what happens to the Bayes factors for an asymmetrical diffuse prior distribution. A typical case in practice is when

λ_{β} = 1

and

ν_{β} \to 0

(see, e.g., [27,39,40]).

Figure 3 (left) shows

log (B F_{12})

values for some complete epidemic data generated under model

M_{1}

in a closed population of size

N = 100 (n = 99, k = 1)

, 92 of whom were infected, with parameter values

β = 3, γ = 1, l = 1

. The parameter

ψ

in model

M_{2}

was set to

0.1

, and then

log (B F_{12})

was calculated for various values of

ν_{β}

. It is clear that as the prior become more informative (i.e.,

ν_{β}

increases), the

log (B F_{12})

becomes more supportive to the data-generating model

M_{1}

.

Figure 3. Values of

log (B F_{12})

(left) and

log (B F_{23})

(right), calculated for various

ν_{β}

(the

β

prior scale parameter) with

λ_{β} = 1

fixed. The horizontal dashed lines denote the lower and upper limiting values of

log (B F_{12})

and

log (B F_{23})

as

ν_{β}

varies.

Figure 3 (right) shows

log (B F_{23})

values for some complete epidemic data generated under model

M_{3}

in a closed population of size

N = 100 (n = 99, k = 1)

, 92 of whom were infected, with parameter values

β = 7.5, γ = 1, l = 1, ζ = 0.3

. Again, we set

ψ = 0.1

in model

M_{2}

, and then

log (B F_{23})

was calculated for various values of

ν_{β}

. It is evident that as

ν_{β}

increases (the prior becomes more informative), the

log (B F_{23})

tends to favor the true model

M_{3}

.

3.2. Computational Methods for Marginal Likelihood Estimation

As introduced earlier,

(e, e_{ω}), i

and

r

denote the observed exposure, infection and removal times, respectively, with

β

and

γ

representing the models’ infection and removal rates parameters. Here we discuss three computational methods to estimate the marginal likelihood presented in the literature of model choice but not applied in model selection of stochastic epidemic models. The rationale is to see how robust and reliable they are when considering dependent data with complex likelihoods such as the ones involved in our epidemic models settings.

3.2.1. Arithmetic Mean Estimator (Naive Monte Carlo Estimator)

Dropping the notational dependence on M, the marginal likelihood becomes

π (e, e_{ω}, i, r) = \int_{γ} \int_{β} π (e, e_{ω}, i, r | β, γ) π (β, γ) d β d γ .

(18)

The arithmetic mean (AM) estimator for

π (e, e_{ω}, i, r)

can be obtained by a simple Monte Carlo integration method, that is

\hat{π (e, e_{ω}, i, r)} = \frac{1}{K} \sum_{h = 1}^{K} π (e, e_{ω}, i, r | β^{(h)}, γ^{(h)}),

(19)

where

β^{(1)}, \dots, β^{(K)}

and

γ^{(1)}, \dots, γ^{(K)}

are samples generated from the priors

π (β)

and

π (γ)

, respectively.

It is well-known [23] that since the likelihood is typically sharply peaked, most of the prior samples have very small likelihood values, in particular when the prior is diffuse. Therefore, unless K is very large, the prior samples will contain virtually no points from the high-likelihood region, leading to poor and inadequate estimation of the marginal likelihood. However, by the law of large numbers, this estimator converges to the true expectation as the number of independent samples, K, drawn from the prior, tends to infinity [41]. Additionally, this estimate can perform well when the dimensionality of the parameter space is modest, so that enormous prior samples can be generated very quickly.

Illustrative simulation study

In what follows, we conducted a simulation study to illustrate the effectiveness of the AM estimator of the marginal likelihood in approximating the true marginal likelihood and hence the Bayes factor. This simulation example was performed under different prior assumptions for each model parameter. As we have a prior of two dimensions in each model,

M_{1}, M_{2}

and

M_{3}

, we expect the arithmetic mean estimates of the marginal likelihoods, namely

\hat{π (e, e_{ω}, i, r | M_{1})} = \frac{1}{K} \sum_{h = 1}^{K} π (e, e_{ω}, i, r | β^{(h)}, γ^{(h)}),

(20)

\hat{π (e, e_{ω}, i, r | M_{2})} = \frac{1}{K} \sum_{h = 1}^{K} π (e, e_{ω}, i, r | ψ, β^{(h)}, γ^{(h)}),

(21)

and

\hat{π (e, e_{ω}, i, r | M_{3})} = \frac{1}{K} \sum_{h = 1}^{K} π (e, e_{ω}, i, r | ζ, β^{(h)}, γ^{(h)}),

(22)

to be effective in the estimating and recovering the true value of the Bayes factor. Here,

β^{(h)}

and

γ^{(h)}

, where

h = 1, 2, \dots, K

, are independent samples drawn from the gamma prior distribution.

We simulated two typical epidemic data sets, one data set was generated under

M_{1}

and the other data set was simulated from

M_{3}

. Each outbreak was in a closed population consisting of 99 initially susceptible individuals and a single initially infected case. The model parameters for the simulation were (

β = 3, γ = 1, l = 1

) for model

M_{1}

and (

β = 7, γ = 1, l = 1, ζ = 0.3

) for model

M_{3}

. The resulting final size was 92 under both models. The parameter

ψ

was set to

0.1

when fitting

M_{2}

.

Under each prior assumption, namely

Γ (1, 1), Γ (1, 0.1)

and

Γ (1, 0.01)

, the true values of the logarithm of Bayes factors were calculated using (10) and (14). The estimated values of the logarithm of Bayes factors, under different prior distributions, were computed as follows. Initially, we simulated K samples from the prior for each parameter and evaluated the marginal likelihoods based on these prior samples using (20)–(22). Then, the logarithm of Bayes factors was estimated using the following equations.

log (\hat{B F_{12}}) = log [\hat{π (e, e_{ω}, i, r | M_{1})}] - log [\hat{π (e, e_{ω}, i, r | M_{2})}],

(23)

log (\hat{B F_{23}}) = log [\hat{π (e, e_{ω}, i, r | M_{2})}] - log [\hat{π (e, e_{ω}, i, r | M_{3})}] .

(24)

The results are displayed in Table 1. It was found that increasing prior uncertainty of the model parameters does not have much effect on the precision of Bayes factor estimation via AM method, as the prior in our case is low-dimensional and so fast to sample from. These results suggest that as the flat prior distribution expresses a form of neutrality and lack of previous knowledge, its use is possible in the present application. In contrast, a more informative prior (e.g.,

Γ (1, 1)

) also has the advantage of estimating

log (B F_{12})

and

log (B F_{23})

more accurately. These results indicate the possibility of using the arithmetic mean estimator of the marginal likelihood as a simple but effective way of approximating Bayes factors in situations where complete epidemic data are available but Bayes factors cannot be obtained in a closed form. For instance, when comparing these SEIR models with unknown

ψ

or when comparing the SEIR model with different infection mechanisms and unknown power parameter

ζ

.

Table 1. The logarithm of Bayes factor results obtained using two data sets simulated based on models

M_{1}

and

M_{3}

. For each case, model parameters were assigned prior distributions to varying degrees of diffuseness to determine the effect on the logarithm of Bayes factor estimation. Three sizes of prior parameter sample (

K = 10^{4}, K = 10^{5}

and

K = 10^{6}

) were used to determine the effective sample size required to reasonably estimate the logarithm of Bayes factor.

3.2.2. Harmonic Mean Method

Estimation of the marginal likelihood by harmonic mean approach has become a popular method due to its simplicity. The harmonic mean (HM) estimator was first proposed by [42] who showed that the marginal likelihood can be estimated from the harmonic mean of the likelihood, given posterior samples. Under the assumption that the prior distribution of the parameters is proper, we have

1 = \int_{γ} \int_{β} π (β, γ) d β d γ = π (e, e_{ω}, i, r) \int_{γ} \int_{β} \frac{1}{π (e, e_{ω}, i, r | β, γ)} π (β, γ | e, e_{ω}, i, r) d β d γ .

Therefore,

\frac{1}{π (e, e_{ω}, i, r)} = \int_{γ} \int_{β} \frac{1}{π (e, e_{ω}, i, r | β, γ)} π (β, γ | e, e_{ω}, i, r) d β d γ = E_{(β, γ | e, e_{ω}, i, r)} (\frac{1}{π (e, e_{ω}, i, r | β, γ)}),

where

E_{(β, γ | e, e_{ω}, i, r)}

denotes the expected value with respect to the posterior distribution of the models parameters.

The harmonic mean estimator is given by

\hat{π (e, e_{ω}, i, r)} = {(\frac{1}{K} \sum_{h = 1}^{K} \frac{1}{π (e, e_{ω}, i, r | β^{(h)}, γ^{(h)})})}^{- 1},

(25)

where

{β^{(1)}, \dots, β^{(K)}}

and

{γ^{(1)}, \dots, γ^{(K)}}

are sets of MCMC draws from the posterior distribution.

The simplicity (easy and fast) of the HM approach is its main advantage over other more specialized techniques. It uses only within-model posterior samples and likelihood evaluations, which are often available anyway as part of posterior sampling. This estimator is consistent, but sometimes its variance can be large, since occasionally a sample may be taken into consideration with low likelihood, which significantly affects the result due to the reciprocal presented in (25). The HM estimator is often biased and results in overestimating the true marginal likelihood [43]. It is included here because of its simplicity and being not applied as a model selection tool in stochastic epidemic modeling.

3.2.3. Power Posterior Method

An approach for estimating the marginal likelihood based on the idea of thermodynamic integration [31] was developed in [32]. The authors introduced an auxiliary variable (temperature parameter)

p \in [0, 1]

and defined the power posterior (PP) as

π_{p} (β, γ | e, e_{ω}, i, r) \propto π {(e, e_{ω}, i, r | β, γ)}^{p} π (β, γ),

(26)

where by construction the normalizing constant of the power posterior is

z_{p} (e, e_{ω}, i, r) = \int_{γ} \int_{β} π {(e, e_{ω}, i, r | β, γ)}^{p} π (β, γ) d β d γ .

(27)

Clearly, as p moves from

p = 0

to

p = 1

, the power posterior follows a path from the prior to the posterior density, where

z_{p = 0} (e, e_{ω}, i, r) = 1

is the integral of the proper prior of

(β, γ)

and

z_{p = 1} (e, e_{ω}, i, r)

is the marginal likelihood. It can be shown that

log [π (e, e_{ω}, i, r)] = log [\frac{z_{p = 1} (e, e_{ω}, i, r)}{z_{p = 0} (e, e_{ω}, i, r)}] = \int_{0}^{1} E_{β, γ | e, e_{ω}, i, r, p} log [π (e, e_{ω}, i, r | β, γ)] d p .

(28)

In [44], the approach was refined so that the power posterior estimate of

log [π (e, e_{ω}, i, r])

becomes

log [π (e, e_{ω}, i, r)] \approx \sum_{j = 1}^{r} \frac{1}{2} (p_{j} - p_{j - 1}) \times [Δ_{p_{j}} + Δ_{p_{j - 1}}] - \sum_{j = 1}^{r} \frac{1}{12} {(p_{j} - p_{j - 1})}^{2} \times [Φ_{p_{j}} - Φ_{p_{j - 1}}],

(29)

where

Δ_{q} = E_{β, γ | e, e_{ω}, i, r, q} log [π (e, e_{ω}, i, r | β, γ)]; q = p_{j}, p_{j - 1},

and

Φ_{q} = V_{β, γ | e, e_{ω}, i, r, q} log [π (e, e_{ω}, i, r | β, γ)]; q = p_{j}, p_{j - 1},

are the expectation and the variance of

log [π (e, e_{ω}, i, r | β, γ)]

at q, respectively; and r is the number of points in the interval

(0, 1]

in the temperature schedule.

When considering

M_{2}

model (

M_{1}

and

M_{3}

are performed similarly), at temperature p, the power posterior densities for

β

and

γ

can be obtained using (26), which gives

π_{p} (β | e, e_{ω}, i, r) \equiv Γ (λ_{β} + p (m - 1), ν_{β} + (p / n) ξ_{ψ}),

(30)

π_{p} (γ | e, e_{ω}, i, r) \equiv Γ (λ_{γ} + m, ν_{γ} + p \sum_{j = 1}^{m} (r_{j} - i_{j})) .

(31)

More details about this method when applied to stochastic epidemic outbreak data can be found in [10,27], where the performance of this approach in estimating Bayes factors was investigated and assessed.

4. Extensive Simulation Study

In this section, we present an extensive simulation study for the three considered approaches in which the true values of the marginal likelihoods and hence Bayes factors are known (i.e., can be calculated analytically). Throughout this section, we assumed a fixed latent period of

l = 1

, and we chose

γ = 1

. Then, two simulated scenarios were preformed, each scenario was based on 120 epidemic outbreak data sets. These epidemic data sets were simulated using various population sizes (

N = 10, 50, 100

) and fitted under two asymmetric prior assumptions, namely

Γ (1, 1)

and

Γ (1, 0.01)

.

To make each of the approaches (AM, HM and PP) for computing the evidence as comparable as possible, we tried to make each of the algorithms equivalent in terms of the number of iterations. Therefore, the results were based on a run of 100,000 after a burn-in period of 5000 iterations, and thinning the chain by taking each 5th value. Table 2 summarized the two simulated scenarios.

Table 2. Details of the extensive simulation study in which each simulation scenario consists of 120 outbreak data sets generated under various population sizes and fitted under different prior assumptions.

4.1. Scenario I

In this simulated scenario, the data-generating model was

M_{1}

, where, using each population size,

N = 10, 50

and 100, 20 outbreak data sets were generated. Then, under the two prior assumptions, namely

Γ (1, 1)

and

Γ (1, 0.01)

, the true values of the

log B F_{12}

were calculated using (10) and the

log (\hat{B F_{12}})

values were computed using AM, HM and PP methods and their means over the 20 data sets are presented in Table 3, where each row gives results from 20 simulated epidemics in which the final sizes are matched to m.

Table 3. Results of the true mean values of

log B F_{12}

and the estimated mean of

log (\hat{B F_{12}})

computed using AM, HM and PP methods using different population sizes and two prior distribution settings. Each row gives results from 20 simulated epidemics in which the final sizes are matched to m.

Surprisingly, in this experiment, the AM approach performed very well. This might be a result of having low-dimensional parameter space in our setting. The poorest performance was from the HM estimator. The PP method proved its robustness in estimating Bayes factors.

The mean values of the

log B F_{12}

and the means of

log (\hat{B F_{12}})

indicated that as the prior becomes diffuse, the true value of Bayes factors and their estimates using all approaches decrease. In addition to that, the values of Bayes factor were sensitive to epidemic data and were independent of the epidemic sizes. Another point to be mentioned here is that, with small outbreaks, having a vague prior distribution is not recommended and can affect this diagnostic tool for model choice as seen in Table 3 (the fourth row).

4.2. Scenario II

In the second scenario, the epidemic data were simulated from model

M_{3}

. Using each population sizes, namely

N = 10, 50

and 100, we generated 20 outbreak data sets. Then, under the two prior assumptions, namely

Γ (1, 1)

and

Γ (1, 0.01)

, the true values of the

log B F_{23}

were calculated using (14) and their mean is shown in Table 4. The

log (\hat{B F_{23}})

values were estimated using AM, HM and PP methods, and their means over the 20 data sets are presented in Table 4, where we matched the final size of the 20 outbreak data sets to m.

Table 4. Results of the true mean values of

log B F_{23}

and the estimated mean of

log (\hat{B F_{23}})

computed using AM, HM and PP methods using different population sizes and two prior distribution settings. Each row gives results from 20 simulated epidemics from the true model in which the final sizes are matched to m.

The AM approach performed very well in this simulated example as a result of the relatively low number of parameters. The HM estimator was poor in recovering the true mean values of the

log B F_{23}

. The estimates produced by the PP method had a high degree of accuracy.

Again, mean values of the

log B F_{23}

and the mean of

log (\hat{B F_{23}})

showed that the true value of Bayes factors and their estimates using all approaches decrease as the prior becomes diffuse. Moreover, unlike in the first scenario, the mean values of

log B F_{23}

were impacted by epidemic sizes. In other words, as the final epidemic increases the evidence becomes clear in supporting the true model.

5. Concluding Remarks

In this paper, we have demonstrated that Bayes factors can be obtained analytically for situations where complete epidemic data are available based on various asymmetrical dynamics of infection transmission. In addition, three computational methods to estimate the marginal likelihood and hence Bayes factor have been discussed: A Monte Carlo estimate employing samples drawn from the prior distribution, known as the arithmetic mean estimate; a Monte Carlo estimate employing samples drawn from the MCMC posterior distribution, also known as the harmonic mean estimate; and a discretized numerical approximation of the thermodynamic integration known as the power posterior estimate. Various asymmetric prior distribution assumptions have been used to see their effects on values of Bayes factors.

Bayes factor expressions that have been derived based on complete outbreak data were of particular assistance in gaining some insights into what estimated values of Bayes factors might be expected, specifically under different prior assumptions. The results also indicated the possibility of using the arithmetic mean estimator (AME) of the marginal likelihood as a simple but effective way of approximating Bayes factors in situations where complete epidemic data are available but closed forms of Bayes factors cannot be obtained.

The results presented in this research should be interpreted as a broad indicator of the level of accuracy of these approaches. It is acknowledged that there are various ways in which some of the algorithms could be optimized, such as the tempering scale in power posterior method and considering other adjusted versions of the harmonic mean approach. Although saying that the AM method clearly performs well, it remains to see how well it performs with more complicated models and different epidemic data. In the author’s opinion, the PP method showed the most promise in terms of accuracy and generality.

The evidence is typically a challenging quantity to estimate, especially if the prior distribution is dispersed, thus it is not surprising that its estimation process may need considerable work in terms of computation time and computer coding. As is the case with the harmonic mean estimator, sometimes the simplest approach may not yield the most precise results.

For further work, it would be interesting to see the efficacy of the methods for estimating the marginal likelihood and Bayes factors in other epidemic settings such as epidemic models with different removal process assumptions. It has not passed our attention to apply this work for incomplete epidemic data; however, we leave this for future work. The comparison of model selection by other model selection criterion such as DIC would be another area of investigation.

Funding

The author would like to thank the Deanship of Scientific Research at Taif University for supporting this research work.

Data Availability Statement

Simulated data were used to obtain our results.

Conflicts of Interest

The author declares no conflict of interest.

References

Adrakey, H.K.; Streftaris, G.; Cunniffe, N.J.; Gottwald, T.R.; Gilligan, C.A.; Gibson, G.J. Evidence-based controls for epidemics using spatio-temporal stochastic models in a Bayesian framework. J. R. Soc. Interface 2017, 14, 20170386. [Google Scholar] [CrossRef]
Butt, A.I.K.; Imran, M.; Batool, S.; Nuwairan, M.A. Theoretical analysis of a COVID-19 CF-fractional model to optimally control the spread of pandemic. Symmetry 2023, 15, 380. [Google Scholar] [CrossRef]
Khajji, B.; Boujallal, L.; Balatif, O.; Rachik, M. Mathematical Modelling and Optimal Control Strategies of a Multistrain COVID-19 Spread. J. Appl. Math. 2022, 2022, 9071890. [Google Scholar] [CrossRef]
Huisman, J.S.; Scire, J.; Angst, D.C.; Li, J.; Neher, R.A.; Maathuis, M.H.; Bonhoeffer, S.; Stadler, T. Estimation and worldwide monitoring of the effective reproductive number of SARS-CoV-2. eLife 2022, 11, e71345. [Google Scholar] [CrossRef]
Locatelli, I.; Trächsel, B.; Rousson, V. Estimating the basic reproduction number for COVID-19 in Western Europe. PLoS ONE 2021, 16, e0248731. [Google Scholar] [CrossRef] [PubMed]
Aleta, A.; Martin-Corral, D.; Pastore y Piontti, A.; Ajelli, M.; Litvinova, M.; Chinazzi, M.; Dean, N.E.; Halloran, M.E.; Longini, I.M., Jr.; Merler, S.; et al. Modelling the impact of testing, contact tracing and household quarantine on second waves of COVID-19. Nat. Hum. Behav. 2020, 4, 964–971. [Google Scholar] [CrossRef]
Kucharski, A.J.; Klepac, P.; Conlan, A.J.; Kissler, S.M.; Tang, M.L.; Fry, H.; Gog, J.R.; Edmunds, W.J.; Emery, J.C.; Medley, G.; et al. Effectiveness of isolation, testing, contact tracing, and physical distancing on reducing transmission of SARS-CoV-2 in different settings: A mathematical modelling study. Lancet Infect. Dis. 2020, 20, 1151–1160. [Google Scholar] [CrossRef]
Ferguson, N.M.; Laydon, D.; Nedjati-Gilani, G.; Imai, N.; Ainslie, K.; Baguelin, M.; Bhatia, S.; Boonyasiri, A.; Cucunubá, Z.; Cuomo-Dannenburg, G.; et al. Impact of Non-Pharmaceutical Interventions (NPIs) to Reduce COVID-19 Mortality and Healthcare Demand; Imperial College COVID-19 Response Team: London, UK, 2020. [Google Scholar]
O’Neill, P.D. Introduction and snapshot review: Relating infectious disease transmission models to data. Stat. Med. 2010, 29, 2069–2077. [Google Scholar] [CrossRef]
Alharthi, M. Bayesian Model Assessment for Stochastic Epidemic Models. Ph.D. Thesis, University of Nottingham, Nottingham, UK, 2016. [Google Scholar]
Gibson, G.J.; Streftaris, G.; Thong, D. Comparison and assessment of epidemic models. Stat. Sci. 2018, 33, 19–33. [Google Scholar] [CrossRef]
O’Neill, P.D.; Becker, N.G. Inference for an epidemic when susceptibility varies. Biostatistics 2001, 2, 99–108. [Google Scholar] [CrossRef]
Boys, R.J.; Giles, P.R. Bayesian inference for stochastic epidemic models with time-inhomogeneous removal rates. J. Math. Biol. 2007, 55, 223–247. [Google Scholar] [CrossRef]
Alharthi, M. Model discrimination for epidemiological SEIR-type models with different transmission mechanisms. JP J. Biostat. 2022, 20, 27–50. [Google Scholar] [CrossRef]
Severo, N.C. Generalizations of some stochastic epidemic models. Math. Biosci. 1969, 4, 395–402. [Google Scholar] [CrossRef]
Liu, W.M.; Hethcote, H.W.; Levin, S.A. Dynamical behavior of epidemiological models with nonlinear incidence rates. J. Math. Biol. 1987, 25, 359–380. [Google Scholar] [CrossRef] [PubMed]
O’Neill, P.; Wen, C. Modelling and inference for epidemic models featuring non-linear infection pressure. Math. Biosci. 2012, 238, 38–48. [Google Scholar] [CrossRef]
Roberto Telles, C.; Lopes, H.; Franco, D. SARS-COV-2: SIR model limitations and predictive constraints. Symmetry 2021, 13, 676. [Google Scholar] [CrossRef]
Aristotelous, G. Topics in Bayesian Inference and Model Assessment for Partially Observed Stochastic Epidemic Models. Ph.D. Thesis, University of Nottingham, Nottingham, UK, 2020. [Google Scholar]
Britton, T.; Kypraios, T.; O’Neill, P.D. Inference for epidemics with three levels of mixing: Methodology and application to a measles outbreak. Scand. J. Stat. 2011, 38, 578–599. [Google Scholar] [CrossRef]
Andersson, H.; Britton, T. Stochastic Epidemic Models and Their Statistical Analysis; Springer: New York, NY, USA, 2000; Volume 4. [Google Scholar]
Aitkin, M. Posterior bayes factors. J. R. Stat. Soc. Ser. B Methodol. 1991, 53, 111–128. [Google Scholar] [CrossRef]
Kass, R.E.; Raftery, A.E. Bayes factors. J. Am. Stat. Assoc. 1995, 90, 773–795. [Google Scholar] [CrossRef]
Neal, P.J.; Roberts, G.O. Statistical inference and model selection for the 1861 Hagelloch measles epidemic. Biostatistics 2004, 5, 249–261. [Google Scholar] [CrossRef] [PubMed]
O’Neill, P.D.; Marks, P.J. Bayesian model choice and infection route modelling in an outbreak of Norovirus. Stat. Med. 2005, 24, 2011–2024. [Google Scholar] [CrossRef]
Knock, E.S.; O’Neill, P.D. Bayesian model choice for epidemic models with two levels of mixing. Biostatistics 2014, 15, 46–59. [Google Scholar] [CrossRef] [PubMed]
Alharthi, M.; Kypraios, T.; O’Neill, P.D. Bayes factors for partially observed stochastic epidemic models. Bayesian Anal. 2019, 14, 907–936. [Google Scholar] [CrossRef]
Worby, C.J. Statistical Inference and Modelling for Nosocomial Infections and the Incorporation of Whole Genome Sequence Data. Ph.D. Thesis, University of Nottingham, Nottingham, UK, 2013. [Google Scholar]
Zhang, L. Time-Varying Individual-Level Infectious Disease Models. Ph.D. Thesis, University of Guelph, Guelph, ON, Canada, 2014. [Google Scholar]
Touloupou, P.; Alzahrani, N.; Neal, P.; Spencer, S.E.; McKinley, T.J. Efficient model comparison techniques for models requiring large scale data augmentation. Bayesian Anal. 2018, 13, 437–459. [Google Scholar] [CrossRef]
Gelman, A.; Meng, X.L. Simulating normalizing constants: From importance sampling to bridge sampling to path sampling. Stat. Sci. 1998, 13, 163–185. [Google Scholar] [CrossRef]
Friel, N.; Pettitt, A.N. Marginal likelihood estimation via power posteriors. J. R. Stat. Soc. Ser. B Stat. Methodol. 2008, 70, 589–607. [Google Scholar] [CrossRef]
O’Neill, P.D.; Roberts, G.O. Bayesian inference for partially observed stochastic epidemics. J. R. Stat. Soc. Ser. A Stat. Soc. 1999, 162, 121–129. [Google Scholar] [CrossRef]
Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. B Methodol. 1977, 39, 1–38. [Google Scholar]
Gelfand, A.E.; Smith, A.F. Sampling-based approaches to calculating marginal densities. J. Am. Stat. Assoc. 1990, 85, 398–409. [Google Scholar] [CrossRef]
Kypraios, T. Efficient Bayesian Inference for Partially Observed Stochastic Epidemics and a New Class of Semi-Parametric Time Series Models. Ph.D. Thesis, Lancaster University, Lancaster, UK, 2007. [Google Scholar]
Lindley, D.V. A statistical paradox. Biometrika 1957, 44, 187–192. [Google Scholar] [CrossRef]
Robert, C.P. The Bayesian Choice: From Decision-Theoretic Foundations to Computational Implementation; Springer: New York, USA, 2007; Volume 2. [Google Scholar]
Clancy, D.; O’Neill, P.D. Bayesian estimation of the basic reproduction number in stochastic epidemic models. Bayesian Anal. 2008, 3, 737–757. [Google Scholar] [CrossRef]
Alharthi, M.F. The Basic Reproduction Number for the Markovian SIR-Type Epidemic Models: Comparison and Consistency. J. Math. 2022, 2022, 1925202. [Google Scholar] [CrossRef]
Robert, C.P.; Casella, G.; Casella, G. Monte Carlo Statistical Methods; Springer: New York, NY, USA, 2004; Volume 3. [Google Scholar]
Newton, M.A.; Raftery, A.E. Approximate Bayesian inference with the weighted likelihood bootstrap. J. R. Stat. Soc. Ser. B Methodol. 1994, 56, 3–48. [Google Scholar] [CrossRef]
Lartillot, N.; Philippe, H. Computing Bayes factors using thermodynamic integration. Syst. Biol. 2006, 55, 195–207. [Google Scholar] [CrossRef]
Friel, N.; Hurn, M.; Wyse, J. Improving power posterior estimation of statistical evidence. Stat. Comput. 2014, 24, 709–723. [Google Scholar] [CrossRef]

Figure 1. The Susceptible (S), Exposed (E), Infectious (I) and Removed (R) (SEIR) model.

Figure 2. The distributions of removal trajectories and epidemic duration are represented by the top and bottom rows, respectively. The distributions of 5000 removal curves that were simulated from each SEIR model are displayed in the first row. The distributions of the epidemic duration (

r_{m}

) from each SEIR model based on the 5000 simulated epidemics are displayed in the second row.

Figure 3. Values of

log (B F_{12})

(left) and

log (B F_{23})

(right), calculated for various

ν_{β}

(the

β

prior scale parameter) with

λ_{β} = 1

fixed. The horizontal dashed lines denote the lower and upper limiting values of

log (B F_{12})

and

log (B F_{23})

as

ν_{β}

varies.

Table 1. The logarithm of Bayes factor results obtained using two data sets simulated based on models

M_{1}

and

M_{3}

. For each case, model parameters were assigned prior distributions to varying degrees of diffuseness to determine the effect on the logarithm of Bayes factor estimation. Three sizes of prior parameter sample (

K = 10^{4}, K = 10^{5}

and

K = 10^{6}

) were used to determine the effective sample size required to reasonably estimate the logarithm of Bayes factor.

Table 1. The logarithm of Bayes factor results obtained using two data sets simulated based on models

M_{1}

and

M_{3}

. For each case, model parameters were assigned prior distributions to varying degrees of diffuseness to determine the effect on the logarithm of Bayes factor estimation. Three sizes of prior parameter sample (

K = 10^{4}, K = 10^{5}

and

K = 10^{6}

) were used to determine the effective sample size required to reasonably estimate the logarithm of Bayes factor.

Model	Prior	True $\log ({BF}_{12})$	$\log (\hat{{BF}_{12}})$
Model	Prior	True $\log ({BF}_{12})$	$K = 10^{4}$	$K = 10^{5}$	$K = 10^{6}$
	$Γ (1, 1)$	$3.894706$	$3.995201$	$3.835836$	$3.899106$
$M_{1}$	$Γ (1, 0.1)$	$1.811504$	$2.035148$	$1.817458$	$1.815973$
	$Γ (1, 0.01)$	$1.593932$	$0.9698732$	$1.859154$	$1.6351788$
		True $\log ({BF}_{23})$	$\log (\hat{{BF}_{23}})$
		True $\log ({BF}_{23})$	$K = 10^{4}$	$K = 10^{5}$	$K = 10^{6}$
	$Γ (1, 1)$	$- 28.08603$	$- 28.59146$	$- 28.3244$	$- 28.04024$
$M_{3}$	$Γ (1, 0.1)$	$- 30.5115$	$- 31.30335$	$- 30.35625$	$- 30.49217$
	$Γ (1, 0.01)$	$- 30.767$	$- 35.95646$	$- 31.66247$	$- 30.71537$

Table 2. Details of the extensive simulation study in which each simulation scenario consists of 120 outbreak data sets generated under various population sizes and fitted under different prior assumptions.

Scenario	True Model	Fitted Models	Parameter Values
I	$M_{1}$	$M_{1}, M_{2}$	$β = 2, 2.5, 3; ψ = 0.1; r = 20, 40$
II	$M_{3}$	$M_{2}, M_{3}$	$β = 3, 5, 7.5; ψ = 0.1; ζ = 0.3; r = 20, 40$

Table 3. Results of the true mean values of

log B F_{12}

and the estimated mean of

log (\hat{B F_{12}})

computed using AM, HM and PP methods using different population sizes and two prior distribution settings. Each row gives results from 20 simulated epidemics in which the final sizes are matched to m.

Table 3. Results of the true mean values of

log B F_{12}

and the estimated mean of

log (\hat{B F_{12}})

computed using AM, HM and PP methods using different population sizes and two prior distribution settings. Each row gives results from 20 simulated epidemics in which the final sizes are matched to m.

Model	N	m	Prior	$E [log (\hat{{BF}_{12}})]$
Model	N	m	Prior	True	AM	HM	PP
$M_{1}$	10	8	$Γ (1, 1)$	0.1363	0.1373	0.1172	0.1367
$M_{1}$	10	8	$Γ (1, 0.01)$	−0.4487	−0.4738	−0.3657	−0.4499
$M_{1}$	50	45	$Γ (1, 1)$	2.2709	2.2569	1.4543	2.2715
$M_{1}$	50	45	$Γ (1, 0.01)$	0.8469	0.6860	1.5133	0.8486
$M_{1}$	100	90	$Γ (1, 1)$	1.8638	1.8609	0.5455	1.8627
$M_{1}$	100	90	$Γ (1, 0.01)$	0.8393	0.8841	1.4024	0.8391

Table 4. Results of the true mean values of

log B F_{23}

and the estimated mean of

log (\hat{B F_{23}})

computed using AM, HM and PP methods using different population sizes and two prior distribution settings. Each row gives results from 20 simulated epidemics from the true model in which the final sizes are matched to m.

Table 4. Results of the true mean values of

log B F_{23}

and the estimated mean of

log (\hat{B F_{23}})

computed using AM, HM and PP methods using different population sizes and two prior distribution settings. Each row gives results from 20 simulated epidemics from the true model in which the final sizes are matched to m.

Model	N	m	Prior	$E [log (\hat{{BF}_{23}})]$
Model	N	m	Prior	True	AM	HM	PP
$M_{3}$	10	8	$Γ (1, 1)$	−0.2644	−0.2651	−0.1929	−0.2632
$M_{3}$	10	8	$Γ (1, 0.01)$	−0.1416	−0.1520	−0.2231	−0.1430
$M_{3}$	50	45	$Γ (1, 1)$	−6.4499	−6.4552	−7.2083	−6.4518
$M_{3}$	50	45	$Γ (1, 0.01)$	−6.6124	−6.6772	−6.1098	−6.6116
$M_{3}$	100	90	$Γ (1, 1)$	−17.3413	−17.3440	−18.8693	−17.3454
$M_{3}$	100	90	$Γ (1, 0.01)$	−19.5665	−19.4738	−19.1056	−19.5655

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Computational Methods for Estimating the Evidence and Bayes Factor in SEIR Stochastic Infectious Diseases Models Featuring Asymmetrical Dynamics of Transmission

Abstract

1. Introduction and Related Background

1.1. The Stochastic SEIR Epidemic Model

1.1.1. Transition Probabilities

1.1.2. Types of Epidemic Data

1.1.3. The Basic Reproduction Number $R_{0}$

1.2. Marginal Likelihood Estimation

1.3. Bayes Factors

1.4. Model Selection within the Stochastic Epidemic Models Literature

2. Bayesian Inference via MCMC Methods for the SEIR Model with Time-Dependent Infection Rate

2.1. Complete Data Case

2.2. Incomplete Data Case

2.3. The Behavior of the SEIR Models Featuring Asymmetrical Dynamics of Transmission

3. Bayes Factors and Marginal Likelihood Estimation for SEIR Epidemic Models Given Complete Data

3.1. Bayes Factors: Theoretical Aspects

Prior Sensitivity Simulation Study

3.2. Computational Methods for Marginal Likelihood Estimation

3.2.1. Arithmetic Mean Estimator (Naive Monte Carlo Estimator)

3.2.2. Harmonic Mean Method

3.2.3. Power Posterior Method

4. Extensive Simulation Study

4.1. Scenario I

4.2. Scenario II

5. Concluding Remarks

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

Computational Methods for Estimating the Evidence and Bayes Factor in SEIR Stochastic Infectious Diseases Models Featuring Asymmetrical Dynamics of Transmission

Abstract

1. Introduction and Related Background

1.1. The Stochastic SEIR Epidemic Model

1.1.1. Transition Probabilities

1.1.2. Types of Epidemic Data

1.1.3. The Basic Reproduction Number R 0

1.2. Marginal Likelihood Estimation

1.3. Bayes Factors

1.4. Model Selection within the Stochastic Epidemic Models Literature

2. Bayesian Inference via MCMC Methods for the SEIR Model with Time-Dependent Infection Rate

2.1. Complete Data Case

2.2. Incomplete Data Case

2.3. The Behavior of the SEIR Models Featuring Asymmetrical Dynamics of Transmission

3. Bayes Factors and Marginal Likelihood Estimation for SEIR Epidemic Models Given Complete Data

3.1. Bayes Factors: Theoretical Aspects

Prior Sensitivity Simulation Study

3.2. Computational Methods for Marginal Likelihood Estimation

3.2.1. Arithmetic Mean Estimator (Naive Monte Carlo Estimator)

3.2.2. Harmonic Mean Method

3.2.3. Power Posterior Method

4. Extensive Simulation Study

4.1. Scenario I

4.2. Scenario II

5. Concluding Remarks

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

1.1.3. The Basic Reproduction Number $R_{0}$