You are currently viewing a new version of our website. To view the old version click .
Symmetry
  • Article
  • Open Access

10 June 2023

Computational Methods for Estimating the Evidence and Bayes Factor in SEIR Stochastic Infectious Diseases Models Featuring Asymmetrical Dynamics of Transmission

Department of Mathematics and Statistics, College of Science, Taif University, Taif 21944, Saudi Arabia

Abstract

Stochastic epidemic models may offer a vitally essential public health tool for comprehending and regulating disease progression. The best illustration of their importance and usefulness is perhaps the substantial influence that these models have had on the global COVID-19 epidemic. Nonetheless, these models are of limited practical use unless they provide an adequate fit to real-life epidemic outbreaks. In this work, we consider the problem of model selection for epidemic models given temporal observation of a disease outbreak through time. The epidemic models are stochastic individual-based transmission models of the Susceptible–Exposed–Infective–Removed (SEIR) type. The main focus is on the use of model evidence (or marginal likelihood), and hence the Bayes factor is a gold-standard measure of merit for comparing the fits of models to data. Even though the Bayes factor has been discussed in the epidemic modeling literature, little focus has been given to the fundamental issues surrounding its utility and computation. Based on various asymmetrical infection mechanism assumptions, we derive analytical expressions for Bayes factors which offer helpful suggestions for model selection problems. We also explore theoretical aspects that highlight the need for caution when utilizing the Bayes factor as a model selection technique, such as when the within-model prior distributions become more asymmetrical (diffuse or informative). Three computational methods for estimating the marginal likelihood and hence Bayes factor are discussed, which are the arithmetic mean estimator, the harmonic mean estimator, and the power posterior estimator. The theory and methods are illustrated using artificial data.

2. Bayesian Inference via MCMC Methods for the SEIR Model with Time-Dependent Infection Rate

2.1. Complete Data Case

Suppose that an SEIR system is considered and let m denote the number of individuals becoming infectious at times i = ( i 1 , , i m ) and removed from the population at times r = ( r 1 , , r m ) with 0 = r 1 r 2 r m . Similarly, let e = { e j : j ω } denote the set of the m 1 exposure times, that are the times at which individuals catch the infection but are unable to spread it, such that for 1 j m , e j refers to the exposure time of the individual who becomes infectious at time i j = e j + l and recovered at time r j , where the initial exposed individual is labeled by ω such that e ω < e j for all j ω .
Now, taking into account model M 2 , the likelihood of the data given the model parameters may be written as
L ( e , i , r | β , ψ , θ , e ω ) = j = 1 , j ω m h ψ ( β , e j ) I ( e j ) × exp e ω r m h ψ ( β , t ) S ( t ) I ( t ) d t × j = 1 m f D I r j i j | θ ,
where h ψ ( β , s ) = n 1 β e ψ s , with e k = i k = r k = for k = m + 1 , , N , and θ indicates the parameter governing the infectious period, D I , where θ may be a vector.
Note that, when the infectious period distribution is exponential with rate γ ( r j i j E x p ( γ ) ), then
j = 1 m f D I r j i j | θ = j = 1 m γ exp γ ( r j i j )
Given complete epidemic data, the Bayesian framework can first be used by assigning the model parameters’ independent conjugate gamma prior distributions as in [], namely β Γ ( λ β , ν β ) , and γ Γ ( λ γ , ν γ ) . Then, the joint posterior density for β and γ (assuming ψ is known) is obtained by multiplying the prior distributions and the likelihood as follows.
π ( β , γ | e , i , r , e ω ) L ( e , i , r , e ω | β , γ ) π ( β ) π ( γ ) = j = 1 , j ω m n 1 β e ψ e j I ( e j ) × exp n 1 β ξ ψ × j = 1 m γ exp γ ( r j i j ) × β λ β 1 e ν β β × γ λ γ 1 e ν γ γ ,
where
ξ ψ = e ω r m e ψ t S ( t ) I ( t ) d t .
The parameters β and γ are a posteriori conditionally independent, so we have
π ( β | e , e ω , i , r ) Γ λ β + m 1 , ν β + n 1 ξ ψ ,
π ( γ | e , e ω , i , r ) Γ λ γ + m , ν γ + j = 1 m ( r j i j ) .
The posterior distributions of the two parameters, β and γ , are thus easily accessible, making it simple to obtain any function of these parameters.
Note that the inference for models M 1 and M 3 , given complete data, can be performed in a similar manner (see, e.g., [,]).

2.2. Incomplete Data Case

When the exposure times and hence the infection times are not observed, the likelihood of the removal times becomes intractable. A common way to handle this issue is to use data augmentation technique [,] by treating the missing data as extra parameters to be inferred from the data [].
We again derive the following full conditional posterior distributions by assigning independent conjugate gamma prior distributions for the model parameters β and γ , and further assuming e ω has an improper uniform prior density on ( , r 1 ) ,
π ( β | γ , e , e ω , i , r ) Γ λ β + m 1 , ν β + n 1 ξ ψ ,
π ( γ | β , e , e ω , i , r ) Γ λ γ + m , ν γ + j = 1 m ( r j i j ) ,
and
π ( i , e , e ω | β , γ , r ) j = 1 , j ω m e ψ e j I ( e j ) × exp β n 1 ξ ψ × j = 1 m exp γ ( r j i j ) .
Given that the model parameters β and γ have closed forms of the posterior distribution, updating them can be performed via Gibbs sampling steps [,,]. However, as the posterior distribution of the component ( i , e , e ω ) is not explicitly provided, it needs to be updated using the Metropolis–Hastings step. In general, it is possible to apply (with minor adjustments) the MCMC technique in [,] to update both the model parameters and the missing exposure and infection times. Bayesian inference for models M 1 and M 3 can be performed in a similar way (see, e.g., [,]).
However, the focus of this paper is on model choice given complete epidemic data, where we aim to derive explicit expressions for the Bayes factor in asymmetric infection process settings to validate the estimate of the Bayes factor derived from other proposed methods.

2.3. The Behavior of the SEIR Models Featuring Asymmetrical Dynamics of Transmission

One crucial issue of special significance when modeling an infectious disease is how infectious the disease is. The rate at which individuals contract a disease, in other words. The use of strategies to stop the spread of a disease among a population can be facilitated by having access to such knowledge. The effective of having different infection processes can be observed by comparing the removal trajectories and the epidemic duration for an epidemic.
The behavior of models M 1 and M 3 was investigated using the removal trajectories and the outbreak duration. A simulation study was conducted to investigate how the two SEIR models behave. We simulated 5000 removal trajectories from each SEIR model with β = 0.5 , γ = 0.2 for the M 1 model and β = 3 , γ = 0.2 , ψ = 0.3 for the M 3 model, using a population of size N = 100 ( n = 99 , k = 1 ) .
The major outbreak components of the final size distributions were permitted to be comparable in the sense that they peaked at around the same values for both models, and we set the mean infectious period to be the same in both models ( 1 / γ = 5 ). The length of the latent period was set at l = 1 in both models.
It is clear from Figure 2 that the removal trajectory distributions of the two SEIR models differ in terms of their position and shape. In particular, the M 1 model peaks quicker and has more variants, whereas the M 3 model tends to be more widespread and has a longer epidemic duration.
Figure 2. The distributions of removal trajectories and epidemic duration are represented by the top and bottom rows, respectively. The distributions of 5000 removal curves that were simulated from each SEIR model are displayed in the first row. The distributions of the epidemic duration ( r m ) from each SEIR model based on the 5000 simulated epidemics are displayed in the second row.

3. Bayes Factors and Marginal Likelihood Estimation for SEIR Epidemic Models Given Complete Data

In this section, assuming complete outbreak data, we shall derive explicit expressions for the Bayes factor across several epidemic scenarios. This assumption could be unrealistic in practice, although it may arise when outbreaks are being closely monitored. For instance, complete data can be gathered in the early phases of a suspected major epidemic, or in experimental settings for animal diseases. The motivations for looking at Bayes factor formula are that it is of interest in its own right and it provides us of some insights into how to better comprehend this Bayesian model selection tool. In addition, we can validate the estimate of the Bayes factor derived from other methods proposed in the literature such as the arithmetic mean and harmonic mean methods in order to see how reliable they are in situations where complete epidemic data are available but Bayes factors cannot be obtained in closed forms.

3.1. Bayes Factors: Theoretical Aspects

The Bayes factor in favor of model M 1 over model M 2 can be calculated as follows.
B F 12 = π ( e , e ω , i , r | M 1 ) π ( e , e ω , i , r | M 2 ) = γ β π ( e , e ω , i , r | β , γ ) π ( β ) π ( γ ) d β d γ γ β π ( e , e ω , i , r | β , γ , ψ ) π ( β ) π ( γ ) d β d γ
The epidemic likelihood may be broken into two independent (infection and removal) parts and, as the removal processes are the same for both SEIR models, they are canceled out. Therefore, we have the following Bayes factor expression:
B F 12 = ν β λ β Γ ( λ β ) ν β λ β Γ ( λ β ) × j = 1 , j ω m n 1 I ( e j ) j = 1 , j ω m n 1 e ψ e j I ( e j ) × β β m 1 × e β n 1 ξ × β λ β 1 e ν β β d β β β m 1 × e β n 1 ξ ψ × β λ β 1 e ν β β d β = exp ψ j = 1 , j ω m e j × ν β + n 1 ξ ψ ν β + n 1 ξ m 1 + λ β ,
where
ξ = e ω r m S ( t ) I ( t ) d t ; ξ ψ = e ω r m e ψ t S ( t ) I ( t ) d t .
Now, we have ξ ψ < ξ as 0 < e ψ t < 1 , therefore B F 12 has an upper bound, that is
B F 12 < exp ψ j = 1 , j ω m e j .
In addition, to see the impact of the infection rate prior distribution on the behavior of B F 12 , we rewrite B F 12 in terms of the mean and variance of the prior distribution, that are
E Γ ( λ β , ν β ) = λ β / ν β = μ and Var Γ ( λ β , ν β ) = λ β / ν β 2 = σ 2 ,
which implies λ β = μ 2 / σ 2 and ν β = μ / σ 2 . Substituting these values into (10) yields
B F 12 = exp ψ j = 1 , j ω m e j × μ σ 2 + ξ ψ n μ σ 2 + ξ n m 1 + μ 2 σ 2 .
Consequently, as σ 2 or σ 2 0 , we have the following limits for the Bayes factor B F 12 , namely
B F 12 ( ξ ψ / ξ ) m 1 × exp ψ j = 1 , j ω m e j , as σ 2 ,
and
B F 12 exp ψ j = 1 , j ω m e j , as σ 2 0 .
It can be noticed that as the prior becomes more and more concentrated at μ ( σ 2 0 ), the Bayes factor becomes more decisive in supporting M 1 .
Similarly, the Bayes factor in favor of model M 2 over model M 3 can be obtained as follows.
B F 23 = π ( e , e ω , i , r | M 2 ) π ( e , e ω , i , r | M 3 ) = ν β λ β Γ ( λ β ) ν β λ β Γ ( λ β ) × j = 1 , j ω m n 1 e ψ e j I ( e j ) j = 1 , j ω m n 1 I ζ ( e j ) × β β m 1 × e β n 1 ξ ψ × β λ β 1 e ν β β d β β β m 1 × e β n 1 ξ ζ × β λ β 1 e ν β β d β = exp ψ j = 1 , j ω m e j × j = 1 , j ω m I 1 ζ ( e j ) × ν β + n 1 ξ ζ ν β + n 1 ξ ψ m 1 + λ β ,
where,
ξ ζ = e ω r m S ( t ) I ζ ( t ) d t .
By rewriting B F 23 in terms of the mean and variance of the prior distribution, we have
B F 23 = exp ψ j = 1 , j ω m e j × j = 1 , j ω m I 1 ζ ( e j ) × μ σ 2 + ξ ζ n μ σ 2 + ξ ψ n m 1 + μ 2 σ 2 .
Consequently, as σ 2 or σ 2 0 , we have the following limits for the Bayes factor B F 23 , namely
B F 23 ( ξ ζ / ξ ψ ) m 1 × exp ψ j = 1 , j ω m e j × j = 1 , j ω m I 1 ζ ( e j ) , as σ 2 ,
and
B F 23 exp ψ j = 1 , j ω m e j × j = 1 , j ω m I 1 ζ ( e j ) , as σ 2 0 .
It is evident that decreasing the prior uncertainty leads to favoring the M 2 model.
These theoretical aspects and explicit Bayes factor expressions are of particular assistance in gaining some insights into what estimated values of Bayes factors might be expected, specifically under different prior assumptions. For this particular epidemic setting, we may conclude that increasing (decreasing) the prior uncertainty of the model parameters reduces (increases) the Bayes factor value.
As an aside, one should be aware of any potential connections to Lindley’s paradox [] when employing the Bayes factor criteria to conduct a Bayesian model choice task. However, in this epidemic setting, models are not nested and have the same level of complexity.

Prior Sensitivity Simulation Study

It is well known that Bayes factors can exhibit strong dependence on the within-model prior distributions []. In our epidemic setting, we explore this issue in detail, considering what happens to the Bayes factors for an asymmetrical diffuse prior distribution. A typical case in practice is when λ β = 1 and ν β 0 (see, e.g., [,,]).
Figure 3 (left) shows log ( B F 12 ) values for some complete epidemic data generated under model M 1 in a closed population of size N = 100 ( n = 99 , k = 1 ) , 92 of whom were infected, with parameter values β = 3 , γ = 1 , l = 1 . The parameter ψ in model M 2 was set to 0.1 , and then log ( B F 12 ) was calculated for various values of ν β . It is clear that as the prior become more informative (i.e., ν β increases), the log ( B F 12 ) becomes more supportive to the data-generating model M 1 .
Figure 3. Values of log ( B F 12 ) (left) and log ( B F 23 ) (right), calculated for various ν β (the β prior scale parameter) with λ β = 1 fixed. The horizontal dashed lines denote the lower and upper limiting values of log ( B F 12 ) and log ( B F 23 ) as ν β varies.
Figure 3 (right) shows log ( B F 23 ) values for some complete epidemic data generated under model M 3 in a closed population of size N = 100 ( n = 99 , k = 1 ) , 92 of whom were infected, with parameter values β = 7.5 , γ = 1 , l = 1 , ζ = 0.3 . Again, we set ψ = 0.1 in model M 2 , and then log ( B F 23 ) was calculated for various values of ν β . It is evident that as ν β increases (the prior becomes more informative), the log ( B F 23 ) tends to favor the true model M 3 .

3.2. Computational Methods for Marginal Likelihood Estimation

As introduced earlier, ( e , e ω ) , i and r denote the observed exposure, infection and removal times, respectively, with β and γ representing the models’ infection and removal rates parameters. Here we discuss three computational methods to estimate the marginal likelihood presented in the literature of model choice but not applied in model selection of stochastic epidemic models. The rationale is to see how robust and reliable they are when considering dependent data with complex likelihoods such as the ones involved in our epidemic models settings.

3.2.1. Arithmetic Mean Estimator (Naive Monte Carlo Estimator)

Dropping the notational dependence on M, the marginal likelihood becomes
π ( e , e ω , i , r ) = γ β π ( e , e ω , i , r | β , γ ) π ( β , γ ) d β d γ .
The arithmetic mean (AM) estimator for π ( e , e ω , i , r ) can be obtained by a simple Monte Carlo integration method, that is
π ( e , e ω , i , r ) ^ = 1 K h = 1 K π ( e , e ω , i , r | β ( h ) , γ ( h ) ) ,
where β ( 1 ) , , β ( K ) and γ ( 1 ) , , γ ( K ) are samples generated from the priors π ( β ) and π ( γ ) , respectively.
It is well-known [] that since the likelihood is typically sharply peaked, most of the prior samples have very small likelihood values, in particular when the prior is diffuse. Therefore, unless K is very large, the prior samples will contain virtually no points from the high-likelihood region, leading to poor and inadequate estimation of the marginal likelihood. However, by the law of large numbers, this estimator converges to the true expectation as the number of independent samples, K, drawn from the prior, tends to infinity []. Additionally, this estimate can perform well when the dimensionality of the parameter space is modest, so that enormous prior samples can be generated very quickly.
  • Illustrative simulation study
In what follows, we conducted a simulation study to illustrate the effectiveness of the AM estimator of the marginal likelihood in approximating the true marginal likelihood and hence the Bayes factor. This simulation example was performed under different prior assumptions for each model parameter. As we have a prior of two dimensions in each model, M 1 , M 2 and M 3 , we expect the arithmetic mean estimates of the marginal likelihoods, namely
π ( e , e ω , i , r | M 1 ) ^ = 1 K h = 1 K π ( e , e ω , i , r | β ( h ) , γ ( h ) ) ,
π ( e , e ω , i , r | M 2 ) ^ = 1 K h = 1 K π ( e , e ω , i , r | ψ , β ( h ) , γ ( h ) ) ,
and
π ( e , e ω , i , r | M 3 ) ^ = 1 K h = 1 K π ( e , e ω , i , r | ζ , β ( h ) , γ ( h ) ) ,
to be effective in the estimating and recovering the true value of the Bayes factor. Here, β ( h ) and γ ( h ) , where h = 1 , 2 , , K , are independent samples drawn from the gamma prior distribution.
We simulated two typical epidemic data sets, one data set was generated under M 1 and the other data set was simulated from M 3 . Each outbreak was in a closed population consisting of 99 initially susceptible individuals and a single initially infected case. The model parameters for the simulation were ( β = 3 , γ = 1 , l = 1 ) for model M 1 and ( β = 7 , γ = 1 , l = 1 , ζ = 0.3 ) for model M 3 . The resulting final size was 92 under both models. The parameter ψ was set to 0.1 when fitting M 2 .
Under each prior assumption, namely Γ ( 1 , 1 ) , Γ ( 1 , 0.1 ) and Γ ( 1 , 0.01 ) , the true values of the logarithm of Bayes factors were calculated using (10) and (14). The estimated values of the logarithm of Bayes factors, under different prior distributions, were computed as follows. Initially, we simulated K samples from the prior for each parameter and evaluated the marginal likelihoods based on these prior samples using (20)–(22). Then, the logarithm of Bayes factors was estimated using the following equations.
log ( B F 12 ^ ) = log [ π ( e , e ω , i , r | M 1 ) ^ ] log [ π ( e , e ω , i , r | M 2 ) ^ ] ,
log ( B F 23 ^ ) = log [ π ( e , e ω , i , r | M 2 ) ^ ] log [ π ( e , e ω , i , r | M 3 ) ^ ] .
The results are displayed in Table 1. It was found that increasing prior uncertainty of the model parameters does not have much effect on the precision of Bayes factor estimation via AM method, as the prior in our case is low-dimensional and so fast to sample from. These results suggest that as the flat prior distribution expresses a form of neutrality and lack of previous knowledge, its use is possible in the present application. In contrast, a more informative prior (e.g., Γ ( 1 , 1 ) ) also has the advantage of estimating log ( B F 12 ) and log ( B F 23 ) more accurately. These results indicate the possibility of using the arithmetic mean estimator of the marginal likelihood as a simple but effective way of approximating Bayes factors in situations where complete epidemic data are available but Bayes factors cannot be obtained in a closed form. For instance, when comparing these SEIR models with unknown ψ or when comparing the SEIR model with different infection mechanisms and unknown power parameter ζ .
Table 1. The logarithm of Bayes factor results obtained using two data sets simulated based on models M 1 and M 3 . For each case, model parameters were assigned prior distributions to varying degrees of diffuseness to determine the effect on the logarithm of Bayes factor estimation. Three sizes of prior parameter sample ( K = 10 4 , K = 10 5 and K = 10 6 ) were used to determine the effective sample size required to reasonably estimate the logarithm of Bayes factor.

3.2.2. Harmonic Mean Method

Estimation of the marginal likelihood by harmonic mean approach has become a popular method due to its simplicity. The harmonic mean (HM) estimator was first proposed by [] who showed that the marginal likelihood can be estimated from the harmonic mean of the likelihood, given posterior samples. Under the assumption that the prior distribution of the parameters is proper, we have
1 = γ β π ( β , γ ) d β d γ = π ( e , e ω , i , r ) γ β 1 π ( e , e ω , i , r | β , γ ) π ( β , γ | e , e ω , i , r ) d β d γ .
Therefore,
1 π ( e , e ω , i , r ) = γ β 1 π ( e , e ω , i , r | β , γ ) π ( β , γ | e , e ω , i , r ) d β d γ = E ( β , γ | e , e ω , i , r ) 1 π ( e , e ω , i , r | β , γ ) ,
where E ( β , γ | e , e ω , i , r ) denotes the expected value with respect to the posterior distribution of the models parameters.
The harmonic mean estimator is given by
π ( e , e ω , i , r ) ^ = 1 K h = 1 K 1 π ( e , e ω , i , r | β ( h ) , γ ( h ) ) 1 ,
where { β ( 1 ) , , β ( K ) } and { γ ( 1 ) , , γ ( K ) } are sets of MCMC draws from the posterior distribution.
The simplicity (easy and fast) of the HM approach is its main advantage over other more specialized techniques. It uses only within-model posterior samples and likelihood evaluations, which are often available anyway as part of posterior sampling. This estimator is consistent, but sometimes its variance can be large, since occasionally a sample may be taken into consideration with low likelihood, which significantly affects the result due to the reciprocal presented in (25). The HM estimator is often biased and results in overestimating the true marginal likelihood []. It is included here because of its simplicity and being not applied as a model selection tool in stochastic epidemic modeling.

3.2.3. Power Posterior Method

An approach for estimating the marginal likelihood based on the idea of thermodynamic integration [] was developed in []. The authors introduced an auxiliary variable (temperature parameter) p [ 0 , 1 ] and defined the power posterior (PP) as
π p ( β , γ | e , e ω , i , r ) π ( e , e ω , i , r | β , γ ) p π ( β , γ ) ,
where by construction the normalizing constant of the power posterior is
z p ( e , e ω , i , r ) = γ β π ( e , e ω , i , r | β , γ ) p π ( β , γ ) d β d γ .
Clearly, as p moves from p = 0 to p = 1 , the power posterior follows a path from the prior to the posterior density, where z p = 0 ( e , e ω , i , r ) = 1 is the integral of the proper prior of ( β , γ ) and z p = 1 ( e , e ω , i , r ) is the marginal likelihood. It can be shown that
log [ π ( e , e ω , i , r ) ] = log z p = 1 ( e , e ω , i , r ) z p = 0 ( e , e ω , i , r ) = 0 1 E β , γ | e , e ω , i , r , p log π ( e , e ω , i , r | β , γ ) d p .
In [], the approach was refined so that the power posterior estimate of log [ π ( e , e ω , i , r ] ) becomes
log [ π ( e , e ω , i , r ) ] j = 1 r 1 2 ( p j p j 1 ) × [ Δ p j + Δ p j 1 ] j = 1 r 1 12 ( p j p j 1 ) 2 × [ Φ p j Φ p j 1 ] ,
where
Δ q = E β , γ | e , e ω , i , r , q log π ( e , e ω , i , r | β , γ ) ; q = p j , p j 1 ,
and
Φ q = V β , γ | e , e ω , i , r , q log π ( e , e ω , i , r | β , γ ) ; q = p j , p j 1 ,
are the expectation and the variance of log π ( e , e ω , i , r | β , γ ) at q, respectively; and r is the number of points in the interval ( 0 , 1 ] in the temperature schedule.
When considering M 2 model ( M 1 and M 3 are performed similarly), at temperature p, the power posterior densities for β and γ can be obtained using (26), which gives
π p ( β | e , e ω , i , r ) Γ λ β + p ( m 1 ) , ν β + ( p / n ) ξ ψ ,
π p ( γ | e , e ω , i , r ) Γ λ γ + m , ν γ + p j = 1 m ( r j i j ) .
More details about this method when applied to stochastic epidemic outbreak data can be found in [,], where the performance of this approach in estimating Bayes factors was investigated and assessed.

4. Extensive Simulation Study

In this section, we present an extensive simulation study for the three considered approaches in which the true values of the marginal likelihoods and hence Bayes factors are known (i.e., can be calculated analytically). Throughout this section, we assumed a fixed latent period of l = 1 , and we chose γ = 1 . Then, two simulated scenarios were preformed, each scenario was based on 120 epidemic outbreak data sets. These epidemic data sets were simulated using various population sizes ( N = 10 , 50 , 100 ) and fitted under two asymmetric prior assumptions, namely Γ ( 1 , 1 ) and Γ ( 1 , 0.01 ) .
To make each of the approaches (AM, HM and PP) for computing the evidence as comparable as possible, we tried to make each of the algorithms equivalent in terms of the number of iterations. Therefore, the results were based on a run of 100,000 after a burn-in period of 5000 iterations, and thinning the chain by taking each 5th value. Table 2 summarized the two simulated scenarios.
Table 2. Details of the extensive simulation study in which each simulation scenario consists of 120 outbreak data sets generated under various population sizes and fitted under different prior assumptions.

4.1. Scenario I

In this simulated scenario, the data-generating model was M 1 , where, using each population size, N = 10 , 50 and 100, 20 outbreak data sets were generated. Then, under the two prior assumptions, namely Γ ( 1 , 1 ) and Γ ( 1 , 0.01 ) , the true values of the log B F 12 were calculated using (10) and the log ( B F 12 ^ ) values were computed using AM, HM and PP methods and their means over the 20 data sets are presented in Table 3, where each row gives results from 20 simulated epidemics in which the final sizes are matched to m.
Table 3. Results of the true mean values of log B F 12 and the estimated mean of log ( B F 12 ^ ) computed using AM, HM and PP methods using different population sizes and two prior distribution settings. Each row gives results from 20 simulated epidemics in which the final sizes are matched to m.
Surprisingly, in this experiment, the AM approach performed very well. This might be a result of having low-dimensional parameter space in our setting. The poorest performance was from the HM estimator. The PP method proved its robustness in estimating Bayes factors.
The mean values of the log B F 12 and the means of log ( B F 12 ^ ) indicated that as the prior becomes diffuse, the true value of Bayes factors and their estimates using all approaches decrease. In addition to that, the values of Bayes factor were sensitive to epidemic data and were independent of the epidemic sizes. Another point to be mentioned here is that, with small outbreaks, having a vague prior distribution is not recommended and can affect this diagnostic tool for model choice as seen in Table 3 (the fourth row).

4.2. Scenario II

In the second scenario, the epidemic data were simulated from model M 3 . Using each population sizes, namely N = 10 , 50 and 100, we generated 20 outbreak data sets. Then, under the two prior assumptions, namely Γ ( 1 , 1 ) and Γ ( 1 , 0.01 ) , the true values of the log B F 23 were calculated using (14) and their mean is shown in Table 4. The log ( B F 23 ^ ) values were estimated using AM, HM and PP methods, and their means over the 20 data sets are presented in Table 4, where we matched the final size of the 20 outbreak data sets to m.
Table 4. Results of the true mean values of log B F 23 and the estimated mean of log ( B F 23 ^ ) computed using AM, HM and PP methods using different population sizes and two prior distribution settings. Each row gives results from 20 simulated epidemics from the true model in which the final sizes are matched to m.
The AM approach performed very well in this simulated example as a result of the relatively low number of parameters. The HM estimator was poor in recovering the true mean values of the log B F 23 . The estimates produced by the PP method had a high degree of accuracy.
Again, mean values of the log B F 23 and the mean of log ( B F 23 ^ ) showed that the true value of Bayes factors and their estimates using all approaches decrease as the prior becomes diffuse. Moreover, unlike in the first scenario, the mean values of log B F 23 were impacted by epidemic sizes. In other words, as the final epidemic increases the evidence becomes clear in supporting the true model.

5. Concluding Remarks

In this paper, we have demonstrated that Bayes factors can be obtained analytically for situations where complete epidemic data are available based on various asymmetrical dynamics of infection transmission. In addition, three computational methods to estimate the marginal likelihood and hence Bayes factor have been discussed: A Monte Carlo estimate employing samples drawn from the prior distribution, known as the arithmetic mean estimate; a Monte Carlo estimate employing samples drawn from the MCMC posterior distribution, also known as the harmonic mean estimate; and a discretized numerical approximation of the thermodynamic integration known as the power posterior estimate. Various asymmetric prior distribution assumptions have been used to see their effects on values of Bayes factors.
Bayes factor expressions that have been derived based on complete outbreak data were of particular assistance in gaining some insights into what estimated values of Bayes factors might be expected, specifically under different prior assumptions. The results also indicated the possibility of using the arithmetic mean estimator (AME) of the marginal likelihood as a simple but effective way of approximating Bayes factors in situations where complete epidemic data are available but closed forms of Bayes factors cannot be obtained.
The results presented in this research should be interpreted as a broad indicator of the level of accuracy of these approaches. It is acknowledged that there are various ways in which some of the algorithms could be optimized, such as the tempering scale in power posterior method and considering other adjusted versions of the harmonic mean approach. Although saying that the AM method clearly performs well, it remains to see how well it performs with more complicated models and different epidemic data. In the author’s opinion, the PP method showed the most promise in terms of accuracy and generality.
The evidence is typically a challenging quantity to estimate, especially if the prior distribution is dispersed, thus it is not surprising that its estimation process may need considerable work in terms of computation time and computer coding. As is the case with the harmonic mean estimator, sometimes the simplest approach may not yield the most precise results.
For further work, it would be interesting to see the efficacy of the methods for estimating the marginal likelihood and Bayes factors in other epidemic settings such as epidemic models with different removal process assumptions. It has not passed our attention to apply this work for incomplete epidemic data; however, we leave this for future work. The comparison of model selection by other model selection criterion such as DIC would be another area of investigation.

Funding

The author would like to thank the Deanship of Scientific Research at Taif University for supporting this research work.

Data Availability Statement

Simulated data were used to obtain our results.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Adrakey, H.K.; Streftaris, G.; Cunniffe, N.J.; Gottwald, T.R.; Gilligan, C.A.; Gibson, G.J. Evidence-based controls for epidemics using spatio-temporal stochastic models in a Bayesian framework. J. R. Soc. Interface 2017, 14, 20170386. [Google Scholar] [CrossRef]
  2. Butt, A.I.K.; Imran, M.; Batool, S.; Nuwairan, M.A. Theoretical analysis of a COVID-19 CF-fractional model to optimally control the spread of pandemic. Symmetry 2023, 15, 380. [Google Scholar] [CrossRef]
  3. Khajji, B.; Boujallal, L.; Balatif, O.; Rachik, M. Mathematical Modelling and Optimal Control Strategies of a Multistrain COVID-19 Spread. J. Appl. Math. 2022, 2022, 9071890. [Google Scholar] [CrossRef]
  4. Huisman, J.S.; Scire, J.; Angst, D.C.; Li, J.; Neher, R.A.; Maathuis, M.H.; Bonhoeffer, S.; Stadler, T. Estimation and worldwide monitoring of the effective reproductive number of SARS-CoV-2. eLife 2022, 11, e71345. [Google Scholar] [CrossRef]
  5. Locatelli, I.; Trächsel, B.; Rousson, V. Estimating the basic reproduction number for COVID-19 in Western Europe. PLoS ONE 2021, 16, e0248731. [Google Scholar] [CrossRef] [PubMed]
  6. Aleta, A.; Martin-Corral, D.; Pastore y Piontti, A.; Ajelli, M.; Litvinova, M.; Chinazzi, M.; Dean, N.E.; Halloran, M.E.; Longini, I.M., Jr.; Merler, S.; et al. Modelling the impact of testing, contact tracing and household quarantine on second waves of COVID-19. Nat. Hum. Behav. 2020, 4, 964–971. [Google Scholar] [CrossRef]
  7. Kucharski, A.J.; Klepac, P.; Conlan, A.J.; Kissler, S.M.; Tang, M.L.; Fry, H.; Gog, J.R.; Edmunds, W.J.; Emery, J.C.; Medley, G.; et al. Effectiveness of isolation, testing, contact tracing, and physical distancing on reducing transmission of SARS-CoV-2 in different settings: A mathematical modelling study. Lancet Infect. Dis. 2020, 20, 1151–1160. [Google Scholar] [CrossRef]
  8. Ferguson, N.M.; Laydon, D.; Nedjati-Gilani, G.; Imai, N.; Ainslie, K.; Baguelin, M.; Bhatia, S.; Boonyasiri, A.; Cucunubá, Z.; Cuomo-Dannenburg, G.; et al. Impact of Non-Pharmaceutical Interventions (NPIs) to Reduce COVID-19 Mortality and Healthcare Demand; Imperial College COVID-19 Response Team: London, UK, 2020. [Google Scholar]
  9. O’Neill, P.D. Introduction and snapshot review: Relating infectious disease transmission models to data. Stat. Med. 2010, 29, 2069–2077. [Google Scholar] [CrossRef]
  10. Alharthi, M. Bayesian Model Assessment for Stochastic Epidemic Models. Ph.D. Thesis, University of Nottingham, Nottingham, UK, 2016. [Google Scholar]
  11. Gibson, G.J.; Streftaris, G.; Thong, D. Comparison and assessment of epidemic models. Stat. Sci. 2018, 33, 19–33. [Google Scholar] [CrossRef]
  12. O’Neill, P.D.; Becker, N.G. Inference for an epidemic when susceptibility varies. Biostatistics 2001, 2, 99–108. [Google Scholar] [CrossRef]
  13. Boys, R.J.; Giles, P.R. Bayesian inference for stochastic epidemic models with time-inhomogeneous removal rates. J. Math. Biol. 2007, 55, 223–247. [Google Scholar] [CrossRef]
  14. Alharthi, M. Model discrimination for epidemiological SEIR-type models with different transmission mechanisms. JP J. Biostat. 2022, 20, 27–50. [Google Scholar] [CrossRef]
  15. Severo, N.C. Generalizations of some stochastic epidemic models. Math. Biosci. 1969, 4, 395–402. [Google Scholar] [CrossRef]
  16. Liu, W.M.; Hethcote, H.W.; Levin, S.A. Dynamical behavior of epidemiological models with nonlinear incidence rates. J. Math. Biol. 1987, 25, 359–380. [Google Scholar] [CrossRef] [PubMed]
  17. O’Neill, P.; Wen, C. Modelling and inference for epidemic models featuring non-linear infection pressure. Math. Biosci. 2012, 238, 38–48. [Google Scholar] [CrossRef]
  18. Roberto Telles, C.; Lopes, H.; Franco, D. SARS-COV-2: SIR model limitations and predictive constraints. Symmetry 2021, 13, 676. [Google Scholar] [CrossRef]
  19. Aristotelous, G. Topics in Bayesian Inference and Model Assessment for Partially Observed Stochastic Epidemic Models. Ph.D. Thesis, University of Nottingham, Nottingham, UK, 2020. [Google Scholar]
  20. Britton, T.; Kypraios, T.; O’Neill, P.D. Inference for epidemics with three levels of mixing: Methodology and application to a measles outbreak. Scand. J. Stat. 2011, 38, 578–599. [Google Scholar] [CrossRef]
  21. Andersson, H.; Britton, T. Stochastic Epidemic Models and Their Statistical Analysis; Springer: New York, NY, USA, 2000; Volume 4. [Google Scholar]
  22. Aitkin, M. Posterior bayes factors. J. R. Stat. Soc. Ser. B Methodol. 1991, 53, 111–128. [Google Scholar] [CrossRef]
  23. Kass, R.E.; Raftery, A.E. Bayes factors. J. Am. Stat. Assoc. 1995, 90, 773–795. [Google Scholar] [CrossRef]
  24. Neal, P.J.; Roberts, G.O. Statistical inference and model selection for the 1861 Hagelloch measles epidemic. Biostatistics 2004, 5, 249–261. [Google Scholar] [CrossRef] [PubMed]
  25. O’Neill, P.D.; Marks, P.J. Bayesian model choice and infection route modelling in an outbreak of Norovirus. Stat. Med. 2005, 24, 2011–2024. [Google Scholar] [CrossRef]
  26. Knock, E.S.; O’Neill, P.D. Bayesian model choice for epidemic models with two levels of mixing. Biostatistics 2014, 15, 46–59. [Google Scholar] [CrossRef] [PubMed]
  27. Alharthi, M.; Kypraios, T.; O’Neill, P.D. Bayes factors for partially observed stochastic epidemic models. Bayesian Anal. 2019, 14, 907–936. [Google Scholar] [CrossRef]
  28. Worby, C.J. Statistical Inference and Modelling for Nosocomial Infections and the Incorporation of Whole Genome Sequence Data. Ph.D. Thesis, University of Nottingham, Nottingham, UK, 2013. [Google Scholar]
  29. Zhang, L. Time-Varying Individual-Level Infectious Disease Models. Ph.D. Thesis, University of Guelph, Guelph, ON, Canada, 2014. [Google Scholar]
  30. Touloupou, P.; Alzahrani, N.; Neal, P.; Spencer, S.E.; McKinley, T.J. Efficient model comparison techniques for models requiring large scale data augmentation. Bayesian Anal. 2018, 13, 437–459. [Google Scholar] [CrossRef]
  31. Gelman, A.; Meng, X.L. Simulating normalizing constants: From importance sampling to bridge sampling to path sampling. Stat. Sci. 1998, 13, 163–185. [Google Scholar] [CrossRef]
  32. Friel, N.; Pettitt, A.N. Marginal likelihood estimation via power posteriors. J. R. Stat. Soc. Ser. B Stat. Methodol. 2008, 70, 589–607. [Google Scholar] [CrossRef]
  33. O’Neill, P.D.; Roberts, G.O. Bayesian inference for partially observed stochastic epidemics. J. R. Stat. Soc. Ser. A Stat. Soc. 1999, 162, 121–129. [Google Scholar] [CrossRef]
  34. Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. B Methodol. 1977, 39, 1–38. [Google Scholar]
  35. Gelfand, A.E.; Smith, A.F. Sampling-based approaches to calculating marginal densities. J. Am. Stat. Assoc. 1990, 85, 398–409. [Google Scholar] [CrossRef]
  36. Kypraios, T. Efficient Bayesian Inference for Partially Observed Stochastic Epidemics and a New Class of Semi-Parametric Time Series Models. Ph.D. Thesis, Lancaster University, Lancaster, UK, 2007. [Google Scholar]
  37. Lindley, D.V. A statistical paradox. Biometrika 1957, 44, 187–192. [Google Scholar] [CrossRef]
  38. Robert, C.P. The Bayesian Choice: From Decision-Theoretic Foundations to Computational Implementation; Springer: New York, USA, 2007; Volume 2. [Google Scholar]
  39. Clancy, D.; O’Neill, P.D. Bayesian estimation of the basic reproduction number in stochastic epidemic models. Bayesian Anal. 2008, 3, 737–757. [Google Scholar] [CrossRef]
  40. Alharthi, M.F. The Basic Reproduction Number for the Markovian SIR-Type Epidemic Models: Comparison and Consistency. J. Math. 2022, 2022, 1925202. [Google Scholar] [CrossRef]
  41. Robert, C.P.; Casella, G.; Casella, G. Monte Carlo Statistical Methods; Springer: New York, NY, USA, 2004; Volume 3. [Google Scholar]
  42. Newton, M.A.; Raftery, A.E. Approximate Bayesian inference with the weighted likelihood bootstrap. J. R. Stat. Soc. Ser. B Methodol. 1994, 56, 3–48. [Google Scholar] [CrossRef]
  43. Lartillot, N.; Philippe, H. Computing Bayes factors using thermodynamic integration. Syst. Biol. 2006, 55, 195–207. [Google Scholar] [CrossRef]
  44. Friel, N.; Hurn, M.; Wyse, J. Improving power posterior estimation of statistical evidence. Stat. Comput. 2014, 24, 709–723. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.