1. Introduction
The SARS-CoV-2 virus was first reported in December 2019 in Wuhan, Hubei Province, China, and its subsequent rapid global spread gave rise to the COVID-19 pandemic [
1]. Despite the fact that various new vaccines have been developed and deployed since 2019, many countries have continued to experience epidemiological waves characterized by high infection and hospitalization rates. Nearly five years later, these challenges remain significant, and the virus continues to circulate. According to the World Health Organization (WHO) [
2], as of 28 September 2025, a total of 778,741,840 COVID-19 cases had been reported, with the number of deaths globally reaching 7,102,636.
The spread of SARS-CoV-2 is typically modeled mathematically using either deterministic or stochastic approaches. Deterministic models assume a continuous host population, whereas stochastic models usually consider a discrete host population [
3,
4,
5,
6]. The stochastic approach allows for the capture of randomness and heterogeneity in disease transmission. In particular, it supports early-phase modeling for relatively smaller populations. Deterministic methods, however, offer important advantages in terms of transparency, interpretability, and computational efficiency. They scale very well to large populations and often perform better when forecasting overall pandemic trajectories. Moreover, their analytical tractability allows for the derivation of key epidemiological quantities such as the basic reproduction number, the herd-immunity threshold, and the timing of peak infectiousness.
Motivated by the proven capabilities of SIR-based frameworks and building on our own experience in advancing and refining their features, the present study focuses on the further development of this class of models.
To apply a deterministic model described by differential equations, one must know the time behavior of all involved parameters. In COVID-19 modeling, it is sometimes reasonable to assume that all parameters in certain models are known and even constant, although typically only for short periods of time [
7,
8,
9]. However, when modeling the spread of a novel virus such as SARS-CoV-2, most of the parameters are initially unknown. This requires first solving an appropriate inverse problem to estimate the parameters from reported data. It is well known that inverse epidemiological problems involving nonlinear ordinary differential equations (ODEs) are generally ill-posed [
10,
11,
12]. For this reason, it is necessary to determine parameters or combinations of parameters that can be uniquely identified from the available data. Typically, parameters are identified by solving an appropriate minimization problem [
10,
13,
14,
15,
16,
17]. For example, an extended SEIR model with hospitalization was studied in [
10], and the corresponding inverse problem was reduced to the minimization of a quadratic objective functional. In [
14], the parameter identification problem for a modified SEIR model was reduced to the minimization of a Tikhonov functional. In [
15,
16], the inverse problems for the classical nonlinear SIR and for an adaptive SIR model were solved by an iterative minimization procedure, resulting in well-posed linear systems. In the case of COVID-19, in particular, the inverse problems that arise from reported case data are nonstandard. Some functions and parameters in the model are known from empirical evidence, whereas the remaining functions and parameters must be determined. However, the compartment models used in COVID-19 modeling are typically extensions of the classical SIR model introduced by Kermack and McKendrick in [
18]. The disruption of vaccination campaigns around the world necessitated the inclusion of a group representing vaccinated individuals in the models [
19,
20,
21,
22]. On the other hand, the limited capacity of hospitals and availability of intensive care unit (ICU) beds posed a serious challenge during the peak periods of the epidemic waves. Therefore, it was essential to employ models that account for hospital occupancy [
23,
24,
25,
26,
27]. That is why, in our previous articles, we also introduced models with vaccination [
28,
29] and hospitalization [
30]. Another important aspect of the COVID-19 pandemic, analyzed through deterministic models, is its impact on the economy, social life, and the progression of other diseases [
31,
32,
33,
34,
35].
Let us mention also some results for the SEIR epidemic model for COVID-19 transmission using a Caputo derivative of fractional order [
36,
37,
38,
39].
In our previous paper [
30], we studied a complex SEIRS-VBHC model in which hospitalized individuals are divided into two categories: non-critical and critical (requiring intensive care). Actually, the officially reported COVID-19 data are not enough to validate this model. Nevertheless, as we demonstrated in [
30], it is possible to identify certain parameters within the SEIRS-VBHC model that are sufficient to compute both the basic and effective reproduction numbers.
Motivated by these limitations in real-world data availability, in this article, we present a unified framework for the mathematical analysis, modeling, computer simulation, and visualization of the COVID-19 pandemic based on real epidemiological observations. This framework provides a consistent basis for the formulating inverse problems, estimating time-dependent epidemiological parameters, and validating the resulting models. In the present paper, we introduce a new SEIRS-VBH model in which hospitalized critical cases are not treated as a separate group for which we do not have enough information. This formulation enables the use of additional information on new hospital admissions (or the cumulative total number) to identify all model parameters and validate the model. More precisely, in
Section 2, the SEIRS-VBH model is introduced, and the corresponding direct problem is stated. Special extended direct and inverse problems are formulated based on reported COVID-19 data in Bulgaria. The biologically reasonable mathematical properties of the model are established in
Section 3. The practical identifiability of the model is discussed in
Section 4. In
Section 5, following the approaches described in [
40,
41], a special family of nonstandard semi-implicit variants of the model is proposed, and the biologically meaningful properties of the discrete variants are studied. The employed nonstandard schemes significantly improve the accuracy of the results. Furthermore, a family of time-discrete inverse problems is formulated, and a unique solving algorithmis developed. In
Section 6, this algorithm is applied to the reported COVID-19 data in Bulgaria. Then, by numerically solving the direct differential problem, we identify the discrete inverse problem whose solution provides the best approximation of the reported data with respect to the
and
sub-norms.
Section 7 presents the simulation tool developed by the authors of present paper for implementing the concept and algorithm outlined in
Section 5 and
Section 6. A visualization of the results obtained from the model validation are presented in
Appendix A. A discussion and conclusions are outlined in
Section 8 and
Section 9, respectively.
2. Differential Model SEIRS-VBH and Statement of the Inverse Problem
In order to study the transmission dynamics of the SARS-CoV-2 strain of coronavirus, we introduce a new deterministic SEIRS-VBH model with time-dependent parameters. Although developed for COVID-19, the model could potentially also be applied to the spread of other viral infections. According to the new model, the host population is divided into the following compartments: Susceptible (
S), Exposed (
E), Infectious (
I), Recovered (
R), Vaccinated Susceptible (
V), Individuals with vaccination-acquired immunity (
B), and Hospitalized (
H). For clarity, we note that (
V) is the group of individuals who have been vaccinated but remain susceptible to infection for a short period until antibodies develop, while (
B) denotes the group of individuals who have developed antibodies as a result of vaccination and are therefore protected. The movement of individuals among the SEIRS-VBH model compartments is illustrated in
Figure 1. As can be seen, the proposed model also incorporates vital dynamics and accounts for the loss of immunity in both vaccinated and recovered individuals.
The model is described mathematically by the following Cauchy problem for a system of nonlinear ordinary differential equations:
with non-negative initial conditions of
Here,
is the total population size (the number of all living individuals), and
. The coefficients in the system (
1) are time-dependent, and they involve the parameters listed in
Table 1.
We call this deterministic problem the direct problem.
Direct problem: Suppose that the initial data, i.e.,
and all parameters, i.e.,
are known for
. Solve the Cauchy problem (
1), (
2) and find the unknown function, i.e.,
for
.
Unfortunately, in practice, the parameters are unknown, except a subset that can be determined based on statistical data. On the other hand, officially reported data provide information about certain functions in the model or about other functions closely related to the model. Thus, depending on the available data, various interesting inverse problems arise, some of which can be solved.
Official COVID-19 data are typically available for the following:
—The number of active cases;
—The cumulative number of individuals who have recovered from the disease up to time t. In contrast to , this group includes individuals who have recovered but may have since lost their immunity acquired from the disease.
—The cumulative number of individuals hospitalized due to the disease up to time t. Typically, the reported data are for new hospital admissions. Their sum up to time t corresponds to .
—The cumulative number of COVID-19 deaths up to time t;
—The cumulative number of vaccinated individuals up to time t;
This allows us, similarly to the SIR-type models (see [
7,
28]), to expand the SEIRS-VBH model with the following linear equations:
with initial conditions of
The
equation in (
5) represents the influx of newly recovered individuals originating both from the infectious population (captured by the
term) and from the hospitalized compartment (represented by
). The
equation in (
5) accounts for the number of vaccinated individuals who have not yet acquired immunity and are susceptible to the disease. The
equation in (
5) describes the dynamics of newly hospitalized cases, while the
equation in (
5) represents the flow of newly deceased individuals.
For convenience, we introduce the following notations for the functions:
Next, we formulate the following:
Extended direct problem: Suppose that the initial data, i.e.,
, and all parameters, i.e.,
, are known for
. Solve Cauchy problems (
1), (
2) and (
5), (
6) and find all functions (
and
) for
.
To formulate the appropriate inverse problem, we introduce the function expressed as
which represents quantities for which official data are typically unavailable and will therefore be treated as unknown. In contrast, the
function corresponds to quantities for which official data are generally available and will, thus, be considered to be known.
In the same manner, we divide the parameters (
) into two separate groups: a group of known parameters, i.e.,
where we add vaccine effectiveness (
), and a group of unknown parameters, i.e.,
Finally, we formulate a special inverse problem corresponding to the extended direct problem described above.
Inverse Problem (
)
: Let the initial data (
), the functions (
), and the parameters (
) be given on the interval of
. Then, using the system of differential Equation (
1), find the functions (
) and the parameters (
) for
.
Remark 1. The required smoothness of the functions will be specified as needed in the corresponding context.
Since the reported data provide the functions () as piece-wise constant (step) functions of time, it is natural to also search for and as step functions. For this reason, we introduce a family of time-discrete variants of the SEIRS-VBH model and formulate an appropriate family of discrete variants (problems) of problem .
3. Mathematical Analysis of the Differential SEIRS-VBH Model
Let us consider the SEIRS-VBH model for the time frame of
, where
. In the traditional SIR and SEIR models [
18], as well as in several SIR/SEIR-based models with fixed coefficients, it is assumed that the population size (
N) remains constant. However, in model (
1), the population size (
) is a time-varying function.
Clearly, for all , as the total population size represents the sum of biologically meaningful and non-negative subpopulations (e.g., susceptible, exposed, infected, etc.). Moreover, from a biological perspective, must remain strictly positive, since the model assumes an initially non-zero population (i.e., ).
Our first goal is to show that the proposed model possesses biologically reasonable properties. In particular, it has a unique solution that remains non-negative and is bounded above by a function depending on the initial population size, as well as on the birth and mortality rates. Such boundedness guarantees the viability of the population.
To study the biologically meaningful properties of the model under the assumption of a fixed total population size (
1), following [
28,
42], we introduce the following functions:
For the new unknown functions, using (
1) and (
2), we obtain the following reduced Cauchy problem:
with non-negative initial conditions of
where
Subsequently, let us introduce the following notations:
It is clear that if problem (
8), (
9) is obtained from problem (
1), (
2) by the substitution of (
7), then
for
However, if we consider Cauchy problem (
8), (
9) as a problem independent of Cauchy problem (
1), (
2), then the property of
does not necessarily hold, and the following lemma is particularly useful.
Lemma 1. Let for some and the initial data () satisfy the condition of Suppose that there exists a solution () of Cauchy problem (8), (9). Then, for the sum of the components of , we have for all . Proof. By summing up all equations in system (
8), we find that
satisfies the following first-order linear ordinary differential equation and the initial condition of
:
Obviously,
is a solution of the obtained Cauchy problem. We also note that this Cauchy problem has a unique solution, since the coefficients in Equation (
11) are continuous for
.
The proof is complete. □
Remark 2. According to the notation used here and throughout, we say that a real vector () is non-negative (positive) if all its components () are non-negative (positive). Furthermore, we use the norm defined as Actually, for , the norm is
Furthermore, we study Cauchy problem (
8), (
9), as we are focused on a real-world epidemic scenario in which, at the initial time, the numbers of susceptible individuals (
) and infectious individuals (
) are both positive. For this purpose, we introduce the following set:
We now prove the following existence and uniqueness theorem, in which neither the condition of
nor the restriction to
imposed in Theorem 1 in [
30] is required.
Theorem 1. Let , , and for . Then, there exists a unique solution () of Cauchy problem (8), (9), which is well-defined and bounded for all . In addition, for . Proof. Owing to the specific structure of the vector-valued function appearing on the right-hand side of system (
8) (whose components are second-order polynomials with respect to
components), this function is continuous and differentiable. Then, according to Picard’s existence and uniqueness theorem (Theorem 1.1., p. 8 in [
43] or Theorem 2., p. 20 in [
44]), there exists a unique solution (
) of Cauchy problem (
8) with initial conditions (
9) defined in the interval of
for some
.
Our next goal is to prove that .
Step 1. Here, we prove that and , for .
Let us consider the
-th, the
-th, the
-th, the
-th, the
-th, and the
-th equation from system (
8). All these equations are linear first-order differential equations in the form of
where
and
are continuous functions over the interval of
. The unique solution to the Cauchy problem for Equation (
13) with an initial condition of
is given by the following formula:
From (
15), it is then obvious that if
and
over the interval of integration
, then
at all points
. If, in addition,
, then
.
Now, taking into account , we assume that vanishes at some points in . Let denote the first time such that . Then,
(i) The
equation of system (
8) is in the form of (
13), and we can use (
15) with
and
in
. Therefore,
throughout the interval of
.
(ii) The
equation of system (
8) is in the form of (
13), with
and
in
. Hence,
for
.
(iii) The
equation of system (
8) is in the form of (
13), with
and
in
. It follows that
for
.
(iv) The
equation from system (
8) conforms to the form of (
13), with
and
in
. Therefore,
.
(v) The
equation from system (
8) is not in the form of (
13), but it is a Riccati equation with respect to
. Taking into account
, we assume there exists a point (
) such that
. Then, since
is a continuous function on
, there exists a point (
) such that
and
for all
. Then, the
equation from system (
8) gives
for
. This leads to a contradiction with
and
for
. Therefore,
for
.
(vi) The
equation from system (
8) conforms to the form of (
13), with
and
in
. Therefore,
for
(vii) Now, since the
equation from system (
8) conforms to the form of (
13) with
and
in
, it follows that
.
The results in (iv) and (vii) lead to a contradiction with . Hence, it follows that and for .
Now, it easy to repeat stages (i), (ii), (iii), (v), and (vi) on the interval of instead of and conclude that for .
Step 2. Since
and
is a solution of Cauchy problem (
8), (
9) with
,
,
in
and
, Lemma 1 gives
for all
Therefore, the solution is
for
, and it is uniformly bounded on
. According to the Extension Theorem (Theorem 3.1, p. 12 in [
43] or Proposition (A), p. 189 in [
44]), it follows that
, which means solution
exists on the interval of
and
for
.
This completes the proof. □
In order to study the original Cauchy problem (
1), (
2) for
, we introduce the following sets:
where
.
We are now prepared to formulate the following existence and uniqueness theorem, which also provides an upper bound for the population size ().
Theorem 2. Let the initial data be , and for . Then, there exists a solution () of Cauchy problem (1) and (2), which is defined for , and for . The solution with such properties is unique. Proof. To prove the existence and uniqueness of such a solution, we use Theorem 1.
- (i)
Existence.
As
with
and
, let us consider Cauchy problem (
8), (
9) with an initial condition of
. Since
and
are both strictly positive,
and all other conditions required by Theorem 1 are satisfied, it follows that Theorem 1 guarantees the existence of a function (
) that is a unique solution of problem (
8), (
9) defined for
.
Next, we consider the following Cauchy problem:
It follows directly that the unique solution to problem (
16) is
and is defined for
.
As shown earlier, system (
1) can be transformed into (
8). Now, by reversing this transformation using
, we find that the function expressed as
is a solution to the Cauchy problem defined by (
1), (
2) on the interval of
and satisfies
and
for all
.
Furthermore, by summing up all equations in system (
1) and the initial conditions (
2), we derive the following Cauchy problem:
which is, in fact, problem (
16) with
Problem (
17) has a unique solution, i.e.,
Since
in
, it follows that
Therefore, for .
- (ii)
Uniqueness. The uniqueness of the solution is proven in a manner analogous to that used in Theorem 2 [
28].
The proof is complete. □
4. Identifiability of the SEIRS-VBH Model
Mathematical models in epidemiology consist of many different coefficients (for example, see ODE system (
1)). We are now attempting to solve ODE system (
1) with the given initial conditions (
2) in a case where all coefficients (epidemiological parameters) in system (
1) are known. As usual, we refer to this as the
direct problem. In contrast to the direct problem, we now seek to determine some unknown coefficients using additional information about the groups in solution
. With this information at hand, it is possible to find the coefficients, which means that we solve a problem opposite to the direct problem (the so-called
inverse problem). Such inverse problems arise in many practical situations (see [
45,
46] and references therein). These inverse problems are sometimes
ill-posed in the sense of Hadamard, and in the current situation, we actually encounter such a case. More information about this procedure for solving epidemiological problems can be found in [
46]. Each such problem requires a specific research approach.
In the present paper, we develop a methodology for approximating the reported COVID-19 data (five series of data points: active, recovered, vaccinated, hospitalized, and deceased cases) for Bulgaria using the differential SEIRS-VBH model. Here, we note that we are looking only for biologically reasonable values. As explained above, we first identify the parameters in the model by solving the
inverse problem corresponding to the direct problem (
1), (
2). A criterion for assessing how well the parameters of the differential model are estimated from the reported data is the quality of the approximation. To measure approximation accuracy, we use relative errors with respect to the
and
norms. We perform a perturbation analysis (using both
and
c parameters) and find that the best approximation of the reported data with respect to the
norm coincides with that obtained using the
norm. Moreover, the results presented in
Figure 2 and
Figure 3 show very good agreement between the corresponding error margins of the
and
norms. These observations suggest the robustness of the proposed SEIRS-VBH model with respect to the norm of choice within the
family.
The available data for Bulgaria lead to a special inverse problem. To describe this complex situation more precisely and clearly, we provide a brief overview of the developed methodology:
We construct a family of discrete inverse problems that are uniquely solvable (see the Unique Solving Algorithm in
Section 5). This is achieved by using a family of discrete variants of the differential model that involve two new parameters (
and
c; see the difference scheme (
20) and the function (
40)).
We perturb the
and
c parameters, and for each pair
, we solve the corresponding discrete inverse problem (see Step 1 in
Section 6). In this way, for each
, we find a set of model parameters
. Each parameter set is specific to both the particular reported data and the corresponding discrete model.
We numerically solve the extended
direct problems (
1), (
2) and (
5), (
6) using each of the identified parameter sets. These computations are performed using the ode45 solver in Matlab R2024a, which implements the Runge–Kutta 4th/5th-order method. As a result, we obtain numerical solutions (with a step of 0.01) of the original SEIRS-VBH model (see Step 2 in
Section 6).
For each pair
, we compute the corresponding relative approximation error with respect to the
and
norms (see Step 3 in
Section 6).
The minimum of the computed relative errors shows the best approximation of the official data with respect to the chosen error measure (see Step 4 in
Section 6). This serves as our criterion for solving both the inverse and direct problems (for comparison, see
Figure 4 and
Figure 5 and
Appendix A.1).
In such a way, we fix the model parameters and all model curves, including those for the active, recovered, vaccinated, hospitalized, and deceased cases.
5. Discrete Variants of the Differential SEIRS-VBH Model and Related Discrete Inverse Problems
Let us note, again, that inverse epidemiological problems involving nonlinear ordinary differential equations are generally ill-posed.
Now, in order to formulate an appropriate family of discrete inverse problems related to the differential inverse problem
, we introduce a family of discrete variants of the differential problem (
1), (
2), extended by (
5), (
6).
Let us consider a time frame of
, where
and a step size of
. If
, then
represents fixed consecutive days of the epidemic (actually, the official data for COVID-19 are reported once per day). Next, we introduce the notation for the values of functions in the model for
, i.e.,
as well as the values of the parameters:
Remark 3. Furthermore, each parameter value () is hereinafter considered the value corresponding to day required to obtain from
Here, we propose time-discrete analogs of the differential SEIRS-VBH model. More precisely, for
, we introduce the following family of modified semi-implicit difference schemes with a weight of
and a non-negative function (
) that substitutes the step size (
h) such that
as
:
where
and the initial values are
.
Remark 4. In [40,41], a similar modified difference scheme was investigated but in the context of the classical SIR model. The proposed functions are , , . In this study, we find and use different functions that yield better results for day (which corresponds to the reported data). Since we apply the discrete model (20) with a step of day , only the value of is relevant. Therefore, in the numerical experiments, we use specific simple functions, which will be defined later. Our goal is to construct an appropriate function () that reduces the relative error of the computations. Our first objective is to show that the modified discrete problem (
20), when provided with appropriate initial data, exhibits biologically plausible behavior comparable to that of the differential SEIRS-VBH model. Consequently, the next step involves determining sufficient conditions on the parameter values (
) given by (
19) and the
function in the difference scheme (
20) to ensure the component-wise non-negativity of the numerical solution.
Theorem 3. Let and be fixed and with for all . Let the condition ofwithhold. Then, for the values of obtained from (20), we find that with , for , and the estimate ofalso holds for all . Proof. We prove the statements for and separately.
Step 1. First, we prove, by induction, that with and for
The statement of the theorem for holds. Now, according to the induction hypothesis, with for some . Thus, we obviously have and .
Now we express (
20) in the following form:
Applying (
21), we proceed in the following manner:
From the I Equation (
23), it follows that
According to our inductive assumption, we have
, and condition (
21) ensures that
. Hence,
.
Now, we consider the S equation in (
23). Since
, by applying (
24) and
, we obtain
Furthermore, condition (
21) implies that
, and we conclude that
.
By applying (
24) and
to the E equation and V equation in (
23), we obtain
and
. The R equation and B equation yield
and
, respectively.
Finally, let us consider the H equation in (
23). Condition (
21) ensures that the coefficient is
. Additionally, by taking into account that
, we obtain
Therefore, the statement of the theorem for holds.
Step 2. Now, we prove (
22). By summing all the equations from (
20), we derive the following:
.
Consequently, we find that
From (
25), taking into account that
and
, by removing the negative term, it follows directly that
Since, the inequalities (
26) hold for
, it follows that (
22)holds as well.
The proof is complete. □
Our next goal is to formulate an appropriate discrete inverse problem related to the difference scheme (
20).
In accordance with this, we define the following quantities for ease of notation:
By discretizing with a step size of
and applying Equations (
27)–(
32), we rewrite (
20) in the following form:
Furthermore, discretizing Equation (
5) with a step size of
and again applying Equations (
27)–(
32), we derive the following supplementary relationships:
where
,
,
and
.
Next, following [
28,
29], we introduce the notations for the officially reported data:
and for the unknown values
where
Then, we categorize the parameters () into two separate groups:
- -
A group consisting of parameters whose values are naturally assumed to be known:
where we add the daily values (
) of vaccine effectiveness;
- -
A group consisting of parameters whose values we need to find:
To analyze the dynamics of COVID-19, we now formulate an inverse problem. To do this, we determine the parameters of system (
1) using official reported data.
Inverse Problem (
): Let
be fixed and the values of
,
and
be given. Then, find the quantities of
such that relations (
33)–(
37) hold.
It is important to note again that inverse problems are typically ill-posed.
To solve inverse problem
, first of all, we sum up the second and third equations in the discretized scheme (
33). Then, applying (
27) and notations (
27)–(
32), we obtain the following relations:
Based on these findings, we can now outline the following procedure.
Unique Solving Algorithm for the Inverse Problem : Assume that the non-negative data values of , and are given for , along with the initial non-negative values of , and and their sum (), where and . Using this information, is also determined, and we compute the sequences of and according to the steps described below.
According to (
27), non-hospitalized active cases are expressed as
According to (
35), we find that
From the
equation in (
38), we obtain
According to (
34), we find that
Let us now assume that the values of , and therefore for some are calculated and . Then,
Taking into account that
and according to (
36), for
, we obtain the vaccination parameter:
Using the
equation from (
38), we calculate
By means of the
equation,
equation,
equation,
equation, and
equation in (
38), we compute
, and
sequentially.
Finally, the value of
is also obtained. In this way, the values of
are calculated sequentially.
Now, according to Equations (
28)–(
32), the parameters are computed as follows for
:
Should any step yield a biologically unreasonable value, the algorithm must terminate.
Finally, taking into account Remark 4, we consider the following family of simple functions:
where a constant is
. Furthermore, in
Section 6 we apply the Algorithm for Solving the Inverse Problem (
) using different functions (
), i.e., different
c values. Since
, the computed function values and parameter values obtained by this algorithm depend on the choice of
c.
Remark 5. The backward Euler scheme is a widely used method for finite-difference approximation of time-derivative terms in complex evolution models. Its primary advantage is that it is unconditionally stable. However, it offers only first-order accuracy. It is also well known that higher-order schemes generally require sufficiently small time steps to maintain stability. In this study, this approach is not applicable due to the fact that we fixed the time step to 1 day. Under this constraint, the choice of becomes natural. Enhanced accuracy is then achieved by optimizing the parameter to within the family of finite-difference schemes generated by the perturbation (). In the next section, we show that this approach yields a substantial improvement in accuracy while preserving the non-negativity of the numerical solution.
6. Solving the Discrete Inverse Problems for the SEIRS-VBH Model and Validation
To validate the SEIRS-VBH model, we use officially reported data for COVID-19 in Bulgaria [
47,
48,
49,
50] for the time frame of 7 June 2020–12 March 2023. The reported cases for 7 June 2020 are used as the initial data, and we consider an additional 1008 days (144 weeks) for the simulation, i.e.,
,
,
. We assume that the official information provided each morning refers to the data from the previous day; for example, the 1019 reported active cases on 8 June 2020 are considered as the
value corresponding to 7 June 2020.
We use data on Bulgaria’s population, as well as annual birth and mortality rates from [
51], together with data on COVID-19 deaths reported in [
47], to calculate
and the following daily values for birth and natural mortality rates:
and
for the days of 2020, 2021, 2022, and 2023, respectively. Then, in accordance with [
30], we assume that the reinfection rates of recovered and vaccinated individuals are equal to
and the antibody rate is
. Finally, for the latency rate, we use
, and for the vaccine effectiveness rates, we use
for the
1 Wuhan variant;
2 Alpha, Beta, and Gamma variants;
3 Delta variant; and
4 Omicron variant of SARS-CoV-2.
It should be noted that the group includes individuals who completed the full recommended course of vaccination, as well as those who received the booster dose. The latter are considered newly vaccinated individuals who have lost their vaccine-acquired immunity.
To solve the discrete inverse problem (
), then to validate the model, we introduce the grid expressed as
Now, we describe the results from the conducted numerical experiments.
First of all, for each node of the grid (), we perform the following steps:
Step 1 (Parameter identification). Using the Unique Solving Algorithm from
Section 5, we solve the discrete inverse problem (
) with the
function and
day. As a result, we obtain the following non-negative parameter values:
and
. Graphics of the parameter values obtained for particular values of
and
c are provided in
Appendix A.2.
Step 2 (Extended direct problem solving). We use the parameter values obtained in Step 1 and numerically solve the extended direct problem (
1), (
2) and (
5), (
6) (with
) with initial conditions of
,
,
,
,
,
,
and
,
,
,
. More precisely, we solve the extended Cauchy problem for the second day (
) (using the initial data from
). Then for each next day (
), the problem is solved again with constant parameters of
, where the computed values of the
functions at the end of the previous day (
) serve as the initial data for the current day. This means that only the initial data for the first day are prescribed, while those for the following days are computed. This computation is performed using the ode45 solver in Matlab, which implements the Runge–Kutta 4th/5th-order method with a step size of
on each interval (
), and as a result, we obtain the numerical solution:
and
for
,
.
Graphics of the components of
for special values of
and
c, as defined in Step 4, are presented in
Appendix A.3. Let us mention that all values obtained by this procedure are biologically reasonable.
Step 3 ( and relative errors calculation). Let us introduce the notation for the official reported numbers of active cases, hospitalized individuals, total recoveries, total vaccinations, total hospitalizations, and total deaths:
For the corresponding computed values at the end of each day, we use the following notations:
where the calculated active cases are
Then, we compute the relative errors as
and
The results are presented in
Figure 2 and
Figure 3. We observe that the proposed modified discrete difference schemes improve both the standard explicit (
) and semi-implicit (
) schemes. The relative errors (
and
) for the modified explicit (forward) Euler scheme (
) are lowest for small positive values of
c. In contrast, for the modified semi-implicit schemes (
), both errors are smallest for small negative values of
c. On the other hand, for each fixed
c value, the obtained results show that the explicit Euler scheme produces the largest relative errors, but weighted convex linear combinations of backward and forward Euler schemes can provide higher accuracy while, at the same time, maintaining stability.
Step 4 (Model validation). To validate the SEIRS-VBH model, we minimize both relative errors computed in Step 3 on the grid (
):
and
This result shows that both relative errors achieved their minimum on the grid (
) at one and the same node, i.e.,
Hence, among all considered discrete inverse problems (
) the solution expressed as
,
of
, where
with
, provides the best approximation of the reported data.
The documentation of the relative errors (
and
) in
Table 2 shows that relative errors provided by the modified difference schemes with
are about four times lower than the relative error provided by the standard difference scheme (
).
By minimizing the investigated relative errors on the grid (), we identify the best of the considered approximations to the officially reported data. In this way, the SEIRS-VBH model is validated.
The curves of reported (
) and computed (
) active cases are compared in
Figure 4. A comparison of remaining reported and computed data is shown in
Appendix A.1 as follows:
in
Figure A1,
in
Figure A2,
in
Figure A3, and
in
Figure A4. In all these figures, we observe very good agreement between the reported data and the data computed with the SEIRS-VBH model.
Remark 6. In Figure 4, the reported and model plots appear to coincide mainly because of the length of the time period and large changes in values. However, in reality, this is not the case, as clearly shown in Figure 5, where the values are visualized only for the period of 6–13 February 2022, which, in the real situation, corresponds to the highest peak of active cases on 10 February 2022. The situation is similar with the plots in the figures presented in Appendix A.1, where, at first glance, the plots appear identical. They are consistent with the low values of the relative errors computed in our analysis. One can easily verify this by executing the corresponding scripts from the GitHub repository (see Data Availability Statement) and inspecting the plots in greater detail by zooming in. The obtained parameter values of
are presented in
Appendix A.2. The behavior of the parameters over time, as well as the 7-day moving averages, are shown as follows:
in
Figure A5,
in
Figure A6,
in
Figure A7,
in
Figure A8,
in
Figure A9, and
in
Figure A10.
The solution (
) of the SEIRS-VBH model (direct problem (
1), (
2)) is presented graphically in
Appendix A.3. Graphics of the components are shown as follows: infectious (
) and vaccinated susceptible (
) in
Figure A11, exposed (
) and hospitalized (
) in
Figure A11, disease-acquired (
) and vaccine-acquired immunity (
) in
Figure A13, and susceptible (
) and total population size (
) in
Figure A14.
In order to demonstrate and quantify the advantage of the proposed SEIRS-VBH model and the performance of its validation methodology, we compare our results with a reference simulation based on a standard SEIR model. The inverse problem for the SEIR model was formulated and analyzed in our previous work [
52], where an explicit Euler scheme was employed. Following the latter approach here, we identify the parameters of the SEIR model and subsequently solve the corresponding direct problem. For the SEIR model, the comparison between the computed and reported data for active cases and recovered individuals yields the following errors: an
error of 0.3544, and an
error of 0.3612. These errors are more than ten times larger than the relative errors (
and
) for the SEIRS-VBH model obtained in the present paper. This reduction in error highlights the clear advantage of the SEIRS-VBH model, which provides a substantially better fit to the reported data.
7. SEIRSVBH Simulator: Description and Usage
The SEIRSVBH Simulator is a MATLAB-based tool developed by the authors of this paper, referred to hereinafter as “the simulator”. The simulator consists of two distinct components:
A script-based computational module intended for large-scale data processing, model calibration, and reproduction of the results presented in this research;
A graphical user interface (GUI), which enables intuitive interaction, parameter manipulation, and visual exploration of simulation results (see
Figure 6).
All results presented in this paper were obtained exclusively through the script-based component of the simulator.
The simulator operates using officially reported data on the COVID-19 pandemic, caused by the SARS-CoV-2 virus, for Bulgaria for the period of 7 June 2020–12 March 2023, enabling data-driven analysis of the pandemic’s evolution across this full time frame.
The simulator further integrates MATLAB’s parallel processing framework as a core computational feature, allowing for concurrent evaluation of multiple parameter combinations. This design substantially reduces computation time and enhances the scalability and performance of error quantification and parameter inference procedures, enabling efficient large-scale simulations.
The SEIRSVBH Simulator adds scientific value by integrating all stages of the computational workflow—error analysis, parameter evaluation, and model validation—within a consistent graphical environment. It serves to facilitate transparent and reproducible analysis of the COVID-19 disease dynamics using officially reported COVID-19 data for Bulgaria, providing researchers with an accessible platform for conducting, visualizing, and validating epidemiological simulations.
8. Discussion
Here, we note that the SEIRS-VBH model does not explicitly incorporate the values of the transmission or recovery rates of non-hospitalized and hospitalized individuals, as these parameters depend on factors such as social distancing measures, treatment methods, and the intrinsic characteristics of the virus. The precise functional dependence among these factors remains unknown. To overcome this limitation, we address this issue by formulating and solving an appropriate inverse problem to estimate the daily parameter values (see
Appendix A.2) that mathematically represent the complex behavior of the official reported data. This is why the parameter values are sensitive to changes in the data. For example, the rapid decline in the reported number of active cases (see
Figure 4), which results from the sharp increase in the reported number of recovered individuals (
Figure A1) during June 2022 leads to an elevated recovery rate among non-hospitalized individuals (
Figure A7). However, this does not significantly affect the recovery rate of hospitalized individuals (
Figure A9), as the number of hospitalized patients at that time (
Figure A12) was much lower than the number of individuals reported to have recovered. These events are followed by increases in the transmission rate (
Figure A6) and hospitalization rate (
Figure A8) caused by the emergence of the omicron variant in Bulgaria. This example of parameter behavior demonstrates that the model works effectively with real-world data, even in the presence of abrupt changes.
It is also worth noting that, with the exception of hospitalized individuals, there are no officially reported data for the components of the SEIRS-VBH model solution. The proposed methodology for solving an extended inverse problem allows the size of important groups to be calculated (see
Appendix A.3) as exposed, infectious, and susceptible (vaccinated and non-vaccinated) individuals, as well as people with vaccine-acquired immunity and disease-acquired immunity.
All these results can be very useful in the accurate calculation of key epidemiological characteristics, such as basic and effective reproduction values and the herd immunity threshold.