Next Article in Journal
Oil Market Volatility Forecasting Under Uncertainty Theory: A Joint Modeling Framework via Uncertain Vector Autoregression
Previous Article in Journal
New Families of Asymmetric Quantum MDS Codes via Affine and Projective Partitions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Bayesian Analysis of Tuberculosis Spread Scenarios in Regions of Russian Federation

1
Research Center for Information Technologies and Artificial Intelligence, Sirius University of Science and Technology, 354340 Sirius, Russia
2
Sobolev Institute, Mathematics of SB RAS, 630090 Novosibirsk, Russia
3
Novosibirsk TB Research Institute, the Ministry of Health RF, 630040 Novosibirsk, Russia
4
Tula Regional Clinical Center for the Prevention and Control of AIDS and Infectious Diseases, 300002 Tula, Russia
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Mathematics 2026, 14(10), 1600; https://doi.org/10.3390/math14101600
Submission received: 25 March 2026 / Revised: 29 April 2026 / Accepted: 6 May 2026 / Published: 8 May 2026
(This article belongs to the Special Issue Recent Advances in Mathematical Epidemiology and Applications)

Abstract

Understanding the heterogeneous spread of tuberculosis (TB), particularly multidrug-resistant (MDR) forms and the role of subclinical infection, is critical for achieving the WHO End TB strategy. This study develops a novel compartmental model that explicitly incorporates incipient and subclinical TB together with MDR forms, and links them to case detection and treatment pathways. The key innovation lies in integrating a sensitivity-based identifiability analysis with a Bayesian MCMC framework to quantify parameter uncertainty and correlations directly from regional surveillance data. Applied to five high-burden regions of the Russian Federation (2009–2020), the approach reveals strong heterogeneity in epidemic drivers: wide credible intervals for contagiousness, the rate of progression to bacterio-positive (BE+) states, and detection rates. The probabilistic forecasts up to 2025 are validated against 2021–2023 data. The region-specific differences in these correlated parameters dictate transmission dynamics, and improving detection of BE+ cases is the most effective lever for control.

Graphical Abstract

1. Mathematical Modeling of the Tuberculosis Spread

Despite some success in the fight against tuberculosis (TB) in the Russian Federation and globally, the disease remains one of the main challenges to national healthcare systems and is the leading global cause of death from infectious diseases, one of the 10 main causes of death and the predominant cause of death in people infected with HIV. According to WHO, in 2025, 10.6 million people fell ill with TB for the first time, 1.3 million people died, and the number of MDR cases increased by 3% [1,2]. For successful control over the spread of socially significant and epidemically dangerous infectious diseases such as TB, predictive mathematical modeling of the epidemic process is necessary, capable of providing a reliable forecast of the probability of developing an unfavorable and favorable scenario of the epidemic situation in specific territories and early planning/implementation of necessary interventions [3].
The difficulty of developing a model of slow infections dynamics (HIV/AIDS, TB) comes when considering the heterogeneity of the population with various climatic, environmental, epidemiological, medical, biological, and socio-economic characteristics, including migration, demographic characteristics of regions, availability of medical services, the presence of high-risk groups (HIV-infected people, people with chronic diseases, addiction to psychoactive substances, etc.), and a high proportion of drug-resistant strains.
The models used for the dynamics of infectious diseases in populations have varying degrees of complexity, but three main approaches and their combinations can be distinguished [4]. The first approach is to divide the population into non-overlapping, homogeneously mixed groups with respect to infection and to introduce rules for transition from one group to another [5]. Typically, models of this type are systems of integral, integro-differential, and difference equations (SIR models), interconnected by the mass action law. In the review by Avilov and Romanyukha [5], examples of SIR models and their features in describing the dynamics of TB in a population are given. Such models are characterized by their parameters (coefficients and initial conditions), and describe an outbreak of an infectious disease if coefficients are constant. Accounting for medical and socio-economic interventions is possible in a generalized manner, while detailing the features of related processes remains difficult.
In work by Melnichenko and Romanyukha [6] it is shown that the parameter of TB infectiousness is sensitive to morbidity; therefore, demographic and socio-economic indicators were introduced as a step function into the change of the infectiousness parameter. This approach is demonstrated in predicting morbidity in the regions of the Russian Federation and confirms that the change in socio-economic conditions in the regions is one of the reasons for the decrease in TB spread. Furthermore, this effect is combined with the results of the work of the anti-TB services. In work by Kabanikhin et al. [7] for the model of TB and HIV co-infection, fundamental scenarios of disease spread were constructed with regulation of the parameter of the effectiveness of treatment in the regions of the Russian Federation. However, including socio-economic processes in an explicit form requires modification of the model (introduction of additional equations) and data. The second approach is agent-based modeling (e.g., see work by Vlad et al. [8]). Within this framework, each individual of the population is described by a set of parameters (age, immune status, time elapsed since infection, etc.), the rules of interaction of individuals, and the influence of the environment. This approach allows us to obtain a more detailed picture of the epidemic process: age stratification of morbidity, the influence of social status and the environment on the incidence and prevalence of infection, changes in immunity as a result of the spread of infection, etc. To specify models of this type, both differential equations and algorithmic descriptions are used. Such models are computationally expensive and require a detailed description specific to the region under consideration; however, the results of modeling and forecasting are not only advisory, but also practical for medical organizations (e.g., see work by Romanyukha et al. [9]).
The third approach to describing and forecasting infectious disease spread relies on statistical data and deep learning models [10,11]. However, their implementation requires complete data of the process under study, which is difficult to obtain for slowly progressing infections (for example, the effectiveness of COVID-19 modeling and forecasting is studied in the works by Krivorotko et al. [12], Krivorotko and Zyatkov [13]).
To build scenarios for the spread of TB in the regions of the Russian Federation, it is necessary to take into account the heterogeneity of its distribution [14], as well as the large percentage of multidrug-resistant forms of TB (MDR-TB) in the Siberian regions of the Russian Federation among newly diagnosed patients (see Figure 1). For example, more than 40% of newly diagnosed patients were MDR-TB-infected in the Novosibirsk region in 2024, while the average figure for the Siberian Federal District fluctuates around 30%.
The inclusion of the incipient and subclinical TB (ISbTB) compartment is crucial for modeling TB elimination, as it represents a major obstacle to ending TB alongside latent forms of TB [15]. This form is characterized by a low, oscillating bacterial load, qualifying as paucibacillary disease [16], yet remains sufficient for transmission, especially in prolonged close contact [17]. Such individuals are significant concealed community sources of infection [18], eluding diagnosis due to low bacterial burdens that cause false-negative results [19] and present a “hard to determine” category [20].
ISbTB escalates the total epidemic contagiousness; its basic reproduction number, even if low, contributes to the continuity of the overall TB epidemic process. Furthermore, these forms can lead to drug resistance [21] and relapse post-treatment [22]. Despite its proven epidemiological significance, many models omit this parameter. Our model aims to address this gap.
In recent years, the modeling of epidemics has also been developed with a stochastic approach, especially with stochastic control strategies [23,24].
The interest in mathematical modeling of TB was strongly developed after the issue of the WHO concept of ending TB by 2030 [25]. WHO TB researchers published a mathematical model for the prediction of TB incidence decline with fulfillment of the Sustainable Development Goal (SDG) subtargets [26]. In the study by Xu et al. [27] the susceptible-exposed-infectious-recovered (SEIR) model with different age groups was developed. It allowed assessment of the effect of age as a factor on TB transmission and to more accurately determine parameters for ending TB. Separate studies have considered the effects of drug-resistant cases [28], time lag [29], and age structure.
Questions of the hidden reservoirs of TB infection and the burden of overall drug resistance inside them were not significantly involved in mathematical modeling of TB infection in the current literature.
It has recently become clear that traditional notions of dividing TB infection into latent and active TB are a simplification that fails to reflect the biological process of transition from infection to disease. To more realistically characterize this process, the concept of two main intermediate groups of patients, existing between infection and active TB, has been introduced (ISbTB). Therefore, it is now believed that the immediate precursors of the development of active TB are not so much the infection state itself, but rather those stages of infection progression that directly precede active TB, i.e., ISbTB [30,31,32,33]. Recently, the first evidence has emerged of the important role of ISbTB in the spread of infection; subclinical TB has been shown to account for approximately 70% of global TB transmission [34,35]. However, the actual contribution of these conditions to the TB epidemic remains unclear, as diagnosing them in patients is difficult. It is clear that real strategies to combat TB must include the diagnosis and treatment of ISbTB to lead to effective progress towards TB elimination. Molecular biological methods currently being developed for detecting ISbTB will soon be introduced into healthcare practice. These methods will enable a more accurate understanding of the true contribution to the spread of TB and the prognosis of the epidemic [36,37,38,39,40,41]. Since this contribution depends on numerous dynamic variables influencing the spread of the epidemic (human population characteristics, drug resistance of circulating strains, access to medical care, etc.), it is appropriate to describe scenarios for the impact of iISbTB on the epidemic process using mathematical modeling. In our paper we construct the model, that, for the first time, examines the impact of ISbTB in TB epidemiology.
This mathematical model is based on the comprehensive SEIS-type compartmental model for TB dynamics in the Russian Federation, incorporating MDR forms, detection processes, and treatment pathways. In the methods we combine differential equation modeling based on M.I. Perelman and G.I. Marchuk’s ideas [42] as well as Y. Yi et al.’s model [43], Sobol sensitivity analysis, Bayesian optimization, and MCMC-based forecasting. We apply the framework to several high-burden regions that are of epidemiological interest in the Russian Federation as the obstacle of ending TB in Russia. The key points of the current publication that allow achieving additional results, as compared with the other authors, are as follows:
  • Parameter identification and optimization with the use of the concept and paradigm of inverse and ill-posed problems allowed us to evaluate the intensity of the epidemic process in the risk regions of Russia, that we need to identify with the aim to combat TB as the regional and national high stream.
  • Sensitivity-based identifiability analysis allows us to find correlated TB parameters that are involved in the accuracy of parameter identifiability and forecast uncertainty.
  • We construct the posterior distribution of sensitive epidemiological parameters of the TB compartmental mathematical model (such as contagiousness of TB contact with bacterioexcretion, the rate of TB activation, the rate of undetected TB contact with bacterioexcretion per year) that allows us to evaluate the expected TB-infected people in Russian Federation regions for three years ahead.
If we get robust estimations of epidemic force parameters, we have enough scientific potential to plan counterforce actions including efficient treatment of drug resistance and drug-sensitive forms of TB in the amounts enough to concur transmission. The results of the expected heterogeneity of regions and the assessment of key epidemiological parameters are consistent with the results of the works of A.A. Romanyukha [9] and O.A. Melnichenko [6], and also expand and clarify the uncertainty due to the inclusion of incipient and subclinical TB forms, i.e., the key ideas of this paper are
  • Inclusion of detection stages based on papers by Romanyukha [6], Romanyukha et al. [9], and new compartments of MDR-TB based on the paper by Yu et al. [43];
  • Inclusion of ISbTB as a compartment based on the paper by Avilov et al. [44];
  • Usage of Sobol sensitivity analysis and Bayesian MCMC for uncertainty quantification;
  • Region-specific posterior parameter estimation.
The main goal of this research is identify the key epidemic parameters of TB propagation that influence TB heterogeneously in the Russian Federation as justification for real strategies to combat TB that must include the diagnosis and treatment of ISbTB in order to lead to effective progress towards TB elimination.
The paper is organized as follows: In Section 2 we formulate the differential SIR-type model, give analysis of the stability of stationary points of the model, formulate the problem of parameter identification, and provide sensitivity analysis. In Section 3 the statistical data used on TB incidence, socio-economic characteristics of the regions of the Russian Federation, and their processing for use in modeling are presented. In Section 4 numerical estimates of the probability distribution of epidemiological parameters based on statistical data and SIR model data, as well as scenarios for the spread of TB in the regions of the Russian Federation, are presented.

2. SEIS Model of TB Dynamics with Multiple Drug-Resistant Forms

The mathematical model of TB spread and control used in this paper is based on the approach proposed by G.I. Marchuk and A.A. Romanyukha in 2004 [42], the paper by A.A. Romanyukha [44] concerning TB detection, and papers describing the interaction with MDR-TB [45,46]. Using the law of mass action, we formulate the SEIS model [43], characterized by a system of nine ordinary differential equations (the model diagram is shown in Figure 2):
d S d t = β S ( I s + I γ ) / N + γ 1 ( E γ T + I γ T ) + γ 2 ( E s T + I s T ) + b N m N 2 μ S ; d E s d t = β S I s / N ω E s φ E s β E s I γ / N μ E s ; d E s T d t = γ 2 E s T + γ 2 I s T + φ E s E s T ( β I γ / N + t γ ) μ E s T ; d E γ d t = β S I γ / N ω E γ φ E γ + β E s I γ / N μ E γ ; d E γ T d t = γ 1 I γ T γ 1 E γ T + φ E γ + E s T ( β I γ / N + t γ ) μ E γ T ; d I s d t = ω E s φ + I s β I s I γ / N ( α + μ ) I s ; d I s T d t = 2 γ 2 I s T + φ + I s I s T ( β I γ / N + t γ ) ( α + μ ) I s T ; d I γ d t = ω E γ φ + I γ + β I s I γ / N ( α + μ ) I γ ; d I γ T d t = 2 γ 1 I γ T + φ + I γ + I s T ( β I γ / N + t γ ) ( α + μ ) I γ T
with initial conditions
S ( 0 ) = S 0 , E s ( 0 ) = E 0 s , E s T ( 0 ) = E 0 T s , E γ ( 0 ) = E 0 γ , E γ T ( 0 ) = E 0 T γ , I s ( 0 ) = I 0 s , I s T ( 0 ) = I 0 T s , I γ ( 0 ) = I 0 γ , I γ T ( 0 ) = I 0 T γ .
The right-hand side functions are analytical in ( t , S , E s , E s T , E γ , E γ T , I s , I s T , I γ , I γ T ) [ 0 , T ] × [ 0 , k ] 9 . The Cauchy problem (1)–(2) has a unique solution according to the Cauchy–Kovalevskaya theorem [47].
Here N = S ( t ) + E s ( t ) + E s T ( t ) + E γ ( t ) + E γ T ( t ) + I s ( t ) + I s T ( t ) + I γ ( t ) + I γ T ( t ) is a volume of the studied population that satisfies the logistic equation:
d N d t = ( b μ ) 1 N k N α ( I s + I γ + I s T + I γ T ) .
Here b μ > 0 , b N m N 2 refers to the influx of susceptible individuals into the population due to the birth rate or immigration; m = ( b μ ) / k , k is a population potential. We note that if N | t = 0 < k , then d N d t < 0 if N ( t ) ( k δ , k ) for some δ > 0 . Thus, t > 0 : N ( t ) < k .
From the other side, if the compartment reaches 0, its derivative stays non-negative, i.e.:
d S d t | S = 0 = γ 1 ( E γ T + I γ T ) + γ 2 ( E s T + I s T ) + N ( b ( k N ) + μ N ) / k ; d E s d t | E s = 0 = β S I s / N ; d E s T d t | E s T = 0 = γ 2 I s T + φ E s ; d E γ d t | E γ = 0 = β S I γ / N + β E s I γ / N ; d E γ T d t | E γ T = 0 = γ 1 I γ T + φ E γ + E s T ( β I γ / N + t γ ) ; d I s d t | I s = 0 = ω E s ; d I s T d t | I s T = 0 = φ + I s ; d I γ d t | I γ = 0 = ω E γ + β I s I γ / N ; d I γ T d t | I γ T = 0 = φ + I γ + I s T ( β I γ / N + t γ )
Thus, if the compartment is non-negative at the initial time, it stays non-negative. In combination with the fact that the sum of all compartments N < k , then all the compartments stay less than k.
The population is divided into nine non-overlapping groups according to Figure 2 (the subscript T denotes the group of detected infected individuals). We note that we do not depict mortality rates in the diagram in Figure 2, so as not to overload it. The description of the variables and parameters of the model is given in Table 1.
This model is based on the following principles for the Russian Federation:
  • TB BE− and TB BE+ forms are distinguished, and TB BE− is considered an early stage of the disease preceding the development of severe TB BE+ stages;
  • We assume treated individuals return to full susceptibility S, consistent with evidence of high reinfection rates in high-incidence settings [49,50];
  • Patients are divided into detected and undetected;
  • the proportion of those cured from TB with bacterial excretion can transit to a form without bacterial excretion;
  • the probability of transition from forms without MDR to TB-MDR is nonlinear. Infection with MDR-TB occurs both primarily through contact with an infected and in the case of incorrect treatment (with the T index);
  • During treatment of TB BE+, infected people can transit to both latent forms S and TB BE− with equal probability.
A number of TB patients who successfully complete treatment, when the symptoms have disappeared and bacterial excretion has stably stopped, undergo recurrence. In the Siberian Federal District in 2024 the recurrence rate among those cured averaged 33% (in different regions—from 20 to 43% [49]). This means that cured patients remain susceptible to developing TB infection. TB recurrence results from immunological deficiency (dependent on risk factors such as unfavorable social, medical, and stress factors, etc.) and is caused by either relapse of an original infection or exogenous reinfection with a new or the same strain of M. tuberculosis. The source of relapse is bacteria that remain in a persistent, viable state in cured (BE−) persons for many months and years within post-TB residual lung foci, lymph nodes, or even within hematopoietic or mesenchymal stem cells [51,52,53,54]. The source of reinfection is bacteria acquired through continued contacts of cured patients with people with subclinical or active TB [35,50]. In addition, indirect evidence that cured patients remain in a TB-sensitive risk group is that Interferon-Gamma Release Assay tests after cure remain positive in the absolute majority of cases for decades and even until the end of life [55].
Within the framework of the model, all populations, regardless of status, are subject to an average mortality rate with the parameter μ . A susceptible or latent infected individual from group S moves to the infected non-bacterial stage E s (or closed form of TB) after contact with a TB-infected without MDR I s governing the contagiousness parameter β or to the TB-infected non-bacterial stage with MDR E γ . Infected closed-form TB E s , E γ migrate to the bacterial form with ω probability. Infected patients without MDR can migrate to infected MDR-TB with t γ probability, and also with rate β when in contact with infected TB without bacterioexcretion. In the model, each group of TB-infected people is divided into detected (with index T) and undetected with expression coefficients φ for TB BE−, MDR-TB BE− and φ + for TB BE+, MDR-TB BE+, respectively. In the model, additional mortality of infected people with bacterioexcretion α is observed and they move to the state S with probabilities γ 1 (for form with MDR) and γ 2 .
The model principle construction was based on real infection reservoirs. This is TB with bacterioexcretion and TB without bacterioexcretion—the incipient and subclinical TB, the latter always bearing a real hazard of converting to the bacterioexcretion clinical form. That does not mean that we exclude the significance of latent TB, especially in HIV-infected individuals, but consider this of lower value and as a perspective of further studies with the foci being the ending TB.
The limits of the change in the parameters of the models that are used in the linear parts of equations were determined from statistics (TB-infected registration form No. 33). For example, the proportion of identified non-MDR TB cured in 1 year, γ 2 , in a linear approximation is inversely proportional to the time spent in therapy. That is, the limits γ 2 [ 0.3 ,   2 ] correspond to [ 0.5 ,   3.33 ] years undergoing therapy for TB+ non-MDR patients.
Restrictions on initial conditions in the regions of the Russian Federation are also assumed, namely the percentage of undetected infected people is less than 50% due to the absence of a sharp outbreak of TB:
E 0 s = σ E 0 T s , E 0 γ = σ E 0 T γ , I 0 s = σ + I 0 T s , I 0 γ = σ + I 0 T γ .

2.1. ODE System Analysis

The behavior of the ODE systems is broadly studied from the perspective of asymptotic properties, and this paper is also the case. We note that for such systems it is often possible to find an analytical solution using Lie algebra [56]; however, it is effectively applicable only for smaller systems, or systems with symmetries. Thus, the model considered is too complicated for this approach, so we present only the results of stationary points stability.
For the derivation of stationary points of System (1), in this section we only assume that γ 1 = γ 2 = γ . Let us consider the system obtained from System (1) by adding the equations and replacing:
I = I s + I γ , I T = I s T + I γ T , E = E s + E γ , E T = E s T + E γ T .
Then we obtain the following system:
d S d t = b N m N 2 β S I / N + γ I T μ S ; d E d t = β S I / N ( μ + ω + φ ) E ; d E T d t = φ E + γ I T μ E T ; d I d t = ω E ( α + μ + φ + ) I ; d I T d t = φ + I ( 2 γ + α + μ ) I T .
Lemma 1.
System (4) has the only stationary state that is trivial.
Proof. 
By equating the right-hand sides to 0, we obtain a system for finding stationary solutions:
β S I / N + μ S = b N m N 2 + γ I T ; ( μ + ω + φ ) E = β S I / N ; μ E T + γ I T = φ E ; ( α + μ + φ + ) I = ω E ; ( 2 γ + α + μ ) I T = φ + I .
From the second and fourth equations of System (5), we obtain:
S = ( μ + ω + φ ) ( α + μ + φ + ) β ω N .
Therefore, we are left with Equations (1), (3), and (5) of System (5) with a single nonlinear term N 2 .
Note that if N ( 0 ) < k , then t : N ( t ) < k . In the edge cases, when N k or N k , we obtain a linear system for the stationary point; thus, it is unique (except for the trivial point). As soon as the solution is confined, it may tend towards one of these solutions.
We rewrite System (5) the following way:
( μ + ω + φ ) E + μ S = b N m N 2 + γ I T ; E T = I ( φ ( α + μ + φ + ) / ω   γ φ + / ( 2 γ + α + μ ) ) / μ = I c E t ; E = I ( α + μ + φ + ) / ω = I c E ; I T = I φ + / ( 2 γ + α + μ ) = I c I t .
As soon as for stationary point d N / d t : b N m N 2 = α ( I + I T ) = 0, and for the first equation, we get:
( μ + ω + φ ) I c E + μ S = α I ( 1 + c I t ) + γ I c I t .
Thus,
I = μ S ( μ + ω + φ ) c E α ( 1 + c I t ) γ c I t = c I S .
Then, as soon as d N / d t = 0 , we get:
( b μ ) 1 N k N α ( I + I T ) = 0
Thus,
N = k 1 α c S c I ( 1 + c I t ) b μ .
That means that there is a unique stationary point, except for the trivial and disease-free point. However, by the substitution of coefficients c S , c I , c E , c I t into (6), we obtain N = 0 and this point coincides with the trivial point (checked with Wolfram Mathematica, the code is available in GiHub repository). □
Our model operates in absolute values and reflects the logistic curve of the population growth. Most human populations in the regions of the Russian Federation show slow population dynamics and in these dynamics the nontrivial stationary state of TB prevalence can exist. However, we can apply the model to the risk groups where fast growth and diminution can occur.

2.1.1. Stationary Points’ Stability

The matrix of the linearized System (5):
b 2 m N β I ( N S ) N 2 μ b 2 m N b 2 m N b 2 m N β S ( N I ) N 2 b 2 m N + γ β I ( N S ) N 2 μ ω φ 0 β S ( N I ) N 2 0 0 φ μ 0 γ 0 ω 0 α μ φ + 0 0 0 0 φ + 2 γ α μ
There are two stationary points N = 0 and S = N = k .
Then from the form of (7) with I = 0 , point S = N = k is stable, when
μ < b a n d ( μ + ω + φ ) ( α + μ + φ + ) < β ω
and point N = 0 is stable when b μ < 0 , or μ > b .
In our examples we will consider cases b < μ and b μ , which depict the situation in the Russian Federation [57]. In that case the point N = 0 is globally stable for admissible area 0 < N < k , as soon as t : d N d t < 0 if N < k , and d N d t 0 only when N 0 or k. The stability analysis of stationary points is not informative, since the difference μ b is quite small. Thus the convergence to the point N = 0 is slow, so we are interested in short-term dynamics (compared to the convergence to a stationary point).
The 10-year window may seem too large for short-term forecasting; however, that is not the case for TB and the current difference between birth and mortality rates. TB has an infectivity rate of around 2–4 and recovery rate around 0.1 (for annual time scale). It indicates that the TB infection characteristic duration time is around 10 years. The difference between the birth and mortality rates is too small (0.003), which implies that the solution will reach the point with N = 0 in more than 4000 years if the rates stay constant.

2.1.2. Basic Reproduction Number R 0

To derive the basic reproduction number R 0 , we consider two states of people in System (1): infected without MDR x 1 = ( E s , E s T , I s , I s T ) T and with MDR-TB x 2 = ( E γ , E γ T , I γ , I γ T ) T at the disease free equilibrium (DFE) point E 0 = ( k , 0 , 0 , 0 , 0 , 0 , 0 , 0 , 0 ) , i.e., S 0 = N = k . Following the approach from Article [58], we derive the basic reproduction number for non-MDR TB R 1 and MDR-TB R 2 .
We rewrite System (1) as follows:
x i ˙ = ( f i v i ) x i , i = 1 , 2 ,
where f i characterize infection in the group due to external flows, v i – recovery, mortality in the group. Then we have:
f 1 = β S I s / N 0 0 0 , v 1 = β E s I γ N + ω E s + φ E s + μ E s β E s T I γ N + γ 2 E s T γ 2 I s T φ E s + t γ E s T + μ E s T β I s I γ N ω E s + φ + I s + ( α + μ ) I s β I s T I γ N + 2 γ 2 I s T φ + I s + t γ I s T + ( α + μ ) I s T .
f 2 = β S I γ N + β E s I γ N β E s T I γ N β I s I γ N β I s T I γ N , v 2 = ω E γ + φ E γ + μ E γ γ 1 E γ T γ 1 I γ T φ E γ t γ E s T + μ E γ T ω E γ + φ + I γ + ( α + μ ) I γ 2 γ 1 I γ T φ + I γ t γ I s T + ( α + μ ) I γ T .
For the linearized system x i ˙ = ( F i V i ) x i , i = 1 , 2 , we calculate the Jacobian matrices
F i = f k x i j ( E 0 ) , V i = v k x i j ( E 0 ) , j , k = 1 , , 4 ,
after which we find the eigenvalues of the matrices product: λ ( F i V i 1 ) , i = 1 , 2 . The matrices F i and V i are presented in Appendix A.1. Thus, we get
R 1 = R 2 = β ω ( ω + φ + μ ) ( φ + + α + μ ) .
As soon as R 0 = max { R 1 , R 2 } , the basic reproduction number in the model (1) has the form
R 0 = β ω ( ω + φ + μ ) ( φ + + α + μ ) .
The basic reproduction number R 0 is based on infectious rates (contagiousness of TB, rate of undetected TB, rate of TB BE− turning into BE+). The difference between the MDR and non-MDR strains in the model (1) is the rate of treatment, which is not reflected in the basic reproduction number (since it describes the rate of infection in a non-immune population). The identical R 0 suggests that, in the absence of treatment, MDR and non-MDR spread similarly. This highlights the role of treatment in controlling MDR.
Lemma 2.
The stationary point without infection E 0 is locally asymptotically stable when basic reproduction number (9) R 0 < 1 .
The R 0 < 1 in Lemma 2 means that β ω < ( ω + φ + μ ) ( φ + + α + μ ) . It coincides with the part of (8) that represents stability with respect to infection outbreak.
The exclusion of latent forms of TB increases the basic reproduction number R 0 ; namely, due to undetected infectiousness and the transition to an active form (it is possible that such a transition may not occur during the simulation), the R 0 will decrease (see details in Appendix A.1).

2.2. Inverse Problem

We assume that additional information about the solution in fixed points in time is given:
E s T ( t j ) + E γ T ( t j ) + ε j = E j , I s T ( t j ) + ε j = I s j , I γ T ( t j ) + ε j = I γ j , j = 1 , , M .
Then the inverse problem (1), (2), (10) is to identify the parameters of the SEIS model q = ( β , γ 1 , γ 2 , ω , t γ , α , φ , φ + , σ , σ + ) R + 10 with the help of additional information y j d a t a = ( E j , I s j , I γ j ) (10).
We assume that observations y t d a t a = ( E , I s , I γ ) are subject to independent additive Gaussian errors:
y t d a t a = f t ( q ) + ε t , ε t N ( 0 , Σ ) .
Here f t ( q ) = ( E s T ( t ; q ) + E γ T ( t ; q ) , I s T ( t ; q ) , I γ T ( t ; q ) ) , Σ = diag ( σ 1 2 , σ 2 2 , σ 3 2 ) , and σ i are estimated jointly with q.
The inverse problem (1), (2), (10) could be reduced to the problem of the minimization of the misfit function:
J ( q , σ ) = n = 1 3 j = 1 M | y n ( t j ; q ) y n , j d a t a | 2 σ n 2 .
The model (1) is theoretically identifiable based on algebra analysis [59].
Due to the instability and non-uniqueness of the solution to the inverse problem, the following section presents a sensitivity analysis of the model, which will allow us to estimate the degree of sensitivity of the parameters q to the measured states of the system (10) [60].

2.3. Sensitivity-Based Identifiability Analysis for SEIS Model

The sensitivity analysis of the model (1) was carried out using the Sobol method [61], with a uniform distribution of parameters, the variation limits of which are given in Table 1 (the literature and statistical evaluated parameters). This method quantifies how the variance in model outputs (predictions of TB diagnoses) can be apportioned to individual parameters and their interactions across the entire a priori feasible parameter space. The initial conditions (2) approximate the epidemiological situation of the Novosibirsk region in 2009:
N = 2 263 383 , E s = 300 , E γ = 500 , E s T = 1 100 , E γ T = 1 549 , I s = 100 , I γ = 400 , I s T = 438 , I γ T = 1 330 .
Thus, S = N E s E γ E s T E γ T I s I γ I s T I γ T = 2 257 666 people. The modeling time is 10 years.
For model outputs that we consider in the inverse problems y 1 = E s T ( t ; q ) + E γ T ( t ; q ) , y 2 = I s T ( t ; q ) , y 3 = I γ T ( t ; q ) (10), the Sobol decomposition expresses the total variance V ( y n ) , n = 1 , 2 , 3 :
V ( n ) = V ( y n ) = i V i ( n ) + i < j V i j ( n ) + + V 1 m ( n ) , n = 1 , 2 , 3 .
Here V i ( n ) = V [ E ( y n | q i ) ] , i = 1 , , 10 , is the first-order (main) effect variance, measuring the contribution of parameter q i alone, V i j ( n ) is the second-order interaction variance between q i and q j , and so on for higher-order interactions.
We compute the total-order Sobol index
S T i ( n ) = V ( n ) V [ E ( y n | q i ) ] V ( n ) , n = 1 , 2 , 3 ,
and the first-order and second-order Sobol indexes
S 1 i ( n ) = V i ( n ) V ( n ) , S i j ( n ) = V i j ( n ) V ( n ) , n = 1 , 2 , 3 ,
where q i denotes all parameters except q i , 1 = 1 , , 10 . S T i ( n ) measures the total contribution of q i , including all its interactions with other parameters. A parameter with very low total-order indices S T i ( n ) (e.g., < 0.01 ) has a negligible influence on the measurements y n , n = 1 , 2 , 3 . Its value cannot be inferred from the data. If S T i ( n ) > 0.5 for some q i to data y n , then the total contribution of q i , including all its interactions with other parameters, is significant. However, if S T i ( n ) > 0.5 and S 1 i ( n ) < 0.5 , then parameter q i may still be identifiable, but its estimate will be strongly correlated with others.
We use Sensitivity Analysis Library in Python 3.14 (SALib package) and set N = 1024 · ( 2 · 8 + 2 ) = 18,432 Sobol samples in the range of parameter changes relative to their initial value (Table 1) by 20%. In Figure 3 (right) the total-order Sobol coefficients are given for the model coefficients q ˜ = ( β , γ 1 , γ 2 , ω , t γ , α , φ , φ + ) R + 8 (without initial conditions), including the first indices, the second (correlation), etc., with 95% confidence intervals. The most sensitive to measurements of detected infected is the TB contagiousness parameter β . The TB detection parameter BE+ φ + is sensitive to infected TB BE+ without MDR I s T ( t j ) . The proportion of infected TB converting to the BE+ form ω is sensitive to infected TB-MDR BE+ with and without MDR. The additional mortality rate α due to TB is sensitive to MDR-TB BE+ measurements.
Note that the first Sobol index for β coefficient S 1 ( 1 ) for TB BE− measurement is more than 0.5 (this parameter is sensitive to the y 1 ), while others are less than 0.5, which means the parameter’s influence is primarily through interactions (see Figure 3 left). It may still be identifiable, but its estimate will be strongly correlated with others. The second Sobol indexes S β ω ( 2 ) , S β ω ( 3 ) , S β φ + ( 2 ) are more than 0.2 and S β j ( 2 ) close to 0.1–0.2 for all parameters j (see correlation matrices in Appendix A.2).
As we note that Sobol indices show strong parameter correlations, the formal practical identifiability analysis is performed, i.e., we construct the correlation matrix (see Table 2 for Novosibirsk region) based on the Fisher information matrix
ρ i j = F I M i j 1 F I M i i 1 F I M j j 1 ,
where
F I M = f t q T Σ 1 f t q
and f t q is a sensitivity matrix.
Table 2 shows that the parameters with | ρ i j | > 0.95 are difficult to distinguish. For example, ρ β φ + > 0.98 indicates that the data cannot independently inform the parameters β and φ + , only their product is well-identified. This is shown during the sensitivity Sobol analysis as well.

3. Data Analysis

The birth b and mortality μ rates are presented in Table 3 and Table 4 for five regions of the Russian Federation.
To identify parameters of the model for regions of the Russian Federation, we used the following data:
  • Let p γ = I γ T / ( I γ T + I s T ) ; then from [49], we have data of the fraction and absolute number of patients with MDR among the patients with bacillary forms of TB, which represent p γ and I γ T respectively.
  • From data of f.33 (Information about patients with TB from the Order of Rosstat dated 31.12.2010 № 483), we have cohorts that represent I s T + I γ T + E s T + E γ T = C E I . Then we may obtain the following:
    • I γ T is given in data;
    • I s T = I γ T ( 1 / p γ 1 ) ;
    • E γ T + E s T = C E T I γ T I s T .
For demonstration of the numerical results we choose five regions of the Russian Federation with an unfavorable situation in terms of the prevalence of TB: Novosibirsk and Irkutsk regions, Tyva Republic, and Altai and Zabaikalsky districts.

4. Model Scenarios of TB Spread

As a result of solving the inverse problem (1), (2), (10), the vector of SEIS model parameters q is determined for each region, the substitution of which into the Models (1)–(2) leads to a comparison of the model and real data on identified infected people.

Bayesian Approach

To obtain the distribution of the model parameters and to construct a forecast considering the uncertainty, we use the Bayesian MCMC (Markov Chain Monte Carlo) approach, in which the posterior distribution for the parameters q R + 10 has the form:
p ( q y ) p ( y q ) · i = 1 10 p ( q i ) .
Here p ( q i ) is the prior distributions of the SEIS model parameters; p ( y q ) is the likelihood of the data.
The likelihood functional is defined by the logarithmic density function of the normal distribution with the mean being the difference of the real data and solution of the direct problem for the given current parameters (11). The variance of likelihood functional is determined during the MCMC algorithm (see Appendix A.3).
The results of the inverse problem solution are in Table 5. Parameters with a total Sobol index less than 0.01 for all outputs were considered practically non-identifiable and assigned informative Normal priors centered at Optuna values with width 10% of the prior range (see results for mean values in Appendix A.4). The prior distribution p ( q i ) for sensitive parameters β , φ + , ω is proposed as uniform (see Table 1).
We apply the Bayesian approach to find the posteriority distribution of the parameters q with 20,000 draws of four Markov chains (i.e., 80,000 samples for each parameter), mean (column “Mean”), and 95% credible interval (CI). Differences in the parameters of pathogen contagiousness β , the proportion of infected TB converting to the BE+ form ω , and the rate of detection φ + of TB-infected individuals with bacterial excretion indicate heterogeneity in the prevalence of TB in the Russian Federation regions.
The Pareto front (see Figure A3, Appendix A.5) for three measured statistics (10) shows the uniqueness of the given solution of the inverse problem (TB parameter identification); then the CI values depend on the numerical accuracy of the correlated parameters.
Figure 4 demonstrates the mean and CI of model parameters as noted in Table 5. Note that all parameters β , ω , and φ + with a wide CI are sensitive to the measurement with their interaction with each other (see Section 2.3), i.e., the total-order Sobol indexes are greater than 0.5, but the first-order indexes are sufficiently small. Scenarios of the number who are TB BE+ infectious may be unreliable because I s T and I γ T depend on the combination of all these parameters. Small errors in identifying any one of them lead to unrealistic forecasts. Therefore, when analyzing forecast scenarios for TB BE+ groups, it is appropriate to use interval estimates rather than point estimates, as was done in this study. While the model is structurally identifiable, strong parameter correlations (seen in Sobol indices) affect practical identifiability.
In Figure 5, Figure 6 and Figure 7 the probability distribution of TB-infected based on a Bayesian approach is demonstrated for Novosibirsk region and Tyva Republic of the Russian Federation (Figure 5), Altai district and Irkutsk region (Figure 6), as well as Zabaikalsky district (Figure 7). The approximation for Novosibirsk region is more accurate because of small 95% credible intervals that contain the real data (dashed dot line). We choose statistical standards for all five regions: Mean Absolute Percentage Error (MAPE) and Root Mean Square Error (RMSE):
M A P E n = 1 M j = 1 M y n ( t j ) y n , j d a t a y n , j , R M S E n = 1 M j = 1 M ( y n ( t j ) y n , j d a t a ) 2 , n = 1 , 2 , 3 .
MAPE describes the percentage deviation of the simulated data from the actual data and was approximately 25–28% (see Table 6). This value is due to the low base effect: the number of TB patients is small compared to the region’s population. For example, in Novosibirsk region, the number of TB patients is approximately 1000 out of a population of 2.7 million, or 37 out of every 100,000 people. Therefore, we additionally present the RMSE metric, which shows the average deviation of the modeled results from the actual data in absolute terms (number of people). For example, for data I s T RMSE = 5 · 10 5 means that the error is 5 per 100,000 people. The single situation when the mean prediction is worse is the modeling results for I s T , which is compensated by adjustments in other groups. The I s T forecasts are less accurate only for the Zabaikalsky district, and the data is poorly fit for the Tyva Republic. The Zabaikalsky district data has two “falls” in data that cannot be described with the chosen model, as it is able to describe only one peak of infection with a descent without an additional “fall”. The Tyva Republic data has an unusual feature of I s T being significantly lower than I γ T . This may be due to a simplification of the model by ignoring latent TB infections, which may actually account for some of the detected cases. The basic reproduction number R 0 , calculated using Formula (9), for all considered regions is close to 0.9. The most important terms in R 0 are the contagiousness of TB contact with bacterioexcretion β and the rate of TB BE− turning into BE+ per year ω . The larger these parameters, the larger R 0 . The most unfavorable situation in terms of TB incidence among the regions examined is unfolding in the Tyva Republic due to the overestimated value of the ω parameter and its confidence intervals, which is confirmed by social data on population overcrowding in the region.
We also point out that the data points of 2019 are not used in the inverse problem because of a change in the rules of statistical records of TB in Russia [62], which made figures from 2019 unrepresentative. However, that does not affect the epidemiological process, and thus the following years are not affected significantly (see Appendix A.6).
In 2018–early 2019, a surge in TB detection occurred in the Russian Federation due to the emergence of regulatory official orders that significantly expand the coverage of prophylactic medical examination/observation service and the introduction of modern diagnostic and preventive treatment methods [63,64,65]. We suggest that from 2020 onwards, the impact of these official acts was significantly reduced due to the emergence of the COVID-19 pandemic and, as a result, a decrease in TB detection [66].
A short-term forecast for 3 years ahead was constructed for TB-infected with bacterioexcretion BE+. The credible interval and forecast accuracy are higher for TB BE+ without MDR. To improve the accuracy of the MDR-TB BE+ forecast, it is necessary to use a priori information on treatment, immunology, etc. The model cannot capture sudden spikes; forecasts assume stable epidemiological conditions.

5. Discussion

The literature on mathematical modeling of the early stages of untreated pulmonary TB, i.e., its incipient and subclinical forms, is extremely scarce. We found modeling of the early stages of TB in the works of Avilov et al. [44,67], where two models were used, one of which assumes the infectiousness of the initial stages of the disease, and the second considers all cases to be initially non-infectious, but some of them subsequently become infectious. The authors do not give preference to either model, although they consider the first model more plausible. Our model is based on the latest ideas about the very high contagiousness of the early stages of the TB process [35,50], and combines both models, assuming the infectiousness of TB cases in the early stages and the possible gradual development of some cases to an infectious state. At the same time, like Avilov et al. [5], we defined the development of infectiousness as the transition of patients from the BE− to the BE+ state, which allows us to predict the hidden incidence and prevalence of TB within the framework of the model and to better understand the real epidemic situation in different regions.
Coming through recent ideas in TB forecasting based on statistical methods and statistically based machine learning [3], in our work we also relied on long-term statistical data of departmental reporting for the territories of the Siberian and Far Eastern Federal Districts, being the most unfavorable districts in the Russian Federation. However, we used the generalized SEIS-type compartment model with the law of mass action, and machine learning was added not only to the statistical analysis but also integrated directly into the mathematical model.
We consider the closest work to our research to be Study [68]. This prototype model comprises five differential equations, while our model has nine differential equations. The authors used common methodology to derive the formula of R 0 and calculate the sensitivity of R 0 . However, the methodology of comparing the real and the model data is completely different in our works. Our R 0 values (from 0.876 to 0.974) are within the range of estimates available in the literature (from 0.24 to 4.3) [69]. It should be noted that the lack of direct analogs—calculated for individual regions of the Russian Federation—prevents a direct quantitative comparison, highlighting the novelty of our study.
The very recent work by Ochieng [70] presented the sensitivity analysis that has shown that a major positive influence on R 0 is induced by the per-capita TB transmission rate, rate of bacteria spread to the environment by infectious individuals, and the aerosol-transmission coefficient. The negative influence on R 0 is induced by the mycobacterium environmental death rate and the transition rate from the low-risk latent TB to high-risk latent TB. Even though, in terms of clinical epidemiology, the terminology and methodology in our works are quite different, we achieve very similar results.
An analysis of the model sensitivity in our model shows that the TB contagiousness parameter β , the proportion of infected TB converting to the BE+ form ω , and the TB detection parameter BE+ φ + are globally sensitive to measurements of TB BE+ I s T , and the first two parameters β and ω are sensitive to MDR-TB BE+ and TB BE−. The sensitivity is also based on correlations.
Here we fully correspond to [43], where the simulation results proved that the key to better prevent and control the spread and development of TB is to improve the detection rate, and the conversion rate from TB to MDR-TB.
On the Bayesian approach we calculated the credible intervals (CIs) of the posterior distribution of ten model parameters based on additional information of the number of identified patients with TB BE−, TB BE+ without MDR, and MDR-TB BE+ from 2009 to 2021. The less sensitive parameters give small CIs for five considered regions that describe the TB propagation and detection in RF regions as being in common. More sensitive and correlated parameters have huge CIs that prove the heterogeneity of TB propagation in RF regions as well as open ways for its control. The Pareto front (see Figure A3) for three measured statistics shows the uniqueness of the given solution of the inverse problem (TB parameter identification), and then the CI values depend on the numerical accuracy of the correlated parameters.
Based on identified parameters and their CIs, we construct the forecasting scenarios of detected patients with TB BE−, TB BE+ without MDR, and MDR-TB BE+ from 2022 to 2026 years for five regions of the Russian Federation, with the approach being close to the results [71].

6. Conclusions

The aim of our study was to learn the new patterns and laws of TB spreading worldwide based on the example of the Russian Federation, and to evaluate combat strategies. Over the past decade and a half, the TB epidemic in the Russian Federation, including the Siberian and Far Eastern Federal Districts, has been steadily improving, meaning that TB is gradually halting its spread, and thus, the R 0 < 1 . Despite this improvement, given the significant number of people living with HIV infection, who are highly susceptible to TB, we estimate the R 0 value to be 0.9. This improvement is likely due to anti-epidemic measures taken by the TB service, as well as a reduction in socio-economic and other risk factors.
In the compartment model with revealed and unrevealed cases, incipient and subclinical TB (BE−) and clinical TB (BE+), drug-sensitive and drug-resistant TB, we obtained the formula for the basic reproduction number R 0 . The formula comprises the TB contagiousness parameter β , the rate of TB BE− converting into BE+ ω , and the rate of undetected TB BE+ φ + . Using the sensitivity analysis, we discovered the leading pattern, determining the prevalence of TB clinical forms with bacterial excretion, which are the most dangerous for humans from the aspect of lethality and disease transmission. The pattern is based on the same parameters as R 0 determinacy, which are TB contagiousness parameter β , the rate of TB BE− converting into BE+ ω , and the rate of undetected TB BE+ clinical forms φ + . By solving the inverse problem, we proved the variability of these crucial parameters in high-incidence regions of the Russian Federation. This heterogeneity is due to differences in medical and social systems. The TB contagiousness parameter β depends on the intensity of contact tracing. The rate of TB BE− converting into BE+ ω is influenced by healthcare activity—the coverage of the population with X-ray film examination. The detection (reveal) indicator of BE+ clinical forms is based on the same parameter (coverage of the population with X-ray films) and also the application of sensitive methods of TB diagnostics, including molecular methods. Intensification of these activities will help to fulfill the strategy of ending TB. By addressing the prior gaps: the absence of the incipient and subclinical disease compartment, case detection rate, consideration of transitions between compartments, use of the Bayesian approach for the detection of scenarios, we came closer to understanding the TB epidemic and the measures for counteraction.
In the future, to plan detailed parameters of possible interventions and their application in specific regions, additional processes should be included, for example, by introducing quantitative trajectories of socio-economic, medical–social, molecular–epidemiological, and other indicators connected to the territories. The most complex and convenient model for use in practical healthcare will have to include an assessment of the financial and economic efficiency of possible interventions. We plan to consider a fractional-order extension of the mathematical model to capture memory effects in TB progression and solve the uncertainly of forecasting [72,73].

Author Contributions

Conceptualization, O.K. and Y.S.; methodology, O.K. and A.N.; software, O.K., A.N. and N.Z.; validation, N.Z. and A.N.; formal analysis, G.K.; investigation, O.K., Y.S., A.N. and G.K.; resources, O.K.; data curation, G.K. and Z.L.; writing—original draft preparation, A.N. and O.K.; writing—review and editing, O.K.; visualization, A.N.; supervision, Y.S.; project administration, O.K.; funding acquisition, O.K. All authors have read and agreed to the published version of the manuscript.

Funding

The research was performed according to the Government research assignment for the Sobolev Institute of Mathematics SB RAS, project FWNF-2024-0002 “Inverse ill-posed problems and machine learning in biological, socio-economic and ecological processes” (Section 1 and Section 2) and by the Russian Science Foundation, project No. 23-71-10068 (Section 3, Section 4 and Section 5).

Data Availability Statement

The open source Python code is available on the GitHub: https://github.com/KrivorotkoOI/tb-bayes (accessed on 19 March 2026).

Conflicts of Interest

The authors declare that they have no known competing financial interest or personal relationships that could have appeared to influence the work reported in this paper.

Appendix A

Appendix A.1. Basic Reproduction Number

The Jacobian matrices ( i = 1 , 2 )
F i = f k x i j ( E 0 ) , V i = v k x i j ( E 0 ) , j , k = 1 , , 4 ,
for computing basic reproduction number R 0 are as follows
F 1 = F 2 = 0 0 β 0 0 0 0 0 0 0 0 0 0 0 0 0 ,
V 1 = ω + φ + μ 0 0 0 φ γ 2 + t γ + μ 0 0 ω 0 φ + + α + μ 0 0 0 φ + 2 γ 2 + t γ + α + μ .
The basic reproduction numbers R 0 for models with the inclusion of TB latent compartment [5,42,70] contain the rate of transmission from TB latent to subclinical form E (for linear transmission rate σ ) as follows:
R 0 β ω σ ( ω + μ + Treatment ) ( σ + μ + ) .
Mathematically it closes to R 0 without the TB latent form (9) with the factor σ σ + μ + < 1 connected to transmission rate σ from the TB latent form to a subclinical one. The R 0 becomes less because the latent infectiousness can remain in the incubation period for decades, which exceeds the modeling period.

Appendix A.2. Second-Order Sobol Indices

Based on the sensitivity analysis of the TB model (1), we get valuable second-order Sobol indices for sensitive parameters (see Section 2.3 for details). Table A1 and Table A2 demonstrate the second-order Sobol indices S j k ( i ) , i = 2 , 3 , for measured data y i ( y 2 = I s T , y 3 = I γ T ) and all pairs ( j , k ) of parameters q = ( β , γ 1 , γ 2 , ω , t γ , α , φ , φ + ) for some measured time.
Table A1. Second-order Sobol indices S ( 2 ) of TB model to TB diagnosed with BE+ I s T for 2019 year.
Table A1. Second-order Sobol indices S ( 2 ) of TB model to TB diagnosed with BE+ I s T for 2019 year.
β γ 1 γ 2 ω φ φ + t γ α
β 10.150.250.180.200.210.160.18
γ 1 1000000
γ 2 1000.020.020
ω 10.0200.010
φ 10.010.010.02
φ + 10.030.04
t γ 10
α 1
Table A2. Second-order Sobol indices S ( 3 ) of TB model to MDR-TB diagnosed with BE+ I γ T for 2009 year.
Table A2. Second-order Sobol indices S ( 3 ) of TB model to MDR-TB diagnosed with BE+ I γ T for 2009 year.
β γ 1 γ 2 ω φ φ + t γ α
β 1000.2000.0100
γ 1 1000000
γ 2 100000
ω 100.0200
φ 1000
φ + 10.040.04
t γ 10
α 1
High interaction second-order Sobol coefficients ( S > 0.2 ) affect the identifiability of parameters and the uncertainty of the forecast (see Figure 4, Figure 5, Figure 6 and Figure 7), especially at the stage of activation of the form with bacterial excretion ( ω coefficient) and MDR-TB forms ( β coefficient).
Firstly, we found the correlation between the intensity of the TB reveal ( φ , φ + ) and contagiousness β , with the MDR TB reveal correlation being slightly more strong. That means that the fewer hidden TB causes are revealed, the greater the intensity of transmission is and visa versa. Secondly, we found the correlation between TB contagiousness parameter β and the proportion of infected TB converting to the BE+ form ω and to MDR forms t γ , with the first parameter being of slightly more importance. Together, conversion to BE+ and MDR are the main challenges of the contemporary epidemic.

Appendix A.3. MCMC Convergence

A necessary condition for the convergence of Markov chains in the Bayesian method is the analysis of the R-hat statistic. It has been empirically established that if R ^ > 1.1 , then either the Markov chains are insufficiently long (strongly dependent on the a priori assumption) or the chains are “stuck” in certain regions of the parameter space and cannot effectively explore it in the case of complex models and high parameter correlations. The achieved R-hat values for Altai district are displayed in Table A3. All R-hat values are less than 1.1, indicating convergence of the MCMC method.
The convergence processes for Monte Carlo trajectories of the sensitive parameters are shown in Figure A1.
Figure A1. Monte Carlo trajectories for posterior parameter distribution.
Figure A1. Monte Carlo trajectories for posterior parameter distribution.
Mathematics 14 01600 g0a1
Table A3. R-hat values for parameters of SEIS model of TB spread.
Table A3. R-hat values for parameters of SEIS model of TB spread.
ParameterIrkutskNovosibirskTyvaZabaikalskyAltai
β 1.0541.0261.0271.0111.030
t γ 1.0041.0171.0101.0131.005
ω 1.0141.0211.0101.0041.016
α 1.0061.0091.0071.0091.006
φ 1.0161.0101.0011.0041.013
φ + 1.1001.0361.0351.0161.055
γ 1 1.0151.0211.0071.0031.006
γ 2 1.0051.0101.0041.0091.012
σ 1.0141.0091.0051.0131.070
σ + 1.0081.0191.0181.0271.045

Appendix A.4. Inverse Problem: Numerical Approach

The inverse problem (1), (2), (10) may be reduced to a minimization problem for the following cost functional:
J ( q ) = j = 1 M ( E s T ( t j ; q ) + E γ T ( t j ; q ) E j ) 2 ( E j ) 2 + ( I s T ( t j ; q ) I s j ) 2 ( I s j ) 2 + ( I γ T ( t j ; q ) I γ j ) 2 ( I γ j ) 2 .
We note that this form of cost functional makes us implicitly assume that noise in data is distributed normally.
As a method for solving the problem of minimizing the functional (A1), we use the tree-based Parzen optimization approach included in the Optuna software package implemented in Python (the numerical solution algorithm for the related problem is given in [7], and the general results for the algorithm are described in [74]).
The main advantages of the chosen method are its effectiveness for high dimensions, parallelization or even distributed evaluation, availability for categorical variables and discontinuous functions, and ease of implementation. The limitations are the weak convergence guarantees, with these being surrogate modeling, a disregard of properties of objective function, and a lack of uncertainty estimations.
The convergence results for Novosibirsk region are presented in Figure A2.
Figure A2. Tree-based Parzen estimator convergence results for Novosibirsk region.
Figure A2. Tree-based Parzen estimator convergence results for Novosibirsk region.
Mathematics 14 01600 g0a2

Appendix A.5. The Pareto Front of the Inverse Problem Solution

The Pareto front for TPE optimization is shown in Figure A3.
Figure A3. Pareto front for Altai district optimization with TPE algorithm.
Figure A3. Pareto front for Altai district optimization with TPE algorithm.
Mathematics 14 01600 g0a3

Appendix A.6. Data Spike in 2019

The effect of the 2019 spike in data is shown in Figure A4. In comparison to the resulits in Figure 5, the difference in results is not big; however, the 2019 data anomaly is not describable with a differential model, and thus it is removed from the dataset.
Figure A4. Restored forecast of infected individuals in the Novosibirsk region, compared with actual data (dash-dot lines) with 2019 year data.
Figure A4. Restored forecast of infected individuals in the Novosibirsk region, compared with actual data (dash-dot lines) with 2019 year data.
Mathematics 14 01600 g0a4

References

  1. Global Tuberculosis Report, 2024. Available online: https://www.who.int/teams/global-programme-on-tuberculosis-and-lung-health/tb-reports/global-tuberculosis-report-2024 (accessed on 25 November 2024).
  2. Goletti, D.; Meintjes, G.; Andrade, B.; Zumla, A.; Lee, S.S. Insights from the 2024 WHO Global Tuberculosis Report—More Comprehensive Action, Innovation, and Investments required for achieving WHO End TB goals. Int. J. Infect. Dis. 2024, 150, 107325. [Google Scholar] [CrossRef] [PubMed]
  3. Hamna Mariyam, K.; Sayooj Aby, J.; Anuwat, J.; Karuna, M. A comprehensive study on tuberculosis prediction models: Integrating machine learning into epidemiological analysis. J. Theor. Biol. 2025, 597, 111988. [Google Scholar] [CrossRef]
  4. Krivorotko, O.; Kabanikhin, S. Artificial intelligence for COVID-19 spread modeling. J. Inverse-Ill-Posed Probl. 2024, 32, 297–332. [Google Scholar] [CrossRef]
  5. Avilov, K.; Romanyukha, A. Mathematical models of spread and control of tuberculosis (review). Mat. Biol. Bioinform. 2007, 2, 188–318. (In Russian) [Google Scholar] [CrossRef]
  6. Melnichenko, O.; Romanyukha, A. Model of the tuberculosis epidemiology. Data analysis and parameters estimation. Math. Model. 2008, 20, 107–128. (In Russian) [Google Scholar] [CrossRef]
  7. Kabanikhin, S.; Krivorotko, O.; Neverov, A.; Kaminskiy, G.; Semenova, O. Identification of the Mathematical Model of Tuberculosis and HIV Co-Infection Dynamics. Mathematics 2024, 12, 3636. [Google Scholar] [CrossRef]
  8. Vlad, A.I.; Romanyukha, A.A.; Sannikova, T.E. Parameter Tuning of Agent-Based Models: Metaheuristic Algorithms. Mathematics 2024, 12, 2208. [Google Scholar] [CrossRef]
  9. Romanyukha, A.A.; Karkach, A.S.; Borisov, S.; Belilovsky, E.M.; Sannikova, T.E.; Krivorotko, O. Small-scale stable clusters of elevated tuberculosis incidence in Moscow, 2000–2015. Discovery and spatio-temporal analysis. Int. J. Infect. Dis. IJID Off. Publ. Int. Soc. Infect. Dis. 2019, 91, 156–161. [Google Scholar] [CrossRef]
  10. He, W.; Li, X.; Liang, X.; Liu, Z.; Zeng, Z.; Yang, Z.; Hon, C. FaXNet: A frequency-adaptive, explainable, and uncertainty-aware network for influenza forecasting. Front. Public Health 2026, 14, 1746529. [Google Scholar] [CrossRef]
  11. He, W.; Li, X.; He, H.; Li, Y.; Sun, N.; Hon, C. LSTCNet: Timely prediction of influenza in China using Baidu index and LSTM-TCN hybrid network. BMC Public Health 2026, 26, 855. [Google Scholar] [CrossRef]
  12. Krivorotko, O.I.; Zyatkov, N.Y.; Kabanikhin, S.I. Epidemics modelling: Neural net based on data and SIR-model. Comput. Math. Math. Phys. 2023, 63, 1733–1746. (In Russian) [Google Scholar] [CrossRef]
  13. Krivorotko, O.; Zyatkov, N. The Forecasting of the Spread of Infectious Diseases Based on Conditional Generative Adversarial Networks. Mathematics 2024, 12, 3044. [Google Scholar] [CrossRef]
  14. Meshkov, I.A.; Petrenko, T.; Keiser, O.; Estill, J.; Revyakina, O.V.; Felker, I.; Raviglione, M.C.; Krasnov, V.A.; Schwartz, Y. Variations in tuberculosis prevalence, Russian Federation: A multivariate approach. Bull. World Health Organ. 2019, 97, 737–745A. [Google Scholar] [CrossRef]
  15. Drain, P.K.; Bajema, K.L.; Dowdy, D.W.; Dheda, K.; Naidoo, K.; Schumacher, S.G.; Ma, S.; Meermeier, E.W.; Lewinsohn, D.M.; Sherman, D.R. Incipient and Subclinical Tuberculosis: A Clinical Review of Early Stages and Progression of Infection. Clin. Microbiol. Rev. 2018, 31, e00021-18. [Google Scholar] [CrossRef]
  16. Janssen, S.; Murphy, M.; Upton, C.; Allwood, B.; Diacon, A.H. Tuberculosis: An Update for the Clinician. Respirology 2025, 30, 196–205. [Google Scholar] [CrossRef]
  17. Dowdy, D.; Basu, S.; Andrews, J. Is passive diagnosis enough? The impact of subclinical disease on diagnostic strategies for tuberculosis. Am. J. Respir. Crit. Care Med. 2013, 187, 543–551. [Google Scholar] [CrossRef] [PubMed]
  18. Calligaro, G.L.; Zijenah, L.S.; Peter, J.; Theron, G.; Buser, V.; McNerney, R.; Bara, W.; Bandason, T.; Govender, U.; Tomasicchio, M.; et al. Effect of new tuberculosis diagnostic technologies on community-based intensified case finding: A multicentre randomised controlled trial. Lancet. Infect. Dis. 2017, 17, 441–450. [Google Scholar] [CrossRef]
  19. Sabiiti, W.; Mtafya, B.; de Lima, D.A.; Dombay, E.; Baron, V.O.; Azam, K.; Oravcová, K.; Sloan, D.J.; Gillespie, S.H. A Tuberculosis Molecular Bacterial Load Assay (TB-MBLA). J. Vis. Exp. JoVE 2020, 158, e60460. [Google Scholar] [CrossRef]
  20. Gillespie, S.H.; Sabiiti, W. Tuberculosis molecular bacterial load assay in the management of tuberculosis. Curr. Opin. Infect. Dis. 2024, 38, 176–181. [Google Scholar] [CrossRef]
  21. Wang, J.; Zhang, X.; Huo, F.; Qin, L.; Liu, R.; Shang, Y.; Yao, C.; Ma, L.; Pang, Y. Analysis of Xpert MTB/RIF results in retested patients with very low initial bacterial loads: A retrospective study in China. J. Infect. Public Health 2023, 16, 911–916. [Google Scholar] [CrossRef] [PubMed]
  22. Mbelele, P.M.; Sabiiti, W.; Heysell, S.K.; Sauli, E.; Mpolya, E.A.; Mfinanga, S.; Gillespie, S.H.; Addo, K.K.; Kibiki, G.S.; Sloan, D.J.; et al. Use of a molecular bacterial load assay to distinguish between active TB and post-TB lung disease. Int. J. Tuberc. Lung Dis. 2022, 26, 276–278. [Google Scholar] [CrossRef] [PubMed]
  23. Din, A. Bifurcation analysis of a delayed stochastic HBV epidemic model: Cell-to-cell transmission. Chaos Solitons Fractals 2024, 181, 114714. [Google Scholar] [CrossRef]
  24. Din, A.; Li, Y. Optimizing HIV/AIDS dynamics: Stochastic control strategies with education and treatment. Eur. Phys. J. Plus 2024, 139, 812. [Google Scholar] [CrossRef]
  25. Kim, P.S.; Swaminathan, S. Ending TB: The world’s oldest pandemic. J. Int. AIDS Soc. 2021, 24, e25698. [Google Scholar] [CrossRef]
  26. Wallace, D.; Wallace, R. Problems with the WHO TB model. Math. Biosci. 2019, 313, 71–80. [Google Scholar] [CrossRef]
  27. Xu, C.; Cheng, K.; Wang, Y.; Liu, M.; Wang, X.; Yang, Z.; Guo, S. Analysis of the current status of TB transmission in China based on an age heterogeneity model. Math. Biosci. Eng. 2023, 20, 19232–19253. [Google Scholar] [CrossRef]
  28. Lin, Y.J.; Liao, C.M. Seasonal dynamics of tuberculosis epidemics and implications for multidrug-resistant infection risk assessment. Epidemiol. Infect. 2014, 142, 358–370. [Google Scholar] [CrossRef]
  29. Wang, H.; Jiang, L.; Wang, G. Stability of a tuberculosis model with a time delay in transmission. J. North Univ. China 2014, 35, 238–242. [Google Scholar] [CrossRef]
  30. Achkar, J.M.; Jenny-Avital, E.R. Incipient and Subclinical Tuberculosis: Defining Early Disease States in the Context of Host Immune Response. J. Infect. Dis. 2011, 204, S1179–S1186. [Google Scholar] [CrossRef] [PubMed]
  31. Migliori, G.B.; Ong, C.W.; Petrone, L.; D’Ambrosio, L.; Centis, R.; Goletti, D. The definition of tuberculosis infection based on the spectrum of tuberculosis disease. Breathe 2021, 17, 210079. [Google Scholar] [CrossRef] [PubMed]
  32. Zaidi, S.M.; Coussens, A.K.; Seddon, J.A.; Kredo, T.; Warner, D.; Houben, R.M.; Esmail, H. Beyond latent and active tuberculosis: A scoping review of conceptual frameworks. eClinicalMedicine 2023, 66, 102332. [Google Scholar] [CrossRef]
  33. Herrera, M.; Taguiam, E.; Laupland, K.B.; Rueda, Z.V.; Keynan, Y. Public health implications of the evolving understanding of tuberculosis natural history. J. Assoc. Med. Microbiol. Infect. Dis. Can. 2023, 8, 241–244. [Google Scholar] [CrossRef] [PubMed]
  34. Choi, Y.; Park, S.J.; An, H.S.; Kim, H.M.; Yoo, J.Y.; Pyo, S.W.; Song, J.S.; Lee, S.E. Epidemiological Analysis of Tuberculosis Transmission, Risk Factors, and Subclinical Tuberculosis Management in a High School Outbreak, South Korea. Open Forum Infect. Dis. 2025, 12, ofaf452. [Google Scholar] [CrossRef] [PubMed]
  35. Emery, J.C.; Dodd, P.J.; Banu, S.; Frascella, B.; Garden, F.L.; Horton, K.C.; Hossain, S.; Law, I.; Van Leth, F.; Marks, G.B.; et al. Estimating the contribution of subclinical tuberculosis disease to transmission: An individual patient data analysis from prevalence surveys. eLife 2023, 12, e82469. [Google Scholar] [CrossRef]
  36. Escalante, P.; Vadiyala, M.R.; Pathakumari, B.; Marty, P.K.; Van Keulen, V.P.; Hilgart, H.R.; Meserve, K.; Theel, E.S.; Peikert, T.; Bailey, R.C.; et al. New diagnostics for the spectrum of asymptomatic TB: From infection to subclinical disease. Int. J. Tuberc. Lung Dis. 2023, 27, 499–505. [Google Scholar] [CrossRef] [PubMed]
  37. Mahmoudi, S.; García, M.J.; Drain, P.K. Current approaches for diagnosis of subclinical pulmonary tuberculosis, clinical implications and future perspectives: A scoping review. Expert Rev. Clin. Immunol. 2024, 20, 715–726. [Google Scholar] [CrossRef]
  38. Cho, S.; Shin, S.; Kim, Y.; Song, W.; Hong, S.; Jeong, S.; Kang, M.; Lee, K. A novel approach for tuberculosis diagnosis using exosomal DNA and droplet digital PCR. Clin. Microbiol. Infect. 2020, 26, 942.e1–942.e5. [Google Scholar] [CrossRef]
  39. Gupta, R.K.; Turner, C.T.; Venturini, C.; Esmail, H.; Rangaka, M.X.; Copas, A.; Lipman, M.; Abubakar, I.; Noursadeghi, M. Concise whole blood transcriptional signatures for incipient tuberculosis: A systematic review and patient-level pooled meta-analysis. Lancet Respir. Med. 2020, 8, 395–406. [Google Scholar] [CrossRef]
  40. Rotundo, S.; Tassone, M.T.; Serapide, F.; Russo, A.; Trecarichi, E.M. Incipient tuberculosis: A comprehensive overview. Infection 2024, 52, 1215–1222. [Google Scholar] [CrossRef]
  41. Sivakumaran, D.; Jenum, S.; Srivastava, A.; Steen, V.M.; Vaz, M.; Doherty, T.M.; Ritz, C.; Grewal, H.M.S. Host blood-based biosignatures for subclinical TB and incipient TB: A prospective study of adult TB household contacts in Southern India. Front. Immunol. 2023, 13, 1051963. [Google Scholar] [CrossRef]
  42. Perelman, M.; Marchuk, G.; Borisov, S.; Kazennykh, B.; Avilov, K.; Karkach, A.; Romanyukha, A. Tuberculosis epidemiology in Russia: The mathematical model and data analysis. Russ. J. Numer. Anal. Math. Model. 2004, 19, 305–314. [Google Scholar] [CrossRef]
  43. Yu, Y.; Shi, Y.; Yao, W. Dynamic model of tuberculosis considering multi-drug resistance and their applications. Infect. Dis. Model. 2018, 3, 362–372. [Google Scholar] [CrossRef]
  44. Avilov, K.; Romanyukha, A.; Belilovsky, E.; Borisov, S. Mathematical modelling of the progression of active tuberculosis: Insights from fluorography data. Infect. Dis. Model. 2022, 7, 374–386. [Google Scholar] [CrossRef] [PubMed]
  45. Mulugeta, K.; Dawed, M.; Teklu, S. Mathematical model analysis of drug resistance TB dynamics with immigrants and impact of media efficiency. Sci. Rep. 2025, 15, 42375. [Google Scholar] [CrossRef] [PubMed]
  46. Kabanikhin, S.; Krivorotko, O.; Takuadina, A.; Andornaya, D.; Zhang, S. Geo-information system of tuberculosis spread based on inversion and prediction. J. Inverse Ill-Posed Probl. 2021, 29, 65–79. [Google Scholar] [CrossRef]
  47. von Kowalevsky, S. Zur Theorie der partiellen Differentialgleichung. J. Für Die Reine Angew. Math. 1875, 80, 32. [Google Scholar]
  48. Unified Interdepartmental Information and Statistical System. Available online: https://www.fedstat.ru/ (accessed on 19 March 2026).
  49. Stavitskaya, N.; Nemkova, E.; Tashkova, G. Key Indicators of Anti-Tuberculosis Activities in the Siberian and Far Eastern Federal Districts (Statistical Materials); Novosibirsk TB Research Institute: Novosibirsk, Russia, 2025; p. 156. Available online: https://elibrary.ru/item.asp?id=82569907 (accessed on 19 March 2026). (In Russian)
  50. Nguyen, H.V.; Tiemersma, E.; Nguyen, N.V.; Nguyen, H.B.; Cobelens, F. Disease Transmission by Patients with Subclinical Tuberculosis. Clin. Infect. Dis. 2023, 76, 2000–2006. [Google Scholar] [CrossRef]
  51. Tyulkova, T.Y.; Mezentseva, A.V. Latent Tuberculosis Infection and Residual Post-Tuberculous Changes in Children. Curr. Pediatr. 2017, 16, 452–456. [Google Scholar] [CrossRef]
  52. Tornack, J.; Reece, S.T.; Bauer, W.M.; Vogelzang, A.; Bandermann, S.; Zedler, U.; Stingl, G.; Kaufmann, S.H.E.; Melchers, F. Human and Mouse Hematopoietic Stem Cells Are a Depot for Dormant Mycobacterium tuberculosis. PLoS ONE 2017, 12, e0169119. [Google Scholar] [CrossRef]
  53. Belogorodtsev, S.N.; Lykov, A.P.; Nemkova, E.K.; Schwartz, Y.S. Interaction between mesenchymal stromal cells and tuberculous mycobacteria in vitro. Tuberc. Lung Dis. 2023, 101, 57–63. [Google Scholar] [CrossRef]
  54. Khan, A.; Jagannath, C. Interactions of Mycobacterium tuberculosis with Human Mesenchymal Stem Cells. In Tuberculosis Host-Pathogen Interactions; Cirillo, J.D., Kong, Y., Eds.; Springer International Publishing: Cham, Switzerland, 2019; pp. 95–111. [Google Scholar] [CrossRef]
  55. Dale, K.D.; Schwalb, A.; Coussens, A.K.; Gibney, K.B.; Abboud, A.J.; Watts, K.; Denholm, J.T. Overlooked, dismissed, and downplayed: Reversion of Mycobacterium tuberculosis immunoreactivity. Eur. Respir. Rev. 2024, 33, 240007. [Google Scholar] [CrossRef]
  56. Shang, Y. Lie algebraic discussion for affinity based information diffusion in social networks. Open Phys. 2017, 15, 705–711. [Google Scholar] [CrossRef]
  57. Demography statistics of Russian Federation by Rosstat, 2025. Available online: https://rosstat.gov.ru/folder/12781 (accessed on 1 December 2025).
  58. van den Driessche, P.; Watmough, J. Further Notes on the Basic Reproduction Number. In Mathematical Epidemiology; Springer: Berlin/Heidelberg, Germany, 2008; pp. 159–178. [Google Scholar] [CrossRef]
  59. Kabanikhin, S.I.; Voronov, D.A.; Grodz, A.A.; Krivorotko, O.I. Identifiability of mathematical models in medical biology. Russ. J. Genet. Appl. Res. 2016, 6, 838–844. [Google Scholar] [CrossRef]
  60. Krivorotko, O.; Kabanikhin, S.; Petrakova, V. Identifiablility of mathematical models of epidemiology: Tuberculosis, HIV, COVID-19. Math. Biol. Bioinform. 2023, 18, 177–214. [Google Scholar] [CrossRef]
  61. Sobol, I. Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates. Math. Comput. Simul. 2001, 55, 271–280. [Google Scholar] [CrossRef]
  62. Order On Approval of Methodological Recommendations for Improving the Diagnosis and Treatment of Tuberculosis of the Respiratory Organs. 2014. (Dated 29.12.2014 No. 951). Available online: https://www.garant.ru/products/ipo/prime/doc/70749840/ (accessed on 13 March 2026). (In Russian)
  63. Federal Law No. 314-FZ Dated August 3, 2018, 2018. Available online: https://www.garant.ru/products/ipo/prime/doc/71905584/ (accessed on 13 March 2026). (In Russian)
  64. Order of the Ministry of Health of the Russian Federation Dated 13.03.2019 N 127n “On Approval of the Procedure for Dispensary Monitoring of Tuberculosis Patients, Persons in or Who Have Been in Contact with a Source of Tuberculosis, as Well as Persons with Suspected Tuberculosis and Those Who Have Been Treated for Tuberculosis, and on the Repeal of Items 16–17 of the Procedure for Providing Medical Care to Tuberculosis Patients, Approved by Order of the Ministry of Health of the Russian Federation Dated 15.11.2012 N 932n. Available online: https://base.garant.ru/72275106/ (accessed on 13 March 2026). (In Russian)
  65. Order of the Russian Ministry of Health Dated 05.04.2019 N 199 “On Approval of the Departmental Target Program “Prevention and Control of Socially Significant Infectious Diseases”. Available online: https://www.garant.ru/products/ipo/prime/doc/72127892/ (accessed on 13 March 2026). (In Russian)
  66. Trajman, A.; Felker, I.; Alves, L.C.; Coutinho, I.; Osman, M.; Meehan, S.A.; Singh, U.B.; Schwartz, Y. The COVID-19 and TB syndemic: The way forward. Int. J. Tuberc. Lung Dis. 2022, 26, 710–719. [Google Scholar] [CrossRef]
  67. Avilov, K.; Romanyukha, A.; Belilovsky, E.; Borisov, S. Comparison of simulation schemes for the natural history of respiratory tuberculosis. Math. Biol. Bioinform. 2019, 14, 570–587. [Google Scholar] [CrossRef]
  68. Guo, Z.K.; Huo, H.F.; Xiang, H.; Ren, Q.Y. Global dynamics of a tuberculosis model with age-dependent latency and time delays in treatment. J. Math. Biol. 2023, 87, 66. [Google Scholar] [CrossRef]
  69. Ma, Y.; Horsburgh, C.R.; White, L.F.; Jenkins, H.E. Quantifying TB transmission: A systematic review of reproduction number and serial interval estimates for tuberculosis. Epidemiol. Infect. 2018, 146, 1478–1494. [Google Scholar] [CrossRef]
  70. Ochieng, F.O. SEIRS model for TB transmission dynamics incorporating the environment and optimal control. BMC Infect. Dis. 2025, 25, 490. [Google Scholar] [CrossRef] [PubMed]
  71. Villalva-Serra, K.; Barreto-Duarte, B.; Rodrigues, M.M.; Queiroz, A.T.; Martinez, L.; Croda, J.; Rolla, V.C.; Kritski, A.L.; Cordeiro-Santos, M.; Sterling, T.R.; et al. Impact of strategic public health interventions to reduce tuberculosis incidence in Brazil: A Bayesian structural time-series scenario analysis. Lancet Reg. Health-Am. 2025, 41, 100963, Corrigendum in Lancet Reg. Health-Am. 2026, 54, 101381. https://doi.org/10.1016/j.lana.2026.101381. [Google Scholar] [CrossRef] [PubMed]
  72. Saber, S.; Alahmari, A. Mathematical Insights into Zoonotic Disease Spread: Application of the Milstein Method. Eur. J. Pure Appl. Math. 2025, 18, 5881. [Google Scholar] [CrossRef]
  73. Saber, S.; Solouma, E.; Althubyani, M.; Messaoudi, M. Statistical Insights into Zoonotic Disease Dynamics: Simulation and Control Strategy Evaluation. Symmetry 2025, 17, 733. [Google Scholar] [CrossRef]
  74. Akiba, T.; Sano, S.; Yanase, T.; Ohta, T.; Koyama, M. Optuna: A Next-generation Hyperparameter Optimization Framework. arXiv 2019, arXiv:1907.10902. [Google Scholar] [CrossRef]
Figure 1. Proportion of MDR-TB cases among newly diagnosed patients in regions of the Russian Federation (%).
Figure 1. Proportion of MDR-TB cases among newly diagnosed patients in regions of the Russian Federation (%).
Mathematics 14 01600 g001
Figure 2. Schematic diagram of the TB dynamics model with MDR (1), based on the mass conservation law, excluding mortality rates.
Figure 2. Schematic diagram of the TB dynamics model with MDR (1), based on the mass conservation law, excluding mortality rates.
Mathematics 14 01600 g002
Figure 3. The first Sobol indices S 1 i ( n ) (left) and the total Sobol indices S T i ( n ) (right) for parameters q ˜ i , i = 1 , , 8 , with 95% confidence intervals for 10 years of simulation for TB BE− (the first row), TB BE+ without MDR I s T ( t ) (the middle row), and MDR-TB BE+ I γ T ( t ) (the last row).
Figure 3. The first Sobol indices S 1 i ( n ) (left) and the total Sobol indices S T i ( n ) (right) for parameters q ˜ i , i = 1 , , 8 , with 95% confidence intervals for 10 years of simulation for TB BE− (the first row), TB BE+ without MDR I s T ( t ) (the middle row), and MDR-TB BE+ I γ T ( t ) (the last row).
Mathematics 14 01600 g003
Figure 4. Restored parameters of SEIS model of TB spread (1) in regions of Russian Federation with estimations by TPE Optuna optimizer.
Figure 4. Restored parameters of SEIS model of TB spread (1) in regions of Russian Federation with estimations by TPE Optuna optimizer.
Mathematics 14 01600 g004
Figure 5. Probabilistic forecast of infected individuals in the Irkutsk region (left) and the Novosibirsk region (right), compared with actual data (dash-dot lines).
Figure 5. Probabilistic forecast of infected individuals in the Irkutsk region (left) and the Novosibirsk region (right), compared with actual data (dash-dot lines).
Mathematics 14 01600 g005
Figure 6. Probabilistic forecast of infected individuals in the Altai district (left) and the Tyva Republic (right), compared with actual data (dash-dot lines).
Figure 6. Probabilistic forecast of infected individuals in the Altai district (left) and the Tyva Republic (right), compared with actual data (dash-dot lines).
Mathematics 14 01600 g006
Figure 7. Probabilistic forecast of infected individuals in the Zabaikalsky district, compared with actual data (dash-dot lines).
Figure 7. Probabilistic forecast of infected individuals in the Zabaikalsky district, compared with actual data (dash-dot lines).
Mathematics 14 01600 g007
Table 1. Variables and parameters of SEIS model of TB spread.
Table 1. Variables and parameters of SEIS model of TB spread.
SymbolDescriptionValueRef.
Variables (ppl.)
Ssusceptibles and carries of latent infection
E s , E s T TB-infected without bacterioexcretion (TB BE−)
E γ , E γ T MDR-TB-infected without bacterioexcretion (MDR-TB BE−)
I s , I s T TB-infected with bacterioexcretion (TB BE+)
I γ , I γ T MDR-TB-infected with bacterioexcretion (MDR-TB BE+)
Main parameters (parameters of inflow)
bbirth ratesee Section 3[48]
β contagiousness of TB contact with bacterioexcretion [ 0 ,   10 ] [42]
kpopulation potential (ppl.) 4 · 10 6
t γ fraction of non-MDR-infected turning into MDR-infected per year [ 0.001 ,   0.1 ] f.33 *
ω rate of TB BE− turning into BE+ per year [ 0.1 ,   4 ] [42]
Outflow parameters
μ speed of population outflowsee Section 3[48]
α additional death rate of TB-infected with bacterioexcretion [ 0.03 ,   0.2 ] f.33
Treatment parameters
φ rate of undetected TB BE− infected that is detected per year [ 0.2 ,   0.5 ] f.33
φ + rate of undetected TB BE+ infected that is detected per year [ 0.5 ,   3 ] f.33
γ 1 rate of detected MDR-TB-infected that is treated per year [ 0.25 ,   0.5 ] f.33
γ 2 rate of detected TB-infected that is treated per year [ 0.3 ,   2 ] f.33
Initial state
σ fraction of undetected TB BE− infected in the initial moment [ 0 ,   0.5 ]
σ + fraction of undetected TB BE+ infected in the initial moment [ 0 ,   0.5 ]
* f.33 stands for statistical reports form 33 “Information on tuberculosis patients”.
Table 2. Correlation matrix ρ i j for the Novosibirsk region posterior.
Table 2. Correlation matrix ρ i j for the Novosibirsk region posterior.
β γ 1 γ 2 ω φ φ + t γ α
β 1−0.609−0.7390.309−0.8860.9840.8810.869
γ 1 −0.60910.947−0.4810.875−0.506−0.669−0.849
γ 2 −0.7390.9471−0.5590.926−0.639−0.855−0.935
ω 0.309−0.481−0.5591−0.2940.1450.6440.666
φ −0.8860.8750.926−0.2941−0.843−0.825−0.901
φ + 0.984−0.506−0.6390.145−0.84310.8000.771
t γ 0.881−0.669−0.8550.644−0.8250.80010.943
α 0.869−0.849−0.9350.666−0.9010.7710.9431
Table 3. Number of deaths in Russian Federation regions per 1000 residents.
Table 3. Number of deaths in Russian Federation regions per 1000 residents.
Region20092010201120122013201420152016201720182019202020212022
Zabaikalsky district13.813.813.313.112.512.512.912.311.712.312.413.715.813.8
Tyva Republic1211.61111.210.910.910.39.88.78.88.39.498.6
Novosibirsk region1413.913.613.613.413.313.11312.912.912.715.31713.7
Irkutsk region14.314.41413.913.613.713.613.412.91313.21517.714.1
Altai district14.71514.614.614.214.214.114.11414.31416.519.115.8
Table 4. Number of births in Russian Federation regions per 1000 residents.
Table 4. Number of births in Russian Federation regions per 1000 residents.
Region20092010201120122013201420152016201720182019202020212022
Zabaikalsky district16.115.915.616.316.116.215.714.913.71312.212.211.911.2
Tyva Republic26.926.827.426.62625.223.723.121.820.118.42019.717.7
Novosibirsk region12.913.213.113.914.11414.213.712.311.710.710.310.19.6
Irkutsk region15.615.215.315.915.715.215.314.713.412.811.811.31110.4
Altai district12.712.712.813.813.613.412.912.411.210.39.498.78.2
Table 5. Restored parameters of SEIS model of TB spread (1) in regions of Russian Federation.
Table 5. Restored parameters of SEIS model of TB spread (1) in regions of Russian Federation.
NovosibirskTyvaAltaiIrkutskZabaikalsky
SymbolA Priori IntervalMean95% CIMean95% CIMean95% CIMean95% CIMean95% CI
R 0 -0.973[0.594; 1.598]0.974[0.683; 1.400]0.942[0.557; 1.595]0.876[0.518; 1.502]0.926[0.540; 1.593]
β [ 0.03 ,   0.2 ] 2.676[2.369; 2.928]2.972[2.718; 3.171]3.566[3.199; 3.909]2.927[2.628; 3.228]2.128[1.876; 2.375]
t γ [ 0 ,   10 ] 0.057[0.046; 0.068]0.022[0.018; 0.026]0.019[0.015; 0.023]0.049[0.039; 0.058]0.068[0.056; 0.081]
ω [ 0.25 ,   0.5 ] 3.164[2.727; 3.588]3.725[3.272; 3.983]1.368[1.143; 1.597]1.605[1.348; 1.871]0.844[0.725; 0.971]
α [ 0.3 ,   2 ] 0.065[0.053; 0.077]0.038[0.031; 0.045]0.150[0.120; 0.181]0.066[0.054; 0.079]0.125[0.101; 0.149]
φ [ 0 ,   0.5 ] 0.326[0.265; 0.391]0.267[0.212; 0.321]0.488[0.422; 0.556]0.295[0.243; 0.350]0.275[0.235; 0.318]
φ + [ 0 ,   0.5 ] 2.407[2.123; 2.634]2.787[2.536; 2.957]2.610[2.349; 2.836]2.724[2.442; 2.968]1.577[1.376; 1.772]
γ 1 [ 0.1 ,   4 ] 0.373[0.326; 0.419]0.290[0.238; 0.342]0.328[0.273; 0.390]0.307[0.258; 0.356]0.286[0.245; 0.326]
γ 2 [ 0.2 ,   0.5 ] 0.519[0.455; 0.585]0.870[0.741; 1.012]0.674[0.552; 0.816]0.275[0.227; 0.323]0.292[0.249; 0.338]
σ [ 0.5 ,   3 ] 0.278[0.230; 0.322]0.186[0.152; 0.218]0.482[0.447; 0.498]0.254[0.213; 0.295]0.281[0.234; 0.326]
σ + [ 0.001 ,   0.1 ] 0.354[0.298; 0.411]0.381[0.321; 0.440]0.476[0.429; 0.499]0.376[0.319; 0.433]0.471[0.420; 0.497]
Table 6. Quantitative forecast accuracy metrics (MAPE and RMSE) for measured data.
Table 6. Quantitative forecast accuracy metrics (MAPE and RMSE) for measured data.
E s T + E γ T I s T I γ T
MAPE0.25720.2850.2476
RMSE 2.7 · 10 4 5 · 10 5 8.7 · 10 5
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Krivorotko, O.; Neverov, A.; Schwartz, Y.; Kaminskiy, G.; Zyatkov, N.; Laushkina, Z. Bayesian Analysis of Tuberculosis Spread Scenarios in Regions of Russian Federation. Mathematics 2026, 14, 1600. https://doi.org/10.3390/math14101600

AMA Style

Krivorotko O, Neverov A, Schwartz Y, Kaminskiy G, Zyatkov N, Laushkina Z. Bayesian Analysis of Tuberculosis Spread Scenarios in Regions of Russian Federation. Mathematics. 2026; 14(10):1600. https://doi.org/10.3390/math14101600

Chicago/Turabian Style

Krivorotko, Olga, Andrei Neverov, Yakov Schwartz, Grigoriy Kaminskiy, Nikolay Zyatkov, and Zhanna Laushkina. 2026. "Bayesian Analysis of Tuberculosis Spread Scenarios in Regions of Russian Federation" Mathematics 14, no. 10: 1600. https://doi.org/10.3390/math14101600

APA Style

Krivorotko, O., Neverov, A., Schwartz, Y., Kaminskiy, G., Zyatkov, N., & Laushkina, Z. (2026). Bayesian Analysis of Tuberculosis Spread Scenarios in Regions of Russian Federation. Mathematics, 14(10), 1600. https://doi.org/10.3390/math14101600

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop