Next Article in Journal
A Comprehensive Review of Robinetin: Distribution, Biological Activity and Pharmacokinetic Parameters
Previous Article in Journal
Evaluation of Five Plasma miRNAs as Biomarkers for Minimally Invasive Staging of Liver Fibrosis in β-Thalassaemia Patients
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Genetic Susceptibility to Tuberculosis and the Utility of Polygenic Scores in Population Stratification

by
Mariia A. Dashian
1,2,
German A. Shipulin
1 and
Andrei A. Deviatkin
1,3,*
1
Federal State Budgetary Institution ‘Centre for Strategic Planning and Management of Biomedical Health Risks’ of the Federal Medical and Biological Agency, 119121 Moscow, Russia
2
Department of Biomedicine, Pirogov Medical University, 117997 Moscow, Russia
3
Laboratory of Postgenomic Technologies, Izmerov Research Institute of Occupational Health, 105275 Moscow, Russia
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2025, 26(19), 9544; https://doi.org/10.3390/ijms26199544
Submission received: 25 August 2025 / Revised: 24 September 2025 / Accepted: 27 September 2025 / Published: 30 September 2025
(This article belongs to the Special Issue Tuberculosis: Host Immunity, Diagnosis and Treatment)

Abstract

Tuberculosis (TB) is one of the leading infectious causes of mortality worldwide. Although a significant proportion of the population (up to 36%, depending on the region) is infected with the latent form of TB, only about one in ten of these people will develop an active form of the disease in their lifetime. This is due to a complex interaction between the host’s genetic predisposition and environment. However, the genetic determinants of TB are not well established and have been insufficiently explored in previous genome-wide association studies (GWAS) with sparse and incongruent results. We reviewed recent evidence on host genetic susceptibility to TB, highlighting population-specific characteristics, host–pathogen coevolution, and the limitations of conventional GWAS approaches in terms of clinical and genetic heterogeneity. While rare variants with high penetrance, such as TYK2 P1104A, lead to monogenic susceptibility, most heritable risk results from the cumulative effect of numerous common variants. This cumulative effect may be summarized using polygenic risk scores (PRSs). Although their use has been proven for non-communicable diseases, PRSs are not applied to infectious disease susceptibility. To date, no PRS model for susceptibility to tuberculosis has been consistently validated. The development of PRSs for TB susceptibility is limited by phenotypic heterogeneity, population structure, and co-adaptation between host and pathogen. Another major challenge is to take into account the considerable influence of environmental factors. This difficulty in modeling environmental influences probably explains the current lack of a clinically applicable PRS for TB susceptibility. However, taking these caveats into account, polygenic models could improve risk stratification at the individual level compared to single-variant association and allow for earlier targeted treatment and prophylaxis.

1. Introduction

Some people have severe symptoms of infectious diseases, while others are asymptomatic or recover quickly. It is noteworthy that infected asymptomatic individuals usually do not require medical treatment. As a result, these cases usually remain unnoticed. Therefore, the actual incidence of asymptomatic infections is unknown. For example, it is estimated that approximately 25% of COVID-19 cases worldwide were asymptomatic [1]. Furthermore, studies have shown that unvaccinated and previously uninfected individuals can carry antibodies to dangerous infections such as tick-borne encephalitis [2,3,4] and rabies [5,6]. These cases are rarely studied, and current immunological knowledge largely excludes information about asymptomatic cases. This means that the current understanding of the actual epidemiology of infectious diseases is rather fragmented, as it is unclear what causes differences in disease outcomes.
Tuberculosis (TB) is one of the most socially significant diseases, caused by Mycobacterium tuberculosis (MTB) [6]. It is the leading cause of death from an infectious agent [7]. According to estimates, about one-third of the world’s population is infected with MTB; however, only around 5 to 10 percent of these individuals will develop active TB [8]. Importantly, the proportion of individuals with latent infection varies significantly across regions. Rough estimates suggest that 14% of people in North and South America carry latent infections, compared to 36% in Southeast Asia [9]. This disparity likely reflects region-specific combinations of environmental exposures and genetic backgrounds. However, the relative weight of these factors remains unclear. The reasons for variations in susceptibility to infections, including TB, remain poorly understood. Both environmental [10] and genetic factors [11] have an impact on the disease development. Environmental factors such as smoking, poor nutrition, stressful situations, and air pollution can weaken the immune system by impairing lymphocyte function and antibody production [12,13,14,15,16]. At the same time, each person has a unique set of genes that determines individual and population differences in the functioning of immune systems and susceptibility to infections [17]. In Lübeck, Germany, during the 1930 bacille Calmette–Guérin (BCG) vaccination program, 251 newborn infants were mistakenly given the vaccine containing both the attenuated BCG strain and live MTB. Of these, 228 developed some clinical or radiographic evidence of TB disease. Of these children, 72 died, while 156 demonstrated positive tuberculin skin test results but no clinical manifestations of TB [18]. Although the exact MTB dose likely varied among the children, it is reasonable to assume that genetics played a larger role than environmental factors in the manifestation of the disease, given that the children lived in similar conditions and received treatment from the same doctors.
Twin studies are the gold standard for assessing the genetic influence on any trait since monozygotic twins have identical genomes. If the hereditary component of the phenotype is significant, the concordance rate for a disease is higher in monozygotic twins than in dizygotic twins. Twin studies have shown that genetics undoubtedly plays a role in susceptibility to TB. In 1936, the heritability of the incidence of TB was investigated by observing 205 twin pairs, and higher concordance for TB was found in monozygotic (65%) rather than in dizygotic twin pairs (25%) [19]. Similar findings were reported by Comstock et al. [20], in which a multiple regression analysis was conducted in order to eliminate differences caused by factors other than zygosity. Noteworthy, younger twins show lower concordance rates, indicating a stronger environmental role early in life [19,21].
Both genetics and environmental factors [22,23] contribute to TB development, but their relative influence varies among individuals. Schurz et al. [24] indicated a substantial genetic influence on TB risk, with a pooled polygenic heritability estimate of 26.3% across different ancestries. In the presence of genetic predisposition, even minimal environmental exposure may be sufficient to trigger disease onset, whereas individuals with low genetic susceptibility may remain unaffected despite substantial environmental exposure.
Genetic susceptibility to TB has been extensively studied. Previous research has focused on the immunology of MTB infection [8], the impact of rare monogenic variants (e.g., TUK2), and summarizing the results of GWAS performed [24]. However, the applicability of PRS to susceptibility to TB or the challenges that establishing a PRS for susceptibility to TB might pose were not discussed. Therefore, the aim of this review is to summarize recent genetic findings to provide a realistic assessment of the possibilities and limitations of PRS for TB susceptibility.

2. Host–Pathogen Coevolution

The genomics of the infection process can be viewed from two perspectives: the pathogen and the host. Different MTB lineages exhibit a high degree of genetic similarity. Moreover, even closely related species within the Mycobacterium tuberculosis complex (MTBC)—such as M. africanum, M. bovis, and others—are genetically nearly indistinguishable (pairwise digital DNA–DNA hybridization—91.2–99.2%; Average Nucleotide Identity: 99.21–99.92%). As a result, it has been proposed to reclassify all members of the MTBC as variants of M. tuberculosis [25] (“https://lpsn.dsmz.de/species/mycobacterium-tuberculosis” assessed on 24 September 2025).
Despite such genetic similarity, different Mycobacterium lineages vary in infectivity across ethnic groups [26,27,28,29,30], likely reflecting long-term host–pathogen coevolution. For example, the restricted geographic range of M. africanum has been attributed to its coadaptation with specific West African subpopulations [31]. Over evolutionary time, human populations have accumulated unique immune-related genetic variants. Concurrently, certain MTB strains may have adapted to the immune profiles of their local human hosts, shaping population-specific susceptibility to TB.
Experimental evidence for host–pathogen coadaptation was demonstrated using a mouse model of TB susceptibility. The authors infected mice from various inbred lineages with a library of MTB mutants and quantified the relative abundance of each mutant in the input library versus the bacterial population recovered from the spleens one month after infection [32]. Lineage-specific sets of MTB genes required for successful infection were identified for each mouse genotype. These results suggest that different genetic backgrounds of the host exert different selection pressures on the pathogen, supporting the hypothesis of host–pathogen coadaptation. Such mechanisms could explain the regional dominance of certain MTB lineages that may have adapted to the genetic profiles of local human populations. In addition, the population-specific effects of certain human genetic variants on TB susceptibility may be shaped in part by long-term exposure to regionally predominant MTB strains.

3. Limited Success of Genome-Wide Association Studies (GWAS) in TB

Totally, 18 genome-wide association studies (GWAS) have been performed to determine the genetic variants responsible for TB risk (Table 1). A systematic PubMed search with the query “Tuberculosis”[MeSH] AND GWAS [MeSH Terms] AND human[MeSH Terms] NOT “Review”[Publication Type] (accessed on 23 May 2025) yielded 116 publications, 17 of which were found to be eligible after manual screening. One additional relevant study [33] was manually included in this meta-analysis, bringing the total number of GWAS on TB susceptibility to 18. The design and sample size of the studies varied considerably, with the number of cases ranging from 197 to 13,692 and the number of healthy controls ranging from 91 to 283,250 participants. These studies covered a variety of populations, including individuals of African (Ghanaian, Gambian, South African), Asian (Han Chinese, Thai, Japanese, Korean), European (Icelandic, Russian), and admixed (Peruvian) origin.
Table 1. Overview of GWAS investigating genetic susceptibility to TB.
Table 1. Overview of GWAS investigating genetic susceptibility to TB.
ReferenceSample Size (Cases/Healthy Controls)Significant SNPsPopulation AncestryYear
[34] rs2243639 (SFTPD) 2014
rs2569190 (CD14)South African Colored (SAC)
918/507rs34069356 (CTSZ)
[35]410/405not statistically significantSouth African Colored (SAC)2019
[36]642/91rs2057178South African Colored (SAC) population2014
[37]5530/5607rs4733781 (ASAP1)
rs10956514 (ASAP1)
No associations were observed for rs10956514 and rs11774633 (ASAP1) with TB in Chinese populations [38]
Russian (replication in the African population)2015
[39]Four datasets:
young Japanese (60/249),
young Thai (137/295),
old Japanese (123/685),
old Thai (300/295)
rs6071980Thai/Japanese2012
[40]4310/6386rs41553512 (HLA-DRB5)Han Chinese2017
[41]2237/3122rs4331426 (LOC124904262: Intron Variant)
No associations were observed for rs4331426 with TB in Chinese populations [38]
Ghanaian/Gambian2010
[33]8821/13,859rs2057178Ghanaian/Gambian (Russian/Indonesian)2012
[42]333/616rs17155120 (C10orf90: Intron Variant)Vietnamese/French/South African2021
[43]13,692/283,250rs557011Icelandic/Russian/Croatian2016
[44]646/1813rs9365798Korean2017
[45]616/709rs3751112 (RAB17), rs141645096 (DCTN4)Han Chinese2021
[46]2175/1827rs73226617 (LINC02618: Intron Variant)Peruvian (Lima)2019
[47]2949/5090rs12437118 (ESRRB), rs6114027 (TGM6: Intron Variant)Han Chinese2018
[48]556/650rs916943 (AGMO: Intron Variant)Moroccan/Russian2016
[49]2653/2537rs2273051 (COL4A5)Indonesian/Russian2012
[50]4426/84,290rs2894257European2017
[51]43 patients with active TB and 49 with latent TB/313 controlsrs62292160Chinese Han2021
Key findings include loci in HLA genes, macrophage regulators (ASAP1, MAFB), and monocyte-specific enhancers (ATP1B3). However, only a few SNPs were replicated across different studies (Table 1). The WT1 locus, identified in cohorts from Ghana and Gambia, was replicated in cohorts from South Africa and Russia. The ASAP1 locus, first reported in a Russian cohort, was later validated through a reanalysis of previous datasets [27,52]. The human leukocyte antigen (HLA) region, particularly the HLA class II subregion, has been identified as a significant correlate of TB susceptibility [24,53].
It should be noted that GWAS findings often contradicted each other and failed to be replicated in independent studies (Table 1). There are several possible explanations for this observation.
Firstly, TB is a complex polygenic disease influenced by many genetic variants, each exerting a small effect. Detecting these weak signals is challenging with conventional GWAS methods. Genetic susceptibility to TB results from mutations that impair the function of numerous genes that are important for immune protection. Significant genetic changes not only take place in regions of genes that code for proteins but also in non-coding regions. These regions and regulatory protein-coding genes, such as transcription factors, play a significant role in immune function [54,55]. In theory, different groups of people exposed to TB can carry different mutations in different genes—for example, one group may carry mutations in interferon genes, another may carry mutations in HLA variants, and a third group may carry mutations in transcription factors regulating other immune genes. Once different groups of people with different genetic mutations are mixed, the signals from single variants overlap, and it becomes more challenging for GWAS to detect them.
Furthermore, TB presentation differs among individuals, diverging throughout investigations. Active TB includes various clinical forms, potentially exhibiting distinct genetic origins [56]. Although pulmonary TB persists as the predominant manner of its manifestation, the infection may essentially involve each organ system. Even within pulmonary cases, the disease progression differs, with some individuals developing symptoms shortly after primary infection, whereas others experience a reactivation of latent TB years later. This clinical and biological diversity complicates the definition of a consistent phenotype for genetic analysis [57] and may account for the poor replicability observed in TB GWAS.
GWAS have been performed across populations with diverse ethnic backgrounds (Table 1), which differ in allele frequencies and patterns of linkage disequilibrium. Consequently, genetic associations identified in one population are frequently not applicable in others. Different populations may have distinct components of the immune response affected, resulting in varied associated genetic variants [53,58]. For example, the IL-10 promoter polymorphism rs1800896 increases TB susceptibility in Caucasian but not in African populations [59]. Moderate inflammation generally protects against the development of TB, while chronically increased inflammation is strongly associated with disease progression, regardless of bacterial load [60]. Ethnic differences in inflammatory profiles have been reported, with individuals of African descent generally exhibiting higher systemic inflammation than those of European ancestry [61]. This tentatively suggests that higher anti-inflammatory cytokine levels may suppress inflammatory responses in individuals of European origin, thereby increasing their susceptibility by impairing host defense. Conversely, elevated pro-inflammatory cytokine levels in individuals of African ancestry may exacerbate inflammation, increasing TB risk in genetically predisposed persons.
Population heterogeneity and limited sample sizes likely contribute to inconsistent GWAS findings. However, even larger studies combining multiple GWAS datasets reported ethnic disparities. There was only one SNP that reached genome-wide significance for the mixed ancestry dataset, but once the dataset was split into different populations, more prospective SNPs were found for the European and Asian populations separately [24]. Ancestral migrations, infectious disease selective pressures, genetic drift, and local adaptations have led to unique genetic susceptibility patterns among ethnic groups for TB. Thus, populations harbor distinct alleles modulating immune responses to MTB. Moreover, identical SNPs may be functionally distinct in individuals with different ancestry [59]. Such variability further complicates the replication of SNP–disease associations across ethnically diverse studies. Environmental factors additionally modulate the expression of genetic susceptibility. Factors such as nutrition, air quality, co-infections, and access to healthcare can mask or amplify the effect of genetic variants [44]. These variables add complexity to genetic studies of TB. The most plausible explanation for the poor reproducibility of GWAS results in TB susceptibility is therefore the high heterogeneity of the disease. Broader approaches, such as polygenic risk scores (PRS), may offer a more accurate and robust way to assess inherited susceptibility to TB across various populations.

4. Genetic Susceptibility to TB Is Caused by a Continuum Ranging from Monogenic to Polygenic Risk

Unlike severe congenital immunodeficiencies, TB in developed countries with access to effective treatment exerts only modest selective pressure. Even untreated, this disease typically progresses slowly over several years, gradually impairing quality of life rather than causing rapid mortality. Although maternal morbidity in individuals with active TB is increased 2.8-fold and perinatal mortality is 4.2 times higher, the majority of offspring survive, allowing for deleterious alleles to persist in the population without being effectively purged by selection [62].
However, in contexts of intense transmission and high mortality, strong negative selection may occur. For example, after TB was introduced to a Native American reservation in Saskatchewan in 1890, the first-year mortality rate reached 10% of the population. Over the subsequent three generations, more than half of the initial families perished. However, by 40 years after the epidemic began, annual mortality had dropped to 0.2%, consistent with a possible selective sweep that presumably removed highly susceptible individuals [63].
Genetic evidence of such selection was demonstrated for the TYK2 P1104A variant (rs34536443), identified as a monogenic risk factor for TB [64]. TYK2 encodes a non-receptor tyrosine kinase of the Janus kinase family, mediating signal transduction downstream of cytokines including IL-12, IL-23, IFN I, IL-6, IL-10, and IL-13. TYK2-deficient T cells display impaired responses to IL-12, IL-23, IFN-α, and IL-10, resulting in increased susceptibility to MTB and viral infections [65]. At the same time, the rs34536443 polymorphism is associated with protection against multiple autoimmune diseases, such as systemic lupus erythematosus and multiple sclerosis. However, individuals homozygous for P1104A are at a significantly increased risk for clinical TB progression due to IL-23-dependent antimycobacterial IFN-g immunity, which these individuals lack. Homozygosity for P1104A is the first common monogenic cause of TB that has been identified. The current frequency of the P1104A allele in European populations has significantly decreased compared to ancient European DNA samples. These findings suggest that negative selection against the TYK2 P1104A allele by endemic TB in Europe may have contributed to a slow genetic purge of this susceptibility allele during recent millennia [55,64].
Mendelian susceptibility to mycobacterial disease (MSMD) is a rare genetic disorder characterized by impaired immunity against intracellular pathogens, such as MTB, attenuated vaccine strains, and environmental mycobacteria in otherwise healthy individuals. Genetic variants in 19 genes (IFNG, IFNGR1, IFNGR2, STAT1, IL12B, IL12RB1, IL12RB2, IL23R, RORC, TBX21, IRF8, SPPL2A, ISG15, USP18, TYK2, JAK1, ZNFX1, NEMO, and CYBB) lead to MSMD and define 34 disorders that reflect high levels of allelic heterogeneity. The pathogenesis of MSMD depends mostly on the gene variant that leads to either insufficient production or inadequate response to interferon gamma (IFN-γ), which is an indispensable part of an efficient immune response against mycobacteria [66,67]. Biallelic variants in IL12RB1 are the most frequent genetic cause, being present in approximately 60% of patients diagnosed with MSMD [68].

5. The Potential and Challenges of PRS for TB

TB infection is a phenotypic marker of susceptibility that manifests when environmental and infectious exposures align with permissive host factors. The genetic causes can vary across populations and individuals due to immune system complexity, genetic heterogeneity, and environmental variability. Despite efforts, GWAS have not identified reproducible loci for TB sensitivity. These inconsistencies highlight the complex genetic architecture of TB sensitivity, whereby numerous common variants with small effects collectively influence susceptibility. More integrative approaches, such as PRS, aim to estimate genetic predisposition to a specific trait by aggregating the effects of numerous common genetic variants [69]. This metric is calculated as the weighted sum of risk alleles, with each allele representing an estimated contribution to disease risk. It is important to note that current PRS models for complex traits are primarily derived from populations of European ancestry. Their transferability to other populations, especially those most affected by TB, is a major unsolved challenge [70]. More sophisticated models of PRS estimation incorporate linkage disequilibrium to account for correlations between SNPs. It should be noted that unlike monogenic mutations with high penetrance, a high PRS is not a precise criterion for the presence of the selected trait. Rather, as illustrated in Figure 1, PRS provides a probabilistic stratification that may be particularly informative for tuberculosis, where the cumulative burden of many common variants rather than a single strong-effect allele underlies disease susceptibility. Indeed, PRSs facilitate genetic risk stratification by quantifying the cumulative burden of risk alleles.

6. Limited Application of PRSs to Infectious Diseases

PRSs are widely used to stratify genetic risk for non-communicable diseases—cardiovascular and metabolic disorders, multiple types of cancer, and autoimmune conditions such as rheumatoid arthritis and type 1 diabetes. In some cases, individuals in the top decile of PRS distribution exhibit risks comparable to those seen in monogenic disorders [71]. It should be noted that attempts are already being made to introduce PGS for ten diseases into routine clinical practice [72]. Importantly, PRS-based stratification enables targeted preventive interventions. For example, the onset of type 1 diabetes (T1D) may be prevented by immunomodulatory treatment such as teplizumab, a monoclonal anti-CD3 antibody, when administered at the preclinical stage identified by the presence of autoantibodies [73]. However, population-wide autoantibody screening is cost-inefficient. By contrast, pre-screening based on PRS enables the selection of a genetically high-risk subgroup for further immunological testing [74]. Notably, the T1D PRS model demonstrated that the top 10% of genetically at-risk individuals accounted for over 77% of all T1D cases [75].
Despite progress in other disease areas, PRSs remain largely unused in the context of infectious diseases. The PGS Catalog (https://www.pgscatalog.org/ (accessed on 1 August 2025)), a curated repository of published PRS models, shows a highly uneven distribution across disease classes (see Table 1). As of May 2025, infectious diseases were grossly underrepresented compared to, for example, cancer, cardiovascular disease, and digestive system disease.
This underrepresentation is likely attributable to several factors: the historical focus on monogenic germline errors of immunity, the complexity of incorporating pathogen exposure into genetic models, and the assumption that infections are primarily environmental rather than polygenic traits. However, TB demonstrates consistent evidence of heritable susceptibility and variability in immune response, as do other infections. TB does not follow a purely stochastic course but, rather, emerges at the intersection of exposure and host permissiveness. This provides a strong rationale for applying PRS-based modelling to TB. The lack of reproducible associations in GWAS should not be interpreted as a methodological failure but, rather, as an indicator of the underlying polygenic architecture. Many variants may have a small effect; each one is insufficient on its own, but they contribute to susceptibility when considered together. In such settings, PRSs offer a more realistic framework than attempts to identify “the TB susceptibility” gene. The field of infectious disease genetics may benefit from shifting toward cumulative models that do not rely on strong-effect variants alone.
Despite the well-documented genetic component of infectious disease susceptibility, no PRS for TB had been registered in the PGS Catalog as of May 2025. Meanwhile, hundreds of PRS models for non-communicable conditions have been developed and curated. This underrepresentation is also evident in the scientific literature. A PubMed query “(tuberculosis) AND (polygenic risk score)” returned twenty-nine articles, of which only one [76] was relevant. A broader search that included all infectious diseases was used with the following query: “(“communicable diseases”[MeSH Terms] OR (“communicable”[All Fields] AND “diseases”[All Fields]) OR “communicable diseases”[All Fields] OR (“infectious”[All Fields] AND “disease”[All Fields]) OR “infectious disease”[All Fields]) AND ((“polygenic”[All Fields] AND “risk”[All Fields] AND “score”[All Fields]) OR “polygenic risk score”[All Fields])”. This search yielded ninety-five papers, of which only four [77,78,79,80] are relevant to the application of PRS in infectious disease settings. This reflects both a conceptual gap and a missed opportunity to extend risk stratification tools into the domain of communicable diseases.
Despite the well-documented genetic component of infectious disease susceptibility, no PRS for TB had been registered in the PGS Catalog as of May 2025, which explains the ‘0’ entry in Table 2. At the same time, a few studies [43,72] have tested polygenic models for TB prediction, but these remain proof-of-concept and are not included in curated resources such as the PGS Catalog. One study showed that combining ten common genetic variants with basic clinical information, such as age, sex, and BMI, noticeably improved the ability to predict pulmonary TB development [44]. Another study revealed that variants in TLR1, TLR2, and MMP8 genes helped distinguish between acute and chronic forms of TB. Using these variants, the authors created a PRS that successfully classified individuals into the two clinical groups [76].

7. Shared Immunogenetic Components in PRS Across Infectious Diseases

Although PRSs are often specific to a particular trait, susceptibility to infectious diseases may share common genetic components, especially in genes involved in immune signaling. Such susceptibility may arise from impairments in genes encoding proteins with broad defensive functions. For example, one variant may disrupt epitope recognition, and another may interfere with innate immune signaling, both increasing vulnerability to multiple pathogens that rely on these mechanisms. At the same time, infections differ in tissue specificity and pathogenesis. Intestinal infections, for example, may be more frequent in subjects with genetically weakened local immunity of the gut, while respiratory infections may preferentially attack patients with weakened pulmonary defense. Therefore, the character “susceptibility to infection” should include both loci shared by many diseases and loci specific to certain pathogens.
A striking example comes from a study of the TLR1/6/10 locus, where the TLR10 N241H (rs11096957) polymorphism was associated with reduced monocyte activation compared to individuals without this SNP [81]. This highlights how variation in immune-related genes can alter the cellular response to infection and potentially shift disease outcomes. The authors developed an algorithm capable of inferring immune cell activation states from bulk RNA-seq data of patients infected with Salmonella. They then applied this model to longitudinal transcriptomic data from TB patients (Berry London: GSE107991; Berry South Africa: GSE107992; Leicester: GSE107994). Remarkably, the monocyte activation profile distinguished latent individuals who remained healthy from those who progressed to active TB already at baseline, before any clinical symptoms emerged. This illustrates how genetically programmed immune responses can influence infection trajectory across different diseases, supporting the notion that some components of PRS for infectious diseases may reflect shared underlying immunogenetic architecture.

8. Methodological Challenges in Constructing PRS for Infectious Diseases

The construction of PRS for infectious diseases requires a methodological rethinking, as the direct transfer of approaches successfully applied to non-infectious and chronic conditions will most likely be insufficient. This is due both to the biological nature of infection and to the architecture of interactions between host genes, the pathogen, and the environment.
First, infectious diseases rarely represent a single phenotype. For example, tuberculosis can be asymptomatic (latent), manifest as an active or disseminated form, have a chronic course, or cause rapid progression. Different forms of the disease may be associated with different sets of genetic factors. Therefore, when forming cohorts, it is important to account for clinical heterogeneity—combining all “TB” cases without specifying the forms leads to signal dilution and reduced association power.
Second, ethnic and population structure play a special role. Unlike oncology or diabetes, infectious diseases develop in the context of active interaction with a pathogen. There is compelling evidence of the co-evolution of MTB lineages and human populations. The same gene variant may have different effects on risk depending on the pathogen lineage it interacts with; analyzing without accounting for population stratification and the pathogen’s genetic background is methodologically vulnerable. Therefore, in the early stages of PRS analysis, it is preferable to use phenotypically and genetically homogeneous subsamples.
Third, predisposition to infectious diseases often has a pronounced polygenic nature: the contribution of an individual variant is small, but the cumulative effect across multiple SNPs can be significant. At the same time, different individuals may be susceptible for different reasons: some due to impaired innate immunity (e.g., signaling through TLR1/TLR2), and others due to adaptive immunity, or, say, features of inflammatory regulation. This requires careful SNP selection and a preference for methods that account for linkage disequilibrium (LD), such as PRS-CS [70], LDpred [82], lassosum [83], and other LD-aware approaches.
The fourth important aspect is the combination of genetic and non-genetic factors. Models that use only genetics may lack predictive power. Adding age, sex, HIV status, body mass index, socioeconomic status, and environmental variables can improve the results. This is especially important for infectious diseases whose prevalence is predetermined by external factors to a significant extent.
Finally, the effectiveness of PRS is proportional to the size and diversity of the training set. For most infectious diseases, including TB, such large cohorts with high-quality genetic and phenotypic data are extremely scarce today. The lack of multi-ethnic data is particularly acute, without which it is impossible to build universal models suitable for application outside a specific population. In this context, international consortia and multicenter initiatives play a particularly important role.

9. Conclusions

TB remains one of the most common infectious pathogens in the world despite the availability of effective treatment. Meanwhile, it is known that not all patients infected with MTB develop the active form of the disease: in the majority of cases, the disease is latent. Identification of people with increased genetic predisposition towards developing from latent to active infection would most helpfully impact early prevention and target medical attention; e.g., preventive treatment or intensified surveillance. TB is a classic multifactorial disease, where the contribution of individual genetic variants can be relatively small but the cumulative effect of multiple polymorphisms can be significant. This makes it a suitable strategy to use PRS as a risk prediction tool. If the aforementioned challenges can be overcome, a reliable PRS might substantially improve the effectiveness of screening programs, especially in endemic regions.
The development of PRS for TB has been previously hindered for various reasons. First, there was a lack of large, well-defined, and ethnically diverse cohorts available for genetic studies: TB occurs most frequently in resource-poor settings, where the infrastructure for large-scale genetic research is underdeveloped. Second, due to high ethnic heterogeneity, the genetic architecture of risk for TB susceptibility varies significantly between populations, complicating the creation of a universal scoring system. Third, TB exposure is highly dependent on environmental factors—social, economic, and epidemiological status could mask genetic correlations, reducing the predictive power of purely genetic models. Finally, in polygenic risk prediction, priority has long been given to diseases with more apparent genetic contribution to pathogenesis, i.e., cardiovascular diseases, diabetes, or cancer. However, in the new era of the accumulation of large genetic data, the advancement of multi-ethnic research, and the construction of methods for integrating genetic and clinical factors, it has become possible to check the validity of a PRS for TB susceptibility. Such a system could significantly increase the identification of probable future cases of the disease, maximize prevention, and, in the long term, contribute to reducing the global burden of TB, though this remains a speculative goal requiring further validation.

Author Contributions

Conceptualization, G.A.S. and A.A.D.; writing—original draft preparation, M.A.D., G.A.S. and A.A.D.; writing—review and editing, G.A.S., M.A.D. and A.A.D.; visualization, A.A.D.; supervision, G.A.S. and A.A.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Alene, M.; Yismaw, L.; Assemie, M.A.; Ketema, D.B.; Mengist, B.; Kassie, B.; Birhan, T.Y. Magnitude of asymptomatic COVID-19 cases throughout the course of infection: A systematic review and meta-analysis. PLoS ONE 2021, 16, e0249090. [Google Scholar] [CrossRef] [PubMed]
  2. Kriz, B.; Hubalek, Z.; Marek, M.; Daniel, M.; Strakova, P.; Betasova, L. Results of the Screening of Tick-Borne Encephalitis Virus Antibodies in Human Sera from Eight Districts Collected Two Decades Apart. Vector-Borne Zoonotic Dis. 2015, 15, 489–493. [Google Scholar] [CrossRef]
  3. Stjernberg, L.; Holmkvist, K.; Berglund, J. A newly detected tick-borne encephalitis (TBE) focus in south-east Sweden: A follow-up study of TBE virus (TBEV) seroprevalence. Scand. J. Infect. Dis. 2008, 40, 4–10. [Google Scholar] [CrossRef]
  4. Maikova, G.B.; Chernokhaeva, L.L.; Rogova, Y.V.; Kozlovskaya, L.I.; Kholodilov, I.S.; Romanenko, V.V.; Esyunina, M.S.; Ankudinova, A.A.; Kilyachina, A.S.; Vorovitch, M.F.; et al. Ability of inactivated vaccines based on far-eastern tick-borne encephalitis virus strains to induce humoral immune response in originally seropositive and seronegative recipients. J. Med. Virol. 2018, 91, 190–200. [Google Scholar] [CrossRef] [PubMed]
  5. Gilbert, A.T.; Petersen, B.W.; Recuenco, S.; Niezgoda, M.; Gómez, J.; Laguna-Torres, V.A.; Rupprecht, C. Evidence of rabies virus exposure among humans in the Peruvian Amazon. Am. J. Trop. Med. Hyg. 2012, 87, 206–215. [Google Scholar] [CrossRef] [PubMed]
  6. Follmann, E.H.; Ritter, D.G.; Beller, M. Survey of fox trappers in northern Alaska for rabies antibody. Epidemiol. Infect. 1994, 113, 137–141. [Google Scholar] [CrossRef]
  7. World Health Organization. Global Tuberculosis Report; World Health Organization: Geneva, Switzerland, 2024. [Google Scholar]
  8. O’Garra, A.; Redford, P.S.; McNab, F.W.; Bloom, C.I.; Wilkinson, R.J.; Berry, M.P.R. The immune response in tuberculosis. Annu. Rev. Immunol. 2013, 31, 475–527. [Google Scholar] [CrossRef]
  9. Cohen, A.; Mathiasen, V.D.; Schön, T.; Wejse, C. The global prevalence of latent tuberculosis: A systematic review and meta-analysis. Eur. Respir. J. 2019, 54, 1900655. [Google Scholar] [CrossRef]
  10. Eisenberg, J.N.; Desai, M.A.; Levy, K.; Bates, S.J.; Liang, S.; Naumoff, K.; Scott, J.C. Environmental determinants of infectious disease: A framework for tracking causal links and guiding public health research. Environ. Health Perspect. 2007, 115, 1216–1223. [Google Scholar] [CrossRef]
  11. Abel, L.; Dessein, A.J. The impact of host genetics on susceptibility to human infectious diseases. Curr. Opin. Immunol. 1997, 9, 509–516. [Google Scholar] [CrossRef]
  12. Borbour, S.E.; Nakashima, K.; Zhang, J.-B.; Tangada, S.; Hahn, C.-L.; Schenkein, H.A.; Tew, J.G. Tobacco and smoking: Environmental factors that modify the host response (immune system) and have an impact on periodontal health. Crit. Rev. Oral Biol. Med. 1997, 8, 437–460. [Google Scholar] [CrossRef] [PubMed]
  13. Tran, H.M.; Tsai, F.J.; Lee, Y.L.; Chang, J.H.; Chang, L.T.; Chang, T.Y.; Chung, K.F.; Kuo, H.P.; Lee, K.Y.; Chuang, K.J.; et al. The impact of air pollution on respiratory diseases in an era of climate change: A review of the current evidence. Sci. Total. Environ. 2023, 898, 166340. [Google Scholar] [CrossRef]
  14. Guarnieri, G.; Olivieri, B.; Senna, G.; Vianello, A. Relative Humidity and Its Impact on the Immune System and Infections. Int. J. Mol. Sci. 2023, 24, 9456. [Google Scholar] [CrossRef]
  15. Barrea, L.; Muscogiuri, G.; Frias-Toral, E.; Laudisio, D.; Pugliese, G.; Castellucci, B.; Garcia-Velasquez, E.; Savastano, S.; Colao, A. Nutrition and immune system: From the Mediterranean diet to dietary supplementary through the microbiota. Crit. Rev. Food Sci. Nutr. 2020, 61, 3066–3090. [Google Scholar] [CrossRef]
  16. Zefferino, R.; Di Gioia, S.; Conese, M. Molecular links between endocrine, nervous and immune system during chronic stress. Brain Behav. 2020, 11, e01960. [Google Scholar] [CrossRef] [PubMed]
  17. Brodin, P.; Jojic, V.; Gao, T.; Bhattacharya, S.; Angel, C.J.L.; Furman, D.; Shen-Orr, S.; Dekker, C.L.; Swan, G.E.; Butte, A.J.; et al. Variation in the human immune system is largely driven by non-heritable influences. Cell 2015, 160, 37–47. [Google Scholar] [CrossRef] [PubMed]
  18. Fox, G.J.; Orlova, M.; Schurr, E. Tuberculosis in Newborns: The Lessons of the “Lübeck Disaster” (1929–1933). PLoS Pathog. 2016, 12, e1005271. [Google Scholar] [CrossRef]
  19. Hill, H.G. Der erbeinfluss bei der tuberkulose (zwillingstuberkulose II). Eugen. Rev. 1937, 29, 207. [Google Scholar]
  20. Comstock, G. Tuberculosis in twins: A re-analysis of the Prophit survey. Am. Rev. Respir. Dis. 1978, 117, 621–624. [Google Scholar] [PubMed]
  21. Ndong Sima, C.A.A.; Smith, D.; Petersen, D.C.; Schurz, H.; Uren, C.; Möller, M. The immunogenetics of tuberculosis (TB) susceptibility. Immunogenetics 2022, 75, 215–230. [Google Scholar] [CrossRef] [PubMed]
  22. Lee, J.-Y.; Kwon, N.; Goo, G.-Y.; Cho, S.-I. Inadequate housing and pulmonary tuberculosis: A systematic review. BMC Public Health 2022, 22, 622. [Google Scholar] [CrossRef]
  23. Cardona, P.-J. Pathogenesis of tuberculosis and other mycobacteriosis. Enfermedades Infecc. Microbiol. Clin. 2018, 36, 38–46. [Google Scholar] [CrossRef]
  24. Schurz, H.; Naranbhai, V.; Yates, T.A.; Gilchrist, J.J.; Parks, T.; Dodd, P.J.; Möller, M.; Hoal, E.G.; Morris, A.P.; Hill, A.V.; et al. Multi-ancestry meta-analysis of host genetic susceptibility to tuberculosis identifies shared genetic architecture. eLife 2024, 13, e84394. [Google Scholar] [CrossRef]
  25. Riojas, M.A.; McGough, K.J.; Rider-Riojas, C.J.; Rastogi, N.; Hazbón, M.H. Phylogenomic analysis of the species of the mycobacterium tuberculosis complex demonstrates that mycobacterium africanum, mycobacterium bovis, mycobacterium caprae, mycobacterium microti and mycobacterium pinnipedii are later heterotypic synonyms of mycobacterium tuberculosis. Int. J. Syst. Evol. Microbiol. 2018, 68, 324–332. [Google Scholar] [CrossRef]
  26. Müller, S.J.; Schurz, H.; Tromp, G.; van der Spuy, G.D.; Hoal, E.G.; van Helden, P.D.; Owusu-Dabo, E.; Meyer, C.G.; Muntau, B.; Thye, T.; et al. A multi-phenotype genome-wide association study of clades causing tuberculosis in a Ghanaian- and South African cohort. Genomics 2021, 113, 1802–1815. [Google Scholar] [CrossRef]
  27. Correa-Macedo, W.; Cambri, G.; Schurr, E. The Interplay of Human and Mycobacterium Tuberculosis Genomic Variability. Front. Genet. 2019, 10, 865. [Google Scholar] [CrossRef]
  28. Salie, M.; van der Merwe, L.; Möller, M.; Daya, M.; van der Spuy, G.D.; van Helden, P.D.; Martin, M.P.; Gao, X.-J.; Warren, R.M.; Carrington, M.; et al. Associations between human leukocyte antigen class I variants and the mycobacterium tuberculosis subtypes causing disease. J. Infect. Dis. 2013, 209, 216–223. [Google Scholar] [CrossRef] [PubMed]
  29. Luo, T.; Comas, I.; Luo, D.; Lu, B.; Wu, J.; Wei, L.; Yang, C.; Liu, Q.; Gan, M.; Sun, G.; et al. Southern East Asian origin and coexpansion of Mycobacterium tuberculosis Beijing family with Han Chinese. Proc. Natl. Acad. Sci. USA 2015, 112, 8136–8141. [Google Scholar] [CrossRef] [PubMed]
  30. Gagneux, S.; DeRiemer, K.; Van, T.; Kato-Maeda, M.; de Jong, B.C.; Narayanan, S.; Nicol, M.; Niemann, S.; Kremer, K.; Gutierrez, M.C.; et al. Variable host–pathogen compatibility in Mycobacterium tuberculosis. Proc. Natl. Acad. Sci. USA 2006, 103, 2869–2873. [Google Scholar] [CrossRef]
  31. Otchere, I.D.; Coscollá, M.; Sánchez-Busó, L.; Asante-Poku, A.; Brites, D.; Loiseau, C.; Meehan, C.; Osei-Wusu, S.; Forson, A.; Laryea, C.; et al. Comparative genomics of Mycobacterium africanum Lineage 5 and Lineage 6 from Ghana suggests distinct ecological niches. Sci. Rep. 2018, 8, 11269. [Google Scholar] [CrossRef] [PubMed]
  32. Smith, C.M.; Baker, R.E.; Proulx, M.K.; Mishra, B.B.; Long, J.E.; Park, S.W.; Lee, H.-N.; Kiritsy, M.C.; Bellerose, M.M.; Olive, A.J.; et al. Host-pathogen genetic interactions underlie tuberculosis susceptibility in genetically diverse mice. eLife 2022, 11, e74419. [Google Scholar] [CrossRef] [PubMed]
  33. Thye, T.; Owusu-Dabo, E.; Vannberg, F.O.; Van Crevel, R.; Curtis, J.; Sahiratmadja, E.; Balabanova, Y.; Ehmen, C.; Muntau, B.; Ruge, G.; et al. Common variants at 11p13 are associated with susceptibility to tuberculosis. Nat. Genet. 2012, 44, 257–259. [Google Scholar] [CrossRef]
  34. Daya, M.; van der Merwe, L.; Gignoux, C.R.; van Helden, P.D.; Möller, M.; Hoal, E.G. Using multi-way admixture mapping to elucidate TB susceptibility in the South African Coloured population. BMC Genom. 2014, 15, 1021. [Google Scholar] [CrossRef]
  35. Schurz, H.; Kinnear, C.J.; Gignoux, C.; Wojcik, G.; van Helden, P.D.; Tromp, G.; Henn, B.; Hoal, E.G.; Möller, M. A sex-stratified genome-wide association study of tuberculosis using a multi-ethnic genotyping array. Front. Genet. 2019, 9, 678. [Google Scholar] [CrossRef]
  36. Chimusa, E.R.; Zaitlen, N.; Daya, M.; Möller, M.; van Helden, P.D.; Mulder, N.J.; Price, A.L.; Hoal, E.G. Genome-wide association study of ancestry-specific TB risk in the South African coloured population. Human Mol. Genet. 2013, 23, 796–809. [Google Scholar] [CrossRef]
  37. Curtis, J.; Luo, Y.; Zenner, H.L.; Cuchet-Lourenço, D.; Wu, C.; Lo, K.; Maes, M.; Alisaac, A.; Stebbings, E.; Liu, J.Z.; et al. Susceptibility to tuberculosis is associated with variants in the ASAP1 gene encoding a regulator of dendritic cell migration. Nat. Genet. 2015, 47, 523–527. [Google Scholar] [CrossRef]
  38. Miao, R.; Huang, S.; Li, C.; Ding, S.; Wang, R.; Xu, K.; Yang, C.; Xu, F.; Ge, H. An HLA class II locus, previously identified by a genome-wide association study, is also associated with susceptibility to pulmonary tuberculosis in a Chinese population. Infect. Genet. Evol. 2018, 64, 164–167. [Google Scholar] [CrossRef]
  39. Mahasirimongkol, S.; Yanai, H.; Mushiroda, T.; Promphittayarat, W.; Wattanapokayakit, S.; Phromjai, J.; Yuliwulandari, R.; Wichukchinda, N.; Yowang, A.; Yamada, N.; et al. Genome-wide association studies of tuberculosis in Asians identify distinct at-risk locus for young tuberculosis. J. Human Genet. 2012, 57, 363–367. [Google Scholar] [CrossRef]
  40. Qi, H.; Zhang, Y.-B.; Sun, L.; Chen, C.; Xu, B.; Xu, F.; Liu, J.-W.; Liu, J.-C.; Jiao, W.-W.; Shen, C.; et al. Discovery of susceptibility loci associated with tuberculosis in Han Chinese. Human Mol. Genet. 2017, 26, 4752–4763. [Google Scholar] [CrossRef] [PubMed]
  41. Thye, T.; Vannberg, F.O.; Wong, S.H.; Owusu-Dabo, E.; Osei, I.; Gyapong, J.; Hill, A.V.S. Genome-wide association analyses identifies a susceptibility locus for tuberculosis on chromosome 18q11.2. Nat. Genet. 2010, 42, 739–741. [Google Scholar] [CrossRef] [PubMed]
  42. Quistrebert, J.; Orlova, M.; Kerner, G.; Ton, L.T.; Luong, N.T.; Danh, N.T.; Vincent, Q.B.; Jabot-Hanin, F.; Seeleuthner, Y.; Bustamante, J.; et al. Genome-wide association study of resistance to Mycobacterium tuberculosis infection identifies a locus at 10q26.2 in three distinct populations. PLoS Genet. 2021, 17, e1009392. [Google Scholar] [CrossRef]
  43. Sveinbjornsson, G.; Gudbjartsson, D.F.; Halldorsson, B.V.; Kristinsson, K.G.; Gottfredsson, M.; Barrett, J.C.; Gudmundsson, L.J.; Blondal, K.; Gylfason, A.; Gudjonsson, S.A.; et al. HLA class II sequence variants influence tuberculosis risk in populations of European ancestry. Nat. Genet. 2016, 48, 318–322. [Google Scholar] [CrossRef]
  44. Hong, E.P.; Go, M.J.; Kim, H.-L.; Park, J.W. Risk prediction of pulmonary tuberculosis using genetic and conventional risk factors in adult Korean population. PLoS ONE 2017, 12, e0174642. [Google Scholar] [CrossRef]
  45. Li, M.; Hu, Y.; Zhao, B.; Chen, L.; Huang, H.; Huai, C.; Zhang, X.; Zhang, J.; Zhou, W.; Shen, L.; et al. A next generation sequencing combined genome-wide association study identifies novel tuberculosis susceptibility loci in Chinese population. Genomics 2021, 113, 2377–2384. [Google Scholar] [CrossRef]
  46. Luo, Y.; Suliman, S.; Asgari, S.; Amariuta, T.; Baglaenko, Y.; Martínez-Bonet, M.; Ishigaki, K.; Gutierrez-Arcelus, M.; Calderon, R.; Lecca, L.; et al. Early progression to active tuberculosis is a highly heritable trait driven by 3q23 in Peruvians. Nat. Commun. 2019, 10, 3765. [Google Scholar] [CrossRef]
  47. Zheng, R.; Li, Z.; He, F.; Liu, H.; Chen, J.; Chen, J.; Xie, X.; Zhou, J.; Chen, H.; Wu, X.; et al. Genome-wide association study identifies two risk loci for tuberculosis in Han Chinese. Nat. Commun. 2018, 9, 4072. [Google Scholar] [CrossRef] [PubMed]
  48. Grant, A.V.; Sabri, A.; Abid, A.; Abderrahmani Rhorfi, I.; Benkirane, M.; Souhi, H.; Naji Amrani, H.; Alaoui-Tahiri, K.; Gharbaoui, Y.; Lazrak, F.; et al. A genome-wide association study of pulmonary tuberculosis in Morocco. Human Genet. 2016, 135, 299–307. [Google Scholar] [CrossRef] [PubMed]
  49. Png, E.; Alisjahbana, B.; Sahiratmadja, E.; Marzuki, S.; Nelwan, R.; Balabanova, Y.; Nikolayevskyy, V.; Drobniewski, F.; Nejentsev, S.; Adnan, I.; et al. A genome wide association study of pulmonary tuberculosis susceptibility in Indonesians. BMC Med. Genet. 2012, 13, 5. [Google Scholar] [CrossRef] [PubMed]
  50. Tian, C.; Hromatka, B.S.; Kiefer, A.K.; Eriksson, N.; Noble, S.M.; Tung, J.Y.; Hinds, D.A. Genome-wide association and HLA region fine-mapping studies identify susceptibility loci for multiple common infections. Nat. Commun. 2017, 8, 599. [Google Scholar] [CrossRef]
  51. Ai, J.-W.; Zhang, H.; Zhou, Z.; Weng, S.; Huang, H.; Wang, S.; Shao, L.; Gao, Y.; Wu, J.; Ruan, Q.; et al. Gene expression pattern analysis using dual-color RT-MLPA and integrative genome-wide association studies of eQTL for tuberculosis suscepitibility. Respir. Res. 2021, 22, 23. [Google Scholar] [CrossRef]
  52. Möller, M.; Kinnear, C.J. Human global and population-specific genetic susceptibility to Mycobacterium tuberculosis infection and disease. Curr. Opin. Pulm. Med. 2020, 26, 302–310. [Google Scholar] [CrossRef] [PubMed]
  53. Harishankar, M.; Selvaraj, P.; Bethunaickan, R. Influence of genetic polymorphism towards pulmonary tuberculosis susceptibility. Front. Med. 2018, 5, 213. [Google Scholar] [CrossRef]
  54. Azad, A.K.; Sadee, W.; Schlesinger, L.S. Innate immune gene polymorphisms in tuberculosis. Infect. Immun. 2012, 80, 3343–3359. [Google Scholar] [CrossRef]
  55. Boisson-Dupuis, S.; Ramirez-Alejo, N.; Li, Z.; Patin, E.; Rao, G.; Kerner, G.; Lim, C.K.; Krementsov, D.N.; Hernandez, N.; Ma, C.S.; et al. Tuberculosis and impaired IL-23–dependent IFN-γ immunity in humans homozygous for a common TYK2 missense variant. Sci. Immunol. 2018, 3, eaau8714. [Google Scholar] [CrossRef]
  56. Koo, H.-K.; Min, J.; Kim, H.W.; Ko, Y.; Oh, J.Y.; Jeong, Y.-J.; Kang, H.H.; Kang, J.Y.; Lee, S.-S.; Seo, M.; et al. Cluster analysis categorizes five phenotypes of pulmonary tuberculosis. Sci. Rep. 2022, 12, 10084. [Google Scholar] [CrossRef]
  57. Stein, C.M.; Baker, A.R. Tuberculosis as a complex trait: Impact of genetic epidemiological study design. Mamm. Genome 2010, 22, 91–99. [Google Scholar] [CrossRef]
  58. Bruiners, N.; Schurz, H.; Daya, M.; Salie, M.; van Helden, P.D.; Kinnear, C.J.; Hoal, E.G.; Möller, M.; van Pittius, N.C.G. A regulatory variant in the C1Q gene cluster is associated with tuberculosis susceptibility and C1qA plasma levels in a South African population. Immunogenetics 2020, 72, 305–314. [Google Scholar] [CrossRef] [PubMed]
  59. Areeshi, M.Y.; Mandal, R.K.; Dar, S.A.; Jawed, A.; Wahid, M.; Lohani, M.; Panda, A.K.; Mishra, B.N.; Akhter, N.; Haque, S. IL-10-1082 A > G (rs1800896) polymorphism confers susceptibility to pulmonary tuberculosis in Caucasians but not in Asians and Africans: A meta-analysis. Biosci. Rep. 2017, 37. [Google Scholar] [CrossRef] [PubMed]
  60. Lyadova, I.V.; Tsiganov, E.N.; Kapina, M.A.; Shepelkova, G.S.; Sosunov, V.V.; Radaeva, T.V.; Majorov, K.B.; Shmitova, N.S.; Ham, H.-J.V.D.; Ganusov, V.V.; et al. In mice, tuberculosis progression is associated with intensive inflammatory response and the accumulation of Gr-1dim cells in the lungs. PLoS ONE 2010, 5, e10469. [Google Scholar] [CrossRef]
  61. Stepanikova, I.; Bateman, L.B.; Oates, G.R. Systemic Inflammation in Midlife: Race, Socioeconomic Status, and Perceived Discrimination. Am. J. Prev. Med. 2017, 52, S63–S76. [Google Scholar] [CrossRef]
  62. Sobhy, S.; Babiker, Z.; Zamora, J.; Khan, K.; Kunst, H. Maternal and perinatal mortality and morbidity associated with tuberculosis during pregnancy and the postpartum period: A systematic review and meta-analysis. BJOG Int. J. Obstet. Gynaecol. 2016, 124, 727–733. [Google Scholar] [CrossRef]
  63. Motulsky, A.G. Metabolic polymorphisms and the role of infectious diseases in human evolution. Human Biol. 1960, 32, 28–62. Available online: http://www.jstor.org/stable/41448424 (accessed on 24 September 2025).
  64. Kerner, G.; Laval, G.; Patin, E.; Boisson-Dupuis, S.; Abel, L.; Casanova, J.-L.; Quintana-Murci, L. Human ancient DNA analyses reveal the high burden of tuberculosis in Europeans over the last 2000 years. Am. J. Human Genet. 2021, 108, 517–524. [Google Scholar] [CrossRef]
  65. Gorman, J.A.; Hundhausen, C.; Kinsman, M.; Arkatkar, T.; Allenspach, E.J.; Clough, C.; West, S.E.; Thomas, K.; Eken, A.; Khim, S.; et al. The Tyk2-p1104A autoimmune protective variant limits coordinate signals required to generate specialized t cell subsets. Front. Immunol. 2019, 10, 44. [Google Scholar] [CrossRef] [PubMed]
  66. Boisson-Dupuis, S.; Bustamante, J. Mycobacterial diseases in patients with inborn errors of immunity. Curr. Opin. Immunol. 2021, 72, 262–271. [Google Scholar] [CrossRef] [PubMed]
  67. Peñafiel Vicuña, A.K.; Yamazaki Nakashimada, M.; León Lara, X.; Mendieta Flores, E.; Nuñez Núñez, M.E.; Lona-Reyes, J.C.; Nieto, L.H.; Vázquez, M.G.R.; Santos, J.B.; Iñiguez, Á.L.; et al. Mendelian Susceptibility to Mycobacterial Disease: Retrospective Clinical and Genetic Study in Mexico. J. Clin. Immunol. 2022, 43, 123–135. [Google Scholar] [CrossRef]
  68. Bustamante, J. Mendelian susceptibility to mycobacterial disease: Recent discoveries. Human Genet. 2020, 139, 993–1000. [Google Scholar] [CrossRef] [PubMed]
  69. Choi, S.W.; Mak, T.S.-H.; O’Reilly, P.F. Tutorial: A guide to performing polygenic risk score analyses. Nat. Protoc. 2020, 15, 2759–2772. [Google Scholar] [CrossRef]
  70. Kachuri, L.; Chatterjee, N.; Hirbo, J.; Schaid, D.J.; Martin, I.; Kullo, I.J.; Kenny, E.E.; Pasaniuc, B.; Witte, J.S.; Ge, T. Principles and methods for transferring polygenic risk scores across global populations. Nat. Rev. Genet. 2023, 25, 8–25. [Google Scholar] [CrossRef]
  71. Khera, A.V.; Chaffin, M.; Aragam, K.G.; Haas, M.E.; Roselli, C.; Choi, S.H.; Natarajan, P.; Lander, E.S.; Lubitz, S.A.; Ellinor, P.T.; et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat. Genet. 2018, 50, 1219–1224. [Google Scholar] [CrossRef]
  72. Lennon, N.J.; Kottyan, L.C.; Kachulis, C.; Abul-Husn, N.S.; Arias, J.; Belbin, G.; Below, J.E.; Berndt, S.I.; Chung, W.K.; Cimino, J.J.; et al. Selection, optimization and validation of ten chronic disease polygenic risk scores for clinical implementation in diverse US populations. Nat. Med. 2024, 30, 480–487. [Google Scholar] [CrossRef]
  73. Herold, K.C.; Gitelman, S.E.; Gottlieb, P.A.; Knecht, L.A.; Raymond, R.; Ramos, E.L. Teplizumab: A Disease-Modifying Therapy for Type 1 Diabetes That Preserves β-Cell Function. Diabetes Care 2023, 46, 1848–1856. [Google Scholar] [CrossRef]
  74. Akil, A.A.-S.; Yassin, E.; Al-Maraghi, A.; Aliyev, E.; Al-Malki, K.; Fakhro, K.A. Diagnosis and treatment of type 1 diabetes at the dawn of the personalized medicine era. J. Transl. Med. 2021, 19, 137. [Google Scholar] [CrossRef] [PubMed]
  75. Sharp, S.A.; Rich, S.S.; Wood, A.R.; Jones, S.E.; Beaumont, R.N.; Harrison, J.W.; Schneider, D.A.; Locke, J.M.; Tyrrell, J.; Weedon, M.N.; et al. Development and standardization of an improved type 1 diabetes genetic risk score for use in newborn screening and incident diagnosis. Diabetes Care 2019, 42, 200–207. [Google Scholar] [CrossRef] [PubMed]
  76. Lee, M.-R.; Chen, Y.-L.; Wu, C.-W.; Chen, L.-C.; Chang, L.-Y.; Chen, J.-Y.; Huang, Y.-T.; Wang, J.-Y.; Shih, J.-Y.; Yu, C.-J. Toll-like receptor and matrix metalloproteinase single-nucleotide polymorphisms, haplotypes, and polygenic risk score differentiated between tuberculosis disease and infection. Int. J. Infect. Dis. 2022, 125, 61–66. [Google Scholar] [CrossRef] [PubMed]
  77. Bodro, M.; Cervera, C.; Linares, L.; Suárez, B.; Llopis, J.; Sanclemente, G.; Casadó-Llombart, S.; Fernández-Ruiz, M.; Fariñas, M.C.; Cantisan, S.; et al. Polygenic Innate Immunity Score to Predict the Risk of Cytomegalovirus Infection in CMV D+/R- Transplant Recipients. A Prospective Multicenter Cohort Study. Front. Immunol. 2022, 13, 897912. [Google Scholar] [CrossRef]
  78. Adebamowo, S.N.; Adeyemo, A.; Adebayo, A.; Achara, P.; Alabi, B.; Bakare, R.A.; Famooto, A.O.; Obende, K.; Offiong, R.; Olaniyan, O.; et al. Genome, HLA and polygenic risk score analyses for prevalent and persistent cervical human papillomavirus (HPV) infections. Eur. J. Human Genet. 2024, 32, 708–716. [Google Scholar] [CrossRef]
  79. Pare, G.; Neupane, B.; Eskandarian, S.; Harris, E.; Halstead, S.; Gresh, L.; Kuan, G.; Balmaseda, A.; Villar, L.; Rojas, E.; et al. Genetic risk for dengue hemorrhagic fever and dengue fever in multiple ancestries. EBioMedicine 2020, 51, 102584. [Google Scholar] [CrossRef]
  80. Domdom, M.-A.; Brest, P.; Grosjean, I.; Roméo, B.; Landi, M.T.; Gal, J.; Klionsky, D.J.; Hofman, P.; Mograbi, B. A multifactorial score including autophagy for prognosis and care of COVID-19 patients. Autophagy 2020, 16, 2276–2281. [Google Scholar] [CrossRef]
  81. Ben-Moshe, N.B.; Hen-Avivi, S.; Levitin, N.; Yehezkel, D.; Oosting, M.; Joosten, L.A.B.; Netea, M.G.; Avraham, R. Predicting bacterial infection outcomes using single cell RNA-sequencing analysis of human immune cells. Nat. Commun. 2019, 10, 3266. [Google Scholar] [CrossRef]
  82. Vilhjálmsson, B.J.; Yang, J.; Finucane, H.K.; Gusev, A.; Lindström, S.; Ripke, S.; Genovese, G.; Loh, P.-R.; Bhatia, G.; Do, R.; et al. Modeling Linkage Disequilibrium Increases Accuracy of Polygenic Risk Scores. Am. J. Human Genet. 2015, 97, 576–592. [Google Scholar] [CrossRef] [PubMed]
  83. Mak, T.S.H.; Porsch, R.M.; Choi, S.W.; Zhou, X.; Sham, P.C. Polygenic scores via penalized regression on summary statistics. Genet. Epidemiol. 2017, 41, 469–480. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Visualization of possible PRS distribution among cases and healthy controls. (A) Dot plot of individual PRS values for cases (red) and healthy controls (blue). Each point represents one individual; points are jittered vertically for clarity. The dashed green vertical line indicates the 90th percentile threshold (top 10% PRS). (B) Kernel density estimates of PRS distributions for cases and controls. The distribution for cases is shifted rightward, indicating higher genetic risk scores in this group. The dashed green line marks the top 10% PRS cutoff. (C) Jitter plot of PRS values for all individuals grouped by status, showing the spread and overlap of scores between cases and controls. (D) Jitter plot of PRS values restricted to individuals within the top 10% PRS percentile. This highlights the enrichment of cases among high-risk individuals. Although this is a generic example, it illustrates why PRSs are relevant for tuberculosis: while no single variant confers a strong effect, the cumulative distribution of many variants can stratify individuals by their probability of progressing from latent infection to active TB.
Figure 1. Visualization of possible PRS distribution among cases and healthy controls. (A) Dot plot of individual PRS values for cases (red) and healthy controls (blue). Each point represents one individual; points are jittered vertically for clarity. The dashed green vertical line indicates the 90th percentile threshold (top 10% PRS). (B) Kernel density estimates of PRS distributions for cases and controls. The distribution for cases is shifted rightward, indicating higher genetic risk scores in this group. The dashed green line marks the top 10% PRS cutoff. (C) Jitter plot of PRS values for all individuals grouped by status, showing the spread and overlap of scores between cases and controls. (D) Jitter plot of PRS values restricted to individuals within the top 10% PRS percentile. This highlights the enrichment of cases among high-risk individuals. Although this is a generic example, it illustrates why PRSs are relevant for tuberculosis: while no single variant confers a strong effect, the cumulative distribution of many variants can stratify individuals by their probability of progressing from latent infection to active TB.
Ijms 26 09544 g001
Table 2. Number of polygenic scores (PGS) by disease category (PGS Catalog, May 2025).
Table 2. Number of polygenic scores (PGS) by disease category (PGS Catalog, May 2025).
Cancer746 PGS
Cardiovascular disease413 PGS
Digestive system disorder430 PGS
Immune system disorder232 PGS
Infectious disease8 PGS (https://www.pgscatalog.org/trait/EFO_0005741/) (accessed on 1 August 2025)
COVID-193 PGS
Tuberculosis0 PGS
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Dashian, M.A.; Shipulin, G.A.; Deviatkin, A.A. Genetic Susceptibility to Tuberculosis and the Utility of Polygenic Scores in Population Stratification. Int. J. Mol. Sci. 2025, 26, 9544. https://doi.org/10.3390/ijms26199544

AMA Style

Dashian MA, Shipulin GA, Deviatkin AA. Genetic Susceptibility to Tuberculosis and the Utility of Polygenic Scores in Population Stratification. International Journal of Molecular Sciences. 2025; 26(19):9544. https://doi.org/10.3390/ijms26199544

Chicago/Turabian Style

Dashian, Mariia A., German A. Shipulin, and Andrei A. Deviatkin. 2025. "Genetic Susceptibility to Tuberculosis and the Utility of Polygenic Scores in Population Stratification" International Journal of Molecular Sciences 26, no. 19: 9544. https://doi.org/10.3390/ijms26199544

APA Style

Dashian, M. A., Shipulin, G. A., & Deviatkin, A. A. (2025). Genetic Susceptibility to Tuberculosis and the Utility of Polygenic Scores in Population Stratification. International Journal of Molecular Sciences, 26(19), 9544. https://doi.org/10.3390/ijms26199544

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop