Previous Article in Journal
Acclimation Time Enhances Adaptation of Heterotrophic Nitrifying-Aerobic Denitrifying Microflora to Linear Anionic Surfactant Stress
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Immunoinformatics Predictions on Variable Mycobacterium tuberculosis Lineage 6 T Cell Epitopes and HLA Interactions in West Africa

by
Marta L. Silva
1,2,
Nuno S. Osório
3,4,* and
Margarida Saraiva
1,5,*
1
i3S—Instituto de Investigação e Inovação em Saúde, 4200-135 Porto, Portugal
2
Doctoral Program in Molecular and Cellular Biology, Instituto de Ciências Biomédicas Abel Salazar (ICBAS), Universidade do Porto, 4050-313 Porto, Portugal
3
Life and Health Sciences Research Institute (ICVS), School of Medicine, Campus Gualtar, University of Minho, 4710-057 Braga, Portugal
4
ICVS/3B’s—PT Government Associate Laboratory, 4710-057 Braga, Portugal
5
IBMC—Instituto de Biologia Molecular e Celular, Universidade do Porto, 4200-135 Porto, Portugal
*
Authors to whom correspondence should be addressed.
Microorganisms 2025, 13(5), 1032; https://doi.org/10.3390/microorganisms13051032 (registering DOI)
Submission received: 4 April 2025 / Revised: 24 April 2025 / Accepted: 26 April 2025 / Published: 29 April 2025
(This article belongs to the Section Molecular Microbiology and Immunology)

Abstract

:
Tuberculosis (TB), caused by Mycobacterium tuberculosis (Mtb), remains a global health challenge. The human-adapted TB-causing bacteria are distributed into ten lineages with distinct global distributions and clinical outcomes. Mtb lineages 4 (L4) and L6 are good prototypes of these differences, because L4 is globally prevalent, whereas L6 is geographically restricted to West Africa and associated with slower disease progression. Given the fundamental role of T cells for the control of TB, we questioned whether Mtb L4 or L6 antigens and HLA interactions would be disrupted in West African hosts. Here, we selected variable and validated antigens and demonstrate their expression during in vivo Mtb L4 or L6 infections. We then compared the predicted number of IFN-γ-inducing and HLA high-binding-affinity peptides in Mtb ancestral, L4, or L6 proteins, considering HLA alleles of high or low frequency in West Africa. Our immunoinformatics approach predicts that non-synonymous substitutions of high variance in Mtb L6 strains diminish binding affinities to HLA alleles prevalent in West African populations, suggesting specific adaptations of these strains to their preferred hosts. Future functional studies will advance our knowledge on lineage-specific evolution and inform strategies to enhance TB control in endemic regions.

1. Introduction

Tuberculosis (TB) remains a health problem, with an estimated 10.8 million new cases and 1.25 million deaths reported in 2023 alone [1] The causative agent of TB, Mycobacterium tuberculosis (Mtb), is an obligate human pathogen that likely originated thousands of years ago [2,3,4] and that belongs to the Mycobacterium tuberculosis complex (MTBC) [5]. Selective pressures from host immune responses and environmental factors have driven the diversification of Mtb into ten phylogenetic lineages (L1–L10) [5,6] that show a striking geographic structuring, even in the face of increased human mobility and globalization [3,5]. For example, MTBC L4 is globally distributed [7], displaying high incidence across Europe, Africa, and the Americas, while L2 dominates in East Asia [8]. In contrast, MTBC L5, L6, and L9 (formerly known as Mycobacterium africanum [9]) are largely restricted to specific regions of West and East Africa [10,11,12]. In these regions, MTBC L5, L6, and L9 may co-exist with globally spread lineages, most notably L4, without evidence of being out-competed [11,12].
Host–Mtb sympatry highlights the adaptation of the pathogen to specific hosts, and may be explained by different mechanisms, including co-evolution [13], genetic drift [14], and host restriction [15,16,17]. Several studies reported specific association of Mtb L5 infections with the Ewe ethnic group [18,19,20], with an effect visible on macrophage responses [21,22]. Furthermore, Mtb strains of L6 displayed reduced infectivity in allopatric macrophages and reduced transmission in allopatric populations [15] despite similar transmission rates in its West African niche [23]. A major gap in knowledge relates to the impact of Mtb–host sympatric adaptations in T cell responses, which are key players for TB control in humans and experimental models [24,25]. Mtb T cell epitopes are generally hyperconserved, a feature that may however be disrupted in Mtb L6, as T cell epitopes in Mtb strains of this lineage are more variable, as shown by a significantly higher average nucleotide diversity and a greater number of single nucleotide polymorphisms (SNPs) between sequences [10,26,27]. Furthermore, T cells from TB patients infected with Mtb L6 strains showed reduced production of key cytokines, such as interferon (IFN)-γ and interleukin (IL)-2, in response to antigens like ESAT-6 and CFP-10 [28], which are critical for robust T cell-mediated immunity [29,30,31]. This reduced immune activation may potentially explain the ability of Mtb L6 to cause TB disease despite being a slow-growing pathogen [32,33]. On the other hand, it could also lead to decreased inflammation and less tissue damage, allowing the pathogen to persist in the host while minimizing detection.
We hypothesized that the higher antigen variability in Mtb L6 may reflect an evolutionary adaptation of this lineage to evade immune recognition by human leukocyte antigen (HLA) molecules predominant in West African populations. This hypothesis would help to explain the geographic restriction of Mtb L6, whilst also fitting with the high sympatry of Mtb L6 pathogens and their West African host niche [11,12]. Leveraging the potential of immunoinformatics as a transformative tool in immunology [34,35,36], we focused on a set of seven experimentally validated antigens [27], previously described as variable in Mtb L6 genomic sequences, and conducted in silico analyses of HLA binding affinity and predictions of functional immunogenicity. The Mtb L6 variant proteins were predicted to associate with a reduction in the number of high-binding-affinity peptides to both HLA class I and class II alleles of high frequency in West African populations. In contrast, no predicted trend for increased or reduced binding affinity was detected for the Mtb L4 version of the candidate proteins, irrespective of the tested HLA. These in silico findings warrant future experimental validation so that they may potentially impact the development of more efficient TB vaccines, informed by host–pathogen interactions.

2. Materials and Methods

2.1. Ethics Statement

Animal experiments complied with the ARRIVE guidelines, followed the 2010/63/EU Directive and were approved by the i3S Animal Ethics Committee and the Portuguese National Authority for Animal Health (DGAV; #018413/2021-11-24).

2.2. Animal Housing and Infection

C57BL/6 mice were bred and housed at the i3s animal facility. Males and females of 8 to 11 weeks old were used for infections, and maintained under contention conditions in the animal biosafety level 3 facility at i3s, at a controlled temperature (20–24 °C) and humidity (45–65%), and a light cycle of 12 h (light/dark). Water and food were provided ad libitum. Mice were infected with Mtb (clinical isolates from L4 or L6) through the aerosol route using an inhalation exposure system (Glas-Col) [37,38]. To determine the dose of infection, the bacterial load in the lungs of 3 mice was quantified 3 days post-infection. Infected mice were weighed every week. Mice were euthanized by CO2 inhalation at the indicated time points.

2.3. Lung Processing, CFU Determination, and mRNA Analysis

Lungs were aseptically excised and processed as previously described [37,38]. Briefly, lungs were digested with Collagenase D (Roche, Basel, Switzerland, Cat. #11088882001) followed by physical disruption and filtration through 70 μM cell strainers (Falcon, Corning, NY, USA Cat. #352350.0). Cell suspensions were used for colony forming unit (CFU) determination and mRNA analysis. Lung cell suspensions were lysed with saponin serial dilutions prepared and plated in Middlebrook 7H11 agar (BD Biosciences, Franklin Lakes, NJ, USA, Cat. #212203.0) supplemented with OADC and PANTA (BD BioSciences, Franklin Lakes, NJ, USA, Cat. #245114). CFUs were enumerated after 21–28 days of incubation at 37 °C. Total RNA was extracted from infected mouse lungs using TRIzol reagent (GRiSP, Porto, Portugal, Cat. #GB23.0100) according to the manufacturer’s instructions. Briefly, samples were stored in TRIzol at −80 °C, homogenized, and subjected to phase separation using chloroform. RNA was precipitated with isopropanol, washed in 70% ethanol, and resuspended in RNase/DNase-free water. Glycogen was used as a carrier to enhance RNA yield. RNA concentration and purity were assessed by Nanodrop spectrophotometry. For qPCR analysis, cDNA was synthesized with the SuperScript First-Strand Synthesis System using Random Primer Mix (60 µM) for RT-PCR (ThermoScientific, Waltham, MA, USA, Cat. #E6300L). Target gene mRNA expression was quantified by real-time PCR, using SYBR green (Biorad, Hercules, CA, USA, Cat. #1725121). Gene expression was normalized to SigA. Oligonucleotide sequences are listed in Supplementary Table S1. Data from the qPCR (Figure 2) were analyzed using GraphPad Prism software v10.1.1. Differences were considered significant at p ≤ 0.05 and represented as follows: * p ≤ 0.05; ** p ≤ 0.01; and *** p ≤ 0.001.

2.4. Dataset and Genome Sequence Analysis

Publicly available MTBC genome sequences from Napier et al. were utilized [39]. A subset of 247 genomes, comprehensively representing Mtb L6 and diverse MTBC lineages, was selected. Raw sequencing reads were downloaded from the NCBI SRA and processed using a standardized bioinformatic pipeline. This pipeline included quality control with BBDuk (version 39.01); mapping to the inferred Mtb ancestral reference genome sequence (MTB_anc.fasta) derived from Comas et al. [2] using BWA-MEM (version 0.7.17); duplicate marking with samtools markdup (version 1.17); and targeted variant calling focused on specific regions of interest using bcftools mpileup (parameters --min-MQ 30, --min-BQ 20) and bcftools call (version 1.17). Variant filtering was applied using bcftools filter (version 1.17) to remove variants flagged as “LowQual” (criteria: QUAL < 15). Functional annotation of variants was performed using bcftools csq (version 1.17) with the GFF3 annotation file corresponding to the Mtb H37Rv reference genome (NC_000962.3).

2.5. Variant Allele Frequency Analysis and Visualization

Variants identified within the target genes of interest were extracted from the annotated VCF files. For each variant position in each sample, the variant allele frequency (VAF) was calculated using the DP4 (depth per allele) field from the VCF output of bcftools mpileup. This VAF represents the proportion of reads supporting the alternative allele at a given variant position. VAF values were analyzed using Python (version 3.12.2) with the pandas (version 2.2.3) and numpy (version 2.2.4) libraries. To identify variants differing significantly in frequency between lineages, the mean VAF per genomic position was calculated separately for Mtb L6 and L4 isolates, utilizing the lineage classifications provided in [40]. The difference between the mean VAF in Mtb L6 and the mean VAF in Mtb L4 was computed for each variant position. Variants were then ranked based on this VAF difference (VAF_Difference_L6_L4) to identify positions with the largest frequency discrepancies. Distributions of VAF values for selected high-difference variants across specific lineages were visualized using heatmaps generated with the seaborn (version 0.13.2) and matplotlib (version 3.10.1) libraries in Python. These heatmaps displayed the mean VAF per variant position (columns) for each lineage (rows).

2.6. IFNepitope Analysis

The IFNepitope server (https://webs.iiitd.edu.in, IFNepitope, module “Scan”, accessed on 23 March 2025) [40] was utilized to predict IFN-γ inducing epitopes for Mtb L6 and L4 candidate gene variants and the corresponding ancestral sequences, considering a length of 15 residues. This tool enables the identification of IFN-γ-inducing epitopes from a set of peptide sequences or a peptide library [41]. The search model employed was the support vector machine (SVM) motif hybrid, which integrates the SVM-based model with motif contributions relevant to IFN-γ induction.

2.7. HLA Selection for T Cell Epitope Prediction

Allele frequency data were retrieved from the publicly available database allelefrequencies.net to identify HLA alleles of high or low frequency in West Africa, as compared to other regions (Europe, North America, and Asia). In parallel, data from the Immune Epitope Database (IEDB; www.iedb.org) were consulted to confirm the availability of experimental binding data and to aid in the verification of allele representation in the target populations.

2.8. T Cell Epitope Prediction

Peptides of different lengths (8 to 14 mers for HLA class I and 9 to 18 mers for HLA class II) were generated for each candidate antigen. NetMHCpan-4.1b and NetMHCIIpan-4.0 were used to predict the binding potential of ancestral, Mtb L4, and Mtb L6 peptide variants to the selected HLAs. These machine learning-based tools were trained with a comprehensive number of mass spectrometry eluted ligands and have been considered the best overall predictors [42,43,44], namely in the HLA-DR locus [44]. HLA-binders were selected based on the half maximal inhibitory concentrations (IC50). IC50 < 50 nM was used as the cutoff for high-binding-affinity peptides.

3. Results and Discussion

3.1. Selection of Highly Variable Antigens in Mtb L6 Strains

Given the sympatry between Mtb isolates belonging to L6 and their West African hosts [11,12,15], we hypothesized that specific host–pathogen interactions may be in place and mediated by alterations in epitope recognition by HLAs. To investigate this hypothesis, we followed the approach summarized in Figure 1.
Figure 1. Schematic representation of the study. Mtb, Mycobacterium tuberculosis. HLA, human leucocyte antigen. Antigens were retrieved from [27].
Figure 1. Schematic representation of the study. Mtb, Mycobacterium tuberculosis. HLA, human leucocyte antigen. Antigens were retrieved from [27].
Microorganisms 13 01032 g001
We reasoned that particular epitope–HLA interactions would be more likely for the rare variable Mtb antigens than for the more common hyperconserved ones. Thus, we selected for our study seven proteins previously identified as rare variant epitopes across 216 Mtb strains and experimentally validated as immunogenic in samples from individuals with active TB [27]. Although epitopes for the selected proteins had been tested for their immunogenicity in human samples [27], whether the corresponding genes were expressed during Mtb L6 infections had not been tested. We aerosol-infected C57BL/6 mice with a Mtb clinical isolate belonging to L6 [33] or with one belonging to L4 [45]. Analyses were performed at day 120 or day 30 post-infection for the Mtb L6 or Mtb L4 infection, respectively (Figure 2A). At these time points, the infections reach a chronic phase with stabilized bacterial burdens. As expected from previous studies [33], the bacterial burden in the slow-progressing Mtb L6-infected mice was lower than that observed in Mtb L4-infected mice (Figure 2B). The transcriptional analysis of the candidate genes by qRT-PCR showed their in vivo expression, irrespective of the Mtb isolate used (Figure 2C). The pattern of expression was similar for all tested genes, with the exception of Rv0990c, which at the indicated time points displayed higher expression in the lungs of mice infected with the Mtb L6 isolate as compared to Mtb L4 infections (Figure 2C). While B6 mice may not be the best models to study TB pathogenesis, our findings show that the selected genes are expressed in the lungs of Mtb-infected mice in the presence of a protective immune response, which positions them as possible natural antigens.
The genes encoding these seven proteins were inspected for genomic variants using a collection of 247 MTBC genomes from [39] that comprehensively represent L6 and diverse MTBC lineages. For each sample and genomic position within these genes, we determined the presence of variants and calculated their VAF. Mean VAF analysis across all lineages revealed a predominantly low VAF across the majority of positions (Supplementary Table S2), consistent with the expected conservation of T cell epitopes in Mtb [26,27]. To specifically highlight variants with differential prevalence between L6 and L4, we ranked genomic positions based on the difference in mean VAF between these two lineages. To visualize the top 10 L6 versus L4 VAF patterns, we generated a heatmap (Figure 2D) displaying mean VAF for each genomic position (columns) across different MTBC lineage level 3 classifications (rows). The color intensity in each cell corresponds to the mean VAF for that position within that lineage group, ranging from dark blue for low VAF to bright yellow for high VAF (Figure 2D). Four genomic mutations identified as high mean VAF in L6 versus L4 strains, or vice versa, led to non-synonymous amino acid substitutions in the corresponding proteins (Figure 2D). Genomic positions 1107897 (containing the SNP resulting in the A68V substitution in protein Rv0990c), 2122380 (containing the SNP resulting in the L258V substitution in protein LldD2/Rv1872c), and 3597581 (containing the SNP resulting in the Q62H substitution in protein Rv3221c) exhibited markedly elevated mean VAF values in several L6 sub-lineages (e.g., 6, 6.1, 6.2, 6.3) compared to other lineages shown in the heatmap (Figure 2D). On the other hand, the non-synonymous substitutions in Rv0010c, Rv0012, RimJ, and P8L in Rv00990c exhibited elevated mean VAF values in only one of the L6 sub-lineages (Figure 2D). Thus, while the selected proteins are generally conserved across MTBC, specific genomic positions within these candidate genes display increased variability specifically within L6 strains. These positions, characterized by higher VAF in L6, may represent sites under positive selection within this lineage, potentially influencing epitope presentation and subsequent T cell recognition within West African populations. Regarding the mutations detected as high mean VAF in L4 versus L6, we also identified some that are L4-specific and others that also span other lineages, most commonly L2 and L3 (Figure 2D).
Figure 2. Expression and selection of variant antigens. (A) Experimental infection model. C57BL/6 mice were infected via aerosol with Mtb lineage 4 or lineage 6, and at the indicated time points, lung bacterial burdens (B) and candidate gene expression (C) were assessed. Data are represented as mean ± SEM. Each dot represents an individual mouse. Statistical analysis was performed using 2-tailed unpaired Student’s t test. (D) Heat map representing the genomic regions with the highest and lowest mean variant allele frequencies (VAFs) in Mtb L6 compared to Mtb L4. L, lineage. Mtb, Mycobacterium tuberculosis. MTBC, Mycobacterium tuberculosis complex. Ns represents synonymous substitutions. Differences were considered significant at p ≤ 0.05 and represented as follows: ** p ≤ 0.01; and *** p ≤ 0.001.
Figure 2. Expression and selection of variant antigens. (A) Experimental infection model. C57BL/6 mice were infected via aerosol with Mtb lineage 4 or lineage 6, and at the indicated time points, lung bacterial burdens (B) and candidate gene expression (C) were assessed. Data are represented as mean ± SEM. Each dot represents an individual mouse. Statistical analysis was performed using 2-tailed unpaired Student’s t test. (D) Heat map representing the genomic regions with the highest and lowest mean variant allele frequencies (VAFs) in Mtb L6 compared to Mtb L4. L, lineage. Mtb, Mycobacterium tuberculosis. MTBC, Mycobacterium tuberculosis complex. Ns represents synonymous substitutions. Differences were considered significant at p ≤ 0.05 and represented as follows: ** p ≤ 0.01; and *** p ≤ 0.001.
Microorganisms 13 01032 g002

3.2. Immunogenicity Predictions for the Variant Epitopes

We then assessed the predicted potential of the ancestral, Mtb L4, and Mtb L6 variant proteins to elicit IFN-γ responses. We used the IFNepitope tool to predict IFN-γ-inducing peptides (15 amino acid residues) across the 10 highest or lowest VAFs in Mtb L6 strains, causing non-synonymous substitutions (Figure 2D). Of the top 10 L6 VAFs, Rv0010c and TB7.3 displayed no predicted IFN-γ-inducing epitopes that included the mutated sites. The pattern for the other candidate L6 variants varied, with a loss of predicted IFN-γ-inducing epitopes in RimJ (Figure 3A) and LldD2 (Figure 3B), a gain of epitopes in the case of Rv0012 (Figure 3C), and no change for Rv0990c (Figure 3D). In the case of the variants with a low VAF in Mtb L6, a mixed pattern of increase and decrease was detected for LldD2, depending on the amino acid substitution (Figure 3B), while Rv0012 showed a predicted decrease (Figure 3C), and an increase in the number of predicted IFN-γ-inducing peptides was detected for the Mtb L4 variants of Rv0990c (Figure 3D) and Rv2719c (Figure 3E). Therefore, no specific pattern of gain or loss of predicted IFN-γ-inducing epitopes was detected as being common across each protein or for each lineage. This suggests that any potential functional impact of the antigen variants may be primarily dependent on the binding of the variable peptide to the host HLA. Importantly, HLA genes are highly polymorphic in human populations and several alleles have been previously associated with TB susceptibility in different populations [46,47,48,49,50,51,52,53,54]. Thus, it is possible that the binding affinity of the variable antigens to the matching host HLAs may vary, resulting in altered T cell responses.

3.3. HLA Binding Affinity Predictions for Mtb L6 Epitopes

To investigate whether the peptide variants of highest or lowest prevalence in Mtb L6 versus L4 would impact their predicted interactions with host HLAs, we started by retrieving a comprehensive set of HLA alleles, based on the geographic restriction of Mtb L6 to West Africa [10,11,12] and the global prevalence of L4 [7]. From the 10387 HLA class I alleles available in NetMHCpan-4.1, and the 660 HLA-DRB alleles (class II) available in NetMHCIIpan-4.0, we selected those with
  • high frequency in at least one of the five West African countries (Guinea-Bissau, Mali, Senegal, The Gambia, and Burkina Faso), but not at a global level;
  • high frequency in Europe, North America, and Asia, and low frequency in West African countries.
We excluded HLA-C class I alleles due to their limited polymorphism and lower expression levels (about 90% less) compared to the other two loci [55,56,57]. Within the HLA class II -DR locus, only the DRB1 alleles were selected, as they bind most Mtb epitopes and were previously predicted as displaying a five-times-higher expression in comparison to other alleles [58]. HLA-DP and HLA-DQ class II loci were not considered for the study due to the lack of experimental binding data available [59]. Thus, we selected five HLA-A and five HLA-B class I alleles (Table 1), and six HLA-DRB1 class II alleles (Table 2) of high or low frequency in West Africa.
Using computational predictions from NetMHCpan-4.1 and NetMHCIIpan-4.0, we assessed the number of high-binding-affinity epitopes (IC50 < 50 nM) of the ancestral or variant protein for each selected HLA. Different patterns of variation were detected between the Mtb L6 or L4 variants and the ancestral haplotype, depending on the HLA/peptide pair. Non-synonymous variants of Rv0010c, which were only detected for Mtb L6, did not impact the number of high-binding-affinity epitopes predicted for class I HLAs but reduced the predicted number of high-binding-affinity epitopes for HLA class II alleles, independently of them having high or low prevalence in West Africa (Table 3). In the case of RimJ, the identified non-synonymous variant of L6 did not impact the predicted number of high-binding-affinity epitopes to either class of HLA (Table 3). The last Mtb L6 unique non-synonymous mutation, in TB7.3, led to a loss in the predicted number of high-binding-affinity epitopes only to HLA class I and only for those highly prevalent in West Africa (Table 3). Regarding Rv0012, Rv0990c, and LldD2, the three proteins showing non-synonymous mutations in both Mtb L6 and Mtb L4 strains, a pattern of loss of the predicted number of high-binding-affinity epitopes for the Mtb L6 variants to HLA class I and II alleles of high prevalence in West Africa was detected (Table 3). However, this pattern was not recapitulated for HLAs alleles of low frequency in West Africa, nor for the Mtb L4 strain variants (Table 3). Finally, no difference in the number of high-binding-affinity epitopes was predicted for the Rv2719c variant identified for the Mtb L4 strains as compared to the ancestral protein (Table 3). In summary, the epitope substitutions with high VAF in Mtb L6 were predicted to display an overall reduced binding affinity to both HLA class I and II alleles of high prevalence in West Africa. This pattern was not visible in the case of HLA alleles of low prevalence in West Africa, nor for Mtb L4 variants independently of the HLA allele frequency.
Thus, the findings so far support the hypothesis that Mtb L6 strains may have evolved to persist within a specific host genetic landscape, potentially explaining its confinement to West Africa. Of note, although we did not detect a difference in the number of high-binding-affinity peptides for RimJ when comparing Mtb L6 and the ancestral sequences, our in silico analysis predicted a loss of IFN-γ-producing peptides in the case of the Mtb L6 protein (Figure 3A). Similarly, a loss of IFN-γ-producing peptides was predicted for the Mtb L6 variant of LldD2 (Figure 3B). However, this loss of IFN-γ-producing peptides was not seen for Rv0012 and Rv0990c (Figure 3C,D), despite the decrease in the number of predicted high-binding-affinity peptides (Table 3). It is possible that different levels of adaptation exist to modulate T cell responses and confer a selective advantage by reducing immune detection while maintaining sufficient infectivity to ensure transmission within sympatric populations.
Given our analysis suggesting a possible match between variants of Rv0012, Rv0990c, and LldD2 in Mtb L6 strains and the populational HLA alleles, we next inspected with more detail the predictions obtained for these proteins with respect to the individual HLAs tested. In the case of Rv0012 Mtb L6 variants, the loss of two HLA class I high-binding-affinity peptides was caused by the mutation E234Q, specifically on HLA-B*15:03 and 35:01, both of high prevalence in West Africa (Figure 4A). This mutation in Rv0012 was also predicted to decrease the binding of peptides to HLA class II alleles HLA-DRB1*11:01 and HLA-DRB1*11:02, of high frequency in West Africa, and HLA-DRB1*07:01, of low frequency in West Africa (Figure 4D). In this protein, the Mtb L4 variants affected several HLA class I and II alleles, of high or low frequency in West Africa, with a global gain of predicted high-affinity peptides (Figure 4A,D). With respect to LldD2, the Mtb L6 variant L258V was predicted to display less high-binding-affinity epitopes to the high-frequency HLA class I alleles HLA-A*02:01 and HLA-A*02:02 and the class II HLA-DRB1*11:02 allele, whereas it associated with an increase in the number of high-binding-affinity peptides for the HLA-B*58:01 (class I) allele (Figure 4B,E). Mtb L6 variants of LldD2 did not impact the predicted binding to HLA class I or II alleles of low frequency in West Africa (Figure 4B,E), whereas Mtb L4 variants of this protein impacted the binding predictions to several HLA alleles, again with no specific pattern between high- or low-prevalence alleles and with a net gain in predicted high-binding-affinity epitopes (Figure 4B,E). Finally, the mutations affecting the Mtb L6 protein Rv0990c had only a minor effect on HLA class I (HLA-A*02:02) binding predictions (Figure 4C). However, both Mtb L6 and L4 Rv0990c variants widely impacted the predicted number of high-binding-affinity peptides in the context of HLA class II, specifically those of high prevalence in West Africa (Figure 4F). In both cases, a predicted loss in high-binding-affinity epitopes was detected (Figure 4F). Only the Mtb L6 variant of Rv0990c affected the binding to HLA class II alleles of low frequency in West Africa, in this case increasing the number of high-binding-affinity peptides (Figure 4F). Of note, the variants of Rv0012 (E234Q and G258D) and P8L of Rv0990c are represented in sub-lineage 6.2.3, whereas variants A68V of Rv0990c and L258V of LldD2 are broadly represented in L6 (Figure 2D). Therefore, it is tempting to speculate that the variant L258V of LldD2 might have the widest impact on the interaction of Mtb L6 strains with West African hosts. Future functional studies will need to take this potentially differential effect of antigen mutations in distinct sub-lineages into account.
Globally, whereas variations of Mtb L6 proteins in the predicted number of HLA class I high-affinity peptides did not affect a particular allele (within those selected for the study), in the case of HLA class II, the Mtb L6 mutations analyzed for the three selected antigens commonly affected the HLA-DRB1*11:02 allele, being invariably associated with a reduction in the number of high-affinity predicted epitopes. It would be interesting in the future to investigate whether individuals bearing this HLA class II allele show differences in susceptibility to TB disease or in the type and quality of CD4 T cell responses, particularly upon infection by Mtb L6 strains. Interestingly, in a study performed in East Asia, the HLA-DRB1*11 gene polymorphism was suggested to be a protective factor for TB risk [60]. Despite the many studies investigating possible association of HLA genetic variants with TB susceptibility [46,47,48,49,50,51,52,53,54], the results remain inconsistent. Our data highlight the genetic background of the infecting Mtb as a variable to consider in these studies. Supporting this view, a study conducted in 682 TB patients and 836 healthy controls in Thailand found an association of the HLA-DRB1∗09:01 and HLA-DQB1∗03:03 alleles with TB susceptibility in patients infected with modern Mtb strains and suggested strain specific susceptibility to TB [61]. Also relevant, only a few studies on HLA association with TB were performed in individuals of African ancestry, despite the high burden of TB in Africa [1].
Although outside of the scope of this study, experimental validation of the reported predictions is needed to confirm their functional impact. Future studies employing in vitro assays with peripheral blood mononuclear cells from West African individuals typed for specific HLA alleles will be crucial to elucidate the extent to which the variant Mtb L6 proteins may modulate immune responses in a natural infection setting. Moreover, expanding our analysis to additional Mtb antigens under positive selection in L6 could reveal broader immune evasion strategies that contribute to its epidemiological niche. Our study is based on the extensively validated pan-specific tools NetMHCpan and NetMHCIIpan. Also important as a valuable avenue for future work will be the development of a dedicated QSAR model.

4. Conclusions

The immunoinformatics-driven analysis presented in this study offers insights into the potential host–pathogen interactions in place between Mtb L6 strains and their preferred hosts. Focusing on a rare subset of highly variant Mtb antigens, we revealed possible alterations in binding patterns of Mtb L6 antigen variants to host HLAs that may underlie the restricted geographic distribution of Mtb L6. Further understanding the functional impact of these findings may have important implications for TB research, from vaccine design to genetic susceptibility studies, specifically those involving TB caused by Mtb L6 strains. It will be important in the future to experimentally validate the predictions presented in this study. That will include testing the immunogenicity of the Mtb L4 and L6 variant antigens in cells from TB patients stratified for the infecting bacteria (i.e., Mtb L4 or L6) and for their HLA allele. Also important will be the application of other in silico tools, such as QSAR, to improve the power of our predictions. Furthermore, integrating in silico predictions with high-throughput in vitro epitope screens or single-cell TCR repertoire analyses will expand our understanding of how antigenic variation shapes T cell recognition across populations. Coupling HLA–epitope binding data with genome-wide association studies (GWAS) holds the potential to uncover host genetic markers of susceptibility or resistance to Mtb L6, guiding lineage- or region-specific vaccine design and precision medicine strategies. Together, these approaches underscore the need to account for both pathogen diversity and host genetic background when developing next-generation TB control measures. Combined, this knowledge can also guide researchers in identifying broad-spectrum antigens that promote appropriate immune responses in different populations, increasing the effectiveness of universal vaccines. Collectively, this will contribute to improve TB control efforts in endemic regions.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/microorganisms13051032/s1, Table S1: List of oligonucleotides used in this study; Table S2: Mean VAF analysis across all lineages.

Author Contributions

M.L.S.: conceptualization, data curation, formal analysis, methodology, writing. N.S.O.: conceptualization, data curation, formal analysis, supervision, writing. M.S.: conceptualization, funding acquisition, project administration, supervision, writing. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by La Caixa Foundation, grant HR21-00415 to M.S. M.L.S. is funded by Fundação Ciência e Tecnologia (FCT), under the reference PhD scholarship 2020.05061.BD.

Institutional Review Board Statement

Animal experiments complied with the ARRIVE guidelines, followed the 2010/63/EU Directive and were approved by the i3S Animal Ethics Committee and the Portuguese National Authority for Animal Health (DGAV; #018413/2021-11-24).

Data Availability Statement

The data on variant allele frequencies are available in Supplementary Table S2. All genome sequences used for analyses were previously deposited online [39].

Acknowledgments

We thank the i3S scientific platform Animal House and Ana I. Fernandes for the help with the in vivo experiments; and Tiago Beites and Gil Castro for critically reading the manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
CFUColony forming unit
HLAHuman leucocyte antigen
ICInhibitory concentration
IFNInterferon
ILInterleukin
LLineage
MtbMycobacterium tuberculosis
MTBCMycobacterium tuberculosis complex
SNPsSingle nucleotide polymorphism
SVMSupport vector machine
TBTuberculosis
VAFVariant allele frequency

References

  1. WHO. Global Tuberculosis Report; WHO: Geneva, Switzerland, 2023. [Google Scholar]
  2. Comas, I.; Coscolla, M.; Luo, T.; Borrell, S.; Holt, K.E.; Kato-Maeda, M.; Parkhill, J.; Malla, B.; Berg, S.; Thwaites, G.; et al. Out-of-Africa Migration and Neolithic Coexpansion of Mycobacterium Tuberculosis with Modern Humans. Nat. Genet. 2013, 45, 1176–1182. [Google Scholar] [CrossRef]
  3. Gagneux, S. Ecology and Evolution of Mycobacterium Tuberculosis. Nat. Rev. Microbiol. 2018, 16, 202–213. [Google Scholar] [CrossRef] [PubMed]
  4. Orgeur, M.; Sous, C.; Madacki, J.; Brosch, R. Evolution and Emergence of Mycobacterium Tuberculosis. FEMS Microbiol. Rev. 2024, 48, fuae006. [Google Scholar] [CrossRef]
  5. Goig, G.A.; Windels, E.M.; Loiseau, C.; Stritt, C.; Biru, L.; Borrell, S.; Brites, D.; Gagneux, S. Ecology, Global Diversity and Evolutionary Mechanisms in the Mycobacterium Tuberculosis Complex. Nat. Rev. Microbiol. 2025. [Google Scholar] [CrossRef] [PubMed]
  6. Guyeux, C.; Senelle, G.; Le Meur, A.; Supply, P.; Gaudin, C.; Phelan, J.E.; Clark, T.G.; Rigouts, L.; de Jong, B.; Sola, C.; et al. Newly Identified Mycobacterium Africanum Lineage 10, Central Africa. Emerg. Infect. Dis. 2024, 30, 560–563. [Google Scholar] [CrossRef]
  7. Stucki, D.; Brites, D.; Jeljeli, L.; Coscolla, M.; Liu, Q.; Trauner, A.; Fenner, L.; Rutaihwa, L.; Borrell, S.; Luo, T.; et al. Mycobacterium Tuberculosis Lineage 4 Comprises Globally Distributed and Geographically Restricted Sublineages. Nat. Genet. 2016, 48, 1535–1543. [Google Scholar] [CrossRef] [PubMed]
  8. van Soolingen, D.; Qian, L.; de Haas, P.E.; Douglas, J.T.; Traore, H.; Portaels, F.; Qing, H.Z.; Enkhsaikan, D.; Nymadawa, P.; van Embden, J.D. Predominance of a Single Genotype of Mycobacterium Tuberculosis in Countries of East Asia. J. Clin. Microbiol. 1995, 33, 3234–3238. [Google Scholar] [CrossRef]
  9. Riojas, M.A.; McGough, K.J.; Rider-Riojas, C.J.; Rastogi, N.; Hazbón, M.H. Phylogenomic Analysis of the Species of the Mycobacterium Tuberculosis Complex Demonstrates That Mycobacterium Africanum, Mycobacterium Bovis, Mycobacterium Caprae, Mycobacterium Microti and Mycobacterium Pinnipedii Are Later Heterotypic Synonyms of Mycob. Int. J. Syst. Evol. Microbiol. 2018, 68, 324–332. [Google Scholar] [CrossRef]
  10. Coscolla, M.; Gagneux, S.; Menardo, F.; Loiseau, C.; Ruiz-Rodriguez, P.; Borrell, S.; Otchere, I.D.; Asante-Poku, A.; Asare, P.; Sánchez-Busó, L.; et al. Phylogenomics of Mycobacterium Africanum Reveals a New Lineage and a Complex Evolutionary History. Microb. Genom. 2021, 7, 000477. [Google Scholar] [CrossRef]
  11. Silva, M.L.; Cá, B.; Osório, N.S.; Rodrigues, P.N.S.; Maceiras, A.R.; Saraiva, M. Tuberculosis Caused by Mycobacterium Africanum: Knowns and Unknowns. PLoS Pathog. 2022, 18, e1010490. [Google Scholar] [CrossRef]
  12. de Jong, B.C.; Antonio, M.; Gagneux, S. Mycobacterium Africanum-Review of an Important Cause of Human Tuberculosis in West Africa. PLoS Negl. Trop. Dis. 2010, 4, e744. [Google Scholar] [CrossRef] [PubMed]
  13. Brites, D.; Gagneux, S. Co-Evolution of Mycobacterium Tuberculosis and Homo Sapiens. Immunol. Rev. 2015, 264, 6–24. [Google Scholar] [CrossRef]
  14. Hershberg, R.; Lipatov, M.; Small, P.M.; Sheffer, H.; Niemann, S.; Homolka, S.; Roach, J.C.; Kremer, K.; Petrov, D.A.; Feldman, M.W.; et al. High Functional Diversity in Mycobacterium Tuberculosis Driven by Genetic Drift and Human Demography. PLoS Biol. 2008, 6, e311. [Google Scholar] [CrossRef]
  15. Gröschel, M.I.; Pérez-Llanos, F.J.; Diel, R.; Vargas, R.; Escuyer, V.; Musser, K.; Trieu, L.; Meissner, J.S.; Knorr, J.; Klinkenberg, D.; et al. Differential Rates of Mycobacterium Tuberculosis Transmission Associate with Host–Pathogen Sympatry. Nat. Microbiol. 2024, 9, 2113–2127. [Google Scholar] [CrossRef]
  16. Gehre, F.; Antonio, M.; Faïhun, F.; Odoun, M.; Uwizeye, C.; de Rijk, P.; de Jong, B.C.; Affolabi, D. The First Phylogeographic Population Structure and Analysis of Transmission Dynamics of M. Africanum West African 1—Combining Molecular Data from Benin, Nigeria and Sierra Leone. PLoS ONE 2013, 8, 8–13. [Google Scholar] [CrossRef]
  17. Yeboah-Manu, D.; de Jong, B.C.; Gehre, F. The Biology and Epidemiology of Mycobacterium Africanum. Adv. Exp. Med. Biol. 2017, 1019, 117–133. [Google Scholar] [CrossRef]
  18. Asante-Poku, A.; Yeboah-Manu, D.; Otchere, I.D.; Aboagye, S.Y.; Stucki, D.; Hattendorf, J.; Borrell, S.; Feldmann, J.; Danso, E.; Gagneux, S. Mycobacterium Africanum Is Associated with Patient Ethnicity in Ghana. PLoS Negl. Trop. Dis. 2015, 9, e3370. [Google Scholar] [CrossRef] [PubMed]
  19. Otchere, I.D.; Coscolla, M.; Sánchez-Busó, L.; Asante-Poku, A.; Brites, D.; Loiseau, C.; Meehan, C.J.; Osei-Wusu, S.; Forson, A.; Laryea, C.; et al. Comparative Genomics of Mycobacterium Africanum Lineage 5 and Lineage 6 from Ghana Suggests Distinct Ecological Niches. Sci. Rep. 2018, 8, 11269. [Google Scholar] [CrossRef] [PubMed]
  20. Asante-Poku, A.; Otchere, I.D.; Osei-Wusu, S.; Sarpong, E.; Baddoo, A.; Forson, A.; Laryea, C.; Borrell, S.; Bonsu, F.; Hattendorf, J.; et al. Molecular Epidemiology of Mycobacterium Africanum in Ghana. BMC Infect. Dis. 2016, 16, 385. [Google Scholar] [CrossRef]
  21. Hiza, H.; Zwyer, M.; Hella, J.; Arbués, A.; Sasamalo, M.; Borrell, S.; Xu, Z.M.; Ross, A.; Brites, D.; Fellay, J.; et al. Bacterial Diversity Dominates Variable Macrophage Responses of Tuberculosis Patients in Tanzania. Sci. Rep. 2024, 14, 9287. [Google Scholar] [CrossRef]
  22. Osei-Wusu, S.; Tetteh, J.K.A.; Musah, A.B.; Ntiamoah, D.O.; Arthur, N.; Adjei, A.; Arbues, A.; Ofori, E.A.; Mensah, K.A.; Galevo, S.E.A.; et al. Macrophage Susceptibility to Infection by Ghanaian Mycobacterium Tuberculosis Complex Lineages 4 and 5 Varies with Self-Reported Ethnicity. Front. Cell. Infect. Microbiol. 2023, 13, 1163993. [Google Scholar] [CrossRef] [PubMed]
  23. de Jong, B.C.; Hill, P.C.; Aiken, A.; Awine, T.; Antonio, M.; Adetifa, I.M.; Jackson-Sillah, D.J.; Fox, A.; Deriemer, K.; Gagneux, S.; et al. Progression to Active Tuberculosis, but Not Transmission, Varies by Mycobacterium Tuberculosis Lineage in The Gambia. J. Infect. Dis. 2008, 198, 1037–1043. [Google Scholar] [CrossRef]
  24. Scriba, T.J.; Maseeme, M.; Young, C.; Taylor, L.; Leslie, A.J. Immunopathology in Human Tuberculosis. Sci. Immunol. 2024, 9, eado5951. [Google Scholar] [CrossRef] [PubMed]
  25. O’Garra, A.; Redford, P.S.; McNab, F.W.; Bloom, C.I.; Wilkinson, R.J.; Berry, M.P.R. The Immune Response in Tuberculosis. Annu. Rev. Immunol. 2013, 31, 475–527. [Google Scholar] [CrossRef]
  26. Comas, I.; Chakravartti, J.; Small, P.M.; Galagan, J.; Niemann, S.; Kremer, K.; Ernst, J.D.; Gagneux, S. Human T Cell Epitopes of Mycobacterium Tuberculosis are Evolutionarily Hyperconserved. Nat. Genet. 2010, 42, 498–503. [Google Scholar] [CrossRef]
  27. Coscolla, M.; Copin, R.; Sutherland, J.; Gehre, F.; de Jong, B.; Owolabi, O.; Mbayo, G.; Giardina, F.; Ernst, J.D.; Gagneux, S.M. Tuberculosis T Cell Epitope Analysis Reveals Paucity of Antigenic Variation and Identifies Rare Variable TB Antigens. Cell Host Microbe 2015, 18, 538–548. [Google Scholar] [CrossRef]
  28. Tientcheu, L.D.; Sutherland, J.S.; de Jong, B.C.; Kampmann, B.; Jafali, J.; Adetifa, I.M.; Antonio, M.; Dockrell, H.M.; Ota, M.O. Differences in T-Cell Responses between Mycobacterium Tuberculosis and Mycobacterium Africanum-Infected Patients. Eur. J. Immunol. 2014, 44, 1387–1398. [Google Scholar] [CrossRef] [PubMed]
  29. Philips, J.A.; Ernst, J.D. Tuberculosis Pathogenesis and Immunity. Annu. Rev. Pathol. 2012, 7, 353–384. [Google Scholar] [CrossRef]
  30. Orme, I.M.; Andersen, P.; Boom, W.H. T Cell Response to Mycobacterium Tuberculosis. J. Infect. Dis. 1993, 167, 1481–1497. [Google Scholar] [CrossRef]
  31. Roach, D.R.; Bean, A.G.D.; Demangel, C.; France, M.P.; Briscoe, H.; Britton, W.J. TNF Regulates Chemokine Induction Essential for Cell Recruitment, Granuloma Formation, and Clearance of Mycobacterial Infection. J. Immunol. 2002, 168, 4620–4627. [Google Scholar] [CrossRef]
  32. Gehre, F.; Otu, J.; DeRiemer, K.; de Sessions, P.F.; Hibberd, M.L.; Mulders, W.; Corrah, T.; de Jong, B.C.; Antonio, M. Deciphering the Growth Behaviour of Mycobacterium Africanum. PLoS Negl. Trop. Dis. 2013, 7, e2220. [Google Scholar] [CrossRef]
  33. Cá, B.; Fonseca, K.L.; Sousa, J.; Maceiras, A.R.; Machado, D.; Sanca, L.; Rabna, P.; Rodrigues, P.N.S.; Viveiros, M.; Saraiva, M. Experimental Evidence for Limited in Vivo Virulence of Mycobacterium Africanum. Front. Microbiol. 2019, 10, 2102. [Google Scholar] [CrossRef] [PubMed]
  34. Yu, J.; Peng, J.; Chi, H. Systems Immunology: Integrating Multi-Omics Data to Infer Regulatory Networks and Hidden Drivers of Immunity. Curr. Opin. Syst. Biol. 2019, 15, 19–29. [Google Scholar] [CrossRef]
  35. Chatanaka, M.K.; Ulndreaj, A.; Sohaei, D.; Prassas, I. Immunoinformatics: Pushing the Boundaries of Immunology Research and Medicine. ImmunoInformatics 2022, 5, 100007. [Google Scholar] [CrossRef]
  36. Wang, X.; Fan, D.; Yang, Y.; Gimple, R.C.; Zhou, S. Integrative Multi-Omics Approaches to Explore Immune Cell Functions: Challenges and Opportunities. iScience 2023, 26, 106359. [Google Scholar] [CrossRef]
  37. Maceiras, A.R.; Silvério, D.; Gonçalves, R.; Cardoso, M.S.; Saraiva, M. Infection with Hypervirulent Mycobacterium Tuberculosis Triggers Emergency Myelopoiesis but Not Trained Immunity. Front. Immunol. 2023, 14, 1211404. [Google Scholar] [CrossRef] [PubMed]
  38. Fonseca, K.L.; Maceiras, A.R.; Matos, R.; Simoes-Costa, L.; Sousa, J.; Cá, B.; Barros, L.; Fernandes, A.I.; Mereiter, S.; Reis, R.; et al. Deficiency in the Glycosyltransferase Gcnt1 Increases Susceptibility to Tuberculosis through a Mechanism Involving Neutrophils. Mucosal Immunol. 2020, 13, 836–848. [Google Scholar] [CrossRef] [PubMed]
  39. Napier, G.; Campino, S.; Merid, Y.; Abebe, M.; Woldeamanuel, Y.; Aseffa, A.; Hibberd, M.L.; Phelan, J.; Clark, T.G. Robust Barcoding and Identification of Mycobacterium Tuberculosis Lineages for Epidemiological and Clinical Studies. Genome Med. 2020, 12, 114. [Google Scholar] [CrossRef]
  40. Dhanda, S.K.; Vir, P.; Raghava, G.P.S. Designing of Interferon-Gamma Inducing MHC Class-II Binders. Biol. Direct 2013, 8, 30. [Google Scholar] [CrossRef]
  41. Dhall, A.; Patiyal, S.; Raghava, G.P.S. A Hybrid Method for Discovering Interferon-Gamma Inducing Peptides in Human and Mouse. Sci. Rep. 2024, 14, 26859. [Google Scholar] [CrossRef]
  42. Yang, Y.; Wei, Z.; Cia, G.; Song, X.; Pucci, F.; Rooman, M.; Xue, F.; Hou, Q. MHCII-Peptide Presentation: An Assessment of the State-of-the-Art Prediction Methods. Front. Immunol. 2024, 15, 1293706. [Google Scholar] [CrossRef] [PubMed]
  43. Trevizani, R.; Yan, Z.; Greenbaum, J.A.; Sette, A.; Nielsen, M.; Peters, B. A Comprehensive Analysis of the IEDB MHC Class-I Automated Benchmark. Brief. Bioinform. 2022, 23, bbac259. [Google Scholar] [CrossRef]
  44. Reynisson, B.; Alvarez, B.; Paul, S.; Peters, B.; Nielsen, M. NetMHCpan-4.1 and NetMHCIIpan-4.0: Improved Predictions of MHC Antigen Presentation by Concurrent Motif Deconvolution and Integration of MS MHC Eluted Ligand Data. Nucleic Acids Res. 2020, 48, W449–W454. [Google Scholar] [CrossRef]
  45. Sousa, J.; Cá, B.; Maceiras, A.R.; Simões-Costa, L.; Fonseca, K.L.; Fernandes, A.I.; Ramos, A.; Carvalho, T.; Barros, L.; Magalhães, C.; et al. Mycobacterium Tuberculosis Associated with Severe Tuberculosis Evades Cytosolic Surveillance Systems and Modulates IL-1β Production. Nat. Commun. 2020, 11, 1949. [Google Scholar] [CrossRef] [PubMed]
  46. Medhasi, S.; Chantratita, N. Human Leukocyte Antigen (HLA) System: Genetics and Association with Bacterial and Viral Infections. J. Immunol. Res. 2022, 2022, 9710376. [Google Scholar] [CrossRef]
  47. Harishankar, M.; Selvaraj, P.; Bethunaickan, R. Influence of Genetic Polymorphism Towards Pulmonary Tuberculosis Susceptibility. Front. Med. 2018, 5, 213. [Google Scholar] [CrossRef]
  48. Oliveira-Cortez, A.; Melo, A.C.; Chaves, V.E.; Condino-Neto, A.; Camargos, P. Do HLA Class II Genes Protect against Pulmonary Tuberculosis? A Systematic Review and Meta-Analysis. Eur. J. Clin. Microbiol. Infect. Dis. Off. Publ. Eur. Soc. Clin. Microbiol. 2016, 35, 1567–1580. [Google Scholar] [CrossRef] [PubMed]
  49. Li, C.-P.; Zhou, Y.; Xiang, X.; Zhou, Y.; He, M. Relationship of HLA-DRB1 Gene Polymorphism with Susceptibility to Pulmonary Tuberculosis: Updated Meta-Analysis. Int. J. Tuberc. Lung Dis. Off. J. Int. Union Against Tuberc. Lung Dis. 2015, 19, 841–849. [Google Scholar] [CrossRef]
  50. Li, C.; Zhou, Y.; Xiang, X.; Zhou, Y.; He, M. The Relationship of HLA-DQ Alleles with Tuberculosis Risk: A Meta-Analysis. Lung 2015, 193, 521–530. [Google Scholar] [CrossRef]
  51. Yuliwulandari, R.; Sachrowardi, Q.; Nakajima, H.; Kashiwase, K.; Hirayasu, K.; Mabuchi, A.; Sofro, A.S.M.; Tokunaga, K. Association of HLA-A, -B, and -DRB1 with Pulmonary Tuberculosis in Western Javanese Indonesia. Hum. Immunol. 2010, 71, 697–701. [Google Scholar] [CrossRef]
  52. Kettaneh, A.; Seng, L.; Tiev, K.P.; Tolédano, C.; Fabre, B.; Cabane, J. Human Leukocyte Antigens and Susceptibility to Tuberculosis: A Meta-Analysis of Case-Control Studies. Int. J. Tuberc. Lung Dis. Off. J. Int. Union Against Tuberc. Lung Dis. 2006, 10, 717–725. [Google Scholar]
  53. Vejbaesya, S.; Chierakul, N.; Luangtrakool, K.; Srinak, D.; Stephens, H.A.F. Associations of HLA Class II Alleles with Pulmonary Tuberculosis in Thais. Eur. J. Immunogenet. Off. J. Br. Soc. Histocompat. Immunogenet. 2002, 29, 431–434. [Google Scholar] [CrossRef]
  54. Brahmajothi, V.; Pitchappan, R.M.; Kakkanaiah, V.N.; Sashidhar, M.; Rajaram, K.; Ramu, S.; Palanimurugan, K.; Paramasivan, C.N.; Prabhakar, R. Association of Pulmonary Tuberculosis and HLA in South India. Tubercle 1991, 72, 123–132. [Google Scholar] [CrossRef] [PubMed]
  55. Snary, D.; Barnstable, C.J.; Bodmer, W.F.; Crumpton, M.J. Molecular Structure of Human Histocompatibility Antigens: The HLA-C Series. Eur. J. Immunol. 1977, 7, 580–585. [Google Scholar] [CrossRef]
  56. McCutcheon, J.A.; Gumperz, J.; Smith, K.D.; Lutz, C.T.; Parham, P. Low HLA-C Expression at Cell Surfaces Correlates with Increased Turnover of Heavy Chain MRNA. J. Exp. Med. 1995, 181, 2085–2095. [Google Scholar] [CrossRef] [PubMed]
  57. Kulkarni, S.; Savan, R.; Qi, Y.; Gao, X.; Yuki, Y.; Bass, S.E.; Martin, M.P.; Hunt, P.; Deeks, S.G.; Telenti, A.; et al. Differential MicroRNA Regulation of HLA-C Expression and Its Association with HIV Control. Nature 2011, 472, 495–498. [Google Scholar] [CrossRef] [PubMed]
  58. Davila, J.; McNamara, L.A.; Yang, Z. Comparison of the Predicted Population Coverage of Tuberculosis Vaccine Candidates Ag85B-ESAT-6, Ag85B-TB10.4, and Mtb72f via a Bioinformatics Approach. PLoS ONE 2012, 7, e40882. [Google Scholar] [CrossRef]
  59. Karosiene, E.; Rasmussen, M.; Blicher, T.; Lund, O.; Buus, S.; Nielsen, M. NetMHCIIpan-3.0, a Common Pan-Specific MHC Class II Prediction Method Including All Three Human MHC Class II Isotypes, HLA-DR, HLA-DP and HLA-DQ. Immunogenetics 2013, 65, 711–724. [Google Scholar] [CrossRef]
  60. Tong, X.; Chen, L.; Liu, S.; Yan, Z.; Peng, S.; Zhang, Y.; Fan, H. Polymorphisms in HLA-DRB1 Gene and the Risk of Tuberculosis: A Meta-Analysis of 31 Studies. Lung 2015, 193, 309–318. [Google Scholar] [CrossRef]
  61. Toyo-oka, L.; Mahasirimongkol, S.; Yanai, H.; Mushiroda, T.; Wattanapokayakit, S.; Wichukchinda, N.; Yamada, N.; Smittipat, N.; Juthayothin, T.; Palittapongarnpim, P.; et al. Strain-Based HLA Association Analysis Identified HLA-DRB1*09:01 Associated with Modern Strain Tuberculosis. HLA 2017, 90, 149–156. [Google Scholar] [CrossRef]
Figure 3. Prediction of IFN-y-inducing epitopes for the selected proteins. Data represent computational predictions performed using IFNepitope (SVM) to discriminate IFN-y-inducing epitopes. These are selected based on reaching a predicted positive score. Non-synonymous substitutions and predicted IFN-y-inducing epitopes are indicated for the selected proteins: RimJ (A), LldD2 (B), Rv0012 (C), Rv0990c (D), and Rv2719c (E), across the two Mtb variants (L4 and L6) in comparison with the ancestral sequence. In each peptide, the non-synonymous amino acid substitution is represented in bold; X represents loss of IFN-γ-inducing epitopes. Peptides with non-synonymous substitutions only in Mtb L6 are shaded blue; those common to both Mtb L6 and L4 are shaded orange; and those only in Mtb L4 are shaded pink.
Figure 3. Prediction of IFN-y-inducing epitopes for the selected proteins. Data represent computational predictions performed using IFNepitope (SVM) to discriminate IFN-y-inducing epitopes. These are selected based on reaching a predicted positive score. Non-synonymous substitutions and predicted IFN-y-inducing epitopes are indicated for the selected proteins: RimJ (A), LldD2 (B), Rv0012 (C), Rv0990c (D), and Rv2719c (E), across the two Mtb variants (L4 and L6) in comparison with the ancestral sequence. In each peptide, the non-synonymous amino acid substitution is represented in bold; X represents loss of IFN-γ-inducing epitopes. Peptides with non-synonymous substitutions only in Mtb L6 are shaded blue; those common to both Mtb L6 and L4 are shaded orange; and those only in Mtb L4 are shaded pink.
Microorganisms 13 01032 g003
Figure 4. Predicted number of high-binding-affinity epitopes across lineages and HLA alleles. Number of high-binding-affinity epitopes predicted for Rv0012 (A,D), LldD2 (B,E), and Rv0990c (C,F) across the considered HLA class I (AC) or HLA class II (DF) alleles, as indicated on the x axis. Predictions for the ancestral sequences are represented in grey bars, for Mtb L4 variants in orange, and for Mtb L6 variants in green. Graphs on the left represent predictions for the indicated HLA alleles detected with high frequency in West Africa. Graphs on the right represent predictions for the HLA alleles detected with low frequency in West Africa. The non-synonymous substitutions that affect the number of high-binding-affinity predicted epitopes are indicated above each corresponding bar. Predictions were performed using NetMHCpan (HLA class I) and NetMHCIIpan (HLA class II), considering 8 to 14 mers for HLA class I and 9 to 18 mers for HLA class II. HLA frequencies were retrieved from allelefrequency.net. * (grey, orange, or green) indicates the absence of predicted high-binding-affinity peptides for the ancestral sequences, Mtb L4 variants, and Mtb L6 variants, respectively.
Figure 4. Predicted number of high-binding-affinity epitopes across lineages and HLA alleles. Number of high-binding-affinity epitopes predicted for Rv0012 (A,D), LldD2 (B,E), and Rv0990c (C,F) across the considered HLA class I (AC) or HLA class II (DF) alleles, as indicated on the x axis. Predictions for the ancestral sequences are represented in grey bars, for Mtb L4 variants in orange, and for Mtb L6 variants in green. Graphs on the left represent predictions for the indicated HLA alleles detected with high frequency in West Africa. Graphs on the right represent predictions for the HLA alleles detected with low frequency in West Africa. The non-synonymous substitutions that affect the number of high-binding-affinity predicted epitopes are indicated above each corresponding bar. Predictions were performed using NetMHCpan (HLA class I) and NetMHCIIpan (HLA class II), considering 8 to 14 mers for HLA class I and 9 to 18 mers for HLA class II. HLA frequencies were retrieved from allelefrequency.net. * (grey, orange, or green) indicates the absence of predicted high-binding-affinity peptides for the ancestral sequences, Mtb L4 variants, and Mtb L6 variants, respectively.
Microorganisms 13 01032 g004
Table 1. HLA class I allele frequencies across five West African countries. Values represent the total number of copies of each allele in the population sample (Alleles/2n) in decimal format. Alleles are categorized into those with high or low prevalence in West Africa.
Table 1. HLA class I allele frequencies across five West African countries. Values represent the total number of copies of each allele in the population sample (Alleles/2n) in decimal format. Alleles are categorized into those with high or low prevalence in West Africa.
Allele Frequency *
Guinea-BissauMaliSenegalThe GambiaBurkina Faso
HLAs with high prevalence in West AfricaHLA-A * 23:010.18580.2280.177n.d.n.d.
HLA-A * 02:010.10840.0830.081n.d.0.0945
HLA-A* 33:030.08820.0940.081n.d.n.d.
HLA-A * 74:010.131750.0360.022n.d.0.032333
HLA-A* 02:020.07140.0760.091n.d.0.053
HLA-B * 15:030.1080.069n.d.n.d.0.1045
HLA-B * 35:010.1440.1270.122n.d.n.d.
HLA-B * 53:010.10.1590.053n.d.0.171
HLA-B * 58:010.0780.0220.069n.d.n.d.
HLA-B * 08:010.0770.0070.048n.d.n.d.
HLAs with low prevalence in West AfricaHLA-A * 24:020.022400.005n.d.n.d.
HLA-A * 01:010.0510.0070.027n.d.n.d.
HLA-A * 02:07n.d.n.d.n.d.n.d.0.005
HLA-A * 24:07n.d.n.d.n.d.n.d.n.d.
HLA-A * 26:0800n.d.n.d.n.d.
HLA-B * 07:020.0230.0580.058n.d.n.d.
HLA-B * 18:01n.d.0.00700.0270n.d.n.d.
HLA-B * 27:0500n.d.n.d.n.d.
HLA-B * 40:0100n.d.n.d.0.014
HLA-B * 15:0100n.d.n.d.0
n.d. indicates that the allele was not detected in the respective population; * data retrieved from allelefrequency.net.
Table 2. HLA class II allele frequencies across five West African countries. Values represent the total number of copies of each allele in the population sample (alleles/2n) in decimal format. Alleles are categorized into those with high or low prevalence in West Africa.
Table 2. HLA class II allele frequencies across five West African countries. Values represent the total number of copies of each allele in the population sample (alleles/2n) in decimal format. Alleles are categorized into those with high or low prevalence in West Africa.
Allele Frequency *
Guinea-BissauMaliSenegalThe GambiaBurkina Faso
HLAs with high prevalence in West AfricaHLA-DRB1 * 11:010.1080n.d.0.03300.0846n.d.
HLA-DRB1 * 10:010.100n.d0.18900.06280.0900
HLA-DRB1 * 04:050.0770n.d.n.d.0.0475n.d.
HLA-DRB1 * 09:010.0770n.d.0.03300.0742n.d.
HLA-DRB1 * 11:020.0770n.d.0.14400.0753n.d.
HLA-DRB1 * 13:020.0770n.d. 0.04400.1440n.d.
HLA-DRB1 * 15:030.0093n.d.0.0220n.d.n.d.
HLAs with low prevalence in West AfricaHLA-DRB1 * 04:010n.d.n.d.0.0055n.d.
HLA-DRB1 * 07:01n.d.n.d.0.01100.0557n.d
HLA-DRB1 * 11:04n.d.n.d.n.d.0.0005n.d.
HLA-DRB1 * 14:02n.d.n.d.n.d.n.d.n.d.
HLA-DRB1 * 15:010.0080n.d.n.d.n.d.n.d.
HLA-DRB1 * 01:010n.d.0.0110n.d.n.d.
n.d. indicates that the allele was not detected in the respective population; * data retrieved from allelefrequency.net.
Table 3. Predicted differences in high-affinity HLA class I and II binding peptides for Mtb L6 and L4 amino acid variants vs. ancestral strain, using NetMHCpan and NetMHCIIpan, in the West African HLA frequency context.
Table 3. Predicted differences in high-affinity HLA class I and II binding peptides for Mtb L6 and L4 amino acid variants vs. ancestral strain, using NetMHCpan and NetMHCIIpan, in the West African HLA frequency context.
HLA Class IFrequency in West Africa
HighLow
ProteinL6 VariantsL4 VariantsL6–Anc.L4–Anc.L6–Anc.L4–Anc.
Rv0010cA26V-0-0-
RimJL123V-0-0-
TB7.3Q62H-−2-*-
Rv0012E234Q; G258DD55N; K105E−2+10+1
Rv0990cP8L: A68VA54S; H61Q−2000
LldD2L258VA59G; V176A; V253M−2+40+4
Rv2719c-H124Y-0-0
HLA Class IIFrequency in West Africa
HighLow
ProteinL6 VariantsL4 VariantsL6–Anc.L4–Anc.L6–Anc.L4–Anc.
Rv0010cA26V-−4-−5-
RimJL123V-0-0-
TB7.3Q62H-0-0-
Rv0012E234Q; G258DD55N; K105E−2+4−50
Rv0990cP8L: A68VA54S; H61Q−7−5+30
LldD2L258VA59G; V176A; V253M−1+1+30
Rv2719c-H124Y-0-0
Orangecolor indicates loss of predicted high-binding-affinity peptides, yellow no difference, and green a gain for the Mtb L6 or L4 variant. *, indicates that no high-binding-affinity epitopes were detected for the HLA/protein pair.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Silva, M.L.; Osório, N.S.; Saraiva, M. Immunoinformatics Predictions on Variable Mycobacterium tuberculosis Lineage 6 T Cell Epitopes and HLA Interactions in West Africa. Microorganisms 2025, 13, 1032. https://doi.org/10.3390/microorganisms13051032

AMA Style

Silva ML, Osório NS, Saraiva M. Immunoinformatics Predictions on Variable Mycobacterium tuberculosis Lineage 6 T Cell Epitopes and HLA Interactions in West Africa. Microorganisms. 2025; 13(5):1032. https://doi.org/10.3390/microorganisms13051032

Chicago/Turabian Style

Silva, Marta L., Nuno S. Osório, and Margarida Saraiva. 2025. "Immunoinformatics Predictions on Variable Mycobacterium tuberculosis Lineage 6 T Cell Epitopes and HLA Interactions in West Africa" Microorganisms 13, no. 5: 1032. https://doi.org/10.3390/microorganisms13051032

APA Style

Silva, M. L., Osório, N. S., & Saraiva, M. (2025). Immunoinformatics Predictions on Variable Mycobacterium tuberculosis Lineage 6 T Cell Epitopes and HLA Interactions in West Africa. Microorganisms, 13(5), 1032. https://doi.org/10.3390/microorganisms13051032

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop