Next Article in Journal
Mixed Methods Evaluation of a Youth-Friendly Clinic for Young People Living with HIV Transitioning from Pediatric Care
Previous Article in Journal
Convergence between Cardiometabolic and Infectious Diseases in Adults from a Syndemic Perspective: A Scoping Review
Previous Article in Special Issue
Prevalence of Drug-Resistant Tuberculosis in HIV-Positive and Diabetic Patients in Sinaloa, Mexico: A Retrospective Cross-Sectional Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comparative Genomic Analysis of Mycobacterium tuberculosis Isolates Circulating in North Santander, Colombia

by
Diana Patricia Bohada-Lizarazo
1,2,3,*,
Karen Dayana Bravo-Sanabria
1,3,
Paola Cárdenas-Malpica
2,3 and
Raúl Rodríguez
1,2
1
Centro Experimental de Diagnóstico e Investigación Molecular-CEDIMOL, Universidad de Pamplona, Pamplona 503050, Colombia
2
Grupo de Investigación en Recursos Naturales, Universidad de Pamplona, Pamplona 503050, Colombia
3
Departamento de Bacteriología y Laboratorio Clínico, Facultad de Salud, Universidad de Pamplona, Pamplona 503050, Colombia
*
Author to whom correspondence should be addressed.
Trop. Med. Infect. Dis. 2024, 9(9), 197; https://doi.org/10.3390/tropicalmed9090197
Submission received: 29 July 2024 / Revised: 17 August 2024 / Accepted: 27 August 2024 / Published: 28 August 2024
(This article belongs to the Special Issue Burden of Tuberculosis in Different Countries)

Abstract

:
Tuberculosis (TB) is an important infectious disease in relation to global public health and is caused species of the Mycobacterium tuberculosis complex (MTBC). In this study, we used whole-genome sequencing (WGS) and comparative genomics to investigate the genetic diversity of M. tuberculosis (Mtb) isolates circulating in North Santander (NS), Colombia. WGS was used for the phylogenetic and lineage characterization of 18 isolates of Mtb typed with orphan genotypes from 11 municipalities of NS between 2015 and 2018. The isolates studied were included in six sublineages from L4; the most frequent were 4.1.2.1, 4.3.3, and 4.3.4.2, corresponding to a proportion of 22.2%. The genome analysis conducted allowed the identification of a set of genetic variants mainly associated with determinants of virulence and evasion of the immune system (PPE34 and PE_PGRS2); adaptation and survival (PGL/p-HBAD); stress response (sigJ and sigM); geographic variability (PPE34); and carbohydrate and lipid metabolism (aldA, rocA, and cyp144). This is the first description of the molecular epidemiology of Mtb isolates circulating in NS achieved through WGS. It was possible to perform comparative genomics analyses between Mtb isolates against the universal reference H37Rv and Colombian UT205 genome, which can help us to understand the local genetic diversity and is relevant for epidemiological studies, providing insight into TB transmission dynamics in NS.

1. Introduction

Tuberculosis (TB) is a public health problem that has recently increased in scale; in its last report, the World Health Organization (WHO) reported 7.5 million new cases of TB and 1.3 million deaths caused by this disease in 2022 [1], revealing it to be the infectious disease with the second-highest death toll after Coronavirus Disease 2019 (COVID-19), in addition to being the cause of twice as many deaths as Human Immunodeficiency Virus/Acquired Immunodeficiency Syndrome (HIV/AIDS) and causing around 20% of deaths among people resistant to antituberculosis drugs [2]. This disease is caused by species of the Mycobacterium tuberculosis complex (MTBC), which includes ten lineages (L1–L10) adapted to humans [3,4,5], in addition to the species that infect animals, which are distributed phylogeographically [6].
North Santander (NS) is a department of the Republic of Colombia with 1.7 million inhabitants located in the northeastern region and is geographically bordered by the Republic of Venezuela. In NS, the incidence of TB was 42.02 per 100,000 inhabitants in 2023 [7], a value higher than the national average rate (35.17), placing this department among those with the highest TB burden in the nation.
Regarding the mortality rate, Colombia showed relatively stable behavior in 2013–2021, with values between 2.0 and 2.2 deaths per 100,000 inhabitants; however, in 2022, it registered an increasing rate, reaching 2.6 deaths per 100,000 inhabitants [8]. In 2022, 1351 people were registered to have died from TB in this country, affecting men in a greater proportion (71.9%); the older adult group (60 years and older) registered the highest mortality rate for TB, with 10.1 per 100,000 inhabitants; at the same time, NS reported a rate of 2.5 per 100,000 inhabitants for the same year [9].
According to the Colombian study carried out in 2010–2018 [9] that estimated Disability-Adjusted Life Years (DALYs) to TB, it was found that a large part of the national territory (51.5%), including NS, has a high burden, and it results mainly from premature mortality in men [10]. NS reported 9496 cases of sick people and 814 cases of people dying from TB, with a DALY rate of 865.9 per 100,000 inhabitants, which, together with the rate of years of life lost (YLL), amounts to 774.2 cases per 100,000 inhabitants and, for years of lived with disability (YLD), 90.7 cases per 100,000 inhabitants [10]. These data are indicators that reflect the need to improve networking care and the health system in regard to the surveillance of TB events in this region of the country. All of the above information justifies the strengthening of public health policies, reducing the impact of this pathology on the population.
Genotyping has provided evidence of the genotypic diversity of MTBC isolates, such as the findings reported by Jia et al. [11], who, in a comparative analysis between 12 genomes, found large variations in the phenotypes, including regarding association with the host, virulence, and immunoreactivity, which could be the result of long-term coevolution between diverse human and animal populations [12,13,14]. Another example is the information provided by Zwyer et al. [15], who studied the numerous Mtb genotypes introduced anciently or recently in Dar es-Salam (Tanzania) from different regions around the world using whole-genome sequencing (WGS) and clinical data, detecting differences in transmission rates and the duration of the infectious period, while, in regard to virulence, variation was not observed between the most common circulating genotypes during an active TB infection, showing that genomes have evolved at sites that affect the transmission capacity of MTBC strains of diverse genetic origins [15].
On the other hand, a study carried out on 36 genomes belonging to lineages L4, L5, and L6 in Ghana [16] found that Mtb isolates from lineage 4 spread faster than those from M. africanum lineages, because the latter presented mutations in essential growth genes such as ftsE, pstP, whiB3, and suhB, which are associated with cell division and are responsible for cellular control [16]; furthermore, the diversity of the carbohydrate metabolic cycle was confirmed, where, in M. africanum lineages, evidenced a greater preference for pyruvate than for glycerol as a carbon source, similar to that for M. bovis [17]. The preference of L6 for pyruvate was related to modifications in the pykA and eno genes [16], demonstrating that gene variations have implications for the physiology of pathogens.
The growing use of molecular tools such as the WGS of Mtb and bioinformatic systems with clinical and epidemiological applications has made it easier to obtain information on genes associated with virulence [18], transmission [19], and multiplication which phenotypes are the result of mutations such as single-nucleotide polymorphisms (SNPs), i.e., smalls insertions or deletions (indels) of the different types of circulating strains [6], allowing researchers to focus on clinical and surveillance activities regarding TB in different regions of the world in order to improve the impact of the corresponding control programs.
Thus, the genomic study of Mtb isolates has allowed researchers to carry out comparisons using different approaches, such as searching for individual genomic variants or variants shared by all isolates through mapping against the H37Rv genome, contributing the comprehension of its evolution, adaptation to human populations, and the immune response induced by its host. For example, in a study conducted in Thailand [20], Mtb isolates from patients with meningeal TB and patients with pulmonary TB were compared, and it was found that 242 gene variants were common in isolates of meningeal TB and rare in isolates from pulmonary TB, highlighting 28 nonsense SNPs that affected genes such as pks and some of the PE/PPE genes, in addition to a greater number of structural variants in the isolates that cause pulmonary TB. Likewise, the authors found that the distribution of the lineages differed between the two types of disease; that is, the East Asian lineage predominated in both types, while the proportion of the Indo-Oceanic lineage was greater in the cases of meningeal TB, demonstrating some potential factors associated with the pathogenesis of meningeal TB [20,21].
In this work, we start with a genotyping study of Mtb isolates circulating in NS [22], where 18 isolates with an orphan genotype (based on spoligotyping), were taken, to which WGS and comparison were applied against the H37Rv and Colombian UT205 genomes, describing their genomic characteristics and thus determining if they have any relationship with the high transmission of TB in NS, Colombia.

2. Materials and Methods

2.1. Type of Study

This carried out a descriptive, prospective, cross-sectional study that included Mtb isolates stemming from cases of TB investigated and reported in the 2015–2018 period in NS, Colombia.

2.2. Study Population

Patients who were investigated and reported to have TB at the clinical laboratory of the ESE Hospital Universitario Erasmo Meoz (HUEM) in the city of Cúcuta, NS, Colombia, between 2015 and 2018 were included.

2.3. Sample Design

This study was carried out using non-probabilistic convenience sampling. The sample was formed using 215 cases of pulmonary and extrapulmonary TB; among which, 207 cultures were genotyped as Mtb based on spoligotyping and the number of tandem repeats of interspersed variable repetitive units (MIRU-VNTRs) of the mycobacteria of 24 loci. From the example, 18 isolates were taken among the 72 isolates that had an orphan genotype to perform WGS, taking into account the quality, integrity, and content of the DNA obtained from them.

2.4. Bacterial Culture and DNA Extraction

Initially, sputum samples were cultured in Ogawa Kudoh o Lowenstein Jensen Microgen® Culture medium [23]. The samples were incubated at 37 °C, with growth observed weekly until 8 weeks, but they were not discarded as negative before 12 weeks. The cultures that showed colony growth were treated using by Ziehl Neelsen (ZN) staining to confirm the presence of Acid-Fast Bacilli (AFB), and an immunochromatographic assay (BD MGIT TBc ID, Beckton Dickinson Diagnostic, Sparks, NV, USA) was performed, validating that the isolates belonged to the MTBC. In addition, banks of both culture colony samples and genomic DNA were created, all of which were stored at −80 °C as counter samples to be used in case they were needed or in future analyses.
DNA extraction was performed on the isolates identified as belonging to the MTBC using the cetyltrimethyl ammonium bromide (CTAB) method, as previously described [24]. The amount of DNA obtained was determined using a Nanodrop 2000C Thermo Scientific® (Waltham, MA, USA) with measurements at 260 nm/280 nm; calculating the equivalence to 50 µg/mL per absorbance unit, the relation 260/280 equal to 1.8 is considered the ideal ratio for pure DNA. Likewise, the quality of the extracted DNA was evaluated via electrophoresis in 0.8% agarose gel with TBE 1X buffer stained with SYBR Safe, Invitrogen (10,000×). The DNA samples that maintained integrity, purity, and quantity were subjected to sequencing [25].

2.5. Data Management

2.5.1. Study of Variables

This study includes sociodemographic (age, sex, country and municipality of origin, and ethnicity); socioeconomic (inhabitant of a commune or township, health regime, overcrowding, and homelessness); microbiological (type of sample, smear microscopy result, culture result, and immunochromatographic test result); clinical (type of TB, comorbidities, and treatment); and epidemiological (type of patient and TB risk municipality) variables of the participants, which were taken from the notification forms of the 813-TB Event from the HUEM Clinical Laboratory and medical records.
Also, genotypic variables corresponding to the spoligo international type (SIT) of spoligotyping, lineages or families, and subfamilies and mycobacterial interspersed repetitive unit (MIT) of the MIRU-VNTR data were included. All this information was recorded in a Microsoft Office Excel 2016 (Microsoft Corporation—Redmond, WA, USA) spreadsheet that was developed for this purpose and used to generate the NS TB database presented in this study.

2.5.2. Analysis of Variables

Initially, an exploratory analysis of the data was carried out, describing all qualitative and quantitative variable studies using absolute and percentage frequencies, as appropriate. Afterward, the grouped pattern category was compared to the single-pattern category using Fisher’s exact test for both types of variables.
Finally, for variables that presented a p < 0.05 in the previous analysis, the association between them and the grouped pattern was determined by calculating the prevalence ratios (PRs) with their respective confidence intervals (95% CIs) utilizing univariate Poisson regression. Additionally, a multivariate analysis was performed to determine independent variables associated with the grouped pattern. The analyses were performed using Stata 16.1 software (Stata Corp LLC, College Station, TX, USA).

2.6. Total Sequencing of Mtb Genomes

Sequencing was performed at the High-Throughput Genomics core of the University of Utah. DNA libraries were prepared using an Illumina DNA Prep library kit and sequenced using Illumina NextSeq (2 × 151 bp). The sequences were generated as paired ends, trimming was performed to remove adapters, and a quality analysis was run with QIIME v1.9.1 [26] and DADA3 v1.6.0 [27], revealing that all sequences exceeded a score of Q20 (Table S1).
The filtered reads were assembled with MegaHit Assembler v1.0 [28] and aligned to the reference H37Rv genome sequence (NCBI Accession: NC_000962.3) using Kraken software v2.5.0 [29] via the Python v3.12 computational language. The raw sequences were deposited in the European Nucleotide Archive under bioproject PRJEB70559 (ERP155524) and the accessions numbers ERS17287192 to ERS17287209 (available at https://www.ebi.ac.uk/ena/browser/view/PRJEB70599, accessed on 27 November 2023).

2.7. Annotation and Identity Analysis of Mtb Genomes

Structural annotations of the genes and other regions in the Mtb isolates were carried out against UT205 (GenBank accession: NC_016934.1, GI: 392384721) and H37Rv (NCBI Accession: NC_000962.3). Multiple alignment of Mtb genomes was performed utilizing “MAFFT Alignment” tools [30] within Geneious Prime 2023 software (https://www.geneious.com/ accessed on 10 November 2023). During this alignment process, an identity matrix and similarity was sought between the genome sequences with H37Rv and UT205 to identify the genes, regulatory regions, and other characteristic sequences.

2.8. Phylogenetic Analysis

In order to evaluate the genomes behavior of interest and their homology with respect to the reference genomes, a maximum likelihood phylogenetic tree was generated with the 20 genomes using the RAxML tool v8.0 [31].
Phylogenetic analysis was performed using the general time-reversible model (GTR), which evaluates phylogenetic inferences in considerations of the reversibility of nucleotide substitutions over time. A bootstrap of 1000 resampling inferences was employed, and M. canettii (GenBank access: ASM25337v1) was used as an outgroup due to its divergent position and greater genetic variability in comparison with the main MTBC subspecies.

2.9. Identification of Mtb Variants

Mtb genomics variants were identified from WGS data using an algorithm available at https://github.com/ksw9/mtb-call2, accessed on 10 November 2023. Briefly, the quality of the bases was verified. BWA v.0.7.15 (bwa mem) [32] was used to map the reading against the H37Rv genome (NCBI Accession: NC_000962.3), and duplicates were removed using Sambamba software v0.5.0 [33].

2.10. Comparative Genomics Analysis

A comparative genomics analysis was performed through the usage of the LASTZ [34] and MAUVE [35] genome alignment tools available as part of Geneious Prime 2023. This analysis was performed using both tools, conducting two global alignments of the 18 genomes isolated from patients, comparing the UT205 (GenBank accession: NC_016934.1, GI: 392384721) with the H37Rv genome (NCBI Accession: NC_000962.3).

2.11. Ethical Considerations

The study protocol was approved by the Ethics Committee of the HUEM (Protocol 2015-136-003097-2) for the development of this study. The ethical principles of biomedical research were followed in accordance with resolution 008430 of 1993 of the Ministry of Health of Colombia, where it is classified as a risk-free study, taking into account the optimal clinical practices of the Declaration of Helsinki [36]. All participants in this study were informed of the risk it posed and voluntarily signed the form (with the study identifier code PR130-00-051), providing access to the sputum samples from each patient.

3. Results

3.1. Population Characteristics

Among the 18 individuals included in this study, the predominant proportion was male, corresponding to 11 individuals (61.2%); a range of ages between 17 and 73 years was represented, and the majority (83.5%) came from the municipality of Cúcuta. In addition, five of the participants in this study (27.8%) had a negative result in their smear microscopy test, only two individuals (11.1%) presented a positive HIV test, and in one of the cases (5.6%), resistance to rifampicin was found (Table 1).

3.2. Annotation and Identity Analysis of Mtb Genomes

For the 20 sequences analyzed that included the 18 Mtb isolates from NS and evaluated against the UT205 and the H37Rv strains, a total of 3814 genes were found, of which 48 coded for RNAs, and the rest coded for proteins (CDS). The overall identity was found to be 95.9%, with an average alignment length of 4,411,532 base pairs (bps). A total of 4,277,886 identical sites were identified, representing 96.6% similarity.
In relation to the distance matrix, it was found that all Mtb genomes from NS showed high homology with the H37Rv genome, with identity percentages that ranged between 98.70% and 99.56%. The 21088X3-col119, 21088X6-col124, 21088X16-col207, 2108X18-col213, 21088X10-col174, 21088X13-col179, 21088X4-col121, 21088X12-col178, 21088X15-col201, and 21088X7-col132 genomes presented the highest identity values (Figure S1).
On the other hand, the global similarity between genomes from NS and the UT205 genome was 98%, with an average alignment length of 4,411,532 bp and a total of 19 sequences evaluated. In this case, 4,276,412 identical sites were reported, representing 96.5% similarity.
When the homology of the genomes was evaluated individually in comparison with the UT205 and H37Rv genomes, identity percentages that varied between 94.80% and 95.41% were found, as reported in the distance matrix.

3.3. Phylogenetic Analysis

The results of the phylogenetic analysis revealed three clusters of Mtb isolates, including the UT205 genome, which was compared to the H37Rv genome (Figure 1).
The first group is made up of the genomes of the isolates 21088X13-col179, 21088X12-col178, 21088X4-col121, and 21088X5-col123; the second group includes the H37Rv genome and the isolates 21088X9-col173, 21088X6-col124, 21088X16-col207, 21088X18-col213, 21088X3-col119, 21088X7-col132, and 21088X10-col174; and group three consists of the isolates 21088X15-col201, 21088X1-col41, 21088X2-col117, 21088X11-col177, and 21088X8-col137 and the UT205 genome. Isolates 21088X17-col210 and 21088X14-col180 are not related to any group within the phylogenetic tree.
These results show a grouping according to the homology between genomes. The isolates of group 2 present a greater similarity to the H37Rv genome, while the isolates of group 3 present a greater similarity to the genome of the UT205 strain.

3.4. Identification of Mtb Variants

The genomes of the 18 sequenced isolates could be included in six L4 sublineages and were distributed as follows: 4.1.2.1 (H1) with four isolates; 4.3.2 (LAM3) with two isolates; 4.3.3 (LAM9) with four isolates; 4.3.4.1 (LAM1) with three isolates, 4.3.4.2 (LAM11) with four isolates, and 4.8 (T) with a single isolate (Figure 2). The most frequent sublineages were 4.1.2.1, 4.3.3, and 4.3.4.2, and each one has four isolates (22.2%).
Of the 11 municipalities in which this study was carried out, 4 contributed Mtb isolates to the 18 isolates sequenced, and Cúcuta was the municipality that had the highest number of isolates with 15 (83.5%), including all sublineages, while, in the remaining three municipalities of Chinácota, El Zulia, and Tibú, only the 4.1.2.1 and 4.3.4.2 sublineages were found.

3.5. Comparative Genomics Analysis

3.5.1. Comparative Genomics Analysis of Mtb Isolates against the H37Rv Genome

The comparative genomics analysis against the H37Rv genome revealed a total of 27,151 genetic modifications, of which 25,200 correspond to single-nucleotide polymorphisms (SNPs); of these, 8707 were synonymous, and 16,493 conferred changes in the amino acid residues involved that could have various consequences for the protein. The other gene modifications found correspond to insertions events and deletions of single nucleotides or tandem repetitions, constituting SNPs that confer a loss of the starting codon and generate a truncation in the protein, affecting its length.
A total of 44 genes involved in this group of gene modifications encode proteins that are responsible for the biosynthesis of phenolic glycolipids and derivatives of p-hydroxybenzoic acid (PGL/p-HBAD), proteins related to virulence and the host’s immune response, integral proteins of the membrane, enzymes and metabolic proteins, and proteins associated with gene regulation and transcription (Table 2).
The frequencies of the genetic modifications were between 2.9% and 100%, where the proteins PstA1, LpdA, and PE35 represented 100% of the frequency; that is, the variants detected were in all the Mtb isolates from NS collected in the present study and in the UT205 strain as well. In the case of the rv0930 gene that codes for the PstA1 protein, truncation was found in the protein (Figure 3).
In addition, the rv3303c and rv3872 genes coding for the proteins LpdA and PE35, respectively, presented SNPs that conferred truncation in the coding proteins, modifying their extension, as shown in Figures S2 and S3.
The other genes presented gene modifications with lower frequencies affecting specific isolates, for example, the rv1917c gene that codes for the PPE34 protein, which has a size of 4366 bp, and in the case of the UT205 genome, two tandem insertions were identified at positions 463 and 583, respectively, that were not in the 18 NS Mtb isolates, in addition to two deletions, one of 207 bp (position 964) and the other of 69 bp (position 1248), and a 75 bp insertion (position 2895), as shown in Figure 4.
Similar behavior was observed in the genome of isolate 21088X11-col177, in which a 138 bp insertion at position 1014 and a deletion of the same size at position 1204 were identified (Figure 4), while, in the genome of isolate 21088X6-col124, two of the previously reported deletions in the UT205 strain were identified (207 bp at position 964 and 69 bp at position 1248), in addition to a 207 bp insertion at position 1388 (Figure 4).

3.5.2. Comparative Genomics Analysis of Mtb Isolates against to UT205 Genome

The genomic analysis of the 18 isolates from NS against the UT205 genome revealed the presence of various gene variants, identifying a total of 503 variants, of which 196 were located in 123 involved genes. Mutations due to SNPs (141 in total) were the most frequent, of which 57 resulted in synonymous changes, while the 84 SNPs remaining generated modifications in the amino acid residues of related proteins. Furthermore, 37 insertion events were identified, of which 12 involved tandem repeat sequences.
The frequency of the gene variants found was between 22% and 100%, where the maximum value indicates that the variant was presented in the 18 genomes obtained from the isolates from the NS patients analyzed in the present study. In this way, it was determined that 28 variation events presented a frequency of 100% (Table 3), and the genes involved were classified into three main groups according to their biological function in Mtb.
The first group encompasses genes that are related to the coding of PE/PPE family proteins (PPE34, PPE8, PPE15, PPE59, PE_PGRS5, PE_PGRS42, PE_PGRS59, PE_PGRS60, and PE_PGRS51), as described in Table 3, where the rv1917c gene that codes for the PPE34 protein is highlighted again, presenting two deletions of 69 nucleotides towards the 3′ region and three insertions of tandem repetitions (two towards the 3′ region and one in the middle region of the sequence) involving a frameshift of the protein, leading to a significant alteration in the sequence of the amino acid and therefore possibly significantly impacting the function of the corresponding protein. In the same way, the PE_PGRS2 gene presents a genetic modification in UT205 that includes a 185 bp deletion, which generates a significant change in the reading frame of the protein (Figure 5).
The second group of genes includes those associated with lipid biosynthesis and the stress response of mycobacteria. The sigJ and sigM genes are highlighted, which present a SNP and a short tandem insertion, respectively.
Finally, the third group of genes that presented modifications in all the genomes evaluated consisted of genes related to lipid and carbohydrate metabolism (aldA, rocA, cyp144, pykA, gnd1, and glpQ1). These are genes that conferred the change in the amino acid residue involved, except for the gnd1 gene, which presented a 5 bp insertion towards the 3′ region, and the glpQ1 gene, which had a deletion of the same size.
The other genes are associated with various biological processes in Mtb, such as the synthesis and metabolism of lipids and fatty acids (plsB1, pks1, pks5, pks7, pks9, fadB3, fadE28, and fadD35) which frequency remained in the range of 27.8% to 94.4% and wherein the representative gene changes were simple-nucleotide substitutions that conferred a change in the amino acid residues involved.
Other genes that draw attention are ctpG and PPE59, because they presented deletions of 3650 bp and 419 bp, respectively, in the UT205 strain, which generated a truncation in the coding protein (Figure 6).

4. Discussion

This research is the first study carried out in NS, Colombia, in which WGS was used to analyze TB cases. The results allowed the identification of Mtb isolates with an orphan genotype (based on spoligotyping) in L4 or Euro-American lineages with sublineages 4.1.2.1, 4.3.2, 4.3.3, 4.3.4.1, 4.3.4.2, and 4.8. These results obtained are similar to those reported in previous studies conducted using WGS [38,39] and traditional methods found in the rest of the country [40,41] and the Latin American continent [42].
In the gene identity analysis, it is notable that the 18 isolates present greater homology to H37Rv than UT205. On the other hand, the results of the phylogenetic analysis allowed us to detect three clusters among the Mtb isolates, perceiving some degree of greater similarity with the reference strains. However, we recommend conducting a more robust phylogenetic analysis that allows the use of other tools (PhyML and FastTree) and involves various genomic regions that allow the acquisition of precise and efficient phylogenetic reconstructions, which will contribute significantly to the understanding of the evolutionary relationships between microorganisms at the genomic level.
In the sociodemographic characterization of the 18 sequenced isolates obtained from the participants, it was found that all those in sublineage 4.1.2.1 were female, while all were male in sublineages 4.3.4.1, 4.3.4.2, and 4.8. At the same time, in sublineages 4.3.2 and 4.3.3, the participants were equally distributed according to sex. These data show that neither of the two sexes presented exclusivity in terms of carrying a sublineage in this group.
Regarding the relationship between sublineages and the municipalities of origin of the participants, it was found that sublineage 4.1.2.1 corresponded to participants from communes 3, 4, and 6 of Cúcuta, as well as from the municipality of El Zulia; sublineage 4.3.3 corresponded to participants from communes 1, 3, 5, and 6; sublineage 4.3.4.2 corresponded to participants from communes 3, 9, and the municipalities of Chinácota and Tibú; sublineage 4.3.4.1 corresponded to participants from communes 2, 6, and the township of Aguaclara; sublineage 4.3.2 corresponded to participants from commune 9 and the township of Buena Esperanza; and the sublineage 4.8 corresponded to participants from commune 7. The results obtained indicate that the transmission of this disease is not determined by a specific sublineage, since these were found to be indiscriminately present in high- or low-risk areas of the department of NS.
On the other hand, the presence of sublineages 4.3.3, 4.3.4.1, and 4.3.2 was found without distinction in population groups such as homeless and HIV-positive and migrants, being consistent with the findings of classical epidemiology [22], which highlights these population groups as important, because they can become spreaders of the infection in the community, and therefore, it is necessary to prioritize control measures to increase early detection both in studies of contacts and the general population that allow a rapid and timely diagnosis.
Another important finding was that 3 of the 18 participants had negative smear microscopy results in sublineages 4.1.2.1, 4.3.3, and 4.3.4.2. It is important to mention that people with negative bacilloscopy results can transmit TB and thus constitute a hidden prevalence of the disease and promote the maintenance of the chain of transmission in a community [43].
In regard to the grouping of the isolates, sublineages 4.1.2.1, 4.3.2, and 4.3.3 presented grouping in half of their isolates, while sublineages 4.3.4.1 and 4.3.4.2 showed grouping in a lower proportion (25%). However, these results show that five of the six sublineages identified in this study are associated with the recent transmission of TB in Cúcuta and that the six sublineages are related to relapses and reinfections both in the communes of Cúcuta and neighboring municipalities.
Nevertheless, only 1 of the 18 samples was observed to be resistant to rifampicin, corroborating the fact that rifampicin resistance does not constitute a cause for concern in the region [44].
As consequence of the different genetic modifications observed when performing the comparative genomics analysis of the 18 isolates of NS Mtb and UT205 against the H37Rv genome, changes in biological function were inferred, such as those generated in a critical site of the protein, potentially negatively or positively affecting the metabolic pathways associated with it, as well as protein–protein interactions and enzymatic activity, among others, which could be associated with adaptation and survival events within the host, as well as evasion of the immune system and susceptibility to drugs [45].
One of the gene modifications found in the NS Mtb isolates involves the gene encoding the PstA1 protein that is associated with the high-affinity phosphate transport system in Mtb. This phosphate transport system called Pst (“phosphate-specific transport”) is essential for the efficient uptake and utilization of phosphate, a nutrient essential for the growth and survival of bacteria [46]. In the Pst system, PstA1 functions as a periplasmic protein that contributes to the adaptation and survival of mycobacteria in environments where phosphate is a limited resource [46]. It has been shown that the Pi uptake system component of Mtb, PstA1, is essential to counteract the IFNγ-dependent immune response [46], and its deficiency could trigger high sensitization of the mycobacteria to the host immune response [46,47,48].
Another modification found was observed in the rv3303c gene, which encodes the enzyme lipoamide dehydrogenase (LpdA), which performs an important role in the metabolism (specifically in regard to the generation of energy under anaerobic conditions [49]) and in biosynthesis of the mycolic acids, key components of its cell wall; this modification could be associated with mycobacteria´s resistance against reactive metabolites of nitrogen and nitric oxide, as this enzyme plays an important role in this process [50].
Likewise, the rv3872 gene, which encodes the PE35 protein belonging to the PE/PPE family, presented a SNP at position 295 that caused the loss in an amino acid at the external C-terminal, producing a truncation of the expression of the corresponding information. As it is a protein associated with the virulence of the microorganism, it could be thought that this modification is a function of the evasion of the immune system; however, more specific studies are required to support this hypothesis. In a study carried out by Jiang and colleagues, among 161 isolates of Mtb, mutations in the PE35 protein that generated drastic changes resulting in the increased virulence of the mycobacteria were identified [51].
Additionally, SNPs were found in 2 of the 18 isolates (21088X06-col124 and 21088X11-col177), and they were related to the rv1917c gene, which encodes the PPE34 protein, which is associated with the virulence and evasion of the host immune system [52]. This protein helps dendritic cells mature by inducing a Th2 cellular response that favors immune evasion [53]; in addition, together with the rest of the PE/PPE proteins, it exhibits high genetic variability within Mtb isolates that may contribute to the adaptation of the bacteria to different environments and immunological conditions, facilitating its survival and persistence [54].
The behavior of the genetic modifications found in this gene in the NS isolates and the UT205 genome could be associated with strain-specific variability, where factors such as a patient’s immunity could influence such changes. Another phenomenon that could be associated is the geographical distribution of the isolates involved in this study (in the NS region) and the Colombian UT205 genome (Medellin) significantly influencing the genetic variability in the gene coding for PPE34. Therefore, it is necessary to highlight that research on PE/PPE proteins continues to shed light on their specific functions and contribution to Mtb pathogenesis. A detailed understanding of the biology of these proteins may have important implications for the development of therapeutic strategies and vaccines against TB [52,55].
Similarly, modifications in the proteins associated with the biosynthesis of phenolic products of PGL/p-HBAD were identified in the other genes, which presented frequencies between 11% and 38%, wherein truncation and extension events of the proteins were observed. These changes could be related to the adaptation of mycobacteria to different environments, drug resistance, and interaction with the immune system.
The phenolic products of PGL/p-HBAD in Mtb play an essential role in the structure and virulence of the corresponding bacterium, because they form crucial components of the cell wall, contributing to its integrity and resistance [56]. Also, their presence allows modulating the immune response and the ability to evade the host’s defenses. Furthermore, they can contribute to the adaptation of bacteria to different environments within the host, playing a crucial role in the pathogenesis of TB [56,57].
As described, most of the modifications occurred in genes associated with the PE/PPE protein family, presenting frequencies between 2% and 100% (Table 2). These proteins are known for their diversity in sequence and structure, as well as for their role in the interaction with the host immune system and in their contribution to bacterial virulence [52]. The adaptation of Mtb to different human populations and geographic environments may be reflected in specific patterns of genetic variability in PE/PEE proteins, suggesting an evolutionary response to host selective pressures and providing deep insight into the complexity of the interaction between this bacterium and the immune system, as well as its ability to persist and cause disease in various populations and epidemiological contexts [54,55,58,59].
Regarding the frequency of occurrence, between 5% and 47% (Table 2) of the modifications present in the genes that encode proteins that interact with the host’s immune response play a role in selection pressure events that could be influenced by the host or environmental factors.
Finally, some variants were identified in genes encoding integral membrane proteins. The frequencies of occurrence are between 5% and 72%, and their possible effects on mycobacteria are associated with resistance or evasion of the immune system due to truncations in these proteins, which could have significant implications for the pathogenesis of TB. These mutations could allow these mycobacteria to avoid recognition by and the effective response of the immune system, thus facilitating their survival and persistence in the host. By avoiding the defenses of the immune system, Mtb can establish chronic and recurrent infections by evading immunological and therapeutic interventions. Furthermore, immunological resistance could have consequences for the transmissibility of the disease, affecting the spread and epidemiological burden of TB in affected populations [60].
On the other hand, regarding the gene variants found in the Mtb isolates in this study, when performing the comparative analysis against the UT205 genome, it is important to highlight the first group of genes related to the PE/PPE family due to their importance in pathogenesis and in the interaction between the pathogen and the host’s immune system. The literature suggests that this group of proteins could be involved in the evasion of the immune system, affecting the antigen presentation and playing a role in the adaptation of Mtb to the intracellular environment [61]. Similarly, in the PE_PGRS2 gene, there is a genetic modification in the Colombian UT205 strain, not found in the NS isolates, and its effect seems to be similar to those effects found in the PPE34 protein, which were related to the interaction between the host immune system and Mtb [62,63].
The second group, which involves mutations in genes that code for factors called “sigma” involved in gene expression and response to stress, may have significant consequences on the ability of the mycobacteria to adapt to their environment. These “sigma” factors play a key role in the regulation of genes related to virulence and response to adverse conditions [64]. A change could alter the expression of genes critical for the survival and persistence of Mtb, compromising its ability to evade the host immune system and resist hostile conditions. This could influence the virulence of the bacteria and their ability to establish chronic infections, thus affecting the dynamics of TB. Furthermore, such a change could have implications for drug resistance and the effectiveness of therapeutic strategies, highlighting the importance of understanding at a molecular level how these changes affect the adaptation of mycobacteria to their changing environment [64,65].
In the third group, gene modifications were found in all 18 isolates related to lipid and carbohydrate metabolism. Mutations in these genes could have various consequences that impact the adaptation of mycobacteria to their environment. For example, SNPs in molecules such as aldolase (aldA) and regulatory protease (rocA), which are involved in central metabolism and protein regulation, respectively [66], could affect the ability of mycobacteria to utilize carbon sources and nitrogen. Similarly, mutations in cyp144 (cytochrome P450) could influence the resistance to xenobiotic compounds and the modulation of virulence. On the whole, mutations in these genes could alter critical aspects of Mtb metabolism, stress response, and virulence, influencing Mtb´s ability to adapt and persist in its host.
Likewise, other mutations related to the degradation, synthesis, and transport of fatty acids were found (the plsB1, pks1, pks5, pks7, pks9, fadB3, fadE28, and fadD3 genes), which could alter the ability of the bacteria in question to use and metabolize fatty acids, essential components for their survival and persistence in the host [67]. Such mutations could impact lipid biosynthesis, affecting the integrity of the cell wall and, therefore, the bacteria’s resistance to environmental conditions and the host’s immune system. Since lipid homeostasis is essential for Mtb virulence, mutations in fadB3, fadE28, and fadD35 could influence the analyzed bacteria’s ability to establish chronic infections and evade immune responses, indicating the importance of these genes in the pathogenesis of TB [68].
Finally, it is important to mention that the ctpG gene presented an insertion with a frequency of 63.6%, a relatively high value, and was detected in some of the NS isolates in this study (21088X10-col174, 21088X12-col178, 21088X13-col179, 21088X4-col121, 21088X5-col123, 21088X7-col132, and 21088X9-col173). This gene encodes proteins involved in the active transport of metal ions and other cations across the cell membrane, a process with the maintenance of ionic homeostasis and resistance to heavy metals, playing a role in the survival of mycobacteria [69,70].
On the other hand, the gene modification in PPE59 presented a much lower frequency compared to ctpG (36.4%) and was present in only four of the clinical isolates (21088X4-col121, 21088X5-col123, 21088X12-col178, and 21088X18-col213). According to the literature, this gene encodes proteins that are expressed on the cell surfaces of the bacteria in question and may play a role in the evasion of the immune system or in resistance to fluoroquinolones [71].
The genomic comparison of the Mtb isolates with the Colombian UT205 strain revealed various genetic modifications that are different from those found when comparing all the isolates with the H37Rv genome. It can be seen that the mutations identified in the comparison with the UT205 genome in all cases correspond to this strain—that is, they are typical of the Colombian strains. The analysis against H37Rv allows to conclude that the majority of the isolates retain more characteristics than the UT205 strain.
This study has limitations regarding the relatively small number of isolates included in the analysis. It is possible that a larger sample may reveal more information about the genetic variations and evolution of Mtb strains circulating in the department of NS, Colombia. In addition, future efforts are needed to perform more robust phylogenetic reconstruction analyses to map the genetic diversity present in Mtb isolates from the region.

5. Conclusions

This is the first description of the molecular epidemiology of Mtb isolates circulating in NS achieved through WGS. It was possible to perform a comparative genomics analysis between Mtb isolates against the universal references H37Rv and Colombian UT205 genome, which revealed various changes in each genome evaluated that can help us to understand the local genetic diversity and is relevant for epidemiological studies, providing insight into TB transmission dynamics in NS, Colombia.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/tropicalmed9090197/s1. Figure S1. Identity matrix. The image, corresponding to the results obtained using the LASTZ tool, shows the identity matrix generated for the Mtb genomes of this study and the Colombian UT205 strain, and, compared to the universal reference, the H37Rv genome. Figure S2. Gene modification in the gene that codes for the LpdA protein. As revealed using LASTZ and Mauve, the image shows a SNP in position 1416 observed during comparison against the H37Rv genome, where a termination codon is generated in the protein, leading to a loss of 21 amino acids towards the C-terminus. Figure S3. Genetic modification in the gene that codes for the PE35 protein. As revealed using LASTZ and Mauve and in comparison with the H37Rv genome, the image reveals a SNP in position 295, where a termination codon is generated in the protein, leading to a loss of one amino acid towards the C-terminus. Table S1. Summary of the total genome sequencing process of the 18 Mtb isolates from NS, Colombia.

Author Contributions

Conceptualization, D.P.B.-L.; methodology, D.P.B.-L., R.R., K.D.B.-S., and P.C.-M.; acquisition and analysis of data, D.P.B.-L., K.D.B.-S., and R.R.; interpretation of data, D.P.B.-L. and K.D.B.-S.; writing—original draft preparation, D.P.B.-L. and K.D.B.-S.; writing—review and editing, R.R. and P.C.-M.; funding acquisition, D.P.B.-L. and R.R. All authors have read and agreed to the published version of the manuscript.

Funding

This project was financed by the Vice-Rector’s Office of Research, Universidad de Pamplona, through the Internal Announcement Program Projects Bank 2013 (PR130-00-051; GA150-BP-II-2013-2.1.2.2.1) and 2017 (400-156.012-041; GA313-BP-2017). D.P.B.-L. received a study commission from the Universidad de Pamplona (Agreement 005 of February; 2016) for her Doctorate in Biotechnology from the Universidad de Antioquia.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee of the HUEM (protocol 2015-136-003097-2) for studies involving humans.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The raw dataset generated and analyzed during this study will be provided by the authors upon reasonable request.

Acknowledgments

We would like to express our gratitude to the staff members of the Mycobacteria section of the Clinical Laboratory, as well as the Epidemiology section of the ESE HUEM in the city of Cúcuta, for their support in the samples collection and for granting access to information. We would like to thank all the patients who gave their consent to participate in this project and Martha Inírida Guerrero for her contributions and unconditional support in carrying out this study.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

  1. WHO. Global Tuberculosis Report 2023. 2023. Available online: https://www.who.int/teams/global-tuberculosis-programme/tb-reports/global-tuberculosis-report-2023 (accessed on 7 February 2024).
  2. WHO. Global Tuberculosis Report 2023. Tuberculosis Data. Interactive Visualizations. Diagnosis Notification and Treatment of Rifampicin-Resistant TB. 2023. Available online: https://www.who.int/teams/global-tuberculosis-programme/data (accessed on 3 March 2024).
  3. Gagneux, S. Ecology and evolution of Mycobacterium tuberculosis. Nat. Rev. Microbiol. 2018, 16, 202–213. [Google Scholar] [CrossRef] [PubMed]
  4. Coscolla, M.; Gagneux, S.; Menardo, F.; Loiseau, C.; Ruiz-Rodriguez, P.; Borrell, S.; Otchere, I.D.; Asante-Poku, A.; Asare, P.; Sánchez-Busó, L.; et al. Phylogenomics of Mycobacterium africanum reveals a new lineage and a complex evolutionary history. Microb. Genom. 2021, 7, 000477. [Google Scholar] [CrossRef] [PubMed]
  5. Guyeux, C.; Senelle, G.; Meur, A.L.; Supply, P.; Gaudin, C.; Phelan, J.E.; Clark, T.G.; Rigouts, L.; de Jong, B.; Sola, C.; et al. Newly Identified Mycobacterium africanum Lineage 10, Central Africa-Volume 30, Number 3—March 2024-Emerging Infectious Diseases Journal-CDC. Available online: https://wwwnc.cdc.gov/eid/article/30/3/23-1466_article (accessed on 3 March 2024).
  6. Napier, G.; Campino, S.; Merid, Y.; Abebe, M.; Woldeamanuel, Y.; Aseffa, A.; Hibberd, M.L.; Phelan, J.; Clark, T.G. Robust barcoding and identification of Mycobacterium tuberculosis lineages for epidemiological and clinical studies. Genome Med. 2020, 12, 114. [Google Scholar] [CrossRef] [PubMed]
  7. MinSalud; INS. Informe Evento Tuberculosis PEXII 2023 [Internet]. 2023. Available online: https://www.ins.gov.co/buscador-eventos/Informesdeevento/TUBERCULOSIS%20PE%20XII%202023.pdf (accessed on 12 July 2024).
  8. DANE. Estadísticas Vitales (EEVV). 2022. Available online: https://www.dane.gov.co/files/investigaciones/poblacion/pre_estadisticasvitales_IIItrim_2022pr.pdf (accessed on 7 March 2024).
  9. DANE. Estadísticas Vitales. Cifra de Defunciones. 2023. Available online: https://www.dane.gov.co/files/operaciones/EEVV/pres-EEVV-Defunciones-IIItrim2023.pdf (accessed on 7 March 2024).
  10. Plata-Casas, L.; Gutierrez-Lesmes, O.; Cala-Vitery, F. Tuberculosis Disability Adjusted Life Years, Colombia 2010–2018. Trop. Med. Infect. Dis. 2022, 7, 250. [Google Scholar] [CrossRef]
  11. Jia, X.; Yang, L.; Dong, M.; Chen, S.; Lv, L.; Cao, D.; Fu, J.; Yang, T.; Zhang, J.; Zhang, X.; et al. The Bioinformatics Analysis of Comparative Genomics of Mycobacterium tuberculosis Complex (MTBC) Provides Insight into Dissimilarities between Intraspecific Groups Differing in Host Association, Virulence, and Epitope Diversity. Front. Cell Infect. Microbiol. 2017, 7, 88. [Google Scholar] [CrossRef]
  12. Brites, D.; Gagneux, S. The Nature and Evolution of Genomic Diversity in the Mycobacterium tuberculosis Complex. In Strain Variation in the Mycobacterium tuberculosis Complex: Its Role in Biology, Epidemiology and Control; Gagneux, S., Ed.; Springer International Publishing: Cham, Switzerland, 2017; pp. 1–26. [Google Scholar] [CrossRef]
  13. Gagneux, S. Host–pathogen coevolution in human tuberculosis. Philos. Trans. R. Soc. B Biol. Sci. 2012, 367, 850–859. [Google Scholar] [CrossRef]
  14. McHenry, M.L.; Bartlett, J.; Igo, R.P.; Wampande, E.M.; Benchek, P.; Mayanja-Kizza, H.; Fluegge, K.; Hall, N.B.; Gagneux, S.; Tishkoff, S.A.; et al. Interaction between host genes and Mycobacterium tuberculosis lineage can affect tuberculosis severity: Evidence for coevolution? PLoS Genet. 2020, 16, e1008728. [Google Scholar] [CrossRef]
  15. Zwyer, M.; Rutaihwa, L.K.; Windels, E.; Hella, J.; Menardo, F.; Sasamalo, M.; Sommer, G.; Schmülling, L.; Borrell, S.; Reinhard, M.; et al. Back-to-Africa introductions of Mycobacterium tuberculosis as the main cause of tuberculosis in Dar es Salaam, Tanzania. PLoS Pathog. 2023, 19, e1010893. [Google Scholar] [CrossRef]
  16. Osei-Wusu, S.; Otchere, I.D.; Morgan, P.; Musah, A.B.; Siam, I.M.; Asandem, D.; Afum, T.; Asare, P.; Asante-Poku, A.; Kusi, K.A.; et al. Genotypic and phenotypic diversity of Mycobacterium tuberculosis complex genotypes prevalent in West Africa. PLoS ONE 2021, 16, e0255433. [Google Scholar] [CrossRef]
  17. Keating, L.A.; Wheeler, P.R.; Mansoor, H.; Inwald, J.K.; Dale, J.; Hewinson, R.G.; Gordon, S.V. The pyruvate requirement of some members of the Mycobacterium tuberculosis complex is due to an inactive pyruvate kinase: Implications for in vivo growth. Mol. Microbiol. 2005, 56, 163–174. [Google Scholar] [CrossRef]
  18. Forrellad, M.A.; Klepp, L.I.; Gioffré, A.; Sabio y García, J.; Morbidoni, H.R.; Santangelo, M.d.l.P.; Cataldi, A.A.; Bigi, F. Virulence factors of the Mycobacterium tuberculosis complex. Virulence 2013, 4, 3–66. [Google Scholar] [CrossRef] [PubMed]
  19. Sobkowiak, B.; Banda, L.; Mzembe, T.; Crampin, A.C.; Glynn, J.R.; Clark, T.G. Bayesian reconstruction of Mycobacterium tuberculosis transmission networks in a high incidence area over two decades in Malawi reveals associated risk factors and genomic variants. Microb. Genom. 2020, 6, e000361. [Google Scholar] [CrossRef] [PubMed]
  20. Faksri, K.; Xia, E.; Ong, R.T.-H.; Tan, J.H.; Nonghanphithak, D.; Makhao, N.; Thamnongdee, N.; Thanormchat, A.; Phurattanakornkul, A.; Rattanarangsee, S.; et al. Comparative whole-genome sequence analysis of Mycobacterium tuberculosis isolated from tuberculous meningitis and pulmonary tuberculosis patients. Sci. Rep. 2018, 8, 4910. [Google Scholar] [CrossRef]
  21. Hernández Pando, R.; Aguilar, D.; Cohen, I.; Guerrero, M.; Ribon, W.; Acosta, P.; Orozco, H.; Marquina, B.; Salinas, C.; Rembao, D.; et al. Specific bacterial genotypes of Mycobacterium tuberculosis cause extensive dissemination and brain infection in an experimental model. Tuberculosis 2010, 90, 268–277. [Google Scholar] [CrossRef]
  22. Bohada-Lizarazo, D.P. Variabilidad Genómica de Mycobacterium tuberculosis como Elemento Explicativo del alto Riesgo para Tuberculosis del Departamento de Norte de Santander. Doctoral Thesis, Universidad de Antioquia, Medellín, Colombia, 2024. [Google Scholar]
  23. INS. Tuberculosis. Manual de Procedimientos. Segunda edición. Instituto Nacional de Salud. 1987. Available online: https://www.minsalud.gov.co/sites/rid/Lists/BibliotecaDigital/RIDE/IA/INS/tuberculosis-manual-procedimientos.pdf (accessed on 10 November 2023).
  24. van Soolingen, D. Molecular epidemiology of tuberculosis and other mycobacterial infections: Main methodologies and achievements. J. Intern. Med. 2001, 249, 1–26. [Google Scholar] [CrossRef]
  25. Votintseva, A.A.; Pankhurst, L.J.; Anson, L.W.; Morgan, M.R.; Gascoyne-Binzi, D.; Walker, T.M.; Quan, T.P.; Wyllie, D.H.; Del Ojo Elias, C.; Wilcox, M.; et al. Mycobacterial DNA Extraction for Whole-Genome Sequencing from Early Positive Liquid (MGIT) Cultures. J. Clin. Microbiol. 2015, 53, 1137–1143. [Google Scholar] [CrossRef] [PubMed]
  26. Bolyen, E.; Rideout, J.R.; Dillon, M.R.; Bokulich, N.A.; Abnet, C.C.; Al-Ghalith, G.A.; Alexander, H.; Alm, E.J.; Arumugam, M.; Asnicar, F.; et al. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat. Biotechnol. 2019, 37, 852–857. [Google Scholar] [CrossRef]
  27. Prodan, A.; Tremaroli, V.; Brolin, H.; Zwinderman, A.H.; Nieuwdorp, M.; Levin, E. Comparing bioinformatic pipelines for microbial 16S rRNA amplicon sequencing. PLoS ONE 2020, 15, e0227434. [Google Scholar] [CrossRef]
  28. Li, D.; Luo, R.; Liu, C.-M.; Leung, C.-M.; Ting, H.-F.; Sadakane, K.; Yamashita, H.; Lam, T.-W. MEGAHIT v1.0: A fast and scalable metagenome assembler driven by advanced methodologies and community practices. Methods 2016, 102, 3–11. [Google Scholar] [CrossRef]
  29. Wood, D.E.; Salzberg, S.L. Kraken: Ultrafast metagenomic sequence classification using exact alignments. Genome Biol. 2014, 15, R46. [Google Scholar] [CrossRef]
  30. Katoh, K.; Rozewicki, J.; Yamada, K.D. MAFFT online service: Multiple sequence alignment, interactive sequence choice and visualization. Brief. Bioinform. 2019, 20, 1160–1166. [Google Scholar] [CrossRef]
  31. Stamatakis, A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 2014, 30, 1312–1313. [Google Scholar] [CrossRef] [PubMed]
  32. Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef] [PubMed]
  33. Tarasov, A.; Vilella, A.J.; Cuppen, E.; Nijman, I.J.; Prins, P. Sambamba: Fast processing of NGS alignment formats. Bioinformatics 2015, 31, 2032–2034. [Google Scholar] [CrossRef]
  34. Gundabolu, S.C.; Vijaykumar, T.N.; Thottethodi, M. FastZ: Accelerating gapped whole genome alignment on GPUs. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, St. Louis, MI, USA, 14–19 November 2021; ACM: New York, NY, USA, 2021; pp. 1–13. [Google Scholar] [CrossRef]
  35. Darling, A.C.E.; Mau, B.; Blattner, F.R.; Perna, N.T. Mauve: Multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004, 14, 1394–1403. [Google Scholar] [CrossRef] [PubMed]
  36. WMA-The World Medical Association-WMA Declaration of Helsinki–Ethical Principles for Medical Research Involving Human Subject. 2022. Available online: https://www.wma.net/policies-post/wma-declaration-of-helsinki-ethical-principles-for-medical-research-involving-human-subjects/ (accessed on 7 June 2024).
  37. Narvaiz de Kantor, I.; Kim, S.J.; Frieden, T.R.; Laszlo, A.; Luelmo, F.; Norval, P.Y.; Rieder, H.; Valenzuela, P.; Weyer, K. Laboratory Services in Tuberculosis Control. 1998. Available online: https://iris.who.int/handle/10665/65942 (accessed on 17 July 2024).
  38. Sánchez-Corrales, L.; Tovar-Aguirre, O.L.; Galeano-Vanegas, N.F.; Jiménez, P.A.C.; Martínez-Vega, R.A.; Maldonado-Londoño, C.E.; Hernández-Botero, J.S.; Siller-López, F. Phylogenomic analysis and Mycobacterium tuberculosis antibiotic resistance prediction by whole-genome sequencing from clinical isolates of Caldas, Colombia. PLoS ONE 2021, 16, e0258402. [Google Scholar] [CrossRef]
  39. Baena, A.; Cabarcas, F.; Ocampo, J.C.; Barrera, L.F.; Alzate, J.F. Large genomic deletions delineate Mycobacterium tuberculosis L4 sublineages in South American countries. PLoS ONE 2023, 18, e0285417. [Google Scholar] [CrossRef]
  40. Puerto, G.; Erazo, L.; Wintaco, M.; Castro, C.; Ribón, W.; Guerrero, M.I. Mycobacterium tuberculosis Genotypes Determined by Spoligotyping to Be Circulating in Colombia between 1999 and 2012 and Their Possible Associations with Transmission and Susceptibility to First-Line Drugs. PLoS ONE 2015, 10, e0124308. [Google Scholar] [CrossRef]
  41. Realpe, T.; Correa, N.; Rozo, J.C.; Ferro, B.E.; Gómez, V.; Zapata, E.; Ribon, W.; Puerto, G.; Castro, C.; Nieto, L.M.; et al. Population structure among Mycobacterium tuberculosis isolates from pulmonary tuberculosis patients in Colombia. PLoS ONE 2014, 9, e93848. [Google Scholar] [CrossRef]
  42. Ritacco, V.; Iglesias, M.-J.; Ferrazoli, L.; Monteserin, J.; Dalla Costa, E.R.; Cebollada, A.; Morcillo, N.; Robledo, J.; de Waard, J.H.; Araya, P.; et al. Conspicuous multidrug-resistant Mycobacterium tuberculosis cluster strains do not trespass country borders in Latin America and Spain. Infect. Genet. Evol. 2012, 12, 711–717. [Google Scholar] [CrossRef]
  43. Asadi, L.; Croxen, M.; Heffernan, C.; Dhillon, M.; Paulsen, C.; Egedahl, M.L.; Tyrrell, G.; Doroshenko, A.; Long, R. How much do smear-negative patients really contribute to tuberculosis transmissions? Re-examining an old question with new tools. eClinicalMedicine 2022, 43, 101250. [Google Scholar] [CrossRef] [PubMed]
  44. Ruíz-Roa, S.L.; Rojas, S.M.M.; Galeano, I.A.F.S. Tuberculosis en la frontera colombo-venezolana: Distribución geoespacial. Rev. Peru Med. Exp. Salud Pública 2022, 39, 362–367. [Google Scholar] [CrossRef] [PubMed]
  45. Shastry, B.S. SNPs: Impact on Gene Function and Phenotype. In Single Nucleotide Polymorphisms: Methods and Protocols; Komar, A.A., Ed.; Methods in Molecular BiologyTM; Humana Press: Totowa, NJ, USA, 2009; pp. 3–22. [Google Scholar] [CrossRef]
  46. Tischler, A.D.; Leistikow, R.L.; Kirksey, M.A.; Voskuil, M.I.; McKinney, J.D. Mycobacterium tuberculosis Requires Phosphate-Responsive Gene Regulation To Resist Host Immunity. Infect. Immun. 2013, 81, 317–328. [Google Scholar] [CrossRef]
  47. White, D.W.; Elliott, S.R.; Odean, E.; Bemis, L.T.; Tischler, A.D. Mycobacterium tuberculosis Pst/SenX3-RegX3 Regulates Membrane Vesicle Production Independently of ESX-5 Activity. mBio 2018, 9, e00778-18. [Google Scholar] [CrossRef]
  48. Tischler, A.D.; Leistikow, R.L.; Ramakrishnan, P.; Voskuil, M.I.; McKinney, J.D. Mycobacterium tuberculosis Phosphate Uptake System Component PstA2 Is Not Required for Gene Regulation or Virulence. PLoS ONE 2016, 11, e0161467. [Google Scholar] [CrossRef] [PubMed]
  49. Argyrou, A.; Vetting, M.W.; Blanchard, J.S. Characterization of a New Member of the Flavoprotein Disulfide Reductase Family of Enzymes from Mycobacterium tuberculosis. J. Biol. Chem. 2004, 279, 52694–52702. [Google Scholar] [CrossRef] [PubMed]
  50. Leung, K.S.S.; Siu, G.K.H.; Tam, K.K.G.; To, S.W.C.; Rajwani, R.; Ho, P.L.; Wong, S.S.-Y.; Zhao, W.W.; Ma, O.C.-K.; Yam, W.-C. Comparative Genomic Analysis of Two Clonally Related Multidrug Resistant Mycobacterium tuberculosis by Single Molecule Real Time Sequencing. Front. Cell. Infect. Microbiol. 2017, 7, 478. [Google Scholar] [CrossRef]
  51. Jiang, Y.; Wei, J.; Liu, H.; Li, G.; Guo, Q.; Qiu, Y.; Zhao, L.; Li, M.; Zhao, X.; Dou, X.; et al. Polymorphisms in the PE35 and PPE68 antigens in Mycobacterium tuberculosis strains may affect strain virulence and reflect ongoing immune evasion. Mol. Med. Rep. 2016, 13, 947–954. [Google Scholar] [CrossRef]
  52. D’Souza, C.; Kishore, U.; Tsolaki, A.G. La familia PE-PPE de Mycobacterium tuberculosis: Proteínas camufladas. Immunobiology 2023, 228, 152321. [Google Scholar] [CrossRef]
  53. Bansal, K.; Sinha, A.Y.; Ghorpade, D.S.; Togarsimalemath, S.K.; Patil, S.A.; Kaveri, S.V.; Balaji, K.N.; Bayry, J. Src Homology 3-interacting Domain of Rv1917c of Mycobacterium tuberculosis Induces Selective Maturation of Human Dendritic Cells by Regulating PI3K-MAPK-NF-κB Signaling and Drives Th2 Immune Responses. J. Biol. Chem. 2010, 285, 36511–36522. [Google Scholar] [CrossRef]
  54. Gómez-González, P.J.; Grabowska, A.D.; Tientcheu, L.D.; Tsolaki, A.G.; Hibberd, M.L.; Campino, S.; Phelan, J.E.; Clark, T.G. Functional genetic variation in pe/ppe genes contributes to diversity in Mycobacterium tuberculosis lineages and potential interactions with the human host. Front. Microbiol. 2023, 14, 1244319. [Google Scholar] [CrossRef] [PubMed]
  55. McEvoy, C.R.E.; Cloete, R.; Müller, B.; Schürch, A.C.; van Helden, P.D.; Gagneux, S.; Warren, R.M.; van Pittius, N.C.G. Comparative Analysis of Mycobacterium tuberculosis pe and ppe Genes Reveals High Sequence Variation and an Apparent Absence of Selective Constraints. PLoS ONE 2012, 7, e30593. [Google Scholar] [CrossRef] [PubMed]
  56. Guilhot, C.; Chalut, C.; Daffé, M. Biosynthesis and Roles of Phenolic Glycolipids and Related Molecules in Mycobacterium tuberculosis. In The Mycobacterial Cell Envelope; John Wiley & Sons, Ltd.: Hoboken, NJ, USA, 2008; pp. 271–289. [Google Scholar] [CrossRef]
  57. Pérez, E.; Constant, P.; Laval, F.; Lemassu, A.; Lanéelle, M.-A.; Daffé, M.; Guilhot, C. Molecular dissection of the role of two methyltransferases in the biosynthesis of phenolglycolipids and phthiocerol dimycoserosate in the Mycobacterium tuberculosis complex. J. Biol. Chem. 2004, 279, 42584–42592. [Google Scholar] [CrossRef] [PubMed]
  58. Tundup, S.; Mohareer, K.; Hasnain, S.E. Mycobacterium tuberculosis PE25/PPE41 protein complex induces necrosis in macrophages: Role in virulence and disease reactivation? FEBS Open Bio 2014, 4, 822–828. [Google Scholar] [CrossRef]
  59. Stucki, D.; Brites, D.; Jeljeli, L.; Coscolla, M.; Liu, Q.; Trauner, A.; Fenner, L.; Rutaihwa, L.; Borrell, S.; Luo, T.; et al. Mycobacterium tuberculosis lineage 4 comprises globally distributed and geographically restricted sublineages. Nat. Genet. 2016, 48, 1535–1543. [Google Scholar] [CrossRef]
  60. Noda, G.S.; Barrios, M.F.; Díaz, Y.R. Evasión del sistema inmune por Mycobacterium tuberculosis: Mecanismos moleculares. Rev. Cuba Tecnol. Salud 2018, 9, 191–204. [Google Scholar]
  61. Fishbein, S.; van Wyk, N.; Warren, R.M.; Sampson, S.L. Phylogeny to function: PE/PPE protein evolution and impact on Mycobacterium tuberculosis pathogenicity. Mol. Microbiol. 2015, 96, 901–916. [Google Scholar] [CrossRef]
  62. Baena, A.; Cabarcas, F.; Alvarez-Eraso, K.L.F.; Isaza, J.P.; Alzate, J.F.; Barrera, L.F. Differential determinants of virulence in two Mycobacterium tuberculosis Colombian clinical isolates of the LAM09 family. Virulence 2019, 10, 695–710. [Google Scholar] [CrossRef]
  63. Isaza, J.P.; Duque, C.; Gomez, V.; Robledo, J.; Barrera, L.F.; Alzate, J.F. Whole genome shotgun sequencing of one Colombian clinical isolate of Mycobacterium tuberculosis reveals DosR regulon gene deletions. FEMS Microbiol. Lett. 2012, 330, 113–120. [Google Scholar] [CrossRef]
  64. Baruzzo, G.; Serafini, A.; Finotello, F.; Sanavia, T.; Cioetto-Mazzabò, L.; Boldrin, F.; Lavezzo, E.; Barzon, L.; Toppo, S.; Provvedi, R.; et al. Role of the Extracytoplasmic Function Sigma Factor SigE in the Stringent Response of Mycobacterium tuberculosis. Microbiol. Spectr. 2023, 11, e0294422. [Google Scholar] [CrossRef]
  65. Sachdeva, P.; Misra, R.; Tyagi, A.K.; Singh, Y. The sigma factors of Mycobacterium tuberculosis: Regulation of the regulators. FEBS J. 2010, 277, 605–626. [Google Scholar] [CrossRef] [PubMed]
  66. Xu, Y.; Pooja Borah, K. Mycobacterium tuberculosis carbon and nitrogen metabolic fluxes. Biosci. Rep. 2022, 42, BSR20211215. [Google Scholar] [CrossRef]
  67. Xu, J.Y.; Zhao, L.; Xu, Y.; Li, B.; Zhai, L.; Tan, M.; Ye, B.C. Dynamic Characterization of Protein and Posttranslational Modification Levels in Mycobacterial Cholesterol Catabolism. mSystems 2020, 5, e00424-19. [Google Scholar] [CrossRef] [PubMed]
  68. Abou-Fadel, J.S. Steroid-CoA Ligases, FadD17A1 and FadD19A1, and Their Role in Cholesterol Side-Chain Degradation in Mycobacterium tuberculosis. ETD Collect Univ Tex El Paso. 2018, pp. 1–129. Available online: https://scholarworks.utep.edu/dissertations/AAI10814786 (accessed on 6 July 2024).
  69. López, M.; Quitian, L.V.; Calderón, M.N.; Soto, C.Y. The P-type ATPase CtpG preferentially transports Cd2+ across the Mycobacterium tuberculosis plasma membrane. Arch. Microbiol. 2018, 200, 483–492. [Google Scholar] [CrossRef] [PubMed]
  70. Botella, H.; Peyron, P.; Levillain, F.; Poincloux, R.; Poquet, Y.; Brandli, I.; Wang, C.; Tailleux, L.; Tilleul, S.; Charrière, G.M.; et al. Mycobacterial P1-Type ATPases Mediate Resistance to Zinc Poisoning in Human Macrophages. Cell Host Microbe 2011, 10, 248–259. [Google Scholar] [CrossRef]
  71. Chaiyachat, P.; Chaiprasert, A.; Nonghanphithak, D.; Smithtikarn, S.; Kamolwat, P.; Pungrassami, P.; Reechaipichitkul, W.; Ong, R.T.-H.; Teo, Y.-Y.; Faksri, K. Whole-genome analysis of drug-resistant Mycobacterium tuberculosis reveals novel mutations associated with fluoroquinolone resistance. Int. J. Antimicrob. Agents 2021, 58, 106385. [Google Scholar] [CrossRef]
Figure 1. Phylogenetic tree. Phylogeny of the genomes of Mtb evaluated against the Colombian UT205 strain and the reference H37Rv genome using the RAxML tool. Group 1 is shown in red, group 2 is shown in blue, and group 3 is shown in green; isolates in black are not related to any group.
Figure 1. Phylogenetic tree. Phylogeny of the genomes of Mtb evaluated against the Colombian UT205 strain and the reference H37Rv genome using the RAxML tool. Group 1 is shown in red, group 2 is shown in blue, and group 3 is shown in green; isolates in black are not related to any group.
Tropicalmed 09 00197 g001
Figure 2. L4 Sublineages of Mtb circulating in the municipalities of NS, Colombia.
Figure 2. L4 Sublineages of Mtb circulating in the municipalities of NS, Colombia.
Tropicalmed 09 00197 g002
Figure 3. Genetic modification in the gene that codes for the PstA1 protein. In the image, which shows the results generated using LASTZ and Mauve tools, it is observed comparatively against the H37Rv genome a SNP at position 915, a termination codon in the protein causing a loss of 3 amino acids towards the C-terminal region.
Figure 3. Genetic modification in the gene that codes for the PstA1 protein. In the image, which shows the results generated using LASTZ and Mauve tools, it is observed comparatively against the H37Rv genome a SNP at position 915, a termination codon in the protein causing a loss of 3 amino acids towards the C-terminal region.
Tropicalmed 09 00197 g003
Figure 4. Gene modifications in the gene coding the PPE34 protein. In the image, which shows the results generated using LASTZ and Mauve tools, it is observed comparatively against the H37Rv genome that the UT205 genome is shown, where two tandem insertions of 69 bp were identified, in addition to two deletions of 207 and 69 bp. In the 21088X11 genome, an insertion and a deletion can be observed, both amounting to 138 bp, while, in the 21088X6 genome, two deletions appear, one of 207 bp and the other of 69 bp, as well as an insertion of 207 bp.
Figure 4. Gene modifications in the gene coding the PPE34 protein. In the image, which shows the results generated using LASTZ and Mauve tools, it is observed comparatively against the H37Rv genome that the UT205 genome is shown, where two tandem insertions of 69 bp were identified, in addition to two deletions of 207 and 69 bp. In the 21088X11 genome, an insertion and a deletion can be observed, both amounting to 138 bp, while, in the 21088X6 genome, two deletions appear, one of 207 bp and the other of 69 bp, as well as an insertion of 207 bp.
Tropicalmed 09 00197 g004
Figure 5. Genetic modifications (SNPs) in the PE_PGRS2 gene. The image, showing the results obtained using LASTZ and Mauve tools, depicts a deletion in the UT205 genome.
Figure 5. Genetic modifications (SNPs) in the PE_PGRS2 gene. The image, showing the results obtained using LASTZ and Mauve tools, depicts a deletion in the UT205 genome.
Tropicalmed 09 00197 g005
Figure 6. Gene modifications in the ctpG and PPE59 genes. The image, showing results obtained using LASTZ and Mauve tools, shows the following: (A) gene deletion of 3650 bp in the ctpG gene; (B) deletion of 419 bp in the PPE59 gene present in the UT205 genome.
Figure 6. Gene modifications in the ctpG and PPE59 genes. The image, showing results obtained using LASTZ and Mauve tools, shows the following: (A) gene deletion of 3650 bp in the ctpG gene; (B) deletion of 419 bp in the PPE59 gene present in the UT205 genome.
Tropicalmed 09 00197 g006
Table 1. Study population characteristics.
Table 1. Study population characteristics.
SampleSexAgeMunicipality
of Origen
Commune/Township of CúcutaOvercrowdingHomelessType
of TB
ComorbidityHIV
Result
BK
Result *
Drug
Susceptibility
col180M42Cúcuta6NoHLPulmonaryHIVPositive2+Sensitive
col179M42Cúcuta6NoHLPulmonaryHIVPositive3+Sensitive
col119F45Cúcuta4NoNoPulmonaryNoneNegativeNegativeSensitive
col41M68CúcutaAguaclaraNoNoPulmonaryNoneNegative2+Sensitive
col207F56Cúcuta6NoNoPulmonaryNoneNegativeNegativeSensitive
col178M32Cúcuta5NoNoPulmonaryNoneNegative2+Sensitive
col137M71Cúcuta3NoNoPulmonaryDMNegative1+Sensitive
col177M73ChinácotaWithout communeNoNoPulmonaryNoneNegative1+Sensitive
col117M26Cúcuta9NoNoPulmonaryNoneNegativeNegativeSensitive
col213F17El ZuliaWithout communeNoNoPulmonaryNoneNegative3+Sensitive
col201M72TibúWithout communeNoNoPulmonaryNoneNegative1+Sensitive
col174F28CúcutaBuena
Esperanza
NoNoPulmonaryMalnutritionNegative1+Sensitive
col173M51Cúcuta7NoNoPulmonaryMalnutritionNegative1+Sensitive
col132M71Cúcuta9NoHLPulmonaryMalnutritionNegative3+Sensitive
col124F33Cúcuta3NoNoPulmonaryNoneNegativeNegativeSensitive
col123F33Cúcuta3NoNoPulmonaryNoneNegativeNegativeSensitive
col210M32Cúcuta2NoNoExtrapulmonaryNoneNegativeWithout DataRR
col121F22Cúcuta1NoNoPulmonaryNoneNegative3+Sensitive
Note. HL: homeless, DM: diabetes mellitus, and RR: rifampicin resistance. * WHO scale: Negative: No AFB found in at least 100 microscopic fields, 1+: 10 to 99 AFB per 100 microscopic fields, 2+: 1 to 10 AFB per field in at least 50 microscopic fields, and 3+: more than 10 AFB per microscopic field in at least 20 fields [37].
Table 2. Genetic variants found in Mtb isolates compared to the H37Rv genome.
Table 2. Genetic variants found in Mtb isolates compared to the H37Rv genome.
Gene InvolvedProtein CodingType of ModificationVariant FrequencyIsolates InvolvedProtein EffectBiological Function
rv2958cPGL/p-HBAD biosynthesis glycosyltransferaseSubstitutions11.10%21088X12, X15Protein spreadPGL/p-HBAD Biosynthesis
rv2962cPGL/p-HBAD biosynthesis rhamnosyltransferaseSNP Transition22%21088X4, X5, X12, X13Truncation
rv2959cRhamnosyl O-methyltransferaseSNP Transition38.9%21088X1, X2, X8, X11, X14, X15, X17Truncation
rv0279cPE PGRS4Substitutions33.3%21088X1, X6, X7, X9, X10, X11, X12, X14, X17Protein spreadVirulence and evasion of the host immune response
rv0978cPE PGRS17SNP transversion41.90%21088X1, X2, X3, X4, X5, X6, X7, X8, X9, X11, X13, X16, X18Protein spread
rv0980cPE PGRS18SNP transversion21.70%21088X10, X12, X14, X15, X17Truncation
rv3344cPE PGRS49Substitutions35.70%21088X1, X2, X4, X5, X7, X9, X10, X12, X13, X14Loss of start codon
rv3508PE PGRS54Substitutions5.3%21088X5Protein spread
23.50%21088X1, X2, X3, X6, X12, X13, X15, X18Truncation
rv0304cPPE5Substitutions50%21088X2, X3, X4, X5, X6, X7, X8, X9, X10, X11, X12, X13, X14, X15, X16, X17, X18Loss of start codon
rv0305cPPE6Substitutions2.9%21088X11Truncation
45.70%21088X2, X3, X4, X5, X6, X7, X8, X9, X10, X12, X13, X14, X15, X16, X17, X18Protein spread
rv0354cPPE7SNP transversion47.10%21088X2, X3, X4, X5, X6, X7, X8, X9, X10, X12, X13, X14, X15, X16, X17, X18Protein spread
rv1917cPPE34SNP8.20%21088X12, X13, X17Protein spread
rv3125cPPE49SNP transversion22.20%21088X3, X6, X16, X18Truncation
rv3872PE35SNP transversion100%21088X1-18Truncation
rv0180cTransmembrane proteinSNP Transition5.60%21088X9TruncationMembrane proteins
rv0446cTransmembrane proteinSNP Transition72%21088X1, X2, X4, X5, X7, X8, X10, X11, X12, X13, X14, X15, X17Truncation
rv1624c Integral membrane proteinSNP Transition11.10%21088X4, X5 Truncation
rv2120cIntegral membrane proteinSNP Transition11.10%21088X10, X11Truncation
rv2395Membrane proteinSNP Transition11.10%21088X12, X13Truncation
rv3870EccCa1 CDSSubstitutions35.70%21088X3, X5, X6, X7, X8, X9, X12, X14, X16, X17Protein spreadProteins related to the immune response
rv0305cPPE6 CDSSubstitutions2.90%21088X11Truncation
45%21088X2, X3, X4, X5, X6, X7, X8, X9, X10, X12, X13, X14, X15, X16, X17, X18Protein spread
rv0354cPPE7 CDSSNP Transition47.10%21088X2, X3, X4, X5, X6, X7, X8, X9, X10, X12, X13, X14, X15, X16, X17, X18Extension
rv0577TB27.3SNP transversion5.60%21088X11Truncation
rv0113GmhA CDSSNP transversion5.90%21088X2Truncation
rv2221cGlnE CDSSNP Transition5.60%21088X9Loss of start codon
rv2673AftC CDSSNP Transition33.30%21088X3, X4, X7, X8, X9, X12, X13, X14, X18Truncation
rv0197OxidoreductaseSNP transversion88.90%21088X1, X2, X3, X6, X7, X8, X9, X10, X11, X12, X13, X14, X15, X16, X17, X18TruncationEnzymes and metabolic proteins
rv0389PurT CDSSNP Transition5.60%21088X9Truncation
rv0873FadE10 CDSSNP Transversion5.60%21088X17Truncation
rv0930PstA1 CDSSNP Transition100%21088X1-18Truncation
rv3287cRsbW CDSSNP Transversion5.60%21088X17Truncation
rv3303cLpdA CDSSNP Transversion100%21088X1-18Truncation
rv3618MonooxygenaseSNP Transition11.10%21088X4, X5 Truncation
rv3378cDiterpene synthaseSNP Transversion11.10%21088X1, X14Truncation
rv3701cEgtD CDSSNP Transversion5.60%21088X9Truncation
rv3097cLipY CDSSNP Transition5.60%21088X6Truncation
rv1167cTranscriptional regulatorSNP Transversion5.60%21088X2Protein spreadRegulation and transcription proteins
rv3050cAsn C family transcriptional regulatorSNP Transition5.60%21088X17Truncation
rv1189SigI CDSSNP Transversion5.60%21088X7Truncation
rv1329cDinG CDSSNP Transversion5.60%21088X10Truncation
rv1493MutB CDSSNP Transition47.10%21088X2, X3, X4, X5, X6, X7, X8, X9, X10, X11, X12, X13, X15, X16, X17, X18Truncation
rv1494MazE4 CDSSNP Transversion47.10%21088X2, X3, X4, X5, X6, X7, X8, X9, X10, X11, X12, X13, X15, X16, X17, X18Truncation
rv1574Phage proteinSNP Transversion50%21088X3, X6, X9, X16, X18Protein spread
rv1575Phage proteinSNP Transversion50%21088X3, X6, X9, X16, X18Truncation
Table 3. Variants found in Mtb isolates compared to the UT205 genome.
Table 3. Variants found in Mtb isolates compared to the UT205 genome.
Gene InvolvedProtein CodingType of ModificationVariant FrequencyProtein EffectBiological Function
rv1917cPPE34 CDSNucleotide insertion100%Protein spreadVirulence and evasion of the host immune response
rv0355cPPE8 CDS
rv1039cPPE15 CDS
rv19cPPE59 CDS
rv0297PE PGRS5 CDS
rv2487cPE PGRS42 CDS
rv0376PE PGRS59 CDS
rv3652PE PGRS60 CDS
rv356PE PGRS51 CDS
ppsAPpsA CDSSubstitution100%No effectLipid biosynthesis and stress response
kdtBKdtB CDSSubstitution
sigJSigJ CDSSubstitution
sigMSigM CDSTandem deletionChange in reading frame
aldAAldA CDSSNPs100%Amino acide residue change
No effects of truncation or change in reading frames have been observed
Lipid and carbohydrate metabolism
rocARocA CDSSNPs
cyp144Cyp144 CDSSNPs
pykAPykA CDSSNPs
gnd1Gnd1 CDSSNPs
gnd1Gnd1 CDSInsertionChange in reading frame
glpQ1GlpQ1 CDSTandem deletion
plsB1PlsB1 CDSSNPs100%Amino acide residue change
No effects of truncation or change in reading frames have been observed
Lipid and carbohydrate metabolism
pks1Pks1 CDS
pks5Pks5 CDS
pks7Pks7 CDS
pks9Pks9 CDS
fadB3FadB3 CDS
fadE28FadE28 CDS
fadD35FadD35 CDS
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bohada-Lizarazo, D.P.; Bravo-Sanabria, K.D.; Cárdenas-Malpica, P.; Rodríguez, R. Comparative Genomic Analysis of Mycobacterium tuberculosis Isolates Circulating in North Santander, Colombia. Trop. Med. Infect. Dis. 2024, 9, 197. https://doi.org/10.3390/tropicalmed9090197

AMA Style

Bohada-Lizarazo DP, Bravo-Sanabria KD, Cárdenas-Malpica P, Rodríguez R. Comparative Genomic Analysis of Mycobacterium tuberculosis Isolates Circulating in North Santander, Colombia. Tropical Medicine and Infectious Disease. 2024; 9(9):197. https://doi.org/10.3390/tropicalmed9090197

Chicago/Turabian Style

Bohada-Lizarazo, Diana Patricia, Karen Dayana Bravo-Sanabria, Paola Cárdenas-Malpica, and Raúl Rodríguez. 2024. "Comparative Genomic Analysis of Mycobacterium tuberculosis Isolates Circulating in North Santander, Colombia" Tropical Medicine and Infectious Disease 9, no. 9: 197. https://doi.org/10.3390/tropicalmed9090197

APA Style

Bohada-Lizarazo, D. P., Bravo-Sanabria, K. D., Cárdenas-Malpica, P., & Rodríguez, R. (2024). Comparative Genomic Analysis of Mycobacterium tuberculosis Isolates Circulating in North Santander, Colombia. Tropical Medicine and Infectious Disease, 9(9), 197. https://doi.org/10.3390/tropicalmed9090197

Article Metrics

Back to TopTop