Next Article in Journal
The Role of SLC22A1 and Genomic Ancestry on Toxicity during Treatment in Children with Acute Lymphoblastic Leukemia of the Amazon Region
Next Article in Special Issue
Origin and Global Expansion of Mycobacterium tuberculosis Complex Lineage 3
Previous Article in Journal
Analysis of Long Non-Coding RNA (lncRNA) uc.38 and uc.63 Expression in Breast Carcinoma Patients
Previous Article in Special Issue
Molecular Epidemiology of Mycobacterium tuberculosis Complex Strains in Urban and Slum Settings of Nairobi, Kenya
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

SNPs in Genes Related to DNA Damage Repair in Mycobacterium Tuberculosis: Their Association with Type 2 Diabetes Mellitus and Drug Resistance

by
Damián E. Pérez-Martínez
1,
Gustavo A. Bermúdez-Hernández
1,
Carlos F. Madrazo-Moya
2,
Irving Cancino-Muñoz
2,3,
Hilda Montero
4,
Cuauhtemoc Licona-Cassani
5,6,7,
Raquel Muñiz-Salazar
6,8,
Iñaki Comas
2,3 and
Roberto Zenteno-Cuevas
4,6,*
1
Programa de Doctorado en Ciencias de la Salud, Instituto de Ciencias de la Salud, Universidad Veracruzana, Av. Luis, Dr. Castelazo Ayala s/n, Col. Industrial Animas, Xalapa 91190, Mexico
2
Biomedical Institute of Valencia IBV-CSIC, C. de Jaume Roig, 11, 46010 Valencia, Spain
3
CIBER of Epidemiology and Public Health, 08908 Madrid, Spain
4
Instituto de Salud Pública, Universidad Veracruzana, Av. Luis Castelazo Ayala s/n, A.P. 57, Col. Industrial Animas, Xalapa 91190, Mexico
5
Escuela de Ingeniería y Ciencias, Tecnológico de Monterrey, Ave. Eugenio Garza Sada 2501 Sur, Monterrey 64849, Mexico
6
Red Multidisciplinaria de Investigación en Tuberculosis, Mexico City 14080, Mexico
7
Division of Integrative Biology, The Institute for Obesity Research, Tecnológico de Monterrey, Monterrey 64849, Mexico
8
Laboratorio de Epidemiología y Ecología Molecular, Escuela de Ciencias de la Salud, Universidad Autónoma de Baja California, Ensenada 22890, Mexico
*
Author to whom correspondence should be addressed.
Genes 2022, 13(4), 609; https://doi.org/10.3390/genes13040609
Submission received: 8 February 2022 / Revised: 19 March 2022 / Accepted: 24 March 2022 / Published: 29 March 2022

Abstract

:
Genes related to DNA damage repair in Mycobacterium tuberculosis are critical for survival and genomic diversification. The aim of this study is to compare the presence of SNPs in genes related to DNA damage repair in sensitive and drug-resistant M. tuberculosis genomes isolated from patients with and without type 2 diabetes mellitus (T2DM). We collected 399 M. tuberculosis L4 genomes from several public repositories; 224 genomes belonging to hosts without T2DM, of which 123 (54.9%) had drug sensitive tuberculosis (TB) and 101 (45.1%) had drug resistance (DR)-TB; and 175 genomes from individuals with T2DM, of which 100 (57.1%) had drug sensitive TB and 75 (42.9%) had DR-TB. The presence of SNPs in the coding regions of 65 genes related to DNA damage repair was analyzed and compared with the resistance profile and the presence/absence of T2DM in the host. The results show the phylogenetic relationships of some SNPS and L4 sub-lineages, as well as differences in the distribution of SNPs present in DNA damage repair-related genes related to the resistance profile of the infecting strain and the presence of T2DM in the host. Given these differences, it was possible to generate two discriminant functions to distinguish between drug sensitive and drug resistant genomes, as well as patients with or without T2DM.

1. Introduction

With more than 10 million cases and 1.2 million deaths, tuberculosis (TB) remains an infectious disease of global importance [1]. Currently, drug resistance (DR), HIV, and type 2 diabetes mellitus (T2DM) are elements that contribute to the status of TB as a severe health problem. In this regard, T2DM leads to a 3.1 (2.27–4.26) fold increased risk of developing TB [2], through reactivation of latent infection or progression of recent infection [3], and the TB-T2DM binomial has been identified as a risk factor for unfavorable treatment outcomes, increasing the probability of failure, relapse, or death [4]. Patients with TB-T2DM are also more frequently found within the recent TB transmission groups [5], having a 4.7 (1.4–11.3) fold increased risk of becoming single-drug resistant, and a 2.8 (2.2–3.4) to 3.5 (1.1–11.1) fold increased risk of becoming multi-drug resistant (TB-MDR) [6,7]. This behavior has been explained as a consequence of the decrease in plasma concentrations of anti-tuberculosis drugs [8] and the interference from medications taken for glycemic control [9]. Besides, high glucose concentrations and immunological alterations in patients with T2DM [10,11,12] have been shown to promote the elongation of active infection and increased proliferation of TB bacilli [13].
From a molecular perspective, Single Nucleotide Polymorphisms (SNPs) are the primary source of variation in M. tuberculosis (Mtb) [14]. Through whole-genome sequencing (WGS) analysis of serial samples taken from affected patients, it was recently observed that a significant number of the genetic variations in Mtb are greatly influenced by the host environment [15,16]. In this sense, genes related to DNA damage repair (GRDDR) in Mtb have been the subject of increasing interest in recent years [17], since they play a fundamental role in maintaining DNA integrity and genomic diversification of Mtb [18,19], mainly in damage generated by the environment in which the bacterium is found, and by the host immune response. Mtb presents multiple DNA damage repair mechanisms, some redundant and others specifically related to induced mutagenesis [17]. It has been documented that the presence of SNPs in some Mycobacterium GRDDR can increase the mutation rate [20] and improve the adaptability of the bacteria [21] and can also be associated with increased DR or drug susceptibility [22].
It remains unknown if T2DM in the host could influence the generation of SNPs in Mtb GRDDRs and consequently drug resistance. Therefore, the aim of this study is to compare the presence of SNPs in GRDDRs in sensitive and drug-resistant Mtb genomes isolated from patients with and without T2DM.

2. Materials and Methods

2.1. Genome Compilation

A search for Mtb genomes was performed in the following public repositories: GenBank, TB Portals, ENA, and Patric. Additionally, a collection of genomes provided by Dr. Iñaki Comas of the Tuberculosis Genomics Unit of the Instituto de Biomedicina de Valencia, Spain, was also considered.
Inclusion of the genomes in the analysis was contingent on the metadata information meeting the following criteria: (1) presence or absence of T2DM in the host, (2) detailed description of the genotypic profile of resistance to first-line drugs (rifampicin, isoniazid, pyrazinamide and ethambutol), (3) coverage values > 99% and depth > 100×, and (4) exclusively belonging to L4, due to its high frequency and global distribution [23]. The final sample consisted of 399 genomes, which were organized into sensitive and drug-resistant TB from hosts with and without T2DM (Supplementary Table S1).

2.2. Bioinformatics Analysis of Genomes

First, the low-quality ends in the sequences (<30) were trimmed using Fastp [24], then Kraken V.2 [25,26] was used to filter reads belonging to the MTBC complex and avoid false variants as a result of DNA contamination. The reads were aligned with the BWA program [27] using a MTBC reference sequence [28], considering the default parameters. Variant calling (SNPs and INDELS) was performed following a previously described and validated pipeline [5,29], which is available online at http://tgu.ibv.csic.es/?page_id=1794, accessed on 4 October 2021. Variants present in at least 20 reads and at ≥90% frequency within each isolate were used to detect phylogenetic mutations and confirm pertinence to L4. In contrast, variants in at least 10 reads with a frequency of ≥10% to ≤90% were termed non-fixed SNPs and used to detect the first- and second-line drug resistance profile.
From the call of variants previously obtained for each one of the genomes, variants with an allelic frequency ≥ 10% in the coding regions of the 65 genes related to DNA damage repair were identified and selected (Supplementary Table S2). A database was then constructed with the non-synonymous SNPs identified in the 65 GRDDRs, the resistance profiles, and the presence/absence of DMT2 in the host.

2.3. Statistical Testing, Clustering, and Discriminant Analysis

To identify differences between resistant and sensitive TB genomes from hosts with and without T2DM, and the various sub-lineages comprising the sample, clustering analyses were performed on the non-synonymous SNPs in the GRDDRs only using the IQ-TREE software [30] (http://iqtree.cibiv.univie.ac.at/, accessed on 1 November 2021) with the default parameters for binary data. The generated consensus trees were visualized using iTOL [31] (https://itol.embl.de/, accessed on 1 November 2021).
SNPs present in >99% of genomes were considered as redundant sites. Identification of sub-lineage-related SNPs was performed using the fixation index (Fts = 1), which indicates that the SNP is fixed to the sub-lineage and is not present outside of it. The fixation index was calculated using the Genepop package for Rstudio [32], excluding mixed infections and single sub-lineages (n = 1).
Inter-group differences were analyzed by chi-square test using IBM SPSS V21 [33] (95% confidence level), excluding redundant SNPs and those fixed in sub-lineages.
For the discriminant analysis, two functions were generated; one to discriminate the presence or absence of T2DM in the host and another to determine DR-TB and sensibility. Both models were developed using the presence or absence of non-synonymous SNPs in each of the 65 GRDDRs analyzed (0 for absence and 1 for presence of SNPs in the gene). For this analysis, redundant SNPs and those fixed in sub-lineages were excluded, as well as genomes that did not present SNPs other than those related to sub-lineages. The eigenvalue, canonical correlation and Wilks’ lambda were calculated as summary values for each discriminant function. Finally, the generated functions were validated by classifying one case (cross-validation) and a 10% random sample (random sample validation) after exclusion from the discriminant function calculation. SPSS® V21 [33] software was used to perform this analysis and the validation.

3. Results

3.1. Population Characteristics

A total of 399 Mtb genomes were recovered, with collection periods ranging from 2010 to 2019, from 10 countries, predominantly Georgia (33.6%), Moldova (13.8%), and Indonesia (12.8%). The individuals carrying these isolates were a mean age of 42.5 years (±15.1), 226 (61.4%) were male, and 175 (43.8%) presented T2DM. A total of 223 strains (55.9%) were classified as sensitive, and 176 (44.1%) were classified as mono-resistant, poly-resistant, multi-drug resistant (MDR-TB), pre-extreme-drug resistant (pre-XDR-TB), or extreme-drug resistant (XDR-TB). Resistance to isoniazid was observed in 35%, to rifampicin in 32%, and to ethambutol in 21%. Phylogenetic analysis confirmed that all strains belonged to L4. Nineteen sub-lineages were identified, of which the most frequent were 4.1 (25.3%), 4.1.2.1 (20.3%), and 4.3.3 (17.8%). Mixed infection by L4 strains was found in 2.3% of the genomes (Table 1).
Regarding the analysis groups, 224 genomes belonged to hosts without T2DM, of which 123 (54.9%) had drug sensitive TB and 101 (45.1%) had DR-TB. On the other hand, 175 genomes were from individuals with T2DM, of which 100 (57.1%) had drug sensitive TB and 75 (42.9%) had DR-TB.
In the 65 GRDDRs analyzed, 352 non-synonymous SNPs were identified. Of these, 346 (98.3%) generated amino acid change, 5 (1.4%) generated early stop codons, and 1 (0.3%) caused stop codon loss. In addition, 241 SNPs (68.5%) were identified as unique in the genomes and 28 SNPs (7.9%) had a frequency ≥ 5% of the sample.

3.2. SNPs in Genes Related to DNA Damage Repair

The genes with the highest number of non-synonymous SNPs were DnaE2 (18 SNPs), RecC (14 SNPs), and LigD (14 SNPs), whereas the genes Dut, Ku, Prim-PolC, RecR, and SSBa presented only one non-synonymous SNP. When contrasting the length of the genes, a positive correlation was observed in which the greater the length of the gene, the greater the number of non-synonymous SNPs (r = 0.639, p < 0.01). This is evidence that nonsynonymous SNPs are variable among GRDDRs, and this seems to be related to their length.
Ninety-nine percent of the genomes analyzed presented SNPs in the genes AlkA (position 1479085G>A), AdnA (3577958C>T), and ImuA (3811629T>C) (Supplementary Table S3). Additionally, <1% of the genomes presented non-synonymous SNPs in SSBa, SSBb, Ku, RecR, RecX, Mpg, DinB2, MazG, Prim-PolC, PolD2, RuvA, RuvC, MutT1, MutT4, and Ung. Notably, RuvX, which encodes a protein responsible for repairing branched DNA structures (Holliday structures) [17], was the only gene lacking non-synonymous SNPs. Likely, the SNPs observed with a frequency > 99% of the sample were previously acquired by a shared ancestor among the different L4 members. On the other hand, genes with lower presence of non-synonymous SNPs could suggest better function or some advantage in terms of conservation.

3.3. Cluster Analysis

The dendrogram created with the 352 non-synonymous SNPs identified in the GRDDRs showed that the clusters generated were in complete agreement with the 19 sub-lineages that comprised the sample, but no association was observed with the presence of T2DM in the host (Figure 1). Only lineages 4.2.1 and 4.10 showed differences in the presence of some clusters observed between sensitive and drug resistant genomes, reflected in the presence of non-synonymous SNPs in RecGwed (3011653A>G) and MutT2 (1286927C>A), respectively. The only genome belonging to lineage 4.6.1.1 was placed within the 4.3 lineage group due to the similarity between these lineages. Through the selection of 16 non-synonymous SNPs in 13 GRDDRs (Fts = 1) (Table 2), it was possible to generate a dendrogram that more clearly defined the sub-lineages comprised by the sample (Figure 2). This indicates that certain SNPs in GRDDR are specific to the sub-lineages and strongly related to their evolutionary development.
Given this observation, both sub-lineage-fixed SNPs and redundant SNPs in the sample were given no further consideration in the comparative analysis between host groups or resistance profiles. It should be noted that 11.7% of the genomes (n = 47) did not present non-synonymous SNPs, other than those related to the sub-lineage.

3.4. Comparison between Drug Sensitive and Drug Resistant Genomes of T2DM and Non-T2DM Hosts

With respect to drug resistance, the distribution of SNPs showed significant differences among groups (Table 3). In the genomes of drug resistant isolates, no non-synonymous SNPs were present in the genes RecR, DinB2, Prim-PolC, RNaseH1, RNaseH2, ImuB, or MutT2, but a high frequency of non-synonymous SNPs was found in RecGwed (p < 0.001) (only present in L4.2.1 and mixed infection), MutY (p < 0.001), and UvrA (p = 0.003). In contrast, drug sensitive TB isolates showed an absence of non-synonymous SNPs in the genes Mpg, SSBa, and Ku, but a high frequency was observed in ImuB (p = 0.028), RNaseH2 (p = 0.028), RNaseH1 (p = 0.046), and MutT2 (p < 0.001) (only present in L4.10 and mixed infection).
In relation to the presence/absence of DMT in the host, significant differences were observed in the distribution of genes with SNPs (Table 3). The genomes of individuals with T2DM presented no non-synonymous SNPs in the genes Fpg2 or Ung, but showed a higher frequency of non-synonymous SNPs in LigB (p = 0.006), ImuA (p = 0.024) (only present in L4.3 and L4.3.3), Prim-PolC (p = 0.049) (only present in L4.4.1.1), MazG (p = 0.049), RecX (p = 0.049) (only present in L4.10), LigD (p = 0.006), RuvB (p = 0.008), and RecG (p = 0.015). The genomes of individuals without T2DM had no non-synonymous SNPs in the genes Prim-PolC, MazG, RecX, RecR, SSBa, and Ku, but showed a higher frequency of non-synonymous SNPs in Cho (p < 0.001), PolA (p < 0.001), Nei2 (p < 0.001), RecGwed (p = 0.013), MutT2 (p = 0.008), and RecO (p = 0.001). This indicates that the occurrence of SNPs in GRDDR is influenced by the presence or absence of T2DM in the host, with a higher presence in isolates from individuals with T2DM.

3.5. Discriminant Analysis

Discriminant function analysis was performed on the 333 non-synonymous SNPs present in the 65 GRDDR genes, assigning values of 0 and 1, respectively, to the absence and presence of non-synonymous SNPs in the gene. Sub-lineage-related SNPs and redundant SNPs in the sample were excluded from the analysis, as well as those genomes that had no SNPs other than those related to sub-lineages. The discriminant function for TB from individuals with or without T2DM showed a moderate but statistically significant discrimination value (eigenvalue = 0.499, canonical correlation = 0.577, Wilks’ lambda = 0.667, df = 63, p = 0.000), with functions at the centroids to differentiate between genomes from TB isolates from hosts without T2DM = 0.600 and genomes from TB isolates from hosts with T2DM = −0.827. This analysis correctly classified 73.6% of the genomes of TB from hosts with T2DM, with a sensitivity of 0.69 and a specificity of 0.80. However, it presented lower accuracy in the cross-validation (64.2% correct classification) and validation through a random sample including 10% of the genomes (64.4% correct classification) (Table 4A). From a selection of 63 Mtb GRDDRs, we can duplicate the model (Table 5A), establishing the presence of SNPs in RecR, LigB, and DnaE1 as the most discriminant genes associated with the presence of T2DM in the host.
On the other hand, the discriminant function for sensitive and drug resistant isolates showed moderate but significant values (eigenvalue = 0.788, Canonical correlation = 0.664, Wilks’ lambda = 0.559, df = 63, p = 0.000), with functions at the centroids to differentiate between genomes from drug sensitive TB = −0.803 and genomes with drug resistant TB = 0.975. This discriminant function correctly classified 74.8% of the resistant genomes with sensitivity and specificity values of 0.82 and 0.81, respectively (Table 4B), while the cross-validation presented a classification that was 64.8% correct and the random sample validation was 68.1% correct. The model generated with a selection of 63 GRDDRs in the drug resistant and drug sensitive isolates shows that the presence of non-synonymous SNPs in the gene RecGwed had the highest discriminant value, followed by MutY, DnaE2, and Mfd (Table 5B). This demonstrates their high frequency in genomes with some level of drug resistance and thus their utility as discriminants.
Finally, it is important to note that the differences observed between the proportions of isolates correctly classified by the discriminant functions and their validations confirms that the sample used was limited, but still generates functional models with which it is possible to determine the presence of pharmacological resistance and T2DM in the host through analysis of these genes.

4. Discussion

Analysis of the polymorphisms in GRDDR in a set of Mtb genomes (lineage 4) allowed the identification of several relationships among the identified SNPs, presence of T2DM in the host, pharmacological resistance and, unexpectedly, the L4 sub-lineages. The results represent the first approach to the study of GRDDRs and evidence an unexpected diversification of SNPs in these Mtb genes, as well as the possible influence of T2DM and DR on their development.
Some of the SNPs identified were highly specific for L4 sub-lineages and could be of further use for classification, while some Mtb genotyping systems refer to the use of certain SNPs from GRDDR, such as: UvrD2 (3570528C>G) [39], MutT3 (500224G>T), MutT4 (4393839C>T), MutY (4031203C>A) [40], DnaE1 (1750465T>C), RuvB (2923264G>A), DinB2 (3416734G>A), and SSBa (58786G>C) [41]. This is the first report of a new panel of 16 non-synonymous SNPs distributed in 13 GRDDRs that allows a correct sub-clustering of L4 sub-linages (Table 2). Although the limited size of the analyzed sample and the low representativeness of some sub-lineages are recognized, the clustering capacity observed with the SNPs present in GRDDR poses new questions about the coevolution of these mutations and the sub-lineages with which they are associated, and raises the possibility of their use as markers for phylogenetic classification.
It has been documented that the genes MutY and UvrA play an important role in DNA repair pathways and that their deficiency sensitizes Mtb to different clastogens [42,43,44]. While MutM (Fpg) and MutY are involved in the cleavage of oxidized guanine and its paired base [20], UvrA (with UvrB) plays an essential role in DNA damage recognition and initiates nucleotide excision repair [17]. On the other hand, RecGwed has a binding affinity for branched DNA structures, mainly in the stationary phase [37], and is involved in drug resistance acquisition pathways of the L4.2.1 [45]. Considering the above, the presence of a more significant number of SNPs in these genes in drug resistant strains could indicate that they result of positive selection induced by anti-TB treatment, which confers advantages when facing the effects of the drugs. On the other hand, the higher frequency of SNPs in the MutT2 (only present in L4.10 and mixed infection), RNaseH1, RNaseH2, and ImuB genes predominantly found in sensitive strains raises several questions regarding their origin and utility.
Differences in the non-synonymous SNPs in the GRDDR observed between individuals with and without T2DM suggest that the presence of T2DM could influence a greater diversity of polymorphisms in the GRDDR in Mtb. This could be explained by the influence of the metabolic and immune system alterations characteristic of patients with T2DM [16,46], reduced plasma concentrations of anti-tuberculosis drugs [8,47], consumption of drugs for glycemic control [9], and the increased formation of cavitary lesions [46], factors that also promote drug resistance [8,46], some of them due to the appearance of mutations generated by the increase of oxidative stress in bacteria [9].
In this sense, the higher frequency of SNPs in LigB ligase and multifunctional LigD ligase, which acts with Ku in the repair of double-strand breaks by non-homologous end joining [48]; Prim-PolC polymerase, which participates in abasic site filling [49]; the accessory protein ImuA, involved in DnaE2-induced mutagenesis [50]; MazG, the protein product of which degrades 5-OH-dCTP to prevent C>T (G>A) mutation when incorporated into DNA [51]; RecX, which regulates the functions of the RecA gene [17], which is fundamental to the SOS response, induced mutagenesis and drug resistance [52] and the RecG and RuvB genes, involved in the resolution of branched DNA structures [53], could explain the increased risk of individuals with the TB-T2DM binomial generating drug resistance or other negative outcomes to anti-tuberculosis treatment. However, further studies are required to determine the specific participation of these or other genes in the process of drug resistance in individuals with the TB-T2DM binomial.
The variations observed between the SNPs of sensitive and resistant isolates from individuals with or without T2DM made it possible to develop two discriminant functions with which to distinguish these characteristics. Besides providing the first description of their use in this context, these functions showed acceptable levels of sensitivity and specificity, which were higher than the random classification values (0.5) in terms of identifying both drug resistant isolates and those from hosts with T2DM. Although there are molecular markers for the diagnosis of drug resistance with better levels of sensitivity and specificity [54], we consider that the discriminant function for drug resistant strains using GRDDR non-synonymous SNPs could be used as a surrogate marker, since it does not use genes commonly related to drug resistance.
We consider that the misclassifications generated by both functions could be influenced by factors related to the time of evolution of the infection, different levels of drug resistance, and clinical control of T2DM in the host. Nevertheless, the presence of non-synonymous SNPs in GRDDR could be used as a tool to distinguish the presence or absence of T2DM in the host, based on Mtb genome analysis. However, further studies are required to confirm the true utility of this proposal.
The main limitation of this study was related to the reduced number of genomes and, consequently, of the members forming the study groups, which acted to generate an unpaired distribution in the sample. This number was related to the limited and non-uniform clinical and epidemiological information of the hosts in the databases. In most cases, information regarding T2DM is limited to presence/absence without incorporating pharmacological treatment for glycemic control, glycemic testing, time elapsed with T2DM condition, and presence of other comorbidities/addictions, which also limited the depth of analysis. Based on the above, incorporation of sufficient clinical and epidemiological information pertaining to the host should be considered a mandatory requirement for the registration of genomes in the repositories. Availability of this information will undoubtedly be key for subsequent studies.

5. Conclusions

The present study is the first analysis of polymorphisms present in GRDDR in the context of drug resistance and T2DM. The results show differences in the distribution of non-synonymous SNPs present in GRDDRs that were related to the resistance profile of the infecting strain and the presence of T2DM in the host. This provides evidence that the environment of a patient with T2DM influences the development of SNPs that could be involved in the development of drug resistance. However, further studies are required in order to confirm this possibility.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes13040609/s1, Table S1: Data of the analyzed sequences, Table S2: Genes related to DNA damage repair in M. Tuberculosis, Table S3: Distribution of non-synonymous SNPs in genes related to DNA damage repair according to drug resistance and the presence/absence of T2DM in the host, Table S4: Distribution of genes with SNPs according to drug resistance and absence/presence of T2DM in the host (continuation).

Author Contributions

Conceptualization, R.Z.-C.; methodology, D.E.P.-M., I.C.-M., C.F.M.-M., I.C. and R.Z.-C.; software, D.E.P.-M., G.A.B.-H., I.C.-M. and C.F.M.-M.; validation, D.E.P.-M., I.C.-M. and C.F.M.-M.; formal analysis, D.E.P.-M.; research, D.E.P.-M.; resources, R.Z.-C.; data curation, I.C.-M., C.F.M.-M. and D.E.P.-M.; original writing-drafting, D.E.P.-M. and R.Z.-C.; writing-revising and editing, I.C.-M., C.F.M.-M., G.A.B.-H., I.C., H.M., R.M.-S. and C.L.-C.; visualization, D.E.P.-M. and R.Z.-C.; supervision, H.M., R.M.-S., C.L.-C., R.Z.-C. and I.C.; fundraising, R.Z.-C. All authors have read and agreed to the published version of the manuscript.

Funding

This project financed with resources from CONACYT-Basic Sciences Fund: Influence of type 2 diabetes mellitus in the development of mutations associated with multidrug resistance in Tuberculosis, A1-S-22956. The doctoral students were granted scholarships by CONACYT, call “Becas Nacional (Tradicional) 2019-02” with registration number (“CVU”) 411155 and 710466.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and was approved on 23 June 2020 by the Research Ethics Committee of the Institute of Health Sciences, Universidad Veracruzana (30CEI0012120180131), with protocol code 010/2020.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data is contained within the article or Supplementary Material.

Acknowledgments

This work was submitted in partial fulfillment of the requirements for Pérez-Martínez D.E. at Programa de Doctorado en Ciencias de la Salud, Universidad Veracruzana.

Conflicts of Interest

The authors declare that they have no conflict of interest.

References

  1. World Health Organization (WHO). Global Tuberculosis Report 2020; World Health Organization: Geneva, Switzerland, 2020. [Google Scholar]
  2. Jeon, C.Y.; Murray, M.B. Diabetes Mellitus Increases the Risk of Active Tuberculosis: A Systematic Review of 13 Observational Studies. PLoS Med. 2008, 5, e152. [Google Scholar]
  3. Ponce-De-Leon, A.; Garcia-Garcia, M.D.L.; Garcia-Sancho, M.C.; Gomez-Perez, F.J.; Valdespino-Gomez, J.L.; Olaiz-Fernandez, G.; Rojas, R.; Ferreyra-Reyes, L.; Cano-Arellano, B.; Bobadilla, M.; et al. Tuberculosis and Diabetes in Southern Mexico. Diabetes Care 2004, 27, 1584–1590. [Google Scholar] [CrossRef] [Green Version]
  4. Baker, M.A.; Harries, A.D.; Jeon, C.Y.; Hart, J.E.; Kapur, A.; Lönnroth, K.; Ottmani, S.-E.; Goonesekera, S.D.; Murray, M.B. The impact of diabetes on tuberculosis treatment outcomes: A systematic review. BMC Med. 2011, 9, 81. [Google Scholar] [CrossRef] [Green Version]
  5. Xu, Y.; Cancino-Muñoz, I.; Torres-Puente, M.; Villamayor, L.M.; Borrás, R.; Borrás-Máñez, M.; Bosque, M.; Camarena, J.J.; Colomer-Roig, E.; Colomina, J.; et al. High-resolution mapping of tuberculosis transmission: Whole genome sequencing and phylogenetic modelling of a cohort from Valencia Region, Spain. PLoS Med. 2019, 16, e1002961. [Google Scholar] [CrossRef] [Green Version]
  6. Pérez-Navarro, L.M.; Fuentes-Domínguez, F.; Morales-Romero, J.; Zenteno-Cuevas, R. Factors associated to pulmonary tuberculosis in patients with diabetes mellitus from Veracruz, México. Gac. Med. Mex. 2011, 147, 219–225. [Google Scholar] [PubMed]
  7. Pérez-Navarro, L.M.; Fuentes-Domínguez, F.J.; Zenteno-Cuevas, R. Type 2 diabetes mellitus and its influence in the development of multidrug resistance tuberculosis in patients from southeastern Mexico. J. Diabetes Complicat. 2015, 29, 77–82. [Google Scholar] [CrossRef] [PubMed]
  8. Babalik, A.; Ulus, I.H.; Bakirci, N.; Kuyucu, T.; Arpag, H.; Dagyildizi, L.; Capaner, E. Plasma Concentrations of Isoniazid and Rifampin Are Decreased in Adult Pulmonary Tuberculosis Patients with Diabetes Mellitus. Antimicrob. Agents Chemother. 2013, 57, 5740–5742. [Google Scholar] [CrossRef] [Green Version]
  9. Singhal, A.; Jie, L.; Kumar, P.; Hong, G.S.; Leow, M.K.-S.; Paleja, B.; Tsenova, L.; Kurepina, N.; Chen, J.; Zolezzi, F.; et al. Metformin as adjunct antituberculosis therapy. Sci. Transl. Med. 2014, 6, 263ra159. [Google Scholar] [CrossRef] [PubMed]
  10. Jafar, N.; Edriss, H.; Nugent, K. The Effect of Short-Term Hyperglycemia on the Innate Immune System. Am. J. Med. Sci. 2016, 351, 201–211. [Google Scholar] [CrossRef]
  11. Pal, R.; Ansari, M.A.; Hameed, S.; Fatima, Z. Diabetes Mellitus as Hub for Tuberculosis Infection: A Snapshot. Int. J. Chronic Dis. 2016, 2016, 5981574. [Google Scholar] [CrossRef] [Green Version]
  12. Kumari, P.; Meena, L.S. Factors Affecting Susceptibility to Mycobacterium tuberculosis: A Close View of Immunological Defence Mechanism. Appl. Biochem. Biotechnol. 2014, 174, 2663–2673. [Google Scholar] [CrossRef] [PubMed]
  13. Mehrotra, P.; Jamwal, S.V.; Saquib, N.; Sinha, N.; Siddiqui, Z.; Manivel, V.; Chatterjee, S.; Rao, K.V.S. Pathogenicity of Mycobacterium tuberculosis Is Expressed by Regulating Metabolic Thresholds of the Host Macrophage. PLoS Pathog. 2014, 10, e1004265. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Godfroid, M.; Dagan, T.; Merker, M.; Kohl, T.A.; Diel, R.; Maurer, F.P.; Niemann, S.; Kupczok, A. Insertion and deletion evolution reflects antibiotics selection pressure in a Mycobacterium tuberculosis outbreak. PLoS Pathog. 2020, 16, e1008357. [Google Scholar] [CrossRef] [PubMed]
  15. Liu, Q.; Wei, J.; Li, Y.; Wang, M.; Su, J.; Lu, Y.; López, M.G.; Qian, X.; Zhu, Z.; Wang, H.; et al. Mycobacterium tuberculosis clinical isolates carry mutational signatures of host immune environments. Sci. Adv. 2020, 6, eaba4901. [Google Scholar] [CrossRef] [PubMed]
  16. Nimmo, C.; Brien, K.; Millard, J.; Grant, A.D.; Padayatchi, N.; Pym, A.S.; O’Donnell, M.; Goldstein, R.; Breuer, J.; Balloux, F. Dynamics of within-host Mycobacterium tuberculosis diversity and heteroresistance during treatment. EBioMedicine 2020, 55, 102747. [Google Scholar] [CrossRef] [PubMed]
  17. Singh, A. Guardians of the mycobacterial genome: A review on DNA repair systems in Mycobacterium tuberculosis. Microbiology 2017, 163, 1740–1758. [Google Scholar] [CrossRef]
  18. Chopra, I.; O’Neill, A.J.; Miller, K. The role of mutators in the emergence of antibiotic-resistant bacteria. Drug Resist. Updat. 2003, 6, 137–145. [Google Scholar] [CrossRef]
  19. Ebrahimi-Rad, M.; Bifani, P.; Martin, C.; Kremer, K.; Samper, S.; Rauzier, J.; Kreiswirth, B.; Blazquez, J.; Jouan, M.; van Soolingen, D.; et al. Mutations in Putative Mutator Genes of Mycobacterium tuberculosis Strains of the W-Beijing Family. Emerg. Infect. Dis. 2003, 9, 838–845. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  20. Hassim, F.; Papadopoulos, A.O.; Kana, B.; Gordhan, B.G. A combinatorial role for MutY and Fpg DNA glycosylases in mutation avoidance in Mycobacterium smegmatis. Mutat. Res. Mol. Mech. Mutagen. 2015, 779, 24–32. [Google Scholar] [CrossRef]
  21. Castañeda-García, A.; Martín-Blecua, I.; Cebrián-Sastre, E.; Chiner-Oms, A.; Torres-Puente, M.; Comas, I.; Blázquez, J. Specificity and mutagenesis bias of the mycobacterial alternative mismatch repair analyzed by mutation accumulation studies. Sci Adv. 2020, 6, eaay4453. [Google Scholar] [CrossRef] [Green Version]
  22. Sander, P.; Papavinasasundaram, K.G.; Dick, T.; Stavropoulos, E.; Ellrott, K.; Springer, B.; Colston, M.J.; Böttger, E.C. Mycobacterium bovis BCG recA Deletion Mutant Shows Increased Susceptibility to DNA-Damaging Agents but Wild-Type Survival in a Mouse Infection Model. Infect Immun. 2001, 69, 3562–3568. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Stucki, D.; Brites, D.; Jeljeli, L.; Coscolla, M.; Liu, Q.; Trauner, A.; Fenner, L.; Rutaihwa, L.; Borrell, S.; Luo, T.; et al. Mycobacterium tuberculosis lineage 4 comprises globally distributed and geographically restricted sublineages. Nat. Genet. 2016, 48, 1535–1543. [Google Scholar] [CrossRef] [PubMed]
  24. Chen, S.; Zhou, Y.; Chen, Y.; Gu, J. fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 2018, 34, i884–i890. [Google Scholar] [CrossRef] [PubMed]
  25. Goig, G.A.; Blanco, S.; Garcia-Basteiro, A.L.; Comas, I. Contaminant DNA in bacteriARal sequencing experiments is a major source of false genetic variability. BMC Biol. 2020, 18, 24. [Google Scholar] [CrossRef] [PubMed]
  26. Wood, D.E.; Salzberg, S.L. Kraken: Ultrafast metagenomic sequence classification using exact alignments. Genome Biol. 2014, 15, R46. [Google Scholar] [CrossRef] [Green Version]
  27. Picard Tools—By Broad Institute. [Internet]. Available online: https://broadinstitute.github.io/picard/ (accessed on 5 June 2021).
  28. Comas, I.; Chakravartti, J.; Small, P.; Galagan, J.; Nieman, S.; Kremer, K.; Ernst, J.D.; Gagneux, S. Human T cell epitopes of Mycobacterium tuberculosis are evolutionarily hyperconserved. Nat. Genet. 2010, 42, 498–503. [Google Scholar] [CrossRef] [Green Version]
  29. Comas, I.; Coscolla, M.; Luo, T.; Borrell, S.; Holt, K.; Kato-Maeda, M.; Parkhill, J.; Malla, B.; Berg, S.; Thwaites, G.; et al. Out-of-Africa migration and Neolithic coexpansion of Mycobacterium tuberculosis with modern humans. Nat. Genet. 2013, 45, 1176–1182. [Google Scholar] [CrossRef] [PubMed]
  30. Nguyen, L.T.; Schmidt, H.A.; Von Haeseler, A.; Minh, B.Q. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 2015, 32, 268–274. [Google Scholar] [CrossRef] [PubMed]
  31. Letunic, I.; Bork, P. Interactive Tree of Life (iTOL) v5: An online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021, 49, W293–W296. [Google Scholar] [CrossRef]
  32. Rousset, F. Genepop’007: A complete re-implementation of the genepop software for Windows and Linux. Mol. Ecol. Resour. 2008, 8, 103–106. [Google Scholar] [CrossRef] [PubMed]
  33. IBM Corp. IBM SPSS Statistics for Windows; IBM Corp: New York, NY, USA, 2012. [Google Scholar]
  34. Arif, S.M.; Patil, A.G.; Varshney, U.; Vijayan, M. Biochemical and structural studies of Mycobacterium smegmatis MutT1, a sanitization enzyme with unusual modes of association. Acta Cryst. Sect. D Struct. Biol. 2017, 73, 349–364. [Google Scholar] [CrossRef] [PubMed]
  35. Verhoeven, E.E.A. The presence of two UvrB subunits in the UvrAB complex ensures damage detection in both DNA strands. EMBO J. 2002, 21, 4196–4205. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Moolla, N.; Goosens, V.J.; Kana, B.D.; Gordhan, B.G. The contribution of Nth and Nei DNA glycosylases to mutagenesis in Mycobacterium smegmatis. DNA Repair 2014, 13, 32–41. [Google Scholar] [CrossRef] [PubMed]
  37. Singh, A.; Vijayan, M.; Nagaraju, G. RecG wed: A probable novel regulator in the resolution of branched DNA structures in mycobacteria. IUBMB Life 2018, 70, 786–794. [Google Scholar] [CrossRef] [Green Version]
  38. Boshoff, H.I.M.; Reed, M.B.; Barry, C.E.; Mizrahi, V. DnaE2 Polymerase Contributes to In Vivo Survival and the Emergence of Drug Resistance in Mycobacterium tuberculosis. Cell 2003, 113, 183–193. [Google Scholar] [CrossRef] [Green Version]
  39. Coll, F.; McNerney, R.; Guerra-Assunção, J.A.; Glynn, J.R.; Perdigão, J.; Viveiros, M.; Portugal, I.; Pain, A.; Martin, N.; Clark, T.G. A robust SNP barcode for typing Mycobacterium tuberculosis complex strains. Nat. Commun. 2014, 5, 4812. [Google Scholar] [CrossRef] [Green Version]
  40. Comas, I.; Homolka, S.; Niemann, S.; Gagneux, S. Genotyping of genetically monomorphic bacteria: DNA sequencing in Mycobacterium tuberculosis highlights the limitations of current methodologies. PLoS ONE 2009, 4, e7815. [Google Scholar] [CrossRef] [Green Version]
  41. Napier, G.; Campino, S.; Merid, Y.; Abebe, M.; Woldeamanuel, Y.; Aseffa, A.; Hibberd, M.L.; Phelan, J.; Clark, T.G. Robust barcoding and identification of Mycobacterium tuberculosis lineages for epidemiological and clinical studies. Genome Med. 2020, 12, 114. [Google Scholar] [CrossRef] [PubMed]
  42. Kurthkoti, K.; Srinath, T.; Kumar, P.; Malshetty, V.S.; Sang, P.B.; Jain, R.; Manjunath, R.; Varshney, U. A distinct physiological role of MutY in mutation prevention in mycobacteria. Microbiology 2010, 156, 88–93. [Google Scholar] [CrossRef] [Green Version]
  43. Kurthkoti, K.; Kumar, P.; Jain, R.; Varshney, U. Important role of the nucleotide excision repair pathway in Mycobacterium smegmatis in conferring protection against commonly encountered DNA-damaging agents. Microbiology 2008, 154, 2776–2785. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Cordone, A.; Audrain, B.; Calabrese, I.; Euphrasie, D.; Reyrat, J.-M. Characterization of a Mycobacterium smegmatis uvrA mutant impaired in dormancy induced by hypoxia and low carbon concentration. BMC Microbiol. 2011, 11, 231. [Google Scholar] [CrossRef] [Green Version]
  45. Muzondiwa, D.; Hlanze, H.; Reva, O. The Epistatic Landscape of Antibiotic Resistance of Different Clades of Mycobacterium tuberculosis. Antibiotics 2021, 10, 857. [Google Scholar] [CrossRef]
  46. Chao, W.-C.; Yen, C.-L.; Wu, C.H.; Shieh, C.-C. How mycobacteria take advantage of the weakness in human immune system in the modern world. J. Microbiol. Immunol. Infect. 2020, 53, 209–215. [Google Scholar] [CrossRef]
  47. Ruslami, R.; Nijland, H.M.; Adhiarta IG, N.; Kariadi, S.H.; Alisjahbana, B.; Aarnoutse, R.E.; van Crevel, R. Pharmacokinetics of Antituberculosis Drugs in Pulmonary Tuberculosis Patients with Type 2 Diabetes. Antimicrob. Agents Chemother. 2010, 54, 1068–1074. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Pitcher, R.S.; Brissett, N.C.; Picher, A.J.; Andrade, P.; Juarez, R.; Thompson, D.; Fox, G.C.; Blanco, L.; Doherty, A.J. Structure and Function of a Mycobacterial NHEJ DNA Repair Polymerase. J. Mol. Biol. 2007, 366, 391–405. [Google Scholar] [CrossRef] [Green Version]
  49. Płociński, P.; Brissett, N.C.; Bianchi, J.; Brzostek, A.; Korycka-Machała, M.; Dziembowski, A.; Dziadek, J.; Doherty, A.J. DNA Ligase C and Prim-PolC participate in base excision repair in mycobacteria. Nat. Commun. 2017, 8, 1251. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  50. Warner, D.F.; Ndwandwe, D.E.; Abrahams, G.L.; Kana, B.D.; Machowski, E.E.; Venclovas, Č.; Mizrahi, V. Essential roles for imuA’- and imuB-encoded accessory factors in DnaE2-dependent mutagenesis in Mycobacterium tuberculosis. Proc. Natl. Acad. Sci. USA 2010, 107, 13093–13098. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  51. Lyu, L.D.; Tang, B.K.; Fan, X.Y.; Ma, H.; Zhao, G.P. Mycobacterial MazG Safeguards Genetic Stability via Housecleaning of 5-OH-dCTP. PLoS Pathog. 2013, 9, e1003814. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  52. Wipperman, M.; Heaton, B.; Nautiyal, A.; Adefisayo, O.; Evans, H.; Gupta, R.; Ditmarsch, D.; Soni, R.; Hendrickson, R.; Johnson, J.; et al. Mycobacterial Mutagenesis and Drug Resistance Are Controlled by Phosphorylation- and Cardiolipin-Mediated Inhibition of the RecA Coprotease. Mol. Cell 2018, 72, 152–161.e7. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  53. Thakur, R.S.; Basavaraju, S.; Khanduja, J.S.; Muniyappa, K.; Nagaraju, G. Mycobacterium tuberculosis RecG Protein but Not RuvAB or RecA Protein Is Efficient at Remodeling the Stalled Replication Forks: Implications for multiple mechanisms of replication restart in mycobacteria. J. Biol. Chem. 2015, 290, 24119–24139. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. World Health Organization (WHO). Catalogue of Mutations in Mycobacterium Tuberculosis Complex and Their Association with Drug Resistance; World Health Organization: Geneva, Switzerland, 2021. [Google Scholar]
Figure 1. Dendrogram generated using 534 non-synonymous SNPs in genes related to DNA damage repair.
Figure 1. Dendrogram generated using 534 non-synonymous SNPs in genes related to DNA damage repair.
Genes 13 00609 g001
Figure 2. Sub-lineage clustering using 16 non-synonymous SNPs in 13 genes related to DNA damage repair.
Figure 2. Sub-lineage clustering using 16 non-synonymous SNPs in 13 genes related to DNA damage repair.
Genes 13 00609 g002
Table 1. Sociodemographic and epidemiological characteristics of the individuals comprising the group of genomes.
Table 1. Sociodemographic and epidemiological characteristics of the individuals comprising the group of genomes.
Host without T2DMTotal
without T2DM
(n = 224)
n (%)
Host with T2DMTotal
with T2DM
(n = 175)
n (%)
Total
Sensitive Isolate
(n = 223)
n (%)
Total
Drug Resistant Isolate
(n = 176)
n (%)
Total
(n = 399)
n (%)
Sensitive Isolate
(n = 123)
n (%)
Drug Resistant Isolate
(n = 101)
n (%)
Sensitive Isolate
(n = 100)
n (%)
Drug Resistant Isolate
(n = 75)
n (%)
Sex *,†
 Female40 (32.5)34 (33.7)74 (33.0)38 (50.0)30 (44.1)68 (47.2)78 (39.2)64 (37.9)142 (38.6)
 Male83 (67.5)67 (66.3)150 (67.0)38 (50.0)38 (55.9)76 (52.8)121 (60.8)105 (62.1)226 (61.4)
Age *,†
 Mean
(±SD)
38.5
(±15.6)
36.8
(±14.3)
37.7
(±15.0)
50.2
(±11.8)
50.4
(±11.8)
50.3
(±11.8)
42.8
(±15.3)
42.2
(±14.9)
42.5
(±15.1)
Country †
 Georgia90 (73.2)37 (36.6)127 (56.7)0 (0.0)7 (9.3)7 (4.0)90 (40.4)44 (25.0)134 (33.6)
 Moldova19 (15.4)27 (26.7)46 (20.5)3 (3.0)6 (8.0)9 (5.1)22 (9.9)33 (18.8)55 (13.8)
 Indonesia0 (0.0)0 (0.0)0 (0.0)42 (42.0)9 (12.0)51 (29.1)42 (18.8)9 (5.1)51 (12.8)
 Mexico7 (5.7)11 (10.9)18 (8.0)6 (6.0)23 (30.7)29 (16.6)13 (5.8)34 (19.3)47 (11.8)
 Spain0 (0.0)0 (0.0)0 (0.0)27 (27.0)8 (10.7)35 (20.0)27 (12.1)8 (4.5)35 (8.8)
 Others a7 (5.7)26 (25.7)33 (14.7)22 (22.0)22 (29.3)44 (25.1)29 (13.0)48 (27.3)77 (19.3)
Lineage †
 4.148 (39.0)24 (23.8)72 (32.1)19 (19.0)10 (13.3)29 (16.6)67 (30.0)34 (19.3)101 (25.3)
 4.1.2.124 (19.5)12 (11.9)36 (16.1)27 (27.0)18 (24.0)45 (25.7)51 (22.9)30 (17.0)81 (20.3)
 4.3.331 (25.2)22 (21.8)53 (23.7)8 (8.0)10 (13.3)18 (10.3)39 (17.5)32 (18.2)71 (17.8)
 4.2.111 (8.9)29 (28.7)40 (17.9)0 (0.0)7 (9.3)7 (4.0)11 (4.9)36 (20.5)47 (11.8)
 4.1.1.30 (0.0)2 (2.0)2 (0.9)1 (1.0)10 (13.3)11 (6.3)1 (0.4)12 (6.8)13 (3.3)
 L4 Mix-inf1 (0.8)3 (3.0)4 (1.8)4 (4.0)1 (1.3)5 (2.9)5 (2.2)4 (2.3)9 (2.3)
 Others b8 (6.5)9 (8.9)17 (7.6)41 (41.0)19 (25.3)60 (34.3)49 (22.0)28 (15.9)77 (19.3)
Resistance profile
 Sensitive123 (100.0)-123 (54.9)100 (100.0)-100 (57.1)223 (100.0)-223 (55.9)
 Mono-resistant-21 (20.8)21 (9.4)-20 (26.7)20 (11.4)-41 (23.3)41 (10.3)
 Poly-resistant-11 (10.9)11 (4.9)-13 (17.3)13 (7.4)-24 (13.6)24 (6.0)
 MDR-20 (19.8)20 (8.9)-18 (24.0)18 (10.3)-38 (21.6)38 (9.5)
 Pre-XDR-31 (30.7)31 (13.8)-17 (22.7)17 (9.7)-48 (27.3)48 (12.0)
 XDR-18 (17.8)18 (8.0)-7 (9.3)7 (4.0)-25 (14.2)25 (6.3)
T2DM: Type 2 diabetes mellitus. MDR: Multidrug Resistant. XDR: Extensively Drug Resistant. Mix-inf: Mixed infection. * Patients without information are excluded. † Statistically significant difference between hosts (chi square test, p < 0.05). a Other countries: Peru, Romania, Belarus, Azerbaijan, and Kazakhstan. b Other lineages: lineage 4.3.1, 4.3.4.2, 4.4.1.1, 4.3.2, 4.1.2, 4.3.4.1, 4.1.1, 4, 4.4.1.2, 4.5, 4.1.1.1, 4.3, 4.2.2, and 4.6.1.1.
Table 2. Nonsynonymous SNPs in DNA damage repair-related genes used to classify sub-lineages.
Table 2. Nonsynonymous SNPs in DNA damage repair-related genes used to classify sub-lineages.
GeneGene FunctionSiteNucleotide ChangeSub-Lineage
RecCParticipates in the single-strand annealing pathway [17]726703G>T4.3, 4.3.3, 4.3.1, 4.3.4.1, and 4.3.4.2
726816C>G4.1.1, 4.1.1.1, and 4.1.1.3
LigDDNA ligase [17]1047165C>T4.10
1047683G>T4.3.2
MfdRecognizes transcription problems and recruits UvrABC [17]1139102G>A4.1.1.1
Ogt/adaBRepairs alkylated guanine in DNA [34]1477588C>G4.1.2.1
DinB1DNA polymerase [17]1740771A>C4.3, 4.3.3, 4.3.1, 4.3.4.1, and 4.3.4.2
UvrBParticipates in recognition of DNA damage (with UvrA) and initiates nucleotide excision repair [35]1838153G>A4.3.2
Nei1Excises oxidized pyrimidines [36]2767631G>A4.4.1.1
RecGwedBinds to branched DNA structures [37]3011692T>G4.1.2.1
DutInvolved in Nucleotide Pool Sanitization [17]3013784G>C4.1.2.1
LigBDNA ligase [17]3426025G>A4.5
AdnBInvolved in repair by homologous recombination [17]3574504C>T4.2.1 and 4.2.2
3575106T>C4.4.1.1
DnaE2Error-prone DNA polymerase [38]3781574C>G4.4.1.2
LigCDNA ligase [17]4182695A>G4.10
AlkAExhibits methyltransferase activity [17]1479085G>APresent in >99% of the sample
AdnAWith AdnB, initiates DNA double-strand break repair by RecA-dependent homologous recombination [17]3577958C>TPresent in >99% of the sample
ImuAEncodes a DnaE2 accessory protein [17]3811629T>CPresent in >99% of the sample
Table 3. Distribution of genes with SNPs according to drug resistance and absence/presence of T2DM in the host.
Table 3. Distribution of genes with SNPs according to drug resistance and absence/presence of T2DM in the host.
GeneHost without T2DMTotal
without T2DM
(n = 224)
n (%)
Host with T2DMTotal
with T2DM
(n = 175)
n (%)
Total
Sensitive Isolate
(n = 223)
n (%)
Total
Drug-Resistant Isolate
(n = 176)
n (%)
Total
(n = 399)
n (%)
Exclusive Sub-Linage Diversification
Sensitive Isolate
(n = 123)
n (%)
Drug Resistant Isolate
(n = 101)
n (%)
Sensitive Isolate
(n = 100)
n (%)
Drug Resistant Isolate
(n = 75)
n (%)
LigD •5 (4.1)2 (2.0)7 (3.1)12 (12.0)5 (6.7)* 17 (9.7)17 (7.6)7 (4.0)24 (6.0)No
Mfd •2 (1.6)4 (4.0)6 (2.7)4 (4.0)4 (5.3)8 (4.6)6 (2.7)8 (4.5)14 (3.5)No
MazG0 (0.0)0 (0.0)0 (0.0)2 (2.0)1 (1.3)* 3 (1.7)2 (0.9)1 (0.6)3 (0.8)No
MutT217 (13.8)0 (0.0)* 17 (7.6)3 (3.0)0 (0.0)3 (1.7)** 20 (9.0)0 (0.0)20 (5.0)4.10 and Mix-inf
DnaE14 (3.3)1 (1.0)5 (2.2)4 (4.0)5 (6.7)9 (5.1)8 (3.6)6 (3.4)14 (3.5)No
PolA32 (26.0)29 (28.7)* 61 (27.2)5 (5.0)8 (10.7)13 (7.4)37 (16.6)37 (21)74 (18.5)No
UvrA4 (3.3)15 (14.9)19 (8.5)6 (6.0)7 (9.3)13 (7.4)10 (4.5)** 22 (12.5)32 (8.0)No
Cho41 (33.3)22 (21.8)* 63 (28.1)5 (5.0)9 (12.0)14 (8.0)46 (20.6)31 (17.6)77 (19.3)No
RNaseH14 (3.3)0 (0.0)4 (1.8)1 (1.0)0 (0.0)1 (0.6)** 5 (2.2)0 (0.0)5 (1.3)No
RecO7 (5.7)9 (8.9)* 16 (7.1)1 (1.0)0 (0.0)1 (0.6)8 (3.6)9 (5.1)17 (4.3)No
RuvB1 (0.8)2 (2.0)3 (1.3)8 (8.0)3 (4.0)* 11 (6.3)9 (4.0)5 (2.8)14 (3.5)No
RecGwed •0 (0.0)24 (23.8)* 24 (10.7)1 (1.0)6 (8.0)7 (4.0)1 (0.4)** 30 (17)31 (7.8)4.2.1 and Mix-inf
RecX0 (0.0)0 (0.0)0 (0.0)2 (2.0)1 (1.3)* 3 (1.7)2 (0.9)1 (0.6)3 (0.8)4.10.
RNaseH24 (3.3)0 (0.0)4 (1.8)2 (2.0)0 (0.0)2 (1.1)** 6 (2.7)0 (0.0)6 (1.5)No
RecG2 (1.6)1 (1.0)3 (1.3)8 (8.0)2 (2.7)* 10 (5.7)10 (4.5)3 (1.7)13 (3.3)No
LigB •1 (0.8)0 (0.0)1 (0.4)6 (6.0)2 (2.7)* 8 (4.6)7 (3.1)2 (1.1)9 (2.3)No
Nei228 (22.8)22 (21.8)* 50 (22.3)4 (4.0)8 (10.7)12 (6.9)32 (14.3)30 (17)62 (15.5)No
DnaE2 •15 (12.2)21 (20.8)36 (16.1)20 (20.0)17 (22.7)37 (21.1)35 (15.7)38 (21.6)73 (18.3)No
ImuB2 (1.6)0 (0.0)2 (0.9)4 (4.0)0 (0.0)4 (2.3)** 6 (2.7)0 (0.0)6 (1.5)No
ImuA •1 (0.8)0 (0.0)1 (0.4)4 (4.0)2 (2.7)* 6 (3.4)5 (2.2)2 (1.1)7 (1.8)4.3 and 4.3.3 (site 3811327)
MutY0 (0.0)17 (16.8)17 (7.6)2 (2.0)6 (8.0)8 (4.6)2 (0.9)** 23 (13.1)25 (6.3)No
RecR0 (0.0)0 (0.0)0 (0.0)2 (2.0)0 (0.0)2 (1.1)2 (0.9)0 (0.0)2 (0.5)No
Prim-PolC0 (0.0)0 (0.0)0 (0.0)3 (3.0)0 (0.0)* 3 (1.7)3 (1.3)0 (0.0)3 (0.8)4.4.1.1
T2DM: Type 2 diabetes mellitus. Only genes with significant differences between groups are presented. • Lineage-related SNPs and SNPs > 99% of the sample are excluded. * Statistically significant difference between hosts with/without T2DM (chi-square test, p < 0.05). ** Statistically significant difference between sensitive and drug-resistant strains (chi-square test, p < 0.05). Genes not shown in the table are described in Supplementary Table S4.
Table 4. Discriminant analysis to predict diversification associated with the presence of T2DM in the host and drug resistance according to SNPs in genes related to DNA damage repair.
Table 4. Discriminant analysis to predict diversification associated with the presence of T2DM in the host and drug resistance according to SNPs in genes related to DNA damage repair.
Predicted GroupTotalClassified CorrectlySensitivitySpecificityPositive Predictive ValueNegative Predictive Value
GroupHost with T2DM
n (%)
Host without T2DM
n (%)
(A) Original
Host with T2DM109 (73.6)39 (26.4)14875.3%0.690.8074%76%
Host without T2DM48 (26.5)156 (69.6)204
Cross-validation *
Host with T2DM94 (63.5)54 (36.5)14864.2%0.570.7164%65%
Host without T2DM72 (35.3)132 (64.7)204
Random sample validation (10%) **---64.4%----
(B) OriginalDrug-resistant
n (%)
Sensitive
n (%)
Total
Drug-resistant119 (74.8)40 (25.2)15981.3%0.820.8175%87%
Sensitive26 (13.5)167 (86.5)193
Cross-validation *
Drug-resistant98 (61.6)61 (38.4)15964.8%0.610.6862%67%
Sensitive63 (32.6)130 (67.4)193
Random sample validation (10%) **---68.1%----
T2DM: Type 2 diabetes mellitus. * In cross-validation, each case is classified using the functions derived from the rest of the cases. ** 10% of the sample is excluded from the function calculation and is classified using the functions derived from the rest of the cases; the average obtained from 10 repetitions is presented. The standardized coefficients for both discriminant functions are detailed in Table 5.
Table 5. Standardized coefficients of the canonical discriminant function for hosts with/without T2DM and drug-resistant TB. (A) Negative values indicate the presence of T2DM in the host and positive values indicate the absence of T2DM in the host. (B) Positive values indicate drug resistance and negative values indicate drug-susceptible TB.
Table 5. Standardized coefficients of the canonical discriminant function for hosts with/without T2DM and drug-resistant TB. (A) Negative values indicate the presence of T2DM in the host and positive values indicate the absence of T2DM in the host. (B) Positive values indicate drug resistance and negative values indicate drug-susceptible TB.
(A) Discriminant Function Coefficients for Hosts with/without T2DM(B) Discriminant Function Coefficients for Drug-Resistant and Sensitive TB
GeneCoefficientGeneCoefficientGeneCoefficientGeneCoefficientGeneCoefficientGeneCoefficient
RecR−0.422UvrD2−0.052MutT40.076RecGwed0.777RuvB0.114RadA−0.055
LigB−0.414RNaseH2−0.051DinB10.08MutY0.609End (Nfo)0.074AdnB−0.065
DnaE1−0.243RuvA−0.043AdnA0.093DnaE20.329UvrD20.072RecN−0.072
LigD−0.237MutT1−0.043Nei10.1Mfd0.305RecA0.068LigA−0.077
MazG−0.233PolD2−0.038TagA0.103XthA0.3MutT10.031RecX−0.089
ImuA−0.203ImuB−0.034Rv21190.103RecF0.291UvrC0.031RecC−0.097
End (Nfo)−0.194SSBb−0.032UdgB0.104UvrB0.249PolA0.028RuvA−0.098
SSBa−0.173XthA−0.03RecD0.111TagA0.24SSBb0.026MutT3−0.099
Cho−0.164Mfd−0.009Ogt/adaB0.12Mpg0.222Ung0.021Fpg2−0.11
MutT3−0.151RecC0.001Nth0.129NucS0.221AdnA0.017RecG−0.137
RecX−0.144RecN0.006UvrD10.134AlkA0.216UvrA0.013Nth−0.157
MutY−0.133Mpg0.009Ung0.137Nei20.203LigD0.011UvrD1−0.162
AlkA−0.129DnaE20.01DinB20.144RecD0.195RuvC0.004Nei1−0.17
Prim-PolC−0.113NucS0.02LigA0.178RecO0.166MutT4−0.001RNaseH2−0.173
UvrB−0.111AdnB0.025Fpg20.181Ku0.157DinB1−0.001MutT2−0.175
Ku−0.108RNaseH10.026RecB0.259SSBa0.154DnaE1−0.004Cho−0.181
MutM (Fpg)−0.096UvrA0.032Nei20.298LigC0.151MazG−0.01Prim−PolC−0.186
RuvB−0.094RuvC0.037RecGwed0.363UdgB0.141ImuA−0.025ImuB−0.195
UvrC−0.081RecF0.057RecO0.391MutM (Fpg)0.134Rv2119−0.033RNaseH1−0.206
RecA−0.071LigC0.058PolA0.501PolD20.133Ogt/adaB−0.044RecB−0.236
RecG−0.053RadA0.062MutT20.549RecR0.116DinB2−0.047LigB−0.315
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Pérez-Martínez, D.E.; Bermúdez-Hernández, G.A.; Madrazo-Moya, C.F.; Cancino-Muñoz, I.; Montero, H.; Licona-Cassani, C.; Muñiz-Salazar, R.; Comas, I.; Zenteno-Cuevas, R. SNPs in Genes Related to DNA Damage Repair in Mycobacterium Tuberculosis: Their Association with Type 2 Diabetes Mellitus and Drug Resistance. Genes 2022, 13, 609. https://doi.org/10.3390/genes13040609

AMA Style

Pérez-Martínez DE, Bermúdez-Hernández GA, Madrazo-Moya CF, Cancino-Muñoz I, Montero H, Licona-Cassani C, Muñiz-Salazar R, Comas I, Zenteno-Cuevas R. SNPs in Genes Related to DNA Damage Repair in Mycobacterium Tuberculosis: Their Association with Type 2 Diabetes Mellitus and Drug Resistance. Genes. 2022; 13(4):609. https://doi.org/10.3390/genes13040609

Chicago/Turabian Style

Pérez-Martínez, Damián E., Gustavo A. Bermúdez-Hernández, Carlos F. Madrazo-Moya, Irving Cancino-Muñoz, Hilda Montero, Cuauhtemoc Licona-Cassani, Raquel Muñiz-Salazar, Iñaki Comas, and Roberto Zenteno-Cuevas. 2022. "SNPs in Genes Related to DNA Damage Repair in Mycobacterium Tuberculosis: Their Association with Type 2 Diabetes Mellitus and Drug Resistance" Genes 13, no. 4: 609. https://doi.org/10.3390/genes13040609

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop