Next Article in Journal
Is Toxoplasma gondii Infection Associated with Sexual Promiscuity? A Cross-Sectional Study
Previous Article in Journal
The Persistence of Bacterial Pathogens in Surface Water and Its Impact on Global Food Safety
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Mycobacterium intracellulare subsp. chimaera from Cardio Surgery Heating-Cooling Units and from Clinical Samples in Israel Are Genetically Unrelated

1
National Public Health Laboratory, Ministry of Health, Tel Aviv-Yafo 6810416, Israel
2
Government Central Laboratories, Ministry of Health, Jerusalem 9134302, Israel
3
National Center for Infection Control, Ministry of Health, Tel Aviv-Yafo 6423906, Israel
4
Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv-Yafo 6997801, Israel
5
National Mycobacterium Reference Centre, Tel Aviv-Yafo 6810416, Israel
*
Author to whom correspondence should be addressed.
Pathogens 2021, 10(11), 1392; https://doi.org/10.3390/pathogens10111392
Submission received: 8 August 2021 / Revised: 20 October 2021 / Accepted: 20 October 2021 / Published: 27 October 2021
(This article belongs to the Section Emerging Pathogens)

Abstract

:
Non-tuberculous mycobacteria (NTM) are opportunistic pathogens that cause illness primarily in the elderly, in the immunocompromised or in patients with underlying lung disease. Since 2013, a global outbreak of NTM infection related to heater-cooler units (HCU) used in cardio-thoracic surgery has been identified. This outbreak was caused by a single strain of Mycobacterium intracellulare subsp. chimaera. In order to estimate the prevalence of this outbreak strain in Israel, we sampled Mycobacterium intracellulare subsp. chimaera from several HCU machines in Israel, as well as from patients, sequenced their genomes and compared them to the outbreak strain. The presence of mixed mycobacteria species in the samples complicated the analysis of obtained sequences. By applying a metagenomic binning strategy, we were able to obtain, and characterize, genomes of single strains from the mixed samples. Mycobacterium intracellulare subsp. chimaera strains were compared to each other and to previously reported genomes from other countries. The strain causing the outbreak related to the HCU machines was identified in several such machines in Israel but not in any clinical sample.

1. Introduction

Non-tuberculous mycobacteria (NTM) are ubiquitous environmental bacteria found primarily in soil and water. They are considered opportunistic pathogens; however, the occurrences of disease and even death caused by these bacteria in recent years are increasing [1,2,3,4,5]. These bacteria are resistant to many drugs and disinfectants [3,6,7]. Mycobacterium intracellulare subsp. chimaera (M. chimaera, previously classified as the separate species Mycobacterium chimaera) [8] is a slow-growing NTM, which is now considered a subspecies of Mycobacterium intracellulare, with high similarity to the subspecies intracellulare [8,9].
In the second decade of the 21 century, a global outbreak of M. chimaera disseminated infection and endocarditis occurred among patients who had undergone cardiopulmonary bypass surgeries [10,11,12,13]. The source of infection was identified as contaminated water in heater-cooler units (HCU), which regulate the temperature of blood and cardioplegia solution during open-heart surgery. Aerosols from the HCU, containing the pathogenic bacteria, spread in the operating room, infecting the patients’ open chest or grafts [14]. M. chimaera detected at the manufacturing site of the LivaNova (formerly Sorin) HCU suggested this was the source of the contamination [15].
In order to identify the source of the outbreak, the European Union launched an epidemiological investigation, in which genomes of 250 isolates, from patients, HCUs and the water supply in five different countries were sequenced, and the results were published in 2017 by Van Ingen et al. [16]. A phylogenetic analysis of the genomes divided most of the strains into two main groups. Almost all the strains obtained from patients who had undergone cardiopulmonary bypass surgery, belonged to a closely related subgroup which the authors designated subgroup 1.1. Van Ingen et al. also identified specific SNP signatures for some of the phylogenetic groups, including the outbreak subgroup 1.1 [16]. Other studies supported the finding that a single strain of M. chimaera was involved in this outbreak [17,18,19].
In addition to the clinical strains from patients related to the outbreak, the outbreak subgroup 1.1 included most of the strains from HCUs manufactured by LivaNova as well as samples taken from the LivaNova production site. Interestingly, other strains of M. chimaera were detected in water taken from HCUs manufactured by another company, Maquet, but these strains were genetically different from the strain causing the outbreak [16]. The mechanical characteristics of the LivaNova/Sorin 3T model of HCUs favored the spread of bacteria containing aerosols [14]. Following these findings, LivaNova modified this HCU device to improve its safety.
The Israel Ministry of Health (MOH) was notified in November 2016 by LivaNova of the NTM infections associated with use of its Stöckert 3T heater-cooler devices. At the time, 25 such devices were distributed among 11 hospitals throughout Israel performing open-heart surgery. The following steps were undertaken:
1. General hospital CEOs were notified, and asked to inform relevant staff members of the outbreak and nature of invasive NTM infections associated with use of these machines;
2. The directors of the clinical microbiology laboratories in general hospitals were asked to retroactively relay information on any pertinent cultures from patients who had undergone cardiac surgery with use of a heater-cooler device;
3. The public was notified via press release and a media interview of the outbreak and those having undergone open-heart surgery since 2011 were asked to report any suspicious symptoms or signs to their medical providers for evaluation;
4. CEOs of hospitals with the heater-cooler devices in use were asked to ensure that updated safety instructions issued by the manufacturer, and additional measures requested by the MOH, were strictly followed;
5. Hospitals using the machines were required to submit water samples for NTM, document device maintenance, and report bimonthly bacterial colony counts from water in the devices.
6. The manufacturer, in conjunction with the Ministry of Health, embarked a staggered recall of the devices to their plant in Europe, where they underwent cleaning, as well as installation of safety features that would render patients no longer vulnerable to infection due to contaminated exhaust from the water. This process was completed by mid-2018.
In order to estimate the infection risk, water samples from HCUs at 10 medical centers were sent to the Israeli public health laboratories for total bacterial count and specific identification of Mycobacterium contamination.
Fortunately, no clinical infections with M. chimaera related to use of the HCU were reported in Israel. Nevertheless, to characterize the epidemiology of M. chimaera in Israel and to evaluate the possibility of HCU-related M. chimaera infections, we sequenced the genomes of M. chimaera from the HCUs, as well as all the clinical Mycobacterium isolates in the Israeli mycobacterial reference laboratory that were identified as M. chimaera.
We found that some of the HCU cultures contained mixed mycobacterial species, which complicated the identification of M. chimaera strains. Environmental samples are often comprised of mixed mycobacteria strains. Isolating each NTM type is difficult, which makes deep sequencing analysis challenging. Several approaches have been applied to the task of deciphering bacterial strains from whole genome sequencing of a mixed culture. Eyre et al. [20] used a maximum likelihood-based model to identify two different strains of Clostridium difficile in short read WGS from mixed infection samples. However, their method relies on a previously constructed panel of known haplotypes that ideally include the strains. The QuantTB method [21] that detects mixed infection tuberculosis also uses a reference dataset of MTB genomes. Yang et al. introduced a tool for identifying strains of Salmonella enterica in samples from mixed infections, which also uses a database of known strains [22]. DESMAN [23] uses metagenomic binning and core genes to identify strains in mixed genome samples without a strain database. Metagenomic binning is a process that clusters environmental shotgun reads or their assembled contigs back into the taxa composing the sample. It is usually applied to reconstruct nearly complete genomes from metagenomic samples containing a large number of microorganisms. To address the challenge of mixed samples we formulated an in-house method, which involved de novo assembly and metagenomic binning followed by SNP identification and genotyping, by comparing our draft genomes to both a reference genome and a SNP-containing reference genome. Herein, we report the results of this analysis.

2. Results

2.1. Whole Genome Sequencing and Species Identification

We included in our study all eight M. chimaera isolates obtained from clinical samples during 2017 in Israel. These isolates were all obtained from sputum taken from patients suffering from chimaera pulmonary disease. In addition, we included one previous M. chimaera isolate obtained from chest biopsy in 2014 that was initially designated as M. intracellulare, and one M. intracellulare sample isolated from a patient’s pleural fluid in 2013. Besides the clinical samples, we also included thirteen samples of HCU water from ten medical centers that were cultured in the Israeli Mycobacterial Reference Laboratory, and identified as Mycobacterium intracellulare subsp. chimaera. All the sampled HCU devices were of the 3T model manufactured by LivaNova/Sorin, the same model that was linked to the global outbreak of M. chimaera following open-heart surgery (Table 1).
For initial identification, we used a molecular kit that is based on PCR and proprietary probes (GenoType NTM-DR assay, Hain Lifescience, Nehren, Germany). Two of the HCU samples were identified as mixed samples containing M. chimaera (Table 1).
Short reads whole genome sequencing (WGS) of these samples were obtained. We further identified the bacterial species in each sample by assigning taxa to the short sequence reads. The most abundant annotation in all samples was the genus Mycobacterium (53–83% of the reads, data not shown). On the species level, in clinical sample M10, originally identified as M. intracellulare by Hain Lifescience GenoType assay, the most abundant species-level annotation was indeed M. intracellulare. In two other clinical samples, M8 and M9, the most abundant species-level annotations were Shewanella decolorationis and Bacillus azotoformans, respectively, and M. chimaera was only the second most abundant species. However, these samples had 60% and 58% of their reads annotated as Mycobacterium, respectively. Therefore, it is safe to conclude that M. chimaera was the most abundant species in these samples as well. In seven of the samples from HCU devices, the most abundant species-level annotation was Mycobacterium gordonae, another NTM species (Table 2).
Since some of our samples were not pure M. chimaera, it was impossible to use the short read sequence for identifying the outbreak M. chimaera strain in a straightforward manner. We, therefore, applied the following steps:
Short reads in each sample were assembled de novo into contigs.
Contigs were binned into bacterial species, using metagenomic binning. Each sample resulted in one or two bins (Table 2 and Table S1). Each bin can be seen as an equivalent to genome sequencing of an isolate.
The species of each bin was identified by finding the genome most similar to it, from the collection of all publicly available genomes (Table 2).
A single bin was retrieved in each of our clinical samples, corresponding to one species. Most of these bins were more similar to M. chimaera genomes than any other genome, except for two cases. The bin produced from sample M10 was, as expected, most similar to a M. intracellulare genome, and sample M2, which was surprisingly most similar to a Mycobacterium sp. TKK-01-0059 genome, a poorly characterized species of the Mycobacterium tuberculosis complex (Table 2). However, this bin also showed high similarity to a genome of M. yongonense, a sub species of M. intracellulare (839/1000 k-mers in MASH analysis [24]), and lower similarity to a genome of M. chimaera (588/1000 k-mers). It cannot be ruled out that sample M2 is a strain of M. chimaera, which is distant from the strains with publicly available genomes.
The situation was different among samples taken from HCU devices. Samples M16-M19 and M21 had two metagenomic bins in each of them, one corresponding to M. chimaera and one to M. gordonae. The metagenomic binning process resulted in a single assembly bin, identified as M. gordonae by comparison to publicly available genomes, in samples M22 and M23. While a more sensitive PCR-based method (Hain Lifescience GenoType NTM-DR assay) was able to identify two different Mycobacterium species in samples M22 and M23, it is possible that the amount of M. chimaera in these samples was too low to be detected in the NGS sequencing by metagenomic sequencing. M19, M21, M22 and M23 had a relatively high proportion of reads annotated as M. gordonae (Table 2 column 5). Sample M20, also taken from a HCU, was divided into one bin of M. chimaera and one bin most similar to the genome of a Mycobacterium species isolated from a drinking water system in Illinois, USA [25]. HCU samples M11–M15 all had a single bin corresponding to M. chimaera.
For the rest of our study, we used only the assembly bins identified as Mycobacerium intracellulare, including the assembly bin from sample M2. The total length of these assemblies ranged between 5.3 Mbp and 6.9 Mbp, and the GC content between 67.3% and 68.1% (Table S1). In comparison, the published complete genome length of the M. chimaera reference strain DSM-44623 and strains Zuerich-1 and Zuerich-2 are 6.1 Mbp, 6.4 Mbp and 6.5 Mbp respectively, and their GC content are 67.7%, 67.5% and 67.4%, respectively.

2.2. Identification of Known Phylogenetic Groups by Specific SNP Signatures

We searched our assembled genomes for the SNP signatures defined by van Ingen et al. [16]. Twelve of the samples had a SNP signature assigning them to phylogenetic group 1, the largest phylogenetic group in that study, which included 200 isolates (Figure 1a). Within these 12 samples, two samples were further assigned to branch 2, one to subgroup 1.8, and six to subgroup 1.1, a tightly related phylogenetic subgroup related to the outbreak, by this subgroup’s SNP signature (Figure 1b). Three samples were assigned to group 1 but not to any of its subgroups or branches. In addition, one of our isolates was assigned to van Ingen et al.’s group 2 and, within it, to subgroup 2.1, by their specific SNP signatures (Figure 1a).
The remaining eight genomes were not assigned to any of the groups defined by specific SNP signature. Note that not every phylogenetic sub-type of M. chimaera has a SNP signature that can be used for its identification [9].
All isolates in our study that were assigned to subgroup 1.1, the outbreak subgroup, originated from HCU devices, which is in accordance with the fact that no clinical sample was taken from patients who had undergone cardio-thoracic surgery in the past.

2.3. Phylogeny

We identified SNP loci, relative to the M. chimaera reference strain DSM-44623, in the assembled bin of our isolates, as well as in some isolates analyzed by van Ingen et al. [16] (see next section), and M. intracellulare strain MOTT-2. These SNPs were used for building a maximum likelihood phylogenetic tree (Figure 1a). The phylogenetic tree branching pattern agreed with the assignment of isolates by van Ingen et al.’s group-specific SNP signatures [16].
Our samples have a wide genetic variability (Figure 1a). Sample M10, identified as M. intracellulare in all previous analyses, is most similar to other M. intracellulare genomes. Perhaps surprisingly, the same is true for sample M2 obtained from clinical sputum. Four of our samples, M6–M9, were obtained from clinical samples clustered in one similar, yet not identical, group. The largest group of our samples, comprised of 12 samples, clustered together with isolates assigned by van Ingen et al. [16] to group 1.
To explore the diversity within group 1, a separate maximum likelihood phylogenetic tree was built based on these isolates (Figure 1b). This tree too revealed a close agreement with SNP signatures. Most importantly, six isolates from HCUs clustered together with isolates from the outbreak subgroup described by van Ingen et al. [16]. These isolates also belong to this subgroup based on their signature SNPs.

2.4. Validation of the Results

To validate our procedures, we downloaded the raw genomic sequences of some isolates analyzed by van Ingen et al. [16], representing the full genetic variability in that study, and applied the same methods used on our isolates, including taxon annotation of short reads (not shown), de novo assembly followed by metagenomic binning and SNP identification. We used group-specific SNP signatures to assign these genomes back to their phylogenetic groups, and the original groups and subgroups were retrieved. Isolates that were part of the same branch in Ingen et al.’s phylogenetic analyses, exhibited similar clustering when analyzed with this study pipeline and used in phylogenetic trees (Figure 1).
We performed an in silico simulation, creating mixtures of short reads from M. chimaera and M. gordonae and applied the same method used here to identify SNPs, produce genotype calls and construct a phylogenetic tree, in order to explore the capacity of our method to identify strains from mixed samples. The Appendix A gives a detailed report of this analysis. In short, we were able to identify the original strains from the mixtures, and differentiate between different strains (Figures S1 and S2).
Ten of our samples, M1-10, were obtained from clinical samples, and can be assumed to belong to a single clone of Mycobacterium. To validate our results further, we mapped the sequence reads from theses samples to the M. chimaera reference genome and looked for SNPs. Among the SNPs identified in both methods, the base call of our clinical samples is identical in 95–100% of the cases, depending on the strain (average of 99%). The subtype signature SNPs were identical to those obtained with metagenomic binning (data not shown).

3. Discussion

The aim of this study was to characterize the M. chimaera strains in Israel, in both HCUs and clinical samples, and find out whether the global outbreak strain is present among them.
The mixed nature of samples in this study made it difficult to characterize the strains comprising them, and led us to use an un-orthodox path of bioinformatics analysis. The relative similarity between Mycobacterium genomes made the use of mapping to a reference genome inappropriate, since reads from one species of bacterium in the sample could be cross-mapped to a genome of a different bacterium. In addition, the mix of intra-species and inter-species variation may interfere with the identification of strains. In contrast, the use of assembled contigs, which are much longer, enabled the separation of sequences into bins representing the different bacteria. The metagenomic binning approach served as an in silico isolation method. The agreement in genome size and GC content of the bins we annotated as M. chimaera, with those characteristics of known genomes of M. chimaera, supports this annotation. Moreover, our confidence in our results relies on the fact that we re-captured the results of van Ingen et al. [16] for some of the isolates analyzed by them, using our unique analysis. For the clinical samples, each of which is clonal, we repeated our main results with the traditional strategy of mapping short reads to reference. Our simulation of mycobacteria mixtures (Appendix A) demonstrated highly accurate strain identification using the same method. We therefore have confidence in our in silico isolation strategy, and believe it can be used in other similar scenarios, in which in vitro isolation is not practical.
Many of the samples taken from HCU water contained Mycobacterium gordonae. Interestingly, other groups also found M. gordonae in cultures taken from HCU devices [18]. The culture medium used to grow Mycobacterium favors the growth of both species, and if the water contained more M. gordonae than M. chimaera, this would explain why it constituted the majority of sequences in some of our samples.
The global outbreak strain was found in some of the LivaNova/Sorin T3 HCUs in Israel but not in patients. As far as we know, no case of M. chimaera infection was diagnosed in a patient who had undergone cardio-thoracic surgery in Israel. This fortunate lack of infected patients, despite the presence of the outbreak strain in local HCU devices, may be due to several factors. The location and orientation of the HCU has a major effect on the chance of infection. If the infected devices were placed outside the operation room, or even inside it, but in an orientation such that the airflow direction is away from the surgery bed, this could diminish the infection risk [14]. Even in medical centers where cardio-thoracic surgery caused infections, these infections were very rare [26,27]. Lastly, several years can pass after the surgery before the infection manifests clinically [11,15].
Studies similar to ours were conducted globally (e.g., [12,17,18,19]). Our findings are in agreement with other studies, confirming the single strain of M. chimaera related to the outbreak and its common source.
Our study included nine Mycobacterium intracellulare subsp. chimaera clinical isolates from lungs (sputum or thoracic biopsy). Lung NTM infections are increasing worldwide [28,29,30]. In a previous study, 28% of Mycobacterium avium complex infections identified in human pulmonary samples were caused by Mycobacterium intracellulare subsp. chimaera [30]. The importance of this pathogen to human health is, therefore, beyond the cardio-thoracic surgery related outbreak, mostly regarding lung infection (for example [31,32,33]). Our knowledge of the population structure, mode of infection and virulence mechanism of this emerging pathogen is still lacking.

4. Materials and Methods

4.1. Bacterial Sampling

Sputum and chest biopsy samples were decontaminated for 30 min at room temperature with a 1:1 volume of 4% NaOH or 1:4 volume of 4% H2SO4, respectively. DDW was added to stop the process, and the samples were centrifuged at 3000× g for 20 min and re-suspended in 5 mL of the supernatant. 0.5 mL of the processed sample was inoculated on solid Löwenstein–Jensen (LJ) medium and incubated at 30 °C until observation of growth (up to 8 weeks).
Mycobacterium sampling from water was executed according to Public Health England guidelines [34]. One liter from each water sample was centrifuged at 3000× g for 20 min, the supernatant discarded and about 4 mL of the remaining was decontaminated using 1:4 volume of 4% H2SO4 with gentle agitation for 15 min. Decontamination stop, inoculation and incubation of the water samples were identical to the treatment of the clinical samples. Acid fast staining confirmed positive cultures [35].

4.2. DNA Extraction and Species Identification by Hain Lifescience Assay Kit

Crude DNA was extracted from the positive cultures by suspending a loop-full of bacteria from LJ medium in 300 µL water and heat inactivation for 45 min at 95 °C, followed by 15 min of sonication in an ultrasonic bath and centrifuging at 13,000 rpm for 5 min. A 5-µL aliquot of the supernatant was used for molecular identification using the GenoType NTM-DR assay (Hain Lifescience, Nehren, Germany) that was performed according to the manufacturer’s instructions.

4.3. Genomic DNA Isolation

Two different protocols were used for the isolation of genomic DNA. Manual RFLP-grade DNA extraction was performed, as previously described [36].
An automated DNA extraction was performed by suspending a confluent portion of bacteria from LJ media in 400 µL TE, heat inactivated for 30 min at 90 ℃ followed by incubation with 1 mg/mL lysozyme at 37 ℃ overnight. An amount of 400 µL of the bacterial lysate was transferred to the automated MagNA Pure Compact system for DNA extraction according to the manufacturer’s instructions (Roche Life Science, Penzberg, Germany).

4.4. Whole Genome Sequencing

Whole genome sequencing was performed at Hylabs LTD, Israel. Libraries were prepared using the NEB Ultra DNA library prep kit. Ten of the samples were sequenced on Illumina HiSeq instrument and 13 of the samples were sequenced on Illumina MiSeq instrument. Both sequencing generated 2 × 150 bp paired end reads.

4.5. Metagenomic Taxon Annotation of Reads

Annotating each short-read sequence to a taxon was undertaken with Kaiju [37]. In essence, each read was compared to a database of publically available DNA sequences to assign a taxon to it. When a high-resolution taxon level assignment was not possible, a lower resolution level was used. (For example, when it could not be decided to which species a read belongs, Kaiju would try to decipher to which genus it belongs. If a genus level annotation could not be made, a family level annotation would be tried, and so on).

4.6. Metagenomic Binning

Short reads were de novo assembled into contigs using SPAdes v3.11.1 [38]. The Assemblies were uploaded to the Pathosystems Resource Integration Center (PATRIC, https://www.patricbrc.org/; https://www.patricbrc.org/, accessed on 24 January 2019) [39], and PATRIC ‘Metagenomic Binning’ service was applied. The metagenomic algorithm used by PATRIC is explained in [38]. Briefly, a data base of representing sequences of a protein, encoded by a single gene in all prokaryotic genomes, is searched against for initial identification of the species present in the sample (the ‘bins’). Each bin has a reference genome associated with it. The assembly contigs are then assigned to the bins by Blast against these reference genomes.
To identify the species of each retrieved assembly bin, we used PATRIC service ‘Similar Genome Finder’, which implements the Mash tool [24]. The Mash algorithm represents each genome by a sketch, containing 1000 k-mers from this genome. The higher the similarity between two genomes, the more k-mers their sketches have in common.

4.7. SNP Calling and Genotype Calling

The dnadiff command in the MUMmer suit [40] was used to align each genome assembly to the genome assembly of M. chimaera reference strain DSM-44623 (RefSeq accession NZ_CP015278.1), and find variant positions. For each sample, the output of this command was a list of variant positions, relative to the reference. SNPs were filtered if another SNP existed within a window of 12 bp in the same genome. We constructed a dataset of all the SNPs identified from all the isolates in this study, and from a selection of 40 isolates analyzed by van Ingen et al. [16]. The database contains a total of 67,762 SNPs.
Since dnadiff only reports variant positions, it did not enable us to distinguish between cases where the sample genome is identical to the reference at a certain locus, and cases where this locus is not covered in the sample genome.
To distinguish between these two cases, we applied the following procedure (Figure 2):
A. We created a ‘dummy’ reference genome, in which the original bases in the reference genome of strain DSM-44623 were changed in all SNP loci, to some other base. All other loci in the ‘dummy’ reference genome are identical to the original reference genome. There is no significance to the base chosen to replace the reference base, as long as it is not the same (Figure 2a).
Genotype calls were collected as follows:
B. As mentioned above, for each isolate, it assembly bin was aligned to the reference genome, and SNPs were identified using dnadiff command in the MUMmer suit. For each SNP in the dataset, if the isolate differs from the reference genome in this base position, this base position appears in the output with the genotype call of the isolate.
C. Similar to the previous step, each isolate was aligned to the ‘dummy’ reference genome, and SNPs were identified. Since the ‘dummy’ genome differs from the true reference genome in all the SNPs in the dataset, for each SNP in the dataset where the isolate base is identical to the true reference genome, it differs from the ‘dummy’ reference genome and this base call appears in the output of dnadiff (Figure 2c).
D. Genotype call for each isolate from all dataset SNP loci was integrated from both B and C (Figure 2d).

4.8. SNP Based Phylogenetic Tree

After merging isolates of identical genotype, the SNPs were concatenated into a DNA sequence. A maximum likelihood phylogenetic tree was calculated using RAxML [41] with General Time Reversible model of nucleotide substitution under the Gamma model of rate heterogeneity with ascertainment bias correction, followed by 100 bootstrap iterations.

4.9. Group Specific SNP Signatures

The researchers van Ingen et al. identified SNPs signatures specific to some of their genotype groups and sub-groups and included them in Table S2 in their paper [16]. We searched our genomes for these signatures. We compared the bases in Table S2 from van Ingen et al. to the genome sequences strains DSM-44623 and ZUERICH-1 (RefSeq accessions NZ_CP015278.1 and NZ_CP015272.1, respectively). We noticed a mistake in one of the SNPs in the table, the reference allele in position 4,050,336 is C, not G, and the table was corrected accordingly. SNP genotypes were called, as described above. Isolates were identified which contain group-specific SNP signatures.

4.10. Group Specific Signature SNP in Clinical Samples, Using Mapping of Reads to Referecne Genome

In order to validate our results, we repeated the SNP identification in our clinical samples only, in a more mainstream approach. Reads were aligned to DSM-44623 reference genome with botwie2 [42], Samtools [43] and Varscan [44] were used for SNP identification. The genotype in each signature SNP loci was compared to the one obtained from metagenomic binning strategy.

5. Conclusions

The global outbreak strain was found in some of the LivaNova/Sorin T3 HCUs in Israel but not in patients. The use of metagenomic binning enabled strain identification from mixed cultures. The characterization of clinical M. chimaera isolates in Israel is important for our ability to surveil this emerging pathogen.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/pathogens10111392/s1. Table S1: The metagenomic bins and their quality [39]. Table S2: Metadata of M. chimaera samples used for in silico mix simulation. Table S3: Metadata of M. gordonae samples used in the in silico mix simulation. Table S4: Bins constructed from the in silico mixtures of reads from both Mycobacterium chimaera and Mycobacterium gordonae. Table S5: The metagenomic bins from the in silico mixture samples and their quality. Table S6: SNP Genotype call identity between M. chimaera bins from the read mixtures, and bins from the corresponding non-mixed M. chimaera sample. Table S7: SNP Genotype call identity between M. chimaera bins from the read mixtures and mapping non-mixed reads from the same sample to reference genome. Figure S1: SNP based phylogenetic tree with bins from mixed samples and the non-mixed mapped samples. Figure S2: Genotype call identity between bins from mixed samples and the non-mixed mapped strain. Tables S2–S7 and Figure S1 and S2: Results of the simulation [16,42,45].

Author Contributions

Conceptualization, E.R., I.N., M.R., M.J.S. and Y.C.; methodology, I.N., R.G. and M.R.; software, M.R.; validation, M.R.; formal analysis, M.R.; investigation, R.G.; resources, H.K.-S., R.G., Z.D., M.J.S. and Y.C.; data curation, R.G. and M.R.; writing-original draft preparation, M.R. and R.G.; writing-review and editing, M.R., R.G., I.N., M.J.S., Y.C., H.K.-S., Z.D. and E.R.; visualization, M.R.; supervision, E.R.; project administration, E.R.; funding acquisition, E.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are openly available in ENA at doi:10.15468/avkgwm, project accession number: PRJEB39994.

Acknowledgments

We would like to thank the Israeli National Mycobacterium laboratory team for carrying out the growth, identification and DNA extraction of bacilli.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Validation of SNP genotype call and strain identification from mixtures using metagenomic binning.
Since our Environmental samples contained mixtures of Mycobacterium intracellulare subsp. chimaera (M. chimaera) and Mycobacterium gordonae (M. gordonae), we have developed an in-house method to separate sequence data of mixed samples and identify the strain of M. chimaera. Our method includes de novo assembly of reads and metagenomic binning of contigs into different species, followed by genome alignment for SNP detection and genotype call (see methods section in the manuscript). In order to identify this method’s strength and limitations, we applied it to simulated in silico mixed samples with pre-defined proportions of M. chimaera and M. gordonae.
Short reads from 14 genetically diverse M. chimaera samples from van Ingen et al., [16] (Table S2) and 3 different M. gordonae from a Mycobacteria species sequencing project (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA401515, accessed on 17 September 2021 Table S3) were used to build 126 mixtures. Each pair of M. chimaera and M. gordonae constructed three mixtures, with 1:1, 3:1 and 9:1 proportions of chimaera:gordonae, respectively.
Each mixture was de novo assembled and subjected to metagenomic binning, as described in the method section. The PATRIC metagenomic binning service (see methods) constructed one M. gordonae bin in each of the 1:1 mixtures. In some of them, a M. chimaera bin was also constructed (Tables S4 and S5). In almost all 3:1 mixtures, one M. chimaera bin was constructed. In a few, a M. gordonae bin was also constructed. In a few, no bin was constructed. In almost all 9:1 mixtures, one M. chimaera bin was constructed. A M. gordonae bin was constructed in none of them. In a few mixtures, no bin was constructed. No bin annotated other than M. chimaera or M. gordonae was constructed in any mix (Tables S4 and S5). The ability of the Metagenomic binning service to identify and construct bins is, therefore, influenced by the amount of sequence from each species, as well as characteristic of the sample itself, either its quality or the strain of the species, or a combination of these factors. This explains why a more sensitive PCR-based method (Hain Lifescience GenoType NTM-DR assay) was able to identify two different Mycobacterium species in samples M22 and M23 in the main manuscript, whereas metagenomic binning was only able to retrieve an M. gordonae bin. However, as shown below, whenever a M. chimaera bin was constructed, it was possible to correctly identify its strain.
Subsequently, we continued to evaluate how well our pipeline performs in identifying the strain of the M. chimaera bins.
All bins annotated as M. chimaera originating from mixtures were used for SNP call and genotype call, as described in the manuscript method section. A total of 30242 SNPs were identified, relative to the reference genome of strain DSM-44623.
In our manuscript, we assigned some M. chimaera isolates, obtained from HCU water in Israel, to the outbreak strain. We based the assignment on their position in the phylogenetic tree, which was among samples previously shown by van Ingen et al. [16] to belong to this outbreak strain. We, therefore, wanted to validate that M. chimaera bins obtained from in silico mixtures indeed group together on a phylogenetic tree, according to the strain present in the mixture. Figure S1 shows a phylogenetic tree of the M. chimaera bins from the mixtures as well as non-mixed samples, color coded by the identity of the original strain in the mixture. Note that strains indeed cluster together correctly. Moreover, strains 16 and 31, and strains 206 and 207, which clustered together in this phylogenetic tree, also cluster together in the phylogenetic tree constructed by van Ingen et al. in their original paper [16]. The phylogenetic analysis, thus, supports our confidence in the method we have chosen.
Among the 30242 SNPs identified in the simulation, the genotype call of each bin was compared to that of its corresponding non-mixed M. chimaera strain. The rates of genotype call identity are presented in Table S6. The average genotype call identity between a bin from a mixed sample, and a bin from the clean sample of the same strain, is 99.7%. In comparison, the average identity between a bin from a mixed sample and a bin from a different strain is only 75.0%. Since strain identification is based on SNP genotype calls, this strongly supports the validity of this method for strain identification.
We also wanted to compare the genotype calls obtained from mixtures going through the binning process to those obtained by a more conventional method, namely, mapping the short sequence reads from non-mixed sample to the reference genome. The short reads from the 14 M. chimaera samples used in the simulation were re-mapped to the DSM-44623 reference genome with botwie2 [42]. Samtools and BCFtools [45] were used for per-position genotype call. Genotype calls were filtered for cover higher than 4, and ratio higher than 0.9 between major allele likelihood and minor allele likelihood. Genotype calls in the loci of SNPs identified among bins were obtained. We compared the genotype calls from the mixtures undergoing assembly and metagenomic binning, to the genotype calls obtained from their respective non-mixed samples, undergoing re-mapping (Table S7, Figure S2). On average, the genotype call identity was 98%. In comparison, the genotype call identity between a mixture-originating bin and a non-mixed re-mapped sample of a different strain is on average 73% (Table S7), again supporting the validity of strain identification using our method.
In summary, the validation demonstrates that our method is capable of separating different strains, and correctly identifying samples belonging to the same strain, thus assuring the validity of our results.
Our method is, however, not without limitations. The main limitation is the failure to identify bins of all species present in some mixtures.

References

  1. Larsson, L.-O.; Polverino, E.; Hoefsloot, W.; Codecasa, L.R.; Diel, R.; Jenkins, S.G.; Loebinger, M.R. Pulmonary disease by non-tuberculous mycobacteria-clinical management, unmet needs and future perspectives. Expert Rev. Respir. Med. 2017, 11, 1–13. [Google Scholar] [CrossRef] [PubMed]
  2. Lu, M.; Saddi, V.; Britton, P.N.; Selvadurai, H.; Robinson, P.D.; Pandit, C.; Marais, B.J.; Fitzgerald, D.A. Disease caused by non-tuberculous mycobacteria in children with cystic fibrosis. Paediatr. Respir. Rev. 2019, 29, 42–52. [Google Scholar] [CrossRef] [PubMed]
  3. Misch, E.A.; Saddler, C.; Davis, J.M. Skin and Soft Tissue Infections Due to Nontuberculous Mycobacteria. Curr. Infect. Dis. Rep. 2018, 20, 6. [Google Scholar] [CrossRef] [PubMed]
  4. Ratnatunga, C.N.; Lutzky, V.P.; Kupz, A.; Doolan, D.L.; Reid, D.W.; Field, M.; Bell, S.C.; Thomson, R.M.; Miles, J.J. The Rise of Non-Tuberculosis Mycobacterial Lung Disease. Front. Immunol. 2020, 11, 303. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Saifi, M.; Jabbarzadeh, E.; Bahrmand, A.R.; Karimi, A.; Pourazar, S.; Fateh, A.; Masoumi, M.; Vahidi, E. HSP65-PRA identification of non-tuberculosis mycobacteria from 4892 samples suspicious for mycobacterial infections. Clin. Microbiol. Infect. 2013, 19, 723–728. [Google Scholar] [CrossRef] [Green Version]
  6. Araj, G.F.; Baba, O.Z.; Itani, L.Y.; Avedissian, A.Z.; Sobh, G.M. Non-tuberculous mycobacteria profiles and their anti-mycobacterial resistance at a major medical center in Lebanon. J. Infect. Dev. Ctries. 2019, 13, 612–618. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Maurer, F.P.; Pohle, P.; Kernbach, M.; Sievert, D.; Hillemann, D.; Rupp, J.; Hombach, M.; Kranzer, K. Differential drug susceptibility patterns of Mycobacterium chimaera and other members of the Mycobacterium avium-intracellulare complex. Clin. Microbiol. Infect. 2019, 25, 379e1–379e7. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Nouioui, I.; Carro, L.; Garcia-Lopez, M.; Meier-Kolthoff, J.P.; Woyke, T.; Kyrpides, N.C.; Pukall, R.; Klenk, H.P.; Goodfellow, M.; Goker, M. Genome-Based Taxonomic Classification of the Phylum Actinobacteria. Front. Microbiol. 2018, 9, 2007. [Google Scholar] [CrossRef] [Green Version]
  9. Van Ingen, J.; Turenne, C.Y.; Tortoli, E.; Wallace, R.J., Jr.; Brown-Elliott, B.A. A definition of the Mycobacterium avium complex for taxonomical and clinical purposes, a review. Int. J. Syst. Evol. Microbiol. 2018, 68, 3666–3677. [Google Scholar] [CrossRef]
  10. Achermann, Y.; Rossle, M.; Hoffmann, M.; Deggim, V.; Kuster, S.; Zimmermann, D.R.; Bloemberg, G.; Hombach, M.; Hasse, B. Prosthetic valve endocarditis and bloodstream infection due to Mycobacterium chimaera. J. Clin. Microbiol. 2013, 51, 1769–1773. [Google Scholar] [CrossRef] [Green Version]
  11. Kohler, P.; Kuster, S.P.; Bloemberg, G.; Schulthess, B.; Frank, M.; Tanner, F.C.; Rossle, M.; Boni, C.; Falk, V.; Wilhelm, M.J.; et al. Healthcare-associated prosthetic heart valve, aortic vascular graft, and disseminated Mycobacterium chimaera infections subsequent to open heart surgery. Eur. Heart. J. 2015, 36, 2745–2753. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Perkins, K.M.; Lawsin, A.; Hasan, N.A.; Strong, M.; Halpin, A.L.; Rodger, R.R.; Moulton-Meissner, H.; Crist, M.B.; Schwartz, S.; Marders, J.; et al. Notes from the Field: Mycobacterium chimaera Contamination of Heater-Cooler Devices Used in Cardiac Surgery-United States. Morb. Morta.l Wkly. Rep. 2016, 65, 1117–1118. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Williamson, D.; Howden, B.; Stinear, T. Mycobacterium chimaera Spread from Heating and Cooling Units in Heart Surgery. N. Engl. J. Med. 2017, 376, 600–602. [Google Scholar] [CrossRef] [PubMed]
  14. Kuehl, R.; Banderet, F.; Egli, A.; Keller, P.M.; Frei, R.; Dobele, T.; Eckstein, F.; Widmer, A.F. Different Types of Heater-Cooler Units and Their Risk of Transmission of Mycobacterium chimaera During Open-Heart Surgery: Clues From Device Design. Infect. Control Hosp. Epidemiol. 2018, 39, 834–840. [Google Scholar] [CrossRef] [PubMed]
  15. Haller, S.; Holler, C.; Jacobshagen, A.; Hamouda, O.; Abu Sin, M.; Monnet, D.L.; Plachouras, D.; Eckmanns, T. Contamination during production of heater-cooler units by Mycobacterium chimaera potential cause for invasive cardiovascular infections: Results of an outbreak investigation in Germany, April 2015 to February 2016. Euro. Surveill. 2016, 21. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Van Ingen, J.; Kohl, T.A.; Kranzer, K.; Hasse, B.; Keller, P.M.; Katarzyna Szafranska, A.; Hillemann, D.; Chand, M.; Schreiber, P.W.; Sommerstein, R.; et al. Global outbreak of severe Mycobacterium chimaera disease after cardiac surgery: A molecular epidemiological study. Lancet Infect. Dis. 2017, 17, 1033–1041. [Google Scholar] [CrossRef]
  17. Hasan, N.A.; Epperson, L.E.; Lawsin, A.; Rodger, R.R.; Perkins, K.M.; Halpin, A.L.; Perry, K.A.; Moulton-Meissner, H.; Diekema, D.J.; Crist, M.B.; et al. Genomic Analysis of Cardiac Surgery-Associated Mycobacterium chimaera Infections, United States. Emerg. Infect. Dis. 2019, 25, 559–563. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Svensson, E.; Jensen, E.T.; Rasmussen, E.M.; Folkvardsen, D.B.; Norman, A.; Lillebaek, T. Mycobacterium chimaera in Heater-Cooler Units in Denmark Related to Isolates from the United States and United Kingdom. Emerg. Infect. Dis. 2017, 23, 507–509. [Google Scholar] [CrossRef] [PubMed]
  19. Zhang, X.; Lin, J.; Feng, Y.; Wang, X.; McNally, A.; Zong, Z. Identification of Mycobacterium chimaera in heater-cooler units in China. Sci. Rep. 2018, 8, 7843. [Google Scholar] [CrossRef] [Green Version]
  20. Eyre, D.W.; Cule, M.L.; Griffiths, D.; Crook, D.W.; Peto, T.E.; Walker, A.S.; Wilson, D.J. Detection of mixed infection from bacterial whole genome sequence data allows assessment of its role in Clostridium difficile transmission. PLoS Comput. Biol. 2013, 9, e1003059. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  21. Anyansi, C.; Keo, A.; Walker, B.J.; Straub, T.J.; Manson, A.L.; Earl, A.M.; Abeel, T. QuantTB-a method to classify mixed Mycobacterium tuberculosis infections within whole genome sequencing data. BMC Genomics 2020, 21, 80. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Yang, W.; Huang, L.; Shi, C.; Wang, L.; Yu, R. UltraStrain: An NGS-Based Ultra Sensitive Strain Typing Method for Salmonella enterica. Front. Genet. 2019, 10, 276. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Quince, C.; Delmont, T.O.; Raguideau, S.; Alneberg, J.; Darling, A.E.; Collins, G.; Eren, A.M. DESMAN: A new tool for de novo extraction of strains from metagenomes. Genome Biol. 2017, 18, 181. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Ondov, B.D.; Treangen, T.J.; Melsted, P.; Mallonee, A.B.; Bergman, N.H.; Koren, S.; Phillippy, A.M. Mash: Fast genome and metagenome distance estimation using MinHash. Genome Biol. 2016, 17, 132. [Google Scholar] [CrossRef] [Green Version]
  25. Zhang, Y.; Kitajima, M.; Whittle, A.J.; Liu, W.T. Benefits of Genomic Insights and CRISPR-Cas Signatures to Monitor Potential Pathogens across Drinking Water Production and Distribution Systems. Front. Microbiol. 2017, 8, 2036. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Chand, M.; Lamagni, T.; Kranzer, K.; Hedge, J.; Moore, G.; Parks, S.; Collins, S.; Del Ojo Elias, C.; Ahmed, N.; Brown, T.; et al. Insidious Risk of Severe Mycobacterium chimaera Infection in Cardiac Surgery Patients. Clin. Infect. Dis. 2017, 64, 335–342. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Scriven, J.E.; Scobie, A.; Verlander, N.Q.; Houston, A.; Collyns, T.; Cajic, V.; Kon, O.M.; Mitchell, T.; Rahama, O.; Robinson, A.; et al. Mycobacterium chimaera infection following cardiac surgery in the United Kingdom: Clinical features and outcome of the first 30 cases. Clin. Microbiol. Infect. 2018, 24, 1164–1170. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Adjemian, J.; Olivier, K.N.; Seitz, A.E.; Holland, S.M.; Prevots, D.R. Prevalence of nontuberculous mycobacterial lung disease in U.S. Medicare beneficiaries. Am. J. Respir. Crit. Care. Med. 2012, 185, 881–886. [Google Scholar] [CrossRef] [Green Version]
  29. Boyle, D.P.; Zembower, T.R.; Reddy, S.; Qi, C. Comparison of Clinical Features, Virulence, and Relapse among Mycobacterium avium Complex Species. Am. J. Respir. Crit. Care. Med. 2015, 191, 1310–1317. [Google Scholar] [CrossRef]
  30. Koh, W.-J.; Chang, B.; Jeong, B.-H.; Jeon, K.; Kim, S.-Y.; Lee, N.Y.; Ki, C.-S.; Kwon, O.J. Increasing Recovery of Nontuberculous Mycobacteria from Respiratory Specimens over a 10-Year Period in a Tertiary Referral Hospital in South Korea. Tuberc. Respir. Dis. 2013, 75, 199–204. [Google Scholar] [CrossRef] [PubMed]
  31. Hagiya, H. Mycobacterium avium complex infection mimicking lung cancer. BMJ Case Rep. 2019, 12, e228847. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Kim, S.-Y.; Shin, S.H.; Moon, S.M.; Yang, B.; Kim, H.; Kwon, O.J.; Huh, H.J.; Ki, C.-S.; Lee, N.Y.; Shin, S.J.; et al. Distribution and clinical significance of Mycobacterium avium complex species isolated from respiratory specimens. Diagn. Microbiol. Infect. Dis. 2017, 88, 125–137. [Google Scholar] [CrossRef] [PubMed]
  33. Rosero, C.I.; Shams, W.E. Mycobacterium chimaera infection masquerading as a lung mass in a healthcare worker. IDCases 2019, 15, e00526. [Google Scholar] [CrossRef] [PubMed]
  34. Kranzer, K.; Ahmed, N.; Hoffman, P.; Pink, F. Protocol for Environmental Sampling, Processing and Culturing of Water and Air Samples for the Isolation of Slow-Growing Mycobacteria Standard Operating Procedure. 2016. Available online: https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/540325/Air_water_environmental_sampling_SOP_V2.pdf (accessed on 20 October 2021).
  35. Kent, P.T.; Kubica, G.P. Public Health Mycobacteriology: A Guide for the Level III Laboratory; Centers for Disease Control: Atlanta, GA, USA, 1985; p. 207.
  36. Freidlin, P.J.; Goldblatt, D.; Kaidar-Shwartz, H.; Rorman, E. Polymorphic exact tandem repeat A (PETRA): A newly defined lineage of mycobacterium tuberculosis in israel originating predominantly in Sub-Saharan Africa. J. Clin. Microbiol. 2009, 47, 4006–4020. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Menzel, P.; Ng, K.L.; Krogh, A. Fast and sensitive taxonomic classification for metagenomics with Kaiju. Nat. Commun. 2016, 7, 11257. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Nurk, S.; Bankevich, A.; Antipov, D.; Gurevich, A.A.; Korobeynikov, A.; Lapidus, A.; Prjibelski, A.D.; Pyshkin, A.; Sirotkin, A.; Sirotkin, Y.; et al. Assembling single-cell genomes and mini-metagenomes from chimeric MDA products. J. Comput. Biol. 2013, 20, 714–737. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  39. Wattam, A.R.; Davis, J.J.; Assaf, R.; Boisvert, S.; Brettin, T.; Bun, C.; Conrad, N.; Dietrich, E.M.; Disz, T.; Gabbard, J.L.; et al. Improvements to PATRIC, the all-bacterial Bioinformatics Database and Analysis Resource Center. Nucleic Acids Res. 2017, 45, D535–D542. [Google Scholar] [CrossRef] [PubMed]
  40. Kurtz, S.; Phillippy, A.; Delcher, A.L.; Smoot, M.; Shumway, M.; Antonescu, C.; Salzberg, S.L. Versatile and open software for comparing large genomes. Genome Biol. 2004, 5, R12. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  41. Stamatakis, A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 2014, 30, 1312–1313. [Google Scholar] [CrossRef]
  42. Langmead, B.; Salzberg, S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 2012, 9, 357–359. [Google Scholar] [CrossRef] [Green Version]
  43. Li, H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 2011, 27, 2987–2993. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Koboldt, D.C.; Chen, K.; Wylie, T.; Larson, D.E.; McLellan, M.D.; Mardis, E.R.; Weinstock, G.M.; Wilson, R.K.; Ding, L. VarScan: Variant detection in massively parallel sequencing of individual and pooled samples. Bioinformatics 2009, 25, 2283–2285. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Danecek, P.; Bonfield, J.K.; Liddle, J.; Marshall, J.; Ohan, V.; Pollard, M.O.; Whitwham, A.; Keane, T.; McCarthy, S.A.; Davies, R.M.; et al. Twelve years of SAMtools and BCFtools. Gigascience 2021, 10. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Phylogeny. Maximum likelihood phylogenetic trees based on SNPs identified in the bin assemblies of isolates from this study (Table 1) and selected isolates from the EU study published in 2017 in Lancet Infect Dis [16]. The shapes of the tips represent the isolate source. HTS: HCU device, this study; HVI: HCU device, van Ingen et al.; RPVI: related patient (a patient which has undergone a cardio thoracic surgery), van Ingen et al.; UPVI: unrelated patient, van Ingen et al.; UPTS: unrelated patient, this study; WVI: water dispenser, van Ingen et al. Sample names are colored by specific SNP signatures. Nodes supported by more than 90% bootstraps are marked in green dots. (a) All samples; (b) samples assigned to group 1 by its specific SNP signature.
Figure 1. Phylogeny. Maximum likelihood phylogenetic trees based on SNPs identified in the bin assemblies of isolates from this study (Table 1) and selected isolates from the EU study published in 2017 in Lancet Infect Dis [16]. The shapes of the tips represent the isolate source. HTS: HCU device, this study; HVI: HCU device, van Ingen et al.; RPVI: related patient (a patient which has undergone a cardio thoracic surgery), van Ingen et al.; UPVI: unrelated patient, van Ingen et al.; UPTS: unrelated patient, this study; WVI: water dispenser, van Ingen et al. Sample names are colored by specific SNP signatures. Nodes supported by more than 90% bootstraps are marked in green dots. (a) All samples; (b) samples assigned to group 1 by its specific SNP signature.
Pathogens 10 01392 g001aPathogens 10 01392 g001b
Figure 2. The process of genotype call. The assembled genome from each sample was aligned against the reference genome and SNPs were called. The SNPs from all the samples were used to build one dataset of SNPs. In order to differentiate SNP loci where the isolate genome is identical to the reference genome, from loci not covered or not properly aligned in the isolate, each isolate was also compared to the ‘dummy’ reference genome. (a) The ‘dummy’ reference genome was created by changing the base in each of the dataset loci. (b) Positions from the dataset in which the isolate is different from the reference genome were identified when aligning the isolate genome to the true reference genomes and the isolate base call in these positions was recorded. Positions with a question mark are those where the isolate is either identical to the reference genome, or the two genomes did not align. (c) The alignment step was repeated using the ‘dummy’ reference genome. (d) The genotype calls from (b) and (c) are integrated so a final genotype call is given to all SNPs in the dataset where the isolate genome aligns with the reference genome.
Figure 2. The process of genotype call. The assembled genome from each sample was aligned against the reference genome and SNPs were called. The SNPs from all the samples were used to build one dataset of SNPs. In order to differentiate SNP loci where the isolate genome is identical to the reference genome, from loci not covered or not properly aligned in the isolate, each isolate was also compared to the ‘dummy’ reference genome. (a) The ‘dummy’ reference genome was created by changing the base in each of the dataset loci. (b) Positions from the dataset in which the isolate is different from the reference genome were identified when aligning the isolate genome to the true reference genomes and the isolate base call in these positions was recorded. Positions with a question mark are those where the isolate is either identical to the reference genome, or the two genomes did not align. (c) The alignment step was repeated using the ‘dummy’ reference genome. (d) The genotype calls from (b) and (c) are integrated so a final genotype call is given to all SNPs in the dataset where the isolate genome aligns with the reference genome.
Pathogens 10 01392 g002
Table 1. A list of the samples sequenced and analyzed in this study.
Table 1. A list of the samples sequenced and analyzed in this study.
Sample NumberSource 1DNA Extraction 2Illumina Sequencing Instrument 3PCR Based Identification 4Year of SamplingHCU Production Date 5HCU NumberMedical Center 6
M1cl.: sputumRFLPHiSeqM. chimaera2017 Tm
M2cl.: sputumRFLPHiSeqM. chimaera2017 Ma
M3cl.: sputumRFLPHiSeqM. chimaera2017 Abct
M4cl.: sputumMagnapureMiSeqM. chimaera2017 Tlm
M5cl.: sputumMagnapureMiSeqM. chimaera2017 Ma
M6cl.: sputumMagnapureMiSeqM. chimaera2017 Nt
M7cl.: sputumRFLPMiSeqM. chimaera2017 Rm
M8cl.: sputumRFLPMiSeqM. chimaera2017 Nt
M9cl.: thoracic biopsyRFLPMiSeqM. chimaera2014 -
M10cl.: pleural fluidRFLPHiSeqM. intracellulare2013 -
M11HCUMagnapureMiSeqM. chimaera2017201116S12916Lc
M12HCUMagnapureMiSeqM. chimaera2017200416S10462Lc
M13HCUMagnapureMiSeqM. chimaera2017 16S12007Nk
M14HCUMagnapureMiSeqM. chimaera2017200416S10468Ahs
M15HCUMagnapureMiSeqM. chimaera2017201116S12918Rs
M16HCURFLPHiSeqM. chimaera2017200716S10888Lc
M17HCURFLPHiSeqM. chimaera2017200416S10448Nw
M18HCURFLPHiSeqM. chimaera2017200716S10890As
M19HCURFLPHiSeqM. chimaera2017200416S10395Hh
M20HCURFLPHiSeqM. chimaera2017 16S10082Ks
M21HCURFLPHiSeqM. chimaera2017201416S14090Aa
M22HCUMagnapureMiSeqM. chimaera + M. gordonae2017201516S15449Ap
M23HCUMagnapureMiSeqM. chimaera + NTM2017201516S15447Nw
1 Source: cl. = clinical. 2 The genomic DNA was extracted using two methods (‘RFLP’ and ‘Magnapure’, see methods). 3 Sequencing was performed using Illumina short reads technology in either a HiSeq or a MiSeq instrument (see methods). 4 PCR based identification-species identification by Hain Lifescience GenoType NTM-DR assay. 5 The production date of the HCU device, when available. 6 The hospital or the clinic in which the HCU was used, or the clinical sample was obtained, when available.
Table 2. Species identification.
Table 2. Species identification.
Sample NumberSource 1PCR Based Identification 2*Reads’ Most Abundant Species 3*Reads’ M. chimaera Abundance 4*Number of Bins 5Most Similar Genome to Bin (# K-mers) 6*
M1cl.: sputumM. chimaeraM. chimaera (3.6%)3.6%1M. chimaera strain MCIMRL2 (920)
M2cl.: sputumM. chimaeraM. chimaera (0.8%)0.8%1M. sp. TKK-01-0059 (874)
M3cl.: sputumM. chimaeraM. chimaera (6.4%)6.4%1M. chimaera strain ZUERICH-2 (896)
M4cl.: sputumM. chimaeraM. chimaera (4.4%)4.4%1M. chimaera strain ZUERICH-1 (805)
M5cl.: sputumM. chimaeraM. chimaera (3.8%)3.8%1M. chimaera strain ZUERICH-1 (325)
M6cl.: sputumM. chimaeraM. chimaera (4.6%)4.6%1M. chimaera strain DSM 44,623 (880)
M7cl.: sputumM. chimaeraM. chimaera (4.4%)4.4%1M. chimaera strain DSM 44,623 (893)
M8cl.: sputumM. chimaeraShewanella decolorationis (5.6%)4.2%1M. chimaera strain DSM 44,623 (903)
M9cl.: thoracic biopsyM. chimaeraBacillus azotoformans (6.6%)3.9%1M. chimaera strain DSM 44,623 (873)
M10cl.: pleural fluidM. intracellulareM. intracellulare (6.2%)0.2%1M. intracellulare MIN_061107_1834 (801)
M11HCUM. chimaeraM. chimaera (3.4%)3.4%1M. chimaera strain CDC 2015-22-71 (987)
M12HCUM. chimaeraM. chimaera (3.6%)3.6%1M. chimaera strain CDC 2015-22-71 (987)
M13HCUM. chimaeraM. chimaera (10.3%)10.3%1M. chimaera strain ZUERICH-2 (963)
M14HCUM. chimaeraM. chimaera (4.7%)4.7%1M. chimaera strain ZUERICH-1 (1000)
M15HCUM. chimaeraM. chimaera (4.1%)4.1%1M. chimaera strain ZUERICH-1 (999)
M16HCUM. chimaeraM. gordonae (15.1%)0.6%2M. gordonae strain 1275229.4 (567)
M. chimaera strain ZUERICH-1 (900)
M17HCUM. chimaeraM. gordonae (18.5%)0.2%2M. gordonae strain 1275229.4 (609)
M. chimaera strain WCHMC000032 (841)
M18HCUM. chimaeraM. gordonae (17.7%)0.3%2M. gordonae strain 1275229.4 (609)
M. chimaera strain WCHMC000032 (841)
M19HCUM. chimaeraM. gordonae (12.5%)0.2%2M. gordonae strain 1275229.4 (352)
M. chimaera strain SJ42 (743)
M20HCUM. chimaeraM. sp. (6.8%)0.9%2M. sp. strain DS2.013 (767)
M. chimaera strain WCHMC000030 (897)
M21HCUM. chimaeraM. gordonae (17%)0.3%2M. chimaera strain WCHMC000032 (867)
M. gordonae strain 1275229.4 (645)
M22HCUM. chimaera + M. gordonaeM. gordonae (18.6%)0.9%1M. gordonae strain 1275229.4 (605)
M23HCUM. chimaera + NTMM. gordonae (16.6%)0.3%1M. gordonae strain 1275229.4 (562)
1 Sample Source: cl. = clinical. 2 Species identification by Hain Lifescience GenoType NTM-DR assay. 3 The most abundant annotation of short reads on the species level (proportion of reads annotated to this species). 4 proportion of reads annotated as M. chimaera.5 Number of assembly bins retrieved from metagenomic binning. 6 The most similar genome to each bin, as identified by running MASH [24] against a database of all publicly available genomes. MASH uses a sketch in the form of 1000 K-mers to represent each genome. The more K-mers two genomes have in common- the more similar they are. # (Number of common K-mers out of maximum 1000 K-mers). * M. = Mycobacterium.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Rubinstein, M.; Grossman, R.; Nissan, I.; Schwaber, M.J.; Carmeli, Y.; Kaidar-Shwartz, H.; Dveyrin, Z.; Rorman, E. Mycobacterium intracellulare subsp. chimaera from Cardio Surgery Heating-Cooling Units and from Clinical Samples in Israel Are Genetically Unrelated. Pathogens 2021, 10, 1392. https://doi.org/10.3390/pathogens10111392

AMA Style

Rubinstein M, Grossman R, Nissan I, Schwaber MJ, Carmeli Y, Kaidar-Shwartz H, Dveyrin Z, Rorman E. Mycobacterium intracellulare subsp. chimaera from Cardio Surgery Heating-Cooling Units and from Clinical Samples in Israel Are Genetically Unrelated. Pathogens. 2021; 10(11):1392. https://doi.org/10.3390/pathogens10111392

Chicago/Turabian Style

Rubinstein, Mor, Rona Grossman, Israel Nissan, Mitchell J. Schwaber, Yehuda Carmeli, Hasia Kaidar-Shwartz, Zeev Dveyrin, and Efrat Rorman. 2021. "Mycobacterium intracellulare subsp. chimaera from Cardio Surgery Heating-Cooling Units and from Clinical Samples in Israel Are Genetically Unrelated" Pathogens 10, no. 11: 1392. https://doi.org/10.3390/pathogens10111392

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop