Next Article in Journal
In Silico Characterisation of Putative Prophages in Lactobacillaceae Used in Probiotics for Vaginal Health
Next Article in Special Issue
Preclinical Evidence for the Role of Botulinum Neurotoxin A (BoNT/A) in the Treatment of Peripheral Nerve Injury
Previous Article in Journal
Screening of Antimicrobial Activities and Lipopeptide Production of Endophytic Bacteria Isolated from Vetiver Roots
Previous Article in Special Issue
Extensive Genome Exploration of Clostridium botulinum Group III Field Strains
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comparative Genomics of Clostridium baratii Reveals Strain-Level Diversity in Toxin Abundance

by
Claudia Silva-Andrade
1,2,
Alberto J. Martin
1,* and
Daniel Garrido
2,*
1
Laboratorio de Biología de Redes, Centro de Genómica y Bioinformática, Facultad de Ciencias, Universidad Mayor, Santiago 8580000, Chile
2
Department of Chemical and Bioprocess Engineering, School of Engineering, Pontificia Universidad Católica de Chile, Santiago 7820436, Chile
*
Authors to whom correspondence should be addressed.
Microorganisms 2022, 10(2), 213; https://doi.org/10.3390/microorganisms10020213
Submission received: 24 September 2021 / Revised: 29 October 2021 / Accepted: 10 November 2021 / Published: 20 January 2022
(This article belongs to the Special Issue Gram Positive Toxins Producing Organisms)

Abstract

:
Clostridium baratii strains are rare opportunistic pathogens associated with botulism intoxication. They have been isolated from foods, soil and be carried asymptomatically or cause botulism outbreaks. Is not taxonomically related to Clostridium botulinum, but some strains are equipped with BoNT/F7 cluster. Despite their relationship with diseases, our knowledge regarding the genomic features and phylogenetic characteristics is limited. We analyzed the pangenome of C. baratii to understand the diversity and genomic features of this species. We compared existing genomes in public databases, metagenomes, and one newly sequenced strain isolated from an asymptomatic subject. The pangenome was open, indicating it comprises genetically diverse organisms. The core genome contained 28.49% of the total genes of the pangenome. Profiling virulence factors confirmed the presence of phospholipase C in some strains, a toxin capable of disrupting eukaryotic cell membranes. Furthermore, the genomic analysis indicated significant horizontal gene transfer (HGT) events as defined by the presence of prophage genomes. Seven strains were equipped with BoNT/F7 cluster. The active site was conserved in all strains, identifying a missing 7-aa region upstream of the active site in C. baratii genomes. This analysis could be important to advance our knowledge regarding opportunistic clostridia and better understand their contribution to disease.

1. Introduction

Species belonging to the Firmicutes phylum are the most numerous in the gut microbiota, with several Clostridium species among the most characterized and studied of its members [1]. Clostridium is an important genus of Gram-positive, obligate anaerobes, bar-shaped spore-forming rods. Clostridium species could be found in soil and residing in several animals’ intestines, showing their widespread distribution [2]. This taxonomic group includes important animal and human pathogens that cause dangerous and potentially deadly diseases such as tetanus [3] (produced by tetanus toxin from C. tetani), gastroenteritis [4] (C. perfringens), Clostridioides difficile infection, and botulism [5,6] (produced by botulism neurotoxin from C. botulinum), among several others [7]. In addition to their medical importance, Clostridium species are well studied for their industrial properties in producing important bioproducts such as ethanol, organic acids, and therapeutic proteins [8].
Interestingly, some members of the genus seem to be essential in the establishment of the immune tolerance to the gut microbiota, an effect mediated in part by regulatory T-cells and butyrate as a fermentation end-product [9]. Taxonomically, Clostridium sensu stricto species belong to the Clostridium Cluster I, based on 16S rRNA gene sequences, according to Collins et al. [10]. This group includes C. botulinum and other less-studied species such as Clostridium baratii.
C. baratii strains are Gram-positive rods, non-motile spore-forming bacteria. C. baratii is not taxonomically related to C. botulinum, but some members of this species have caused botulism outbreaks associated with food consumption, as well as sporadic infant botulism cases (less than 1% of total cases). These outbreaks are described as sporadic and rare episodes [11,12,13,14,15]; however, the strains that possess the neurotoxin may pose a significant threat, and it is essential to further study the properties of these microorganisms. C. baratii strains have been found in liver pastes, meat, soil, and other foods. The species is considered an opportunistic pathogen that can cause disease in subjects with altered gut microbiomes or undergoing antibiotherapy, both infants and adults.
Botulism intoxication is characterized by flaccid paralysis due to the blocking of neurotransmitter release (usually acetylcholine) at the neuromuscular junction. There are eight serotypes of BoNT toxins (A–G, and X) [16], which differ in their protein targets [17]. The C. baratii isolates responsible for the outbreaks have been shown to carry the botulinum neurotoxin (BoNT) and its gene cluster responsible for the production of the Type F botulinum toxin [12,13,14,15,16,18,19,20,21,22,23]. The F serotype is generally produced by C. botulinum strains and some strains of C. baratii. There are nine subtypes of the botulinum toxin F serotype [24]. These subtypes exhibit unique catalytic properties, sequence diversity, and substrate specificities [21,24]. The BoNT gene forms a cluster including UviA, UviB, ORF-X3, ORF-X2, ORF-X1, p47, NTNH, and BoNT genes [13]. The non-toxic non-hemagglutinin (NTNH) interacts directly with BoNT and plays an important role in protecting the toxin in the gastrointestinal environment, facilitating its cellular transport and release [25]. The UviA gene appears to perform a regulatory role similar to BotR in other serotypes [18].
Clostridium species play a pivotal and intricate role in the gut microbiota. Under normal circumstances, they promote gut homeostasis and contribute to the barrier effect and balance in mucosal immunity [26,27]. However, several members produce a wide array of toxins and contribute to severe diseases in certain conditions. In this study, we aimed to determine relevant genomic determinants of C. baratii genomes to better understand their diversity and their contribution to disease. To do so, we compared and studied all available C. baratii genomes, including one derived from a new isolate obtained by us, focusing on the identification of genes associated with the pathogenicity of this species.

2. Materials and Methods

2.1. DNA Isolation, Library Preparation, and Illumina Sequencing

Clostridium baratii C3 strain was isolated from fecal samples from a healthy subject in a previous study [28]. The strain was isolated using a YCFA medium [29] after three rounds of purification. Bacteria were cultured in Reinforced Clostridium Medium under anaerobic conditions in an anaerobic jar (Anaerocult), and genomic DNA was extracted using a modified phenol-chloroform protocol. DNA was submitted to the sequencing service MicrobesNG (Birmingham, UK) for sequencing using Illumina MiSeq with a minimum coverage of 30×, and 250 bp pair-end reads.

2.2. De Novo Assembly and Functional Annotation

Reads were trimmed with Trimmomatic [30]. Trimmed reads were assembled into contigs using the next genome assemblers: Mira [31], MaSuRCa [32], and SPAdes [33]. Each assembler was executed with default parameters. Finally, the result of the three assembly tools was merged with the option untrusted-contigs of SPAdes. Prodigal [34] was used to identify predicted open reading frames (ORFs) in all contigs from assembly genomes. Assembly contigs were annotated using BLAST+ [35] for Clusters of Orthologous Groups (COG) [36], Carbohydrate-Active enZYmes (CAZy) [37], TransportDB [38,39], Victors [40], and VFDB [41,42] databases and using HMMER [43] v3.1b2 against Pfam [44] database as available on July, 2020. Following, eggNOG-mapper [45] was used for annotation through orthology assignment. Finally, Interproscan [46] was used with default parameters to assign functional annotations based on sequence homology detection.

2.3. Genomes

The genome of our isolated C. baratii strain was used to perform comparative genomics with the existent genomes available in the Reference sequence (RefSeq) database at NCBI available on September 2021 [47]. The data used for comparative genome analysis are described in Supplementary Table S1. The data used for Multi Locus Sequence Analysis were the genomes used in Kiu et al. [7].

2.4. Multi Locus Sequence Analysis (MLSA)

To obtain a better resolution of the phylogenetic relationships of our isolated C. baratii strain, we included in our analysis all Clostridium pathogenic strains used in Kiu et al. [7]. Then performed a Multi Locus sequence analysis [48] with the 15 ribosomal protein data sets (ribosomal proteins L2, L3, L4, L5, L6, L14, L16, L18, L22, L24, S3, S8, S10, S17, and S19) used in Hug et al. [49]. We used Orthofinder [50] to search orthologous genes between the strains, followed by RaxML [51], a tool that calculates the phylogenetic distance and creates a Maximum Likelihood phylogenetic tree. Finally, the phylogenetic tree was visualized with Figtree [52].

2.5. Comparative Genome Analysis

Homologous groups of protein-coding genes from multiple bacterial genomes were built using Roary [53]. Roary was run with the following parameters: “roary-p 10-r-e-i 90-f./data/*.gff”. We visualized the output of Roary with roary_plots.py and create_pan_genome_plots.R scripts, provided with the software.
We employed the reference pangenome created by Roary as an input to EggNOG 4.5.1 eggNOG-mapper v2 genome-wide functional annotation tool [45], ran default parameters, and the DIAMOND [54] mapping mode.
We evaluate the Clusters of Orthologous Groups (COGs) [36] found in the pangenome, sorting the results from EggNOG by COG categories. The values associated with the COG categories represent the percentage belonging to the core genome out of the total COG found. Each category is counted independently.
We used a blastp search against the Carbohydrate-Active Enzymes (CaZy) [37] Database Available online: http://www.cazy.org/ (accessed on 5 January 2021) to identify genes of carbohydrate metabolism. CaZy hits were considered positive if they had a 60% identity and coverage in blast result.
Genes related to the GH (glycoside hydrolases) CaZy category in the pangenome were extracted from each strain using in-house Perl and bash scripts. Bar plots were generated using the R ggplot2 package [55].

2.6. Genomic Analysis

To identify the virulence potentials, we performed an in silico analysis of 29 identified toxins and virulent enzyme sequence data found in different Clostridium strains from the Virulence Factor Database (VFDB, as available on July 2020) [41,42]. To identity prophage profiles, we used the online tool PHASTER [56,57] (PHAge Search Tool Enhanced Release). To identify antimicrobial resistance, we download the sequences from the Comprehensive Antimicrobial Resistance Database [58] (CARD), as available in January 2021. These sequences were used to build a BLAST database.
To identify the plasmid in C. baratii strains, we performed an in silico analysis using the Comprehensive Database of Plasmid Sequences [59]. With the result of plasmid, we searched the BoNT/F7 cluster and evaluated the proteins from the plasmid present in C. baratii strains with BLAST+ [35].
BLAST+ [35] was employed for sequence similarity search (blastP) using thresholds for percentage identity and coverage of 80% and e value of 1 × 10−5 were applied. Heat maps were generated using the R [55] pheatmap package.
BoNT gene clusters were downloaded from the NCBI database from the code JX847735 and used to build a BLAST database. BLAST+ was employed for sequence similarity search (blastP) using thresholds for percentage identity and coverage of 60% and e value of 1 × 10−5. Heat maps were generated using the R pheatmap package. To understand the phylogenetic relationship between the BoNT genes in C. baratii and C. botulinum strains, we used RAxML [51] to calculate the phylogenetic distance and create a Maximum Likelihood phylogenetic tree with all sequences of BoNT genes. Finally, the phylogenetic tree was visualized with Figtree [52].
To evaluate the possible effect on the three-dimensional structure of the loss of seven amino acids upstream of the active site in BoNT toxin in C. baratii genomes, we used the BoNT/F7 gene from AGR53840.1 and performed a tertiary structure prediction with Phyre2 [60]. We compared this gene to the known three-dimensional structure of BoNT/A1 (PDB: 3BTA) in VMD [61] using Multiseq tools to align the structure of both proteins.

2.7. Microbiological Assays

C. baratii C3 was routinely cultured in Reinforced Clostridium Medium (RCM, Becton-Dickinson, Franklin Lakes, NJ, USA) supplemented with 0.5 g/L of L-cysteine (RCM-cys; Loba Chemie, India). All incubations were performed at 37 °C for 24–48 h in an anaerobic jar (Anaerocult, Merck, Darmstadt, Germany) with anaerobic packs (Gaspak EM, Becton-Dickinson, Franklin Lakes, NJ, USA). Two overnight cultures of C. baratii were made on Blood Agar and Starch Agar plates (Winkler, Santiago, Chile). The bacteria were also inoculated in API-20A strips (Biomerieux, Craponne, France) following the manufacturer’s instructions. Finally, antibiotic resistance patterns were evaluated using antibiotic discs in RCM agar plates incubated for 48 h (Lilfilchem, Roseto degli Abruzzi TE, Italy) for 48 h. Cefazolin (30 µg), ciprofloxacin (5 µg), chloramphenicol (30 µg), erythromycin (15 µg), penicillin G (10 µg), piperacill/tazobactam (110 µg), trimethoprim-sulfamethoxazole (25 µg), and vancomycin (30 µg) were used.

3. Results

3.1. General Features

Our analysis included fourteen public genomes of C. baratii available from IMG [62,63,64] and the Reference sequence (RefSeq) database. Genomes were obtained from single species or public metagenomes [47,65], and a novel strain isolated from an asymptomatic subject fecal samples, C. baratii C3. In publicly accessible metagenomes studies from the NCBI [66] database, we observed that the frequency of finding C. baratii genomes across human gut metagenomes was very low: from 2588 human gut meta-genome bioprojects only four contained C. baratii genomes (Supplementary Table S1).
On average, using the fifteen genomes employed in this work, the genome size of C. baratii was 3,102,724 bp and it contained 2911 predicted protein-coding genes (Supplementary Table S1). The genome size of C. baratii C3 was 3,119,424 bp with 2972 predicted protein-coding genes, similar to other C. baratii genomes.

3.2. Probing Evolutionary Relationships between Clostridium Strains

We used the multi-locus sequence analysis approach (MLSA [48]) to understand the phylogenetic relationship between the fifteen genomes of C. baratii described above and the pathogenic clostridia used in Kiu et al. [7]. The pathogenic clostridia used in this study and toxins or diseases that produce are listed in Supplementary Table S2 This analysis showed that all C. baratii strains formed a monophyletic lineage in the phylogenetic tree, and the closest relative genomes were found in a cluster containing Clostridium carnis and Clostridium butyricum genomes [67]. C. baratii C3 appeared to be closely related to a monophyletic lineage in the phylogenetic tree formed by C. baratii strains L3_128_029G1, XCM, 693-15, 796-15, 2789STDY5834956, 695-15, 694-15, MCC332, and 771-14 (Figure 1).

3.3. Comparative Genomics Analysis

To identify variable and conserved functions in C. baratii, we constructed the pangenome of this species from all fifteen available genomes. The pangenome of C. baratii comprised 6122 genes, 1744 core genes, and 4378 accessory genes (Figure 2A). Remarkably, 28.49% of the pangenome represented core genes, and 24.43% (1.496 genes) were unique (defined as genes present only in one strain; Figure 2A–C). This analysis indicated that C. baratii comprises genetically diverse organisms since it has a highly variable pangenome (Figure 2D). Given available genomes, we estimate that new genes could potentially be added at an average of 226 genes for each new genome sequenced (Supplementary Figure S1).

3.4. COG Analysis

We then analyzed the functions of the core and accessory genes of the C. baratii pangenome by assigning them to “Clusters of Orthologous Groups” (COG) database [36]. Genes with unknown functions were more abundant among accessory genes than in the core pangenome (332 core genes vs. 790 accessory genes). Interestingly, 242 accessory genes were assigned to a functional cluster associated with “replication, recombination and repair” (“L” COG code) versus 88 core genes (Figure 3), the majority of genes from accessory genes in this category were associated with transposases, integrases, and phages. This difference suggests that C. baratii genomes have variations in their repair mechanisms and adaptations to stress. Similarly, genes associated with cell wall/membrane/envelope biogenesis (“M” COG code) were found more often in the accessory pangenome (233 genes) than in the core genes (68 genes). Finally, the number of genes associated with carbohydrate transport and metabolism (“G” COG code) was higher among accessory genes than in core genes (accessory: 288 genes vs. core: 127 genes). This suggests that carbohydrate utilization functions are conserved among strains of C. baratii.

3.5. Carbohydrate-Active Enzyme Analysis

Glycoside hydrolases were identified as the enzyme family with more members versus other carbohydrate-active enzymes among C. baratii genomes (Supplementary Figure S2). We found that the GH1 family was the most represented in C. baratii strains (Figure 4). Among GH1, 21.43% were predicted as β-galactosidases according to their EC numbers, and 78.57% were predicted as β-glucosidases. The second most represented family was GH13, a family represented by α-amylases. Their abundance indicates a conserved starch-utilization capability among C. baratii, especially strains C3 and XCM. The third most represented family was GH18, known as hexosaminidases releasing N-acetyl-β-D-glucosaminide/galactosaminide linkages in host-derived glycans and chitins. This observation suggests a limited ability to metabolize host-derived glycans, considering the absence of other important GH families including sialidases and fucosidases.

3.6. Virulence Factors

We later examined the presence of virulence factors among all analyzed C. baratii genomes and other bacteria in the Clostridium genus that are known to cause diseases to be pathological (Figure 5). The BoNT gene F7 serotype, encoding the botulinum toxin, was present in seven out of fifteen genomes of C. baratii, being absent in strains MGYG-HGUT-00064, 2789STDY5834956, 2789STDY5834907, XCM, L3_128_029G1, MCC332, L2_013_037G1, and C3. The membrane-active alpha-toxin phospholipase C (plc) was present in all C. baratii genomes (Supplementary Table S3). C. perfringens ATCC 13,124 contained the most diverse repertoire of toxin genes, including plc, sialidases (nanH, nanI, and nanJ), theta-toxin/perfringolysin O (pfoA), mu-toxin (NagH, NagI, NagJ, nagK), and microbial collagenase (colA).
Additionally, we evaluated the presence of antimicrobial resistance genes (AMR) and phage content among these Clostridium genomes (Figure 5). All but two clostridial pathogens and three C. baratii strains had a rpoB gene (CARD code ARO:3004563) with point mutations conferring putative resistance to rifampin and rifampicin. The genome of C. sordelli contains the most diverse repertoire of AMR genes, including EF-Tu mutations (CARD code ARO:3003357) conferring resistance to elfamycin, gyrA (fluoroquinolones), rpoC (rifampicin), tetA, and tetB (tetracycline resistance). C. baratii strains 2789STDY5834956, C3, and MGYG-HGUT-00064 had a tetA gene. Strain C3 carried point mutations in gyrB (CARD code ARO:3004562) with point mutations conferring resistance to fluoroquinolone antibiotics. Strain XCM carried tetA and tetB genes (Figure 5). These results indicate that certain C. baratii strains could carry relevant AMR genes. However, they were scattered among C. baratii genomes.
Finally, genome scanning for prophage elements revealed that phage phiSM101 and vB_Cpes_CP51, bacteriophages that belong to the Siphoviridae family. The Siphoviridae family is a diverse family that infects the Enterobacteriaceae bacterial family and carries virulence genes acquired by other microorganisms [68]; they are present in C. perfringens [25] and only present in C. baratii strains 771-14, XCM, Sullivan 2789STDY5834956, and 2789STDY5834907. The PHASTER Putative prophages analysis showed that all C. baratii genomes except strain 695-15 contain several prophage proteins, ranging from eight to more than a hundred (Supplementary Table S4).
Plasmids were found only in Sullivan and CDC51267 strains, which have finished genomes sequences. C. baratii strain l Sullivan carries the plasmid pCBJ, with a size of 185,364 bp and 212 coding regions [18], and C. baratii strain CDC51267 carries the plasmid pNPD11_1, a circular plasmid with 119 coding regions and a size of 120,667 bp [69]. Both strains have the BoNT/F7 cluster in the plasmid. We found evidence of protein from plasmids pNPD11_1 in strains 693-15, 694-15, 695-15, 771-14, and 796-15, (Supplementary Table S5)

3.7. Phenotypical Assays of C. baratii C3

Strain C3 was shown to have a β-hemolytic activity revealing the presence of exotoxins, and positive for starch degradation, indicating amylase activity (Figure 6). Biochemical tests indicated several features in common with other C. baratii strains, but compared to other studies C3 was unusual in the utilization of xylose, arabinose, melezitose, and raffinose. Disk diffusion tests indicated that C3 was resistant to trimethoprim-sulfamethoxazole.

3.8. Distribution of the BoNT Gene Cluster in C. baratii

We finally studied the distribution and diversity of the BoNT gene cluster in this species. C. baratii strains 693-15, 694-15, 695-15, 771-14, 796-15, Sullivan, and CDC51267 carried the BoNT/F cluster, while it was absent in the other eight genomes studied (Figure 7). The proteins encoded by genes in this cluster, represented by genes BoNT/F, NTNH, UviA, UviB, P47, ORF-X1, ORF-X2, and ORF-X3, synthesize the BoNT/F toxin. C. baratii 693-15, 694-15, 695-15, 771-14, 796-15, CDC51267 and Sullivan strains showed an organization and gene cluster contents identical to serotype E and similar to serotype F6, which do not contain the botR gene. Serotype F6 differs from F7 and E as it contains an is element between the orfX3 and orfX2 genes. The proteolytic clostridia F1, F2, F4, and F5 contain botR [18].
The NTNH gene was also found in all C. baratii genomes carrying the BoNT cluster, as well as UviA and UviB, suggesting that the UviA/B protein complex participates in the regulation of the production of botulinum neurotoxin [18] in C. baratii, similar to other Clostridium pathogens [70,71], but this remains to be shown experimentally.
At the amino acid sequence level, the BoNT gene clustered separately between C. baratii and C. botulinum genomes (Supplementary Figure S3). BoNT sequences were well conserved among C. baratii genomes that contained the gene set. The BoNT proteins from C. baratii strains had on average a 72% identity sequence with the BoNT/F from C. botulinum strains (Supplementary Figure S3).
The active site of the type botulinum toxin is the HEXXH sequence, located in positions 211–215 [72,73]. This motif and surrounding positions were well conserved among all C. baratii and C. botulinum strains (HELIH sequence). Only a 7-aa region upstream of the active site appeared to be missing across C. baratii genomes compared with C. botulinum strains (Figure 8). It has been shown that BoNT/F from C. baratii can cleave VAMP at the same site as the other BoNT/F serotypes [74,75,76]. Therefore, the 7 amino acids gap present in C. baratii genomes is not likely to affect the activity of the toxin and its production since this gap is located far from the active site of the protein described in Breidenbach et al. [77] (Supplementary Figure S4).

4. Discussion

The species C. baratii includes strains that are known to be responsible for botulism outbreaks, a dangerous form of food poisoning [15,19,23]. The BoNT serotype F has been associated with outbreaks caused by the consumption of meat-derived foods [13,78]. However, our knowledge of the diversity and genome features of this species is limited. Here, we compared the fifteen genomes of C. baratii, including two obtained from metagenomes and the genome of a strain we previously isolated from an asymptomatic subject. Unfortunately, the availability of genome sequences from outbreaks and clinical cases is very low, limiting our analysis to a reduced set of genomes. In addition, the isolation of a neurotoxigenic organism from feces does not necessarily imply that living bacteria have colonized the intestinal area since inert spores passing through the intestinal tract may be the source of the isolate. However, it is uncertain if this applies to C. baratii since it has been found in different environments.
C. baratii appears to be a very infrequent species in the gut microbiota (Supplementary Table S1). C. baratii genomes form a monophyletic lineage, not phylogenetically related to any known pathogen clostridia. The comparative analysis shows an open pangenome, indicating that more genomes of this species are needed to understand its full genomic diversity. This observation is in the same line as previous studies indicating C. baratii represents genetically diverse organisms [3,7]. Similarly, several other pathogenic microorganisms have open pangenomes, including C. perfringens [7] and Legionella pneumophila [79]. Different bacterial pathogens, such as Bacillus anthracis [80] and Yersinia pestis [81] were reported to have closed pangenomes. Rouli et al. [80] proposed that the pangenome’s nature reflects the organism’s lifestyle, and species that have an open pangenome can thrive in human and animal guts. However, it is uncertain if this applies to C. baratii as it does not colonize temporarily the gut microbiota and it has been found in several environments. It should be noted that this pangenome analysis might be limited due to the low number of C. baratii genomes available in the databases.
The pangenomic variation of the C. baratii species studied here could be driven by horizontal gene transfer (HGT) events, as suggested by the high number of genes associated with transposases, integrases, and phages (“L” COG category). This high unusual number could explain a high potential for gene gain or loss events (Figure 3). Interestingly, genes involved in defense mechanisms, recombination, and repair processes were encoded at a higher percentage within the accessory genes [82,83,84]. This also suggests variation in the genomic adaptations to stress among C. baratii genomes, which in addition, correlates with the variations found in AMR genes in this species. In contrast, genes encoding functions related to metabolism (i.e., amino acid, carbohydrates, butyrate production) were more represented within the core pangenome, indicating that these functions are conserved among C. baratii species.
To understand the pathogenicity of C. baratii and other pathogen clostridia described in Kiu et al. [7], we evaluated the toxins present in these genomes. Figure 5 shows that alpha-toxin plc (62.8% identity on average with plc from C. perfringens, Supplementary Table S3) is present in all C. baratii strains. The phospholipase C (plc) is a hydrolase involved in signal transduction processes [85]. Bacterial plc interacts with eukaryotic cell membranes and hydrolyzes phosphatidylcholine and sphingomyelin, causing cell lysis [86], and under certain thresholds oxidative stress [87] and gas gangrene [88]. We found evidence of the presence of virulence factors well conserved in C. baratii genomes. And we found only two C. baratii strains with complete plasmids (plasmid pNPD11_1 in CDC51267 and plasmid pCBJ in Sullivan strains). Additionally, we only found proteins from a plasmid (plasmid pNPD11_1) in strains that have the BoNT cluster (693-15, 694-15, 695-15, 771-14, and 796-15). This is relevant because these are the only genomes that are closed, the other strains have a draft genome. Knowing how they were acquired is a big challenge because is necessary to have more information on these strains and ideally have closed the genomes instead of what is available today. This bioinformatics prediction correlates with the β-hemolytic activity of C. baratii C3. Finally, predicted AMR results should be taken with caution, and should be validated experimentally. For example, CARD predicted fluoroquinolone resistance in C. baratii C3 through gyrB mutations, however, this microorganism was sensitive to ciprofloxacin. Apparently, C3 is resistant to tetracycline and rifampin, but we were not able to confirm this estimation. Clostridium species appear to be resistant in great numbers to sulfonamides.
The BoNT/F gene cluster was present in seven out of fifteen strains (C. baratii 693-15, 694-15, 695-15, 771-14, 796-15, Sullivan, and CDC51267 strains). Considering the clinical importance of this microorganism and that most studies reporting genomic information about this taxa are related to botulinum outbreaks, the proportion of C. baratii genomes with BoNT production potential is unclear and could be overlooked. The phylogeny of BoNT genes (Supplementary Figure S3) in C. baratii and C. botulinum strains showed a monophyletic lineage, indicating that the amino acid sequence is well conserved between these species. These findings agree with the high degree of conservation of the active site of the BoNT protein. In addition, BoNT sequences within C. baratii strains analyzed in this study, have a lack of genetic variation among BoNT sequences when compared to other Clostridium species and its presence in seven of fifteen genomes suggest a recent horizontal gene transfer event that is also supported by a large number of prophage proteins. That C. baratii strains acquired the BoNT cluster by HGT was proposed before based on the evidence of the presence of the same toxins in different groups of bacterial species [20] and that the genomic architecture showed that the BoNT cluster is flanked by two IS1182 gene copies [14,20].
The percentage of identity between the BoNT type F of C. botulinum and C. baratii strains was on average 72%, and the active site HEXXH was conserved between proteins of both species (Figure 8). We found a seven amino acid region upstream of the active site appeared to be missing across C. baratii genomes. However, this gap is not likely to impact the toxicity of BoNT because these amino acids do not interact with the molecular target of BoNT, the VAMP2 protein [71]. It is relevant to consider that the BoNT/F7 of C. baratii needs to recognize a longer peptide of the VAMP sequence to achieve cleavage [89] when identifying the toxin. In addition, there is a 30% difference in identity with other F serotypes of C. botulinum.

5. Conclusions

C. baratii is an understudied clostridial species that could pose a potential threat to human health considering some isolates could produce the botulinum toxin. Our comparative genomic analysis of fifteen genomes of C. baratii indicated a heterogeneous open pangenome and diversity in adaptation to stress processes. Certain C. baratii species seem to be resistant to tetracycline, the majority of C. baratii strains were equipped with phospholipase C. Seven out of fifteen genomes carried a complete BoNT/F7 cluster that is highly conserved among C. baratii genomes and displayed distinct differences compared to C. botulinum BoNT. The analysis we report helps to understand the properties of rare clostridial taxa and their understudied clinical relevance, highlighting their toxin-producing capabilities and likely resistance and adaptation mechanisms.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/microorganisms10020213/s1. Figure S1. C. baratii pan-genome computational statistics. Figure S1. Conserved and unique genes in C. baratii pangenome. Figure S2. Percentage of all CAZy categories identified in C. baratii pangenomes. Figure S3. BoNT phylogeny between C. botulinum and C. baratii. Figure S4. BoNT three-dimensional structure. Table S1. List of Clostridium baratii strains genomes from NCBI database [90,91,92,93,94,95,96,97,98,99,100,101]. Table S2. Genomes of pathogenic clostridia. List of pathogen Clostridium genomes used. Table S3. plc protein identity. % identity based on amino acid sequence identity (using blastp) with plc from C. perfringens as reference. Table S4. Prophages found in C. baratii strains. On a separate file. Table S5. Quantity of proteins from plasmids found in C. baratii strains.

Author Contributions

C.S.-A. performed the analyses. C.S.-A. and D.G. wrote the article. C.S.-A., A.J.M. and D.G. conceived the study and revised the manuscript. D.G. and A.J.M. provided materials. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by ANID Doctoral Fellowship [21191605], FONDEQUIP EQM190070, FONDECYT Regular Project [1190074], and FONDECYT Regular Project [1181089]. Powered@NLHPC: this research was partially supported by the supercomputing infrastructure of the NLHPC (ECM-02); and by the computing infrastructure of the Centro de Genómica y Bioinformática, Universidad Mayor.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The genome of C. baratii strain C3 are available on https://www.ncbi.nlm.nih.gov/bioproject/PRJNA716081/ accessed on 11 November 2021.

Acknowledgments

The authors thank Pamela Thomson, Belen Hirmas, Guillermo Orellana, and Alexis Torres for their support in the study.

Conflicts of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Eckburg, P.B.; Bik, E.M.; Bernstein, C.N.; Purdom, E.; Dethlefsen, L.; Sargent, M.; Gill, S.R.; Nelson, K.E.; Relman, D.A. Diversity of the human intestinal microbial flora. Science 2005, 308, 1635–1638. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Cruz-Morales, P.; Orellana, C.; Moutafis, G.; Moonen, G.; Rincon, G.; Nielsen, L.; Marcellin, E. Revisiting the Evolution and Taxonomy of Clostridia, a Phylogenomic Update. Genome Biol. Evol. 2019, 11, 2035–2044. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Cohen, J.E.; Wang, R.; Shen, R.-F.; Wu, W.W.; Keller, J.E. Comparative pathogenomics of Clostridium tetani. PLoS ONE 2017, 12, e0182909. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Shrestha, A.; Uzal, F.A.; McClane, B.A. Enterotoxic Clostridia: Clostridium perfringens Enteric Diseases. Microbiol. Spectr. 2018, 6, 1–17. [Google Scholar] [CrossRef]
  5. Le Gratiet, T.; Poezevara, T.; Rouxel, S.; Houard, E.; Mazuet, C.; Chemaly, M.; Le Maréchal, C. Development of An Innovative and Quick Method for the Isolation of Clostridium botulinum Strains immInvolved in Avian Botulism Outbreaks. Toxins 2020, 12, 42. [Google Scholar] [CrossRef] [Green Version]
  6. Lonati, D.; Schicchi, A.; Crevani, M.; Buscaglia, E.; Scaravaggi, G.; Maida, F.; Cirronis, M.; Petrolini, V.M.; Locatelli, C.A. Foodborne Botulism: Clinical Diagnosis and Medical Treatment. Toxins 2020, 12, 509. [Google Scholar] [CrossRef]
  7. Kiu, R.; Caim, S.; Alexander, S.; Pachori, P.; Hall, L.J. Probing Genomic Aspects of the Multi-Host Pathogen Clostridium perfringens Reveals Significant Pangenome Diversity, and a Diverse Array of Virulence Factors. Front. Microbiol. 2017, 8, 2485. [Google Scholar] [CrossRef] [Green Version]
  8. Wong, Y.M.; Juan, J.C.; Ting, A.; Wu, T.Y.; Gan, H.M.; Austin, C.M. Draft Genome Sequence of Clostridium sp. Strain Ade.TY, a New Biohydrogen- and Biochemical-Producing Bacterium Isolated from Landfill Leachate Sludge. Genome Announc. 2014, 2, e00078-14. [Google Scholar] [CrossRef] [Green Version]
  9. Atarashi, K.; Tanoue, T.; Shima, T.; Imaoka, A.; Kuwahara, T.; Momose, Y.; Cheng, G.; Yamasaki, S.; Saito, T.; Ohba, Y.; et al. Induction of colonic regulatory T cells by indigenous Clostridium species. Science 2011, 331, 337–341. [Google Scholar] [CrossRef] [Green Version]
  10. Collins, M.D.; Lawson, P.A.; Willems, A.; Cordoba, J.J.; Fernandez-Garayzabal, J.; Garcia, P.; Cai, J.; Hippe, H.; Farrow, J.A.E. The phylogeny of the genus Clostridium: Proposal of five new genera and eleven new species combinations. Int. J. Syst. Evol. Microbiol. 1994, 44, 812–826. [Google Scholar] [CrossRef] [Green Version]
  11. Castor, C.; Mazuet, C.; Saint-Leger, M.; Vygen, S.; Coutureau, J.; Durand, M.; Popoff, M.R.; Da Silva, N.J. Cluster of two cases of botulism due to Clostridium baratii type F in France, November 2014. Eurosurveillance 2015, 20, 21031. [Google Scholar] [CrossRef]
  12. Paisley, J.W.; Lauer, B.A.; Arnon, S.S. A second case of infant botulism type F caused by Clostridium baratii. Pediatr. Infect. Dis. J. 1995, 14, 912–914. [Google Scholar]
  13. Mazuet, C.; Legeay, C.; Sautereau, J.; Bouchier, C.; Criscuolo, A.; Bouvet, P.; Trehard, H.; Da Silva, N.J.; Popoff, M. Characterization of Clostridium Baratii Type F Strains Responsible for an Outbreak of Botulism Linked to Beef Meat Consumption in France. PLoS Curr. 2017, 9. [Google Scholar] [CrossRef]
  14. Lima, C.O.G.X.; Rocha, V.; Ferreira, E.D.O.; Filho, J.S.; Serradas, L.R.; Silva, R.O.S.; Lobato, F.C.F.; Domingues, R.M.C.P. Clostridium baratii: A rare case of pneumonia associated with an Alzheimer patient in Rio de Janeiro, Brazil. JMM Case Rep. 2016, 3, e005041. [Google Scholar] [CrossRef]
  15. Lafuente, S.; Nolla, J.; Valdezate, S.; Tortajada, C.; Vargas-Leguas, H.; Parron, I.; Sáez-Nieto, J.A.; Portaña, S.; Carrasco, G.; Moguel, E.; et al. Two simultaneous botulism outbreaks in Barcelona: Clostridium baratii and Clostridium botulinum. Epidemiol. Infect. 2013, 141, 1993–1995. [Google Scholar] [CrossRef] [Green Version]
  16. Zhang, S.; Masuyer, G.; Zhang, J.; Shen, Y.; Lundin, D.; Henriksson, L.; Miyashita, S.I.; Martínez-Carranza, M.; Dong, M.; Stenmark, P. Identification and characterization of a novel botulinum neurotoxin. Nat. Commun. 2017, 8, 14130. [Google Scholar] [CrossRef]
  17. Masuyer, G.; Zhang, S.; Barkho, S.; Shen, Y.; Henriksson, L.; Košenina, S.; Dong, M.; Stenmark, P. Structural characterisation of the catalytic domain of botulinum neurotoxin X-High activity and unique substrate specificity. Sci. Rep. 2018, 8, 4518. [Google Scholar] [CrossRef] [Green Version]
  18. Dover, N.; Barash, J.R.; Burke, J.N.; Hill, K.K.; Detter, J.C.; Arnon, S.S. Arrangement of the Clostridium baratii F7 toxin gene cluster with identification of a σ factor that recognizes the botulinum toxin gene cluster promoters. PLoS ONE 2014, 9, e97983. [Google Scholar] [CrossRef] [Green Version]
  19. Harvey, S.M.; Sturgeon, J.; Dassey, D.E. Botulism due to Clostridium baratii type F toxin. J. Clin. Microbiol. 2002, 40, 2260–2262. [Google Scholar] [CrossRef] [Green Version]
  20. Smith, T.J.; Hill, K.K.; Xie, G.; Foley, B.T.; Williamson, C.H.; Foster, J.T.; Johnson, S.L.; Chertkov, O.; Teshima, H.; Gibbons, H.S.; et al. Genomic sequences of six botulinum neurotoxin-producing strains representing three clostridial species illustrate the mobility and diversity of botulinum neurotoxin genes. Infect. Genet. Evol. 2015, 30, 102–113. [Google Scholar] [CrossRef] [Green Version]
  21. Raphael, B.H.; Choudoir, M.J.; Lúquez, C.; Fernández, R.; Maslanka, S.E. Sequence diversity of genes encoding botulinum neurotoxin type F. Appl. Environ. Microbiol. 2010, 76, 4805–4812. [Google Scholar] [CrossRef] [Green Version]
  22. Tréhard, H.; Poujol, I.; Mazuet, C.; Blanc, Q.; Gillet, Y.; Rossignol, F.; Popoff, M.-R.; Da Silva, N.J. A cluster of three cases of botulism due to Clostridium baratii type F, France, August 2015. Eurosurveillance 2016, 21, 30117. [Google Scholar] [CrossRef]
  23. Huang, W.-C.; Lee, W.-S.; Chang, T.; Ou, T.-Y.; Lam, C. Emphysematous cholecystitis complicating liver abscess due to Clostridium baratii infection. J. Microbiol. Immunol. Infect. 2012, 45, 390–392. [Google Scholar] [CrossRef] [Green Version]
  24. Sikorra, S.; Skiba, M.; Dorner, M.B.; Weisemann, J.; Weil, M.; Valdezate, S.; Davletov, B.; Rummel, A.; Dorner, B.G.; Binz, T. Botulinum Neurotoxin F Subtypes Cleaving the VAMP-2 Q(58)K(59) Peptide Bond Exhibit Unique Catalytic Properties and Substrate Specificities. Toxins 2018, 10, 311. [Google Scholar] [CrossRef] [Green Version]
  25. Feng, Y.; Fan, X.; Zhu, L.; Yang, X.; Liu, Y.; Gao, S.; Jin, X.; Liu, D.; Ding, J.; Guo, Y.; et al. Phylogenetic and genomic analysis reveals high genomic openness and genetic diversity of Clostridium perfringens. Microb. Genom. 2020, 6, e000441. [Google Scholar] [CrossRef]
  26. Geirnaert, A.; Calatayud, M.; Grootaert, C.; Laukens, D.; Devriese, S.; Smagghe, G.; De Vos, M.; Boon, N.; Van de Wiele, T. Butyrate-producing bacteria supplemented in vitro to Crohn’s disease patient microbiota increased butyrate production and enhanced intestinal epithelial barrier integrity. Sci. Rep. 2017, 7, 11450. [Google Scholar] [CrossRef] [Green Version]
  27. Takahashi, K.; Nishida, A.; Fujimoto, T.; Fujii, M.; Shioya, M.; Imaeda, H.; Inatomi, O.; Bamba, S.; Andoh, A.; Sugimoto, M. Reduced Abundance of Butyrate-Producing Bacteria Species in the Fecal Microbial Community in Crohn’s Disease. Digestion 2016, 93, 59–65. [Google Scholar] [CrossRef]
  28. Thomson, P.; Santibañez, R.; Aguirre, C.; Galgani, J.E.; Garrido, D. Short-term impact of sucralose consumption on the metabolic response and gut microbiome of healthy adults. Br. J. Nutr. 2019, 122, 856–862. [Google Scholar] [CrossRef] [Green Version]
  29. Das, P.; Ji, B.; Kovatcheva-Datchary, P.; Bäckhed, F.; Nielsen, J. In vitro co-cultures of human gut bacterial species as predicted from co-occurrence network analysis. PLoS ONE 2018, 13, e0195161. [Google Scholar] [CrossRef]
  30. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef] [Green Version]
  31. Chevreux, B.; Wetter, T.; Suhai, S. Genome Sequence Assembly Using Trace Signals and Additional Sequence Information. Comput. Sci. Biol. Proc. Ger. Conf. Bioinform. 1999, 99, 45–56. [Google Scholar]
  32. Zimin, A.V.; Marçais, G.; Puiu, D.; Roberts, M.; Salzberg, S.L.; Yorke, J.A. The MaSuRCA genome assembler. Bioinformatics 2013, 29, 2669–2677. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Bankevich, A.; Nurk, S.; Antipov, D.; Gurevich, A.A.; Dvorkin, M.; Kulikov, A.S.; Lesin, V.M.; Nikolenko, S.I.; Pham, S.; Prjibelski, A.D.; et al. SPAdes: A new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 2012, 19, 455–477. [Google Scholar] [CrossRef] [Green Version]
  34. Hyatt, D.; Chen, G.-L.; Locascio, P.F.; Land, M.L.; Larimer, F.W.; Hauser, L.J. Prodigal: Prokaryotic gene recognition and translation initiation site identification. BMC Bioinform. 2010, 11, 119. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Camacho, C.; Coulouris, G.; Avagyan, V.; Ma, N.; Papadopoulos, J.; Bealer, K.; Madden, T.L. BLAST+: Architecture and applications. BMC Bioinform. 2009, 10, 421. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Tatusov, R.L.; Koonin, E.V.; Lipman, D.J. A genomic perspective on protein families. Science 1997, 278, 631–637. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Lombard, V.; Golaconda Ramulu, H.; Drula, E.; Coutinho, P.M.; Henrissat, B. The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 2014, 42, D490–D495. [Google Scholar] [CrossRef] [Green Version]
  38. Ren, Q.; Chen, K.; Paulsen, I.T. TransportDB: A comprehensive database resource for cytoplasmic membrane transport systems and outer membrane channels. Nucleic Acids Res. 2007, 35, D274–D279. [Google Scholar] [CrossRef]
  39. Elbourne, L.D.H.; Tetu, S.G.; Hassan, K.A.; Paulsen, I.T. TransportDB 2.0: A database for exploring membrane transporters in sequenced genomes from all domains of life. Nucleic Acids Res. 2017, 45, D320–D324. [Google Scholar] [CrossRef]
  40. Sayers, S.; Li, L.; Ong, E.; Deng, S.; Fu, G.; Lin, Y.; Yang, B.; Zhang, S.; Fa, Z.; Zhao, B.; et al. Victors: A web-based knowledge base of virulence factors in human and animal pathogens. Nucleic Acids Res. 2019, 47, D693–D700. [Google Scholar] [CrossRef]
  41. Chen, L.; Yang, J.; Yu, J.; Yao, Z.; Sun, L.; Shen, Y.; Jin, Q. VFDB: A reference database for bacterial virulence factors. Nucleic Acids Res. 2005, 33, D325–D328. [Google Scholar] [CrossRef] [Green Version]
  42. Liu, B.; Zheng, D.; Jin, Q.; Chen, L.; Yang, J. VFDB 2019: A comparative pathogenomic platform with an interactive web interface. Nucleic Acids Res. 2019, 47, D687–D692. [Google Scholar] [CrossRef]
  43. Johnson, L.S.; Eddy, S.R.; Portugaly, E. Hidden Markov model speed heuristic and iterative HMM search procedure. BMC Bioinform. 2010, 11, 431. [Google Scholar] [CrossRef] [Green Version]
  44. Sonnhammer, E.L.; Eddy, S.R.; Durbin, R. Pfam: A comprehensive database of protein domain families based on seed alignments. Proteins 1997, 28, 405–420. [Google Scholar] [CrossRef]
  45. Huerta-Cepas, J.; Forslund, K.; Coelho, L.P.; Szklarczyk, D.; Jensen, L.J.; Von Mering, C.; Bork, P. Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper. Mol. Biol. Evol. 2017, 34, 2115–2122. [Google Scholar] [CrossRef] [Green Version]
  46. Zdobnov, E.M.; Apweiler, R. InterProScan—An integration platform for the signature-recognition methods in InterPro. Bioinformatics 2001, 17, 847–848. [Google Scholar] [CrossRef] [Green Version]
  47. O’Leary, N.A.; Wright, M.W.; Brister, J.R.; Ciufo, S.; Haddad, D.; McVeigh, R.; Rajput, B.; Robbertse, B.; Smith-White, B.; Ako-Adjei, D.; et al. Reference sequence (RefSeq) database at NCBI: Current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 2015, 44, D733–D745. [Google Scholar] [CrossRef] [Green Version]
  48. Glaeser, S.P.; Kämpfer, P. Multilocus sequence analysis (MLSA) in prokaryotic taxonomy. Syst. Appl. Microbiol. 2015, 38, 237–245. [Google Scholar] [CrossRef]
  49. Hug, L.A.; Baker, B.; Anantharaman, K.; Brown, C.T.; Probst, A.; Castelle, C.; Butterfield, C.N.; Hernsdorf, A.W.; Amano, Y.; Ise, K.; et al. A new view of the tree of life. Nat. Microbiol. 2016, 1, 16048. [Google Scholar] [CrossRef] [Green Version]
  50. Emms, D.M.; Kelly, S. OrthoFinder: Phylogenetic orthology inference for comparative genomics. Genome Biol. 2019, 20, 238. [Google Scholar] [CrossRef] [Green Version]
  51. Stamatakis, A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 2014, 30, 1312–1313. [Google Scholar] [CrossRef]
  52. Rambaut, A. FigTree v1.3.1. Institute of Evolutionary Biology, University of Edinburgh, Edinburgh. 2010. Available online: http://tree.bio.ed.ac.uk/software/figtree/ (accessed on 20 September 2021).
  53. Sitto, F.; Battistuzzi, F.U. Estimating Pangenomes with Roary. Mol. Biol. Evol. 2020, 37, 933–939. [Google Scholar] [CrossRef]
  54. Buchfink, B.; Xie, C.; Huson, D.H. Fast and sensitive protein alignment using DIAMOND. Nat. Methods 2015, 12, 59–60. [Google Scholar] [CrossRef]
  55. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2018. [Google Scholar]
  56. Zhou, Y.; Liang, Y.; Lynch, K.H.; Dennis, J.J.; Wishart, D.S. PHAST: A fast phage search tool. Nucleic Acids Res. 2011, 39, W347–W352. [Google Scholar] [CrossRef]
  57. Arndt, D.; Grant, J.R.; Marcu, A.; Sajed, T.; Pon, A.; Liang, Y.; Wishart, D.S. PHASTER: A better, faster version of the PHAST phage search tool. Nucleic Acids Res. 2016, 44, W16–W21. [Google Scholar] [CrossRef] [Green Version]
  58. Alcock, B.P.; Raphenya, A.R.; Lau, T.T.Y.; Tsang, K.K.; Bouchard, M.; Edalatmand, A.; Huynh, W.; Nguyen, A.-L.V.; Cheng, A.A.; Liu, S.; et al. CARD 2020: Antibiotic resistome surveillance with the comprehensive antibiotic resistance database. Nucleic Acids Res. 2020, 48, D517–D525. [Google Scholar] [CrossRef]
  59. Brooks, L.; Kaze, M.; Sistrom, M. A Curated, Comprehensive Database of Plasmid Sequences. Microbiol. Resour. Announc. 2019, 8, e01325-18. [Google Scholar] [CrossRef] [Green Version]
  60. Kelley, L.A.; Mezulis, S.; Yates, C.M.; Wass, M.N.; Sternberg, M.J.E. The Phyre2 web portal for protein modeling, prediction and analysis. Nat. Protoc. 2015, 10, 845–858. [Google Scholar] [CrossRef] [Green Version]
  61. Humphrey, W.; Dalke, A.; Schulten, K. VMD: Visual molecular dynamics. J. Mol. Graph. 1996, 14, 33–38. [Google Scholar] [CrossRef]
  62. Markowitz, V.M.; Chen, I.-M.A.; Palaniappan, K.; Chu, K.; Szeto, E.; Grechkin, Y.; Ratner, A.; Jacob, B.; Huang, J.; Williams, P.; et al. IMG: The Integrated Microbial Genomes database and comparative analysis system. Nucleic Acids Res. 2012, 40, D115–D122. [Google Scholar] [CrossRef] [Green Version]
  63. Chen, I.-M.A.; Chu, K.; Palaniappan, K.; Pillay, M.; Ratner, A.; Huang, J.; Huntemann, M.; Varghese, N.; White, J.R.; Seshadri, R.; et al. IMG/M v.5.0: An integrated data management and comparative analysis system for microbial genomes and microbiomes. Nucleic Acids Res. 2019, 47, D666–D677. [Google Scholar] [CrossRef] [PubMed]
  64. Chen, I.-M.A.; Chu, K.; Palaniappan, K.; Ratner, A.; Huang, J.; Huntemann, M.; Hajek, P.; Ritter, S.; Varghese, N.; Seshadri, R.; et al. The IMG/M data management and analysis system v.6.0: New tools and advanced capabilities. Nucleic Acids Res. 2021, 49, D751–D763. [Google Scholar] [CrossRef] [PubMed]
  65. Pruitt, K.D.; Tatusova, T.; Maglott, D.R. NCBI reference sequences (RefSeq): A curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 2007, 35, D61–D65. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  66. Barrett, T.; Wilhite, S.E.; Ledoux, P.; Evangelista, C.; Kim, I.F.; Tomashevsky, M.; Marshall, K.A.; Phillippy, K.H.; Sherman, P.M.; Holko, M.; et al. NCBI GEO: Archive for functional genomics data sets—Update. Nucleic Acids Res. 2012, 41, D991–D995. [Google Scholar] [CrossRef] [Green Version]
  67. Chen, D.; Jin, D.; Huang, S.; Wu, J.; Xu, M.; Liu, T.; Dong, W.; Liu, X.; Wang, S.; Zhong, W.; et al. Clostridium butyricum, a butyrate-producing probiotic, inhibits intestinal tumor development through modulating Wnt signaling and gut microbiota. Cancer Lett. 2020, 469, 456–467. [Google Scholar] [CrossRef]
  68. Brüssow, H.; Desiere, F. Comparative phage genomics and the evolution of Siphoviridae: Insights from dairy phages. Mol. Microbiol. 2001, 39, 213–222. [Google Scholar] [CrossRef]
  69. Halpin, J.L.; Hill, K.; Johnson, S.L.; Bruce, D.C.; Shirey, T.B.; Dykes, J.K.; Lúquez, C. Finished Whole-Genome Sequences of Clostridium butyricum Toxin Subtype E4 and Clostridium baratii Toxin Subtype F7 Strains. Genome Announc. 2017, 5, e00375-17. [Google Scholar] [CrossRef] [Green Version]
  70. Dupuy, B.; Matamouros, S. Regulation of toxin and bacteriocin synthesis in Clostridium species by a new subgroup of RNA polymerase σ-factors. Res. Microbiol. 2006, 157, 201–205. [Google Scholar] [CrossRef]
  71. Brüggemann, H. Genomics of clostridial pathogens: Implication of extrachromosomal elements in pathogenicity. Curr. Opin. Microbiol. 2005, 8, 601–605. [Google Scholar] [CrossRef]
  72. Silvaggi, N.R.; Boldt, G.E.; Hixon, M.; Kennedy, J.P.; Tzipori, S.; Janda, K.D.; Allen, K.N. Structures of Clostridium botulinum Neurotoxin Serotype A Light Chain Complexed with Small-Molecule Inhibitors Highlight Active-Site Flexibility. Chem. Biol. 2007, 14, 533–542. [Google Scholar] [CrossRef] [Green Version]
  73. Agarwal, R.; Binz, T.; Swaminathan, S. Structural Analysis of Botulinum Neurotoxin Serotype F Light Chain:  Implications on Substrate Binding and Inhibitor Design. Biochemistry 2005, 44, 11758–11765. [Google Scholar] [CrossRef]
  74. Schmidt, J.J.; Stafford, R.G. Botulinum neurotoxin serotype F: Identification of substrate recognition requirements and development of inhibitors with low nanomolar affinity. Biochemistry 2005, 44, 4067–4073. [Google Scholar] [CrossRef]
  75. Schiavo, G.; Shone, C.C.; Rossetto, O.; Alexander, F.C.; Montecucco, C. Botulinum neurotoxin serotype F is a zinc endopeptidase specific for VAMP/synaptobrevin. J. Biol. Chem. 1993, 268, 11516–11519. [Google Scholar] [CrossRef]
  76. Agarwal, R.; Schmidt, J.J.; Stafford, R.G.; Swaminathan, S. Mode of VAMP substrate recognition and inhibition of Clostridium botulinum neurotoxin F. Nat. Struct. Mol. Biol. 2009, 16, 789–794. [Google Scholar] [CrossRef]
  77. Breidenbach, M.A.; Brunger, A.T. Substrate recognition strategy for botulinum neurotoxin serotype A. Nature 2004, 432, 925–929. [Google Scholar] [CrossRef]
  78. Hannett, G.E.; Schaffzin, J.K.; Davis, S.W.; Fage, M.P.; Schoonmaker-Bopp, D.; Dumas, N.B.; Musser, K.A.; Egan, C. Two cases of adult botulism caused by botulinum neurotoxin producing Clostridium baratii. Anaerobe 2014, 30, 178–180. [Google Scholar] [CrossRef]
  79. D’Auria, G.; Jiménez-Hernández, N.; Peris-Bondia, F.; Moya, A.; Latorre, A. Legionella pneumophila pangenome reveals strain-specific virulence factors. BMC Genom. 2010, 11, 181. [Google Scholar] [CrossRef] [Green Version]
  80. Rouli, L.; Merhej, V.; Fournier, P.-E.; Raoult, D. The bacterial pangenome as a new tool for analysing pathogenic bacteria. New Microbes New Infect. 2015, 7, 72–85. [Google Scholar] [CrossRef] [Green Version]
  81. Eppinger, M.; Worsham, P.L.; Nikolich, M.P.; Riley, D.R.; Sebastian, Y.; Mou, S.; Achtman, M.; Lindler, L.E.; Ravel, J. Genome sequence of the deep-rooted Yersinia pestis strain Angola reveals new insights into the evolution and pangenome of the plague bacterium. J. Bacteriol. 2010, 192, 1685–1699. [Google Scholar] [CrossRef] [Green Version]
  82. Li, X.-Z.; Nikaido, H. Efflux-mediated drug resistance in bacteria. Drugs 2004, 64, 159–204. [Google Scholar] [CrossRef]
  83. Andersen, J.L.; He, G.-X.; Kakarla, P.; KC, R.; Kumar, S.; Lakra, W.S.; Mukherjee, M.M.; Ranaweera, I.; Shrestha, U.; Tran, T.; et al. Multidrug efflux pumps from Enterobacteriaceae, Vibrio cholerae and Staphylococcus aureus bacterial food pathogens. Int. J. Environ. Res. Public Health 2015, 12, 1487–1547. [Google Scholar] [CrossRef] [Green Version]
  84. Köster, W. ABC transporter-mediated uptake of iron, siderophores, heme and vitamin B12. Res. Microbiol. 2001, 152, 291–301. [Google Scholar] [CrossRef]
  85. Meldrum, E.; Parker, P.J.; Carozzi, A. The PtdIns-PLC superfamily and signal transduction. Biochim. Biophys. Acta 1991, 1092, 49–71. [Google Scholar] [CrossRef]
  86. Titball, R.W. Bacterial phospholipases C. Microbiol. Rev. 1993, 57, 347–366. [Google Scholar] [CrossRef]
  87. Monturiol-Gross, L.; Flores-Díaz, M.; Pineda-Padilla, M.J.; Castro-Castro, A.C.; Alape-Giron, A. Clostridium perfringens phospholipase C induced ROS production and cytotoxicity require PKC, MEK1 and NFκB activation. PLoS ONE 2014, 9, e86475. [Google Scholar] [CrossRef] [Green Version]
  88. Flores-Díaz, M.; Alape-Girón, A. Role of Clostridium perfringens phospholipase C in the pathogenesis of gas gangrene. Toxicon 2003, 42, 979–986. [Google Scholar] [CrossRef]
  89. Kalb, S.R.; Baudys, J.; Egan, C.; Smith, T.J.; Smith, L.A.; Pirkle, J.L.; Barr, J.R. Different substrate recognition requirements for cleavage of synaptobrevin-2 by Clostridium baratii and Clostridium botulinum type F neurotoxins. Appl. Environ. Microbiol. 2011, 77, 1301–1308. [Google Scholar] [CrossRef] [Green Version]
  90. Wang, X.; Maegawa, T.; Karasawa, T.; Kozaki, S.; Tsukamoto, K.; Gyobu, Y.; Yamakawa, K.; Oguma, K.; Sakaguchi, Y.; Nakamura, S. Genetic analysis of type E botulinum toxin-producing Clostridium butyricum strains. Appl. Environ. Microbiol. 2000, 66, 4992–4997. [Google Scholar] [CrossRef] [Green Version]
  91. Schade, R.P.; Van Rijn, M.; Timmers, H.J.; Dofferhoff, A.S.; Klaassen, C.H.; Meis, J.F. Clostridium cadaveris bacteraemia: Two cases and review. Scand. J. Infect. Dis. 2006, 38, 59–62. [Google Scholar] [CrossRef] [PubMed]
  92. Wort, A.J.; Ozere, R.L. Characteristics of a strain of Clostridium carnis causing septicaemia in a young infant. J. Clin. Pathol. 1976, 29, 1011–1013. [Google Scholar] [CrossRef] [Green Version]
  93. Scanlan, D.R.; Smith, M.A.; Isenberg, H.D.; Engrassia, S.; Hilton, E. Clostridium bifermentans bacteremia with metastatic osteomyelitis. J. Clin. Microbiol. 1994, 32, 2867–2868. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  94. Edagiz, S.; Lagace-Wiens, P.; Embil, J.; Karlowsky, J.; Walkty, A. Empyema caused by Clostridium bifermentans: A case report. Can. J. Infect. Dis. Med. Microbiol. 2015, 26, 105–107. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  95. Agergaard, C.N.; Hoegh, S.V.; Holt, H.M.; Justesen, U.S. Two Serious Cases of Infection with Clostridium celatum after 40 Years in Hiding? J. Clin. Microbiol. 2016, 54, 236–238. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  96. Baldassi, L. Clostridial toxins: Potent poisons, potent medicines. J. Venom. Anim. Toxins Incl. Trop. Dis. 2005, 11, 391–411. [Google Scholar] [CrossRef] [Green Version]
  97. Ahmed, U.K.B.; Shadid, T.M.; Larabee, J.L.; Ballard, J.D. Combined and Distinct Roles of Agr Proteins in Clostridioides difficile 630 Sporulation, Motility, and Toxin Production. mBio 2020, 11, e03190-20. [Google Scholar] [CrossRef]
  98. Muñoz, M.; Restrepo-Montoya, D.; Kumar, N.; Iraola, G.; Herrera, G.; Ríos-Chaparro, D.I.; Díaz-Arévalo, D.; Patarroyo, M.A.; Lawley, T.D.; Ramírez, J.D. Comparative genomics identifies potential virulence factors in Clostridium tertium and C. paraputrificum. Virulence 2019, 10, 657–676. [Google Scholar] [CrossRef] [Green Version]
  99. Bayne, J.E.; Edmondson, M.A. Diseases of the gastrointestinal system. Sheep Goat Cervid Med. 2021, 63–96. [Google Scholar] [CrossRef]
  100. Qa’Dan, M.; Spyres, L.M.; Ballard, J.D. pH-enhanced cytopathic effects of Clostridium sordellii lethal toxin. Infect. Immun. 2001, 69, 5487–5493. [Google Scholar] [CrossRef] [Green Version]
  101. Abusnina, W.; Shehata, M.; Karem, E.; Koc, Z.; Khalil, E. Clostridium sporogenes bacteremia in an immunocompetent patient. IDCases 2019, 15, e00481. [Google Scholar] [CrossRef]
Figure 1. MLSA analysis of pathogenic Clostridium strains. Maximum Likelihood phylogenetic tree of pathogenic Clostridium strain based on 15 ribosomal protein data sets.
Figure 1. MLSA analysis of pathogenic Clostridium strains. Maximum Likelihood phylogenetic tree of pathogenic Clostridium strain based on 15 ribosomal protein data sets.
Microorganisms 10 00213 g001
Figure 2. Pangenome of Clostridium baratii strains. (A) Core and accessory genes statistics. (B) Frequency bar graphs of the number of identical genes against the number of genomes. (C) Number of new genes and unique genes in the pangenome as genomes are included. (D) Number of conserved genes and total genes as more genomes are included in the pangenome calculation.
Figure 2. Pangenome of Clostridium baratii strains. (A) Core and accessory genes statistics. (B) Frequency bar graphs of the number of identical genes against the number of genomes. (C) Number of new genes and unique genes in the pangenome as genomes are included. (D) Number of conserved genes and total genes as more genomes are included in the pangenome calculation.
Microorganisms 10 00213 g002
Figure 3. Percent of total clusters of orthologous groups (COGs) in pangenome. Percentage of total Cluster of Orthologous group annotated in the Clostridium baratii core genome and accessory genes.
Figure 3. Percent of total clusters of orthologous groups (COGs) in pangenome. Percentage of total Cluster of Orthologous group annotated in the Clostridium baratii core genome and accessory genes.
Microorganisms 10 00213 g003
Figure 4. Count of Glycosyl Transferases (GH) categories identified in strains. GHs in Clostridium baratii genomes. The x-axis represents the count of the Glycosyl Transferases category, and the y-axis represents the strains.
Figure 4. Count of Glycosyl Transferases (GH) categories identified in strains. GHs in Clostridium baratii genomes. The x-axis represents the count of the Glycosyl Transferases category, and the y-axis represents the strains.
Microorganisms 10 00213 g004
Figure 5. Toxins and Antimicrobial resistance (AMR) in pathogenic Clostridium strains. Presence (in blue) or absence (in white) of toxin and AMR genes in pathogen Clostridium, C. baratii strains. The x-axis represents different toxins, and the y-axis represents the strains. The color on heatmaps represents the toxin group by category.
Figure 5. Toxins and Antimicrobial resistance (AMR) in pathogenic Clostridium strains. Presence (in blue) or absence (in white) of toxin and AMR genes in pathogen Clostridium, C. baratii strains. The x-axis represents different toxins, and the y-axis represents the strains. The color on heatmaps represents the toxin group by category.
Microorganisms 10 00213 g005
Figure 6. Phenotypical analysis of C. baratii C3. Shown are Gram-staining, blood agar analysis, disk diffusion, and API-20A results.
Figure 6. Phenotypical analysis of C. baratii C3. Shown are Gram-staining, blood agar analysis, disk diffusion, and API-20A results.
Microorganisms 10 00213 g006
Figure 7. BoNT toxin cluster. Presence (in blue) or absence (in white) of BoNT toxin cluster genes in C. baratii and C. botulinum strains. The x-axis represents BoNT cluster genes, and the y-axis represents the strains.
Figure 7. BoNT toxin cluster. Presence (in blue) or absence (in white) of BoNT toxin cluster genes in C. baratii and C. botulinum strains. The x-axis represents BoNT cluster genes, and the y-axis represents the strains.
Microorganisms 10 00213 g007
Figure 8. BoNT gene alignment. Alignment of BoNT gene in C. baratii and C. botulinum strains. C. baratii strains do not have 7 amino acids present in the BoNT gene of C. botulinum in position 204, and the active site of the type botulinum toxin is the HEXXH sequence, located in positions 211–215.
Figure 8. BoNT gene alignment. Alignment of BoNT gene in C. baratii and C. botulinum strains. C. baratii strains do not have 7 amino acids present in the BoNT gene of C. botulinum in position 204, and the active site of the type botulinum toxin is the HEXXH sequence, located in positions 211–215.
Microorganisms 10 00213 g008
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Silva-Andrade, C.; Martin, A.J.; Garrido, D. Comparative Genomics of Clostridium baratii Reveals Strain-Level Diversity in Toxin Abundance. Microorganisms 2022, 10, 213. https://doi.org/10.3390/microorganisms10020213

AMA Style

Silva-Andrade C, Martin AJ, Garrido D. Comparative Genomics of Clostridium baratii Reveals Strain-Level Diversity in Toxin Abundance. Microorganisms. 2022; 10(2):213. https://doi.org/10.3390/microorganisms10020213

Chicago/Turabian Style

Silva-Andrade, Claudia, Alberto J. Martin, and Daniel Garrido. 2022. "Comparative Genomics of Clostridium baratii Reveals Strain-Level Diversity in Toxin Abundance" Microorganisms 10, no. 2: 213. https://doi.org/10.3390/microorganisms10020213

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop