Next Article in Journal
Role of PTEN, PI3K, and mTOR in Triple-Negative Breast Cancer
Next Article in Special Issue
An Interplay between Viruses and Bacteria Associated with the White Sea Sponges Revealed by Metagenomics
Previous Article in Journal
Cardiovascular Surgical Emergencies in France, before, during and after the First Lockdown for COVID-19 in 2020: A Comparative Nationwide Retrospective Cohort Study
Previous Article in Special Issue
Diversity and Metabolic Potential of the Terrestrial Mud Volcano Microbial Community with a High Abundance of Archaea Mediating the Anaerobic Oxidation of Methane
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Lacticaseibacillus paracasei: Occurrence in the Human Gut Microbiota and K-Mer-Based Assessment of Intraspecies Diversity

1
Laboratory of Functional Genomics and Cellular Stress, Federal Research Center “Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences”, Institute of Cell Biophysics of the Russian Academy of Sciences, 142290 Pushchino, Russia
2
Centre for Strategic Planning of Federal Medical-Biological Agency of Russia, 119121 Moscow, Russia
3
Laboratory of Anaerobic Microorganisms, Federal Research Center “Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences”, Institute of Biochemistry and Physiology of Microorganisms of the Russian Academy of Sciences, 142290 Pushchino, Russia
4
Institute of Mathematical Problems of Biology RAS—The Branch of Keldysh Institute of Applied Mathematics of the Russian Academy of Sciences, 142290 Pushchino, Russia
*
Authors to whom correspondence should be addressed.
Life 2021, 11(11), 1246; https://doi.org/10.3390/life11111246
Submission received: 20 October 2021 / Revised: 11 November 2021 / Accepted: 13 November 2021 / Published: 17 November 2021
(This article belongs to the Special Issue Metagenomics: New Trends and Solutions)

Abstract

:
Alignment-free approaches employing short k-mers as barcodes for individual genomes have created a new strategy for taxonomic analysis and paved a way for high-resolution phylogeny. Here, we introduce this strategy for the Lacticaseibacillus paracasei species as a taxon requiring barcoding support for precise systematics. Using this approach for phylotyping of L. paracasei VKM B-1144 at the genus level, we identified four L. paracasei phylogroups and found that L. casei 12A belongs to one of them, rather than to the L. casei clade. Therefore, we propose to change the specification of this strain. At the genus level we found only one relative of L. paracasei VKM B-1144 among 221 genomes, complete or available in contigs, and showed that the coding potential of the genome of this “rare” strain allows its consideration as a potential probiotic component. Four sets of published metagenomes were used to assess the dependence of L. paracasei presence in the human gut microbiome on chronic diseases, dietary changes and antibiotic treatment. Only antibiotics significantly affected their presence, and strain-specific barcoding allowed the identification of the main scenarios of the adaptive response. Thus, suggesting bacteria of this species for compensatory therapy, we also propose strain-specific barcoding for selecting optimal strains for target microbiomes.

1. Introduction

The Lactobacillus casei group of bacteria, consisting of the closely related L. casei, L. paracasei, L. rhamnosus and L. zeae are among the most studied lactobacilli due to their commercial, industrial, and applied health potential [1]. They are often used to ferment dairy products conferring improved flavor and texture. It has also been found that bacteria of this group produce a variety of bioactive metabolites that can benefit the host when consumed [2]. Lactic acid bacteria L. paracasei are widely distributed in nature [3] and are found on plant materials, from where they enter the human microbiome, usually being localized in the oral cavity and gastrointestinal tract.
However, the taxonomic history of lactobacilli has been confusing and complex. Overall, by March 2021, the genus Lactobacillus included 261 species that are extremely diverse at the phenotypic, ecological and genotypic levels and perhaps more than other taxons require new strategies for phylotyping. In a study by Zheng et al. [4], the phylogeny of Lactobacillaceae and Leuconostocaceae was investigated based on complete genome sequences. The authors assessed a large set of classification parameters including such criteria as pairwise mean amino acid identity, the presence of clade-specific genes and the phylogeny of the conserved genome core. According to this classification, 13 species were combined into a new genus Lacticaseibacillus, which includes Lacticaseibacillus paracasei. In this paper, we will follow this classification introducing Lacticaseibacillus paracasei subsp. paracasei VKM B-1144 as a newly characterized isolate and evaluate its relatives.
Our previous study [5] showed that k-mer-based phylogenetic analysis, operating with thousands of marker sequences without any limitation from the level of genome functional annotation, could be efficiently used for intraspecific systematics. Such a taxonomy is of obvious importance for species containing pathogenic strains/phylogroups and, vice versa, for potential probiotics. Up to now, thorough intraspecific phylotyping has been done mostly for Escherichia coli [6,7,8,9,10,11,12,13,14,15] and several other species from the human intestinal microbiome [16,17]. Here we employed this novel technique for bacteria with probiotic potential and showed their distribution in human gut microbiota depending on the health status, antibiotic/probiotic treatment or dietary shift.

2. Materials and Methods

2.1. Bacterial Growth

Bacteria were grown on an anaerobic medium of the following composition (g/L): casein peptone, 10.0; meat extract, 10.0; yeast extract, 5.0; glucose, 20.0; Tween 80, 1.0; K2HPO4, 2.0; sodium acetate, 5.0; ammonium citrate, 2.0; MgSO4 × 7 H2O, 0.2; MnSO4 × H2O, 0.05; the pH of the medium was 6.2–6.5. For solidification of the medium, Bacto agar 1.5% (w/v) was added. The medium was prepared using the Hungate anaerobic technique [18]. Cultivation was carried out in 15 ml Hungate tubes or on Petri dishes placed in anaerobic jars (Oxoid Limited, Basingstoke, UK). Bacterial cultures were incubated at 37 °C for 24–48 h until the appearance of individual colonies or the onset of the early stationary growth phase.

2.2. DNA Preparation

Genomic DNA was isolated from bacteria cultured in liquid medium at the stationary growth phase. Since L. paracasei are known to possess a thickened cell wall, we used a specific protocol for its effective disruption [19]. Following centrifugation, the cell pellet was washed with 30 mM NaCl and 2 mM EDTA, pH 8.0. The cells were subsequently incubated in buffer (pH 8.0) containing 2 mM EDTA, 20 mM Tris-HCl, 20 mg/mL lysozyme, and 1% Triton X-100 for 2 h at 37 °C. For more efficient lysis, 10 mg/mL of RNase A and 20 mg/mL of proteinase K were added and incubated for one more hour at 55 °C. Finally, a Genomic DNA Purification Kit (Promega, Madison, WI, USA) was used for DNA clean-up according to the manufacturer’s protocol. The concentration of DNA was measured using a Nanodrop ND-1000 spectrophotometer (Thermo Scientific, Wilmington, DE, USA).

2.3. Genome Sequencing

Genome sequencing of the studied isolate was carried out using the Ion Torrent PGM platform (Thermo Fisher Scientific, Waltham, MA, USA). A library for sequencing was prepared using the NEBNext Fast DNA Fragmentation and Library Prep Set for Ion Torrent (NEB, Ipswich, MA, USA), according to the manufacturer’s instructions. The concentration of the resulting library was evaluated on a Qubit 3 fluorometer (Thermo Fisher Scientific, Waltham, MA, USA) using Qubit dsDNA HS Assay Kit reagents (Thermo Fisher Scientific, Waltham, MA, USA). The library was amplified using an Ion One Touch 2 sample preparation system and Ion PGM Hi-Q View OT2 Kit (Thermo Fisher Scientific, Waltham, MA, USA). Sequencing was performed with an Ion 316 Chip Kit v2 BC (Thermo Fisher Scientific, Waltham, MA, USA). Fastq files were filtered on the Galaxy server [20] using the “Filter by Quality” option (parameters: Q20 and coverage 90%) and ignoring all reads with degenerated nucleotides. The genome was resequenced on a MinION device (Oxford Nanopore Technologies, Oxford, UK) using a SQK-LSK109 kit and MinKNOW software v. 21.02.1, following the Genomic DNA by Ligation protocol.

2.4. Genome Assembly and Annotation

Combined sets of reads obtained from the Ion Torrent PGM and Oxford Nanopore MinION were assembled de novo using facilities of the All Bacterial Bioinformatics Resource Center PATRIC [21]. The resulting genome was annotated using the RAST server [22]. For the analysis of glycoside-active enzymes, we used the data from the Carbohydrate-Active Enzymes database [23]. CAZY proteins were annotated using the dbCAN resource [24]. The analysis of transport systems was carried out using the database TransportDB 2.0 [25].

2.5. Barcoding

In order to barcode genomes by marker k-mers (k = 18), we used a local copy of the NCBI GenBank database as of February 19, 2020, containing 39,822 nucleotide sequences of fully assembled bacterial genomes and plasmids, including 75 genomes of Lacticaseibacillus genus together with strain VKM B-1144. When searching for genomes closest to VKM B-1144 we also analyzed L. paracasei sequences assembled in contigs or scaffolds. For phylogenetic analysis, a set of unique 18-mers was obtained for each genome of the L. paracasei species, residing in its sequence and/or in the sequences of other strains of the genus of Lacticaseibacillus (or species L. paracasei), but absent in the genomes of all other bacteria and plasmids from the database. For taxonomic analysis of the human intestinal microflora, all 18-mers present in the genome of Homo sapiens were also removed from the obtained sets. All the procedures were carried out in the virtual machine with a 64-bit version of UniSeq software on a high-performance server (configuration: 2 Xeon Gold 5218, 64 Gb RAM) [5,26].

2.6. Phylogenetic Analysis

After sets of marker 18-mers present only in L. paracasei strains (species level) or in genomes of L. paracasei and bacteria of closely related species (L. rhamnosus, L. casei and L. manihotivorans) were obtained, their pairwise comparison was done and the Sørensen similarity distances were calculated [27]. Phylogenetic trees were constructed using the neighbor-joining [28] and minimum evolution [29] methods in MEGA X [30].

2.7. Phylogroup-Specific Taxonomic Analysis of Human Metagenomes

Metagenomes were obtained from NCBI Short Read Archive (SRA) and included four sets of metadata [31,32,33,34], listed in the Supplementary Table S1. Each dataset contained whole genome shotgun sequence reads of metagenomes obtained from fecal samples. Two sets allowed the possibility of comparing the presence of L. paracasei strains in the microbiomes of healthy individuals and patients with chronic disease (children with autism spectrum disorders (PRJEB23052) [31] and patients with Crohn’s disease (PRJNA290380) [32]). Two other sets provided the possibility for pairwise comparisons of metagenomes taken from a particular individual before and after treatment. In the first one, fecal samples were collected from overweight/obese persons before and after adherence to Mediterranean dietary restrictions (PRJEB33500) [33], and persons before and after antibiotic treatment either subjected to subsequent probiotic therapy or not (PRJEB28097) [34]. Fastq files obtained from the European Nucleotide Archive (ENA) and NCBI SRA were trimmed for the adaptor sequence and quality controlled with Trim Galore v.0.6.6 [35]. The read length cutoff was set for 20 b and low-quality reads with a Phred less than 20 were excluded.
The metagenomes were transcoded the same way as described in [5], i.e., (A), (C), (G), and (T) were substituted with the numerals (0), (1), (2) and (3), respectively, while line breaks were replaced with the numeral (4). As a result, metagenomes were digitized in one numeric string, which allowed use of the UniTestExpress software [5] to operate them in the same way as the genomes. Comparing genomes and metagenomes, there is the possibility to combine k-mers unique for different genomes into a cumulative set and remove common k-mers from the two metadata to be compared. In this study we used species-specific sets of 18-mers, obtained in each genome of L. paracasei but absent in bacterial genomes of other species or genera. We also used strain-specific 18-mers absent in all the genomes of the local copy of the NCBI GenBank database except for the sequence of the target genome. The option “Clear” was used to remove cross-linking k-mers from the target sets. As in our previous study [5], the number of sequence reads containing marker 18-mers, rather than the number of unique 18-mers per se, was used for the statistical analysis. Since reads are rarely identical, this helps to avoid multiple counting of unique k-mers that make up long tracks, while retaining the possibility of quantitative assessment. Since in some cases the collected data did not show a normal distribution, the non-parametric Mann–Whitney U test was used in all cases to estimate the statistical significance of the differences between two groups.

3. Results

3.1. Characteristics of the L. paracasei VKM B-1144 Strain

Lacticaseibacillus paracasei subsp. paracasei strain VKM B-1144 (other identifiers: ATCC 25303, CCM 1752, NCIMB 700152, NIRD H831 and C6) [36,37,38], originally isolated from the human oral cavity [39,40], was obtained from the All-Russian Collection of Microorganisms (IBPM RAS, Pushchino, Russia). The cells of the strain are Gram-positive rods 2–4 µm in length and 0.8–1.0 µm in width. They are seen as single cells during exponential growth in a liquid medium, but form long chains in the stationary phase. On solid medium, the bacteria form whitish colonies with a diameter of 2–3 mm.

3.2. Genome Sequencing and Assembly

The genome of L. paracasei VKM B-1144 was primarily sequenced using the Ion Torrent PGM platform. A total of 2,621,451 sequence reads with an average length of 245 bases were obtained, which gave 2,041,237 sequences after filtration by quality on the Galaxy server [20]. The 16S rRNA gene sequence of the bacterium was determined with primers 27f (5′-AGAGTTTGATCCTGGCTCAG-3′) and 1492r (5′-TACCTTGTTACGACTT-3′) and phylogenetic analysis allowed identification of L. paracasei strain EG9, having the genome of 2,927,257 b.p. in length, as the closest relative. In view of this, semiconductor sequencing provided an approximately 174-fold genome coverage. In order to improve the draft genome assembly, the strain under study was resequenced using an Oxford Nanopore MinION device, which yielded 90,066 reads, with 15,381 reads filtered out for poor quality by the software. Thus, the resulting nanopore set included 73,806 reads, which varied in length in the range from 112 to 78,113 b.p., achieving a ~105-fold genome coverage. Four scaffolds were obtained, with the longest sized 2,921,318 b.p. Comparison of the shotgun whole genome with the genomes of L. paracasei strains JCM 8130 and EG9, showed the average nucleotide identity (ANI) to be 98.5% and 98.6%, respectively, which confirmed the results of 16S rRNA phylotyping. The genome under study was compared with the two abovementioned genomes to scaffold smaller contigs manually based on the positions of gene sequences, thus providing the second scaffold with a length of 24,996 b.p.

3.3. Genome Annotation and Analysis

Genome annotation was performed using the RAST server [22] with the identification of 3043 sequences of protein coding genes. Figure 1 demonstrates their distribution over the main functional categories.
As expected, a significant part of the genes found in the genome are associated with carbohydrate metabolism (189 genes). Their analysis in the dbCAN2 database [24], providing automated annotation of carbohydrate-active enzymes, showed that the genome of the VKM B-1144 strain encodes a wide range of glycoside-active enzymes, such as glycoside transferases, glycoside hydrolases, as well as polysaccharide lyases and carboxyesterases (GH1, GH2, GH3, GH13, GH20, GH25, GH29, GH31, GH32, GH35, GH36, GH65, GH73, GH88, GH136, GH170). For example, there are genes encoding proteins homologous to glucan 1,6-α-glucosidase, α-phosphotrehalase, oligo-1,6-glucosidase; neopullulanase and 1,4-α-glucan branching enzyme, structurally belonging to the GH13 family.
All of these proteins act on the α-1,4- and α-1,6-glycosidic bonds present in starch, glycogen and related oligosaccharides such as maltose. In addition, glycogen synthase (GT5) and glycogen phosphorylase (GT35) were found among the transport proteins. This indicates that the VKM B-1144 strain can accumulate cellulose on the cell wall surface in the form of an extracellular matrix, which facilitates cell adhesion and biofilm formation protecting cells from extracellular stress factors [41,42,43]. Moreover, lactobacilli often synthesize glycogen as a storage substance and use it to interact with a wide variety of environments [44].
Four large categories in Figure 1, quite expectedly, include genes encoding proteins involved in nucleic acid and protein biosynthesis, while the next largest set of protein-coding genes (56) corresponds to the biogenesis of cofactors, vitamins, prosthetic groups and pigments. In this connection, it is important that the value of probiotic bacteria in the human intestine is largely determined by their ability to synthesize vitamins and cofactors de novo and supply them to the host. The human gut microbiota is capable of synthesizing vitamin K and most of the group B vitamins, such as pyridoxine, folate, riboflavin, cobalamin, and thiamine. Among them, thiamine (vitamin B1) as thiamine pyrophosphate (TPP) plays a critical role in host energy metabolism, since it acts as a cofactor in major metabolic pathways, such as the pentose phosphate pathway, glycolysis and the Krebs cycle. The pentose phosphate pathway is required for the synthesis of steroids, nucleic acids, fatty acids, and the biosynthesis of aromatic amino acids. These products are used as precursors for the biogenesis of various neurotransmitters and other bioactive compounds vital for brain function [45]. Most of them can be produced by the VKM B-1144 strain. Although, the genome of these bacteria does not have a complete set of genes for thiamine biosynthesis, there are genes for the thiamine salvage pathway.
Extracellular proteins with the LPXTG motif deserve particular attention. They can be attached to the cell wall by sortases and play an important role in the mechanisms of probiotic action [46], facilitating cell adhesion and habitat colonization. According to the data of Ghosh et al. [47], the number of proteins with the LPXTG motif differs from 0 to 12 among the studied L. paracasei strains. In the genome of VKM B-1144 we found at least three genes for such proteins: outer membrane protein Lmo 0159, peptidoglycan protein Lmo2821 and von Willebrand factor type A (Lmo 2576), but their number can increase up to nine if protein-coding potency is confirmed for six other genomic loci. The presence of three genes encoding sortase A indicates the ability of VKM B-1144 bacteria to successfully overcome the mucous barrier and colonize the intestinal tract, thus supporting their value as potential probiotics.

3.4. Intrageneric and Intraspecific Phylotyping of L. paracasei Genomes

Using the standard nomenclature [48], adopted after the updated classification of bacteria from the Lacticaseibacillus genus, 76 genomes were taken from the NCBI database. This set included 43 L. paracasei chromosomes and 33 chromosomes of closely related species, of which 26 belonged to L. rhamnosus strains, 6 to L. casei, and 1 to the L. manihotivorans LM010 strain. It turned out that there are three assembly versions (AP011548.1, CP031290.1 and FM179322.1) for the L. rhamnosus GG (or ATCC 53103) genome in the database. Therefore, to avoid duplication, only one genome was taken for phylogenetic analysis (accession number CP031290.1). The remaining 74 genomes of the Lacticaseibacillus strains, along with the VKM B-1144 assembled chromosome, are listed in Table 1 and are characterized in more detail in Supplementary Table S2.
For phylotyping at the intrageneric level, the sets of unique genus-specific 18-mers were obtained for all 75 genomes using the UniSeq software [5]. For this, all 18-mers that belong to chromosomes other than those of the genus Lacticaseibacillus according to the modern classification, as well as to all plasmids, were removed from the database. These 75 individual sets (barcodes) belonging to the genomes of the same genus overlap to various extents in pairwise comparison, which allows us to use them for phylogenetic analysis. Using the numbers of common 18-mers for each pair of 75 genomes as a measure of their similarity, we calculated a pairwise distance matrix [27], as described in Materials and Methods. Phylogenetic trees were constructed using the neighbor-joining and minimum evolution methods. Since they were independent of the method, both for representatives of the genus Lacticaseibacillus and for the species L. paracasei, only the tree obtained by neighbor-joining [28] in MEGA X [30] is shown in Figure 2.
For L. paracasei bacteria this tree has four large clades containing 7–15 species. This intraspecies heterogeneity is in line with the data of Smokvina et al. [49], who characterized the diversity of L. paracasei based on multilocus alignment of 183 protein-coding sequences of the core genome compiled from 34 laboratory strains. The set of genomes analyzed in that paper included only three genomes available in the NCBI Database (Zhang, BL23 and ATCC 334). The same as in our case (Figure 2), they belonged to different clades. Thus, although the search for differences in metabolic traits and evolutionary pathways for different clades requires further analysis, it became clear that k-mer-based intraspecies phylogeny works reproducibly even for a rather small set of compared genomes.
In addition to four large clades, four single branches are clearly separated within the L. paracasei cluster. They are: L. paracasei subsp. tolerans MGB0734, L. paracasei NFFJ04, L. paracasei subsp. paracasei VKM B-1144 and L. paracasei NSMJ15, of which the latter strain outgrouped in relation to all the rest of the strains of L. paracasei. We also found that the chromosomes of strains BL23 and W56 (clade III) and 12A (clade II) (indicated by circles in Figure 2), that were taken from the NCBI GenBank as L. casei, cluster together with L. paracasei.
It turned out that the chromosomes of these three strains had a lower GC content (46.3%–46.4%), which is typical for L. paracasei, and not for L. casei (47.9%–48%). Thus, it is reasonable that in November 2020, the specification of BL23 and W56 in the NCBI GenBank was changed to L. paracasei. However, strain 12A is still referred to as L. casei. According to our data, its classification should also be reconsidered.
Since the strain CECT 9104 formerly annotated as L. casei is currently annotated as L. zeae, only two strains with complete genomes (ATCC 393 and LC5) remained in the L. casei clade. The genomes of L. rhamnosus strains, on the contrary, form a single large cluster in full accordance with the annotation, and the genome of L. manihotivorans LM010, as expected, occupies a branch independent of other species (Figure 2).
The two next steps were implemented to estimate the stability of the tree obtained for L. paracasei genomes. At the first step, we checked the diversity of L. paracasei at the species level. For this purpose, 18-mers present only in the genomes of L. paracasei strains were obtained (Figure 3). The topology of the tree remained virtually unchanged, except that the L. paracasei NSMJ15 strain ceased to be an outgroup in relation to the rest of the clades.
The topology and composition of all four large clades, as well as the presence of all singletons, including L. paracasei VKM B-1144, appeared to be completely reproduced. Thus, at the second step we searched for microorganisms related to our strain among all the genomes in the NCBI GenBank, which are annotated as L. paracasei, although deposited as sets of contigs or scaffolds rather than complete genomes. As of 11 October 2021, there were 175 of them (Supplementary Table S3), and k-mer-based phylogeny allows us to use their species-specific 18-mers together with 47 complete genomes similarly to Figure 3.
The resulting tree constructed for 222 strains is shown in Supplementary Figure S1. The groups I, II and IV are well separated and increased. The group III, which previously included 12 strains, subdivided into three subgroups, of which the first one captured the MGB0734 strain that previously formed an individual branch between groups I and II (marked in gray in Figure 2 and Figure 3). A new phylogroup was identified between groups I and II, which includes the NFFJ04 genome (indicated by a gray circle in Supplementary Figure S1) and seven more genomes assembled in contigs. The NSMJ15 strain shows some homology only with strain SRCM103410, while L. paracasei VKM B-1144 demonstrated a higher homology with only L. paracasei DSM 20207 and these two pairs form clades with the earliest divergence from the main phylogroups. Thus, the next part of this study was aimed at understanding: how evenly different phylogroups of L. paracasei are distributed in the human microbiota, and how much their presence depends on the physiological state of an individual.

3.5. Phylogroup-Dependent Profiling of L. paracasei Presence in Human Intestinal Microbiomes

Four sets of freely available metadata with a total of 117 human gut microflora shotgun metagenomes were used for this analysis (listed in Supplementary Table S1). Two sets allowed a comparison of the presence of L. paracasei strains in the microbiomes of healthy individuals and patients with a chronic disease. Of these, autism spectrum disorder (20 samples) was selected as a target pathology due to the increased attention to the gut microbiota in its treatment [31,50,51], and Crohn’s disease (14 samples) since it is well known to cause intestinal dysbiosis [5,32,52]. In both cases the percentage of sequence reads containing species-specific 18-mers of L. paracasei genomes was averaged over control samples (10 and four metagenomes, respectively) and samples obtained from patients with autism spectrum disorder (ASD) or Crohn’s disease (10 metagenomes in each case).
In the case of samples obtained from overweight/obese persons, 50 metagenomes were available for comparison, 25 of which were obtained from individuals before adherence to Mediterranean dietary restrictions and 25 after the nutritional plan [33]. Samples were taken from 13 men and 12 women aged from 21 to 65 years. The most promising was the last group with 33 fecal samples taken from 11 healthy antibiotic-naive individuals before and after receiving ciprofloxacin and metronidazole for 7 days, followed by 8 weeks of recovery with probiotics treatment or without it (Supplementary Table S1). Among 11 bacterial species, the probiotic cocktail contained L. paracasei bacteria [34].
Metagenomes were prepared as described in Materials and Methods and species-specific sets of marker 18-mers were obtained to assess the level of L. paracasei strains in fecal samples. Their numbers, indicated in columns four and nine of Table 2, ranged from 1,000,877 (TD 062 strain genome) to 1,250,212 (IBB3423). Thus, the percentage of reads found in metagenomes was normalized per one million marker 18-mers.
In natural metagenomes this percentage reflects the presence of close relatives for each genome. The values obtained for 44 genomes were averaged over four phylogroups indicated in Figure 3 and Table 2 (Figure 4). For strain VKM B-1144 they are shown independently (red plots in Figure 4).
It became clear, that L. paracasei make up about 0.05% of the human intestinal microflora. Their presence is fairly consistent across the given datasets. Variability in the range from 0.02% to 0.1% is most likely due to the difference in methods used by the authors for DNA extraction, because peptidoglycan walls of Gram-positive bacteria require more severe treatment for cell disruption. For all that, we removed one metagenome from the selected metadata because of an abnormally high abundance of L. paracasei in the control sample of the first set (about 4%). This means that such variations do happen.
We did not observe any changes in the presence of L. paracasei in the metagenomes of children with ASD syndrome (Figure 4a) and in response to dietary shift (Figure 4c). The level of L. paracasei in the intestinal microflora of patients with Crohn’s disease tended to decrease (Figure 4b), but this effect was not statistically significant. However, in response to antibiotic administration, the intestinal abundance of L. paracasei was significantly reduced and turned out to be more variable than in the control samples (Figure 4d). Although all phylogroups exhibited highly similar changes, the response of bacteria from the second and fourth groups was statistically more significant. Therefore, at the next step, we compared fold ratios in the number of reads with the marker 18-mers for all 47 genomes in the metagenomes of 11 people in response to antibiotic treatment (volcano plot in Figure 5a).
It became clear that bacteria belonging to phylogroups II and IV (green and blue ovals in Figure 5a) are on average more sensitive to antibiotics than bacteria of the two other groups. In line with the phylogenetic trees (Figure 2 and Figure 3), the strain VKM B-1144 also belongs to the same category. The bacteria of phylogroup III demonstrated the highest diversity in terms of antibiotic sensitivity, which might be symptomatic, because this group subdivided into several clades in Supplementary Figure S1. Probably, the most surprising observation in this part of the study is shown in Figure 5b: even after a long recovery period, the abundance of L. paracasei bacteria remains significantly suppressed in the microbiota of people subjected to antibiotic treatment.

3.6. Strain-Specific Characterization of Antibiotic/Probiotic-Mediated Changes in Microbiomes

Hoping to increase the sensitivity of k-mer-based taxonomy, in the next part of the study we used strain-specific 18-mers, which do not overlap with the barcodes of other known genomes of L. paracasei strains. Their number is much lower than that of species-specific 18-mers (Table 2), and in two cases (strains L9 and MGYG-HGUT-02388) removal of 18-mers present in at least one other genome yielded empty sets. This outcome is explained by the presence of closely related species in our reference set (Table 2) and can be avoided by selecting one of them for withdrawal, which is difficult to substantiate. Hence, in this study we removed 16 genomes with the smallest barcode sets from the analysis. The rest of the sets (bolded in Table 2) ranged from 11,840 to 86,180 18-mers in size, with an average number of 36,794. Thus, the percentage of reads with barcodes of 31 genomes found in metagenomes was normalized per 37,000.
As expected, the antibiotic treatment provided the same effect as in Figure 4d (Figure 6a). After 7 days of antibiotic administration, six volunteers were allowed to recover, and metagenomes of their gut were sampled after 56 days. Only bacteria of the third group approached their control level in the human intestine (Figure 6b). Twenty-eight days of probiotic therapy applied to five individuals caused an apparently more pronounced effect compared to the antibiotic-treated samples (Figure 6c). However, the variability of the L. paracasei response to antibiotic/probiotic administration made the observed changes statistically insignificant. Thus, at the next step we visualized these changes individually for all strains in all metagenomes.

3.7. Tracking Individual Changes in the Presence of L. paracasei Strains in Response to Antibiotic Administration

Figure 7a,b shows the pattern of changes in the multiplicity of bacteria containing marker 18-mers of L. paracasei VKM B-1144, NSMJ15, and NFF04 strains in 11 metagenomes. The same as in Figure 6, plots are combined into two groups, corresponding to samples obtained from individuals later either subjected to probiotic therapy (Figure 7a) or not (Figure 7b). In most cases, we observed an expected decrease in the number of strain-specific 18-mers or an absence of significant changes after antibiotic intake. However, barcodes of the L. paracasei NSMJ15 genome demonstrated an unusual increase in multiplicity. Many other examples of antibiotic-mediated stimulation were found for bacteria of all phylogroups (Figure 7b–j). This apparently disagrees with the volcano plot in Figure 5a, indicating a statistically significant decrease in the number of species-specific barcodes of all genomes in antibiotic-treated samples. Therefore, we checked the possibility that this difference is due to the specificity of strain-specific barcodes (Figure 6 and Figure 7). However, visualizing individual changes with species-specific sets of marker 18-mers, we observed the same alterations as in the case with strain-specific barcodes (exemplified in Figure 7a’,b’ for L. paracasei NSMJ15). Thus, it became clear that individual diversity is masked in volcano plots when averaged over 11 metagenomes.
All strains but one (L. paracasei CAUH35, Group IV) showed antibiotic-mediated augmentation in 1–4 of 11 microbiomes, and there is only one metagenome, where no such effect was detected (ERR2750509). Moreover, in six of the remaining 10 metagenomes, exemplifying the consequence of antibiotic administration and containing bacteria positively responding to this treatment, we were unable to detect the presence of lactobacilli with barcodes corresponding to 1–18 other strains. Thus, it is likely that the adaptation of microbiomes to drugs can be carried out at the level of intraspecific diversification, which was not expected a priori.

3.8. Tracking Individual Changes in the Presence of L. paracasei Strains in Response to Probiotic Therapy and Spontaneous Recovery

The average abundance of L. paracasei VKM B-1144 relatives increased after probiotic therapy (Figure 6c), and in one metagenome their presence (0.0051%) was even higher than before the administration of ciprofloxacin and metronidazole (0.0027%), although it decreased to zero after their intake (Figure 7a). A response with suppression and almost full recovery of these bacteria was observed in two other microbiomes (Figure 7a). However, the percentage of L. paracasei VKM B-1144 relatives in the remaining two samples reduced as expected in response to antibiotics, but decreased even more after probiotic therapy (Figure 7a).
The same behavior in one or two metagenomes was demonstrated by relatives of 16 other strains belonging to different phylogroups (Figure 7c,e,g,i). Among spontaneously recovered microbiomes, the number of strains with this type of change turned out to be higher (26), and in four cases (phylogroups I, II, IV), the final reduction reached zero (Figure 7d,f,h,j), which was not observed for the samples collected after probiotic therapy (Figure 7a,c,e,g,i). Almost all strains with a positive response to antibiotics tended to recover to the control level both after probiotic administration and without it. However, strains NJ (Figure 7c), TMW 1.1434 (Figure 7g), JCM 8130 (Figure 7i), ATCC 334 and FAM18149 (Figure 7j) each showed a further increase in a single metagenome. This may indicate antibiotic-mediated rearrangement in the host microbiome. In any case, visualization of individual changes in the percentage of reads containing strain-specific 18-mer codes allowed us to reveal four different scenarios used by microbiomes to withstand antibiotics.

4. Discussion

Among lactic bacteria, L. paracasei and its close relatives are considered as promising components of probiotics. Studies in animals and humans conducted at cell and organismic levels have shown dozens of its strains to possess beneficial properties for the host [53,54,55,56]. For instance, L. paracasei strain L1 turned out to be able to improve the gut microbiota in chickens and promoted their growth [53]; L. paracasei NCC2461 can attenuate antibiotic induced visceral hypersensitivity in mice [54]. Suzuki and co-authors [55] found L. paracasei KW3110 to inhibit NLRP3 inflammasome activation in macrophages from BALB/c mice, and this effect was shown to be IL-10-dependent. The same strain administered per os reduced monosodium urate crystal-induced peritoneal inflammation in C57BL/6 mice in vivo and its continuous intake mitigated insulin resistance caused by a high fat diet. In another work, Chondrou et al. [56] demonstrated immunomodulatory potential of L. paracasei K5, which promoted upregulation of interleukins IL-1α, ΙL-1β, IL-6, tumor necrosis factor-alpha, as well as some other products related to carcinogenesis, in the human cell line Caco-2. There are several other examples in the review by Vuong and Hsiao [50]. Thus, it is reasonable that bacteria of this species are used in probiotic compositions, although not so often as Limosilactobacillus reuteri or Lacticasebacillus rhamnosus. The newly characterized strain of L. paracasei VKM B-1144 may have an advantage as a probiotic due to its potentially high ability to colonize the intestinal epithelium.
Relatively rare use of L. paracasei in probiotic cocktails may partly be due to the not entirely clear phylogeny of Lacticaseibacillus genus bacteria, and one of the main results of this study is the first classification of the L. paracasei species at the intraspecific level. Revealing three stable phylogroups I, II and IV based on the set of complete genomes, we also detected subgroups in the phylogroup III and an additional clade between groups I and II, when the set, extended by yet unassembled genomes, was analyzed. This is important, because based on hundreds of datasets characterizing intestinal microbiomes, it is usually accepted that their complex species/genus composition can quickly adapt in response to dietary shifts or changes in the physiological state of the host. On the other hand, microbiomes can maintain enterotypes that were optimized in individuals during their development. Thus, it is generally believed that the combination of plasticity and stability is realized at the level of species diversity, while pathogenesis is caused by expansion of a certain strain, as was the case with the E. coli serotype O104:H4 [57].
Since the pathogenicity of individual strains largely depends on their evolutionary origin, information on intraspecific diversity is becoming more and more in demand, which requires the development and introduction of special tools into the routine practice of bioinformatics analysis. Here we used a k-mer-based approach, which allows the revelation of bacterial strains in natural biota and performing phylotyping without multiple sequence alignment of conserved orthologous proteins or genes. We demonstrated its heuristic efficiency by identifying three species of L. paracasei previously/recently attributed to L. casei, which are difficult to distinguish based on classical phylotyping by 16S rRNA due to the close relationship of these species.
For taxonomic analysis, we selected four datasets with different probabilities of having changes in the abundance level of L. paracasei. In our previous paper [5] we revealed an increase in the presence of E. coli in the intestinal microbiota of Crohn’s disease patients, and this effect turned out to be phylogroup dependent. Here we did not observe statistically significant changes in samples taken from Crohn’s disease patients, although some suppression of the presence of L. paracasei was detected (Figure 4). The main difference from the data obtained for E. coli lies in an approximately equal distribution of different L. paracasei strains (their relatives) in all the metagenomes. Since we dealt with k-mers absent in all complete bacterial genomes except the genomes of L. paracasei, the individual barcodes overlapped, and similar 18-mers contributed to the calculated percentage, probably masking the difference, but deviation from the control samples was negligible, so we did not continue analysis of this data set.
Two other sets of metadata, reliably show an absence of difference in L. paracasei abundance in the microbiota of healthy individuals and patients with ASD, as well as in response to an 8-week isocaloric dietary intervention with a Mediterranean diet used by obese persons. The fourth set of 33 metagenomes [34], on the contrary, allowed the revelation of a statistically significant and apparently phylogroup-dependent reduction of L. paracasei abundance in the gut microflora in response to antibiotic treatment (Figure 5a).
Probably, the most important and hardly expected observation was a very slow recovery of microbiota after antibiotic treatment (Figure 6b,c). This contradicts the expected plasticity of microbiomes, if it is not only due to the specific selection of antibiotic-naïve volunteers as models. Decreased levels of lactobacilli in their intestine may be a stage of a normal adaptive response, which might be preserved in the future.
In order to characterize the recovery in more detail, for the first time we implemented strain-specific barcodes, but did not observe significant differences between the phylogroups (Figure 6c), all of which showed unreliable changes in response to probiotic therapy. Moreover, only the relatives of L. paracasei strain TD062 from Group IV and ZY-1 from Group I statistically significantly accumulated after probiotic intake (p = 0.008 and 0.032, respectively). The next two effectively recovered bacteria were related to L. paracasei strain JCM 8130 from Group IV and the singleton NSMJ15 (p = 0.056 and 0.095, respectively). Since all these bacteria belong to different phylogroups, it is unlikely that their more stable recovery can be explained by the mere presence of L. paracasei BAA-52 bacteria in the probiotic cocktail [58] (its phylogroup is unknown, because the genome is absent in the NCBI database). However, within the framework of this study, we received a reasonable explanation for the absence of statistically significant changes in the responses of different phylogroups to antibiotics or probiotics. It is based on different scenarios used by bacteria in different metagenomes to adapt to these agents. Thus, in response to antibiotics, L. paracasei bacteria can both increase and decrease their content in metagenomes and usually tend to restore the initial level upon recovery. Hence, in many cases, the changes caused by antibiotics even appeared to strengthen after 56 days of recovery. To our knowledge, this is the first indication of such complex dynamics of antibiotic-induced changes. The fact that relatives of the same strain often behaved differently in different metagenomes, while in one metagenome (ERR2750041) we observed an antibiotic-mediated increase for relatives of many strains, indicates that the implementation of different growth scenarios by bacteria depends on the enterotype/composition of the intestinal microflora, which must be taken into account when carrying out probiotic therapy.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/life11111246/s1: Figure S1. Phylogenetic tree inferred from a pairwise distance matrix for 222 sets of 18-mers unique to the species L. paracasei; Table S1. Metagenomes used for taxonomic analysis.; Table S2. List including 74 complete genomes of Lacticaseibacillus from the NCBI GenBank; Table S3. List of 175 additional L. paracasei genomes from the NCBI GenBank (assembled in contigs and scaffolds) used for intraspecific phylogenetic analysis.

Author Contributions

Conceptualization, O.O.; methodology, O.O.; software, V.P.; validation, M.F.; formal analysis, T.D., N.K., V.T., V.S.; investigation, O.G., O.A., N.M., K.S.; resources, A.Z.; data curation S.K.; writing—original draft preparation, O.O.; writing—review and editing, O.O. and K.S.; visualization, S.K., K.S.; supervision, O.O.; project administration, V.M.; funding acquisition, S.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the State Contract No. 0373100122121000059.

Institutional Review Board Statement

All human data used in this study were obtained from the publicly available database (NCBI SRA). BioProject ID numbers are indicated in Supplementary Table S1.

Informed Consent Statement

Not applicable since all human data are anonymous and were obtained from publicly available database.

Data Availability Statement

Assembled and annotated genome of L. paracasei VKM B-1144 is available in NCBI (BioProject PRJNA779281).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Sun, Z.; Harris, H.M.; McCann, A.; Guo, C.; Argimon, S.; Zhang, W.; Yang, X.; Jeery, I.B.; Cooney, J.C.; Kagawa, T.F.; et al. Expanding the biotechnology potential of lactobacilli through comparative genomics of 213 strains and associated genera. Nat. Commun. 2015, 6, 8322. [Google Scholar] [CrossRef]
  2. Dietrich, C.G.; Kottmann, T.; Alavi, M. Commercially available probiotic drinks containing Lactobacillus casei DN-114001 reduce antibiotic-associated diarrhea. World J. Gastroenterol. 2014, 20, 15837. [Google Scholar] [CrossRef]
  3. Collins, M.D.; Phillips, B.A.; Zanoni, P. Deoxyribonucleic acid homology studies of Lactobacillus casei, Lactobacillus paracasei sp. nov., subsp. paracasei and subsp. tolerans, and Lactobacillus rhamnosus sp. nov., comb. nov. Int. J. Syst. Bacteriol. 1989, 39, 105–108. [Google Scholar] [CrossRef] [Green Version]
  4. Zheng, J.; Wittouck, S.; Salvetti, E.; Franz, C.M.A.P.; Harris, H.M.B.; Mattarelli, P.; O’Toole, P.; Pot, B.; Vandamme, P.; Walter, J.; et al. A taxonomic note on the genus Lactobacillus: Description of 23 novel genera, emended description of the genus Lactobacillus Beijerinck 1901, and union of Lactobacillaceae and Leuconostocaceae. Int. J. Syst. Evol. Microbiol. 2020, 70, 2782–2858. [Google Scholar] [CrossRef] [PubMed]
  5. Panyukov, V.V.; Kiselev, S.S.; Ozoline, O.N. Unique k-mers as strain-specific barcodes for phylogenetic analysis and natural microbiome profiling. Int. J. Mol. Sci. 2020, 21, 944. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Clermont, O.; Christenson, J.K.; Denamur, E.; Gordon, D.M. The Clermont Escherichia coli phylo-typing method revisited: Improvement of specificity and detection of new phylo-groups. Environ. Microbiol. Rep. 2013, 5, 58–65. [Google Scholar] [CrossRef] [PubMed]
  7. Clermont, O.; Dixit, O.V.A.; Vangchhia, B.; Condamine, B.; Dion, S.; Bridier-Nahmias, A.; Denamur, E.; Gordon, D. Characterization and rapid identification of phylogroup G in Escherichia coli, a lineage with high virulence and antibiotic resistance potential. Environ. Microbiol. 2019, 21, 3107–3117. [Google Scholar] [CrossRef] [PubMed]
  8. Clermont, O.; Bonacorsi, S.; Bingen, E. Rapid and simple determination of the Escherichia coli phylogenetic group. Appl. Environ. Microbiol. 2000, 66, 4555–4558. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  9. Clermont, O.; Olier, M.; Hoede, C.; Diancourt, L.; Brisse, S.; Keroudean, M.; Glodt, J.; Picard, B.; Oswald, E.; Denamur, E. Animal and human pathogenic Escherichia coli strains share common genetic backgrounds. Infect. Genet. Evol. 2011, 11, 654–662. [Google Scholar] [CrossRef] [PubMed]
  10. Escobar-Paramo, P.; Clermont, O.; Blanc-Potard, A.B.; Bui, H.; Le Bouguenec, C.; Denamur, E. A specific genetic background is required for acquisition and expression of virulence factors in Escherichia coli. Mol. Biol. Evol. 2004, 21, 1085–1094. [Google Scholar] [CrossRef] [Green Version]
  11. Johnson, J.R.; Owens, K.L.; Clabots, C.R.; Weissman, S.J.; Cannon, S.B. Phylogenetic relationships among clonal groups of extraintestinal pathogenic Escherichia coli as assessed by multi-locus sequence analysis. Microbes Infect. 2006, 8, 1702–1713. [Google Scholar] [CrossRef]
  12. Gordon, D.M.; Clermont, O.; Tolley, H.; Denamur, E. Assigning Escherichia coli strains to phylogenetic groups: Multi-locus sequence typing versus the PCR triplex method. Environ. Microbiol. 2008, 10, 2484–2496. [Google Scholar] [CrossRef] [PubMed]
  13. Jaureguy, F.; Landraud, L.; Passet, V.; Diancourt, L.; Frapy, E.; Guigon, G.; Carbonnelle, E.; Lortholary, O.; Clermont, O.; Denamur, E.; et al. Phylogenetic and genomic diversity of human bacteremic Escherichia coli strains. BMC Genom. 2008, 9, 560. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Gonzalez-Alba, J.M.; Baquero, F.; Canton, R.; Galan, J.C. Stratified reconstruction of ancestral Escherichia coli diversification. BMC Genom. 2019, 20, 936. [Google Scholar] [CrossRef] [Green Version]
  15. Abram, K.; Udaondo, Z.; Bleker, C.; Wanchai, V.; Wassenaar, T.M.; Robeson II, M.S.; Ussery, D.W. Mash-based analyses of Escherichia coli genomes reveal 14 distinct phylogroups. Commun. Biol. 2021, 4, 117. [Google Scholar] [CrossRef] [PubMed]
  16. Brulc, J.M.; Yeoman, C.J.; Wilson, M.K.; Berg Miller, M.E.; Jeraldo, P.; Jindou, S.; Goldenfeld, N.; Flint, H.J.; Lamed, R.; Borovok, I.; et al. Cellulosomics, a gene-centric approach to investigating the intraspecific diversity and adaptation of Ruminococcus flavefaciens within the rumen. PLoS ONE 2011, 6, e25329. [Google Scholar] [CrossRef] [Green Version]
  17. Rozman Grinberg, I.; Yin, G.; Borovok, Y.; Berg Miller, M.E.; Yeoman, C.J.; Dassa, B.; Yu, Z.; Mizrahi, I.; Flint, H.J.; Bayer, E.A.; et al. Functional phylotyping approach for assessing intraspecific diversity of Ruminococcus albus within the rumen microbiome. FEMS Microbiol. Lett. 2015, 362, 1–10. [Google Scholar] [CrossRef] [Green Version]
  18. Hungate, R.E. A roll tube method for cultivation of strict anaerobes. Methods Microbiol. 1969, 3, 117–132. [Google Scholar] [CrossRef]
  19. Alimolaei, M.; Golchin, M. An efficient DNA extraction method for Lactobacillus casei, a difficult-to-lyse bacterium. Int. J. Enteric Pathog. 2016, 4, e32472. [Google Scholar] [CrossRef]
  20. Afgan, E.; Baker, D.; Batut, B.; van den Beek, M.; Bouvier, D.; Cech, M.; Chilton, J.; Clements, D.; Coraor, N.; Gruning, B.A.; et al. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update. Nucleic Acids Res. 2016, 44, W3–W10. [Google Scholar] [CrossRef] [Green Version]
  21. Wattam, A.R.; Brettin, T.; Davis, J.J.; Gerdes, S.; Kenyon, R.; Machi, D.; Mao, C.; Olson, R.; Overbeek, R.; Pusch, G.D.; et al. Assembly, annotation, and comparative genomics in PATRIC, the all bacterial Bioinformatics Resource Center. Methods Mol. Biol. 2018, 1704, 79–101. [Google Scholar] [CrossRef] [PubMed]
  22. Aziz, R.K.; Bartels, D.; Best, A.A.; DeJongh, M.; Disz, T.; Edwards, R.A.; Formsma, K.; Gerdes, S.; Glass, E.M.; Kubal, M.; et al. The RAST Server: Rapid Annotations using Subsystems Technology. BMC Genom. 2008, 9, 75. [Google Scholar] [CrossRef] [Green Version]
  23. Lombard, V.; Golaconda Ramulu, H.; Drula, E.; Coutinho, P.M.; Henrissat, B. The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 2014, 42, D490–D495. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Zhang, H.; Yohe, T.; Huang, L.; Entwistle, S.; Wu, P.; Yang, Z.; Busk, P.K.; Xu, Y.; Yin, Y. dbCAN2: A meta server for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. 2018, 46, W95–W101. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Elbourne, L.D.H.; Tetu, S.; Hassan, K.; Paulsen, I.T. TransportDB 2.0: A database for exploring membrane transporters in sequenced genomes from all domains of life. Nucleic Acids Res. 2017, 45, D320–D324. [Google Scholar] [CrossRef] [PubMed]
  26. Panyukov, V.V.; Kiselev, S.S.; Alikina, O.V.; Nazipova, N.N.; Ozoline, O.N. Short unique sequences in bacterial genomes as strain- and species-specific signatures. Math. Biol. Bioinf. 2017, 12, 547–558. [Google Scholar] [CrossRef]
  27. Sørensen, T. A method of establishing groups of equal amplitude in plant sociology based on similarity of species content. K. Dan. Vidensk. Selskab. Biol. Krifter. 1948, 4, 1–34. [Google Scholar]
  28. Saitou, N.; Nei, M. The neighbor-joining method: A new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 1987, 4, 406–425. [Google Scholar] [CrossRef]
  29. Rzhetsky, A.; Nei, M. Theoretical foundation of the minimum-evolution method of phylogenetic inference. Mol. Biol. Evol. 1993, 10, 1073–1095. [Google Scholar] [CrossRef] [Green Version]
  30. Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K. MEGA X: Molecular Evolutionary Genetics Analysis across computing platforms. Mol. Biol. Evol. 2018, 35, 1547–1549. [Google Scholar] [CrossRef]
  31. Wang, M.; Wan, J.; Rong, H.; He, F.; Wang, H.; Zhou, J.; Cai, C.; Wang, Y.; Xu, R.; Yin, Z.; et al. Alterations in gut glutamate metabolism associated with changes in gut microbiota composition in children with autism spectrum disorder. mSystems 2019, 4, e00321–e18. [Google Scholar] [CrossRef] [Green Version]
  32. Vaughn, B.P.; Vatanen, T.; Allegretti, J.R.; Bai, A.; Xavier, R.J.; Korzenik, J.; Gevers, D.; Ting, A.; Robson, S.C.; Moss, A.C. Increased intestinal microbial diversity following fecal microbiota transplant for active Crohn’s disease. Inflamm. Bowel Dis. 2016, 22, 2182–2190. [Google Scholar] [CrossRef] [Green Version]
  33. Meslier, V.; Laiola, M.; Roager, H.M.; De Filippis, F.; Roume, H.; Quinquis, B.; Giacco, R.; Mennella, I.; Ferracane, R.; Pons, N.; et al. Mediterranean diet intervention in overweight and obese subjects lowers plasma cholesterol and causes changes in the gut microbiome and metabolome independently of energy intake. Gut 2020, 69, 1258–1268. [Google Scholar] [CrossRef] [Green Version]
  34. Montassier, E.; Valdes-Mas, R.; Batard, E.; Zmora, N.; Dori-Bachash, M.; Suez, J.; Elinav, E. Probiotics impact the antibiotic resistance gene reservoir along the human GI tract in a person-specific and antibiotic-dependent manner. Nat. Microbiol. 2021, 6, 1043–1054. [Google Scholar] [CrossRef] [PubMed]
  35. Krueger, F. Babraham Bioinformatics—Trim Galore! Available online: https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/ (accessed on 23 September 2021).
  36. Briggs, M. The classification of lactobacilli by means of physiological tests. J. Gen. Microbiol. 1953, 9, 234–248. [Google Scholar] [CrossRef] [Green Version]
  37. Sharpe, M.E. A serological classification of lactobacilli. J. Gen. Microbiol. 1955, 12, 107–122. [Google Scholar] [CrossRef] [Green Version]
  38. Sharpe, M.E. 603. Development of serologically identified lactobacilli added to cheese made without starter. J. Dairy Res. 1955, 22, 374–376. [Google Scholar] [CrossRef]
  39. Tilden, E.B.; Svec, M. Further studies of a differential culture technique for estimations of acidogenic bacteria in saliva. II. Species of lactobacilli isolated from saliva and their distribution in a group of children. J. Dent. Res. 1952, 31, 831–838. [Google Scholar] [CrossRef] [PubMed]
  40. Simelyte, E.; Rimpilainen, M.; Lehtonen, L.; Zhang, X.; Toivanen, P. Bacterial cell wall-induced arthritis: Chemical composition and tissue distribution of four lactobacillus strains. Infect. Immun. 2000, 68, 3535–3540. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  41. Ross, P.; Mayer, R.; Benziman, M. Cellulose biosynthesis and function in bacteria. Microbiol. Rev. 1991, 55, 35–58. [Google Scholar] [CrossRef]
  42. Lorca, G.L.; Font de Valdez, G.; Ljungh, A. Characterization of the protein-synthesis dependent adaptive acid tolerance response in Lactobacillus acidophilus. J. Mol. Microbiol. Biotechnol. 2002, 4, 525–532. [Google Scholar]
  43. Jiao, Y.; Cody, G.D.; Harding, A.K.; Wilmes, P.; Schrenk, M.; Wheeler, K.E.; Banfield, J.F.; Thelen, M.P. Characterization of extracellular polymeric substances from acidophilic microbial biofilms. Appl. Environ. Microbiol. 2010, 76, 2916–2922. [Google Scholar] [CrossRef] [Green Version]
  44. Wang, L.; Wise, M.J. Glycogen with short average chain length enhances bacterial durability. Naturwissenschaften 2011, 98, 719–729. [Google Scholar] [CrossRef]
  45. Kerns, J.C.; Arundel, C.; Chawla, L.S. Thiamin deficiency in people with obesity. Adv. Nutr. 2015, 6, 147–153. [Google Scholar] [CrossRef] [Green Version]
  46. Call, E.K.; Goh, Y.J.; Selle, K.; Klaenhammer, T.R.; O’Flaherty, S. Sortase-deficient lactobacilli: Effect on immunomodulation and gut retention. Microbiology 2015, 161, 311–321. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  47. Ghosh, S.; Sarangi, A.N.; Mukherjee, M.; Bhowmick, S.; Tripathy, S. Reanalysis of Lactobacillus paracasei Lbs2 strain and large-scale comparative genomics places many strains into their correct taxonomic position. Microorganisms 2019, 7, 487. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Parte, A.C.; Sarda Carbasse, J.; Meier-Kolthoff, J.P.; Reimer, L.C.; Goker, M. List of Prokaryotic names with Standing in Nomenclature (LPSN) moves to the DSMZ. Int. J. Syst. Evol. Microbiol. 2020, 70, 5607–5612. [Google Scholar] [CrossRef] [PubMed]
  49. Smokvina, T.; Wels, M.; Polka, J.; Chervaux, C.; Brisse, S.; Boekhorst, J.; van Hylckama Vlieg, J.E.T.; Siezen, R.J. Lactobacillus paracasei comparative genomics: Toward species pan-genomic definition and exploitation of diversity. PLoS ONE 2013, 3, e68731. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  50. Vuong, H.E.; Hsiao, E.Y. Emerging roles for the gut microbiome in autism spectrum disorder. Biol. Psychiatry 2017, 81, 411–423. [Google Scholar] [CrossRef] [Green Version]
  51. Lombardi, M.; Troisi, J. Gut reactions: How far are we from understanding and manipulating the microbiota complexity and the interaction with its host? Lessons from autism spectrum disorder studies. Nutrients 2021, 13, 3492. [Google Scholar] [CrossRef]
  52. Morgan, X.C.; Tickle, T.L.; Sokol, H.; Gevers, D.; Devaney, K.L.; Ward, D.V.; Reyes, J.A.; Shah, S.A.; LeLeiko, N.; Snapper, S.B.; et al. Dysfunction of the intestinal microbiome in inflammatory bowel disease and treatment. Genome Biol. 2012, 13, R79. [Google Scholar] [CrossRef] [PubMed]
  53. Xu, Y.; Tian, Y.; Cao, Y.; Li, J.; Guo, H.; Su, Y.; Tian, Y.; Wang, C.; Wang, T.; Zhang, L. Probiotic properties of Lactobacillus paracasei subsp. paracasei L1 and its growth performance-promotion in chicken by improving the intestinal microflora. Front. Physiol. 2019, 10, 937. [Google Scholar] [CrossRef] [PubMed]
  54. Verdu, E.F.; Bercik, P.; Verma-Gandhu, M.; Huang, X.X.; Blennerhassett, P.; Jackson, W.; Mao, Y.; Wang, L.; Rochat, F.; Collins, S.M. Specific probiotic therapy attenuates antibiotic induced visceral hypersensitivity in mice. Gut 2006, 55, 182–190. [Google Scholar] [CrossRef] [PubMed]
  55. Suzuki, H.; Yamazaki, T.; Ohshio, K.; Sugamata, M.; Yoshikawa, M.; Kanauchi, O.; Morita, Y. A specific strain of lactic acid bacteria, Lactobacillus paracasei, inhibits inflammasome activation in vitro and prevents inflammation-related disorders. J. Immunol. 2020, 205, 811–821. [Google Scholar] [CrossRef]
  56. Chondrou, P.; Karapetsas, A.; Kiousi, D.E.; Vasileiadis, S.; Ypsilantis, P.; Botaitis, S.; Alexopoulos, A.; Plessas, S.; Bezirtzoglou, E.; Galanis, A. Assessment of the immunomodulatory properties of the probiotic strain Lactobacillus paracasei K5 in vitro and in vivo. Microorganisms 2020, 8, 709. [Google Scholar] [CrossRef] [PubMed]
  57. Buchholz, U.; Bernard, H.; Werber, D.; Bohmer, M.M.; Remschmidt, C.; Wilking, H.; Delere, Y.; an der Heiden, M.; Adlhoch, C.; Dreesman, J.; et al. German outbreak of Escherichia coli O104:H4 associated with sprouts. N. Engl. J. Med. 2011, 365, 1763–1770. [Google Scholar] [CrossRef] [PubMed]
  58. Zmora, N.; Zilberman-Schapira, G.; Suez, J.; Mor, U.; Dori-Bachash, M.; Bashiardes, S.; Kotler, E.; Zur, M.; Regev-Lehavi, D.; Brik, R.B.-Z.; et al. Personalized gut mucosal colonization resistance to empiric probiotics is associated with unique host and microbiome features. Cell 2018, 174, 1388–1405. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Color diagram obtained as a result of genome-wide RAST annotation, showing the distribution profile of the identified coding sequences according to their belonging to different functional categories. Only gene sets with identified subsystems are indicated.
Figure 1. Color diagram obtained as a result of genome-wide RAST annotation, showing the distribution profile of the identified coding sequences according to their belonging to different functional categories. Only gene sets with identified subsystems are indicated.
Life 11 01246 g001
Figure 2. Phylogenetic tree constructed by the neighbor-joining method in MEGA X software [30]. The tree was inferred from the pairwise distance matrix for 75 sets of 18-mers unique to the genus Lacticaseibacillus. Strains BL23 and W56, previously ascribed to the L. casei species, as well as L. casei 12A, are marked by circles. The clades/branches of species L. rhamnosus, L. casei, L. zeae and L. manihotivorans, as well as four main L. paracasei phylogroups, are marked with different colors. Two branches for MGB0734 and NFFJ04 in the common clade of phylogroups I–III are shown in gray; the isolate VKM B-1144 is highlighted in red, and the outbreak from the whole L. paracasei clade (NSMJ15) is in black. The scale bar shows the Sørensen distance as a percentage.
Figure 2. Phylogenetic tree constructed by the neighbor-joining method in MEGA X software [30]. The tree was inferred from the pairwise distance matrix for 75 sets of 18-mers unique to the genus Lacticaseibacillus. Strains BL23 and W56, previously ascribed to the L. casei species, as well as L. casei 12A, are marked by circles. The clades/branches of species L. rhamnosus, L. casei, L. zeae and L. manihotivorans, as well as four main L. paracasei phylogroups, are marked with different colors. Two branches for MGB0734 and NFFJ04 in the common clade of phylogroups I–III are shown in gray; the isolate VKM B-1144 is highlighted in red, and the outbreak from the whole L. paracasei clade (NSMJ15) is in black. The scale bar shows the Sørensen distance as a percentage.
Life 11 01246 g002
Figure 3. Phylogenetic tree constructed by the neighbor-joining method in MEGA X software [30]. The tree was inferred from the pairwise distance matrix for 47 sets of 18-mers unique to the species Lacticaseibacillus paracasei. The color code is the same as in Figure 2. The scale bar shows the Sørensen distance as a percentage.
Figure 3. Phylogenetic tree constructed by the neighbor-joining method in MEGA X software [30]. The tree was inferred from the pairwise distance matrix for 47 sets of 18-mers unique to the species Lacticaseibacillus paracasei. The color code is the same as in Figure 2. The scale bar shows the Sørensen distance as a percentage.
Life 11 01246 g003
Figure 4. Dependence of the presence of L. paracasei bacteria on the health status (a,b) or acquired treatment (c,d). Circles mark the presence of species-specific 18-mers of 45 genomes in different metagenomes. For phylogroups I–IV, they are averaged over 8–16 genomes (Table 2). Box plots reflect their variability. Data from healthy or untreated individuals (control samples) are plotted on the left in each of the grouped pairs. Boxplots were produced in R using the ggplot2 package.
Figure 4. Dependence of the presence of L. paracasei bacteria on the health status (a,b) or acquired treatment (c,d). Circles mark the presence of species-specific 18-mers of 45 genomes in different metagenomes. For phylogroups I–IV, they are averaged over 8–16 genomes (Table 2). Box plots reflect their variability. Data from healthy or untreated individuals (control samples) are plotted on the left in each of the grouped pairs. Boxplots were produced in R using the ggplot2 package.
Life 11 01246 g004
Figure 5. Volcano plots, reflecting fold change ratios in the number of reads with species-specific marker 18-mers from 47 genomes of L. paracasei in 33 metagenomes in response to a 7-day antibiotic treatment of 11 healthy volunteers (a) and the effectiveness of microbiome recovery over 8 weeks (b). The second set of metagenomes was composed of 5 samples taken from persons who received a probiotic cocktail during the first 28 days and from 6 individuals who recovered spontaneously (Supplementary Table S1). The color code is the same as in Figure 2, Figure 3 and Figure 4. Symbols corresponding to NFFJ04 and NSMJ15 are gray; colored ovals highlight close localization of groups I, II and IV symbols on panel (a). The dashed line on panel (b) indicates the cut-off level for statistical significance corresponding to a 0.05 probability of getting a false result.
Figure 5. Volcano plots, reflecting fold change ratios in the number of reads with species-specific marker 18-mers from 47 genomes of L. paracasei in 33 metagenomes in response to a 7-day antibiotic treatment of 11 healthy volunteers (a) and the effectiveness of microbiome recovery over 8 weeks (b). The second set of metagenomes was composed of 5 samples taken from persons who received a probiotic cocktail during the first 28 days and from 6 individuals who recovered spontaneously (Supplementary Table S1). The color code is the same as in Figure 2, Figure 3 and Figure 4. Symbols corresponding to NFFJ04 and NSMJ15 are gray; colored ovals highlight close localization of groups I, II and IV symbols on panel (a). The dashed line on panel (b) indicates the cut-off level for statistical significance corresponding to a 0.05 probability of getting a false result.
Life 11 01246 g005
Figure 6. Dependence of the presence of L. paracasei bacteria in the human intestine on the antibiotic treatment (a), spontaneous recovery (b) and probiotic therapy (c). Circles show the presence of strain-specific 18-mers of 31 genomes in different metagenomes. For phylogroups I-IV, they are averaged over 5–9 genomes (Table 2). Box plots reflect their variability. Data are shown by paired box plots, of which the left in panel (a) corresponds to samples taken from 11 healthy antibiotic-naïve individuals, while panels (b,c) reflect abundance of L. paracasei in metagenomes sampled from the same persons, who received probiotic therapy (5 samples) or not (6 samples). The dashed line shows the approximate level of the medians in the control samples. Boxplots were produced in R using the ggplot2 package.
Figure 6. Dependence of the presence of L. paracasei bacteria in the human intestine on the antibiotic treatment (a), spontaneous recovery (b) and probiotic therapy (c). Circles show the presence of strain-specific 18-mers of 31 genomes in different metagenomes. For phylogroups I-IV, they are averaged over 5–9 genomes (Table 2). Box plots reflect their variability. Data are shown by paired box plots, of which the left in panel (a) corresponds to samples taken from 11 healthy antibiotic-naïve individuals, while panels (b,c) reflect abundance of L. paracasei in metagenomes sampled from the same persons, who received probiotic therapy (5 samples) or not (6 samples). The dashed line shows the approximate level of the medians in the control samples. Boxplots were produced in R using the ggplot2 package.
Life 11 01246 g006
Figure 7. Visualization of individual changes in the abundance of L. paracasei strains in response to antibiotic administration (time point 2) and subsequent recovery after 56 days with (time point 3 in panels (a,a’,c,e,g,i)) or without (time point 4 in panels (b,b’,d,f,h,j)) probiotics. Line plots in panels (a’,b’) show data obtained with species-specific barcodes. All other panels demonstrate the time course of changes in 11 intestinal microbiomes obtained with strain-specific sets of 18-mers. The order of the samples and their specifications are shown in panels (a,b). The color code is the same as in the previous figures.
Figure 7. Visualization of individual changes in the abundance of L. paracasei strains in response to antibiotic administration (time point 2) and subsequent recovery after 56 days with (time point 3 in panels (a,a’,c,e,g,i)) or without (time point 4 in panels (b,b’,d,f,h,j)) probiotics. Line plots in panels (a’,b’) show data obtained with species-specific barcodes. All other panels demonstrate the time course of changes in 11 intestinal microbiomes obtained with strain-specific sets of 18-mers. The order of the samples and their specifications are shown in panels (a,b). The color code is the same as in the previous figures.
Life 11 01246 g007
Table 1. List of Lacticaseibacillus genomes used for intrageneric phylogenetic analysis.
Table 1. List of Lacticaseibacillus genomes used for intrageneric phylogenetic analysis.
NName of StrainPhylo-
Group
Number of Unique 18-Mers Absent in the Genomes of Other Genera and PlasmidsNName of StrainPhylo-
Group
Number of Unique 18-Mers Absent in the Genomes of Other Genera and Plasmids
L. paracasei40EG9IV1,243,018
1MGB0747I1,342,84141N1115IV1,235,840
2MGB0761I1,342,89342KL1IV1,234,653
3L9I1,326,47443JCM 8130IV1,286,351
4MGYG-HGUT-02388I1,326,47444MGB0734-1,327,753
5NJI1,335,03545NFFJ04-1,287,217
6347–16I1,354,65246VKM B-1144-1,274,372
7CACC 566I1,354,71847NSMJ15-1,208,720
8Lp02I1,307,893L. casei
9IJH-SONE68I1,316,00248ATCC 393 1,216,652
10IIAI1,334,70249LC5 1,398,284
1110266I1,306,215L. zeae
12ZY-1I1,354,74550CECT 9104 (formerly L. casei) 1,326,946
13LC355I1,311,398L. manihotivorans
14SRCM103299I1,333,80451LM010 1,381,360
15HDS-01I1,310,186L. rhamnosus
16HD1–7I1,310,389524B15 1,357,067
17ZhangII1,267,65053GG 1,357,107
18LOCK919II1,373,09354BIO6870 1,356,905
19ZFM54II1,310,42655LR-B1 1,357,098
20TK1501II1,289,09756WQ2 1,351,500
2112A (annotated as L. casei)II1,301,29757LRB 1,333,164
228700:2II1,319,01258hsryfm 1301 1,352,189
23Lpc10II1,345,20159DSM 14870 1,346,181
24MGB0245III1,331,01760IDCC 3201 1,323,375
25MGB0625III1,391,07261BFE5264 1,338,512
26TK-P4AIII1,352,170621–0320 1,350,218
27TMW 1–1434III1,323,01063LR5 1,370,793
28IBB3423III1,400,22464LOCK900 1,332,726
29CBA3611III1,316,23765Pen 1,329,920
30LC2WIII1,316,04766BPL5 1,392,367
31AO356 (7112–2)III1,310,30767ATCC 8530 1,362,386
32BL23 (formerly L. casei)III1,337,55168Lc 705 1,377,678
33W56 (formerly L. casei)III1,337,35369ASCC 290 1,357,004
34BD-IIIII1,337,13770SCT-10–10-60 1,377,318
35TCSIII1,337,71771ATCC 11443 1,377,473
36TD 062IV1,129,42572LOCK908 1,377,399
37FAM18149IV1,162,77973NCTC13764 1,377,359
38CAUH35IV1,187,21074BIO5326 1,377,176
39ATCC 334IV1,229,64075NCTC13710 1,377,380
Table 2. List of L. paracasei strains traced in human metagenomes.
Table 2. List of L. paracasei strains traced in human metagenomes.
NStrain 1PG 2Number ofNStrain Number of
Species-
Specific
18-Mers
Strain-Specific
18-Mers
PGSpecies-
Specific
18-Mers
Strain-Specific
18-Mers
1MGB0747I1,201,2608325MGB0245III1,184,70550,602
2MGB0761I1,201,3068126MGB0625III1,228,58648,092
3L9I1,186,502027TK-P4AIII1,212,48386,180
4MGYG-HGUTI1,186,506028TMW 1.1434III1,199,25225,090
5NJI1,188,56617,47029IBB3423III1,250,21238,367
6347–16I1,207,01713030CBA3611III1,188,32989
7CACC 566I1,207,08721631LC2WIII1,188,15238
8Lp02I1,177,63411,84032subsp. 7112–2III1,182,816121
9IJH-SONE68I1,183,89626,67933BL23III1,199,366427
10IIAI1,198,69916,60734W56III1,199,237571
11subsp 10266I1,172,48825,78435BD-IIIII1,199,063144
12ZY-1I1,216,12146,66536TCSIII1,199,506244
13LC355I1,179,98417,90337NFFJ04-1,155,50771,319
14SRCM 103299I1,194,44823,02138NSMJ15-1,060,65177,519
15HD1.7I1,201,2608339VKM B-1144-1,100,02473,388
16HDS-01I1,201,3068140TD 062IV1,000,87722,825
17MGB0734-1,178,02665,52441FAM18149IV1,028,07228,405
18ZhangII1,143,566888542CAUH35IV1,052,57538,375
19LOCK919II1,230,63028,39443ATCC 334IV1,143,68138,932
20ZFM54II1,169,16022,14544EG9IV1,093,79937,831
21TK1501II1,154,09819,87145N1115IV1,105,81725,748
2212AII1,103,71428,75346KL1IV1,165,96620,265
23subsp. 8700:2II1,184,38823,56447JCM 8130IV1,134,44143,983
24Lpc10II1,179,98439,486
1 Strains indicated in bold were used for species-specific and strain-specific analysis. 2 Phylogroup.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Frolova, M.; Yudin, S.; Makarov, V.; Glazunova, O.; Alikina, O.; Markelova, N.; Kolzhetsov, N.; Dzhelyadin, T.; Shcherbakova, V.; Trubitsyn, V.; et al. Lacticaseibacillus paracasei: Occurrence in the Human Gut Microbiota and K-Mer-Based Assessment of Intraspecies Diversity. Life 2021, 11, 1246. https://doi.org/10.3390/life11111246

AMA Style

Frolova M, Yudin S, Makarov V, Glazunova O, Alikina O, Markelova N, Kolzhetsov N, Dzhelyadin T, Shcherbakova V, Trubitsyn V, et al. Lacticaseibacillus paracasei: Occurrence in the Human Gut Microbiota and K-Mer-Based Assessment of Intraspecies Diversity. Life. 2021; 11(11):1246. https://doi.org/10.3390/life11111246

Chicago/Turabian Style

Frolova, Maria, Sergey Yudin, Valentin Makarov, Olga Glazunova, Olga Alikina, Natalia Markelova, Nikolay Kolzhetsov, Timur Dzhelyadin, Viktoria Shcherbakova, Vladimir Trubitsyn, and et al. 2021. "Lacticaseibacillus paracasei: Occurrence in the Human Gut Microbiota and K-Mer-Based Assessment of Intraspecies Diversity" Life 11, no. 11: 1246. https://doi.org/10.3390/life11111246

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop