Identification of Streptococcus thermophilus Genes Specifically Expressed under Simulated Human Digestive Conditions Using R-IVET Technology

Despite promising health effects, the probiotic status of Streptococcus thermophilus, a lactic acid bacterium widely used in dairy industry, requires further documentation of its physiological status during human gastrointestinal passage. This study aimed to apply recombinant-based in vivo technology (R-IVET) to identify genes triggered in a S. thermophilus LMD-9 reference strain under simulated digestive conditions. First, the R-IVET chromosomal cassette and plasmid genomic library were designed to positively select activated genes. Second, recombinant clones were introduced into complementary models mimicking the human gut, the Netherlands Organization for Applied Scientific Research (TNO) gastrointestinal model imitating the human stomach and small intestine, the Caco-2 TC7 cell line as a model of intestinal epithelium, and anaerobic batch cultures of human feces as a colon model. All inserts of activated clones displayed a promoter activity that differed from one digestive condition to another. Our results also showed that S. thermophilus adapted its metabolism to stressful conditions found in the gastric and colonic competitive environment and modified its surface proteins during adhesion to Caco-2 TC7 cells. Activated genes were investigated in a collection of S. thermophilus strains showing various resistance levels to gastrointestinal stresses, a first stage in the identification of gut resistance markers and a key step in probiotic selection.


Introduction
The lactic acid bacterium Streptococcus thermophilus is widely used as a starter for yogurt and cheese production, mainly for its ability to produce lactic acid and secondary fermentation products with aromatic and textural properties [1]. It has a very long history of use in dairy industry without any identified health problem. Furthermore, virulence-related genes are absent from its genome [2]. Hence, it has been assigned the generally recognized as safe (GRAS) and qualified presumption of safety (QPS) status by the American Food and Drug Administration and European Food Safety Authority (EFSA), respectively. In spite of its very large human consumption and the health claim attributed by EFSA to live yogurt cultures (S. thermophilus and Lactobacillus delbrueckii) to improve lactose digestion [3], the probiotic status of S. thermophilus strains is still poorly explored [4,5].
in humans (for a review, see [31]) as a model of bacterial adhesion to human intestinal epithelial cells, and (iii) anaerobic batch cultures of human feces as a simplified model of the human colon.

Bacterial Strains, Culture, and Transformation Conditions
Bacterial strains and plasmids used in the present study are listed in Table 1. S. thermophilus LMD-9 from the American Type Culture Collection (Manassas, VA, USA) was used to implement the R-IVET system. S. thermophilus strains were stored at −80 • C in reconstituted skim milk (10%, w/v). They were grown at 42 • C under anaerobic conditions (AnaeroGen, Oxoid, Basingstoke, UK) in M17 medium [32] supplemented with 2% (w/v) lactose (LM17) or in 10% (w/v) reconstituted milk (powdered semi-skimmed milk, fast dissolution, Régilait). Unless otherwise specified, antibiotics (Sigma, Saint Quentin Fallavier, France) were added at the following concentrations: spectinomycin 300 µg/mL, streptomycin 20 µg/mL, kanamycin 1 mg/mL, or erythromycin 5 µg/mL. S. thermophilus naturally competent cells were prepared in Chemically Defined Medium (CDM) as previously described [33] and transformed according to Junjua et al. [23]. For long-term storage at −80 • C, competent cells were centrifuged, resuspended in 1/10 volume of the initial culture supplemented with glycerol (14%), and frozen in liquid nitrogen. As S. thermophilus develops in the form of coccus chains, samples were systematically vigorously vortexed for 3 min to break the chains before spreading on solid medium. This step was to ensure that each colony observed after incubation at 42 • C probably came from a single cell. LMD-9 derivative with the prom-loxP-specR-loxP-kanR fragment in the STER_0891 locus (STER_RS04415) Present study S. thermophilus TIL1193 LMD-9 feoB::aphA3 [33] E. coli TOP10 One Shot ® TOP10 chemically competent E. coli cells Invitrogen Plasmids pULNcreB pG + host9 TR derivative containing promoterless recombinase cre gene [23] pSET4S Replication function of pG + host3 and pUC19 [35] pULNcreB-plac pULNcreB derivative containing the promoter of the lactose operon plac cloned in the BglII site upstream of cre [23] S. thermophilus strains came either from CNRZ (Centre National de Recherches Zootechniques, INRA, Jouy-en-Josas, France) or ATCC (American Type Culture Collection, Manassas, VA, USA) collections or were isolated in our laboratory from either yogurt or cheese.
Microorganisms 2021, 9,1113 4 of 26 E. coli TOP10 (Invitrogen, Breda, The Netherlands) was used as an intermediate cloning host and was grown in aerobic conditions at 37 • C in Luria-Bertani (LB) medium under shaking at 240 rpm, supplemented or not with erythromycin 300 µg/mL. Chemical transformation of this strain was performed according to manufacturer's instructions.
2.2. DNA Extraction, Digestion, and PCR Amplification S. thermophilus genomic DNA was extracted as previously described by Fischer et al. [36], and restriction digestions were performed following the manufacturer's instructions. Polymerase chain reactions (PCR) were carried out in a Mastercycler pro thermocycler (Eppendorf, Hambourg, Germany). Oligonucleotides used as primers were purchased from Eurogentec (Seraing, Belgium), and their sequences are described in Table S1 (Supplementary Materials). Classic and overlapping PCRs were performed using enzymes and conditions previously described by Junjua et al. [23]. DNA fragments were separated by electrophoresis in 1% agarose gels using 0.5× Tris-Borate EDTA buffer at 100 V [37]. Molecular weight marker 1 kb and 100 bp DNA ladders from Fermentas were used. Gels were stained with ethidium bromide and imaged using a GelDoc-It Imaging System (Bio-Rad, Marne-la-Coquette, France).

Construction of R-IVET Chromosomal Cassette and S. thermophilus Mutant STUL5003
The strategy used to construct the S. thermophilus STUL5003 mutant strain containing the chromosomal prom-loxP-specR-Tlas-loxP-kanR cassette is presented in Figure 1. In a first series of PCR ( Figure 1A-C), the intermediate strain STUL5002 was built from S. thermophilus LMD-9. This mutant strain contained a chromosomal cassette consisting of the kanR gene, encoding a 3 5 -aminoglycoside phosphotransferase of type III (aphA3), separated from its own promoter by a loxP site ( Figure 1C). The presence of loxP did not preclude the expression of the kanR gene. To obtain the cassette, a series of PCRs was carried out as follows: amplification of the UP#1 and DOWN#1 fragments from genomic DNA of S. thermophilus LMD-9, and of the prom (promoter region of the kanR gene) and loxP-kanR fragments ( Figure 1A) from genomic DNA of S. thermophilus TIL1193 (Table 1) [33]. The four fragments UP#1, prom, loxP-kanR, and DOWN#1 were amplified individually using primer couples #1#2, #3#4, #5#6, and #7#8, respectively ( Figure 1A). Then, an overlapping PCR was carried out using primer couples #1#8 and the four fragments UP#1, prom, loxP-kanR, and DOWN#1 mixed together in equimolar concentrations ( Figure 1B). About 30 ng of the resulting amplified fragments were used for transformation of LMD-9 competent cells. A kanamycin-resistant clone was selected and confirmed to have the prom-loxP-kanR fragment at the STER_0891 locus by colony PCR using the primer couple #9#10 ( Figure 1C). The resulting mutant strain was named STUL5002. The prom-loxP-kanR fragment and chromosomal junctional regions were sequenced to check that no mutation occurred during this mutant construction.
In a second series of PCR ( Figure 1D-F), strain STUL5003 containing a prom-loxP-specR-Tlas-loxP-kanR chromosomal cassette was built. The four fragments UP#2, specR, Tlas, and loxP-DOWN#2 were individually amplified using primer couples #11#12, #13#14, #15#16 and #17#6, respectively ( Figure 1D). Genomic DNA of strain STUL5002 was used as template to amplify the UP#2 and loxP-DOWN#2 fragments. The terminator Tlas fragment was amplified from the pULNcreB plasmid [23]. The plasmid pSET4S (Table 1, [35]) was used to amplify the specR gene encoding a spectinomycin adenyltransferase. Once obtained, fragments UP#2, specR, Tlas, and loxP-DOWN#2 were mixed together in equimolar concentrations, and an overlapping PCR was carried out using primer couple #11#6 ( Figure 1E). About 30 ng of the resulting amplified fragments were used for transformation of STUL5002 competent cells ( Figure 1F). Among the spectinomycin-resistant transformants, three were randomly selected, and the construction was sequenced in each to confirm the presence of the expected chromosomal cassette. Hence, one of them was chosen and named STUL5003. This strain displayed a Spec R Kan S phenotype as expected.  Construction of the chromosomal R-IVET cassette. The R-IVET chromosomal construction was cloned into the locus STER 0891 (t0891) of S. thermophilus LMD-9 resulting in strain STUL5003. This strain was constructed in two steps. The first step (1A to 1C) corresponds to the construction of STUL5002 mutant: PCR amplification of individual fragments (A), overlapping PCR (B), and resulting construction on the chromosome of STUL5002 mutant (C). The second step (1D to 1F) illustrates the construction of STUL5003 mutant: PCR amplification of individual fragments (D), overlapping PCR (E), and resulting chromosomal construction of STUL5003 mutant (F). Numerical values indicated in italics in genes (F) correspond to locus tags as described in the annotated sequence of LMD-9 chromosome (NC_008532_1). Oligonucleoti0des used as amplification primers are indicated as black arrows and are labeled with their # numbers (see Table S1, Supplementary Materials). UP and DOWN regions correspond to upstream and downstream sequences flanking the fragments to be cloned, respectively. The specR, kanR, and prom fragments correspond to genes encoding a spectinomycin adenyltransferase conferring resistance to spectinomycin and streptomycin, a 3 5"-aminoglycoside phosphotransferase of type III conferring resistance to kanamycin, and the promoter region of kanR gene, respectively. The gray triangle corresponds to the loxP sequence that is recognized by the specific site recombinase Cre.

R-IVET Genomic Library Construction
Digestion with AluI and SmaI restriction enzymes, ligation with the T 4 DNA ligase, and dephosphorylation using calf intestinal alkaline phosphatase were performed according to the supplier's recommendations (New England Biolabs, Leiden, The Netherlands). The R-IVET library was constructed in the STUL5003 strain. The genomic DNA of S. thermophilus LMD-9 was partially digested with AluI, and restriction fragments were ligated with the pULNcreB plasmid, which was previously linearized by SmaI and dephosphorylated. DNA digestions were checked by agarose gel electrophoresis and were purified using the High Pure DNA purification Kit (Roche Molecular Biochemicals, Mannheim, Germany) according to the manufacturer's recommendations. Chemically competent cells of E. coli TOP10 were transformed with the ligated plasmids. All resulting clones were pooled and their plasmid DNAs extracted using a Miniprep Kit (Fermentas, Villebon sur Yvette, France). Plasmid DNAs were then introduced by natural transformation into S. thermophilus STUL5003. The resulting S. thermophilus clones were selected on LM17 supplemented with streptomycin, pooled, resuspended into LM17 supplemented with 11.6% glycerol, and stored in aliquots at −80 • C to constitute the S. thermophilus R-IVET genomic library.

Optimization of the Counterselection Condition of Activated R-IVET Clones
To efficiently eliminate Spec S Kan R clones obtained during the R-IVET library growth before its introduction into one of the gut models used in this work, two different antibiotic combinations were tested: spectinomycin 300 µg/mL or a mix of spectinomycin 300 µg/mL and streptomycin 20 µg/mL. Thus, after growth in milk (medium used for TIM-1 and fecal batch culture models) or in LM17 (Caco-2 TC7 model), cells of the R-IVET genomic library were exposed for 3 h, 7 h, or 15 h to these antibiotics. Counterselection effectiveness of the Spec S Kan R clones was then evaluated by plating appropriate dilutions of each sample on LM17 agar supplemented with erythromycin (LM17-ery), a mix of spectinomycin and streptomycin (LM17-spec/strep), or a mix of kanamycin and erythromycin (LM17-kan/ery). As the specR gene confers resistance to both spectinomycin and streptomycin, these antibiotics were used alone or together. After 3 h exposure to spectinomycin 300 µg/mL or a mix of spectinomycin 300 µg/mL and streptomycin 20 µg/mL, 0.15% and 0.03% of colonies plated onto LM17-ery were also present on LM17-kan/ery, respectively. The results appeared slightly better since 0.015% of colonies plated onto LM17-ery were also growing on LM17-kan/ery after 7 h exposure to the mix of spectinomycin 300 µg/mL and streptomycin 20 µg/mL vs. 0.08% when using spectinomycin 300 µg/mL. Similar results were obtained after 15 h exposure. Moreover, for each condition of counterselection tested, 10 clones randomly selected from LM17-kan/ery plates, deletion of loxP-specR fragment from the chromosomal cassette, and presence of an insert in pULNcreB plasmid were confirmed. Therefore, it appeared that counterselection using a mix of spectinomycin/streptomycin was the most efficient. On the basis of these results and for further easy handling, it was decided to grow the R-IVET genomic library overnight in milk (TIM-1 and batch cultures) or in LM17 (adhesion) with this mix of antibiotics.

In Vitro Digestions in the TIM-1 System
The TIM-1 system is composed of four successive compartments, namely, the stomach, duodenum, jejunum, and ileum. This computer-controlled dynamic in vitro model accurately reproduces the main physicochemical parameters found in the stomach and small intestine in vivo (no microbiota): body temperature, kinetics of gastric and intestinal pH, gastric, pancreatic, and liver digestive secretions, chyme transit and mixing, and passive absorption of nutrients and water ( Figure 2). TIM-1 was washed with detergent and sterilized by steaming at 105 • C for 35 min before each experiment to avoid any microbial contamination. In the present study, the model was programmed to reproduce digestion of milk by a healthy human adult ( Table 2). min), jejunum (30,60,120, and 180 min), and ileum (60, 120, and 180 min). Ileal effluents were also kept on ice and collected as pools of periods covering 0-60, 60-120, 120-180, and 180-240 min. At the end of digestion, the gastrointestinal residue was recovered. Samples were 10-fold diluted and appropriate dilutions were plated on LM17 agar (viable count) and on LM17 agar supplemented with a mix of kanamycin and erythromycin (selection of activated Spec S Kan R clones). Three independent experiments were performed in the TIM-1 system.   A power exponential equation (f = 1 − 2 −(t/t1/2)β where f represents the fraction of meal delivered, t is the time of delivery, t 1/2 is the half-time of delivery, and β is a coefficient describing the shape of the curve) was used for the computer control of gastric and ileal deliveries.
R-IVET library clones were precultured for 10 h in milk supplemented with erythromycin. Then, the library was cultured overnight in 300 mL of milk supplemented with spectinomycin and streptomycin. Before introduction into the TIM-1 stomach, the fermented milk was homogenized by vortexing at room temperature for 5 min. Samples (1 mL) were collected from the initial fermented milk (T0) and at several time points during in vitro digestion in the stomach (30,60,90, and 120 min), duodenum (30, 60, and 120 min), jejunum (30,60,120, and 180 min), and ileum (60, 120, and 180 min). Ileal effluents were also kept on ice and collected as pools of periods covering 0-60, 60-120, 120-180, and 180-240 min. At the end of digestion, the gastrointestinal residue was recovered. Samples were 10-fold diluted and appropriate dilutions were plated on LM17 agar (viable count) and on LM17 agar supplemented with a mix of kanamycin and erythromycin (selection of activated Spec S Kan R clones). Three independent experiments were performed in the TIM-1 system.

Fecal Batch Cultures
Fresh feces from healthy human volunteers were used to prepare the bacterial inoculum for batch cultures. Stools (~50 g) were mixed with 350 mL of sodium phosphate buffer (200 mM, pH 6.5) and filtered through a double layer of gauze under strictly anaerobic conditions in a vinyl anaerobic chamber (Coy, Grass Lake, MI, USA). Ten milliliters of the fecal suspension was rapidly transferred to 50 mL crimped vials, flushed with CO 2 , and filled with 20 mL of nutritive medium ( Figure 2). The nutritive medium contained various carbohydrate, protein, lipid, mineral, and vitamin sources, as previously described by Thévenot et al. [38]. For each fecal sample, six vials were prepared and sealed: two control vials with 1 mL of acidified milk and four vials containing 1 mL of an R-IVET genomic library culture. The vials were incubated for 24 h (37 • C, 140 rpm), and samples (1 mL) were collected at 0, 2, 4, and 24 h. Appropriate dilutions were plated on LM17 agar supplemented with a mix of spectinomycin and streptomycin (viable count) and on the same medium supplemented with a mix of kanamycin and erythromycin (selection of Spec S Kan R activated clones and plasmid presence through erythromycin resistance). The experiments were performed in duplicate with the feces of two adult volunteers, a male and a female.

In Vitro Adhesion to Caco-2 Cultures
The R-IVET library was precultured overnight in LM17 supplemented with erythromycin and cultured for 12 h in LM17 supplemented with spectinomycin and streptomycin. The Caco-2 TC7 cell line was used for adhesion studies, and cell culture experiments were performed as previously described by Kebouchi et al. [16] with some modifications. Briefly, bacterial cells from 12 h cultures were pelleted, washed, and resuspended at a final concentration of 10 9 CFU/mL (OD 650 nm of about 12) in Dulbecco's modified Eagle's Minimal Essential Medium (DMEM) with 4.5 g/L glucose (DMEM Glutamax, Fisher Scientific) or in LM17, used as a control. Subsequently, samples were taken separately from the two bacterial suspensions corresponding to T0 LM17 and T0 DMEM samples ( Figure 2). Afterward, the bacterial suspension in DMEM was added to a confluent Caco-2 TC7 cell monolayer at a final concentration of 10 9 CFU/mL to reach a bacterial-cell-toepithelial-cell ratio of 1000:1. Bacterial cells were co-incubated with the cell monolayer for 4 h at 37 • C in a humidified atmosphere with 10% CO 2 and processed further according to Kebouchi et al. [16] to recover cells. All cells (adherent bacteria and eukaryotic cells) recovered from two inserts were pooled and then incubated 30 min at room temperature. Samples were taken separately after recovering from triton treatment of bacterial suspension (T0 triton) and after 30 min incubation (T30 triton). Serial dilutions of all samples (T0 LM17, T0 DMEM, T0 triton, and T30 triton) were performed, and appropriate dilutions were plated onto LM17 agar supplemented with either a mix of spectinomycin and streptomycin or a mix of kanamycin and erythromycin (selection of activated Spec S Kan R clones having a recombinant plasmid). Plates were then incubated at 42 • C for 48 h. Three independent R-IVET adhesion experiments were conducted with Caco-2 TC7 cells.

Sequence Analyses
In the three gut models, excision of the chromosomal cassette was firstly checked in Spec S Kan R clones (qualified as an activated R-IVET clone) by PCR using the primer pair #22 and #29 (Table S1, Supplementary Materials). Detection of a 628 bp amplicon confirmed that the loxP-specR fragment of the R-IVET cassette was deleted because of promoter induction in the recombinant plasmid. To determine which genes from S. thermophilus were induced, the plasmid insert of each activated R-IVET clone was amplified using primers #18 and #19, purified, and sequenced using the same primers by Genewiz Company (Takeley, UK). The nucleotide sequences were analyzed by NCBI BLASTn using the annotated genome sequence of wild-type S. thermophilus LMD-9 [34]. A promoter prediction was performed on sequences obtained by using the online promoter prediction tools Softberry BPROM [39] and phiSITE Promoter Hunter [40].

Statistical Analyses
Data were analyzed using a two-way repeated-measures analysis of variance (ANOVA) followed by a Bonferroni test. The statistical analyses were performed using GraphPad Prism software 7.01 (GraphPad Software, Inc., San Diego, CA, USA). Results were expressed as means ± SEM. Differences were considered statistically significant when p < 0.05. Survival kinetics of the R-IVET library in the TIM-1 system were compared to that of a theoretical transit marker provided by the computer. This marker evolves according to the volume of each compartment, the rate of dilution by digestive secretions, and the chyme flow between two successive compartments, and it corresponds to a 100% survival rate. The comparison was independently performed in each compartment and at each time of sampling. For fecal batch cultures, the number of cultivable cells was determined, and a comparison was independently performed between volunteer 1 and 2 at each time of sampling.

Construction and Validation of the R-IVET Positive Screening Tool
The R-IVET positive screen was designed on the basis of the strategy developed for Enterococcus faecalis [40]. The chromosomal cassette consisting of two antibiotic resistance genes was constructed as detailed in Figure 1 and in Section 2.3 and then introduced into the locus STER_0891, encoding a putative glucose uptake permease, of the S. thermophilus LMD-9 genome to obtain the mutant strain STUL5003. This cassette consisted of the promoter (prom) of the kanamycin resistance gene kanR, followed by a loxP site, the spectinomycin resistance gene specR with its own promoter, the Tlas terminator, another loxP site, and the promoterless kanR gene. Thus, the strain STUL5003 was resistant to spectinomycin and sensitive to kanamycin (phenotype Spec R Kan S ). It was expected that, under the expression of the Cre recombinase, the specR gene would be excised and, consequently, the kanR would be correctly positioned to be expressed under the promoter prom ( Figure 3).
The STUL5003 chromosomal cassette was sequenced to ensure that no mutation occurred during the construction. When compared with the expected sequence of the R-IVET cassette, two mutations were detected: replacement of a cytosine by a guanine in the prom region and deletion of a cytosine in the transcription terminator of the specR gene. Regarding the position of these mutations, as well as the phenotype of STUL5003 and STUL5003-plac strain colonies, as presented below, we concluded that these mutations had no significant impact on the functionality of our R-IVET tool.
Microorganisms 2021, 9, x FOR PEER REVIEW 11 of 28 the kanR would be correctly positioned to be expressed under the promoter prom ( Figure  3).

Figure 3.
Schematic representation of positive-selection R-IVET screen in S. thermophilus STUL5003. The R-IVET technology consists of two elements. The first one is the chromosomal cassette (A) with the spectinomycin resistance gene (specR) flanked by two loxP sites followed by the promoterless kanamycin resistance gene (kanR). The second element is the pULNcreB plasmid (B, Table 1) which possesses the promoterless recombinase gene (cre). Cre recombinase recognizes the loxP sites of the cassette. (C) When the DNA fragment cloned upstream of the cre gene is without promoter activity, only the specR gene is expressed and the two terminators, i.e., specR gene and Tlas downstream of the specR gene, prevent expression of the kanR gene. The corresponding clone is spectinomycin-resistant and kanamycin-sensitive (Spec R Kan S ).
(D) When the DNA fragment possesses promoter activity, Cre recombinase is produced and leads to excision of the specR gene; the kanR gene can, therefore, be expressed from its own promoter. Hence, the clone is spectinomycin-sensitive and kanamycin-resistant (Spec S Kan R ).
The STUL5003 chromosomal cassette was sequenced to ensure that no mutation occurred during the construction. When compared with the expected sequence of the R-IVET cassette, two mutations were detected: replacement of a cytosine by a guanine in the prom region and deletion of a cytosine in the transcription terminator of the specR gene. Regarding the position of these mutations, as well as the phenotype of STUL5003 and STUL5003-plac strain colonies, as presented below, we concluded that these mutations had no significant impact on the functionality of our R-IVET tool.
Functionality of the R-IVET positive screen was validated using the plasmid pULN-creB-plac containing the plac promoter of lactose operon ( Table 1). As this promoter is inducible by lactose and less active in the presence of glucose [23], it was expected that, following the expression of Cre recombinase, all STUL5003-plac strain colonies grown in the presence of lactose would lose the specR gene and express the kanR gene. Hence, this plasmid was introduced into STUL5003 strain to obtain STUL5003-plac strain. Sixty randomly chosen colonies from both strains were grown on five different agar media to check their antibiotic resistance phenotype: LM17 medium without antibiotic (control) or supplemented with either spectinomycin or erythromycin (to confirm the presence of pULN-creB-plac plasmid, which has the ermE gene conferring erythromycin resistance), with kanamycin, or with a mix of erythromycin and kanamycin. We observed that all 60 STUL5003- consists of two elements. The first one is the chromosomal cassette (A) with the spectinomycin resistance gene (specR) flanked by two loxP sites followed by the promoterless kanamycin resistance gene (kanR). The second element is the pULNcreB plasmid (B, Table 1) which possesses the promoterless recombinase gene (cre). Cre recombinase recognizes the loxP sites of the cassette. (C) When the DNA fragment cloned upstream of the cre gene is without promoter activity, only the specR gene is expressed and the two terminators, i.e., specR gene and Tlas downstream of the specR gene, prevent expression of the kanR gene. The corresponding clone is spectinomycin-resistant and kanamycin-sensitive (Spec R Kan S ). (D) When the DNA fragment possesses promoter activity, Cre recombinase is produced and leads to excision of the specR gene; the kanR gene can, therefore, be expressed from its own promoter. Hence, the clone is spectinomycin-sensitive and kanamycin-resistant (Spec S Kan R ).
Functionality of the R-IVET positive screen was validated using the plasmid pULNcreBplac containing the plac promoter of lactose operon ( Table 1). As this promoter is inducible by lactose and less active in the presence of glucose [23], it was expected that, following the expression of Cre recombinase, all STUL5003-plac strain colonies grown in the presence of lactose would lose the specR gene and express the kanR gene. Hence, this plasmid was introduced into STUL5003 strain to obtain STUL5003-plac strain. Sixty randomly chosen colonies from both strains were grown on five different agar media to check their antibiotic resistance phenotype: LM17 medium without antibiotic (control) or supplemented with either spectinomycin or erythromycin (to confirm the presence of pULNcreB-plac plasmid, which has the ermE gene conferring erythromycin resistance), with kanamycin, or with a mix of erythromycin and kanamycin. We observed that all 60 STUL5003-plac strain colonies exhibited a Spec S Kan R Ery R phenotype, because they grew without antibiotic, as well as with erythromycin, kanamycin, or both, but not with spectinomycin. As expected, none of the 60 STUL5003 strain colonies was able to grow with erythromycin and/or kanamycin, while they grew with spectinomycin, as well as on control medium (Spec R Kan S Ery S phenotype). The presence/absence of a loxP-specR excisable fragment was checked by PCR in randomly chosen colonies of STUL5003 and STUL5003-plac strains. In each colony (6/6) of STUL5003 strain, an amplicon of 2091 bp was observed as expected since loxP-specR was present, while an amplicon of 628 bp was detected in each colony (3/3) of STUL5003-plac strain, since loxP-specR was absent. Lastly, sequencing of three STUL5003-plac amplicons confirmed that the chromosomal cassette was deleted.
In addition to validation of R-IVET positive screening, these results also attested to a certain stability of the chromosomal cassette, since none of the 60 STUL5003 colonies lost the chromosomal cassette through a recombination between the loxP sites which, in fact, constituted direct repeats. Lastly, it must be underlined that, at this stage, all of the Spec S Kan R Ery R R-IVET clones (almost 2500) analyzed throughout this work displayed a recombinant pULNcreB.

Construction of the R-IVET Genomic Library and Counterselection of Activated R-IVET Clones
The R-IVET genomic library was constructed using Escherichia coli TOP10 as an intermediate cloning host. Genomic DNA of S. thermophilus LMD-9 partially digested with AluI was ligated with SmaI-digested pULCreB plasmid (Table 1). Then, recombinant plasmids were introduced into E. coli TOP10, and 72,000 clones were selected. Plasmids extracted from E. coli TOP10 were introduced into S. thermophilus STUL5003, resulting in approximately 114,600 colonies selected on LM17-ery medium. Hence, PCRs were performed on plasmids from 56 randomly chosen R-IVET recombinant clones to (i) assess the proportion of recombinant clones, and (ii) establish insert sequences to evaluate LMD-9 genome recovery. The results showed that 37.5% (21/56) of R-IVET clones possessed an insert, with an average size of 500 bp. Analyses of insert sequences revealed that 13 corresponded to simple inserts and eight corresponded to multi-inserts. In the rest of this work, about 50% of R-IVET activated clones exhibited a multi-insert. They were systematically excluded from analyzes. This probably resulted from the method used to set-up the R-IVET genomic library, since it was decided to dephosphorylate the SmaI-digested pULNcreB vector to avoid its self-ligation instead of dephosphorylating the inserts. According to the proportion of recombinant clones (37.5%) and of that of multi-inserts (50%), the probability that each part of S. thermophilus LMD-9 genome exists at least once in the form of a simple insert was estimated at 0.997, using the Clarke and Carbon formula [41]. Lastly, the corresponding fragments of simple inserts came from different parts of the S. thermophilus genome, and no specific genomic region appeared to be over-or under-represented.
As R-IVET technology was used to identify promoters specifically induced under simulated human digestive conditions, counterselection of R-IVET recombinant clones containing a promoter induced during their growth before introduction into one of the three gut models used in this work constituted an essential first step. Hence, preselection conditions were optimized to eliminate Spec S Kan R clones (i.e., clones where recombinase was expressed) and select Spec R Kan S clones (i.e., clones keeping an intact original chromosomal cassette since Cre recombinase was not expressed). R-IVET genomic library clones were grown either in milk (growth medium used for the TIM-1 system and batch cultures of human fecal microbiota) or in LM17 (Caco-2 TC7 model) and exposed to antibiotics at different times, as described in Section 2.5. The better selection condition found corresponded to exposition for at least 7 h to a mix of spectinomycin/streptomycin (specR conferring resistance to both antibiotics) but it seemed impossible to eliminate Spec S Kan R clones present in initial cultures. Consequently, the Kan R clones obtained at the beginning of each experiments (T0) were systematically and carefully analyzed in each of the experiments described in this work.

Survival Kinetics of S. thermophilus under Human Simulated Conditions
Before starting selection of activated R-IVET clones in the three complementary gut models, viability of the R-IVET genomic library under digestive conditions was established, given that S. thermophilus survival in the human GI tract is a key parameter to consider for further evaluation as a probiotic, according to the regulatory definition [6]. Fermented milk was chosen as a food carrier for S. thermophilus in TIM-1 and in fecal batch cultures, since it was previously shown to improve bacterial survival in the in vitro stomach compared to milk [18]. Furthermore, S. thermophilus is commonly consumed by humans via ingestion of fermented milks such as yogurt. For the Caco-2 TC7 adhesion model, a culture medium simpler than fermented milk, i.e., LM17 medium was preferred.
In TIM-1, bacterial survival kinetics was not significantly different from that of a theoretical transit marker provided by the in vitro model during the 60 min digestion in the gastric compartment ( Figure 4A). Conversely, significant bacterial mortality (p < 0.0001) was observed from 90 min when pH fell below 1.8, with a loss of 3 log 10 CFU and 7 log 10 CFU compared to the transit marker at 90 and 120 min, respectively. Monitoring of R-IVET library cell numbers in the three compartments of the small intestine showed a negative impact of digestive conditions on survival ability of these clones in post-gastric conditions ( Figure 4A). The number of viable cells showed a significant decrease (p < 0.001) in the small intestinal compartments after 120 min of digestion, with a loss of around 8 log 10 CFU in the duodenum and 6 log 10 CFU in the jejunum and ileum compared to the transit marker. Bacterial cells which survived the whole digestive process were recovered in the TIM-1 ileal effluents. Final counts, i.e., when the numbers of bacterial cells in ileal effluents and gastrointestinal residue were added, reached 8.2 log 10 CFU versus 9.2 log 10 CFU for the wild-type strain, as previously established by Uriot et al. [18]. This could result either from interruption of the STER_0891 locus where the chromosomal cassette was introduced or from a mutation elsewhere in the genome of STUL5003 strain.

Sequence Analyses of Initially Activated (T0) R-IVET Genomic Library Clones
For the reasons exposed before, R-IVET clones displaying a Spec S Kan R phenotype at T0, i.e., before their introduction in one of the three models, were selected and analyzed. From a physiological point of view, they corresponded to clones that were in a stationary growth phase, with a growth medium at about pH 4.6. Sequencing of inserts from a total In batch cultures of human fecal microbiota, survival rates of the R-IVET library remained unchanged during the first 4 h of incubation ( Figure 4B), and no significant difference was observed between the two volunteers. However, after 24 h incubation, the number of viable S. thermophilus R-IVET cells significantly decreased by approximately 2.5 and 3.5 log 10 CFU for volunteers 1 and 2, respectively. The final survival rate was significantly different (p < 0.0001) between the two volunteers. This result suggested that the R-IVET library was not able to colonize the colon, as observed with wild-type strains in animals and humans after yogurt ingestion [17]. In previous studies in germ-free rat models, lactose was shown to significantly improve the survival capacity of S. thermophilus in the GI tract [21,42]. Of note, here, no supplementary nutrient or carbon source that could have been easily used by S. thermophilus was added to batch cultures.
Lastly, the survival ability of the R-IVET library was determined in the medium used for Caco-2 TC7 cell adhesion assays, in the same conditions as described in Figure 2, but without Caco-2 cells. Results showed no significant decrease in clone survival after 4 h incubation in DMEM medium and triton X-100 at 0.1% ( Figure S1, Supplementary Materials).

Sequence Analyses of Initially Activated (T0) R-IVET Genomic Library Clones
For the reasons exposed before, R-IVET clones displaying a Spec S Kan R phenotype at T0, i.e., before their introduction in one of the three models, were selected and analyzed. From a physiological point of view, they corresponded to clones that were in a stationary growth phase, with a growth medium at about pH 4.6. Sequencing of inserts from a total of 963 clones selected revealed that about 50% consisted of multiple inserts, which were eliminated from the study. Sequencing of single inserts revealed that they originated from about 150 different genes, with certain inserts matching with at least two genes because of the presence of repeated sequences inside the inserts (Table S2, Supplementary Materials). This implies that most of the 150 genes were observed several times. Three situations were distinguished, whereby inserts included (i) a gene promoter (84 genes or 56%), (ii) a sequence located inside the promoter of a gene but on the antisense strand (eight genes or 0.05%), and (iii) a sequence located inside a gene, on either the sense or the antisense strand ( Figure 5 and Table S2, Supplementary Materials). Interestingly, among the 49 promoters from T0 clones of TIM-1 and fecal batch culture, 33 were identified in both and probably correspond to genes induced during growth in milk. Lastly, 20 T0 clones were common to the three gut models, which is consistent with them corresponding to genes involved in basic cellular functions, for example, PepS aminopeptidase, subunit III of DNA polymerase III, ATP subunit of Clp protease, DNA/RNA helicase, or DNA polymerase I (Table S2, Supplementary Materials).
Microorganisms 2021, 9, x FOR PEER REVIEW 15 of 28 strand ( Figure 5 and Table S2, Supplementary Materials). Interestingly, among the 49 promoters from T0 clones of TIM-1 and fecal batch culture, 33 were identified in both and probably correspond to genes induced during growth in milk. Lastly, 20 T0 clones were common to the three gut models, which is consistent with them corresponding to genes involved in basic cellular functions, for example, PepS aminopeptidase, subunit III of DNA polymerase III, ATP subunit of Clp protease, DNA/RNA helicase, or DNA polymerase I (Table S2, Supplementary Materials).

Identification of S. thermophilus Functions Induced in GI Models
After transit through TIM-1, fecal batch incubation, or adhesion to Caco-2 TC7 cells, 1461 Spec S Kan R clones were recovered for further analysis. As for T0 samples, single or multiple inserts upstream of cre gene were observed. Once multiple inserts and T0 activated genes were excluded from the analysis, the resting single-insert sequence corresponded to 98 genes. Once again, some of them were found several times. For 44 genes, the insert included the promoter of a gene already annotated on the LMD-9 strain genome Promoter/sense (see Table 2 and Table S2) S. thermophilus LM D-9 CDS

Promoter region
Promoter/antisense (see Table 3 and Table S2) Inside/sense (see Table 3 and Table S2) Inside/antisense (see Table 3 and Table S2) Figure 5. Location, orientation, and denomination of inserts of activated R-IVET clones compared to S. thermophilus LMD-9 genes. CDS: coding DNA sequence.

Identification of S. thermophilus Functions Induced in GI Models
After transit through TIM-1, fecal batch incubation, or adhesion to Caco-2 TC7 cells, 1461 Spec S Kan R clones were recovered for further analysis. As for T0 samples, single or multiple inserts upstream of cre gene were observed. Once multiple inserts and T0 activated genes were excluded from the analysis, the resting single-insert sequence corresponded to 98 genes. Once again, some of them were found several times. For 44 genes, the insert included the promoter of a gene already annotated on the LMD-9 strain genome (Table 3), whereas the insert corresponding to the 54 other genes included an internal gene sequence (on the sense or antisense strand) or sequences located on the antisense strand of a gene promoter region ( Table 4). Analysis of their sequences using promoter prediction software showed that all of them contained at least a strong signal of promoter activity, often well positioned in relation to the gene cre, considering that 45 bp separated the end of the insert from the beginning of gene cre.
Of the 44 gene promoters, 18, 20, and six were identified in TIM-1, Caco-2 TC7 cells, and fecal batch cultures, respectively (Table 3). In TIM-1, the 18 promoters were identified in the stomach compartment, mainly after 30 and 60 min digestion. In fecal batch cultures, two promoters were observed after 2 h and four promoters were observed after 4 h incubation. Interestingly, only one promoter was detected both in TIM-1 and in fecal batch cultures, while others remained gut region-specific. Functional classes of activated genes were also mainly gut model-dependent. For example, seven out of 18 genes activated in TIM-1 have a protein synthesis function and three are involved in nutrient absorption and metabolism. In Caco-2 TC7, three genes encoding cell surface proteins and two encoding competence proteins were specifically activated. Lastly, two out of six genes activated in fecal batch cultures are involved in the stress response. Of note, a few genes encoding regulators (two in TIM-1 and one in Caco-2 TC7) were also detected in R-IVET activated clones. Furthermore, four genes encoding hypothetical proteins were activated in Caco-2 TC7 compared to three in TIM-1 and only one in fecal batch cultures. Interestingly, inserts from activated R-IVET library clones located inside genes or on the antisense strand mainly belonged to genes involved in nutrient absorption and metabolism (15/55), protein synthesis (5/55), and stress response (6/55). On the remaining 23 inserts, 13 belonged to genes involved in various functions such as peptide synthesis, DNA repair, or DNA internalization (Table 4).  A cell in blue, orange or green means that a promoter has been activated specifically in the TIM-1, Caco-2 TC7 or Fecal batch cultures, respectively. a Locus annotation from 28 January 2014; b locus annotation from 30 July 2015 from NCBI; * the insert is correctly positioned to detect the expression of promoter regions from two adjacent genes, with the possibility of an operon. S30, S60, and S90 denote genes identified in the TIM-1 stomach compartment at 30 min, 60 min, and 90 min, respectively. M4 denotes genes identified in fecal batch cultures from the male volunteer at 4 h. W2 and W4 denote genes identified in fecal batch cultures from the female volunteer at 2 h and 4 h, respectively.  A cell in blue, orange or green means that a promoter has been activated specifically in the TIM-1, Caco-2 TC7 or Fecal batch cultures, respectively. a Locus annotation from 28 January 2014; b locus annotation from 30 July 2015 from NCBI; c see Figure 5 for term explanations. S30, S60, and S90 denote genes identified in the TIM-1 stomach compartment at 30 min, 60 min, and 90 min, respectively. M2 and M4 denote genes identified in fecal batch cultures from male volunteer at 2 h and 4 h, respectively. W2 and W4 denote genes identified in fecal batch cultures from female volunteer at 2 h and 4 h, respectively.

Determination of Activated Gene Variability in S. thermophilus Strains Classified According to Their Resistance to GI Stresses and Adhesion Capacity
In a previous study, we determined the capacity of 30 phylogenetically very close strains of S. thermophilus to resist different stresses known to prevail in the human digestive tract (including acid stress) or to adhere to HT29-MTX mucus-producing cells. This led to the identification of six distinct phenotypic classes [15]. Therefore, we investigated the variability of activated genes from the present study in strains belonging to some of these six phenotypic groups. All genes identified in activated R-IVET clones from TIM-1, fecal batch cultures, or Caco-2 TC7 cells were amplified from the genomes of selected strains (CNRZ160, CNRZ21, EBL308, EBL385, EBLST20, PB18O, PB302, PB385, PB5MJ, ST14, and ST88; see Table 1). Sequences obtained were then compared with similar ones in fully sequenced LMD-9 and LMG18311 S. thermophilus strains [2,34].
As shown in Figure 6 (see full data in Table S3, Supplementary Materials), tested strains displayed three resistance levels against acid stress: high level (strains LMD-9, PB18O, PB302, EBLST20, and EBL385), low level (strains EBL308, ST14, and CNRZ21), and intermediate level (others) [15], with the most sensitive being strain CNRZ21. Out of the 24 genes tested and the 22 corresponding proteins, six (four proteins and two genes) were identical in all selected strains. On the contrary, 18 predicted proteins varied, among which 12 are encoded by genes specifically activated in TIM-1, three are encoded by genes specifically activated in fecal batch cultures, and three are encoded by genes activated in both models. They are involved in various functions such as nutrient absorption and metabolism, regulation of transcription, or stress response. Some of them correspond to hypothetical proteins (Table 3). Interestingly, strains displaying a high resistance level to acidity, especially LMD-9, PB18O, PB302, and EBLST20, shared greater similarity in allele profile than the most sensitive ones (EBL308, ST14, and CNRZ21). A protein signature could be deduced from a comparison of acid-resistant strains, which seemed less obvious for sensitive strains where more variability was observed ( Figure 6 and Table S3, Supplementary Materials).
According to Junjua et al. [15], selected strains displayed also different adhesion capacities to HT29-MTX cells: high capacity for LMD-9, ST88, EBLST20, and LMG18311, intermediate capacity for PB18O and EBL385, and low capacity (unable to adhere) for CNRZ21 and PB5MJ. No clear difference was observed between strains depending on their adhesion capacity (Table S4, Supplementary Materials). Patterns obtained for the very adherent strains LMD-9, ST88, and EBLST20 were very similar, as well as similar to that of weakly adherent strains such as PB5MJ. Furthermore, the pattern obtained for the very adherent strain LMG18311 was very similar to that of the weakly adherent strain CNRZ21. This absence of variability might result from differences in cell models used in our work and Junjua et al. [15].   [15]; d LMD-9 used as a reference strain for comparison; e types of proteins found in strains exhibiting a high level of resistance against GI stress; f types of proteins found in strains exhibiting a low level of resistance against GI stress.

Discussion
The aim of this work was to better understand the physiological status of S. thermophilus during its passage through the human digestive environment. To reproduce the different niches found in the human GI tract, three complementary in vitro models were used, namely, the TIM-1 (stomach and small intestine), fecal batch cultures (colon), and the Caco-2 TC7 cells (interaction with intestinal epithelial cells). Among available approaches that allow the following of bacterial gene expression, we decided to use the R-IVET technology, which is the only one among IVET technologies that functions as a genetic screen [25]. The main advantage of this technology is that it makes it possible to identify genes specifically expressed in complex media, such as the human digestive environment, including transiently and locally expressed genes, at the level of the individual bacterium [25]. R-IVET also bypasses the difficult acquisition step of high-quality RNAs from complex environments required in other techniques (especially in environments LMD-9 d PB18O PB302 EBLST20 LMG18311 CNRZ160 PB385 EBL308 ST14 CNRZ21 EBL305 [15] Table S3 (Supplementary Materials). Ø No PCR amplicon; a locus annotation from 28 January 2014; b locus annotation from 30 July 2015 from NCBI; c according to Junjua et al. [15]; d LMD-9 used as a reference strain for comparison; e types of proteins found in strains exhibiting a high level of resistance against GI stress; f types of proteins found in strains exhibiting a low level of resistance against GI stress.

Discussion
The aim of this work was to better understand the physiological status of S. thermophilus during its passage through the human digestive environment. To reproduce the different niches found in the human GI tract, three complementary in vitro models were used, namely, the TIM-1 (stomach and small intestine), fecal batch cultures (colon), and the Caco-2 TC7 cells (interaction with intestinal epithelial cells). Among available approaches that allow the following of bacterial gene expression, we decided to use the R-IVET technology, which is the only one among IVET technologies that functions as a genetic screen [25]. The main advantage of this technology is that it makes it possible to identify genes specifically expressed in complex media, such as the human digestive environment, including transiently and locally expressed genes, at the level of the individual bacterium [25]. R-IVET also bypasses the difficult acquisition step of high-quality RNAs from complex environments required in other techniques (especially in environments with a complex microbial background as found in the human colon). First, we significantly improved the R-IVET tool applied to S. thermophilus LMD-9, through the construction and validation of a positive screening of activated clones. The LMD-9 reference strain was chosen for three main reasons: (i) availability of its genome sequence, (ii) ability to resist various stresses prevailing in the GI tract, and (iii) adherence to intestinal epithelial cells [15,16,18,34]. Then, we identified specifically activated genes in each digestive condition and observed that some of them varied across S. thermophilus strains showing different levels of resistance against GI stresses.
The chromosomal cassette we constructed allows the identification of activated genes through excision of the first reporter gene specR, leading to the activation of the second reporter gene kanR. Furthermore, by adding erythromycin to the selection medium already containing kanamycin, the activated Spec S Kan R clones could be selected while ensuring the stability of the R-IVET plasmid. During this work, about 2500 clones displaying the Spec S Kan R Ery R phenotype were analyzed. In each of them, a precise excision of the specR gene, through recombination between the lox sites flanking it, was observed, and an insert located upstream the cre gene was present, even though our genomic R-IVET library contained only 37.5% recombinant clones. This result provides a strong argument for the high stability of the constructed chromosomal cassette and the efficiency of designed screening approach. Lastly, the probability that genomic R-IVET library covered the entire LMD-9 genome was very high (0.997).
Identification of genes specifically activated under human simulated digestive conditions using R-IVET technology implies the elimination of any recombinant clone containing a promoter induced during its growth culture (i.e., before introduction in the tested environment). Despite our efforts, it was not possible to counterselect all Spec S Kan R clones before introduction of the R-IVET library in each GI model, leading us to systematically analyze all these clones. This might result from the limited number of antibiotic resistance genes that can be efficiently used in S. thermophilus and constitutes a limitation of our R-IVET tool. Nevertheless, as shown in Table S2 (Supplementary Materials), inserts of T0 activated R-IVET clones often correspond to basic cellular functions expected to be activated during library growth, as bacteria have to display an active metabolism to multiply. Hence, promoters of the corresponding genes cloned upstream the cre gene are obviously activated.
Throughout in vitro experiments, among the 2424 Spec S Kan R clones analyzed, 128 promoters were detected, including 44 specifically activated in the GI models. All of these 44 promoters, expect one corresponding to a gene encoding a hypothetical protein of the phasin superfamily, appeared to be GI model-specific, strengthening the argument that they do not correspond to false positives. Furthermore, in each in vitro model, these promoters were represented several times among activated inserts. Thus, the first conclusion is that S. thermophilus LMD-9 is metabolically active and can finely adapt its metabolism to each of the tested digestive conditions. All of the 18 promoters identified in TIM-1 were isolated from the gastric compartment, and no promoter was identified in the small intestine. This could be linked to a survival loss due to acidic pH in the stomach and high bile salt concentrations in the proximal small intestine. as the number of activated R-IVET clones that can be recovered is already weak, the significant drop in genomic library survival in the gastric and duodenal compartments might have contributed to the non-selection of activated clones. In the gastric compartment, S. thermophilus activated a higher number of genes involved in nutrient absorption and metabolism and in protein synthesis (10/18) compared to Caco-2 TC7 cells (3/20) and the colon model (2/6). Interestingly, while the gastric compartment was expected to be very stressful because of acidic conditions, only one gene (STER_0303) involved in stress response was identified (Figure 7). This gene might be involved in protecting cells against mutagenesis in relation to the HAM1 domain found in the last 200 amino-acid residues of the protein [43]. Detection of such a low number of stress-related genes could also result from adaptation to acidic conditions of the R-IVET library during its growth in milk, since, at the end of fermentation, pH was still low (pH 4.6). Therefore, genes known to be involved in acidic stress resistance such as GroEL, GroES, or Hsp [44,45], activated from pH 5, were likely expressed during the growth phase, and the corresponding clones were eliminated during the counterselection step applied before TIM-1 inoculation. Of interest, an insert belonging to the groES gene was detected in an activated R-IVET clone; however, as it was a multi-insert clone, it was not included in the study. Even if there was only one stress protein induced, analysis of other genes activated during the TIM-1 gastric phase suggests an adaptation of the bacterium to very stressful conditions (Figure 7). As an example, the gene STER_1048 encodes a protein that displays a domain belonging to the superfamily of phasins, which are involved in many bacteria during formation and intracellular accumulation of polyhydroxyalkanoates under adverse conditions, which allows cells increasing their fitness and resistance to stresses [46].
Genes induced during adhesion assays suggest that S. thermophilus modulated its surface properties to adhere to Caco-2 TC7 cells (Figure 7). Three genes encoding cell surface proteins were induced during cell adhesion, two of them having an unknown function. The third was a cell membrane division protein designated as rod shape-determining protein RodA (STER_1197). This protein contains the FtsW domain [47], which appears to be required in crossing peptidoglycan precursors outside the cell membrane and for peptidoglycan biosynthesis, thus being involved in maintaining cell shape and modulating cell-wall morphology [48]. Furthermore, it has been shown that the RodA protein is essential for the cell viability of Bacillus subtilis and to maintain cell surface shape [49]. Three genes involved in nutrient absorption and metabolism were also induced; among them, one (STER_1494) encodes an ABC transporter [50]. Even if its role has to be further documented, it must be noted that a significant decrease in adhesion of B. bifidum to Caco-2 TC7 cells has been observed after deletion of ABC transporters encoding genes located at the outer cell surface [51].
Only six promoters were identified in batch cultures of human fecal microbiota. Two of them correspond to genes involved in protein synthesis, two encode stress proteins (STER_0572 and STER_1347 genes encoding two ABC permeases possibly involved in antimicrobial peptide transport; see [52]), and two encode proteins of the CRISPR immune system, one being an hypothetical protein while the other is the CRISPR-associated endoribonuclease Cas6. The activation of stress proteins and proteins from the CRISPR system involved in cell protection against foreign DNA, such as plasmid or virus DNA [53], suggests that S. thermophilus reacts to an environment where competition with other gut microorganisms is very hard (Figure 7). Moreover, 54 unique inserts allowing the expression of the cre gene but not corresponding to an already annotated promoter region were revealed in gut models. There were three different cases for the location of insert sequence allowing Cre recombinase expression: (i) inside a gene and in the sense orientation, (ii) inside a gene but in the antisense orientation, and (iii) in the promoter region of a gene but on the antisense strand. All these inserts ere carefully analyzed using several predictive promoter software. In any case, there was at least one part of the sequence, located near the insert extremity and, hence, correctly positioned, that could serve as a promoter. Furthermore, analysis of the sequence located downstream revealed, in certain cases, the presence of small ORFs. The S. thermophilus genome is known to contain small ORFs encoding peptides involved in the regulation of response such as quorum sensing [54,55]. Thus, if their promoter activity is confirmed, these inserts could belong to regions containing small ORFs, not yet considered during genome annotation. Furthermore, sequences located inside a gene but in the antisense orientation could correspond to promoters of potential noncoding antisense RNA, as previously suggested in other studies using R-IVET technology, e.g., in L. plantarum [25] or Enterococcus faecalis [56,57]. As mentioned before, no Spec S Kan R clone obtained throughout this work was devoid of an insert. However, we cannot formally exclude that at least certain inserts corresponded to false positives as proposed by Koguchi et al. [27]. If we rule out the false positive hypothesis, then these data suggest the R-IVET technology as helpful for a better understanding of gene regulation, with potential use in genome annotation of a targeted strain such as S. thermophilus LMD-9.
Microorganisms 2021, 9, x FOR PEER REVIEW 25 of 28 as helpful for a better understanding of gene regulation, with potential use in genome annotation of a targeted strain such as S. thermophilus LMD-9. Lastly, we established the allelic form of the 44 genes revealed in GI models and deduced corresponding proteins in a collection of phylogenetically very close S. thermophilus strains [15]. These strains were previously described as displaying different survival capacities to known GI stresses (such as acidic pH, bile salts, and oxidation stress) and varied adhesion capacity, although, in this last case, adhesion was determined using HT-29 MTX and not Caco-2 TC7 cells. For the 24 genes activated in TIM-1 and fecal batch cultures, comparison of the corresponding proteins between strains having a high level and a low level of resistance to GI stresses allowed us to draw a protein signature for the two categories. The next step will be to determine their role in S. thermophilus resistance to GI conditions by making mutants and performing further experiments in TIM-1 and batch culture models. The ultimate goal of this study is to apply R-IVET technology to identify selection markers of GI resistance that will be helpful in the screening of S. thermophilus strains with potential probiotic properties.
Supplementary Materials: The following are available online at www.mdpi.com/xxx/s1: Figure S1. Survival of the R-IVET library in adhesion experiment conditions; Table S1. Oligonucleotides used for the constructions, verification of chromosomal R-IVET cassette, sequencing of inserts from activated clones, and amplification of activated genes in various S. thermophilus strains; Table S2. Inserts from activated R-IVET clones identified at T0 in the different gut models; Table S3. Allele variability of R-IVET activated genes identified in TIM-1 and fecal batch cultures; Table S4. Allele variability of R-IVET activated genes identified in Caco-2 TC7 cells.   Lastly, we established the allelic form of the 44 genes revealed in GI models and deduced corresponding proteins in a collection of phylogenetically very close S. thermophilus strains [15]. These strains were previously described as displaying different survival capacities to known GI stresses (such as acidic pH, bile salts, and oxidation stress) and varied adhesion capacity, although, in this last case, adhesion was determined using HT-29 MTX and not Caco-2 TC7 cells. For the 24 genes activated in TIM-1 and fecal batch cultures, comparison of the corresponding proteins between strains having a high level and a low level of resistance to GI stresses allowed us to draw a protein signature for the two categories. The next step will be to determine their role in S. thermophilus resistance to GI conditions by making mutants and performing further experiments in TIM-1 and batch culture models. The ultimate goal of this study is to apply R-IVET technology to identify selection markers of GI resistance that will be helpful in the screening of S. thermophilus strains with potential probiotic properties.
Supplementary Materials: The following are available online at https://www.mdpi.com/article/10 .3390/microorganisms9061113/s1: Figure S1. Survival of the R-IVET library in adhesion experiment conditions; Table S1. Oligonucleotides used for the constructions, verification of chromosomal R-IVET cassette, sequencing of inserts from activated clones, and amplification of activated genes in various S. thermophilus strains; Table S2. Inserts from activated R-IVET clones identified at T0 in the different gut models; Table S3. Allele variability of R-IVET activated genes identified in TIM-1 and fecal batch cultures; Table S4. Allele variability of R-IVET activated genes identified in Caco-2 TC7 cells.
Institutional Review Board Statement: Not applicable. This study being a non-interventional study with no addition to usual clinical care, according to the French Health Public Law (CSP Art L 1121-1.1), the protocol does not require approval from an ethic committee.