Arginine Deiminase and Biotin Metabolism Signaling Pathways Play an Important Role in Human-Derived Serotype V, ST1 Streptococcus agalactiae Virulent Strain upon Infected Tilapia

Simple Summary Patients who were infected with Streptococcus agalactiae (ST1) were mainly associated with asymptomatic carriage. However, the invasive diseases in non-pregnant adults caused by S. agalactiae (serotype V, ST1) have increased recently. We have previously reported that human-derived S. agalactiae (serotype V, ST1) could infect tilapia with virulence and pathologic characteristics similar to highly virulent tilapia-derived S. agalactiae (ST7) strains. The potential risk of cross-species infection cannot be ignored. Therefore, our research provided a multi-omics analysis of the human-derived serotype V ST1 S. agalactiae strains, which were virulent and non-virulent to tilapia and provided a more comprehensive understanding of the virulence mechanism. Abstract Our previous study showed that human-derived Streptococcus agalactiae (serotype V) could infect tilapia, but the mechanism underlying the cross-species infection remains unrecognized. In this study, a multi-omics analysis was performed on human-derived S. agalactiae strain NNA048 (virulent to tilapia, serotype V, ST1) and human-derived S. agalactiae strain NNA038 (non-virulent to tilapia, serotype V, ST1). The results showed that 907 genes (504 up/403 down) and 89 proteins (51 up/38 down) were differentially expressed (p < 0.05) between NNA038 and NNA048. Among them, 56 genes (proteins) were altered with similar trends at both mRNA and protein levels. Functional annotation of them showed that the main differences were enriched in the arginine deiminase system signaling pathway and biotin metabolism signaling pathway: gdhA, glnA, ASL, ADI, OTC, arcC, FabF, FabG, FabZ, BioB and BirA genes may have been important factors leading to the pathogenicity differences between NNA038 and NNA048. We aimed to provide a comprehensive analysis of the human-derived serotype V ST1 S. agalactiae strains, which were virulent and non-virulent to tilapia, and provide a more comprehensive understanding of the virulence mechanism.


Introduction
Streptococcus agalactiae not only causes pneumonia and meningitis in humans, but also causes streptococcosis in farmed tilapia [1][2][3]. However, some zoonotic/multi-host lineages have emerged

Data Analysis of the Transcripts
After removing the unqualified reads from the raw data, clean reads were obtained and mapped to the reference genome (NCBI, Accession NO.NZ_CP011325.1) by using STAR (v2.5.1b). The reads numbers of each gene were calculated by HTSeq v0.6.1 [31], and measured with FPKM numbers [32]. DESeq R package (1. 18.0) provided a differential expression analysis of the two groups, and genes with an adjusted p-value < 0.05 were assigned as differentially expressed (adjusted with Benjamini and Hochberg's approach) [33]. The statistical DEGs were annotated and enriched by Gene Ontology and KEGG databases, which were performed by GOseq [34] and KOBAS v2.0 [35], respectively.

Protein Extraction and Peptide Preparation
The strain samples were lysed in the glass homogenizer individually with 1 mL of lysis buffer (50mM Tris buffer, 8M Urea, 1%SDS, pH = 8). The homogenate was disrupted by ultrasonic, then centrifuged at 12,000 g for 15 min at 4 • C. The supernatant was mixed with four volumes of 10 mM DTT in cold acetone (10%), after determining the protein concentration (Bradford assay). Static settled at −20 • C for 2 h. The sediments were collected after centrifugation, washed twice with cold acetone and dissolved in the buffer (50 mM Tris buffer, 8 M Urea, pH = 8).
After trypsin digestion, the peptide was desalted with a C18 cartridge to remove the high urea, and the desalted peptides were dried by vacuum centrifugation.

The Identification and Quantitation of Protein
Proteome Discoverer2.2 software was used to search the resulting spectra of each fraction from the "Run2_Streptococcus_agalactiae_GCF_001190805.1_ASM119080v1_protein.fasta" database (mass tolerance: precursor ion scans, 10 ppm; product ion scans, 0.02 Da). Carbamidomethyl was used as the fixed modification, while oxidation of methionine, acetylation of the N-terminus and iTRAQ 8-plex of tyrosine and lysine were used as variable modifications (miscleavage sites ≤ 2).
The raw data of mass spectrometry used dataset identifier < PXD011206> (ProteomeXchange Datasets).

Data Analysis of Proteome
InterProScan 5 was used to classify the functions of the DEPs by searching the GO, InterPro, COG and KEGG databases [36]. STRING-db was used to predict the interactions of the proteins [37]. The enrichments were performed by pipelines [38].

Real-Time Quantitative PCR (RT-qPCR)
The target genes were predicted to be associated with virulence. RT-qPCR was used to compare the expression levels of the target genes between NNA038 and NNA048. Primer Premier (Ver. 5.0) was used to design the specific primers of the target genes, and recA [39] gene was used as the reference gene. The total RNA was extracted from the culture strains by RNAiso Plus (Takara Bio, Beijing), then the reverse transcription and Real-time qPCR were performed by PrimeScript™ RT reagent Kit (Takara Bio, Beijing, China). The expression level was determined as a relative expression to recA using the 2(-∆∆C(T)) method.

Virulence Test
The results showed that NNA048 was virulent and NNA038 was avirulent to tilapia, which was consistent with our previous studies [11].

Statistic Analysis of the Multi-Omics Profiles
In RNA-seq profiles, a total of 47,985,674 raw reads (10,999,064 and 12,442,108 for NNA038 as well as 12,287,016 and 12,257,486 for NNA048, respectively) were generated. After removing the unqualified reads, 47,435,932 clean reads were obtained. Approximately 92.15% (89.61%-96.70%) of the mapped reads were acquired from the RNA-seq experiment, of which 90.76% (86.88%-95.45%) were mapped to unique genomic locations ( Figure 1). A total of 907 genes (504 up/403 down) ( Figure 2A) and 89 proteins (51 up/38 down) ( Figure 2B) were significantly altered (p < 0.05) in the NNA038 vs NNA048 group. Among them, 56 proteins were significantly altered with similar trends at both mRNA and protein levels ( Figure 2C). Functional annotation of them showed that they were mainly enriched in the biosynthesis of amino acids, fatty acid metabolism and pyrimidine metabolism pathways ( Figure 2D).

Functional Annotation of DEGs and DEPs
The differentially expressed genes and proteins between NNA038 and NNA048 were annotated and enriched by GO and KEGG databases. By GO analysis, DEGs were mainly enriched in the organonitrogen compound metabolic process, the small molecule metabolic process and the organonitrogen compound metabolic process. The DEPs were mainly enriched in nitrogen compound transport, isomerase activity and catalytic activity. By KEGG pathway analysis, the DEGs were mainly enriched in oxidative phosphorylation, a biosynthesis of antibiotics and fatty acid metabolism. The DEPs were mainly     Among the enriched signaling pathways, the arginine biosynthesis and biotin metabolism pathways were significantly enriched, and considered to be an important reason for the significant difference between NNA038 and NNA048 in the pathogenicity of tilapia ( Figure 3A). organonitrogen compound metabolic process, the small molecule metabolic process and the organonitrogen compound metabolic process. The DEPs were mainly enriched in nitrogen compound transport, isomerase activity and catalytic activity. By KEGG pathway analysis, the DEGs were mainly enriched in oxidative phosphorylation, a biosynthesis of antibiotics and fatty acid metabolism. The DEPs were mainly enriched in a biosynthesis of amino acids, fructose and mannose metabolism, arginine biosynthesis and biotin metabolism.

Functional Annotation of DEGs and DEPs
Among the enriched signaling pathways, the arginine biosynthesis and biotin metabolism pathways were significantly enriched, and considered to be an important reason for the significant difference between NNA038 and NNA048 in the pathogenicity of tilapia ( Figure 3A).

RT-qPCR
RT-qPCR was used to verify the expression levels of the target genes between NNA038 and NNA048. The target genes in arginine biosynthesis and biotin metabolism signaling pathways were verified in Figures 3 and 4, respectively. The relative expression of the target genes were significantly different between NNA038 and NNA048, which has similar alteration trends to the multi-omics results.

RT-qPCR
RT-qPCR was used to verify the expression levels of the target genes between NNA038 and NNA048. The target genes in arginine biosynthesis and biotin metabolism signaling pathways were verified in Figures 3 and 4, respectively. The relative expression of the target genes were significantly different between NNA038 and NNA048, which has similar alteration trends to the multi-omics results.

Discussion
As a multi-host infectious pathogen, the risk of cross-host infection of GBS has been widely concerning [5,[40][41][42][43][44][45]. It had been reported that human-derived S.agalactiae (ST23, ST7, ST19 and ST17) could infect fish, causing invasive disease and even death [4,27,28,46,47]. In addition, the consumption of raw fish has caused invasive streptococcosis in humans [6]. The streptococcosis could outbreak in a short time by spreading through water and densely farmed fish. At present, the treatment for streptococcosis in aquaculture is mainly dependent on antibiotics. However, this causes water pollution and resistant strains, which leads to a vicious circle. The cross-species infection between humans and fish poses a serious threat to human health. Hetron et al. have provided an overview of the progress and prospects of functional genomics based on conventional single gene sequencing, cloning and characterization together with HTS in the prevention and control of fish diseases in aquaculture, which showed that genomics studies made a great contribution in the prevention and control of fish diseases in aquaculture [26]. It has been reported that the proportion of infection caused by serotype V, ST1 S.agalactiae in non-pregnant adults has increased, which may cause infant death [7][8][9][10]48,49]. Therefore, the highly virulent strains which could spread across species need to be a higher concern. Human-derived S.agalactiae NNA048 and NNA038 (Serotype V, ST1) could both infect tilapia, but the pathogenicity was quite different. Genome study showed that there were major differences in phage sequences, NNA048 specificcally possesses an intact phage sequence which is encoded with 68 proteins [11]. In this study, 56 proteins were significantly differentially expressed between NNA048 and NNA038 with similar alteration trends at both mRNA and protein levels by multi-omics analysis. These DEGs and DEPs may be closely associated with the pathogenicity of the strains. Functional analysis of them will provide a comprehensive understanding of the virulence mechanism.

Discussion
As a multi-host infectious pathogen, the risk of cross-host infection of GBS has been widely concerning [5,[40][41][42][43][44][45]. It had been reported that human-derived S. agalactiae (ST23, ST7, ST19 and ST17) could infect fish, causing invasive disease and even death [4,27,28,46,47]. In addition, the consumption of raw fish has caused invasive streptococcosis in humans [6]. The streptococcosis could outbreak in a short time by spreading through water and densely farmed fish. At present, the treatment for streptococcosis in aquaculture is mainly dependent on antibiotics. However, this causes water pollution and resistant strains, which leads to a vicious circle. The cross-species infection between humans and fish poses a serious threat to human health. Hetron et al. have provided an overview of the progress and prospects of functional genomics based on conventional single gene sequencing, cloning and characterization together with HTS in the prevention and control of fish diseases in aquaculture, which showed that genomics studies made a great contribution in the prevention and control of fish diseases in aquaculture [26]. It has been reported that the proportion of infection caused by serotype V, ST1 S. agalactiae in non-pregnant adults has increased, which may cause infant death [7][8][9][10]48,49]. Therefore, the highly virulent strains which could spread across species need to be a higher concern. Human-derived S. agalactiae NNA048 and NNA038 (Serotype V, ST1) could both infect tilapia, but the pathogenicity was quite different. Genome study showed that there were major differences in phage sequences, NNA048 specificcally possesses an intact phage sequence which is encoded with 68 proteins [11]. In this study, 56 proteins were significantly differentially expressed between NNA048 and NNA038 with similar alteration trends at both mRNA and protein levels by multi-omics analysis. These DEGs and DEPs may be closely associated with the pathogenicity of the strains. Functional analysis of them will provide a comprehensive understanding of the virulence mechanism. The gastrointestinal tract is the main route for a S. agalactiae infection. When a pathogen invades the host through the gastrointestinal tract, it must first adapt to the acidic conditions and survive in the stomach to start its infectious cycle [50,51]. Acid resistance is an important factor affecting bacterial pathogenicity. The arginine deiminase system (ADS) of the bacteria allows the bacteria to survive potentially lethal acidification through production of ammonia to raise the environmental pH value [51]. Curran et al. found that the ADS in mutans streptococci could protect it against acid damage [52]. Xiong et al. also found that ADS could help Laribacter hongkongensis survive in the hostile acidic environment of the stomach [53]. ADS consists of three enzymes: arginine deiminase, ornithine carbamoyltransferase and carbamate kinase [52]. The ADS-encoded genes are commonly organized as an operon, a lot of genes have been found associated with the ADS functions, but the arrangement and components differ among species [54][55][56][57][58]. In this study, we have identified some over-expressed genes which are homologous to the published S. agalactiae ADS-encoding genes. These genes were predicted to participate in ADS functions. The results showed that the ASL, ADI, gdhA, glnA and arcC genes were significantly down-regulated in NNA048, while OTC encoding genes were up-regulated in NNA048, which leads to ornithine carbamoyltransferase and carbamate kinase having significantly higher expression in NNA048. Therefore, NNA048 could provide more ATP for growth and proliferation derived from catabolism of arginine to ornithine, CO 2 , and NH 3 . In addition, the high concentration of ammonia is toxic to most fish because exposure of the brain to elevated ammonia concentrations leads to a wide range of neuro-cognitive deficits, intellectual disabilities, coma and death [49], which are the typical symptoms of GBS diseases [59]. The differences in ADS pathways may be the key cause of the difference between the pathogenicity of NNA038 and NNA048.
The synthesis of fatty acids is necessary because the bacterial fatty acids are important cellular components, such as the cell membrane [60]. Fatty acid biosynthesis in bacteria is carried out by the type II fatty acid synthase (FAS) system, which consists of four basic reactions. Each of the reactions is performed by discrete enzymes, which are highly specific to different species of bacteria [61]. We have shown that FabF, FabG and FabZ homologous genes in fatty acid metabolic signaling pathways are differentially expressed in NNA038 and NNA048. The enzymes encoded by these genes catalyze one of the reactions in the type II FAS system [62][63][64]. The differential expression of these coding genes leads to significant down-regulation of FabF and FabG proteins, and significant up-regulation of FabZ protein in NNA048. These changes affect the downstream biotin metabolic signaling pathway because the biotin synthase BioB and biotin operon repressor BirA genes are significantly down-regulated in NNA038, which affects the synthesis of biotin [65,66]. Biotin, a water-soluble vitamin of the B complex, functions as a cofactor of carboxylases that catalyzes an indispensable cellular metabolism. Studies have shown that biotin deficiency up-regulates TNF-alpha production in murine macrophages, which affects the cytoactive of macrophages [67]. It has been shown that GBS can survive for a long time after being phagocytosed by phagocytic cells such as macrophages and neutrophils [67][68][69], thereby escaping the killing effect of active antibacterial molecules in the blood [12,50,69]. Therefore, the differences in fatty acid metabolic pathways lead to differences in the growth and proliferation abilities of NNA038 and NNA048, which ultimately leads to differences in pathogenicity to tilapia. FabF, FabG, FabZ, BioB and BirA genes might be the important genomics-driven targets for antibacterial drug discovery.

Conclusions
In this study, a multi-omics analysis was performed on a highly virulent human-derived S. agalactiae strain NNA048 (serotype V, ST1) and avirulent strain NNA038 (serotype V, ST1). We have elucidated the differences between the two strains at both the mRNA and protein levels. The differences in fatty acid metabolic pathways (FabF, FabG, FabZ, BioB and BirA) and the arginine deiminase system (gdhA, glnA, ASL, ADI, OTC and arcC) may lead to the differences in pathogenicity to tilapia among the two strains. This study provided a more comprehensive understanding of the virulence mechanism of the human-derived serotype V, ST1 S. agalactiae strains, and some potential targets for antibacterial drug discovery were identified.