Construction and Application of a Plasmid-Based Signal Peptide Library for Improved Secretion of Recombinant Proteins with Priestia megaterium

The secretion of recombinant proteins plays an important role in their economic production and purification. The secretion efficiency depends on the responsible signal peptide (SP) in combination with the target protein and the given host and cannot be predicted so far. Due to its high plasmid stability, the lack of alkaline extracellular proteases and only few contaminating extracellular host proteins, Priestia megaterium provides a promising alternative to common Bacillus species. For the development of an easy and fast cloning and screening system to identify the SP best suited to a distinct protein, a plasmid-based SP library containing all predicted 182 Sec-dependent SPs from P. megaterium was established. The splitting of the SPs into 10 groups of individual multi-SP plasmids (pMSPs) allows their grouped amplification and application in screening approaches. The functionality of the whole library was demonstrated by enhancing the amount of the already well-secreted α-amylase AmyE by 1.6-fold. The secretion of a novel penicillin G acylase, which remained as insoluble protein inside the cells, as its native SP is unsuitable for secretion in P. megaterium, could be enhanced even up to 29-fold. Overall, only around 170 recombinant P. megaterium clones based on 50 inserted SPs had to be screened to achieve sufficient amounts for further enzyme characterizations. Thus, this newly developed plasmid-based genetic tool applicable for P. megaterium and also other Bacillus species facilitates the identification of suitable SPs for secretion of recombinant proteins.


Introduction
For decades, the biotechnological production of recombinant proteins has played an important role in industry [1]. The emergence of recombinant DNA technology in the 1970s led to an enormous increase of potential applications in the area of biopharmaceuticals and industrial enzymes [2,3]. The secretion of recombinant proteins to the extracellular environment has advantages over their intracellular production. The downstream processing is easier, non-denaturing, and cost-effective due to the purification of correctly folded proteins from the cell-free culture broth without cell disruption [4,5]. Furthermore, the secretion of recombinant proteins can facilitate continuous cultivation [6]. Due to the architecture of their cell wall, Gram-positive bacteria are especially well suited to the secretion of proteins. They are characterized by an intrinsic capacity to secrete proteins directly into the surrounding medium because of the lack of an outer membrane [4,7].

Bacterial Strains and Construction of Expression Vectors
The cloning was carried out in Escherichia coli strain DH10B (Life Technologies, Carlsbad, USA). For screening experiments and protein production, the protease-deficient P. megaterium strain MS941 was used [27].
The plasmid p3STOP1623hp [28] (Table S1) served as the origin for the construction of expression vectors. In one cloning step, its multiple cloning site (MCS) was extended (restriction sites for KasI, MluI, AvrII, NotI, NheI, and AgeI) and the coding sequence of a His 6 -tag was integrated via oligos (Table S2) using the restriction site AgeI, resulting in plasmid pTKSP0. Next, the amyE gene of B. subtilis 168 (GenBank: CAB12098.2) encoding the α-amylase AmyE was amplified by PCR from genomic DNA without its native signal peptide (SP) and integrated in pTKSP0 via the restriction sites NgoMIV and AgeI, resulting in the plasmid pTKSPamyE0. As a control, the coding sequence of the previously used SP of lipase A (sp lipA , [32]) from P. megaterium was amplified via PCR primer (Table S2) from pEJBmD1.3scFv [33] and inserted in pTKSPamyE0 via the restriction sites BsrGI and NgoMIV yielding pTKSPamyElipA. Furthermore, the bmaspga gene was amplified by PCR from vector pRBBm317 [30] without its SP coding sequence and integrated in pTKSP0 via the restriction sites NgoMIV and NheI, resulting in the plasmid pJMBm75. The success of cloning procedures was verified by DNA sequencing (Microsynth Seqlab GmbH, Göttingen, Germany).

Construction of Multisignal Peptide Plasmids (pMSP)
Sec-dependent SPs of P. megaterium DSM 319 were identified with the online tool Sig-nalP 4.0 [15]. The SP coding sequences were sorted by size and divided on ten multisignal peptide plasmids (pMSPs, Tables S3-S12). The coding sequences of all SPs in groups of 18 and 20 SPs, respectively, were arranged in alternating orientations and separated by the restriction sites BsrGI (5 ) and NgoMIV (3 ). All pMSPs were synthesized by General Biosystems (Morrisville, NC, USA).

Cloning of SP Coding Sequences into Expression Vectors
For cloning of SP coding sequences from pMSPs into the obtained plasmids, the multi-SP plasmids (pMSP1-10) and the expression vector pTKSPamyE0 and pJMBm75, respectively, were digested with BsrGI and NgoMIV. To prevent ligation of more than one SP coding sequence in one vector, the ends of the obtained SP coding sequences were dephosphorylated with shrimp alkaline phosphatase (rSAP). Subsequently, the DNA fragments were separated by agarose gel electrophoresis and the corresponding band for the expression vector and SP mix were excised and purified. While the digested vectors were purified using the NucleoSpin PCR and Gel Clean-Up Kit (Macherey-Nagel, Düren, Germany), the SP mix was purified using the QIAEX II Gel Extr action Kit (QIAGEN, Hilden, Germany). Four µL (50 ng) of purified vector fragment and 4.5 µL (200 ng) of SP mix were ligated using T4 DNA ligase. Next, E. coli DH10B cells were transformed with ligation preparations to generate a higher number of isolated colonies representing clones with varying SPs. All clones were suspended together in LB medium (10 g L −1 tryptone, 5 g L −1 yeast extract, 5 g L −1 NaCl) and the plasmid mix was purified in one step using the NucleoSpin Plasmid Kit (Macherey-Nagel, Düren, Germany). Finally, P. megaterium MS941 was transformed with the plasmid mix by protoplast transformation [34]. Isolated single colonies were applied for the screening approach.

Cultivation in a Microbioreactor System
For the screening of different SPs, the cultivation of P. megaterium MS941 containing plasmids with differing SPs was carried out in 48-well flower plates in a BioLector system (m2p-labs, Baesweiler, Germany) at 37 • C and 1400 rpm for 24 h. One mL of LB medium supplemented with 10 mg L −1 tetracycline, 0.5% (w/v) xylose and, for PGA secretion, 2.5 mM of CaCl 2 was used per well. Each well was inoculated with corresponding overnight pre-cultures, resulting in an initial optical density of 578 nm (OD 578 ) of 0.055. During cultivation, the growth was monitored by scattered light at 620 nm. After 24 h, the flower plate was centrifuged for cell-harvesting (3300× g, 20 min, 4 • C) and the cell-free supernatant was used for activity determination.

Cultivation in Shake Flasks and Protein Analyses
For shake flask cultivations, overnight pre-cultures of corresponding P. megaterium MS941 plasmid strains were used to inoculate 50 mL LB medium supplemented with 10 mg L −1 tetracycline and 2.5 mM CaCl 2 . The cultures were incubated at 37 • C and 200 rpm. At an OD 578 of 0.3 to 0.5, the induction of recombinant gene expression was performed by adding 0.5% (w/v) xylose. After further cultivation at 37 • C and 200 rpm for 22 h, cells were separated from the protein-containing supernatant by centrifugation.
For the analysis of extracellular proteins, these were precipitated from the cell-free supernatant by addition of ammonium sulfate. For this, 1.5 mL of cell-free supernatant was shaken with 660 mg ammonium sulfate for 2 h at 4 • C with shaking at 1000 rpm. After centrifugation at 17,000× g and 4 • C for 30 min, the precipitated proteins were suspended in 20 µL ddH 2 O and analyzed by SDS-PAGE.
For the analysis of intracellular proteins, cell sediments of two OD equivalents were suspended in 20 µL digestion buffer (100 mM Na 3 PO 4 , 5 mg mL −1 lysozyme, 2 µL mL −1 benzonase, pH 6.5) and incubated for 30 min at 37 • C and 1000 rpm. After centrifugation (17,000× g, 4 • C, 15 min), the supernatant was used to analyze soluble intracellular proteins by SDS-PAGE. The sediment obtained after centrifugation was suspended in 20 µL 8 M urea. After centrifugation (17,000× g, 4 • C, 10 min), the supernatant was used to analyze the insoluble intracellular proteins by SDS-PAGE.

Amylase Activity Assay
The Phadebas ® Amylase test (Kristianstad, Sweden) was used to determine the enzyme activity of α-amylase in the cell-free supernatant. By degradation of starch by α-amylase the embedded dye is released. The measured absorption at 620 nm is proportional to the converted substrate and thus to the amount of secreted α-amylase. One hundred microliters of cell-free culture medium obtained after centrifugation were mixed with 800 µL Phadebas ® Amylase test solution and incubated at 37 • C with shaking of 1400 rpm for 15 min. Subsequently, the reaction was stopped by the addition of 600 µL of 500 mM NaOH. The mixture was centrifuged (21,000× g, 10 min) and the supernatant was analyzed at 620 nm.

Colony PCR for Identification of Signal Peptides
For the identification of the corresponding SPs, colony PCR was performed and the resulting fragments were sequenced. P. megaterium colonies were suspended in lysis buffer containing 150 mM Tris-HCl (pH 8), 1 mg mL −1 lysozyme, and 100 µg mL −1 proteinase K and incubated for 20 min at 37 • C and 400 rpm, followed for another 20 min at 55 • C and 400 rpm. Subsequently, the enzymes were heat inactivated at 95 • C and the samples centrifuged at 21,000× g for 10 min. The supernatant was used as template for the PCR reaction. For colony PCR, taq polymerase and primers SeqpXylA_fw and SeqpMM1520_rv were used (Table S2). The resulting fragments were purified and the coding sequences of the SPs were identified by DNA sequence analysis. Alternatively, P. megaterium colonies were directly applied for sequencing using Ecoli NightSeq (Microsynth Seqlab GmbH, Göttingen, Germany), which was proven to work for Gram-positive P. megaterium before.

Results
Secretion of recombinant proteins, in contrast to intracellular accumulation of proteins, offers advantages such as simplified product recovery or the possibility of continuous cultivation [34]. For this purpose, the Gram-positive soil bacterium P. megaterium is of biotechnological relevance as it secretes recombinant proteins directly into the surrounding medium with only low amounts of host proteins compared to other secreting bacteria such as B. subtilis [30,36].
The amount of a secreted protein depends on the combination of the N-terminal signal peptide (SP), responsible for translocation, and the protein itself. However, as bioinformatics predictions of efficiency are not yet possible, for each protein of interest the most effective SP has to be found experimentally [5].

Construction of a Signal Peptide Test System in P. megaterium
First, we developed a plasmid setup that allows a simple and standardized exchange of both the coding sequence of the SP and the corresponding gene of the protein to be secreted. Thereby, optimal combinations of SP and protein can be identified, while the plasmid scaffold remains unchanged. Therefore, the secretion efficiency is only affected by the SP and in addition the amount of secreted protein should provide a direct statement about the SP functionality and secretion efficiency. The new plasmid system is based on the plasmid p3STOP1623hp, an established shuttle vector for cloning in E. coli and xylosedependent recombinant protein production in P. megaterium [28]. Downstream of the optimized xylose inducible promoter P xylA opt , a multiple cloning site (MCS) is located [37], which was extended by five recognition sites near the 3 end in this study. In addition, the coding sequence of a His 6 -tag was introduced downstream of the MCS followed by a stop codon to allow an affinity chromatographic purification of the target protein, resulting in the new plasmid pTKSP0 (Figure 1). Due to the large selection of restriction sites, the construction of pTKSP0 allows the easy cloning of the SP coding sequence in the 5 part of the MCS and of the target gene in the 3 end.
For cloning of the SP coding sequence at the 5 end of the MCS, restriction enzyme BsrGI can be chosen, whose recognition site is located between the ribosome binding site (RBS) and the corresponding start codon of the following coding sequence, so that the original 5 end remains intact, resulting in the native N-terminus of the SP. The fusion of the SP coding sequence and the target gene results in the insertion of two amino acids on protein level between SP and target protein ( Figure 1). These two residues at the so-called +1 and +2 position relative to the signal peptidase I (SPaseI) restriction site can affect the effectivity of SPase I. It depends on the N-terminal amino acid of the protein (+1) and on the C-terminal residues of the SP itself at positions −3, −2, −1 [10], while the latter are unchanged during the cloning procedure. Furthermore, these two additional +1 and +2 amino acids may influence the properties of the protein, since they remain at the Nterminus of the secreted protein after cleavage of the SP. Hence, the restriction enzyme NgoMIV was chosen, whose in-frame recognition sequence gccggc results in the insertion of the small and uncharged alanine and glycine residues with expectation of the least effect due to steric hindrance or other interactions on the secreted protein. In addition, both amino acids have been described as not negatively influencing the cleavage of the SP by SPaseI [10,38]. At the 3 end, the in-frame recognition site of AgeI results in the least amino acid residues between protein of interest and His 6 -tag but all other cleavage sites are also possible, especially for cloning without His 6 -tag fusion. For cloning of the SP coding sequence at the 5′ end of the MCS, restriction enzyme BsrGI can be chosen, whose recognition site is located between the ribosome binding site (RBS) and the corresponding start codon of the following coding sequence, so that the original 5′ end remains intact, resulting in the native N-terminus of the SP. The fusion of the SP coding sequence and the target gene results in the insertion of two amino acids on protein level between SP and target protein ( Figure 1). These two residues at the so-called +1 and +2 position relative to the signal peptidase I (SPaseI) restriction site can affect the effectivity of SPase I. It depends on the N-terminal amino acid of the protein (+1) and on the C-terminal residues of the SP itself at positions −3, −2, −1 [10], while the latter are unchanged during the cloning procedure. Furthermore, these two additional +1 and +2 amino acids may influence the properties of the protein, since they remain at the N-terminus of the secreted protein after cleavage of the SP. Hence, the restriction enzyme NgoMIV was chosen, whose in-frame recognition sequence gccggc results in the insertion of the small and uncharged alanine and glycine residues with expectation of the least effect due to steric hindrance or other interactions on the secreted protein. In addition, both amino acids have been described as not negatively influencing the cleavage of the SP by SPaseI [10,38]. At the 3′ end, the in-frame recognition site of AgeI results in the least amino acid residues between protein of interest and His6-tag but all other cleavage sites are also possible, especially for cloning without His6-tag fusion.

Identification of Sec-Dependent Signal Peptides from P. megaterium DSM 319
To identify Sec-dependent SPs used for protein secretion with P. megaterium, all annotated open reading frames (ORFs) of P. megaterium strain DSM 319 were translated to protein sequences. Possible N-terminal located Sec-dependent SPs were predicted using the web-based program SignalP 4.0 [15], based on a calculated discrimination score (d score), a measure to distinguish SPs from non-SPs. Amino acid sequences with a d score greater than 0.45 were considered as Sec-dependent SPs for P. megaterium. According to this procedure, 182 SPs were identified in the P. megaterium DSM 319 genome with lengths between ten and 56 amino acids (Tables S3-S12). Schematic representation of the molecular components of the plasmid-based SP test system in P. megaterium based on the plasmid pTKSP0. The SP coding sequences including their appropriate native start codon can be cloned in the 5 part of the MCS, ideally using the BsrGI (5 ) and NgoMIV (3 ) sites. The target gene can be cloned in the 3 part of the MCS using the corresponding recognition sites optional as fusion to the His 6 -tag coding sequence. On the protein level, the inserted NgoMIV restriction site leads to the indicated additional alanine and glycine residues between SP and target protein, which now represent the +1 and +2 positions relative to the signal peptidase I (SPaseI) cleaving site. The SPaseI cleaving site is defined as position −3, −2, −1 (last residues of the SP) and position +1 of the target protein. P xylA opt -optimized xylose inducible promoter; RBS-ribosome binding site.

Identification of Sec-Dependent Signal Peptides from P. megaterium DSM 319
To identify Sec-dependent SPs used for protein secretion with P. megaterium, all annotated open reading frames (ORFs) of P. megaterium strain DSM 319 were translated to protein sequences. Possible N-terminal located Sec-dependent SPs were predicted using the web-based program SignalP 4.0 [15], based on a calculated discrimination score (d score), a measure to distinguish SPs from non-SPs. Amino acid sequences with a d score greater than 0.45 were considered as Sec-dependent SPs for P. megaterium. According to this procedure, 182 SPs were identified in the P. megaterium DSM 319 genome with lengths between ten and 56 amino acids (Tables S3-S12).

Construction of Multisignal Peptide Plasmids (pMSPs) for P. megaterium
In previous publications, the coding sequences of the SPs were often individually amplified by PCR [17,19], which allows for individual mixing of different SPs but also implies a higher workload and higher cost. Here we present a new approach in which the coding sequences of the SPs were located on plasmids separated by restriction sites so that their fast and easy amplification, restriction, and cloning is possible. For this, the coding sequences of the 182 predicted SPs of P. megaterium were arranged on ten multi-signal peptide plasmids (pMSPs) (Figure 2). pMSP1 to pMSP9 each contain 18 and pMSP10 contains 20 coding sequences for SPs. The sequences were sorted by size, so that the longest SP sequences were grouped on pMSP1 going up to pMSP10, which is carrying the shortest sequences (Tables S3-S12) to prevent possible preferential incorporation of long or short sequences into the targeting vector. The SP coding sequences were assembled as cassettes on the plasmids, arranged in alternating orientation so that BsrGI is at the 5 and NgoMIV at the 3 end for ligation in correct orientation into the target vector and synthesized by General Biosystems (Morrisville, NC, USA) (Figures 1 and 2). their fast and easy amplification, restriction, and cloning is possible. For this, the coding sequences of the 182 predicted SPs of P. megaterium were arranged on ten multi-signal peptide plasmids (pMSPs) (Figure 2). pMSP1 to pMSP9 each contain 18 and pMSP10 contains 20 coding sequences for SPs. The sequences were sorted by size, so that the longest SP sequences were grouped on pMSP1 going up to pMSP10, which is carrying the shortest sequences (Tables S3-S12) to prevent possible preferential incorporation of long or short sequences into the targeting vector. The SP coding sequences were assembled as cassettes on the plasmids, arranged in alternating orientation so that BsrGI is at the 5′ and NgoMIV at the 3´ end for ligation in correct orientation into the target vector and synthesized by General Biosystems (Morrisville, NC, USA) (Figures 1 and 2).

Figure 2.
Map of a multi-signal peptide plasmid (pMSP). All pMSPs carry a resistance gene (bla) for carbenicillin resistance (striped), the ori of pBR322 for E. coli (black) and a DNA cassette encoding the individual SPs (gray). This DNA cassette is shown in more detail in the image below. It contains coding sequences of 18-20 SPs from P. megaterium DSM 319 flanked by BsrGI (upstream) and NgoMIV (downstream) recognition sites. The black arrows indicate the respective orientation of coding sequences of the SPs.
To investigate the functionality of the developed plasmid-based SP test system, a screening system in P. megaterium was established ( Figure 3). First, the target gene has to be cloned into pTKSP0 using NgoMIV as 5′ recognition site and one of the following 3′ sites of the MCS ( Figure 1). Subsequently, both the constructed target vector and one or more pMSPs have to be digested using BsrGI and NgoMIV. The fragments representing the SP coding sequences need to be dephosphorylated to prevent incorporation of multiple sequences in one target vector. After ligation of the SP mix with the target vector, E. coli has to be transformed with the corresponding ligation reaction, which allows a rapid amplification of the plasmid mix. A direct transformation of P. megaterium is not possible due to its low transformation efficiency [39]. After isolation of the amplified plasmid mix from all E. coli clones, P. megaterium can be transformed with the mixture and individual clones screened according to their secretion capability. An individual activity assay has to be established for each applied protein in order to use activity as a measure of the amount of protein secreted. To investigate the functionality of the developed plasmid-based SP test system, a screening system in P. megaterium was established ( Figure 3). First, the target gene has to be cloned into pTKSP0 using NgoMIV as 5 recognition site and one of the following 3 sites of the MCS (Figure 1). Subsequently, both the constructed target vector and one or more pMSPs have to be digested using BsrGI and NgoMIV. The fragments representing the SP coding sequences need to be dephosphorylated to prevent incorporation of multiple sequences in one target vector. After ligation of the SP mix with the target vector, E. coli has to be transformed with the corresponding ligation reaction, which allows a rapid amplification of the plasmid mix. A direct transformation of P. megaterium is not possible due to its low transformation efficiency [39]. After isolation of the amplified plasmid mix from all E. coli clones, P. megaterium can be transformed with the mixture and individual clones screened according to their secretion capability. An individual activity assay has to be established for each applied protein in order to use activity as a measure of the amount of protein secreted. Microorganisms 2022, 10, x FOR PEER REVIEW 8 of 20

Screening of Signal Peptides of All pMSPs Regarding Secretion of Model Protein α-Amylase
To validate the new system, the α-amylase AmyE of B. subtilis was chosen as a model protein to observe, compare, and evaluate the secretion success achieved by different SPs. AmyE is characterized by its high stability, does not inhibit growth, and has no toxic effects on the host organism [40,41]. Furthermore, the secretion of amylases is widely used as a reporter system as it allows screening in liquid cultures as well as on starch-containing LB agar by detection via a colorimetric assay due to its enzymatic reaction, in which α-1-4-glycosidic bonds of polysaccharides are endohydrolytically cleaved [40].
The amyE gene and subsequently the SP coding sequences from pMSP1 to 10 were introduced into pTKSP0. After amplification in E. coli, P. megaterium was transformed with the obtained plasmid mixes, yielding 10 libraries of P. megaterium clones containing plasmids with amyE and SPs from one pMSP each. After cultivation of 45 individual clones per pMSP in a microbioreactor system, the amylase activities were determined in the cellfree supernatant. They were compared to the activity resulting from recombinant AmyE secretion with the SP of the lipase A from P. megaterium ATCC 9885 (pTKSPamyElipA), for which previous work has already demonstrated a high secretion efficiency of recombinant proteins with P. megaterium [22]. As a negative control, recombinant AmyE-production without SP (pTKSPamyE0) was chosen. The amylase activity in the cell-free supernatant of the 450 cultivations was related to the scattered light at 620 nm of the culture as a measure for the formed biomass ( Figure 4).

Screening of Signal Peptides of All pMSPs Regarding Secretion of Model Protein α-Amylase
To validate the new system, the α-amylase AmyE of B. subtilis was chosen as a model protein to observe, compare, and evaluate the secretion success achieved by different SPs. AmyE is characterized by its high stability, does not inhibit growth, and has no toxic effects on the host organism [40,41]. Furthermore, the secretion of amylases is widely used as a reporter system as it allows screening in liquid cultures as well as on starch-containing LB agar by detection via a colorimetric assay due to its enzymatic reaction, in which α-1-4-glycosidic bonds of polysaccharides are endohydrolytically cleaved [40].
The amyE gene and subsequently the SP coding sequences from pMSP1 to 10 were introduced into pTKSP0. After amplification in E. coli, P. megaterium was transformed with the obtained plasmid mixes, yielding 10 libraries of P. megaterium clones containing plasmids with amyE and SPs from one pMSP each. After cultivation of 45 individual clones per pMSP in a microbioreactor system, the amylase activities were determined in the cellfree supernatant. They were compared to the activity resulting from recombinant AmyE secretion with the SP of the lipase A from P. megaterium ATCC 9885 (pTKSPamyElipA), for which previous work has already demonstrated a high secretion efficiency of recombinant proteins with P. megaterium [22]. As a negative control, recombinant AmyE-production without SP (pTKSPamyE0) was chosen. The amylase activity in the cell-free supernatant of the 450 cultivations was related to the scattered light at 620 nm of the culture as a measure for the formed biomass ( Figure 4).

Figure 4.
Activity of α-amylase AmyE in the cell-free supernatant recombinantly secreted by P. megaterium MS941 with SPs from pMSP1 to 10. The measured enzyme activity was related to the scattered light at 620 nm of the culture. Enzyme activity is given as a percentage and refers to the positive control (100%), in which the α-amylase is recombinantly secreted by the SP from lipase A of P. megaterium ATCC 9885. Cultivation was performed in a microbioreactor system with 1 mL of LB medium supplemented with tetracycline (10 μg mL -1 ) and 0.5% (w/v) xylose at 37 °C and 1400 rpm. After 24 h, α-amylase activity in the cell-free culture supernatant was determined photometrically (OD620).
The photometric screening of the SP-AmyE library with respect to the amount of secreted α-amylase in the culture supernatant revealed strong differences ranging from no (0%) to 161% secreted amylase compared to the control. While around 84% of the clones showed no or less than 10% amylase activity in the supernatant compared to the control, for 18 clones (4%) a higher activity of amylase was detected caused by SPs from pMSP2-4 and pMSP7-10, respectively. DNA sequencing revealed that enhanced AmyE secretion was caused by SPs originating from conserved hypothetical proteins with unknown function (nine of the 18 SPs) and also from proteins with functions reaching from flagellum and peptidoglycan biosynthesis (three) to hydrolases (four) and transferases (two) ( Table 1).
The length of SPs leading to increased AmyE secretion varied between 21 and 31 amino acids in our study, while SPs ranging from 10 to 56 amino acids were predicted and tested (Table 1 and Table S3-S12). However, 75% of the P. megaterium SPs are in the range of 21 to 31 amino acids, while only 7% of the SPs are shorter and 18% are longer. . Activity of α-amylase AmyE in the cell-free supernatant recombinantly secreted by P. megaterium MS941 with SPs from pMSP1 to 10. The measured enzyme activity was related to the scattered light at 620 nm of the culture. Enzyme activity is given as a percentage and refers to the positive control (100%), in which the α-amylase is recombinantly secreted by the SP from lipase A of P. megaterium ATCC 9885. Cultivation was performed in a microbioreactor system with 1 mL of LB medium supplemented with tetracycline (10 µg mL −1 ) and 0.5% (w/v) xylose at 37 • C and 1400 rpm. After 24 h, α-amylase activity in the cell-free culture supernatant was determined photometrically (OD 620 ).
The photometric screening of the SP-AmyE library with respect to the amount of secreted α-amylase in the culture supernatant revealed strong differences ranging from no (0%) to 161% secreted amylase compared to the control. While around 84% of the clones showed no or less than 10% amylase activity in the supernatant compared to the control, for 18 clones (4%) a higher activity of amylase was detected caused by SPs from pMSP2-4 and pMSP7-10, respectively. DNA sequencing revealed that enhanced AmyE secretion was caused by SPs originating from conserved hypothetical proteins with unknown function (nine of the 18 SPs) and also from proteins with functions reaching from flagellum and peptidoglycan biosynthesis (three) to hydrolases (four) and transferases (two) ( Table 1). The length of SPs leading to increased AmyE secretion varied between 21 and 31 amino acids in our study, while SPs ranging from 10 to 56 amino acids were predicted and tested (Table 1 and Tables S3-S12). However, 75% of the P. megaterium SPs are in the range of 21 to 31 amino acids, while only 7% of the SPs are shorter and 18% are longer.
The results of the first screening clearly showed the influence of SPs on protein secretion. With the system established here, several SPs could be identified, which led to a very good secretion efficiency for amylase. Although the secretion efficiency for α-amylase AmyE with the SP of lipase A used as control was already high, a further 1.6-fold increase in secretion efficiency could be achieved using the constructed SP library for P. megaterium. In a next step, it was of great interest to apply this new system to other proteins with previously poor secretory properties.

Application of the SP Library for Improved Secretion of Penicillin G Acylase
Next, our SP library was applied to the secretion of industrially relevant penicillin G acylase (PGA), a heterodimeric enzyme with a molecular weight of around 90,000, which is used for the production of semisynthetic β-lactam antibiotics [30,42,43]. Recently, newly identified PGAs from different Bacillus species were found to be recombinantly secretable by P. megaterium via their Sec-dependent native SPs [30]. However, the amount of recombinant PGA in the cell-free supernatant differed significantly. While the PGAs from Bacillus thermotolerans (BtPGA) and B. sp. FJAT-27231 (FJAT-PGA) guided by their native SPs were secreted in similar amounts as the industrially used PGA from P. megaterium (BmPGA), allowing their purification and characterization, hardly any secretion was observed for the PGA from B. massiliogorillae (BmasPGA) even 22 h after induction of the recombinant protein production ( Figure 5A) [30].
lyzed. In the patterns of the intracellular soluble proteins, no differences between th strain containing the pga encoding plasmid and a negative control could be detected (Fig  ure 5B). When insoluble proteins were analyzed, additional bands representing protein with a relative molecular weight of 80,000 to 90,000 were observed 3 h after induction o recombinant protein production, which were missing in the negative control. When ex ported via the Sec-dependent pathway, PGAs are initially produced as a preproprotei with SP, α-subunit, linker, and β-subunit. Only after export, the SP and linker are cleave and the PGA is folded into its active conformation [44]. Consequently, the observed band in the patterns of the insoluble proteins could be the not yet or incorrectly folded insolub preproPGAs. As obviously the export of recombinant BmasPGA with its native SP repre sents the main bottleneck, it should be improved by screening SPs from P. megaterium. After constructing the new target vector (Figure 1) carrying the pga gene from B. ma siliogorillae (pJMBm75), SPs from pMSP3 to 5 were inserted as described above. SPs from pMSP3 and 4 were chosen as they caused the most robust results for the secretion of re combinant AmyE. In contrast, the functionality of SPs from pMSP5 should be tested fo BmasPGA as they showed poor secretion for amylase ( Figure 4). Single colonies of 17 recombinant P. megaterium clones were cultivated in a microbioreactor system for 22 h an the volumetric PGA activity in the cell-free supernatant was determined, serving as a indirect measure of the amount of secreted BmasPGA. By using alternative SPs from P megaterium DSM 319, the secretion of recombinant BmasPGA with P. megaterium could b significantly increased ( Figure 6). To exclude ineffective production of BmasPGA in general, the intracellular proteins of plasmid-carrying P. megaterium recombinantly producing different PGAs were analyzed. In the patterns of the intracellular soluble proteins, no differences between the strain containing the pga encoding plasmid and a negative control could be detected ( Figure 5B). When insoluble proteins were analyzed, additional bands representing proteins with a relative molecular weight of 80,000 to 90,000 were observed 3 h after induction of recombinant protein production, which were missing in the negative control. When exported via the Sec-dependent pathway, PGAs are initially produced as a preproprotein with SP, α-subunit, linker, and β-subunit. Only after export, the SP and linker are cleaved and the PGA is folded into its active conformation [44]. Consequently, the observed bands in the patterns of the insoluble proteins could be the not yet or incorrectly folded insoluble preproPGAs. As obviously the export of recombinant BmasPGA with its native SP represents the main bottleneck, it should be improved by screening SPs from P. megaterium.
After constructing the new target vector (Figure 1) carrying the pga gene from B. massiliogorillae (pJMBm75), SPs from pMSP3 to 5 were inserted as described above. SPs from pMSP3 and 4 were chosen as they caused the most robust results for the secretion of recombinant AmyE. In contrast, the functionality of SPs from pMSP5 should be tested for BmasPGA as they showed poor secretion for amylase (Figure 4). Single colonies of 171 recombinant P. megaterium clones were cultivated in a microbioreactor system for 22 h and the volumetric PGA activity in the cell-free supernatant was determined, serving as an indirect measure of the amount of secreted BmasPGA. By using alternative SPs from P. megaterium DSM 319, the secretion of recombinant BmasPGA with P. megaterium could be significantly increased ( Figure 6). When cell-free supernatants of 86 clones with SPs from pMSP3 were analyzed, 93% of the clones secreted higher amounts of BmasPGA compared to the secretion with its native SP. Eighty-five percent of the 46 BmasPGA clones with SPs from pMSP4 showed higher PGA activity in the supernatant than the reference strain. In contrast, when SPs from pMSP5 were used, only 28% of the clones resulted in increased activity in the supernatant, while the others resulted in even less secretion of BmasPGA compared to the secretion with the native Bmas-SP ( Figure 6). The highest enzyme activities in the supernatant were obtained by SPs from pMSP3 (29-fold increased) followed by pMSP4 (21-fold), while using pMSP5, still 15-fold higher activity compared to the reference strain was observed.
Next, SPs were identified by DNA sequencing. From pMSP3 and 4, SP-coding sequences from all clones were sequenced while for pMSP5 only SP-sequences from clones secreting BmasPGA were investigated. Interestingly, the individual SPs occurred with very different frequencies. For pMSP3, 14 of 18 possible SPs were found in the clones examined. The most frequently found SP was SP 6, which originated from the conserved hypothetical protein BMD_0560 from P. megaterium DSM 319, with 38% of all sequenced SPs (Table 2). For clones with SPs from pMSP4, 11 of 18 possible SPs were found but only half as many clones were studied as for pMSP3. The most frequent SP 9 from a protein of the ErfK/YbiS/YcfS/YnhG family occurred in 25% of the clones. In clones with SPs from pMSP5, seven of 18 possible SPs were detected in 12 sequenced clones. The most frequent SP 15 from a conserved hypothetical protein from P. megaterium DSM 319 occurred four times. In eight clones across all pMSPs, no SP could be found. This correlates also with nearly no measured activity. In six clones, multiple SPs were detected (Table 2). When cell-free supernatants of 86 clones with SPs from pMSP3 were analyzed, 93% of the clones secreted higher amounts of BmasPGA compared to the secretion with its native SP. Eighty-five percent of the 46 BmasPGA clones with SPs from pMSP4 showed higher PGA activity in the supernatant than the reference strain. In contrast, when SPs from pMSP5 were used, only 28% of the clones resulted in increased activity in the supernatant, while the others resulted in even less secretion of BmasPGA compared to the secretion with the native Bmas-SP ( Figure 6). The highest enzyme activities in the supernatant were obtained by SPs from pMSP3 (29-fold increased) followed by pMSP4 (21-fold), while using pMSP5, still 15-fold higher activity compared to the reference strain was observed.
Next, SPs were identified by DNA sequencing. From pMSP3 and 4, SP-coding sequences from all clones were sequenced while for pMSP5 only SP-sequences from clones secreting BmasPGA were investigated. Interestingly, the individual SPs occurred with very different frequencies. For pMSP3, 14 of 18 possible SPs were found in the clones examined. The most frequently found SP was SP 6, which originated from the conserved hypothetical protein BMD_0560 from P. megaterium DSM 319, with 38% of all sequenced SPs (Table 2). For clones with SPs from pMSP4, 11 of 18 possible SPs were found but only half as many clones were studied as for pMSP3. The most frequent SP 9 from a protein of the ErfK/YbiS/YcfS/YnhG family occurred in 25% of the clones. In clones with SPs from pMSP5, seven of 18 possible SPs were detected in 12 sequenced clones. The most frequent SP 15 from a conserved hypothetical protein from P. megaterium DSM 319 occurred four times. In eight clones across all pMSPs, no SP could be found. This correlates also with nearly no measured activity. In six clones, multiple SPs were detected (Table 2). In contrast to these results, it was assumed that by amplifying the entire plasmid, all corresponding genes of the SPs would be equally abundant and accordingly equally distributed among the vectors with bmaspga gene. However, this was not the case. It would be possible that secondary structures of some SP genes were formed due to complementary segments, which led to the altered insertion rate. Furthermore, several SP genes might align with each other or were attached to the vector by complementary stretches, so that fewer of these SP genes were freely present and less likely to ligate into the target vector.
Next, it was analyzed whether the same SPs in different clones of the screening resulted in similar activity in the supernatant. As only a few clones of SPs of pMSP5 showed activity, the analysis was omitted. It was found that the same SPs resulted in similar activities in the supernatant with only a few outliers such as one clone with SP 9 from pMSP4. This may have resulted from a poorly grown culture, as all other 10 clones with this SP showed a narrow distribution of activity (Figure 7). In summary, all SPs except SP 17 from pMSP4 resulted in higher activity in the supernatant compared with secretion with the native SP ( Figures 6 and 7, Table 2). The highest amount of secreted BmasPGA was achieved by the SP of a β-amylase from P. megaterium DSM 319 (SP 2 on pMSP3) with an activity of 0.37 to 0.43 U mL -1 and the SP of an extracellular ribonuclease (SP 5 of pMSP3, 0.25 U mL −1 ). In addition, the SPs from a D-alanyl-D-alanine carboxypeptidase (SP 3 of pMSP3), an alkaline phosphatase (SP 7 of pMSP3), an ErfK/YbiS/YcfS/YnhG family protein (SP 9 of pMSP4), two conserved hypothetical proteins, BMD_0470 (ADF37360. 1, SP 11 of pMSP4) and BMD_3362 (SP 2 of pMSP5), a hypothetical protein BMD_2542 (SP 1 of pMSP5), and l-asparaginase II (SP 9 of pMSP5) led to a high amount of secreted BmasPGA of up to 0.43 U mL −1 .
Next, it was analyzed whether the same SPs in different clones of the screening resulted in similar activity in the supernatant. As only a few clones of SPs of pMSP5 showed activity, the analysis was omitted. It was found that the same SPs resulted in similar activities in the supernatant with only a few outliers such as one clone with SP 9 from pMSP4. This may have resulted from a poorly grown culture, as all other 10 clones with this SP showed a narrow distribution of activity (Figure 7). In summary, all SPs except SP 17 from pMSP4 resulted in higher activity in the supernatant compared with secretion with the native SP (Figures 6 and 7, Table 2).

Scale-Up of Penicillin G Acylase Secretion for Verification of the Screening Results.
To verify these promising screening results, the best BmasPGA secreting strains were cultivated at 50 mL scale in shake flasks and the cell-free supernatant was analyzed 22 h after induction by measuring enzyme activity and SDS-PAGE analyses of secreted proteins (Figures 8 and S1). All investigated clones showed PGA activities in the supernatant which were between 7.1-and 16.5-fold higher compared to the reference strain with the native SP ( Figure 8, Table 3). Additionally, all identified SPs led to a large amount of secreted BmasPGA visible as 25 kDa α-subunit and 60 kDa β-subunit, while for the native SP hardly any secretion was detectable ( Figure S1). The highest protein amounts also correlated to the highest measured PGA activities (Figures 8 and S1). Both in screening as well as upscaling, the activity and amount of secreted BmasPGA was the highest with the clones containing the SP of β-amylase (pMSP3-SP 2). Thus, although the increase was slightly smaller than in the screening, the improved secretion based on the alternative SPs could be reproduced on a larger scale. The highest activity of heterologous BmasPGA

Scale-Up of Penicillin G Acylase Secretion for Verification of the Screening Results
To verify these promising screening results, the best BmasPGA secreting strains were cultivated at 50 mL scale in shake flasks and the cell-free supernatant was analyzed 22 h after induction by measuring enzyme activity and SDS-PAGE analyses of secreted proteins (Figures 8 and S1). All investigated clones showed PGA activities in the supernatant which were between 7.1-and 16.5-fold higher compared to the reference strain with the native SP ( Figure 8, Table 3). Additionally, all identified SPs led to a large amount of secreted BmasPGA visible as 25 kDa α-subunit and 60 kDa β-subunit, while for the native SP hardly any secretion was detectable ( Figure S1). The highest protein amounts also correlated to the highest measured PGA activities (Figures 8 and S1). Both in screening as well as upscaling, the activity and amount of secreted BmasPGA was the highest with the clones containing the SP of β-amylase (pMSP3-SP 2). Thus, although the increase was slightly smaller than in the screening, the improved secretion based on the alternative SPs could be reproduced on a larger scale. The highest activity of heterologous BmasPGA found (0.36 U mL −1 ) was in the range of native BmPGA (0.5 U mL −1 ) [30] in the cell-free supernatant.  Figure 6) and upscaling (black). The clones with the native SP as control and the new SPs were cultured at 37 °C in shake flasks (50 mL culture volume). Twenty-two hours after induction of recombinant PGA secretion, the PGA activity was determined in cell-free supernatant by NIPAB assay in triplicates at 37 °C. pMSP3 to 5 are indicating the corresponding multi-SP plasmids; SP X indicates the corresponding SP of the given plasmid as presented in Table 2.
When considering the original proteins guided by the SPs enhancing secretion of BmasPGA, five belong to the enzyme class of hydrolases like the PGA, one belongs to the transferases, three are hypothetical proteins, and one protein is utilized in the biosynthesis of peptidoglycan (Table 3). As the majority of well-suited SPs originate from proteins belonging to the same enzyme class of hydrolases as the PGA, there could be a correlation. A comparison with the secretion of AmyE, also a hydrolase, shows that the use of some SPs such as from the ErfK/YbiS/YcfS/YnhG family protein showed high secretion of AmyE as well as BmasPGA, whereas other SPs, such as from levansucrase, also a saccharide degrading enzyme like AmyE, or conserved hypothetical protein BMD_3012 were among the best SPs for AmyE secretion but under average for BmasPGA secretion.   Figure 6) and upscaling (black). The clones with the native SP as control and the new SPs were cultured at 37 • C in shake flasks (50 mL culture volume). Twenty-two hours after induction of recombinant PGA secretion, the PGA activity was determined in cell-free supernatant by NIPAB assay in triplicates at 37 • C. pMSP3 to 5 are indicating the corresponding multi-SP plasmids; SP X indicates the corresponding SP of the given plasmid as presented in Table 2. When considering the original proteins guided by the SPs enhancing secretion of BmasPGA, five belong to the enzyme class of hydrolases like the PGA, one belongs to the transferases, three are hypothetical proteins, and one protein is utilized in the biosynthesis of peptidoglycan (Table 3). As the majority of well-suited SPs originate from proteins belonging to the same enzyme class of hydrolases as the PGA, there could be a correlation. A comparison with the secretion of AmyE, also a hydrolase, shows that the use of some SPs such as from the ErfK/YbiS/YcfS/YnhG family protein showed high secretion of AmyE as well as BmasPGA, whereas other SPs, such as from levansucrase, also a saccharide degrading enzyme like AmyE, or conserved hypothetical protein BMD_3012 were among the best SPs for AmyE secretion but under average for BmasPGA secretion.

Discussion
Due to the lack of the outer membrane, Gram-positive bacteria are well suited to the secretion of recombinant proteins directly to the surrounding growth medium, simplifying protein purification [4,5]. N-terminal SPs mediate the translocation across the cytoplasmic membrane, hence different numbers of SPs have been bioinformatically predicted and identified for a range of Gram-positive bacteria. As early as 2000, Tjalsma et al. predicted 180 potential SPs for B. subtilis strain 168, of which 14 were hypothetically assigned to the Tat-dependent and 166 to the Sec-dependent pathway with lengths between 19 and 44 amino acids [38]. Later, 173 of these identified SPs were attributed to the Sec-dependent pathway and tested according to their secretion of different target proteins [17,19]. Additionally, 220 SPs from Bacillus licheniformis DSM 13 together with the 173 SPs of B. subtilis were applied to optimize the secretion of a protease in both organisms [45]. For Lactobacillus plantarum WCFS1, 76 SPs with lengths ranging from 24 to 57 amino acid residues were predicted and applied to improved secretion of a nuclease [21]. For Corynebacterium glutamicum, 405 SP candidates were predicted, of which 90 SPs with lengths between 21 to 53 amino acids were proven to be Sec-dependent [46]. In our study, 182 Sec-dependent SPs from P. megaterium DSM 319 were predicted, which is in the scope of Bacillus species SPs. The lengths of the identified P. megaterium SPs range from ten to 56 amino acids. In 2012, Payne and coworkers described the minimal length of SPs for different species to be 10 amino acids separated in two, five, and three residues for n-, h,-and c-domains of the SP, respectively [13]. Hence, although the minimal length of ten amino acids is shorter than the minimal lengths of the SPs described above, these short SPs were included in our screenings to test their functionality. However, most of these short SPs did not show high secretion capacity of our model enzyme α-amylase ( Figure 4) so that they were not further analyzed for the following screening of BmasPGA.
The identification and screening of SP libraries for many industrially relevant production hosts mentioned above underlines the importance of improving secretion of recombinant proteins. For this purpose, P. megaterium shows lower protease activity in the surrounding medium compared to B. subtilis [31] and the introduced recombinant plasmids show high stability [24]. In addition, P. megaterium is particularly advantageous because it secretes a remarkably low amount of potentially contaminating host proteins to the surrounding medium so that downstream processing of a recombinant target protein is additionally favored [30] ( Figure S1). So, with the identified P. megaterium SPs, a method to improve protein secretion for P. megaterium by SP screening using a plasmid-based SP library was established for the first time. Although the prediction of SPs using different tools has been possible for many years [14][15][16], so far it has not been possible to predict the efficiency of a SP for a given protein [19,47] resulting in laborious screening approaches [5]. In previous screenings with all B. subtilis SPs, each of the 173 SPs was amplified individually by PCR and B. subtilis was transformed individually with the constructed plasmids. This great effort for individual amplification, cloning, and transformation ensured that all SPs were tested [17]. Another possibility is the individual amplification of all 173 B. subtilis SPs, and their ligation as a mix into vectors followed by the transformation of B. subtilis with the ligation mix [19].
However, the major advantage of our new plasmid-based SP test system is a greatly reduced workload due to the amplification of the coding sequences of the SPs in E. coli instead of a PCR amplification before ligation to the target gene. As P. megaterium has a low transformation efficiency [39], the transformation of E. coli with the ligation mixture was necessary to achieve a high concentration of the ligation products. This step could provide a helpful strategy also for other hosts with low transformation rates such as undomesticated isolates of B. subtilis [48], Lactobacillus plantarum, or Lactobacillus buchneri [49]. For analyzing a small number of specific SPs of P. megaterium, individual PCR amplification can be easily used [28]. Nevertheless, the division of SPs into size-related groups of 18 to 20 SPs, each group on one pMSP, reduces the screening effort by organization into individual screening experiments.
The sequencing of SP-coding sequences of our clones from the plasmid-based SPs test system revealed that the individual SPs were represented with different frequencies, which seems to depend on the given recombinant protein (our unpublished data). Fu and coworkers (2018) screened more than 1500 clones for α-amylase secretion based on the 173 SPs of B. subtilis, which corresponded to a more than 8-fold oversampling. The SPs of the best 100 clones were sequenced and only 15 different SPs were identified, resulting in up to an 1.7-fold increase in secretion compared to the native SP [19]. In our study, the screening of 86 colonies of a cloning round with SPs from one pMSP represented 78% (14 SPs) of all possible 18 SPs, resulting in a more than 16.5-fold increase of BmasPGA secretion, while even the screening of only 12 clones already represents around 40% of the 18 SPs of another pMSP with an 8.5-fold increase. To cover more or even all SPs, the number of clones to be screened has to be increased accordingly. Anyway, as shown here, testing lower numbers of SPs can be sufficient to identify a suitable SP for secretion.
During the screening and sequencing of SPs applied for α-amylase secretion, it was observed that a high d score, a measure for identification of a SP, does not correlate to strong secretion and vice versa (Table 1) as described previously [17]. This was also observed for the SPs leading to highly enhanced secretion of BmasPGA. Their d scores were found to be only slightly above the threshold value of 0.45, ranging to high values of 0.88 (Table 3). Interestingly, the native SP of BmasPGA also shows a high d score of 0.77 although it hardly leads to recombinant secretion by P. megaterium. Consequently, d scores seem to be useful in predicting SPs but cannot valuate them, mainly because the secretion efficiency is determined by the combination of protein and SP and additionally depends on the host organism [5,15]. This could also be shown for the B. subtilis SP library, which was applied successfully in C. glutamicum, whereas the secretion efficiency of a recombinant protein based on a given SP differed significantly in both organisms [47].

Conclusions
Our plasmid-based SP library used here is the first SP screening-system for the Grampositive production host P. megaterium. Ours, as well as previous results, confirm that it is still not possible to predict optimally suited SPs for the secretion of recombinant target proteins [19,47]. Therefore, these best SPs need to be identified using a rapid and simple cloning and screening system. In our work, several suited SPs were identified for the secretion of a novel penicillin G acylase from B. massiliogorillae (BmasPGA) that resulted in up to a 29-fold increase in the amount of protein secreted compared to secretion via the native SP. With this drastic increase of the required production and secretion to high amounts, the purification from the cell-free supernatant, and the following characterization of the BmasPGA but also of other recombinant novel proteins using P. megaterium is possible. Hence, our plasmid-based, easily amplifiable SP library will further widen the application of the interesting production host P. megaterium for the secretion of recombinant proteins and additionally could be applied for other hosts due to the functionality of our plasmids in a broad range of Bacillus species.
Supplementary Materials: The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/microorganisms10040777/s1, Table S1: List of plasmids, Table S2: List of primers, Table S3: List of signal peptides available at pMSP1, Table S4: List of  signal peptides available at pMSP2, Table S5: List of signal peptides available at pMSP3, Table S6: List of signal peptides available at pMSP4, Table S7: List of signal peptides available at pMSP5, Table S8: List of signal peptides available at pMSP6, Table S9: List of signal peptides available at pMSP7, Table S10: List of signal peptides available at pMSP8, Table S11: List of signal peptides available at pMSP9, Table S12: List of signal peptides available at pMSP10, Figure S1: Comparison of secreted amount of BmasPGA of selected clones in upscaling. Reference [50] is cited in the supplementary materials.

Data Availability Statement:
The data presented in this study are available within the article.