1. Introduction
Celiac disease (CD) is a hereditary autoimmune disorder characterized by chronic inflammation of the small intestinal mucosa accompanied by malabsorption due to persistent intolerance to the proline/glutamine(PQ)-rich plant storage proteins (prolamins) of wheat (gluten), rye and barley seeds [
1]. This disease is not associated with age, can occur at any time and is diagnosed in 1–2% of the world’s population. Over time, there is a high risk of developing oncological and autoimmune diseases and infertility, as well as nervous disorders. There is currently no treatment for CD, and the only effective way for sufferers to stay healthy is to follow a strict gluten-free diet, which is difficult to adhere to due to the serious restrictions it imposes, leading to asocialization and depressive states. Patients on this diet are deficient in vitamins and minerals and are prone to anemia and osteoporosis. In addition, 0.5–13% of the world’s population is diagnosed with gluten intolerance without celiac disease (non-celiac gluten sensitivity, or NCGS) [
2]. After eating foods with gluten, NCGS patients face the same problems as those with CD [
3]. In this regard, various therapeutic strategies are being developed to combat gluten intolerance.
One particularly significant approach is enzyme therapy, that is, the intake with food of peptidase preparations that can cleave hardly hydrolyzable prolamin peptides. This approach eliminates the cause of the disease—undercleaved PQ-rich peptides (PQ content in prolamins represents ~65–80% of the total number of amino acids). Currently, a significant number of glutenases of various origins (bacteria, fungi, plants, insects) and families (serine, cysteine, metallopeptidases) capable of hydrolyzing toxic immunogenic prolamin peptides have been identified [
4,
5]. It has been found that proline-specific glutenases are not as effective as post-glutamine cleaving peptidases, and a combination of two different activities is much more attractive. However, at present, the range of such proposed preparations is very limited and consists of enzymes from different sources, including pathogenic bacteria, that are not adapted to each other, which limits the possibility of their use.
It is not only safety issues that hinder the promotion of enzyme preparations in the pharmaceutical market. A serious disadvantage of glutenases as possible therapeutic agents is their rapid inactivation and instability in the conditions of the human stomach, i.e., at low pH [
4]. Attempts to stabilize enzymes through pharmaceutical modification, such as PEGylations and microencapsulation, have so far been very limited [
6]. There are two principal strategies to obtain pH-stable glutenase: (1) take another pH-stable peptidase and switch its specificity to prolamins, or (2) take an effective glutenase and make it pH stable.
The first strategy is exemplified by the Kumamax oral protein therapeutic [
7,
8], which has recently passed through phase 1 clinical trials [
9]. In this contribution, a protein design approach mediated by the Rosetta software suite was used to redesign the active site of the kumamolisin-As, known to be active at low pH. The engineered enzyme exhibited 116-fold greater proteolytic activity for a model gluten tetrapeptide than the native template enzyme, as well as an over 800-fold switch in substrate specificity toward immunogenic portions of gluten peptides.
In this work, we implement the second strategy, based on conferring pH stability to an enzyme that already has sufficient glutenase activity. Earlier, we have shown that the main digestive cathepsin L from the insect pest 
Tribolium castaneum (TcCathL1, Uniprot D6X519, NCBI NP_001164001) has high post-glutamine cleaving activity and is able to degrade 33-, 26-, 10- and 8-mer immunogenic prolamin peptides and their fluorogenic analogs [
10,
11]. Successful experiments were carried out both with the native enzyme isolated from the 
T. castaneum midgut extract [
10] and with the recombinant preparation obtained as a zymogen in the 
Pichia pastoris expression system followed by autoprocessing to the mature active enzyme [
11]. However, the disadvantage of the obtained preparations of cathepsin L was their low stability at acidic pH values (2–3) corresponding to the physiological conditions in the human stomach [
12]. 
The aim of this work was to create, based on molecular dynamics simulations and site-specific mutagenesis, recombinant mutant preparations of T. castaneum cathepsin L with post-glutamine cleaving activity that are stable in an acidic environment. This peptidase(s) can be further suggested as a lead enzyme for the development of oral medical preparations that fight CD and gluten intolerance in susceptible people.
  2. Results
Prior to attempting to design a pH-stable version of 
Tribolium castaneum cathepsin L, it would be valuable to uncover the reason for the enzyme’s instability at pH 2. Although molecular dynamics (MD) has proven to be a powerful method for the examination of protein conformational dynamics, it is, unfortunately, unable to explicitly model pH—since the simulated systems’ volumes typically do not exceed much 10 × 10 × 10 nm
3, there would not be a single free proton (H
+ or H
3O
+) for a whole pH range 2–7. While classical force fields are unable to explicitly model ionization (formation/breaking bonds), a constant-pH MD (CpHMD) approach [
13], based on a λ-dynamics formalism [
14], is being developed to dynamically assign the charge states of the titratable groups. However, here we used a simpler and widely accepted alternative: the assignment (at the molecular topology level) of the charge state of the ionizable groups, which corresponds to the chosen pH: e.g., at pH 7 acidic residues’ (glutamic and aspartic acids) side chains are negatively charged (deprotonated), while at pH 2 they accept H
+ and become non-charged. Conversely, histidine is neutral at pH 7 and positively charged (protonated) at pH 2.
While considering other probable causes of activity loss at pH 2 that are beyond the reach of MD (e.g., protein denaturation and proteolysis), we set a goal of revealing in the MD simulations 
conformational effects that protonated cathepsin L side chains (at pH 2) cause to the enzyme structure considering its mature form without propeptide (
Figure 1 and 
Figure 2A). To achieve this, we modeled cathepsin L at different pH values as follows:
- pH 7: all aspartic (14 per protein) and glutamic (10 per protein) acid residue side chains are deprotonated and negatively charged; histidines (2 per protein, including catalytic His 275) are also deprotonated and neutral; lysine (4 per protein) and arginine (5 per protein) are positively charged at pH 7 and 2; protein overall charge is −15. 
- pH “2 (all)”: all mentioned Asp/Glu/His are protonated; Asp and Glu are neutral and His are positively charged; protein overall charge is +11. 
- pH “2 (his)”: only catalytic His 275 is protonated; all other residues have their “pH 7” states; protein charge is −14. This “virtual” variant is created to reveal if it’s possible that His 275 pronation itself can cause conformational deviations at acidic pH. 
There is one aspect of cysteine proteases that we do not take into account in this research: experiments show that catalytic Cys 138 residue may be deprotonated at neutral pH due to His 275 proximity, which shifts cysteine’s p
Ka by as much as 5, down to 2.5–3.3 for ficin, caricain and papain [
15,
16], leading to the formation of the ionic pair Cys-S
−—His-Imidazole
+. However, the anionic form of cysteine is not implemented in most molecular modeling and dynamics software packages, so proceeding with this variant would lead to separate research focused on the force field parameterization and technical aspects of charged states modeling [
17] rather than biochemical work. Due to this reason, in this study, we consider Cys 138 protonated and uncharged.
Ionizable residues and amino acid sequence alignment for TcCathL modeling (detailed in the 
Materials and Methods Section) are highlighted in 
Figure 1. For all three aforementioned system types, we performed MD calculations for 100 ns, each in three replicates (
Table 1, “WT” row, and 
Materials and Methods). Below, we describe the analysis of these calculations and the subsequent work in detail.
  2.1. Catalytic Histidine 275 Changes Its Conformation at Acidic pH
We performed MD simulations of the cathepsin L molecule at two pH values: 7 and 2, with two types of simulation at the latter pH: pH “2 (all)” and pH “2 (his)” (see text above for explanations of these labels), each type in triplicate, yielding nine trajectories (
Materials and Methods Section and 
Table 1 (row “WT”)). The goal was to uncover if there are specific conformational effects to the enzyme’s structure that are caused by protonation of ASP/GLU+HIS residues. We performed a series of analyses that gradually characterized probable effects on the protein structure:
- Classical root-mean-square deviation (RMSD) analysis (see  Figure S1A- ) revealed that the overall protein structure remains stable at both pH values: we do not see any signs of cathepsin L acid-mediated denaturation. Although, focusing on the active site (residues Cys 138, His 275 and Asn 295 [ 18- ];  Figure S1B- ) reveals a slight instability in most of the pH 2 (both “all” and “his”) trajectories, suggesting that acidification causes not global but local effects in our systems. Furthermore, focusing on His 275 itself ( Figure S1C- ) additionally highlights this effect and reveals a wider RMSD distribution, suggesting multiple rotary transitions. 
- A root-mean-square-fluctuation (RMSF) plot ( Figure S2- ) is designed to uncover protein parts that might become excessively flexible upon acidification. Unexpectedly, almost no residues showed a significant change in RMSF, all being remote from the active site and thus, apparently unable to affect it and reduce the enzyme’s activity. 
- The secondary structure also did not deteriorate much at pH 2, notably, even becoming slightly more stable, although non-significantly ( Figure S3- ). 
- Finally, we focused on the finest parameters of the protein structure, residues’ rotameric states, which are confidently presented by χ 1-  torsion angle distributions. A change in His 275 χ 1-  torsion angle correlates with His 275—Asn 295 distance (not shown) and can serve as a reasonable criterion for determining whether His 275 is located inside the active site or outside it. This analysis revealed that acidification affects the structure of the cathepsin L active site, with the His 275 side chain turning away from the active site in the protonated state ( Figure 2- B), thus, violating the active site catalytic conformation and reducing the enzyme’s activity. 
We characterized each cathepsin residue by the distribution of its χ
1 torsion angle from each single MD trajectory: either pH 7 (three replicates) or pH “2 all” (another three replicates). For numerical comparison of two distributions, we introduce an integral of non-overlapping (
SNO), which is zero for two identical distributions and unity for two non-overlapping functions (
pink shading in 
Figure 2C, illustrating the most unlike His 275 conformations for a pair of pH 7/2 trajectories). Comparing pH 7 and pH “2 all” replicates in pairs, we identified a set of 17 residues with 
SNO > 0.5 in two or three MD replicate pairs (see 
Figure 2A and 
Table S1 for a comprehensive list). In-depth analysis revealed that most of these 17 residues are either small or remote from the active site and solvent-exposed, so their rotamer state change upon acidification will have little effect.
The only exceptions to the above are Trp 297, located just “above” the active site, and catalytic His 275, which clearly changes its rotameric state in acidic pH (
Figure 2). At pH 7, it is stably (in all three replicates) located inside the active site with χ
1 = 190° ± 9° (see also 
Figure 3A); conversely, at pH 2 it becomes excessively flexible and often exits the active site (e.g., χ
1 = 285° ± 12°), which probably leaves the enzyme temporarily dysfunctional (
Figure 2B). It should be noted that for ease of calculations, we stacked two χ
1 diapasons to overcome the jump from +180° to −180°. Of course, this conformational preference is unable to irreversibly damage the enzyme; however, at the same time, it is one of the probable mechanisms of activity loss when pH is reduced. 
Figure 2C gives an example of a comparison of His 275 χ
1 distributions at neutral and acidic pH values, revealing 
SNO values as high as 0.89. In some other trajectories, His may be still preferentially located in the active site or be occasionally changing its location (
Figure 3A), resulting in lower 
SNO values.
  
    
  
  
    Figure 2.
      Catalytic histidine 275 changes conformation at acidic pH. (
A). 
The overall structure of Tribolium castaneum 
cathepsin L model with residues that “feel” the pH change from 7 to 2 in MD. The propeptide that keeps an immature enzyme inactive is shown as a semi-transparent gray cartoon and was removed prior to any modeling (Materials and Methods). Disulfide bridges are colored 
yellow. Active site residues (Cys 138, His 275 and Asn 295 [
18]) are shown with 
blue sticks. Residues that considerably change χ
1 distribution at pH 2 compared to pH 7 (
SNO > 0.5; see panel (
C)) in at least 2/3 MD replicates are shown with red lines (see 
Table S1 for a comprehensive list). (
B). 
In-depth analysis reveals that only catalytic His 275 changes may play a role in the enzyme’s activity loss upon acidification. All other residues from (
A), except Trp 297, are either small or remote from the active site and solvent-exposed, so their rotation has little effect. In contrast, His 275 keeps a steady in-site position in all pH 7 simulations (blue; see also panel C and 
Figure 3A, up), while at pH 2 it turns away from the site (red; see also panel C and 
Figure 3A, down), which may cause activity loss. 
C. His 275 χ1 torsion angle distribution in indicative pH 7 (blue
) and pH 2 (red
) MD trajectories. To measure the variance between two distributions, we introduce the integral of non-overlapping parameter (
SNO), here equal to 0.89 and represented by pink shading. A full set of such distributions is provided in 
Figure 3A.
  
 
 
   Figure 2.
      Catalytic histidine 275 changes conformation at acidic pH. (
A). 
The overall structure of Tribolium castaneum 
cathepsin L model with residues that “feel” the pH change from 7 to 2 in MD. The propeptide that keeps an immature enzyme inactive is shown as a semi-transparent gray cartoon and was removed prior to any modeling (Materials and Methods). Disulfide bridges are colored 
yellow. Active site residues (Cys 138, His 275 and Asn 295 [
18]) are shown with 
blue sticks. Residues that considerably change χ
1 distribution at pH 2 compared to pH 7 (
SNO > 0.5; see panel (
C)) in at least 2/3 MD replicates are shown with red lines (see 
Table S1 for a comprehensive list). (
B). 
In-depth analysis reveals that only catalytic His 275 changes may play a role in the enzyme’s activity loss upon acidification. All other residues from (
A), except Trp 297, are either small or remote from the active site and solvent-exposed, so their rotation has little effect. In contrast, His 275 keeps a steady in-site position in all pH 7 simulations (blue; see also panel C and 
Figure 3A, up), while at pH 2 it turns away from the site (red; see also panel C and 
Figure 3A, down), which may cause activity loss. 
C. His 275 χ1 torsion angle distribution in indicative pH 7 (blue
) and pH 2 (red
) MD trajectories. To measure the variance between two distributions, we introduce the integral of non-overlapping parameter (
SNO), here equal to 0.89 and represented by pink shading. A full set of such distributions is provided in 
Figure 3A.
 
  ![Ijms 24 12369 g002 Ijms 24 12369 g002]()
 
An in-depth comparison of all pH 7 and pH 2 trajectories (
Figure 3A) shows that in an acidic environment, His 275 becomes unstable and often leaves the site, probably leading to a loss in activity. In these distributions of χ
1 torsion angle, 
blue shading indicates native (in-site) His 275 orientation, while a red color pinpoints when it turns away from the site and cannot take part in the catalysis. In all three pH 7 replicates, histidine remains stable (
Figure 3A, up), while six pH 2 replicates (3 × “all” and 3 × “his”; 
Figure 3A, down) reveal a high frequency of conformational violation (additionally shown by 
red arrows). It is worth noting that in the so-called pH “2 his” replicates we see the same effect, suggesting that the protonation of catalytic His 275 itself changes its behavior and forces its side chain to leave the site.
While aware that an increase in His 275 flexibility cannot be the sole cause of the loss of cathepsin L activity upon acidification, nevertheless, we exploit this observation as a working hypothesis in an attempt to undo this malfunction by “fixing” the catalytic His residue in the “right” position by point mutagenesis of the residues in the vicinity of the active site.
  2.2. His 275 “Fixing” as a Way to Stabilize Cathepsin L
Based upon the aforementioned finding that it is the internal flexibility of His 275 that may be the cause of the loss in cathepsin L activity upon acidification, we decided to design several point mutants that in some sense “fix” this residue inside the active site in the catalytically competent conformation. Close examination of the enzyme structure permitted the identification of position 277 as the most promising position for such an intervention: this residue (valine in wild-type) is next to His 275 in the β-strand (see 
Figure 3A, 
inset) and certainly influences histidine conformation, while not taking part in the catalysis. The first idea we explored was to put a hydrogen-bonding residue in this position (Asp/Asn/Gln/His/Thr/Tyr) capable of fixing His 275 in the active site by this interaction. The wide range of residue side chain volumes should provide flexibility for the accommodation of such a mutation in the active site vicinity without disturbing the activity. One more possibility explored was A251Y mutation (see 
Figure 3A, inset), which assumes that the introduction of an aromatic and hydrogen-bonding moiety just after the β-sheet ends may somehow fix His 275 in the site. As a negative control, the V277A variant was added to this list, as according to our initial hypothesis, this mutation should have no effect.
While the preliminary in silico analysis indeed suggested that the V277T and V277D mutations should prevent His 275 from rotating and causing a loss in cathepsin L activity (see 
Figure S4), in practice, 
only the “neutral” V277A mutation proved to be acid resistant in the experimental assessment (see further sections for biochemical results), exhibiting steady and significant enzymatic activity at pH values as low as 3.
  
    
  
  
    Figure 3.
      H275 changes conformation at the acidic pH in cathepsin L wild type but maintains conformation in the V277A mutant. The four panels show His 275 χ
1 torsion angle distributions for wild-type (
A) and V277A mutant (
B) enzymes at pH 7 (upper panels; blue outline) or pH 2 (both “2 all” (red outline) and “2 his” (pink outline) replicates; lower panels). Blue shading indicates the “native” His 275 rotameric state while red shading indicates its violation in both directions—these are additionally marked with red arrows (broken outline in A; the lower panel corresponds to the red distribution from 
Figure 2C). Inset in A, upper: cathepsin L active site structure with two His 275 rotamers (analogous to 
Figure 2A) along with V277 and A251 residues (orange). Inset in B, lower: visualization of V277A mutation, which provides some vacant space inside the active site and thus, may stabilize His 275 in its “native” rotameric state. Note that the V277A mutation dramatically reduces the frequency of rotameric violations for His 275 (one vs. six red arrows).
  
 
 
   Figure 3.
      H275 changes conformation at the acidic pH in cathepsin L wild type but maintains conformation in the V277A mutant. The four panels show His 275 χ
1 torsion angle distributions for wild-type (
A) and V277A mutant (
B) enzymes at pH 7 (upper panels; blue outline) or pH 2 (both “2 all” (red outline) and “2 his” (pink outline) replicates; lower panels). Blue shading indicates the “native” His 275 rotameric state while red shading indicates its violation in both directions—these are additionally marked with red arrows (broken outline in A; the lower panel corresponds to the red distribution from 
Figure 2C). Inset in A, upper: cathepsin L active site structure with two His 275 rotamers (analogous to 
Figure 2A) along with V277 and A251 residues (orange). Inset in B, lower: visualization of V277A mutation, which provides some vacant space inside the active site and thus, may stabilize His 275 in its “native” rotameric state. Note that the V277A mutation dramatically reduces the frequency of rotameric violations for His 275 (one vs. six red arrows).
 
  ![Ijms 24 12369 g003 Ijms 24 12369 g003]()
 
Figure 3, in addition to an in-depth analysis of the wild-type cathepsin MD (panel A), contains a description of the ability of the V277A mutant to resist an acidic environment (panel B). This figure analyzes His 275 rotameric states in neutral (top) and acidic (bottom) conditions. While both versions of the enzyme behave similarly at pH 7 (His 275 exhibits “native” rotameric states, indicated by blue distributions shading; and only minor violation is observed in V277A (red shading and red arrow) [
Figure 3, top panels]), at pH 2, a difference emerges: in the wild type, His 275 frequently jumps out of the active site, which probably leads to a loss in activity (red peaks and red arrows in 
Figure 3A, bottom); conversely, in the V277A mutant, the His 275 side chain remains relatively stable, violating the native state only once (one red arrow in 
Figure 3B, 
bottom right panel versus six in the bottom left panel).
 So why did the so-called “neutral” variant work, while the others did not? Most probably, this is not because of His 275 conformational “fixing” by the hydrogen bond with polar side chains, which may not in fact occur, but because of protein volume reduction in position 277 (V → A), which provides the required room for the His 275 side chain to be accommodated inside the active site at pH 2—a pH at which it gains a charge of +1 and may repel from other charged groups in the vicinity of the active site.
  2.3. The Probable Role of V277A Substitution in Cathepsin L Stabilization
To demonstrate the significance of the volume of residue 277 for His 275 rotameric stability, we performed additional modeling of V277G (the smallest possible side chain volume) and V277L (a volume increase compared to the WT) mutants at the two pH values and in triplicate (
Table 1). Four proteins were compared with a gradually increasing volume of the 277 residue: V277G, V277A, WT (V277) and V277L. The effect of the identity (and volume) of residue 277 on the stabilization of the His 275 rotameric state is summarized by the 
SNO dependence (
Figure 4 and 
Figure S5). Interestingly, the dependence is not monotonic with residue 277 volume: the lowest 
SNO is observed for the V277A variant (0.38 ± 0.32), while both an increase (Val or Leu) and a decrease (Gly) in the side chain volume leads to higher 
SNO values. We conclude that the V277A mutation is optimal for cathepsin L pH stability, which is assessed further experimentally.
  2.4. Biochemical Evaluation of the Effect of pH on the Activity and Stability of Cathepsin L Mutant Forms
Tribolium castaneum cathepsin L variants of the wild type and with the suggested mutations V277T, V277D, V277I, V277A and V277Y were obtained in the form of recombinant proenzymes in the yeast expression system of 
Pichia pastoris (rpTcCathL1 WT, rpTcCathL1 V277T, rpTcCathL1 V277D, rpTcCathL1 V277I, rpTcCathL1 V277A and rpTcCathL1 V277Y). To obtain active enzymes of mutant forms of cathepsin L, autocatalytic processing was carried out according to the method developed for wild-type cathepsin, which resulted in an active enzyme rTcCathL1 WT [
11]. However, in practice, we were able to obtain mature forms of recombinant enzymes for only three mutant proteins: rTcCathL1 V277I, rTcCathL1 V277T and rTcCathL1 V277A. Mutant proenzyme rpTcCathL1 V277D was not processed autocatalytically, and no active form was obtained, while rpTcCathL1 V277Y autodegraded instead of processing (that is, the V277D and V277Y mutations interfered with autoprocessing of the cathepsin). Thus, further studies were performed on rTcCathL1 V277I, rTcCathL1 V277T and rTcCathL1 V277A mutant forms in comparison with rTcCathL1 WT.
 The effect of the pH was studied in two ways: determination of the optimum pH in terms of activity and the pH stability of the enzymes (
Figure 5). All mutant proteins, together with the wild-type enzyme, showed maximum activity at neutral pH 7 (
Figure 5A), and the highest activity under these conditions was found in mutant enzyme rTcCathL1 V277I. Activity at pH 2 was very low in all forms; however, at pH 3, the activity was already noticeable and amounted to 34% for rTcCathL1 WT and 36% for rTcCathL1 V277A of the maximum activity at pH 7. However, for rTcCathL1 V277A at pH 3, the absolute value of the activity was only one-third of that of rTcCathL1 WT. However, the study of the stability of proteins across two hours of incubation at 37 °C at different pH values showed that rTcCathL1 V277A had the maximum stability at pH 3, at 78% of the initial activity (
Figure 5B). The wild-type cathepsin retained only 28% of its activity, the rTcCathL1 V277T mutant form only 17% and the rTcCathL1 V277I form only about 2%. The diagram showing the absolute values of residual activity after incubation of preparations with the same initial activity (
Figure S6) shows that despite the lower initial activity at pH 3 (
Figure 5A), the mutant form rTcCathL1 V277A after a two-hour incubation at human body temperature has almost twice the activity of rTcCathL1 WT. In the human stomach, pH is normally between 1.5 and 3.5, and food digestion lasts 1–4 h [
12]; therefore, pH 3 is quite suitable for effective gliadin hydrolysis by rTcCathL1 V277A of at least 2 h. We suggest this protein as a leading candidate glutenase for the development of oral medical preparations that fight CD and gluten intolerance in susceptible people (patent application No. 2023115446 “Obtaining Recombinant Acid-Stable Cathepsin L (Mutant Form) and Methods of Its Application”, Federal Service for Intellectual Property, Russia). 
It should be noted that maximum pH stability for all other forms was observed at pH 6, which is consistent with the physiological conditions for the functioning of digestive cathepsin L in the 
T. castaneum midgut at pH 5.6. Long-term incubation for 2 h at pH 6 did not affect the activity of rTcCathL1 WT and rTcCathL1 V277I, since the residual activity under these conditions was close to 100%. Only in rTcCathL1 V277T did the activity drop to 62% (
Figure 5A).
  3. Discussion
One of the main reasons for the decrease in the activity of peptidases in an acidic environment is structural changes in the enzyme. We employed MD simulations to uncover probable conformational violations that accompany acidification and revealed that the proposed computational protocol is quite reliable for this purpose. Thus, at the timescale of the modeling, there were no signs of the enzyme unfolding. The main result we discovered was that catalytic His 275—the core amino acid residue of the active site of the cysteine proteinases—constantly left the active site in acidic conditions, thereby apparently making catalysis impossible. The only way that classical MD can account for changes due to pH is by varying the charge state of the ionizable residues—which was implemented in our “pH 7” and “pH 2” trajectories. From 
Figure 3A, one can note that at pH 2, the protonated His 275 is unstable and often leaves the active site (
red distributions). It is important to note that even in the “pH 2 (his)” trajectories, when the only protonated residue in the structure is His 275, the effect persists, probably indicating repulsion of the histidine side chain (which gained a positive charge at this pH) from the vicinity of the active site. While aware that His 275 flexibility increase cannot be the sole cause of cathepsin L activity loss upon acidification, nevertheless, we suggest that the conformation of its side chain may be used as a signature of enzyme instability at low pH values.
Given this, we tried to engineer point amino acid substitutions in the vicinity of the active site that might stabilize the His 275 conformation. The most suitable substituted residue for this intervention was found to be the Val 277 (
Figure 3B, inset). Unexpectedly, the most promising candidates (e.g., V277T (
Figure S4), which was suggested to form an H-bond with His 275 and “hold” it in place) did not work (
Figure 5B). Conversely, the proposed mutation V277A, which is “neutral” in terms of its effect on the H-bonding pattern of His 275, leads to a freeing up of space at the site that can also affect the behavior of the catalytic residue. Subsequent experimental testing revealed that the mutant V277A became pH stable while still possessing reasonable activity after exposure to the acidic milieu (Figs. 3B and 5B). Compared to two other mutations tested in silico, V277G (with the shortest possible side chain) and V277L (with increased volume relative to the wild type), the V277A mutant displayed the lowest 
SNO parameter for His 275 (
Figure 4), suggesting that this mutant had the most stable His 275 conformation. One probable cause for this is simply the volume of the 277 residue, which is optimal in the case of V277A for providing the needed room for the accommodation of His 275 in the active site in both neutral and acidic environments.
The unexpected results obtained in this study show that the rational design of the enzymes still represents an area of uncertainty and requires many trial-and-error cycles incorporating both design attempts and experimental validation. Anyhow, the in silico design employed here made it possible to quickly evaluate different scenarios of rearrangements at the enzyme active site at different pH values. As a result, only a very limited number of residues—candidates for point mutagenesis and experimental testing of activity—were proposed. Most significantly, one of these variants—V277A—displayed the desired increase in enzyme stability at low pH. This is important from both the fundamental and practical points of view. In the case of the former, this provides an additional approach to solving a very complex problem in the computational design of enzymes with modified pH sensitivity. In the case of the latter, this leads to a real protein that can be further employed to combat CD and gluten intolerance in susceptible people.
  5. Conclusions
In this work, we made an advance in a quest for oral enzyme therapeutics that may be administered to people susceptible to CD and NCGS to allow them to digest immunogenic prolamin peptides. We proposed that digestive cathepsin L from the insect pest Tribolium castaneum, which has high post-glutamine cleaving activity, may serve as a basis for this quest, and computer modeling of the enzyme dynamics may provide a clue for mutagenesis conferring pH stability on the engineered protein. From computational simulations, we discovered that, at first glance, insignificant change—the V277A mutation—somehow stabilizes the active site of the enzyme during a change in pH from 7 to 2, probably by providing the needed room for the catalytic His 275 residue to remain within the active site. A subsequent biochemical evaluation revealed that, although losing some activity, cathepsin L V277A indeed shows increased stability at the acidic pH 3 compared to the wild type, and thus, is a leading candidate enzyme for use in the treatment of CD.