2.1. Review and Initial Analysis of the Previously Published Data
The first step of this work was to select the crystal structure, which was then used to study the catalytic mechanism of loganic acid methyltransferase. Two LAMT structures from
Catharanthus roseus (PDB codes 6C8S and 6C8R) were found in the RCSB PDB crystal structures database [
11]. Both structures were determined by X-ray diffraction and contain LAMT methyltransferase homodimer. Both complexes additionally include S-adenosylhomocysteine, an analog of the S-adenosylmethionine cofactor, which is a methyl group donor in the reaction catalyzed by LAMT. The 6C8S structure was solved with a resolution of 2.20 Å and 6C8R-1.95 Å. The 6C8R structure, unlike the 6C8S, additionally contained loganic acid, forming a complex with LAMT (
Figure 3).
In the further stages of the work, solely the 6C8R structure was used because it is characterized by better resolution than the 6C8S structure and also contains both SAH ligands and loganic acid.
Two main conclusions can be drawn from the analysis of the crystal structure of 6C8R. First, the distance between one of the oxygens of the carboxyl group of loganic acid and the sulfur atom of the SAH ligand is quite small (3.3 Å), which is related to the structure of the ligands of SAH, unlike the cofactor SAM, which does not contain a reactive methyl group on the sulfur atom. In the LAMT complex with SAM and loganic acid, steric effects associated with the presence of the methyl group of the cofactor is likely to cause the distance between the cofactor and the substrate to be greater than that in the crystal with SAH. For this reason, it can also be suspected that the conformation of the active site in the crystal structure may deviate from the conformation with the bound cofactor. This hypothesis was tested in this study, as described in the subsequent paragraph. The second observation is related to the way loganic acid is bound by the enzyme. In the SABATH family of methyltransferases, carboxylic acid substrates are bound by a conserved glutamine residue. This residue is also present in the active site of loganic acid methyltransferase but does not form hydrogen bonds with the substrate. However, this interaction is crucial for the course of the reaction because the mutation of this glutamine to alanine leads to a significant reduction in enzyme activity. On this basis, it can be concluded that glutamine 38 is involved in interactions with loganic acid, although this is not observed in the crystal structure of the protein. Such a hypothesis was also suggested in a previous paper [
11].
The two above observations lead to the conclusion that the structure of the active center of loganic acid methyltransferase in the 6C8R crystal may significantly differ from that occurring physiologically. In computational studies focused on the analysis of the catalytic mechanism of proteins, application of quantum mechanical calculations is a standard method [
14]. However, in the case of loganic acid methyltransferase, performing QM calculations based on the conformation of the protein from the crystal structure may lead to incorrect conclusions. For this reason, an approach combining molecular dynamics and quantum mechanical computation was used in this work. This approach has bee successfully used to study the catalytic mechanism of TrmD methyltransferase, for which the Mg
2+ cation binding site was unknown [
15]. This methodology was also used to characterize the mechanism of NEP1 methyltransferase, for which calculations solely using the crystal structure led to non-physiological reaction barriers [
16].
2.2. Crystal Structure Analysis and Preparation of MD Simulations
Energy optimization as described in the Materials and Methods section was the first part of this study. The minimized system was then used as the starting geometry for further steps, leading to three independent molecular dynamics trajectories. These trajectories were prepared in a uniform manner, although they differed in the initial velocities of the atoms. This approach enabled us to test whether the properties of the system are reproducible.
In the next step of preparing each of the three trajectories (
1,
2 and
3), the system was simulated under NVT conditions for 100 ps in order to achieve thermal stabilization. In the case of LAMT methyltransferase, it was shown that the enzyme is active at 303 K [
11]; therefore, the target temperature for each simulation was also 303 K. During the simulation, the heavy atoms of the protein complex were frozen; their position could not change, and water molecules and ions could move freely. The advantage of this approach was that the solvent could fill the available spaces inside the complex while maintaining a structure of the protein similar to that in the crystal. During each of the three NVT simulations, the temperature of the system quickly reached values close to the target temperature of 303 K (
Figure S1). After stabilizing the temperature of the system, it was also necessary to stabilize its pressure and, thus, the density of the system. For this reason, simulations were carried out under constant pressure and temperature conditions, with a preserved particle number (NPT) and the heavy atoms of the complex frozen. For each of the three simulations, the system was pressurized to 1 bar (
Figure S2).
For most systems, stabilizing the temperature and pressure is sufficient to perform a proper molecular dynamics simulation. In the case of the loganic acid methyltransferase complex studied in this work, additional relaxation was performed under the NPT conditions, with frozen heavy atoms of the main chain of the protein, SAM cofactor and loganic acid, leaving only the side chains relaxed. Analysis of the LAMT crystal structure shows that the interactions between loganic acid and LAMT observed in the crystal may differ from those under physiological conditions. For this reason, additional relaxation, without constraints imposed on the positions of the atoms of the side chains of amino acid residues, allowed for the reorganization of the structure and the formation of additional bonds between the substrate and the protein. This was important because during production simulations, these interactions may be necessary to stabilize the complex. Therefore, subsequent simulations were carried out under NPT conditions, with the bonds removed from the side chains of amino acid residues. The system was simulated for 1000 ps. In all three trajectories, the formation of a hydrogen bond between loganic acid and glutamine 38 was observed, for which a role in substrate binding was postulated [
11].
In trajectory
1, this bond was formed between the oxygen atom of the iridoid ring and the amino group of the glutamine residue. This observation is interesting because in other methyltransferases from the SABATH family, this residue is responsible for binding the carboxyl group of the substrate. Nevertheless, it is possible that glutamine 38 has a different function in loganic acid methyltransferase (
Figure 4).
In trajectory
2 (
Figure 5), however, we observed the formation of a hydrogen bond between the carboxyl group of the loganic acid and the glutamine residue. This way of binding loganic acid is similar to the way that substrates are bound by other methyltransferases in this family.
A similar conclusion can be drawn from the final structure of trajectory
3. In this trajectory, the carboxyl group of loganic acid also interacts with the side chain of glutamine (
Figure 6).
After stabilizing the temperature and pressure of the system, as well as relaxing the side chains of amino acid residues, the next three (production) independent molecular dynamics simulations were performed. The final structures of each of the three trajectories, which are shown in the figures above (
Figure 4,
Figure 5 and
Figure 6), were used as initial structures for production simulations. Although the means of interaction between loganic acid and glutamine 38 in the last step of trajectory
1 are different than in the other two trajectories (
2 and
3), a production simulation was performed using this structure in order to check whether this bond would be preserved after removing the constrains from the remaining atoms of the complex.
2.3. Analysis of Production MD Simulations
After proper preparation of the systems and final snapshots from trajectories 1, 2 and 3, production molecular dynamics simulations were performed in order to reproduce the native conformations of the LAMT complex with the cofactor and loganic acid. Three independent trajectories, each with a length of 20 ns, were obtained, which made it possible to study the conformational dynamics of the studied complex. Simulations were carried out in an isobaric–isothermal ensemble, and the previously set constrains on the protein, cofactor and substrate heavy atoms were removed. Then, the following properties of the system were characterized:
Stability of the LAMT complex with the SAM cofactor and loganic acid;
Interactions that stabilize the binding of loganic acid;
The distance between the carboxyl group of loganic acid and the methyl group of the SAM cofactor.
One of the approaches used to determine the stability of the complex during the molecular dynamics simulations is RMSD analysis, which allows provides information on how much the conformation of the complex in a given step of the simulation deviates from the reference structure. The structure from the first step of the simulation was used as a reference. For each of the three trajectories, the RMSD values of Cα carbon atoms of amino acid residues were calculated. For each simulation, the RMSD values remained relatively constant and did not exceed 2.5 Å. There were also no spikes in RMSD values. Relatively constant and low RMSD values indicate that the complex was stable during the molecular dynamics simulation; therefore, the obtained trajectories can be used for further analysis (
Figure 7).
In the next step, interactions between loganic acid and loganic acid methyltransferase were analyzed. Hydrogen bonds were determined using the VMD 1.9.3 program. The cutoff values above which atoms were considered not to form hydrogen bonds were 3.2 Å for the donor–acceptor distance and 50° for the angle between the donor, hydrogen and acceptor. Such values allowed for the analysis of strong and medium-strength hydrogen bonds. The results are summarized in
Table 1 and shown in
Figure 8.
Several key hydrogen bonds between loganic acid and protein were observed in the molecular dynamics simulations. In all three trajectories, histidine 162 formed a hydrogen bond with the carboxyl group of the substrate. This bond was present for most of the simulation time, which proves a significant role of this residue in the binding of loganic acid. The tryptophan 163 residue also participated in interactions with the carboxyl group of the substrate. However, the frequency of hydrogen bonding between this residue and loganic acid depends, to a large extent, on the trajectory of molecular dynamics. Only in trajectory 2 did this interaction occur during practically the entire simulation, and in trajectories 1 and 3 it was present only in about 5% of the frames. However, the distance between the nitrogen atom in the tryptophan aromatic ring and the oxygen atom of the substrate carboxyl group is relatively short, averaging 4.19 ± 0.42 Å in trajectory 1 and 4.18 ± 0.44 Å in trajectory 3. A possible explanation for this observation is the presence of weak hydrogen bonds between tryptophan 163 and loganic acid or the occurrence of a hydrogen bond in which the substrate interacts with W163 indirectly with the participation of a water molecule. Water bridge formation was observed in some structures during molecular dynamics simulations.
The role of histidine 162 and tryptophan 163 in substrate binding is also confirmed by the crystal structure of LAMT, in which both residues formed hydrogen bonds with loganic acid. In addition, the H162A and W163F mutants showed negligible catalytic activity, which also suggests an important role of these amino acid residues in interactions with the carboxyl group of the substrate [
11].
In the molecular dynamics simulations, glutamine 38 residue was also found to be involved in the bond with the carboxyl group of loganic acid. The interaction involving Q38 occurred all three molecular dynamics trajectories and was present for most of the simulation time. In the crystal structure of LAMT, this bond was absent, but the involvement of Q38 in substrate binding was previously postulated by Petronikolou based on mutational analysis and the conservation of this residue in the SABATH family of methyltransferases [
11]. Based on molecular dynamics simulations, it can be concluded that amino acid residues H162, W163 and Q38 participate in the recognition and binding of the carboxyl group of loganic acid and are responsible for the proper orientation of the substrate in order to carry out the methylation reaction. This conclusion is additionally confirmed by previously published experimental studies [
11].
In the active site of the enzyme, there are also interactions with the sugar moiety of loganic acid. Among others, alanine 241 participates in these interactions, forming a hydrogen bond with the oxygen atom of the glucose hydroxyl group. This binding occurred in each of the performed simulations for about 50% of their duration. Tyrosine 37 was also involved in interactions with the rest of the glucose. However, this binding occurred relatively rarely and was present in only 16.78% of the first trajectory and practically unobserved in the remaining two. For this reason, it can be concluded that the key interaction for the recognition of the glucose ring is the interaction involving alanine 241, while tyrosine 37 plays only a secondary role.
Both hydrogen bonds were present in the LAMT crystal structure; however, only one of was present for a significant part of the simulation time. Thus, it seems that in the case of Y37 interactions with loganic acid, the hydrophobic interactions between the aromatic ring and the glucose ring play a key role and not hydrogen bonds. The loganic acid methyltransferase mutant Y37A does not show catalytic activity, which further supports the conclusion about the role of tyrosine 37 residue in interactions with the substrate.
Interactions with the loganic acid iridoid ring also occurred during the molecular dynamics simulation. Glutamine residue 273 participated in them, forming a hydrogen bond with the hydroxyl group of the iridoid ring. Due to the presence of an amide group in the side chain of glutamine 273, this residue can be both a donor and an acceptor of a hydrogen bond; however, in the simulations, the main observed interaction involved the NH
2 Q273 group. The role of this residue in the recognition of the iridoid ring was previously postulated by Petronikolou [
11] based on crystal analysis and mutational studies, further supporting the conclusions of molecular dynamics simulations.
In the crystal structure, a hydrogen bond was also present between the hydroxyl group in the iridoid ring and the H245 residue, but formation of this bond was not observed in the current simulations. Interestingly, hydrogen bonding involving glutamine 273 occurred most often in trajectory 2 (80.97%) and much less often in trajectories 1 and 3, as is the case of bonding with tryptophan residue 163. This observation can be explained if we take into account the possibility of formation of hydrogen bonding involving a water molecule between W163 and the substrate.
The presence of a water bridge was observed in trajectories 1 and 3, in which Q273 rarely forms a hydrogen bond with the hydroxyl group of the substrate. This is probably because the water molecule forming the water bridge interacts with both the oxygen atom of the carboxyl group (O2) and the oxygen atom of the hydroxyl group of the loganic acid (O9). The steric and electrostatic effects associated with the presence of a water molecule near the hydroxyl group make the hydrogen bond with glutamine 273 weaker and less frequent. The opposite situation can be observed in trajectory 2, in which direct hydrogen bonding between tryptophan 163 and the substrate is mainly encountered. Due to the lack of a water molecules in the vicinity of O9, the interaction between Q273 and the hydroxyl group occurs more often.
Based on the analysis of hydrogen bonds between loganic acid and the enzyme, it can be concluded that there are two possible patterns of substrate–protein interaction. The first possibility is direct hydrogen bonding between the tryptophan 163 NH group of the indole ring and loganic acid’s carboxylic group, together with strong interaction between glutamine 273 and the iridoid ring. In the second scheme of interactions between W163 and the substrate, there is an indirect hydrogen bond involving a water molecule, and the Q273 residue plays an additional role in the binding of the substrate.
For further characterization of the LAMT complex with loganic acid and the cofactor, the distance between the carbon atom of the reactive methyl group of S-adenosylmethionine and the methylated oxygen atom of the substrate was calculated. This distance is of great importance from the point of view of the reaction catalyzed by the methyltransferase because it corresponds approximately to the reaction coordinate. For large distances, the methyl group must first approach one of the oxygen atoms of the carboxyl group of the loganic acid, and only then can the methyl group be transferred. For this reason, the reaction barrier is associated not only with the breaking and formation of chemical bonds but also with conformational changes within S-adenosylmethionine. For this reason, short distances between the methyl group and its acceptor are more conducive to the methylation reaction. During each trajectory, the distance between the carbon atom of the methyl group of SAM and the nearest oxygen atom of the substrate carboxyl group is stable and rarely exceeds 3.4 Å (
Figure 9). This observation additionally supports the conclusion about the stability of the simulations of molecular dynamics and the SAM–loganic acid interactions occurring therein.
2.5. Catalytic Mechanism Analysis
A quantum chemical cluster approach was used to analyze the mechanism of the reaction catalyzed by loganic acid methyltransferase. In this method, a relatively small but well-selected part of the enzyme is cut out from the structure of the studied complex and is treated at the quantum chemical level. This approach was successfully used in previous studies devoted to the analysis of mechanisms of reactions catalyzed by enzymes [
15,
17,
18,
19].
In the first stage of quantum mechanical calculations, two models (
model 1 and
model 2) of the active center were prepared using previously selected structures from molecular dynamics simulations. In both models, amino acid residues located in the immediate vicinity of the methyl group of SAM and the carboxyl group of loganic acid, including residues forming key interactions, were taken into account. Then, the atoms at the truncated point were fixed to prevent their unnatural movement during the optimization of the geometry. These models contained the same amino acid residues, loganic acid and S-adenosylmethionine. Exactly the same atoms were fixed in both models. The total number of atoms in
model 1 was 192, and that in
model 2 was 189. The main difference between the models, the influence of which was examined during the course of the reaction, was the presence of a water molecule in
model 1, resulting in the number of atoms in this model increasing by three compared with
model 2. This water molecule participated in the hydrogen bonding between W163 and the substrate (
Figure S3). In
model 2, this binding was direct (
Figure S4).
After building the models, it was necessary to optimize their geometry in order to find the energy minimum corresponding to the system before the methylation reaction. The DFT method was used for the calculations, which allows for an accurate description of the energy at a relatively low computational cost. Correlation-exchange functional B3LYP [
20,
21,
22,
23,
24] was used, which is widely applied in studies of the mechanisms of enzymatic reactions using the cluster approach [
25]. However, this functional is not able to satisfactorily describe dispersion interactions that are crucial for the intermolecular interaction; therefore, empirical correction GD3BJ for dispersion forces was included in the calculations [
26]. The Def2-SVP basis set was used for geometry optimization, which is recommended for calculations of similar systems using DFT methods [
27]. The charge of both systems was set to 0, and the spin was set to the singlet state. In order to confirm whether the identified geometries corresponded to the actual energy minimum, hessians were computed, showing only a few imaginary frequencies associated with fixed atoms, which were of the order < 30i cm
−1.
After optimizing the geometry of the systems, we checked whether the obtained structures differed significantly from those obtained before energy minimization. For this purpose, the structures from the first and last optimization step were superimposed. No major changes in the conformations of either
model 1 or
model 2 were observed, which means that the applied atom constraint scheme allowed for the reproduction of steric effects of the part of the complex not included in the model (
Figures S5 and S6).
The optimized structures for
models 1 and
2 were used to find the transition state for the methylation reaction of loganic acid involving S-adenosylmethionine. In order to find the approximate structure of the transition state, the relaxed potential energy surface scan method was used. This method makes it possible to determine changes in potential energy during the change of one of the geometrical parameters. In both models, the distance between the carbon atom of the methyl group of SAM and the oxygen of the carboxyl group of loganic acid was scanned. During each individual calculation, all degrees of freedom were optimized, except for the selected distance, which remained constant. The reaction coordinate was scanned every 0.2 Å. For both models, the maximum electron energy corresponds to distance of 2.1 Å between the methyl group of the cofactor and its acceptor (
Table 2).
The most distant energy structures (distance 2.1 Å) from the scans were used to find the transition state for the methyl group transfer reaction. First, the hessians were calculated to confirm that they were close to the transition state. Both structures showed one imaginary frequency with a high value, which corresponded to the vibrations associated with the methyl group. The presence of the imaginary vibration frequency confirmed that the geometry of these structures is close to the geometry of the transition state, so they can be used as a starting point for its optimization. The transition states were optimized using the analytical hessian calculated in the previous step. Then, it was confirmed that the obtained structures corresponded to the saddle point by calculating the eigenvalues.
In the transition states for both models, the methyl group is located between the sulfur atom of S-adenosylmethionine and the oxygen of loganic acid, which is its acceptor. The S-CH
3-O angle is 174° in
model 1 and 170° in
model 2, which means that the methyl group approaches the acceptor in the plane defined by the carboxyl group. The S-CH
3 distance in both models is 2.3 Å, and the CH
3-O distance is slightly shorter in
model 1 than in
model 2 (2.1 Å vs. 2.2 Å). The asymmetric methyl transfer transition state is characteristic for many methyltransferases, including salicylic acid methyltransferase, which, like LAMT, belong to the SABATH family of methyltransferases. The corresponding distances for the transition state of the reaction catalyzed by SAMT are 2.4 Å and 2.0 Å [
28]. In the transition state for both models, the methyl group adopts a flat conformation, and the reaction itself proceeds with an inversion of configuration on the carbon atom. In both transition states, the interactions present in the system before the reaction are preserved. The carboxyl group of the substrate is oriented to accept the methyl group through interactions with glutamine 38, histidine 162 and tryptophan 163 (
Figures S7 and S8).
After modeling both the initial structures and the transition state structures, the structures of the final state were determined. Optimized conformations from a scan for a CH3-O distance of 1.9 Å were used for this purpose. The constraints imposed on the distance between the methyl group and the acceptor were removed, and the geometry of the system was optimized. Then, it was verified whether the system was an energy minimum by counting the eigenvalues. In both models, as a result of a chemical reaction, the methyl group of the cofactor was transferred to the carboxyl group of loganic acid, resulting in the formation of a methyl ester–loganin complex.
The hydrogen bonds involving the loganin ester group differ between models. In model 1, the same interactions that were responsible for the orientation of the carboxyl group of loganic acid are preserved. The ester group is bound with the participation of glutamine 38, which interacts with the oxygen atom substituted with a methyl group. The loganin carbonyl group forms a direct hydrogen bond with histidine 162 and a hydrogen bond with tryptophan 163 involving a water molecule. In model 2, the direct interactions between the carbonyl group of loganin with tryptophan 163 and histidine 162 are preserved. As a result of the transfer of the methyl group, the hydrogen bond involving glutamine 38 is broken. During the methyl group transfer reaction, the carboxyl group of loganic acid loses its negative charge, which may significantly weaken the hydrogen bond with Q38. In addition, the presence of a bulky group near the oxygen atom may further weaken this bond due to steric hindrance. The combination of electrostatic and steric effects can lead to a strong weakening of the hydrogen bond between loganin and glutamine 38, resulting in its breaking. The lack of interaction between Q38 and the reaction product may be the basis for the mechanism of loganin dissociation from the active site of loganic acid methyltransferase in order to start a new catalytic cycle.
After determining the structure of stationary points, energy corrections were calculated for both models. In order to obtain more accurate values of the electronic energy, single-point calculations were performed using the Def2-QZVP basis set. In the case of DFT methods, the use of a larger basis set than Def2-QZVP does not lead to a further increase in the accuracy of the results due to the basis set limit [
27]. The solvation energy was also calculated using the CPCM implicit model with two different dielectric constants (4 and 80) [
29,
30]. System vibrations at 0 K were taken into account by computing zero-point energy (ZPE). The final energy is the sum of the electronic energy calculated using the Def2-QZVP basis set, the solvation energy (ε = 4 or 80) and the zero-point vibrational energy.
In both models, increasing the basis set from Def2-SVP to Def2-QZVP leads to an increase in the relative electron energy of the transition state and the product by several kcal/mol, which means that some aspects of the electron density are poorly described in calculations using a smaller basis set. In both
model 1 and
model 2, the absolute value of the solvation energy decreases in the following order: initial state, transition state, final state. During the chemical reaction, the positively charged S-adenosylmethionine and the negatively charged loganic acid are converted into neutral products. Due to the fact that ionic substances are more strongly solvated, a decrease in solvation energy is observed during the reaction. The calculated zero-point vibrational energy represents only a small contribution to the final energy of the system: of the order of 1 kcal/mol. In both models, this correction increases the final state energy, and in
model 2, the zero-point vibrational energy also lowers the reaction barrier (
Table 3).
Corrections to the electronic energy allowed for the determination of the energy profile of the reaction (
Figure 12 and
Figure 13). In both models, the reaction barrier is related to the transfer of the methyl group from S-adenosylmethionine to the carboxyl group of loganic acid. The reaction barriers for both models are similar. In the case of
model 1, the activation energy is 18.7 kcal/mol, and in
model 2, it is 17.6 kcal/mol. In both models, the reaction is exoenergetic.
In order to check the influence of the choice of dielectric constant on the reaction barrier, the solvation energy was calculated using two diverse dielectric constants (ε = 4 and 80). Changing the value of the dielectric constant from 4 to 80 increased both the reaction barrier and the final state energy. However, these differences are so small that the choice of dielectric constant does not significantly affect the interpretation of the obtained energy profiles. This means that most of the significant electrostatic interactions were taken into account during the construction of both models.
Analysis of the reaction pathways using quantum chemistry methods and a cluster approach enabled the characterization of two probable reaction mechanisms. In both mechanisms, the S methyl group of adenosylmethionine is transferred to the carboxyl group of loganic acid, resulting in S-adenosylhomocysteine and loganin. This process is the rate-limiting stage of the reaction. The difference between these mechanisms is how the tryptophan 163 residue interacts with loganic acid. In the first mechanism, the hydrogen bond between the W163 residue and the substrate occurs with the participation of a water molecule, while in the second mechanism, the bond is direct. Reaction barriers for these two possibilities are close to each other and equal 18.7 and 17.6 kcal/mol, respectively, which are adequate values for reactions catalyzed in living organisms. This means that both active site configurations of loganic acid methyltransferase are catalytically active.
The obtained results also allow for a deeper understanding of some experimental facts. Earlier experimental studies showed that the turnover number (k
cat) for LAMT is 0.31 ± 0.01 s
−1, which corresponds to an energy barrier of 18.5 kcal/mol [
11]. This barrier is related to the slowest step after binding of substrates, which may be product release, conformational change or chemical reaction. This value is very close to the activation energy for the reaction mechanisms characterized in this work (18.7 kcal/mol for
model 1 and 17.6 kcal/mol for
model 2). This allows us to interpret the k
cat constant as the rate constant of the reaction of the methyl group transfer from S-adenosylmethionine to loganic acid and the reaction itself as the limiting step in the entire catalytic cycle. The agreement between the experimental results and the quantum mechanical calculations is additionally confirmed by the catalytic mechanism characterized in this work.