Next Article in Journal
UPLC-ESI-MS/MS Profiling of Secondary Metabolites from Methanol Extracts of In Vivo and In Vitro Tissues of Daucus capillifolius Gilli (A Comparative Study)
Next Article in Special Issue
The Effect of β-Sheet Secondary Structure on All-β Proteins by Molecular Dynamics Simulations
Previous Article in Journal
Reduced Graphene Oxide-Supported SrV4O9 Microflowers with Enhanced Electrochemical Performance for Sodium-Ion Batteries
Previous Article in Special Issue
Molecular Dynamics Simulations of the Short-Chain Fluorocarbon Surfactant PFHXA and the Anionic Surfactant SDS at the Air/Water Interface
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Electronic and Nuclear Quantum Effects on Proton Transfer Reactions of Guanine–Thymine (G-T) Mispairs Using Combined Quantum Mechanical/Molecular Mechanical and Machine Learning Potentials

Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
*
Author to whom correspondence should be addressed.
Current address: Center for Integrative Proteomics Research, 174 Frelinghuysen Road, Piscataway, NJ 08854-8076, USA.
Molecules 2024, 29(11), 2703; https://doi.org/10.3390/molecules29112703
Submission received: 1 May 2024 / Revised: 30 May 2024 / Accepted: 4 June 2024 / Published: 6 June 2024
(This article belongs to the Special Issue Advances in Computational and Theoretical Chemistry)

Abstract

:
Rare tautomeric forms of nucleobases can lead to Watson–Crick-like (WC-like) mispairs in DNA, but the process of proton transfer is fast and difficult to detect experimentally. NMR studies show evidence for the existence of short-time WC-like guanine–thymine (G-T) mispairs; however, the mechanism of proton transfer and the degree to which nuclear quantum effects play a role are unclear. We use a B-DNA helix exhibiting a wGT mispair as a model system to study tautomerization reactions. We perform ab initio (PBE0/6-31G*) quantum mechanical/molecular mechanical (QM/MM) simulations to examine the free energy surface for tautomerization. We demonstrate that while the ab initio QM/MM simulations are accurate, considerable sampling is required to achieve high precision in the free energy barriers. To address this problem, we develop a QM/MM machine learning potential correction (QM/MM- Δ MLP) that is able to improve the computational efficiency, greatly extend the accessible time scales of the simulations, and enable practical application of path integral molecular dynamics to examine nuclear quantum effects. We find that the inclusion of nuclear quantum effects has only a modest effect on the mechanistic pathway but leads to a considerable lowering of the free energy barrier for the GT*⇌G*T equilibrium. Our results enable a rationalization of observed experimental data and the prediction of populations of rare tautomeric forms of nucleobases and rates of their interconversion in B-DNA.

1. Introduction

With five natural nucleobase monomers (G, C, A, and T/U), DNA and RNA are able to store and transfer a rich density of genetic information with high fidelity [1,2]. Nonetheless, the diversity of biological populations arises from processes of natural selection and genetic variation [3,4], the latter of which is influenced by mutations that occur during DNA replication [5,6]. Even with the precise replication process [7], including proofreading [8] and DNA repair [8], the mutation rate still typically reaches 10 9 10 12 per base pair synthesized [9]. While mutations are a vital component in the creation of genetic diversity, some mutations in eukaryotes lead to diseases, including many forms of cancer [10,11]. Mutations can arise through mispairing of the Watson–Crick G-C and A-T hydrogen bonding bases in the DNA helix. In the Watson–Crick (WC) model, nucleobase pairs are in their “keto” form [12] rather than their “imino” or “enol” forms. Mispairs are believed to commonly arise from tautomeric shifts in the nucleobases [5,6] that cause a change in the hydrogen bond pattern at the Watson–Crick edge without changing the charge [2]. These rare nucleobase tautomers are able to form WC-like mispairs that can sometimes circumvent proofreading and repair machinery and lead to mutations and misincorporation in DNA replication [7,9] and translation [13]. Hence, the study of how protons transfer between different tautomeric forms in DNA mispairs is of fundamental importance.
Despite their importance, the experimental detection of tautomeric nucleobases is challenging due to their short lifetimes [2,14]. Nonetheless, much insight has been gained by previous experimental studies. Allawi et al. investigated the stability of 39 oligonucleotides with internal G-T mismatch using UV absorbance versus temperature profiles and NMR [15]. Waters et al. reported the kinetics of removal of thymine DNA glycosylase from DNA containing a G-T mismatch [16]. Fox et al. reported a G-T mismatch mechanism by using base analogues of G and T [17] to explore the functional groups on the mismatch pair which are recognized by the enzyme. Osakada et al. studied the kinetics of charge transfer for the detection of single-base mismatch in a DNA molecule [18]. Koag et al. studied mismatch discrimination mechanisms in human DNA polymerase β crystal structures and proposed a two-stage mechanism where in the closed conformation state, pol β allows only a Watson–Crick-like conformation [19].
In pioneering NMR work, Al-Hashimi and coworkers reported the existence of transient WC-like G-T mispairs, providing insight into their populations and rates and establishing a dynamical basis for misincorporation via tautomerization and, more rarely, ionization [20,21,22]. In a follow-up computational study [23] focused on calculation of the relative tautomer free energies, it was shown that the tautomer equilibrium was sensitive to the B-DNA environment. This equilibrium was predicted to be shifted to become more favorable when bound to DNA polymerase λ , as was proposed in earlier work [24].
Other computational studies have examined tautomerization in DNA [5] and G-U wobbles in RNA [25]. Very recently, tautomerization in GT wobble pairs has been studied using a coupled first-principles quantum chemistry and open quantum system master equation approach [26]. Results illustrate that quantum tunneling plays an important role in these reactions and has significant influence on rates and transcription error frequency.
In the present work, we examine the mechanism of tautomerization of G-T mispairs in B-DNA, with emphasis on the reaction paths and transition states, sensitivity of the free energy profiles and barriers to sampling, and inclusion of nuclear quantum effects. As a model system, we use the B-DNA crystal structure [27] with two G-T wobble pair mismatches, illustrated in Figure 1. We perform a series of combined quantum mechanical/molecular mechanical (QM/MM) simulations using semiempirical, ab initio, and machine learning models to study the mechanism of the tautomerization reaction. Path integral molecular dynamics (PIMD) simulations are performed to deduce the role of nuclear quantum effects on the calculated barriers in order to help rationalize the observed experimental data. The remainder of the manuscript is outlined as follows. The Section 3 provides a description of the computational details, including the QM/MM and QM/MM- Δ MLP models, construction of the free energy profiles, and PIMD methods. The Section 2 compares free energy profiles using different QM/MM and QM/MM- Δ MLP models, examines the sensitivity of the profiles to degree of sampling, and predicts the influence of nuclear quantum effects on the barriers. Results are discussed in the context of experiments and other related computational work reported in the literature. This paper concludes with a summary of key take-home points.

2. Results and Discussion

Figure 1 shows the B-DNA sequence and structure studied in the current work. The figure highlights the G-T wobble (wGT), and it illustrates the general process of tautomerization. The formation of rare GT* and G*T states, where G* and T* indicate the higher-energy enol tautomeric form of G and T, respectively, leads to Watson–Crick-like mispairs containing three hydrogen bonds, similar to a GC base pair. The wGT mispairs have been proposed to lead to misincorporation errors in DNA replication [21,23]. Tautomerization in GT mispairs have been studied previously using ab initio QM/MM simulations [23], and it has been shown that the tautomer equilibrium is sensitive to the environment. Specifically, Li et al. reported an insightful study of the reaction in aqueous solution in canonical A-form and B-form DNA helices in solution, as well as B-DNA bound to DNA polymerase λ . The main goal of the work was to compute the relative free energies of the wGT, GT*, and G*T tautomers, with less emphasis on the reaction paths and barriers as is the focus of the current work. Results of ab initio QM/MM simulations predicted that the wGT→GT* tautomerization is endoergic in B-DNA in aqueous solution but exoergic in the polymerase environment [23].
The goal of the present work is to use a B-DNA helix exhibiting a wGT mispair as a model system to study the tautomerization reactions in greater detail with regard to the reaction paths and transition states, sensitivity of the free energy profiles and barriers to sampling, and inclusion of nuclear quantum effects. The following briefly highlights the main methodological differences in the classical ab initio QM/MM simulations used in the current work with respect to that of Li et al. [23]. The latter study used the ω B97X-D3 functional [28,29] and 6-311G** basis set [30,31] and performed string calculations using 25 images with 0.1 ps sampling per image. Simulations were performed under aperiodic (frozen) boundary conditions outside of an 18 Å distance. In the current work, the PBE0/6-31G* ab initio QM/MM model was used under full periodic boundary conditions (no frozen boundary) with rigorous electrostatics [32], and string simulations were conducted with 32 images, 2 ps sampling per image, and 40 ps of production sampling (four independent 10 ps trials) along the final path in order to accurately obtain barriers.
In order to ascertain the contribution of nuclear quantum effects, path integral simulations were performed with a recently developed software interface [33,34] between Amber [35] and i-PI [36]. PIMD simulations are roughly an order of magnitude more computationally costly than the classical MD simulations and preclude the practical use of the ab initio QM/MM model. To address this barrier, we developed a QM/MM- Δ MLP model based on a fast, semiempirical QM/MM model that was more than 100 times faster than the ab initio QM/MM model but with comparable accuracy.
The next section compares free energy profiles for tautomerization reactions from classical MD using semiempirical and ab initio QM/MM models as well as the QM/MM- Δ MLP developed in this work. The following section then applies the QM/MM- Δ MLP model to ascertain the effect of inclusion of nuclear quantum effects on the reaction barriers. The final section ties the results together and compares the various models with other computational results reported in the literature as well as available experiments.

2.1. Free Energy Profiles of G-T Mispair from Classical Molecular Dynamics

Figure 2 compares the free energy profiles from classical MD simulations for the ab initio QM/MM (PBE0/6-31G*), semiempirical QM/MM (AM1/d), and QM/MM- Δ MLP models. Table 1 lists the relative free energy values for key stationary points along the various profiles. The reactions are broken down into competing wGT→GT* and wGT→G*T pathways and, separately, the interconversion GT*→G*T. These reaction steps are shown schematically in Figure 3, and representative structures for key stationary points along the reaction paths are shown in Figure 4. The 1D profiles shown are the result of the minimum free energy path derived from the finite temperature string method [37,38], followed by additional production path refinement sampling (see Section 3).
The ab initio QM/MM profiles have a single transition state with the exception of the wGT→G*T profile. The wGT→GT* profile passes through a single transition state (TSa) that involves a partially ionized state with formal charges at G:O6 (protonated) and T:O4 positions (Figure 3a) that then undergoes a conformational shearing of the base pair concerted with proton transfer to form GT*. The wGT→GT* profile follows an initial path (Figure 3b) to a partially ionized transition state (TSb1) similar to that of TSa, but then follows a stepwise path that first involves conformational shearing of the ionized nucleobases to form a metastable intermediate (Ib) that then undergoes a proton transfer from G:N1→T:N3 passing through transition state (TSb2) to form the GT* product. The GT*→G*T involves a concerted dual proton transfer (Figure 3b) that passes through a low-barrier charge-separated transition state (TSc).
The AM1/d QM/MM method is a fast, semiempirical model that was first tested to ascertain the degree to which it could reliably model tautomerization reactions such that it might be applied with the more computationally intensive PIMD simulations. The AM1/d method considerably overestimates the free energy barriers relative to the ab initio QM/MM model (Figure 2). Additionally, the AM1/d model produces wGT→GT* and wGT→G*T reaction profiles that are more endoergic. This is likely due in part to the known problem that AM1 tends to underpredict the strength of hydrogen bonds. As the formation of the high-energy enol tautomer states (G*T and GT*) is accompanied by an increase in the number of hydrogen bonds, an underprediction of the strength of the hydrogen bonds will lead to a higher-energy Watson–Crick-like hydrogen bonded state. Perhaps more concerning is that the AM1/d profile produces multiple transition states for each profile separated by, in some cases, artificial intermediates not observed in the ab initio QM/MM profiles.
The developed QM/MM- Δ MLP model, on the other hand, is observed to closely mimic the ab initio QM/MM profiles in Figure 2. The transition state barriers ( Δ A ) are all within approximately 1 kcal/mol of the PBE0/6-31G* QM/MM values (Table 1), and the reaction free energy values ( Δ A ) are less than 0.5 kcal/mol. This provides support for the use of the QM/MM- Δ MLP model in the PIMD simulations described in the Section 2.3.

2.2. Sensitivity of Free Energy Profiles to Sampling

The quantitative determination of free energy surfaces and minimum free energy paths for complex systems requires intensive sampling efforts. Oftentimes, even a minimal level of sampling is precluded by the computational cost of ab initio QM/MM simulations, particularly when the QM region becomes large. In this section, we illustrate the degree to which sampling can affect predictions obtained from the free energy profiles examined in this study.
In the current work, we performed finite temperature string method optimization using five generalized coordinates to determine the minimum free energy path for the reaction. At the ab initio QM/MM level, we then refined the final path with four independent trials, each of which were conducted for 10 ps for each of the 32 string images (windows). The cumulative sampling from the string simulations and final path refinement were used to construct the plots shown in Figure 2. For the more efficient QM/MM- Δ MLP, the refinement was conducted with four independent trials, and each of the 32 images was sampled for 25 ps/image/trial. This is considerably more sampling than what is often encountered in other studies. For example, a previous study of the same reaction used approximately 2.5 ps/image of cumulative sampling to construct free energy profiles [23].
In order to make an estimate of the expected variance of the free energy profiles derived from more modest sampling, we divided the 40 ps of sampling per string image into 16 nonoverlapping 2.5 ps blocks. We then reanalyzed the free energy profiles using only the 2.5 ps of sampling of a single block in order to come up with 16 profiles, which are shown in Figure 5. While the overall shape of the profiles is qualitatively similar, the range of values for the rate-controlling transition states is observed to vary between 2.0 and 3.4 kcal/mol, particularly for the GT*→GT tautomerization. One should bear in mind that this demonstration is a best-case scenario in the sense that it examines the refinement of a final converged path. It does not consider variations in the path that could result from independent string simulations that also used less sampling to obtain the final path. This demonstration highlights the need for sampling in order to obtain sufficiently precise quantitative estimates of the free energy profiles and barriers. In turn, this stresses the importance of developing accurate QM/MM- Δ MLP models that are able to access time scales well beyond that of conventional ab initio QM/MM methods. This becomes even more critical when one considers more computationally intensive PIMD simulations in the Section 2.3.

2.3. Free Energy Profiles of G-T Mispair from Path Integral Molecular Dynamics

Figure 6 compares the classical and PIMD free energy profiles using the QM/MM- Δ MLP model. The differences in free energy values between classical MD and PIMD, as well as approximate changes in rates and populations, are summarized in the supporting information (Table S1). For the wGT→GT* and wGT→G*T reactions, the profiles and barriers are similar. This is due to the nature of the rate-controlling transition state that involves a conformational event (base shearing, as illustrated in Figure 3 and Figure 4, TSa and TSb1), as opposed to a proton transfer event. Alternatively, in the case of the GT*→G*T, it is the proton transfer itself (TSc in Figure 3 and Figure 4) that characterizes the rate-controlling transition state, and the barrier is predicted to be significantly lower with PIMD.
Generally, the inclusion of nuclear quantum effects does not significantly alter the reaction free energies for the systems examined here. This implies that the populations of the tautomer states are predicted by classical and PIMD simulations to be very similar. The most change in populations is for GT*→G*T, where the Δ A value for the reaction changes from −0.43 to 0.26 kcal/mol. The concerted proton transfer involved in the equilibrium between GT* and G*T is predicted to be more rapid when nuclear quantum effects are included. A rough estimate based only on the Δ Δ A values suggests that PIMD would increase the forward rate by a factor of 123 and the reverse rate by a factor of 395.
The evolution of the key distances as a function of the reaction path progress variable [ 0 , 1 ] , as well as the proton ring polymer degree of expansion ( Δ | r H | , Equation (1)) that is a measure of the delocalization of the proton wave packet [40], is shown in Figure 7. In each case, pronounced delocalization of the proton wave packet occurs at the crossing point of covalent bond formation and cleavage distances involved in the proton transfer. In the wGT→GT* and wGT→G*T reactions, the proton transfers are distinctly separated, whereas in the GT*→G*T reaction, they are concerted and appear in close proximity in terms of the path progress variable. Similar behavior has been observed for proton transfer reactions elsewhere [40].

2.4. Comparison of Various Models with Experiment

Table 1 compares the free energy values for each of the models with values reported by Li et al. [23] and from the experiment. Differences in free energies with respect to the experiment are listed as Δ values. Overall, the free energy differences Δ A between the GT wobble pair and the GT* and G*T states are predicted from calculations to be too endoergic. The AM1/d model performs the worst, with Δ A values exceeding 8 kcal/mol, whereas the PBE0/6-31G* QM/MM and QM/MM- Δ MLP models have errors below 4 kcal/mol. For the wGT→GT* reaction, Li et al. [23] reported a value of 6.00 kcal/mol, which is in good agreement with the experimentally derived value of 4.43 kcal/mol (estimated from the measured rate using classical transition state theory with a unit transmission coefficient). The free energy barriers for the wGT→GT* reaction are overestimated using PBE0/6-31G* QM/MM and QM/MM- Δ MLP, while they appear underestimated for GT*→G*T.
The general picture that emerges from the current work together with previous computational [23,34] and experimental [20,21,22] work is that for the B-DNA system, there is a relatively high barrier for the tautomerization of the wGT wobble pair to form either GT* or G*T tautomer states. The barrier for interconversion between GT*→G*T is much lower, and this barrier is predicted to be considerably influenced by nuclear quantum effects. Likely, this implies that there is a rapid equilibrium between GT* and G*T states that occurs on a time scale much faster than the rate of transition from wGT.
In a broader context, as alternative tautomeric forms of DNA bases have been implicated in DNA replication, it is of interest to consider how tautomerization is affected by the environment of a DNA polymerase. In the study of Li et al. [23], the GT*→G*T tautomerization was studied in B-DNA and also in a DNA polymerase λ variant. The study found that the environment of the polymerase had a significant influence on the thermodynamics, kinetics, and mechanism. Further, Slocombe et al. examined quantum tunneling effects for tautomerization in GT wobble pairs by coupling first-principles quantum chemistry calculations with an open quantum system master equation [26]. Results predicted the existence of a transient “tunneling-ready” state in the polymerase active site that led to a 100-fold increase in rate. In the present work, simulations were not performed in the polymerase environment. Nonetheless, given that the environment alters the population of G-T mispairs and the transition state ensemble for tautomerization [23], together with evidence for a “tunneling-ready” state in the polymerase [26], it is likely that polymerase binding would impact the nuclear quantum effects. The degree to which nuclear quantum effects are influenced by different environments is a subject of continued interest and future work.

2.5. Benefits and Pitfalls of the QM/MM+ Δ MLP Approach

The approach taken in the current work is to train a QM/MM+ Δ MLP model to closely reproduce ab initio QM/MM simulation data at the PBE0/6-31G* level in order to accelerate path sampling convergence and enable more practical PIMD simulations. The advantage of such an approach is that once developed, the QM/MM+ Δ MLP model affords a factor of roughly 300-fold speedup relative to (PBE0/6-31G*)/MM simulations with an accuracy of free energy values within 1 kcal/mol (Table 1). The disadvantage of this approach is that one must train the QM/MM+ Δ MLP model itself, and this will depend on the training data. In the present case, we train against (PBE0/6-31G*)/MM energy and force data. The reliability of these data will depend not only on the level of quantum theory (e.g., the chosen exchange–correlation functional and basis set) but also on the QM/MM interaction parameters (i.e., the MM charges and Lennard–Jones parameters). Recently, the ω B97M-D3BJ/def2-TZVPPD level of theory has been demonstrated to be highly reliable and has been used to create databases for intermolecular interactions [41] that in turn have been used to create machine learning potentials [42] for drug discovery. Such an approach, however, becomes more difficult when transferred to the study of complex chemical reactions that occur in heterogeneous condensed-phase environments such as those considered in the current work. An additional concern when applying a QM/MM+ Δ MLP model is that it may not reliably represent the target ab initio QM/MM model outside the scope of the training data ensemble. One possible mechanism to perform a consistency check is to use a weighted thermodynamic perturbation approach [43] or generalized multireference variant [44]. These methods potentially can correct the approximate QM/MM+ Δ MLP free energy surface to the target ab initio QM/MM level, provided that there is sufficient phase space overlap as indicated by an analysis of the reweighting entropies.
Table 1. Free energy values along the reaction path for tautomerization reactions using different QM models and methods a.
Table 1. Free energy values along the reaction path for tautomerization reactions using different QM models and methods a.
ReactionMethod Δ A ( σ err ) Δ Δ A f ( σ err ) Δ Δ A r ( σ err ) Δ
wGT→GT*Expt. [21]4.43 16.88 12.45
PBE0/6-31G*3.97 (0.04)−0.4620.41 (0.05)3.5316.45 (0.04)4.00
AM1/d9.54 (0.04)5.1123.51 (0.06)6.6313.98 (0.05)1.53
QM/MM- Δ MLP3.6 7(0.07)−0.7621.01 (0.07)4.1317.34 (0.07)4.89
PIMD3.48 (0.03)−0.9520.65 (0.04)3.7717.16 (0.03)4.71
Li et al. [23]6.001.5715.70−1.189.70−2.75
wGT→G*TExpt. [21]3.82
PBE0/6-31G*3.06 (0.04)−0.7620.08 (0.05) 17.02 (0.05)
AM1/d8.19 (0.04)4.3723.55 (0.04) 15.36 (0.04)
QM/MM- Δ MLP3.24 (0.07)−0.5821.02 (0.07) 17.78(0.07)
PIMD3.75 (0.04)−0.0720.63 (0.04) 16.88 (0.04)
Li et al. [23]N/A N/A N/A
GT*→G*TExpt. [21]−0.62 9.21 9.83
PBE0/6-31G*−0.91 (0.03)−0.296.72 (0.05)−2.497.63 (0.05)−2.20
AM1/d−1.36 (0.03)−0.7410.26 (0.03)1.0511.62 (0.04)1.79
QM/MM- Δ MLP−0.43 (0.07)0.197.06 (0.07)−2.157.49 (0.07)−2.34
PIMD0.26 (0.03)0.884.21 (0.03)−5.003.95 (0.04)−5.88
Li et al. [23]−0.200.425.70−3.515.90−3.93
a  σ err refers to the standard error of the mean for 4 trial runs; Δ refers to the difference between the current method and experimental results. Experimental results are obtained from Ref. [21]; Li et al. results were obtained in Ref. [23], which were calculated with ω B97X-D3/6-311G** and 13 reaction coordinates for a B-DNA system built by NAB program in AmberTools [45].

3. Materials and Methods

3.1. Background

There have been a number of quantum mechanical studies that have investigated the tautomerization of nucleobases in DNA and RNA [2,6,25,46,47,48,49,50,51] as well as extended synthetic genetic alphabets [2] that have great promise in the development of new technology. There have been fewer computational studies that utilize QM/MM simulations [23,52] to characterize the tautomerization reaction free energy path in complex condensed-phase environments. Further, there have been relatively few studies that have explored the contribution of nuclear quantum effects on the proton transfers involved in the tautomerization of nucleic acids [26,52,53,54].
One strategy that has been used to study nuclear quantum effects (within the Born–Oppenheimer approximation) in condensed-phase simulations of biological systems has been to use path integral molecular dynamics (PIMD) simulation methods [55,56,57,58,59]. Kosugi et al. [40] calculated among the first multidimensional quantum free energy surfaces using PIMD and a DFTB QM/MM model of a proton transfer reaction in a 2,4-dichlorophenol-trimethylamine complex. The QM/MM model’s empirical repulsive potential was parametrized to reproduce high-level QM reference data. The calculations used the multidimensional blue moon ensemble method to achieve a converged free energy surface in a tour de force benchmark demonstration. Sauceda et al. [60] more recently provided evidence that nuclear quantum effects enhance electronic interactions, leading to strengthening of covalent and noncovalent molecular interactions at finite temperature. The study used the symmetric gradient-domain machine learning (sGDML) framework [61,62,63] trained against high-level data sets [64,65]. The inclusion of nuclear quantum effects with PIMD can become cost-prohibitive at the ab initio QM/MM level when the quantum region becomes too large and/or has too intensive sampling requirements [66,67]. An alternative strategy is to use a fast, approximate, semiempirical [68] or density-functional tight-binding [69] QM model within a QM/MM or so-called quantum mechanical force field [66,67,70] framework. These models can be parameterized to improve their accuracy, for example, using force matching to higher levels [71,72,73]. Machine learning potentials (MLPs) have shown particular promise in enhancing the accuracy and performance of condensed-phase simulations of chemical reactions [33,74,75,76,77,78]. Of particular relevance to the current work is the development of QM/MM- Δ MLP models, whereby the energies and forces of a fast, approximate QM model are corrected with a machine learning potential [76,79,80,81,82,83,84,85]. These are described in more detail in the supporting information.

3.2. Free Energy Profiles of Tautomer Reactions from Classical Molecular Dynamics

The model system was prepared from the crystal structure of B-DNA with G-T wobble pair mismatches (PDB ID:113D) [27]. Experimental NMR data for the G-T mispair dynamics were previously reported for this system [20,21,22]. The model contains 762 solute atoms (including hydrogen atoms). We solvated the system with 5151 water molecules in a truncated octahedron with 59.3 Å real-space lattice vector lengths. A total of 13 Cl and 35 Na+ ions were added to neutralize the charge and produce a 0.14 M ion concentration. The system was equilibrated with an MM potential following the procedure described in Ref. [86]. The procedure includes geometry optimization, heating, solvent annealing, and equilibration of the system density while restraining solute heavy atoms. The solute restraints are gradually reduced over the course of the equilibration procedure. This procedure involved 6.2 ns and 2.6 ns of simulation in the N V T and N P T ensembles, respectively, using a 1 fs time step. The system was then equilibrated for an additional 100 ns in the N P T ensemble. A Langevin thermostat [87] was used at a 5.0 ps−1 collision frequency to maintain the temperature at 298 K, and the density was equilibrated at 1 atm using a Berendsen isotropic barostat.
The DNA was modeled with the OL15 force field [88]. The SPC/Fw water model [89] and the Li et al. ion parameters [90] were used to represent the solvent environment. The particle mesh Ewald method [91,92] was used to calculate electrostatic interactions with a 10 Å real-space cutoff, a 1 Å reciprocal space grid spacing, and tinfoil boundary conditions.
The tautomeric reactions were decomposed 3 steps. Each step was a path that connected 2 tautomeric forms. The free energy profiles of each step were calculated from QM/MM umbrella sampling. The “wGT” state corresponded to the G-T wobble base pair in which the T21:N3 position was protonated. The “GT*” state was a tautomer in which the T21:H3 proton was covalent-bonded to the T21:O4 position. Similarly, the “G*T” state was a tautomer in which the H3 proton was bound to the G4:O6. The 3 steps involved proton displacement between these states: wGT→GT*, wGT→G*T, GT*→G*T.
The free energy profiles were calculated with several QM/MM potentials. The QM/MM methods compared in this work are as follows: PBE0/6-31G*, AM1/d [93,94,95], and the QM/MM- Δ MLP method described above. All QM/MM simulations were performed with rigorous long-range electrostatic interactions under periodic boundary conditions using linear-scaling QM/MM Ewald methods [32,96]. The QM region consisted of 51 atoms (the nucleobase and sugar of G4 and T21), and it had a net neutral charge. The minimum free energy path of each step was optimized in the space of 5 reaction coordinates with the finite temperature string method [38] for every QM/MM model. The 5 reaction coordinates were distance differences meant to represent the transfer of H3 and H1 protons and the relative displacement of the hydrogen bond pattern: ξ 1 = R N 3 H 3 R O 6 H 3 , ξ 2 = R O 6 H 3 R O 4 H 3 , ξ 3 = R N 1 H 1 R N 3 H 1 , ξ 4 = R N 1 O 2 R N 2 O 2 , ξ 5 = R N 2 O 2 R N 1 N 3 . The umbrella biasing potentials used 200 kcal mol−1 Å−2 force constants. The string method began from a linear interpolation of the reaction coordinates between the step’s reactant and product states. Each string was sampled with 32 umbrella windows, and the initial configurations were prepared in sequence starting from the reactant state. The QM/MM structure of each window was geometry-optimized for 1000 steps using the conjugate gradient method. The temperature was raised from 0 K to 298 K over the course of 40 ps, and 10 ps of QM/MM equilibration was performed in the N V T ensemble. The string method was performed for 50 iterations, and sampling occurred at 4 ps/window/iteration using 1 fs time step. Four sets of production simulations were performed along the optimized path using different thermostat random number seeds. Each production simulation included 25 ps/window of sampling, and the reaction coordinate values were recorded every 10 fs. The aggregate sampling from all 3 reaction steps was analyzed with the multistate Bennett’s acceptance ratio (MBAR) method [97], as implemented in the ndfes module in FE-ToolKit software [98,99], distributed within AmberTools [100], to produce 5-dimensional free energy surfaces. The profiles presented in this manuscript are the free energy values within the 5-dimensional surface evaluated along the minimum free energy path. PBE0/6-31G* simulations are very expensive; for this particular case, only 10 iterations of the string method were performed, starting from the optimized AM1/d path. The PBE0/6-31G* sampling was limited to 2 ps/window/iteration, and 4 sets of production simulations were run for 10 ps/window/trial with a 1 fs time step. The PBE0/6-31G* QM/MM electrostatics were calculated with the ambient-potential composite Ewald method [32] using a 10 Å real-space cutoff, 1 Å reciprocal space grid spacing, and tinfoil boundary conditions.

3.3. Free Energy Profiles of Tautomer Reactions from PIMD

We performed PIMD simulations using i-PI 2.0 software [36] to treat nuclear quantum effects. i-PI software [36] is a standalone molecular dynamics program that supports state-of-the-art path integral sampling [101], including the PIGLET thermostat, which has been shown to reduce the expense of computing quantum kinetic energy [102,103]. The path integral dynamics was performed with a ring polymer Hamiltonian consisting of several replicas (beads) that were harmonically restrained in series. At each time step, the potential energy of each bead must be computed; however, these calculations are independent and can be performed in parallel. We modified the Sander program to act as a driver program [34], whereby multiple instances of Sander can be launched, and communication between i-Pi and the pool of driver programs occurred through a network socket interface.
We recalculated the free energy profiles from PIMD umbrella sampling. The profiles generated from classical and path integral dynamics were compared to explore how nuclear quantum effects change the free energy predictions. The PIMD dynamics was propagated with 6 beads (replicas) and a 0.25 fs time step, and the temperature was maintained at 298 K using the PIGLET quantum thermostat [102,104], whose parameters were taken from the GLE4MD website [105,106]. The parameters were chosen to reproduce the quantum fluctuations at 298 K and span a range of frequencies up to 4142 cm−1. The PIMD umbrella sampling was performed with 32 windows using 200 kcal mol−1 Å−2 force constants. The windows uniformly discretized the optimized path obtained from classical dynamics. The PIMD restraint potentials were applied to the centroid positions by making use of the i-PI interface with PLUMED [107,108]. Each window was sampled for 2.5 ps, and the sampling was repeated 4 times with different thermostat random number seeds. The reaction coordinate values were calculated from the centroid positions, and the free energy surface was generated from MBAR analysis of the aggregate sampling obtained from the 4 trials of the 3 reaction steps.
Following Ref. [40], the interpretation of the PIMD results was aided by tracking the “degree of expansion” of the transferring proton ring polymer, Δ | r H | .
Δ | r H | = 1 P n = 1 P | r H , n u H , c |
where r H , n is the position vector of bead n, u H , c is the centroid position vector, and P is the number of beads. The degree of expansion is simply the average bead–centroid distance for the proton involved in the tautomeric reaction.

4. Conclusions

This work presents free energy simulation results of GT mispair tautomerization reactions in B-DNA using semiempirical QM/MM, ab initio QM/MM, and a new QM/MM- Δ MLP model that was specifically trained for this application. We demonstrate that short (2.5 ps) sampling of the reaction path discretized into 32 string images/umbrella windows leads to large deviations (up to 3.4 kcal/mol) in the calculated transition state barriers. The QM/MM- Δ MLP model enables greater sampling, which in turn leads to higher-precision free energy estimates, as well as offering the capability to include nuclear quantum effects through path integral molecular dynamics simulation. PIMD simulations produced barriers for the GT*→G*T interconversion that were significantly lower than those calculated from classical MD. Collectively, the results suggest that there is a rapid equilibrium for GT*⇌G*T but a relatively large barrier for the tautomerization of the wGT wobble pair to form either GT* or G*T tautomer states.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/molecules29112703/s1, References [33,34,67,74,75,76,77,78,79,80,100,109,110,111,112,113] are cited in Supplementary Materials file.

Author Contributions

Conceptualization, D.M.Y.; methodology, Y.T. and T.J.G.; software, T.J.G.; validation, Y.T.; formal analysis, Y.T.; investigation, Y.T.; resources, D.M.Y.; data curation, Y.T.; writing—original draft preparation, Y.T. and D.M.Y.; writing—review and editing, D.M.Y., Y.T. and T.J.G.; visualization, Y.T.; supervision, D.M.Y.; project administration, D.M.Y.; funding acquisition, D.M.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Institutes of Health (No. GM62248 to D.M.Y.).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article/supplementary material; further inquiries can be directed to the corresponding author/s.

Acknowledgments

Computational resources were provided by the Office of Advanced Research Computing (OARC) at Rutgers, The State University of New Jersey. This work used the supercomputer Frontera [114] at the Texas Advanced Computing Center (TACC) through allocation CHE20002, and the supercomputer Expanse at the San Diego Supercomputer Center (SDSC) through allocation CHE190067 from the Advanced Cyberinfrastructure Coordination Ecosystem: Services & Support (ACCESS) program [115], which is supported by National Science Foundation grants #2138259, #2138286, #2138307, #2137603, and #2138296.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
QMQuantum mechanics
MMMolecular mechanics
MLPMachine learning potential correction
PIMDPath integral molecular dynamics

References

  1. Serrano-Andrés, L.; Merchán, M. Are the five natural DNA/RNA base monomers a good choice from natural selection? J. Photochem. Photobiol. C 2009, 10, 21–32. [Google Scholar] [CrossRef]
  2. Eberlein, L.; Beierlein, F.R.; van Eikema Hommes, N.J.R.; Radadiya, A.; Heil, J.; Benner, S.A.; Clark, T.; Kast, S.M.; Richards, N.G.J. Tautomeric Equilibria of Nucleobases in the Hachimoji Expanded Genetic Alphabet. J. Chem. Theory Comput. 2020, 16, 2766–2777. [Google Scholar] [CrossRef] [PubMed]
  3. Eichler, E.E. Genetic Variation, Comparative Genomics, and the Diagnosis of Disease. N. Engl. J. Med. 2019, 381, 64–74. [Google Scholar] [CrossRef] [PubMed]
  4. Gregory, T.R. Understanding Natural Selection: Essential Concepts and Common Misconceptions. Evo Edu Outreach 2009, 2, 156–175. [Google Scholar] [CrossRef]
  5. Gheorghiu, A.; Coveney, P.V.; Arabi, A.A. The influence of base pair tautomerism on single point mutations in aqueous DNA. Interface Focus 2020, 10, 20190120. [Google Scholar] [CrossRef] [PubMed]
  6. Srivastava, R. The Role of Proton Transfer on Mutations. Front. Chem. 2019, 7, 536. [Google Scholar] [CrossRef] [PubMed]
  7. Bebenek, K.; Pedersen, L.C.; Kunkel, T.A. Replication infidelity via a mismatch with Watson–Crick geometry. Proc. Natl. Acad. Sci. USA 2011, 108, 1862–1867. [Google Scholar] [CrossRef] [PubMed]
  8. Perera, R.L.; Torella, R.; Klinge, S.; Kilkenny, M.L.; Maman, J.D.; Pellegrini, L. Mechanism for priming DNA synthesis by yeast DNA Polymerase α. elife 2013, 2, 00482. [Google Scholar] [CrossRef]
  9. Topal, M.D.; Fresco, J.R. Complementary base pairing and the origin of substitution mutations. Nature 1976, 263, 285–289. [Google Scholar] [CrossRef]
  10. Tomasetti, C.; Li, L.; Vogelstein, B. Stem cell divisions, somatic mutations, cancer etiology, and cancer prevention. Science 2017, 355, 1330–1334. [Google Scholar] [CrossRef]
  11. Mendiratta, G.; Ke, E.; Aziz, M.; Liarakos, D.; Tong, M.; Stites, E.C. Cancer gene mutation frequencies for the U.S. population. Nat. Commun. 2021, 12, 5961. [Google Scholar] [CrossRef]
  12. Watson, J.D.; Crick, F.H.C. Molecular structure of nucleic acids. Nature 1953, 171, 737–738. [Google Scholar] [CrossRef]
  13. Rozov, A.; Demeshkina, N.; Westhof, E.; Yusupov, M.; Yusupova, G. New Structural Insights into Translational Miscoding. Trends Biochem. Sci. 2016, 41, 798–814. [Google Scholar] [CrossRef]
  14. de Vries, M.S. Gas-Phase IR Spectroscopy of Nucleobases. Top. Curr. Chem. 2014, 364, 271–297. [Google Scholar]
  15. Allawi, H.T.; SantaLucia, J., Jr. Thermodynamics and NMR of Internal G·T Mismatches in DNA. Biochemistry 1997, 36, 10581–10594. [Google Scholar] [CrossRef] [PubMed]
  16. Waters, T.R.; Swann, P.F. Kinetics of the Action of Thymine DNA Glycosylase. J. Biol. Chem. 1998, 273, 20007–20014. [Google Scholar] [CrossRef]
  17. Fox, K.R.; Allinson, S.L.; Sahagun-Krause, H.; Brown, T. Recognition of GT mismatches by Vsr mismatch endonuclease. Nucleic Acids Res. 2000, 28, 2535–2540. [Google Scholar] [CrossRef] [PubMed]
  18. Osakada, Y.; Kawai, K.; Fujitsuka, M.; Majima, T. Kinetics of charge transfer in DNA containing a mismatch. Nucleic Acids Res. 2008, 36, 5562–5570. [Google Scholar] [CrossRef]
  19. Koag, M.C.; Nam, K.; Lee, S. The spontaneous replication error and the mismatch discrimination mechanisms of human DNA polymerase β. Nucleic Acids Res. 2014, 42, 11233–11245. [Google Scholar] [CrossRef] [PubMed]
  20. Kimsey, I.J.; Petzold, K.; Sathyamoorthy, B.; Stein, Z.W.; Al-Hashimi, H.M. Visualizing transient Watson–Crick-like mispairs in DNA and RNA duplexes. Nature 2015, 519, 315–320. [Google Scholar] [CrossRef]
  21. Kimsey, I.J.; Szymanski, E.S.; Zahurancik, W.J.; Shakya, A.; Xue, Y.; Chu, C.C.; Sathyamoorthy, B.J.; Suo, Z.; Al-Hashimi, H.M. Dynamic basis for dG.dT misincorporation via tautomerization and ionization. Nature 2018, 554, 195–201. [Google Scholar] [CrossRef]
  22. Szymanski, E.S.; Kimsey, I.J.; Al-Hashimi, H.M. Direct NMR Evidence that Transient Tautomeric and Anionic States in dG·dT Form Watson–Crick-like Base Pairs. J. Am. Chem. Soc. 2017, 139, 4326–4329. [Google Scholar] [CrossRef] [PubMed]
  23. Li, P.; Rangadurai, A.; Al-Hashimi, H.M.; Hammes-Schiffer, S. Environmental Effects on Guanine-Thymine Mispair Tautomerization Explored with Quantum Mechanical/Molecular Mechanical Free Energy Simulations. J. Am. Chem. Soc. 2020, 142, 11183–11191. [Google Scholar] [CrossRef] [PubMed]
  24. Maximoff, S.N.; Kamerlin, S.C.L.; Florián, J. DNA Polymerase lambda Active Site Favors a Mutagenic Mispair between the Enol Form of Deoxyguanosine Triphosphate Substrate and the Keto Form of Thymidine Template: A Free Energy Perturbation Study. J. Phys. Chem. B 2017, 121, 7813–7822. [Google Scholar] [CrossRef]
  25. Chandorkar, S.; Raghunathan, S.; Jaganade, T.; Priyakumar, U.D. Multiscale Modeling of Wobble to Watson–Crick-Like Guanine–Uracil Tautomerization Pathways in RNA. Int. J. Mol. Sci. 2021, 22, 5411. [Google Scholar] [CrossRef] [PubMed]
  26. Slocombe, L.; Winokan, M.; Al-Khalili, J.; Sacchi, M. Quantum Tunnelling Effects in the Guanine-Thymine Wobble Misincorporation via Tautomerism. J. Phys. Chem. Lett. 2023, 14, 9–15. [Google Scholar] [CrossRef] [PubMed]
  27. Hunter, W.N.; Brown, T.; Kneale, G.; Anand, N.N.; Rabinovich, D.; Kennard, O. The Structure of Guanosine-Thymidine Mismatches in B-DNA at 2.5 Angstroms Resolution. J. Biol. Chem. 1993, 262, 9962–9970. [Google Scholar] [CrossRef]
  28. Chai, J.D.; Head-Gordon, M. Systematic optimization of long-range corrected hybrid density functionals. J. Chem. Phys. 2008, 128, 084106. [Google Scholar] [CrossRef]
  29. Lin, Y.S.; Li, G.D.; Mao, S.P.; Chai, J.D. Long-Range Corrected Hybrid Density Functionals with Improved Dispersion Corrections. J. Chem. Theory Comput. 2013, 9, 263–272. [Google Scholar] [CrossRef]
  30. Krishnan, R.; Binkley, J.S.; Seeger, R.; Pople, J.A. Self-consistent molecular orbital methods. XX. A basis set for correlated wave functions. J. Chem. Phys. 1980, 72, 650–654. [Google Scholar] [CrossRef]
  31. McLean, A.D.; Chandler, G.S. Contracted Gaussian basis sets for molecular calculations. I. Second row atoms, Z=11-18. J. Chem. Phys. 1980, 72, 5639–5648. [Google Scholar] [CrossRef]
  32. Giese, T.J.; York, D.M. Ambient-Potential Composite Ewald Method for ab Initio Quantum Mechanical/Molecular Mechanical Molecular Dynamics Simulation. J. Chem. Theory Comput. 2016, 12, 2611–2632. [Google Scholar] [CrossRef] [PubMed]
  33. Giese, T.J.; Zeng, J.; Ekesan, Ş.; York, D.M. Combined QM/MM, Machine Learning Path Integral Approach to Compute Free Energy Profiles and Kinetic Isotope Effects in RNA Cleavage Reactions. J. Chem. Theory Comput. 2022, 18, 4304–4317. [Google Scholar] [CrossRef] [PubMed]
  34. Tao, Y.; Giese, T.J.; Ekesan, Ş.; Zeng, J.; Aradi, B.; Hourahine, B.; Aktulga, H.M.; Götz, A.W.; Merz, K.M., Jr.; York, D.M. Amber free energy tools: Interoperable software for free energy simulations using generalized quantum mechanical/molecular mechanical and machine learning potentials. J. Chem. Phys. 2024; in press. [Google Scholar]
  35. Case, D.A.; Aktulga, H.M.; Belfon, K.; Ben-Shalom, I.Y.; Berryman, J.T.; Brozell, S.R.; Cerutti, D.S.; Cheatham, T.E., III; Cisneros, G.A.; Cruzeiro, V.W.D.; et al. Amber 2023; University of California: San Francisco, CA, USA, 2023. [Google Scholar]
  36. Kapil, V.; Rossi, M.; Marsalek, O.; Petraglia, R.; Litman, Y.; Spura, T.; Cheng, B.; Cuzzocrea, A.; Meißner, R.H.; Wilkins, D.M.; et al. i-PI 2.0: A universal force engine for advanced molecular simulations. Comput. Phys. Commun. 2019, 236, 214–223. [Google Scholar] [CrossRef]
  37. Rosta, E.; Nowotny, M.; Yang, W.; Hummer, G. Catalytic Mechanism of RNA Backbone Cleavage by Ribonuclease H from Quantum Mechanics/Molecular mechanics simulations. J. Am. Chem. Soc. 2011, 133, 8934–8941. [Google Scholar] [CrossRef] [PubMed]
  38. Giese, T.J.; Ekesan, Ş.; McCarthy, E.; Tao, Y.; York, D.M. Surface-Accelerated String Method for Locating Minimum Free Energy Paths. J. Chem. Theory Comput. 2024, 20, 2058–2073. [Google Scholar] [CrossRef] [PubMed]
  39. Dickerson, R. Definitions and nomenclature of nucleic acid structure components. Nucleic Acids Res. 1989, 17, 1797–1803. [Google Scholar] [CrossRef] [PubMed]
  40. Kosugi, K.; Nakano, H.; Sato, H. SCC-DFTB-PIMD Method To Evaluate a Multidimensional Quantum Free-Energy Surface for a Proton-Transfer Reaction. J. Chem. Theory Comput. 2019, 15, 4965–4973. [Google Scholar] [CrossRef] [PubMed]
  41. Eastman, P.; Behara, P.K.; Dotson, D.L.; Galvelis, R.; Herr, J.E.; Horton, J.T.; Mao, Y.; Chodera, J.D.; Pritchard, B.P.; Wang, Y.; et al. SPICE, A Dataset of Drug-like Molecules and Peptides for Training Machine Learning Potentials. Sci. Data 2023, 10, 11. [Google Scholar] [CrossRef]
  42. Giese, T.J.; Zeng, J.; Lerew, L.; McCarthy, E.; Tao, Y.; Ekesan, Ş.; York, D.M. Software infrastructure for next-generation QM/MM–ΔMLP force fields. J. Phys. Chem. 2024; in press. [Google Scholar]
  43. Li, P.; Jia, X.; Pan, X.; Shao, Y.; Mei, Y. Accelerated Computation of Free Energy Profile at ab Initio Quantum Mechanical/Molecular Mechanics Accuracy via a Semi-Empirical Reference Potential. I. Weighted Thermodynamics Perturbation. J. Chem. Theory Comput. 2018, 14, 5583–5596. [Google Scholar] [CrossRef]
  44. Giese, T.J.; Zeng, J.; York, D.M. Multireference Generalization of the Weighted Thermodynamic Perturbation Method. J. Phys. Chem. A 2022, 126, 8519–8533. [Google Scholar] [CrossRef] [PubMed]
  45. Case, D.A.; Cheatham, T.E., III; Darden, T.; Gohlke, H.; Luo, R.; Merz, K.M.; Onufriev, A.; Simmerling, C.; Wang, B.; Woods, R.J. The Amber biomolecular simulation programs. J. Comput. Chem. 2005, 26, 1668–1688. [Google Scholar] [CrossRef]
  46. Colominas, C.; Luque, F.J.; Orozco, M. Tautomerism and protonation of guanine and cytosine. Implications in the formation of hydrogen-bonded complexes. J. Am. Chem. Soc. 1996, 118, 6811–6821. [Google Scholar] [CrossRef]
  47. Leszczynski, J. The Potential Energy Surface of Guanine Is Not Flat: An ab Initio Study with Large Basis Sets and Higher Order Electron Correlation Contributions. J. Phys. Chem. A 1998, 102, 2357–2362. [Google Scholar] [CrossRef]
  48. Zhanpeisov, N.U.; Šponer, J.; Leszczynski, J. Reverse Watson-Crick isocytosine-cytosine and guanine-cytosine base pairs stabilized by the formation of the minor tautomers of bases. An ab initio study in the gas phase and in a water cluster. J. Phys. Chem. A 1998, 102, 10374–10379. [Google Scholar] [CrossRef]
  49. Šponer, J.; Šponer, J.E.; Gorb, L.; Leszczynski, J.; Lippert, B. Metal-stabilized rare tautomers and mispairs of DNA bases: N6-metalated adenine and N4-metalated cytosine, theoretical and experimental views. J. Phys. Chem. A 1999, 103, 11406–11413. [Google Scholar] [CrossRef]
  50. Bezbaruah, B.; Medhi, C. Quantum Mechanical Study on the Proton Transfer Mechanism within Adenine-thymine and Guanine-cytosine Base Pairs of DNA Nucleobase. Indian J. Adv. Chem. Sci. 2016, 4, 314–320. [Google Scholar]
  51. Dwi Hartono, Y.; Pabon-Martinez, Y.V.; Uyar, A.; Wengel, J.; Lundin, K.E.; Zain, R.; Smith, C.I.E.; Nilsson, L.; Villa, A. Role of Pseudoisocytidine Tautomerization in Triplex-Forming Oligonucleotides: In Silico and in Vitro Studies. ACS Omega 2017, 2, 2165–2177. [Google Scholar] [CrossRef]
  52. Slocombe, L.; Al-Khalili, J.S.; Sacchi, M. Quantum and classical effects in DNA point mutations: Watson–Crick tautomerism in AT and GC base pairs. Phys. Chem. Chem. Phys. 2021, 23, 4141–4150. [Google Scholar] [CrossRef]
  53. Fang, W.; Chen, J.; Rossi, M.; Feng, Y.; Li, X.Z.; Michaelides, A. Inverse Temperature Dependence of Nuclear Quantum Effects in DNA Base Pairs. J. Phys. Chem. Lett. 2016, 7, 2125–2131. [Google Scholar] [CrossRef]
  54. Pohl, R.; Socha, O.; Slavíček, P.; Šála, M.; Hodgkinson, P.; Dračínský, M. Proton transfer in guanine–cytosine base pair analogues studied by NMR spectroscopy and PIMD simulations. Faraday Discuss. 2018, 212, 331–344. [Google Scholar] [CrossRef]
  55. Voth, G.A. Feynman path integral formulation of quantum mechanical transition-state theory. J. Phys. Chem. 1993, 97, 8365–8377. [Google Scholar] [CrossRef]
  56. Hwang, J.K.; Warshel, A. How important are quantum mechanical nuclear motions in enzyme catalysis? J. Am. Chem. Soc. 1996, 118, 11745–11751. [Google Scholar] [CrossRef]
  57. Wang, L.; Fried, S.D.; Boxer, S.G.; Markland, T.E. Quantum delocalization of protons in the hydrogen-bond network of an enzyme active site. Proc. Natl. Acad. Sci. USA 2014, 111, 18454–18459. [Google Scholar] [CrossRef]
  58. Vardi-Kilshtain, A.; Nitoker, N.; Major, D.T. Nuclear quantum effects and kinetic isotope effects in enzyme reactions. Arch. Biochem. Biophys. 2015, 582, 18–27. [Google Scholar] [CrossRef]
  59. Gao, J. Chapter Fourteen—Enzymatic Kinetic Isotope Effects from Path-Integral Free Energy Perturbation Theory. Methods Enzymol. 2016, 577, 359–388. [Google Scholar]
  60. Sauceda, H.E.; Vassilev-Galindo, V.; Chmiela, S.; Müller, K.R.; Tkatchenko, A. Dynamical strengthening of covalent and non-covalent molecular interactions by nuclear quantum effects at finite temperature. Nat. Commun. 2021, 12, 442. [Google Scholar] [CrossRef]
  61. Chmiela, S.; Tkatchenko, A.; Sauceda, H.E.; Poltavsky, I.; Schütt, K.T.; Müller, K.R. Machine learning of accurate energy-conserving molecular force fields. Sci. Adv. 2017, 3, 1603015. [Google Scholar] [CrossRef]
  62. Chmiela, S.; Sauceda, H.E.; Müller, K.R.; Tkatchenko, A. Towards exact molecular dynamics simulations with machine-learned force fields. Nat. Commun. 2018, 9, 3887. [Google Scholar] [CrossRef]
  63. Chmiela, S.; Sauceda, H.E.; Tkatchenko, A.; Müller, K.R. Accurate Molecular Dynamics Enabled by Efficient Physically Constrained Machine Learning Approaches. In Machine Learning Meets Quantum Physics; Schütt, K.T., Chmiela, S., von Lilienfeld, O.A., Tkatchenko, A., Tsuda, K., Müller, K.R., Eds.; Springer International Publishing: Cham, Switzerland, 2020; Volume 968, pp. 129–154. [Google Scholar]
  64. Rupp, M.; Tkatchenko, A.; Müller, K.R.; von Lilienfeld, O. Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning. Phys. Rev. Lett. 2012, 108, 058301. [Google Scholar] [CrossRef]
  65. Blum, L.C.; Reymond, J.L. 970 Million Druglike Small Molecules for Virtual Screening in the Chemical Universe Database GDB-13. J. Am. Chem. Soc. 2009, 131, 8732–8733. [Google Scholar] [CrossRef] [PubMed]
  66. Giese, T.J.; Huang, M.; Chen, H.; York, D.M. Recent Advances toward a General Purpose Linear-Scaling Quantum Force Field. Acc. Chem. Res. 2014, 47, 2812–2820. [Google Scholar] [CrossRef] [PubMed]
  67. Giese, T.J.; York, D.M. Quantum mechanical force fields for condensed phase molecular simulations. J. Phys. Condens. Matter 2017, 29, 383002. [Google Scholar] [CrossRef] [PubMed]
  68. Margraf, J.T.; Hennemann, M.; Clark, T. EMPIRE: A highly parallel semiempirical molecular orbital program: 3: Born-Oppenheimer molecular dynamics. J. Mol. Model. 2020, 26, 43. [Google Scholar] [CrossRef] [PubMed]
  69. Hourahine, B.; Aradi, B.; Blum, V.; Bonafe, F.; Buccheri, A.; Camacho, C.; Cevallos, C.; Deshaye, M.Y.; Dumitrica, T.; Dominguez, A.; et al. DFTB+, a software package for efficient approximate density functional theory based atomistic simulations. J. Chem. Phys. 2020, 152, 124101. [Google Scholar] [CrossRef] [PubMed]
  70. Gao, J.; Truhlar, D.G.; Wang, Y.; Mazack, M.J.; Löffler, P.; Provorse, M.R.; Rehak, P. Explicit polarization: A quantum mechanical framework for developing next generation force fields. Acc. Chem. Res. 2014, 47, 2837–2845. [Google Scholar] [CrossRef]
  71. Lindsey, R.K.; Fried, L.E.; Goldman, N. ChIMES: A Force Matched Potential with Explicit Three-Body Interactions for Molten Carbon. J. Chem. Theory Comput. 2017, 13, 6222–6229. [Google Scholar] [CrossRef] [PubMed]
  72. Lindsey, R.K.; Kroonblawd, M.P.; Fried, L.E.; Goldman, N. Computational Approaches for Chemistry Under Extreme Conditions; Springer International Publishing: Cham, Switzerland, 2019; Chapter Force Matching Approaches to Extend Density Functional Theory to Large Time and Length Scales; pp. 71–93. [Google Scholar] [CrossRef]
  73. Goldman, N.; Kweon, K.E.; Sadigh, B.; Heo, T.W.; Lindsey, R.K.; Pham, C.H.; Fried, L.E.; Aradi, B.; Holliday, K.; Jeffries, J.R.; et al. Semi-Automated Creation of Density Functional Tight Binding Models through Leveraging Chebyshev Polynomial-Based Force Fields. J. Chem. Theory Comput. 2021, 17, 4435–4448. [Google Scholar] [CrossRef]
  74. Behler, J. First Principles Neural Network Potentials for Reactive Simulations of Large Molecular and Condensed Systems. Angew. Chem. Engl. 2017, 56, 12828–12840. [Google Scholar] [CrossRef]
  75. Meuwly, M. Machine Learning for Chemical Reactions. Chem. Rev. 2021, 121, 10218–10239. [Google Scholar] [CrossRef]
  76. Zeng, J.; Giese, T.J.; Ekesan, Ş.; York, D.M. Development of Range-Corrected Deep Learning Potentials for Fast, Accurate Quantum Mechanical/Molecular Mechanical Simulations of Chemical Reactions in Solution. J. Chem. Theory Comput. 2021, 17, 6993–7009. [Google Scholar] [CrossRef]
  77. Pan, X.; Yang, J.; Van, R.; Epifanovsky, E.; Ho, J.; Huang, J.; Pu, J.; Mei, Y.; Nam, K.; Shao, Y. Machine-Learning-Assisted Free Energy Simulation of Solution-Phase and Enzyme Reactions. J. Chem. Theory Comput. 2021, 17, 5745–5758. [Google Scholar] [CrossRef]
  78. Snyder, R.; Kim, B.; Pan, X.; Shao, Y.; Pu, J. Bridging semiempirical and ab initio QM/MM potentials by Gaussian process regression and its sparse variants for free energy simulation. J. Chem. Phys. 2023, 159, 054107. [Google Scholar] [CrossRef]
  79. Zeng, J.; Tao, Y.; Giese, T.J.; York, D.M. QDπ: A Quantum Deep Potential Interaction Model for Drug Discovery. J. Chem. Theory Comput. 2023, 19, 1261–1275. [Google Scholar] [CrossRef]
  80. Zeng, J.; Tao, Y.; Giese, T.J.; York, D.M. Modern semiempirical electronic structure methods and machine learning potentials for drug discovery: Conformers, tautomers, and protonation states. J. Chem. Phys. 2023, 158, 124110. [Google Scholar] [CrossRef]
  81. Han, Y.; Wang, Z.; Wei, Z.; Liu, J.; Li, J. Machine learning builds full-QM precision protein force fields in seconds. Brief. Bioinform. 2021, 22, bbab158. [Google Scholar] [CrossRef]
  82. Zhang, Y.J.; Khorshidi, A.; Kastlunger, G.; Peterson, A.A. The potential for machine learning in hybrid QM/MM calculations. J. Chem. Phys. 2018, 148, 241740. [Google Scholar] [CrossRef]
  83. Böselt, L.; Thürlemann, M.; Riniker, S. Machine Learning in QM/MM Molecular Dynamics Simulations of Condensed- Phase Systems. J. Chem. Theory Comput. 2021, 17, 2641–2658. [Google Scholar] [CrossRef] [PubMed]
  84. Shen, L.; Wu, J.; Yang, W. Multiscale Quantum Mechanics/Molecular Mechanics Simulations with Neural Networks. J. Chem. Theory Comput. 2016, 12, 4934–4946. [Google Scholar] [CrossRef]
  85. Shen, L.; Yang, W. Molecular Dynamics Simulations with Quantum Mechanics/Molecular Mechanics and Adaptive Neural Networks. J. Chem. Theory Comput. 2018, 14, 1442–1455. [Google Scholar] [CrossRef]
  86. Ekesan, Ş.; York, D.M. Dynamical ensemble of the active state and transition state mimic for the RNA-cleaving 8-17 DNAzyme in solution. Nucleic Acids Res. 2019, 47, 10282–10295. [Google Scholar] [CrossRef]
  87. Loncharich, R.J.; Brooks, B.R.; Pastor, R.W. Langevin dynamics of peptides: The frictional dependence of isomerization rates of N-acetylalanyl-N’-methylamide. Biopolymers 1992, 32, 523–535. [Google Scholar] [CrossRef] [PubMed]
  88. Galindo-Murillo, R.; Robertson, J.C.; Zgarbova, M.; Sponer, J.; Otyepka, M.; Jurecka, P.; Cheatham, T.E., III. Assessing the current state of AMBER force field modifications for DNA. J. Chem. Theory Comput. 2016, 12, 4114–4127. [Google Scholar] [CrossRef] [PubMed]
  89. Wu, Y.; Tepper, H.L.; Voth, G.A. Flexible simple point-charge water model with improved liquid-state properties. J. Chem. Phys. 2006, 124, 024503. [Google Scholar] [CrossRef] [PubMed]
  90. Li, P.; Roberts, B.P.; Chakravorty, D.K.; Merz, K.M., Jr. Rational design of Particle Mesh Ewald compatible Lennard-Jones parameters for +2 metal cations in explicit solvent. J. Chem. Theory Comput. 2013, 9, 2733–2748. [Google Scholar] [CrossRef] [PubMed]
  91. Darden, T.; York, D.; Pedersen, L. Particle mesh Ewald: An N log(N) method for Ewald sums in large systems. J. Chem. Phys. 1993, 98, 10089–10092. [Google Scholar] [CrossRef]
  92. Essmann, U.; Perera, L.; Berkowitz, M.L.; Darden, T.; Lee, H.; Pedersen, L.G. A smooth particle mesh Ewald method. J. Chem. Phys. 1995, 103, 8577–8593. [Google Scholar] [CrossRef]
  93. Dewar, M.J.S.; Zoebisch, E.; Healy, E.F.; Stewart, J.J.P. Development and use of quantum mechanical molecular models. 76. AM1: A new general purpose quantum mechanical molecular model. J. Am. Chem. Soc. 1985, 107, 3902–3909. [Google Scholar] [CrossRef]
  94. Lopez, X.; York, D.M. Parameterization of semiempirical methods to treat nucleophilic attacks to biological phosphates: AM1/d parameters for phosphorus. Theor. Chem. Acc. 2003, 109, 149–159. [Google Scholar] [CrossRef]
  95. Nam, K.; Cui, Q.; Gao, J.; York, D.M. Specific reaction parametrization of the AM1/d Hamiltonian for phosphoryl transfer reactions: H, O, and P atoms. J. Chem. Theory Comput. 2007, 3, 486–504. [Google Scholar] [CrossRef]
  96. Nam, K.; Gao, J.; York, D.M. An efficient linear-scaling Ewald method for long-range electrostatic interactions in combined QM/MM calculations. J. Chem. Theory Comput. 2005, 1, 2–13. [Google Scholar] [CrossRef]
  97. Shirts, M.R.; Chodera, J.D. Statistically optimal analysis of samples from multiple equilibrium states. J. Chem. Phys. 2008, 129, 124105. [Google Scholar] [CrossRef] [PubMed]
  98. Giese, T.J.; Ekesan, Ş.; York, D.M. Extension of the Variational Free Energy Profile and Multistate Bennett Acceptance Ratio Methods for High-Dimensional Potential of Mean Force Profile Analysis. J. Phys. Chem. A 2021, 125, 4216–4232. [Google Scholar] [CrossRef]
  99. Giese, T.J.; York, D.M. FE-ToolKit: The Free Energy Analysis Toolkit. Available online: https://gitlab.com/RutgersLBSR/fe-toolkit (accessed on 3 March 2024).
  100. Case, D.A.; Aktulga, H.M.; Belfon, K.; Cerutti, D.S.; Cisneros, G.A.; Cruzeiro, V.W.D.; Forouzesh, N.; Giese, T.J.; Götz, A.W.; Gohlke, H.; et al. AmberTools. J. Chem. Inf. Model. 2023, 63, 6183–6191. [Google Scholar] [CrossRef]
  101. Ceriotti, M.; More, J.; Manolopoulos, D.E. i-PI: A Python interface for ab initio path integral molecular dynamics simulations. Comput. Phys. Commun. 2014, 185, 1019–1026. [Google Scholar] [CrossRef]
  102. Ceriotti, M.; Manolopoulos, D.E. Efficient first-principles calculation of the quantum kinetic energy and momentum distribution of nuclei. Phys. Rev. Lett. 2012, 109, 100604. [Google Scholar] [CrossRef]
  103. Ceriotti, M.; Markland, T.E. Efficient methods and practical guidelines for simulating isotope effects. J. Chem. Phys. 2013, 138, 014112. [Google Scholar] [CrossRef]
  104. Ceriotti, M.; Manolopoulos, D.E.; Parrinello, M. Accelerating the convergence of path integral dynamics with a generalized Langevin equation. J. Chem. Phys. 2011, 134, 084104. [Google Scholar] [CrossRef]
  105. Ceriotti, M.; Bussi, G.; Parrinello, M. Colored-Noise Thermostats à la Carte. J. Chem. Theory Comput. 2010, 6, 1170–1180. [Google Scholar] [CrossRef]
  106. Ceriotti, M.; Bussi, G.; Parrinello, M. Langevin Equation with Colored Noise for Constant-Temperature Molecular Dynamics Simulations. Phys. Rev. Lett. 2009, 102, 020601. [Google Scholar] [CrossRef]
  107. Bonomi, M.; Branduardi, D.; Bussi, G.; Camilloni, C.; Provasi, D.; Raiteri, P.; Donadio, D.; Marinelli, F.; Pietrucci, F.; Broglia, R.; et al. PLUMED: A portable plugin for free energy calculations with molecular dynamics. Comput. Phys. Commun. 2009, 180, 1961–1972. [Google Scholar] [CrossRef]
  108. Tribello, G.A.; Bonomi, M.; Branduardi, D.; Camilloni, C.; Bussi, G. PLUMED2: New feathers for an old bird. Comput. Phys. Commun. 2014, 185, 604–613. [Google Scholar] [CrossRef]
  109. McCarthy, E.; Ekesan, Ş.; Giese, T.J.; Wilson, T.J.; Deng, J.; Huang, L.; Lilley, D.M.J.; York, D.M. Catalytic mechanism and pH dependence of a methyltransferase ribozyme (MTR1) from computational enzymology. Nucleic Acids Res. 2023, 51, 4508–4518. [Google Scholar] [CrossRef] [PubMed]
  110. Chipot, C.; Pohorille, A. (Eds.) Free Energy Calculations: Theory and Applications in Chemistry and Biology; Springer Series in Chemical Physics; Springer: New York, NY, USA, 2007; Volume 86. [Google Scholar]
  111. Zeng, J.; Zhang, D.; Lu, D.; Mo, P.; Li, Z.; Chen, Y.; Rynik, M.; Huang, L.; Li, Z.; Shi, S.; et al. DeePMD-kit v2: A software package for deep potential models. J. Chem. Phys. 2023, 159, 054801. [Google Scholar] [CrossRef]
  112. Zhang, Y.; Wang, H.; Chen, W.; Zeng, J.; Zhang, L.; Han, W.; E, W. DP-GEN: A concurrent learning platform for the generation of reliable deep learning based potential energy models. Comput. Phys. Commun. 2020, 253, 107206. [Google Scholar] [CrossRef]
  113. Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. arXiv 2017, arXiv:1412.6980. [Google Scholar]
  114. Stanzione, D.; West, J.; Evans, R.T.; Minyard, T.; Ghattas, O.; Panda, D.K. Frontera: The Evolution of Leadership Computing at the National Science Foundation. In PEARC’20: Practice and Experience in Advanced Research Computing; Association for Computing Machinery: New York, NY, USA, 2020; pp. 106–111. [Google Scholar]
  115. Boerner, T.J.; Deems, S.; Furlani, T.R.; Knuth, S.L.; Towns, J. ACCESS: Advancing Innovation: NSF’s Advanced Cyberinfrastructure Coordination Ecosystem: Services & Supports. In PEARC’23: Practice and Experience in Advanced Research Computing; Romanella, R.S.A., Knuth, S., Hackworth, K., Pummill, J., Eds.; Association for Computing Machinery: New York, NY, USA, 2023; pp. 173–176. [Google Scholar]
Figure 1. (a) B-DNA secondary structure with 2 G-T wobble mispairs highlighted in green; (b) B-DNA structure examined in this work prepared from the crystal structure [27] (PDB ID: 113D); (c) structure of the G-T wobble mispair residue; (d) schematic of the general reaction.
Figure 1. (a) B-DNA secondary structure with 2 G-T wobble mispairs highlighted in green; (b) B-DNA structure examined in this work prepared from the crystal structure [27] (PDB ID: 113D); (c) structure of the G-T wobble mispair residue; (d) schematic of the general reaction.
Molecules 29 02703 g001
Figure 2. Classical free energy profiles for guanine–thymine mispair tautomerization reactions using different ab initio and semiempirical QM/MM and QM/MM- Δ MLP models described in the text: (a) wGT→GT*; (b) wGT→G*T; (c) GT*→G*T. Here “wGT” indicates a G-T wobble pair, and G* and T* are nonstandard (enol) tautomer states of G and T, respectively, as indicated in the figure and described in the text.
Figure 2. Classical free energy profiles for guanine–thymine mispair tautomerization reactions using different ab initio and semiempirical QM/MM and QM/MM- Δ MLP models described in the text: (a) wGT→GT*; (b) wGT→G*T; (c) GT*→G*T. Here “wGT” indicates a G-T wobble pair, and G* and T* are nonstandard (enol) tautomer states of G and T, respectively, as indicated in the figure and described in the text.
Molecules 29 02703 g002
Figure 3. Schematic of guanine–thymine mispair tautomerization reaction mechanisms using QM/MM- Δ MLP models described in the text: (a) wGT→GT*; (b) wGT→G*T; (c) GT*→G*T. Here, “wGT” indicates a G-T wobble pair, and G* and T* are nonstandard (enol) tautomer states of G and T, respectively. Blue arrows represent electron transfer direction, and red motion arrows represent shearing direction of the base pairs [39].
Figure 3. Schematic of guanine–thymine mispair tautomerization reaction mechanisms using QM/MM- Δ MLP models described in the text: (a) wGT→GT*; (b) wGT→G*T; (c) GT*→G*T. Here, “wGT” indicates a G-T wobble pair, and G* and T* are nonstandard (enol) tautomer states of G and T, respectively. Blue arrows represent electron transfer direction, and red motion arrows represent shearing direction of the base pairs [39].
Molecules 29 02703 g003
Figure 4. Representative nucleobase structures in guanine–thymine mispair tautomerization reactions using QM/MM- Δ MLP models described in the text and illustrated schematically in Figure 3. Here, “wGT” indicates a G-T wobble pair, and G* and T* are nonstandard (enol) tautomer states of G and T, respectively. TSa, TSb1, TSb2, and TSc are the transition states, and Ib is an intermediate state presented in Figure 2 and Figure 3. The structures were obtained by averaging the trajectory by Cpptraj [35], and the unit of distances shown here is Å. Other parts of the system, including sugar and phosphate groups, are not shown.
Figure 4. Representative nucleobase structures in guanine–thymine mispair tautomerization reactions using QM/MM- Δ MLP models described in the text and illustrated schematically in Figure 3. Here, “wGT” indicates a G-T wobble pair, and G* and T* are nonstandard (enol) tautomer states of G and T, respectively. TSa, TSb1, TSb2, and TSc are the transition states, and Ib is an intermediate state presented in Figure 2 and Figure 3. The structures were obtained by averaging the trajectory by Cpptraj [35], and the unit of distances shown here is Å. Other parts of the system, including sugar and phosphate groups, are not shown.
Molecules 29 02703 g004
Figure 5. Classical free energy profiles for guanine–thymine mispair tautomerization reactions using ab initio QM/MM plotted with independent trials and analyzed with different 2.5 ps time blocks: (a) wGT→GT*; (b) wGT→G*T; (c) GT*→G*T. Here, “wGT” indicates a G-T wobble pair, and G* and T* are nonstandard (enol) tautomer states of G and T, respectively, as indicated in the figure and described in the text. Nonoverlapping 2.5 ps time blocks were constructed by uniformly dividing the 4 independent trials of 10 ps final path refinement sampling (40 ps cumulatively) into 16 2.5-ps blocks that were each analyzed independently. Different colors in the plots refer to different trials, and different line styles refer to different blocks. The ranges of rate-controlling transition state (TS) free energy barriers are given below the TS label.
Figure 5. Classical free energy profiles for guanine–thymine mispair tautomerization reactions using ab initio QM/MM plotted with independent trials and analyzed with different 2.5 ps time blocks: (a) wGT→GT*; (b) wGT→G*T; (c) GT*→G*T. Here, “wGT” indicates a G-T wobble pair, and G* and T* are nonstandard (enol) tautomer states of G and T, respectively, as indicated in the figure and described in the text. Nonoverlapping 2.5 ps time blocks were constructed by uniformly dividing the 4 independent trials of 10 ps final path refinement sampling (40 ps cumulatively) into 16 2.5-ps blocks that were each analyzed independently. Different colors in the plots refer to different trials, and different line styles refer to different blocks. The ranges of rate-controlling transition state (TS) free energy barriers are given below the TS label.
Molecules 29 02703 g005
Figure 6. Classical and path integral molecular dynamics free energy profiles for guanine–thymine mispair tautomerization reactions using QM/MM- Δ MLP models described in the text: (a) wGT→GT*; (b) wGT→G*T; (c) GT*→G*T. Here, “wGT” indicates a G-T wobble pair, and G* and T* are nonstandard (enol) tautomer states of G and T, respectively. Profiles from classical and PIMD are shown as black and red lines, respectively.
Figure 6. Classical and path integral molecular dynamics free energy profiles for guanine–thymine mispair tautomerization reactions using QM/MM- Δ MLP models described in the text: (a) wGT→GT*; (b) wGT→G*T; (c) GT*→G*T. Here, “wGT” indicates a G-T wobble pair, and G* and T* are nonstandard (enol) tautomer states of G and T, respectively. Profiles from classical and PIMD are shown as black and red lines, respectively.
Molecules 29 02703 g006
Figure 7. The atom distances and the proton ring polymer degree of expansion ( Δ | r H | , Equation (1)) for guanine–thymine mispair tautomerization reactions using QM/MM- Δ MLP models described in the text: (a) wGT→GT* (a,d); (b) wGT→G*T (b,e); (c) GT*→G*T (c,f). Here, “wGT” indicates a G-T wobble pair, and G* and T* are nonstandard tautomer states of G and T, respectively. The proton ring polymer degree of expansion and the atom distances are shown as top and bottom panels, respectively. The notations of atoms are the same as in Figure 3.
Figure 7. The atom distances and the proton ring polymer degree of expansion ( Δ | r H | , Equation (1)) for guanine–thymine mispair tautomerization reactions using QM/MM- Δ MLP models described in the text: (a) wGT→GT* (a,d); (b) wGT→G*T (b,e); (c) GT*→G*T (c,f). Here, “wGT” indicates a G-T wobble pair, and G* and T* are nonstandard tautomer states of G and T, respectively. The proton ring polymer degree of expansion and the atom distances are shown as top and bottom panels, respectively. The notations of atoms are the same as in Figure 3.
Molecules 29 02703 g007
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Tao, Y.; Giese, T.J.; York, D.M. Electronic and Nuclear Quantum Effects on Proton Transfer Reactions of Guanine–Thymine (G-T) Mispairs Using Combined Quantum Mechanical/Molecular Mechanical and Machine Learning Potentials. Molecules 2024, 29, 2703. https://doi.org/10.3390/molecules29112703

AMA Style

Tao Y, Giese TJ, York DM. Electronic and Nuclear Quantum Effects on Proton Transfer Reactions of Guanine–Thymine (G-T) Mispairs Using Combined Quantum Mechanical/Molecular Mechanical and Machine Learning Potentials. Molecules. 2024; 29(11):2703. https://doi.org/10.3390/molecules29112703

Chicago/Turabian Style

Tao, Yujun, Timothy J. Giese, and Darrin M. York. 2024. "Electronic and Nuclear Quantum Effects on Proton Transfer Reactions of Guanine–Thymine (G-T) Mispairs Using Combined Quantum Mechanical/Molecular Mechanical and Machine Learning Potentials" Molecules 29, no. 11: 2703. https://doi.org/10.3390/molecules29112703

APA Style

Tao, Y., Giese, T. J., & York, D. M. (2024). Electronic and Nuclear Quantum Effects on Proton Transfer Reactions of Guanine–Thymine (G-T) Mispairs Using Combined Quantum Mechanical/Molecular Mechanical and Machine Learning Potentials. Molecules, 29(11), 2703. https://doi.org/10.3390/molecules29112703

Article Metrics

Back to TopTop