Next Article in Journal
Structure of the Lipooligosaccharide from the Deep-Sea Marine Bacterium Idiomarina zobellii KMM 231T, Isolated at a Depth of 4000 Meters
Previous Article in Journal
Noonindoles A–F: Rare Indole Diterpene Amino Acid Conjugates from a Marine-Derived Fungus, Aspergillus noonimiae CMB-M0339
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

May the Force (Field) Be with You: On the Importance of Conformational Searches in the Prediction of NMR Chemical Shifts

by
Cristina Cuadrado
1,
Antonio Hernández Daranas
1,* and
Ariel M. Sarotti
2,*
1
Instituto de Productos Naturales y Agrobiología, Consejo Superior de Investigaciones Científicas (IPNA-CSIC), 38206 La Laguna, Tenerife, Spain
2
Instituto de Química Rosario (CONICET), Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario, Suipacha 531, Rosario 2000, Argentina
*
Authors to whom correspondence should be addressed.
Mar. Drugs 2022, 20(11), 699; https://doi.org/10.3390/md20110699
Submission received: 18 October 2022 / Revised: 29 October 2022 / Accepted: 4 November 2022 / Published: 8 November 2022
(This article belongs to the Section Structural Studies on Marine Natural Products)

Abstract

:
NMR data prediction is increasingly important in structure elucidation. The impact of force field selection was assessed, along with geometry and energy cutoffs. Based on the conclusions, we propose a new approach named mix-J-DP4, which provides a remarkable increase in the confidence level of complex stereochemical assignments—100% in our molecular test set—with a very modest increment in computational cost.

1. Introduction

Nuclear magnetic resonance (NMR) is the most commonly used spectroscopic technique for elucidating the structure of small organic compounds [1]. However, the intrinsic complexity of some molecular architectures makes this task problematic [2,3]. Over the past decade, the increasing ease of use and success of computational methods have popularized their use in structural elucidation tasks, extending their use to increasingly challenging molecules [4,5,6,7]. These methods generally rely on the use of NMR data calculated at DFT levels, followed by refined strategies to correlate them with experimental information [8,9,10]. Among them, Bayesian analysis (DP4, DP4+, and J-DP4) [11,12,13], parametric corrections (DU8+) [14], or conformational selection (CASE-3D) [15] methods stand out. More recently, new approaches based on machine learning have been developed to speed up the NMR calculation stage, including IMPRESSION, CASCADE, and ML-J-DP4 [16,17,18].
A common, and usually overlooked, step in all of these methods comprises the molecular mechanics (MM) conformational search that generates conformer ensembles for each candidate structure. MM-generated structures can be either used directly or after a DFT optimization step as inputs for NMR chemical shifts and J couplings calculations. However, it is important to note that optimization at high levels of theory (DFT) implies large computational times for complex structures with a high number of conformations or configurations [6,7,19]. Molecular mechanics force fields (FF) have undergone a significant evolution over the years, including different parametrizations of the explicit functions used to define the global energy of the system [20]. The MMFF force field has been proposed for performing the conformational searches in some of the most common metrics used for structure prediction, including DP4 and J-DP4 [11,13]. However, in a substantial number of publications, the conformational sampling has been carried out with other FFs or is simply not described [21,22,23,24,25,26,27]. This suggests that many users (and reviewers) do not give enough importance to this key step. The most obvious consequence of using a different FF is the conformational landscape that results from the different number and types of conformations found within a given cutoff value. This can be critical because missing important conformations may lead to erroneous results. In addition, the effect of subtle geometric differences in the predicted NMR parameters is unclear but potentially very important for the quality of the predictions.

2. Results and Discussion

In this report, we evaluated the impact of FF selection in the prediction of NMR data for structural elucidation within the DP4 [11] and J-DP4 [13] formalisms, using a test set of 33 challenging examples (see Supplementary Material). They showed a wide diversity of structures, conformational freedom, and many possible stereoisomers (Figure 1). The choice to confine the analysis to DP4/J-DP4 was based on the fact that both use molecular coordinates taken directly from the FF-optimized geometries for the subsequent DFT computations of NMR parameters and energies, and, therefore, a significant dependence on the FF selection should be expected. Other popular metrics that require prior geometry optimizations at DFT levels (for example, DP4+) are much less sensitive to the force field employed during the conformational sampling and were not considered in this study. Another relevant issue is that DP4 delivers the probability of a candidate structure being correct based on the analysis of errors of calculated versus experimental chemical shifts. The J-DP4 approach adds geometric (angular) information, obtained from measurements of 3JHH values, into the DP4 formalism. 3JHH values can be used either directly in dJ-DP4 by including an additional probability term into a DP4-like equation or indirectly in the iJ-DP4 approach, using 3JHH to restrict the results of the conformational search. However, their combination (iJ/dJ-DP4) gives the best results.
In this study, we chose five representative and commonly used FFs: AMBER (A) and MM3 (B) [28,29], as they are good examples of “classic” FFs; MMFF (C), because DP4 and J-DP4 were developed using it [30]; and OPLS2005 (D) and OPLS4 (E), as they are two representative examples of modern FFs [31,32]. We also studied the effect of two important yet underestimated parameters that have a strong influence on the conformers ensemble composition, namely, the energy cutoff value and the geometric criterion to eliminate redundant conformers. Regarding the first, a value of 10 kJ/mol has been recommended as a tradeoff between CPU time and the risk of missing important conformations due to the inaccurate prediction of energies [11,12]. However, not only are a wide range of values reported (between 5 and 40 kJ/mol), but in a large number of papers, that important criterion is not described [23,24,33,34]. This trend is accentuated with the second parameter, as the criterion for defining duplicates is mostly unreported but deeply impacts the number of final conformations used for the DFT analysis. The most common ones are RMSD (root mean square deviations) and MAD (maximum atom deviation).
To understand the effect of FF selection on the classification outcome, we first evaluated DP4 and J-DP4, keeping all the resulting conformations within a 21 kJ/mol energy threshold with a maximum atom deviation (MAD) of 0.5 Å as represented in Figure 2. To simplify further discussions, this selection will be referred to as 21 kJ/MAD 05. Applying DP4 probability, which only uses information from NMR chemical shifts, MMFF showed the best results, with a 74.4% overall success, while, surprisingly, the modern OPLS4 force field showed the worst results, with a 68.4% overall success. These outcomes were improved by including 3JHH information in the dJ-DP4 approach, which showed the best results for MM3, with 88.9%. OPLS4 followed, with 79.9%. Finally, as expected, incorporating 3JHH values as described for the iJ/dJ-DP4 formalism yielded the best results, with AMBER and OPLS4 affording the best (95.0%) and worst (87.6%) performances, respectively.
Although the averaged performances between the different FFs are rather similar, it is important to highlight that the results obtained for certain molecules strongly depend on the FF employed. Two different behaviors are noticed regarding the influence of the FF on the result, namely, robustness and sensitiveness. The former comprises those molecular systems which lead to consistent DP4 results, regardless of the FF used (for example, compounds 13, 5, and 1921). The second type includes structures showing significant differences in the fitness of the assignment, according to the FF (for example, compounds 6, 8, 11, and 16). In analyzing the structures within the test set, it is not easy to anticipate which ones will be more or less sensitive. This can be clarified by comparing the dissimilar performances of diastereoisomeric pairs. For example, compounds 7 and 15 are much less sensitive toward the FF than their corresponding diastereoisomers 6 and 16, respectively. This suggests that the sensitivity cannot be linked to any structural property. Moreover, in the case of sensitive systems, it is also difficult to anticipate which FF will be optimal. For example, at level D, compound 6 is correctly elucidated, whereas compound 16 is misassigned, but the opposite trend is observed when using the geometries optimized at level E. Considering that both DP4 and J-DP4 feed on chemical shifts calculated from FF-minimized structures, the previous results indicate that subtle structural differences (consequences of the parametrization of each FF) have an impact on the calculated chemical shifts and, hence, on the resulting DP4-like probabilities.
Being aware of this uncertainty, we tried to dissect the effects arising from geometric structural differences from those derived from inaccuracies in energy calculations. For the first issue, we chose strychnine as a representative example of a rigid molecule where the results are independent of the energy factor (see below), in addition to the fact that it has a wide variety of chemical environments. Figure 3 displays the overlayed global minima of the correct isomer of the strychnine optimized at the five FF studied herein. All structures show a close similarity, though some small differences may be encountered, mainly in the torsion angles and the out-of-plane torsion of the amide group. However, important differences were found in the corresponding isotropic shielding values (σ) calculated at the B3LYP/6-31G** level. The difference between the maximum and minimum 13C σ (MaxΔσ) ranged from 1.5 ppm to 10.6 ppm (average 4 ppm), depending on the nucleus, whereas, for 1H, the MaxΔσ ranged from 0.25 to 0.77 ppm (average 0.54 ppm). Such large variations affect the scaled chemical shifts, with MaxΔδ values ranging from 0.8 to 8.4 ppm, with an average of 3.2 ppm, and from 0.07 to 0.42 ppm, with an average of 0.18 ppm, for the 13C and 1H data, respectively. In an effort to understand the origin of this observation, we analyzed all bond distances in strychnine isomers.
For each bond, we found that the maximum differences between the geometries provided by the FFs under study ranged between 0.006 and 0.06 Å (average 0.02 Å). Similar observations were made for other systems under study, indicating that small variations in the interatomic distances and angles have a great impact on the calculated isotropic shielding values and therefore on the scaled chemical shifts. Naturally, this affects the errors (differences in the calculated and experimental data) used to calculate the DP4 and J-DP4 probabilities. For instance, the correct isomer of strychnine showed 1H CMAE and MaxErr values ranging from 0.08 and 0.19 (OPLS4) to 0.14 and 0.36 ppm (OPLS2005), along with 13C CMAE and MaxErr values from 1.22 and 3.88 (MM3) to 1.86 and 8.43 ppm (OPLS4). In the particular case of strychnine, this heterogeneity of NMR predictions does not affect the overall evidence for the assignment (both DP4 and J-DP4 values are >99.9% for all FFs). However, it is clear that, in other less straightforward systems, wider variations in the DP4 values should be expected.
We next explored the influence of the FFs geometries on the relative B3LYP/6-31G** SCF energies used to calculate the Boltzmann amplitudes in flexible systems. We demonstrated that the conformational landscapes can be strongly influenced by the FF employed in the conformational search. This finding can be illustrated with two case studies (compounds 9 and 27). For all possible isomers (eight in each case), a conformational sampling was performed at the MMFF level (21 kJ/MAD 05). To keep the number and type of conformations constant, the full set of conformers found (6301 for 9 and 6863 for 27) were re-optimized at the MM3, AMBER, OPLS2005, and OPLS4 levels. The DFT energies of all molecule shapes were calculated at the B3LYP/6-31G** level, and the Boltzmann amplitudes (wi) were estimated at 25 °C. As shown in the SI, a remarkable discrepancy was observed for the contributing structures. The R2 values obtained by correlating the five sets of wi values ranged between 0.12 and 0.57 and between <0.01 and 0.30 for compounds 9 and 27, respectively. This becomes evident in comparing the Boltzmann’s amplitudes of the global minima found for 27 at each level of theory with the values computed for those shapes at the remaining four levels (Figure 4). For example, the most stable conformer located at the B3LYP/6-31G**//AMBER level contributes less than 0.1% according to the other levels of theory.
According to the previous results, it is clear that subtle geometric differences in the MM-optimized geometries have a strong impact on both the NMR shielding constants and the Boltzmann amplitudes, with unpredictable effects on the calculated chemical shifts. The averaged CMAE calculated for the right isomers of the test set ranged from 0.12 ppm (MMFF) to 0.15 ppm (AMBER) for the 1H data and from 1.4 ppm (OPLS2005) to 1.7 ppm (MM3) for the 13C data, indicating that, on average, all FFs show a similar behavior in terms of accuracy (see SI). However, after analyzing each system individually, notable differences appear. For example, in the case of 17, the 1H CMAE calculated for OPLS2005 was 0.53 ppm, which is 4.6 times larger than that computed for AMBER (0.12 ppm), but for compound 7, OPLS2005 gave the best 1H prediction (CMAE 0.07 vs. 0.11–0.17 ppm computed for other levels). The same occurred in analyzing carbon chemical shifts. With AMBER, compound 14 shows a clearly smaller 13C MAE (up to 3.5 times) than the other FFs, but for the other 13 systems, the former FF was the least accurate. The difficulty in anticipating the optimal FF in terms of NMR accuracy for a given system is complemented by the relative nature of DP4, meaning that a proper assignment will be made as long as the errors computed for the right isomer are smaller than those of the other candidates. Despite the FF affording a good NMR prediction for the right isomer, it cannot be ruled out that the errors are even smaller for other candidates, hence leading to a wrong assignment.
The previous results also illustrate how another two factors clearly influence the performance of the calculations: (i) the selected FF energy cutoff and (ii) the geometric method used to eliminate the redundant conformers obtained in the search. The first parameter may have a deep impact on the results, since the final outcomes can be correct only if the appropriate conformations are included in the pool used for the DFT calculations [35]. This suggests that using high values could be the harmless option to avoid incorrect energy evaluation by the selected FF and the subsequent erroneous trimming of feasible conformations. On the other hand, the lower the energy cutoff, the smaller the number of structures used in the calculations and the required computational time (the average number was reduced by 50% by passing from 21 kJ/mol to a 12 kJ/mol cutoff). Another important issue that is often overlooked is the method used to eliminate redundant conformer geometries that otherwise would undergo DFT calculations. Therefore, we compared the use of RMSD versus MAD algorithms using two different cutoffs (0.5 Å and 1.0 Å), which, coupled with the two energy cutoffs (12 kJ/mol and 21 kJ/mol), yielded six combinations (Figure 5). In each case, the best results were obtained with iJ/dJ-DP4, suggesting that the superior performance of the last is independent of the conformational search setup. No important differences in the effect of the energy cutoff values were apparent under the tested conditions. Using MAD-0.5 Å and a 21 kJ/mol cutoff, the average performances (among the five FFs evaluated) were 73%, 87%, and 92% for DP4, dJ-DP4, and iJ/dJ-DP4, respectively. These are similar to the values of 72%, 88%, and 91%, respectively, obtained with a 12 kJ/mol cutoff. A similar trend was observed using RMSD-0.5, with 66%, 84%, and 87% scores for 21 kJ/mol and 65%, 85%, and 89% scores for 12 kJ/mol. However, it is worth noting that, although both energy cutoffs provided statistically equivalent performances, there were a few intriguing examples where the performance significantly changed (see SI), making it difficult to draw a general conclusion. It was also evident that each FF generates a different number of conformations for the study. As a general trend, AMBER returned the largest number of structures, and OPLS2005 returned the least. The MAD method provided consistently better results when selected for eliminating redundant conformers than those using RMSD. The best overall results were obtained with MAD 0.5 Å (91% average for five FFs), whereas RMSD 1 Å gave the poorest (82% average for 5 FFs). We consider this a consequence of the loss of structural diversity exerted by RSMD. By imposing more stringent criteria to declare non-equivalency, it causes the loss of similar (yet not identical) structures. This could mask subtle conformational differences that negatively impact the quality of the results. Thus, based on the results, and considering both the classification performance and the computational cost, for practical applications, we recommend using MAD-05 Å and 12 kJ/mol cutoffs under iJ/dJ-DP4 formalism.
Within these boundaries, AMBER, MMFF, and MM3 performed better (95%, 93%, and 92%, respectively) than the modern OPLS2005 (89%) and OPLS4 (88%). Although no single FF afforded optimal classification performance, it was interesting to note that the misassigned cases were very sensitive to the FF employed. This led us to propose a new method based on the combined results from several FFs. Figure 6a shows the workflow of the so-called mix-J-DP4 approach. It involves the parallel computing of iJ/dJ-DP4 probabilities with AMBER, MMFF, and MM3 geometries, using the optimal selection criteria discussed above. The three independent results are then averaged into a single probability. This is in line with recent recommendations of not relying on a single metric but on the results of different calculations in order to reinforce the confidence in the assignment [36]. Although this methodology increases the overall computational cost (relative to a single iJ/dJ-DP4 calculation), it is still faster than a standard DP4 procedure. In addition, it provides both an excellent classification performance (100% in our test set, Figure 6b) and a strong backup to the results.
The impact of the previous approach can be illustrated in the stereochemical assignment of zoarenone (34) and zoaramine (35) [37], two complex metabolites (Figure 7) belonging to the family of norzoanthamine, [38] a group of bioactive alkaloids isolated from the marine cnidiaria Zoanthus pulchelus [39,40].
Zoarenone (34) is a rigid tetracyclic compound comprising seven asymmetric carbons. Following the mix-J-DP4 procedure, the corresponding 64 diastereoisomers were built in silico, and the conformational searches and energy minimizations were performed with the AMBER, MM3, and MMFF force fields using MAD-05 Å and 12 kJ/mol cutoffs. After 3JHH filtering using the H13-H18 coupling, the number of suitable structures was reduced to almost half. The three independent J-DP4 calculations strongly supported the right isomer in high probability (>99%), despite the C9 epimer (9R*) showing the best probability using only H data (>90%). This reinforces the general recommendation that, whenever possible, all types of data should be employed to obtain meaningful results. In addition, the fact that the three FFs gave similar J-DP4 results grants a much stronger confidence in the assignment than what would have been achieved through the traditional procedure of using only one FF. As shown in Figure 8, a great similarity is observed in the global minima structures found for each force field, which would explain the homogeneity of the results found in this case study.
However, in the case of zoaramine (35), divergent J-DP4 results were obtained when using different force fields. Considering the structural similarity between 35 and 34 (the former incorporates a flexible seven-membered ring into the backbone of 34), this emphasizes the unpredictability of the optimal force field for a given molecule. On this occasion, AMBER failed, giving a <0.1% probability to the right isomer, while MM3 and MMFF succeeded, obtaining a >99.9% probability for it. Therefore, combining the three results, a mean value of 66.6% probability for the right isomer would be obtained, giving more confidence to the assignment. This time, MM3 gave the right assignment for all calculated parameters (H-DP4: 99%, C-DP4: 78%, and J-DP4: 86%), while MMFF failed in its carbon prediction (H-DP4: >99.9%, C-DP4: 2%, and J-DP4: 99%). The underperformance of MMFF in C-DP4 is associated with the poorest prediction of 13C chemical shifts (CMAE of 3.99 ppm vs. 2.88 ppm for MMFF and MM3 geometries, respectively). On the other hand, using AMBER geometries, the C2 epimer of 35 was identified as the most likely structure, both according to H-DP4 and C-DP4, which is a consequence of the best overall fitting between the experimental and calculated data (C-CMAE 2.44 ppm; H-CMAE 0.17 ppm). Interestingly, in accordance with our previous observations for compounds 9 and 27 (Figure 4), the energy landscapes computed for 35 show substantial differences for each FF (Figure 9). Thus, while B3LYP/6-31G**//MMFF selected conformers c3 and c4 with 69% and 31% Boltzmann amplitudes, the B3LYP/6-31G**//MM3 level used four conformers (c10: 42%, c14: 21%, c11: 17%, and c17: 10%) to describe the conformational behavior of the right isomer. On the contrary, B3LYP/6-31G**//AMBER chose different conformations (c13: 85% and c5: 11%) for the right isomer and c144: 64% and c145: 36% for the incorrectly selected 2S* epimer. This could be the main reason behind AMBER´s poor performance in this particular case.

3. Materials and Methods

Conformational searches were carried out using the Conformational Search protocol in the gas phase using the MMFF, and the found conformers were re-optimized with the AMBER, MM3, OPLS2005, and OPLS4 force fields, with 21 kJ/mol as the energy cutoff and a MAD of 0.5 Å. For each FF, an energy cutoff was carried out with 21 and 12 kJ/mol, using MAD (0.5 and 1 Å) or RMSD (0.5 and 1 Å). These conformers were subjected to gauge including atomic orbitals (GIAO) NMR calculations at DFT levels (B3LYP/6-31G** levels of theory as recommended for DP4 and J-DP4) using Gaussian 09. The J calculations were carried out at the same level of theory considering only the Fermi contact term and were scaled as recommended in J-DP4. For iJ-DP4 calculations, the constrained search was performed by keeping only the suitable conformations (that is, those conformations showing dihedral angles in agreement with the experimental 3JHH values).
Unscaled chemical shifts (δu) were calculated using TMS as a reference standard according to the following expression: δu = σ0 − σx, where σx is the Boltzmann-averaged shielding tensor (over all significantly populated conformations) and σ0 is the shielding tensor of the TMS computed at the same level of theory used to calculate σx. Boltzmann averaging was performed according to Equation (1):
σ x = i σ i x e E i RT i e E i RT
where σix is the shielding constant for the nucleus x in the conformer i, R is the molar gas constant (8.3145 J/K mol), T is the temperature used for the calculations (298 K), and Ei is the relative energy of the conformer i (to the lowest energy conformer) obtained from a single-point NMR calculation at the corresponding level of theory. The scaled chemical shifts (δs) were computed as δs = (δu − b)/m, where m and b are the slope and intercept, respectively, resulting from a linear regression calculation on a plot of δu against δexp. The J-DP4 calculations were carried out using a home-made Matlab script using the statistical parameters reported in the original paper.

4. Conclusions

In summary, by combining the iJ/dJ-DP4 results obtained with three different force fields (AMBER, MMFF, and MM3), we have developed a new methodology that allows for the assignment of relative configurations in complex molecules with a high certainty. The modest increment in the overall computational cost is fully justified by the remarkable increase in the confidence in the assignments. The mix-J-DP4 can be easily computed using the Excel spreadsheet previously reported for J-DP4.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/md20110699/s1, Computational details, structures and cartesian coordinates, calculated NMR data, and results for DP4 and J-DP4 methods (PDF).

Author Contributions

C.C.: methodology, validation, data analysis. A.H.D.: conceptualization, writing, data analysis. A.M.S.: conceptualization, writing, data analysis. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by UNR (BIO 500 and 567), ANPCyT (PICT-2016-0116, PICT-2017-1524, and PICT-2019-4052), CONICET (PIP 11220200102205CO), MICINN (PID2019-109476RB-C21), and ACIISI-FEDER (ProID2021010118). C. Cuadrado thanks ACIISI and FSE (Programa Operativo Integrado de Canarias 2014–2020, Eje 3, Tema Prioritario 74–85%) for a predoctoral fellowship.

Data Availability Statement

Not applicable.

Acknowledgments

This study made use of Galicia Supercomputing Center (CESGA) facilities as provided by CSIC. The text was revised by G. Jones, who thanks TF INNOVA 2016-2021 for the funding. We thank ALGF and DCG for the assistance with the graphical abstract.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of the data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Modern NMR Approaches to the Structure Elucidation of Natural Products; Rovnyak, D.; Williams, A.J.; Martin, G.E. (Eds.) Royal Society of Chemistry: Cambridge, UK, 2017. [Google Scholar] [CrossRef]
  2. Chhetri, B.K.; Lavoie, S.; Sweeney-Jones, A.M.; Kubanek, J. Recent trends in the structural revision of natural products. Nat. Prod. Rep. 2018, 35, 514–531. [Google Scholar] [CrossRef] [PubMed]
  3. Kutateladze, A.G.; Holt, T. Structure Validation of Complex Natural Products: Time to Change the Paradigm. What did Synthesis of Alstofolinine A Prove? J. Org. Chem. 2019, 84, 8297–8299. [Google Scholar] [CrossRef] [PubMed]
  4. Domínguez, H.J.; Napolitano, J.G.; Fernández-Sánchez, M.T.; Cabrera-García, D.; Novelli, A.; Norte, M.; Fernández, J.J.; Daranas, A.H. Belizentrin, a Highly Bioactive Macrocycle from the Dinoflagellate Prorocentrum belizeanum. Org. Lett. 2014, 16, 4546–4549. [Google Scholar] [CrossRef] [PubMed]
  5. Anderl, F.; Größl, S.; Wirtz, C.; Fürstner, A. Total Synthesis of Belizentrin Methyl Ester: Report on a Likely Conquest. Angew. Chem. Int. Ed. 2018, 57, 10712–10717. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Daranas, A.H.; Sarotti, A.M. Are Computational Methods Useful for Structure Elucidation of Large and Flexible Molecules? Belizentrin as a Case Study. Org. Lett. 2021, 23, 503–507. [Google Scholar] [CrossRef] [PubMed]
  7. Domínguez, H.J.; Cabrera-García, D.; Cuadrado, C.; Novelli, A.; Fernández-Sánchez, M.T.; Fernández, J.J.; Daranas, A.H. Prorocentroic Acid, a Neuroactive Super-Carbon-Chain Compound from the Dinoflagellate Prorocentrum hoffmannianum. Org. Lett. 2021, 23, 13–18. [Google Scholar] [CrossRef]
  8. Bifulco, G.; Dambruoso, P.; Gomez-Paloma, L.; Riccio, R. Determination of Relative Configuration in Organic Compounds by NMR Spectroscopy and Computational Methods. Chem. Rev. 2007, 107, 3744–3779. [Google Scholar] [CrossRef]
  9. Lodewyk, M.W.; Siebert, M.R.; Tantillo, D.J. Computational Prediction of 1H and 13C Chemical Shifts: A Useful Tool for Natural Product, Mechanistic, and Synthetic Organic Chemistry. Chem. Rev. 2012, 112, 1839–1862. [Google Scholar] [CrossRef]
  10. Marcarino, M.O.; Zanardi, M.M.; Cicetti, S.; Sarotti, A.M. NMR Calculations with Quantum Methods: Development of New Tools for Structural Elucidation and Beyond. Acc. Chem. Res. 2020, 53, 1922–1932. [Google Scholar] [CrossRef]
  11. Smith, S.G.; Goodman, J.M. Assigning Stereochemistry to Single Diastereoisomers by GIAO NMR Calculation: The DP4 Probability. J. Am. Chem. Soc. 2010, 132, 12946–12959. [Google Scholar] [CrossRef]
  12. Grimblat, N.; Zanardi, M.M.; Sarotti, A.M. Beyond DP4: An Improved Probability for the Stereochemical Assignment of Isomeric Compounds using Quantum Chemical Calculations of NMR Shifts. J. Org. Chem. 2015, 80, 12526–12534. [Google Scholar] [CrossRef]
  13. Grimblat, N.; Gavín, J.A.; Daranas, A.H.; Sarotti, A.M. Combining the Power of J Coupling and DP4 Analysis on Stereochemical Assignments: The J-DP4 Methods. Org. Lett. 2019, 21, 4003–4007. [Google Scholar] [CrossRef]
  14. Kutateladze, A.G.; Mukhina, O.A. Relativistic Force Field: Parametric Computations of Proton–Proton Coupling Constants in 1H NMR Spectra. J. Org. Chem. 2014, 79, 8397–8406. [Google Scholar] [CrossRef]
  15. Navarro-Vázquez, A.; Gil, R.R.; Blinov, K. Computer-Assisted 3D Structure Elucidation (CASE-3D) of Natural Products Combining Isotropic and Anisotropic NMR Parameters. J. Nat. Prod. 2018, 81, 203–210. [Google Scholar] [CrossRef]
  16. Gerrard, W.; Bratholm, L.A.; Packer, M.J.; Mulholland, A.J.; Glowacki, D.R.; Butts, C.P. IMPRESSION—Prediction of NMR parameters for 3-dimensional chemical structures using machine learning with near quantum chemical accuracy. Chem. Sci. 2019, 11, 508–515. [Google Scholar] [CrossRef] [Green Version]
  17. Guan, Y.; Sowndarya, S.V.S.; Gallegos, L.C.; John, P.C.S.; Paton, R.S. Real-time prediction of 1H and 13C chemical shifts with DFT accuracy using a 3D graph neural network. Chem. Sci. 2021, 12, 12012–12026. [Google Scholar] [CrossRef]
  18. Tsai, Y.-H.; Amichetti, M.; Zanardi, M.M.; Grimson, R.; Daranas, A.H.; Sarotti, A.M. ML-J-DP4: An Integrated Quantum Mechanics-Machine Learning Approach for Ultrafast NMR Structural Elucidation. Org. Lett. 2022, 24, 7487–7491. [Google Scholar] [CrossRef]
  19. Napolitano, J.G.; Norte, M.; Padrón, J.M.; Fernández, J.J.; Daranas, A.H. Belizeanolide, a Cytotoxic Macrolide from the DinoflagellateProrocentrum belizeanum. Angew. Chem. Int. Ed. 2009, 48, 796–799. [Google Scholar] [CrossRef]
  20. Lewis-Atwell, T.; Townsend, P.A.; Grayson, M.N. Comparisons of different force fields in conformational analysis and searching of organic molecules: A review. Tetrahedron 2021, 79, 131865. [Google Scholar] [CrossRef]
  21. Boudermine, S.; Parisi, V.; Lemoui, R.; Boudiar, T.; Chini, M.G.; Franceschelli, S.; Pecoraro, M.; Pascale, M.; Bifulco, G.; Braca, A.; et al. Cytotoxic Sesquiterpenoids from Ammoides atlantica Aerial Parts. J. Nat. Prod. 2022, 85, 647–656. [Google Scholar] [CrossRef]
  22. Li, J.; Li, C.; Riccio, R.; Lauro, G.; Bifulco, G.; Li, T.-J.; Tang, H.; Zhuang, C.-L.; Ma, H.; Sun, P.; et al. Chemistry and Selective Tumor Cell Growth Inhibitory Activity of Polyketides from the South China Sea Sponge Plakortis sp. Mar. Drugs 2017, 15, 129. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Jones, T.H.; Harrison, D.P.; Menegatti, C.; Mevers, E.; Knott, K.; Marek, P.; Hennen, D.A.; Kasson, M.T.; Macias, A.M.; Lovett, B.; et al. Deoxybuzonamine Isomers from the Millipede Brachycybe lecontii (Platydesmida: Andrognathidae). J. Nat. Prod. 2022, 85, 1134–1140. [Google Scholar] [CrossRef] [PubMed]
  24. Sierra, S.; Gomez, M.V.; Jiménez, A.I.; Pop, A.; Silvestru, C.; Marín, M.L.; Boscá, F.; Sastre, G.; Gómez-Bengoa, E.; Urriolabeitia, E.P. Stereoselective, Ruthenium-Photocatalyzed Synthesis of 1,2-Diaminotruxinic Bis-amino Acids from 4-Arylidene-5(4H)-oxazolones. J. Org. Chem. 2022, 87, 3529–3545. [Google Scholar] [CrossRef] [PubMed]
  25. de Oliveira, M.T.; Alves, J.M.A.; Braga, A.A.C.; Wilson, D.J.D.; Barboza, C.A. Do Double-Hybrid Exchange–Correlation Functionals Provide Accurate Chemical Shifts? A Benchmark Assessment for Proton NMR. J. Chem. Theory Comput. 2021, 17, 6876–6885. [Google Scholar] [CrossRef] [PubMed]
  26. Han, B.Y.; Lam, N.Y.S.; MacGregor, C.I.; Goodman, J.M.; Paterson, I. A synthesis-enabled relative stereochemical assignment of the C1–C28 region of hemicalide. Chem. Commun. 2018, 54, 3247–3250. [Google Scholar] [CrossRef] [Green Version]
  27. Kouamé, T.; Bernadat, G.; Turpin, V.; Litaudon, M.; Okpekon, A.T.; Gallard, J.-F.; Leblanc, K.; Rharrabti, S.; Champy, P.; Poupon, E.; et al. Structure Reassignment of Melonine and Quantum-Chemical Calculations-Based Assessment of Biosynthetic Scenarios Leading to Its Revised and Original Structures. Org. Lett. 2021, 23, 5964–5968. [Google Scholar] [CrossRef]
  28. Allinger, N.L.; Yuh, Y.H.; Lii, J.H. Molecular mechanics. The MM3 force field for hydrocarbons. 1. J. Am. Chem. Soc. 1989, 111, 8551–8566. [Google Scholar] [CrossRef]
  29. Cornell, W.D.; Cieplak, P.; Bayly, C.I.; Gould, I.R.; Merz, K.M.; Ferguson, D.M.; Spellmeyer, D.C.; Fox, T.; Caldwell, J.W.; Kollman, P.A. A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules. J. Am. Chem. Soc. 1995, 117, 5179–5197. [Google Scholar] [CrossRef] [Green Version]
  30. Halgren, T.A. Merck molecular force field. I. Basis, form, scope, parameterization, and performance of MMFF94. J. Comput. Chem. 1996, 17, 490–519. [Google Scholar] [CrossRef]
  31. Jorgensen, W.L.; Maxwell, D.S.; Tirado-Rives, J. Development and Testing of the OPLS All-Atom Force Field on Conformational Energetics and Properties of Organic Liquids. J. Am. Chem. Soc. 1996, 118, 11225–11236. [Google Scholar] [CrossRef]
  32. Lu, C.; Wu, C.; Ghoreishi, D.; Chen, W.; Wang, L.; Damm, W.; Ross, G.A.; Dahlgren, M.K.; Russell, E.; Von Bargen, C.D.; et al. OPLS4: Improving Force Field Accuracy on Challenging Regimes of Chemical Space. J. Chem. Theory Comput. 2021, 17, 4291–4300. [Google Scholar] [CrossRef]
  33. Yu, J.S.; Park, M.; Pang, C.; Rashan, L.; Jung, W.H.; Kim, K.H. Antifungal Phenols from Woodfordia uniflora Collected in Oman. J. Nat. Prod. 2020, 83, 2261–2268. [Google Scholar] [CrossRef]
  34. Tan, X.; Han, X.; Teng, H.; Li, Q.; Chen, Y.; Lei, X.; Yang, G. Structural Elucidation of Garcipaucinones A and B From Garcinia paucinervis Using Quantum Chemical Calculations. J. Nat. Prod. 2021, 84, 972–978. [Google Scholar] [CrossRef]
  35. Li, S.-W.; Cuadrado, C.; Yao, L.-G.; Daranas, A.H.; Guo, Y.-W. Quantum Mechanical–NMR-Aided Configuration and Conformation of Two Unreported Macrocycles Isolated from the Soft Coral Lobophytum sp.: Energy Calculations versus Coupling Constants. Org. Lett. 2020, 22, 4093–4096. [Google Scholar] [CrossRef]
  36. Sarotti, A.M. In Silico Reassignment of (+)-Diplopyrone by NMR Calculations: Use of a DP4/J-DP4/DP4+/DIP Tandem to Revise Both Relative and Absolute Configuration. J. Org. Chem. 2020, 85, 11566–11570. [Google Scholar] [CrossRef]
  37. Cen-Pacheco, F.; Norte, M.; Fernández, J.J.; Daranas, A.H. Zoaramine, a Zoanthamine-like Alkaloid with a New Skeleton. Org. Lett. 2014, 16, 2880–2883. [Google Scholar] [CrossRef]
  38. Daranas, A.H.; Fernández, J.; Gavín, J.; Norte, M. New alkaloids from a marine zoanthid. Tetrahedron 1999, 55, 5539–5546. [Google Scholar] [CrossRef]
  39. Villar, R.M.; Gil-Longo, J.; Daranas, A.H.; Souto, M.L.; Fernández, J.J.; Peixinho, S.; Barral, M.A.; Santafé, G.; Rodriguez, J.; Jiménez, C. Evaluation of the effects of several zoanthamine-type alkaloids on the aggregation of human platelets. Bioorg. Med. Chem. 2003, 11, 2301–2306. [Google Scholar] [CrossRef]
  40. García-García, P.; Reyes, R.; Évora, C.; Delgado, A.; Fernández, J.J.; Daranas, A.H. Osteoprotective effect of the marine alkaloid norzoanthamine on an osteoporosis model in ovariectomized rat. Biomed. Pharmacother. 2022, 147, 112631. [Google Scholar] [CrossRef]
Figure 1. Test set of molecules used in this work. The number of possible candidate structures generated for each one depends on the number of carbon atoms marked with an asterisk, where configurations were changed.
Figure 1. Test set of molecules used in this work. The number of possible candidate structures generated for each one depends on the number of carbon atoms marked with an asterisk, where configurations were changed.
Marinedrugs 20 00699 g001
Figure 2. Performance of DP4 (top), dJ-DP4 (middle), and iJ/dJ-DP4 (bottom) with the test set at the five FF studied (A: MM3, B: AMBER, C: MMFF, D: OPLS2005, E: OPLS4), using a 21 kJ/MAD 05 conformational criterion.
Figure 2. Performance of DP4 (top), dJ-DP4 (middle), and iJ/dJ-DP4 (bottom) with the test set at the five FF studied (A: MM3, B: AMBER, C: MMFF, D: OPLS2005, E: OPLS4), using a 21 kJ/MAD 05 conformational criterion.
Marinedrugs 20 00699 g002
Figure 3. (a) Overlay of the global minima structures of strychnine (1) obtained for each FF. (b) 13CMaxΔσ values (red) computed for strychnine at the B3LYP/6-31G** level, using the five geometries shown in (a) as inputs. The minimum (italics) and maximum (plane) σ values are shown for some representative nuclei, with the corresponding FF in parenthesis.
Figure 3. (a) Overlay of the global minima structures of strychnine (1) obtained for each FF. (b) 13CMaxΔσ values (red) computed for strychnine at the B3LYP/6-31G** level, using the five geometries shown in (a) as inputs. The minimum (italics) and maximum (plane) σ values are shown for some representative nuclei, with the corresponding FF in parenthesis.
Marinedrugs 20 00699 g003
Figure 4. Global minima conformation of 27 found at the B3LYP/6-31G** level, using the structures optimized at the five FF shown. Boltzmann´s amplitudes of the global minima for each FF are shown in red, and the contributions of those shapes at the remaining levels are shown in black.
Figure 4. Global minima conformation of 27 found at the B3LYP/6-31G** level, using the structures optimized at the five FF shown. Boltzmann´s amplitudes of the global minima for each FF are shown in red, and the contributions of those shapes at the remaining levels are shown in black.
Marinedrugs 20 00699 g004
Figure 5. Performances of DP4 (top), dJ-DP4 (middle), and iJ/dJ-DP4 (bottom) with the test set in the five FF studied (A: MM3, B: AMBER, C: MMFF, D: OPLS2005, E: OPLS4), after evaluating different strategies to remove duplicate conformations and energy thresholds.
Figure 5. Performances of DP4 (top), dJ-DP4 (middle), and iJ/dJ-DP4 (bottom) with the test set in the five FF studied (A: MM3, B: AMBER, C: MMFF, D: OPLS2005, E: OPLS4), after evaluating different strategies to remove duplicate conformations and energy thresholds.
Marinedrugs 20 00699 g005
Figure 6. (a) Workflow of mix-J-DP4, (b) iJ/dJ-DP4 results obtained with AMBER, MM3, and MMFF using 12 kJ/MAD 05, and averaged values (mix).
Figure 6. (a) Workflow of mix-J-DP4, (b) iJ/dJ-DP4 results obtained with AMBER, MM3, and MMFF using 12 kJ/MAD 05, and averaged values (mix).
Marinedrugs 20 00699 g006
Figure 7. Structures of zoarenone (34) and zoaramine (35).
Figure 7. Structures of zoarenone (34) and zoaramine (35).
Marinedrugs 20 00699 g007
Figure 8. Overlay of the global minima structures of the correct structure of zoarenone (34) obtained for each FF (AMBER: orange, MM3: grey, and MMFF: blue).
Figure 8. Overlay of the global minima structures of the correct structure of zoarenone (34) obtained for each FF (AMBER: orange, MM3: grey, and MMFF: blue).
Marinedrugs 20 00699 g008
Figure 9. Overlay of the conformers with >10% Boltzmann contribution for the correct structure of zoaramine (35) obtained for each FF (AMBER: orange, MM3: grey, and MMFF: blue).
Figure 9. Overlay of the conformers with >10% Boltzmann contribution for the correct structure of zoaramine (35) obtained for each FF (AMBER: orange, MM3: grey, and MMFF: blue).
Marinedrugs 20 00699 g009
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Cuadrado, C.; Daranas, A.H.; Sarotti, A.M. May the Force (Field) Be with You: On the Importance of Conformational Searches in the Prediction of NMR Chemical Shifts. Mar. Drugs 2022, 20, 699. https://doi.org/10.3390/md20110699

AMA Style

Cuadrado C, Daranas AH, Sarotti AM. May the Force (Field) Be with You: On the Importance of Conformational Searches in the Prediction of NMR Chemical Shifts. Marine Drugs. 2022; 20(11):699. https://doi.org/10.3390/md20110699

Chicago/Turabian Style

Cuadrado, Cristina, Antonio Hernández Daranas, and Ariel M. Sarotti. 2022. "May the Force (Field) Be with You: On the Importance of Conformational Searches in the Prediction of NMR Chemical Shifts" Marine Drugs 20, no. 11: 699. https://doi.org/10.3390/md20110699

APA Style

Cuadrado, C., Daranas, A. H., & Sarotti, A. M. (2022). May the Force (Field) Be with You: On the Importance of Conformational Searches in the Prediction of NMR Chemical Shifts. Marine Drugs, 20(11), 699. https://doi.org/10.3390/md20110699

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop