Methods in PES-Learn: Direct-Fit Machine Learning of Born–Oppenheimer Potential Energy Surfaces
Abstract
1. Introduction
2. Discussion
2.1. Dataset Generation and Sampling
2.2. Methods and Models
3. Results
3.1. Examples Revisited
3.2. Benzene
3.3. Ethanol
3.4. Gradients
3.5. Future Directions
4. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| PES | potential energy surface |
| ML | machine learning |
| GP | Gaussian process |
| NN | neural network |
| KRR | kernel ridge regression |
| RMSE | root mean squared error |
| DFT | density funcitonal theory |
| API | application program interface |
| EST | electronic structure theory |
| PIP | permutationally invariant polynomial |
| FI | fundamental invariant |
| RS | reference sample |
| SB | structure-based |
| SR | smart random |
| MAE | mean absolute error |
References
- Truhlar, D. Potential Energy Surfaces and Dynamics Calculations: For Chemical Reactions and Molecular Energy Transfer; Springer: Berlin/Heidelberg, Germany, 1981. [Google Scholar]
- Truhlar, D.G.; Steckler, R.; Gordon, M.S. Potential energy surfaces for polyatomic reaction dynamics. Chem. Rev. 1987, 87, 217–236. [Google Scholar] [CrossRef]
- Schlegel, H.B. Exploring potential energy surfaces for chemical reactions: An overview of some practical methods. J. Comput. Chem. 2003, 24, 1514–1527. [Google Scholar] [CrossRef] [PubMed]
- Cooper, A.M.; Hallmen, P.P.; Kästner, J. Potential energy surface interpolation with neural networks for instanton rate calculations. J. Chem. Phys. 2018, 148, 094106. [Google Scholar] [CrossRef]
- Schatz, G.C. The analytical representation of electronic potential-energy surfaces. Rev. Mod. Phys. 1989, 61, 669–688. [Google Scholar] [CrossRef]
- Dawes, R.; Quintas-Sánchez, E. The Construction of AB Initio-Based Potential Energy Surfaces. In Reviews in Computational Chemistry, Volume 31; John Wiley & Sons, Ltd.: Hoboken, NJ, USA, 2018; Chapter 5; pp. 199–263. [Google Scholar] [CrossRef]
- Schmitz, G.; Godtliebsen, I.H.; Christiansen, O. Machine learning for potential energy surfaces: An extensive database and assessment of methods. J. Chem. Phys. 2019, 150, 244113. [Google Scholar] [CrossRef]
- Rupp, M. Machine learning for quantum mechanics in a nutshell. Int. J. Quant. Chem. 2015, 115, 1058–1073. [Google Scholar] [CrossRef]
- Meuwly, M. Machine Learning for Chemical Reactions. Chem. Rev. 2021, 121, 10218–10239. [Google Scholar] [CrossRef]
- Kwon, H.Y.; Morrow, Z.; Kelley, C.T.; Jakubikova, E. Interpolation Methods for Molecular Potential Energy Surface Construction. J. Phys. Chem. A 2021, 125, 9725–9735. [Google Scholar] [CrossRef]
- Aerts, A.; Schäfer, M.R.; Brown, A. Adaptive fitting of potential energy surfaces of small to medium-sized molecules in sum-of-product form: Application to vibrational spectroscopy. J. Chem. Phys. 2022, 156, 164106. [Google Scholar] [CrossRef]
- Sugisawa, H.; Ida, T.; Krems, R.V. Gaussian process model of 51-dimensional potential energy surface for protonated imidazole dimer. J. Chem. Phys. 2020, 153, 114101. [Google Scholar] [CrossRef]
- Kamath, A.; Vargas-Hernández, R.A.; Krems, R.V.; Carrington; Tucker, J.; Manzhos, S. Neural networks vs Gaussian process regression for representing potential energy surfaces: A comparative study of fit quality and vibrational spectrum accuracy. J. Chem. Phys. 2018, 148, 241702. [Google Scholar] [CrossRef] [PubMed]
- Ho, T.; Rabitz, H. A general method for constructing multidimensional molecular potential energy surfaces from ab initio calculations. J. Chem. Phys. 1996, 104, 2584–2597. [Google Scholar] [CrossRef]
- Keith, J.A.; Vassilev-Galindo, V.; Cheng, B.; Chmiela, S.; Gastegger, M.; Müller, K.R.; Tkatchenko, A. Combining Machine Learning and Computational Chemistry for Predictive Insights Into Chemical Systems. Chem. Rev. 2021, 121, 9816–9872. [Google Scholar] [CrossRef] [PubMed]
- Kuntz, D.; Wilson, A.K. Machine learning, artificial intelligence, and chemistry: How smart algorithms are reshaping simulation and the laboratory. Pure Appl. Chem. 2022, 94, 1019–1054. [Google Scholar] [CrossRef]
- Smith, J.S.; Isayev, O.; Roitberg, A.E. ANI-1: An extensible neural network potential with DFT accuracy at force field computational cost. Chem. Sci. 2017, 8, 3192–3203. [Google Scholar] [CrossRef]
- Kolb, B.; Lentz, L.C.; Kolpak, A.M. Discovering charge desnity functionals and structure-property relationships with PROPhet: A general framework for coupling machine learning and first-principles methods. Sci. Rep. 2017, 7, 1192. [Google Scholar] [CrossRef]
- Schütt, K.T.; Kessel, P.; Gastegger, M.; Nicoli, K.A.; Tkatchenko, A.; Müller, K.R. SchNetPack: A Deep Learning Toolbox For Atomistic Systems. J. Chem. Theory Comput. 2019, 15, 448–455. [Google Scholar] [CrossRef]
- Shao, Y.; Hellström, M.; Mitev, P.D.; Knijff, L.; Zhang, C. PiNN: A Python Library for Building Atomic Neural Networks of Molecules and Materials. J. Chem. Inf. Model. 2020, 60, 1184–1193. [Google Scholar] [CrossRef]
- Quintas-Sánchez, E.; Dawes, R. AUTOSURF: A Freely Available Program To Construct Potential Energy Surfaces. J. Chem. Inf. Model. 2019, 59, 262–271. [Google Scholar] [CrossRef]
- Dral, P.O. MLatom: A program package for quantum chemical research assisted by machine learning. J. Comp. Chem. 2019, 40, 2339–2347. [Google Scholar] [CrossRef]
- Haghighatlari, M.; Li, J.; Guan, X.; Zhang, O.; Das, A.; Stein, C.J.; Heidar-Zadeh, F.; Liu, M.; Head-Gordon, M.; Bertels, L.; et al. NewtonNet: A Newtonian message passing network for deep learning of interatomic potentials and forces. Digit. Discov. 2022, 1, 333–343. [Google Scholar] [CrossRef] [PubMed]
- Gyori, T.; Czakó, G. Automating the Development of High-Dimensional Reactive Potential Energy Surfaces with the robosurfer Program System. J. Chem. Theory Comput. 2020, 16, 51–66. [Google Scholar] [CrossRef] [PubMed]
- Pellegrini, F.; Lot, R.; Shaidu, Y.; Küçükbenli, E. PANNA 2.0: Efficient neural network interatomic potentials and new architectures. J. Chem. Phys. 2023, 159, 084117. [Google Scholar] [CrossRef] [PubMed]
- Podryabinkin, E.; Garifullin, K.; Shapeev, A.; Novikov, I. MLIP-3: Active learning on atomic environments with moment tensor potentials. J. Chem. Phys. 2023, 159, 084112. [Google Scholar] [CrossRef]
- Zeng, J.; Zhang, D.; Lu, D.; Mo, P.; Li, Z.; Chen, Y.; Rynik, M.; Huang, L.; Li, Z.; Shi, S.; et al. DeePMD-kit v2: A software package for deep potential models. J. Chem. Phys. 2023, 159, 054801. [Google Scholar] [CrossRef]
- Behler, J.; Parrinello, M. Generalized Neural-Network Representation of High-Dimensional Potential-Energy Surfaces. Phys. Rev. Lett. 2007, 98, 146401. [Google Scholar] [CrossRef]
- Anstine, D.M.; Zubatyuk, R.; Isayev, O. AIMNet2: A neural network potential to meet your neutral, charged, organic, and elemental-organic needs. Chem. Sci. 2025, 16, 10228–10244. [Google Scholar] [CrossRef]
- Kovács, D.P.; Moore, J.H.; Browning, N.J.; Batatia, I.; Horton, J.T.; Pu, Y.; Kapil, V.; Witt, W.C.; Magdău, I.B.; Cole, D.J.; et al. MACE-OFF: Short-Range Transferable Machine Learning Force Fields for Organic Molecules. J. Am. Chem. Soc. 2025, 147, 17598–17611. [Google Scholar] [CrossRef]
- Behler, J. Atom-centered symmetry functions for constructing high-dimensional neural network potentials. J. Chem. Phys. 2011, 134, 074106. [Google Scholar] [CrossRef]
- Behler, J.; Martoňák, R.; Donadio, D.; Parrinello, M. Pressure-induced phase transitions in silicon studied by neural network-based metadynamics simulations. Phys. Status Solidi B 2008, 245, 2618–2629. [Google Scholar] [CrossRef]
- Li, J.; Song, K.; Behler, J. A critical comparison of neural network potentials for molecular reaction dynamics with exact permutation symmetry. Phys. Chem. Chem. Phys. 2019, 21, 9672–9682. [Google Scholar] [CrossRef] [PubMed]
- Konings, M.; Harvey, J.N.; Loreau, J. Machine Learning Representations of the Three Lowest Adiabatic Electronic Potential Energy Surfaces for the ArH2+ Reactive System. J. Phys. Chem. A 2023, 127, 8083–8094. [Google Scholar] [CrossRef] [PubMed]
- Yang, N.; Hill, S.; Manzhos, S.; Carrington, T. A local Gaussian Processes method for fitting potential surfaces that obviates the need to invert large matrices. J. Mol. Spectrosc. 2023, 393, 111774. [Google Scholar] [CrossRef]
- Krondorfer, J.K.; Binder, C.W.; Hauser, A.W. Symmetry- and gradient-enhanced Gaussian process regression for the active learning of potential energy surfaces in porous materials. J. Chem. Phys. 2023, 159, 014115. [Google Scholar] [CrossRef]
- Kushwaha, A.; Ritika; Chahal, P.; Dhilip Kumar, T.J. Rotational Excitation of NCCN by p-H2(jc = 0) at Low Temperatures. ACS Earth Space Chem. 2023, 7, 515–522. [Google Scholar] [CrossRef]
- Manzhos, S.; Carrington, T.J. Neural Network Potential Energy Surfaces for Small Molecules and Reactions. Chem. Rev. 2021, 121, 10187–10217. [Google Scholar] [CrossRef]
- Manzhos, S.; Ihara, M. Neural Network with Optimal Neuron Activation Functions Based on Additive Gaussian Process Regression. J. Phys. Chem. A 2023, 127, 7823–7835. [Google Scholar] [CrossRef]
- Arab, F.; Nazari, F.; Illas, F. Artificial Neural Network-Derived Unified Six-Dimensional Potential Energy Surface for Tetra Atomic Isomers of the Biogenic [H, C, N, O] System. J. Chem. Theory Comput. 2023, 19, 1186–1196. [Google Scholar] [CrossRef]
- Behler, J. Neural network potential-energy surfaces in chemistry: A tool for large-scale simulations. Phys. Chem. Chem. Phys. 2011, 13, 17930–17955. [Google Scholar] [CrossRef]
- Feng, Y.; Yng, Z.; Chen, H.; Mao, Y.; Chen, M. A globally accurate neural network potential energy surface and quantum dynamics study of Mg+(2S)+H2→MgH+ + H reaction. Chem. Phys. Lett. 2024, 842, 141223. [Google Scholar] [CrossRef]
- Zuo, J.; Zhang, D.; Truhlar, D.G.; Guo, H. Global Potential Energy Surfaces by Compressed-State Multistate Pair-Density Functional Theory: The Lowest Doublet States Responsible for the N(4Su) + C2(a3Πu) → CN(X2Σ+) + C(3Pg) Reaction. J. Chem. Theory Comput. 2022, 18, 7121–7131. [Google Scholar] [CrossRef] [PubMed]
- Dral, P.O.; Owens, A.; Yurchenko, S.N.; Thiel, W. Structure-based sampling and self-correcting machine learning for accurate calculations of potential energy surfaces and vibrational levels. J. Chem. Phys. 2017, 146, 244108. [Google Scholar] [CrossRef] [PubMed]
- Dral, P.O.; Owens, A.; Dral, A.; Csányi, G. Hierarchical machine learning of potential energy surfaces. J. Chem. Phys. 2020, 152, 204110. [Google Scholar] [CrossRef] [PubMed]
- Ji, H.; Rágyanszki, A.; Fournier, R.A. Machine learning estimation of reaction energy barriers. Comput. Theor. Chem. 2023, 1229, 114332. [Google Scholar] [CrossRef]
- Chmiela, S.; Tkatchenko, A.; Sauceda, H.E.; Poltavsky, I.; Schütt, K.T.; Müller, K.R. Machine learning of accurate energy-conserving molecular force fields. Sci. Adv. 2017, 3, e1603015. [Google Scholar] [CrossRef]
- Castro-Palacio, J.C.; Nagy, T.; Bemish, R.J.; Meuwly, M. Computational study of collisions between O(3P) and NO(2Π) at temperatures relevant to the hypersonic flight regime. J. Chem. Phys. 2014, 141, 164319. [Google Scholar] [CrossRef]
- Abbot, A.S.; Turney, J.M.; Zhang, B.; Smith, D.G.A.; Altarawy, D.; Schaefer, H.F. PES-Learn: An Open-Source Software Package for the Automated Generation of Machine Learning Models of Molecular Potential Energy Surfaces. J. Chem. Theory Comput. 2019, 15, 4386–4398. [Google Scholar] [CrossRef]
- Unke, O.T.; Castro-Palacio, J.C.; Bemish, R.J.; Meuwly, M. Collision-induced rotational excitation in –Ar: Comparison of computations and experiment. J. Chem. Phys. 2016, 144, 224307. [Google Scholar] [CrossRef]
- Ho, T.; Hollebeek, T.; Rabitz, H.; Harding, L.B.; Schatz, G.C. A global H2O potential energy surface for the reaction O(1D)+H2→OH+H. J. Chem. Phys. 1996, 105, 10472–10486. [Google Scholar] [CrossRef]
- Langer, M.F.; Goeßmann, A.; Rupp, M. Representations of molecules and materials for interpolation of quantum-mechanical simulations via machine learning. npj Comput. Mater. 2022, 8, 41. [Google Scholar] [CrossRef]
- Snyder, J.C.; Rupp, M.; Hansen, K.; Müller, K.R.; Burke, K. Finding Density Functionals with Machine Learning. Phys. Rev. Lett. 2012, 108, 253002. [Google Scholar] [CrossRef]
- Vu, K.; Snyder, J.C.; Li, L.; Rupp, M.; Chen, B.F.; Khelif, T.; Müller, K.R.; Burke, K. Understanding kernel ridge regression: Common behaviors from simple functions to density functionals. Int. J. Quant. Chem. 2015, 115, 1115–1128. [Google Scholar] [CrossRef]
- Unke, O.T.; Chmiela, S.; Sauceda, H.E.; Gastegger, M.; Poltavsky, I.; Schütt, K.T.; Tkatchenko, A.; Müller, K.R. Machine Learning Force Fields. Chem. Rev. 2021, 121, 10142–10186. [Google Scholar] [CrossRef] [PubMed]
- Hansen, K.; Montavon, G.; Biegler, F.; Fazli, S.; Rupp, M.; Scheffler, M.; von Lilienfeld, O.A.; Tkatchenko, A.; Müller, K.R. Assessment and Validation of Machine Learning Methods for Predicting Molecular Atomization Energies. J. Chem. Theory Comput. 2013, 9, 3404–3419. [Google Scholar] [CrossRef] [PubMed]
- Rauer, C.; Bereau, T. Hydration free energies from kernel-based machine learning: Compound-database bias. J. Chem. Phys. 2020, 153, 014101. [Google Scholar] [CrossRef]
- Khan, D.; Heinen, S.; von Lilienfeld, O.A. Kernel based quantum machine learning at record rate: Many-body distribution functionals as compact representations. J. Chem. Phys. 2023, 159, 034106. [Google Scholar] [CrossRef]
- Wu, Y.; Prezhdo, N.; Chu, W. Increasing Efficiency of Nonadiabatic Molecular Dynamics by Hamiltonian Interpolation with Kernel Ridge Regression. J. Phys. Chem. A 2021, 125, 9191–9200. [Google Scholar] [CrossRef]
- Smith, D.G.A.; Altarawy, D.; Burns, L.A.; Welborn, M.; Naden, L.N.; Ward, L.; Ellis, S.; Pritchard, B.P.; Crawford, T.D. The MolSSI QCArchive project: An open-source platform to compute, organize, and share quantum chemistry data. Wiley Interdiscip. Rev. Comput. Mol. Sci. 2021, 11, e1491. [Google Scholar] [CrossRef]
- Schneider, M.; Born, D.; Kästner, J.; Rauhut, G. Positioning of grid points for spanning potential energy surfaces—How much effort is really needed? J. Chem. Phys. 2023, 158, 144118. [Google Scholar] [CrossRef]
- Ziegler, B.; Rauhut, G. Efficient generation of sum-of-products representations of high-dimensional potential energy surfaces based on multimode expansions. J. Chem. Phys. 2016, 144, 114114. [Google Scholar] [CrossRef]
- Braams, B.J.; Bowman, J.M. Permutationally invariant potential energy surfaces in high dimensionality. Int. Rev. Phys. Chem. 2009, 28, 577–606. [Google Scholar] [CrossRef]
- Xie, Z.; Bowman, J.M. Permutationally Invariant Polynomial Basis for Molecular Energy Surface Fitting via Monomial Symmetrization. J. Chem. Theory Comput. 2010, 6, 26–34. [Google Scholar] [CrossRef] [PubMed]
- King, S.A. Minimal generating sets of non-modular invariant rings of finite groups. J. Symb. Comput. 2013, 48, 101–109. [Google Scholar] [CrossRef]
- Derksen, H.; Kemper, G. Computational Invariant Theory; Encyclopaedia of Mathematical Sciences; Springer: Berlin/Heidelberg, Germany, 2015. [Google Scholar]
- Houston, P.L.; Qu, C.; Yu, Q.; Conte, R.; Nandi, A.; Li, J.K.; Bowman, J.M. PESPIP: Software to fit complex molecular and many-body potential energy surfaces with permutationally invariant polynomials. J. Chem. Phys. 2023, 158, 044109. [Google Scholar] [CrossRef] [PubMed]
- Yang, K.R.; Varga, Z.; Parker, K.A.; Shu, Y.; Truhlar, D.G. PIPFit 2022. Available online: https://comp.chem.umn.edu/pipfit/ (accessed on 8 July 2024).
- Houston, P.L.; Qu, C.; Yu, Q.; Pandey, P.; Conte, R.; Nandi, A.; Bowman, J.M. No Headache for PIPs: A PIP Potential for Aspirin Runs Much Faster and with Similar Precision Than Other Machine-Learned Potentials. J. Chem. Theory Comput. 2024, 20, 3008–3018. [Google Scholar] [CrossRef]
- Fu, B.; Zhang, D.H. Accurate fundamental invariant-neural network representation of ab initio potential energy surfaces. Natl. Sci. Rev. 2023, 10, nwad321. [Google Scholar] [CrossRef]
- Fu, B.; Zhang, D.H. Ab Initio Potential Energy Surfaces and Quantum Dynamics for Polyatomic Bimolecular Reactions. J. Chem. Theory Comput. 2018, 14, 2289–2303. [Google Scholar] [CrossRef]
- Jiang, B.; Li, J.; Guo, H. High-Fidelity Potential Energy Surfaces for Gas-Phase and Gas–Surface Scattering Processes from Machine Learning. J. Phys. Chem. Lett. 2020, 11, 5120–5131. [Google Scholar] [CrossRef]
- Pearson, K.X. On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling. Lond. Edinb. Philos. Mag. J. Sci. 1900, 50, 157–175. [Google Scholar] [CrossRef]
- Sobol’, I.M. On the distribution of points in a cube and the approximate evaluation of integrals. USSR Comput. Math. Math. Phys. 1967, 7, 86–112. [Google Scholar] [CrossRef]
- Manzhos, S.; Carrington, T. Using an internal coordinate Gaussian basis and a space-fixed Cartesian coordinate kinetic energy operator to compute a vibrational spectrum with rectangular collocation. J. Chem. Phys. 2016, 145, 224110. [Google Scholar] [CrossRef] [PubMed]
- Unke, O.T.; Meuwly, M. Toolkit for the Construction of Reproducing Kernel-Based Representations of Data: Application to Multidimensional Potential Energy Surfaces. J. Chem. Inf. Model. 2017, 57, 1923–1931. [Google Scholar] [CrossRef] [PubMed]
- Schütt, K.T.; Chmiela, S.; Lilienfeld, O.A.V.; Tkatchenko, A.; Tsuda, K.; Müller, K.R. Machine Learning Meets Quantum Physics; Springer: Cham, Switzerland, 2020; Chapter 3; pp. 25–35. [Google Scholar] [CrossRef]
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
- Wang, W.; Jing, B.Y. Gaussian process regression: Optimality, robustness, and relationship with kernel ridge regression. J. Mach. Learn. Res. 2022, 23, 1–67. [Google Scholar]
- Paszke, A.; Gross, S.; Chintala, S.; Chanan, G.; Yang, E.; DeVito, Z.; Lin, Z.; Desmaison, A.; Antiga, L.; Lerer, A. Automatic differentiation in PyTorch. In Proceedings of the NIPS 2017 Autodiff Workshop, Long Beach, CA, USA, 8 December 2017. [Google Scholar]
- Feynman, R.P. Forces in Molecules. Phys. Rev. 1939, 56, 340. [Google Scholar] [CrossRef]
- Qu, C.; Yu, Q.; Van Hoozen, B.L.J.; Bowman, J.M.; Vargas-Hernández, R.A. Assessing Gaussian Process Regression and Permutationally Invariant Polynomial Approaches To Represent High-Dimensional Potential Energy Surfaces. J. Chem. Theory Comput. 2018, 14, 3381–3396. [Google Scholar] [CrossRef]
- Yu, Q.; Bowman, J.M. Ab Initio Potential for H3O+ → H+ + H2O: A Step to a Many-Body Representation of the Hydrated Proton? J. Chem. Theory Comput. 2016, 12, 5284–5292. [Google Scholar] [CrossRef]
- Fortenberry, R.C.; Yu, Q.; Mancini, J.S.; Bowman, J.M.; Lee, T.J.; Crawford, T.D.; Klemperer, W.F.; Francisco, J.S. Communication: Spectroscopic consequences of proton delocalization in OCHCO+. J. Chem. Phys. 2015, 143, 071102. [Google Scholar] [CrossRef]
- Christensen, A.S.; von Lilienfeld, O.A. On the role of gradients for machine learning of molecular energies and forces. Mach. Learn. Sci. Technol. 2020, 1, 045018. [Google Scholar] [CrossRef]
- Bowman, J.M.; Qu, C.; Conte, R.; Nandi, A.; Houston, P.L.; Yu, Q. The MD17 datasets from the perspective of datasets for gas-phase “small” molecule potentials. J. Chem. Phys. 2022, 156, 240901. [Google Scholar] [CrossRef]
- Pinheiro, M.; Ge, F.; Ferré, N.; Dral, P.O.; Barbatti, M. Choosing the right molecular machine learning potential. Chem. Sci. 2021, 12, 14396–14413. [Google Scholar] [CrossRef]
- Perdew, J.P.; Burke, K.; Ernzerhof, M. Generalized Gradient Approximation Made Simple. Phys. Rev. Lett. 1996, 77, 3865–3868. [Google Scholar] [CrossRef] [PubMed]
- Weigend, F.; Ahlrichs, R. Balanced basis sets of split valence, triple zeta valence and quadruple zeta valence quality for H to Rn: Design and assessment of accuracy. Phys. Chem. Chem. Phys. 2005, 7, 3297–3305. [Google Scholar] [CrossRef] [PubMed]
- Hyndman, R.J.; Koehler, A.B. Another look at measures of forecast accuracy. Int. J. Forecast. 2006, 22, 679–688. [Google Scholar] [CrossRef]
- Houston, P.L.; Qu, C.; Nandi, A.; Conte, R.; Yu, Q.; Bowman, J.M. Permutationally invariant polynomial regression for energies and gradients, using reverse differentiation, achieves orders of magnitude speed-up with high precision compared to other machine learning methods. J. Chem. Phys. 2022, 156, 044120. [Google Scholar] [CrossRef]
- Waldher, B.; Kuta, J.; Chen, S.; Henson, N.; Clark, A.E. ForceFit: A code to fit classical force fields to quantum mechanical potential energy surfaces. J. Comp. Chem. 2010, 31, 2307–2316. [Google Scholar] [CrossRef]
- Qu, C.; Yu, Q.; Conte, R.; Houston, P.L.; Nandi, A.; Bowman, J.M. A Δ-machine learning approach for force fields, illustrated by a CCSD(T) 4-body correction to the MB-pol water potential. Digit. Discov. 2022, 1, 658–664. [Google Scholar] [CrossRef]
- Bowman, J.M.; Qu, C.; Conte, R.; Nandi, A.; Houston, P.L.; Yu, Q. Δ-Machine Learned Potential Energy Surfaces and Force Fields. J. Chem. Theory Comput. 2023, 19, 1–17. [Google Scholar] [CrossRef]
- Rodriguez, A.; Smith, J.S.; Mendoza-Cortes, J.L. Does Hessian Data Improve the Performance of Machine Learning Potentials? J. Chem. Theory Comput. 2025, 21, 6698–6710. [Google Scholar] [CrossRef]
- Shu, Y.; Truhlar, D.G. Diabatization by Machine Intelligence. J. Chem. Theory Comput. 2020, 16, 6456–6464. [Google Scholar] [CrossRef]
- Goodlett, S.M.; Turney, J.M.; Schaefer, H.F. Comparison of multifidelity machine learning models for potential energy surfaces. J. Chem. Phys. 2023, 159, 044111. [Google Scholar] [CrossRef] [PubMed]






| PIP-LS a | GP a | PES-Learn KRR b | PES-Learn GP c | PES-Learn NN c | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RS | RS | RS | SB | SR | RS | SB | SR | RS | SB | SR | ||
| H3O+ | 500 | 116.99 | 238.74 | 85.01 | 51.22 | 91.68 | 19.97 | 15.24 | 22.59 | 22.53 | 15.27 | 34.76 |
| 1000 | 39.20 | 125.76 | 40.40 | 24.38 | 42.14 | 10.71 | 6.09 | 10.22 | 13.99 | 11.77 | 27.78 | |
| 2000 | 28.13 | 36.33 | 23.59 | 17.38 | 22.52 | 5.88 | 2.94 | 6.36 | 11.35 | 6.99 | 15.31 | |
| 5000 | 11.81 | 11.17 | ||||||||||
| 10,000 | 9.99 | 5.62 | ||||||||||
| 32,141 | 7.09 | |||||||||||
| OCHCO+ | 520 | 259.18 | 32.09 | 53.88 | 25.82 | 58.59 | 23.77 | 8.70 | 19.62 | 38.84 | 14.33 | 46.79 |
| 780 | 293.53 | 26.62 | 37.34 | 21.86 | 49.79 | 23.03 | 4.06 | 17.87 | 29.60 | 5.92 | 34.96 | |
| 1560 | 128.65 | 15.78 | 27.25 | 2.82 | 19.83 | 10.58 | 1.03 | 8.03 | 11.99 | 2.33 | 10.30 | |
| 2600 | 97.40 | 15.55 | ||||||||||
| H2CO | 5104 | 432 | 238 | 228.28 | 260.50 | 358.82 | 191.15 | 402.84 | 375.27 | 118.53 | 124.09 | 189.14 |
| 8703 | 332 | 229 | 234.38 | 201.00 | 290.50 | 121.07 | 91.36 | 146.29 | ||||
| KRR | GP | NN | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| SB | Sobol’ | SR | SB | Sobol’ | SR | SB | Sobol’ | SR | ||
| RMSE (kcal mol−1) | 50 | 1.99 | 1.86 | 1.78 | 2.41 | 2.42 | 2.30 | 41.18 | 2.39 | 2.67 |
| 100 | 1.25 | 1.43 | 1.36 | 1.91 | 1.32 | 1.61 | 2.31 | 2.01 | 2.14 | |
| 150 | 1.61 | 1.63 | 1.31 | 1.11 | 0.99 | 0.84 | 1.60 | 1.92 | 1.82 | |
| 200 | 1.08 | 1.53 | 1.45 | 0.65 | 0.64 | 0.80 | 1.30 | 1.19 | 1.58 | |
| 400 | 0.57 | 0.44 | 0.50 | 0.24 | 0.20 | 0.19 | 0.98 | 0.67 | 0.75 | |
| 600 | 0.33 | 0.26 | 0.25 | 0.14 | 0.13 | 0.13 | 0.44 | 0.55 | 0.30 | |
| 800 | 0.24 | 0.20 | 0.21 | 0.12 | 0.11 | 0.11 | 0.33 | 0.24 | 0.26 | |
| 1000 | 0.20 | 0.16 | 0.16 | 0.10 | 0.10 | 0.10 | 0.25 | 0.19 | 0.19 | |
| Runtime (min) | 50 | 3.72 | 3.70 | 4.56 | 1.86 | 2.19 | 2.18 | 5.60 | 9.17 | 9.96 |
| 100 | 5.87 | 6.14 | 6.16 | 2.88 | 3.93 | 3.52 | 10.55 | 9.76 | 6.88 | |
| 150 | 4.61 | 4.59 | 8.62 | 4.77 | 4.93 | 5.42 | 12.72 | 11.95 | 25.60 | |
| 200 | 7.98 | 5.83 | 6.60 | 7.46 | 7.95 | 9.79 | 7.87 | 11.49 | 17.23 | |
| 400 | 10.54 | 11.27 | 11.35 | 37.94 | 33.38 | 34.50 | 23.28 | 15.63 | 15.45 | |
| 600 | 25.57 | 25.31 | 27.90 | 78.68 | 97.50 | 75.94 | 11.03 | 20.04 | 34.00 | |
| 800 | 34.36 | 35.71 | 34.75 | 122.51 | 125.31 | 123.77 | 27.10 | 34.13 | 60.67 | |
| 1000 | 40.29 | 41.88 | 43.46 | 158.49 | 232.81 | 222.74 | 14.46 | 29.03 | 36.38 | |
| KRR | GP | NN | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| SB | Sobol’ | SR | SB | Sobol’ | SR | SB | Sobol’ | SR | ||
| RMSE (kcal mol−1) | 50 | 4.29 | 2.99 | 3.05 | 4.29 | 4.33 | 3.76 | 7.47 | 4.95 | 9.27 |
| 100 | 4.29 | 3.12 | 3.09 | 4.28 | 3.01 | 3.25 | 5.73 | 4.23 | 4.82 | |
| 150 | 4.29 | 2.90 | 2.90 | 3.49 | 2.33 | 2.44 | 4.36 | 3.70 | 3.51 | |
| 200 | 3.59 | 2.80 | 2.86 | 2.47 | 2.11 | 2.07 | 4.19 | 3.24 | 3.28 | |
| 400 | 2.40 | 1.86 | 2.08 | 2.34 | 1.38 | 1.46 | 3.00 | 1.94 | 2.14 | |
| 600 | 1.62 | 1.37 | 1.34 | 1.53 | 1.23 | 1.28 | 1.93 | 1.49 | 1.61 | |
| 800 | 1.23 | 1.12 | 1.06 | 1.27 | 0.98 | 0.99 | 1.54 | 1.27 | 1.31 | |
| 1000 | 1.08 | 0.95 | 0.97 | 1.03 | 0.82 | 0.79 | 1.50 | 1.08 | 1.06 | |
| Runtime (min) | 50 | 3.19 | 2.99 | 3.22 | 1.66 | 1.74 | 1.87 | 7.14 | 4.30 | 8.39 |
| 100 | 5.78 | 4.72 | 5.32 | 1.93 | 3.59 | 2.24 | 9.11 | 7.30 | 7.51 | |
| 150 | 8.05 | 6.96 | 6.98 | 3.85 | 5.57 | 5.72 | 13.00 | 6.16 | 16.27 | |
| 200 | 8.67 | 7.91 | 8.87 | 6.60 | 6.77 | 7.51 | 9.27 | 13.92 | 22.21 | |
| 400 | 17.75 | 16.37 | 17.27 | 18.17 | 20.80 | 18.03 | 11.84 | 34.59 | 18.03 | |
| 600 | 25.21 | 25.52 | 27.54 | 45.76 | 43.13 | 41.66 | 28.81 | 23.09 | 15.42 | |
| 800 | 31.85 | 32.88 | 35.62 | 81.92 | 69.55 | 74.74 | 15.59 | 18.34 | 13.28 | |
| 1000 | 42.03 | 41.21 | 45.42 | 107.33 | 82.34 | 70.86 | 9.69 | 9.72 | 38.47 | |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Beck, I.T.; Turney, J.M.; Schaefer, H.F., III. Methods in PES-Learn: Direct-Fit Machine Learning of Born–Oppenheimer Potential Energy Surfaces. Molecules 2026, 31, 100. https://doi.org/10.3390/molecules31010100
Beck IT, Turney JM, Schaefer HF III. Methods in PES-Learn: Direct-Fit Machine Learning of Born–Oppenheimer Potential Energy Surfaces. Molecules. 2026; 31(1):100. https://doi.org/10.3390/molecules31010100
Chicago/Turabian StyleBeck, Ian T., Justin M. Turney, and Henry F. Schaefer, III. 2026. "Methods in PES-Learn: Direct-Fit Machine Learning of Born–Oppenheimer Potential Energy Surfaces" Molecules 31, no. 1: 100. https://doi.org/10.3390/molecules31010100
APA StyleBeck, I. T., Turney, J. M., & Schaefer, H. F., III. (2026). Methods in PES-Learn: Direct-Fit Machine Learning of Born–Oppenheimer Potential Energy Surfaces. Molecules, 31(1), 100. https://doi.org/10.3390/molecules31010100

