Review on the QM/MM Methodologies and Their Application to Metalloproteins

Tzeliou, Christina Eleftheria; Mermigki, Markella Aliki; Tzeli, Demeter

doi:10.3390/molecules27092660

Open AccessReview

Review on the QM/MM Methodologies and Their Application to Metalloproteins

by

Christina Eleftheria Tzeliou

¹

,

Markella Aliki Mermigki

¹

and

Demeter Tzeli

^1,2,*

¹

Laboratory of Physical Chemistry, Department of Chemistry, National and Kapodistrian University of Athens, Panepistimiopolis Zografou, 157 71 Athens, Greece

²

Theoretical and Physical Chemistry Institute, National Hellenic Research Foundation, 48 Vassileos Constantinou Ave., 116 35 Athens, Greece

^*

Author to whom correspondence should be addressed.

Molecules 2022, 27(9), 2660; https://doi.org/10.3390/molecules27092660

Submission received: 28 March 2022 / Revised: 12 April 2022 / Accepted: 18 April 2022 / Published: 20 April 2022

(This article belongs to the Special Issue Advances in Modeling of Chemical Reactions by QM/MM Calculations)

Download

Browse Figures

Versions Notes

Abstract

The multiscaling quantum mechanics/molecular mechanics (QM/MM) approach was introduced in 1976, while the extensive acceptance of this methodology started in the 1990s. The combination of QM/MM approach with molecular dynamics (MD) simulation, otherwise known as the QM/MM/MD approach, is a powerful and promising tool for the investigation of chemical reactions’ mechanism of complex molecular systems, drug delivery, properties of molecular devices, organic electronics, etc. In the present review, the main methodologies in the multiscaling approaches, i.e., density functional theory (DFT), semiempirical methodologies (SE), MD simulations, MM, and their new advances are discussed in short. Then, a review on calculations and reactions on metalloproteins is presented, where particular attention is given to nitrogenase that catalyzes the conversion of atmospheric nitrogen molecules N₂ into NH₃ through the process known as nitrogen fixation and the FeMo-cofactor.

Keywords:

multiscale calculations; QM/MM; DFT; semi-empirical; molecular dynamics; molecular mechanics; metalloproteins; chemical reactions; nitrogenase; FeMoco

Graphical Abstract

1. Introduction

Warshel and Levitt introduced the multiscaling Quantum Mechanics/Molecular mechanics approach, i.e., QM/MM, for the investigation of complex molecular systems in 1976 [1]. This methodology was first applied to an enzymatic reaction. The extensive acceptance of this method started in the 1990s [2]. In this study, the conjunction of SE methods with molecular force field was completely illustrated, while the precision, and efficacy of the QM/MM treatment in opposition to ab initio and experimental data were estimated [2]. In the last few decades, a lot of simulations for biomolecular systems have been carried out using QM/MM approaches. Moreover, a lot of reviews evaluate these methods themselves and the updates that are established throughout the years. Additionally, this method is combined with others, such as methods that consider the quantum nature of atomic motion, including free-energy and reaction path methods for more accurate answers in studies of complex systems and especially in enzymatic reactions [3]. Generally, the QM/MM approach is established for modeling complex biomolecular systems, inorganic, organometallic, and solid-state systems, as well as for the study of processes that take place in explicit solvent [3].

In 2013, the Nobel Prize in Chemistry was awarded to Martin Karplus, Michael Levitt and Arieh Warshel equally, “for the development of multiscale models for complex chemical systems” as a reward for their significant contribution in computational chemistry. The theoretical calculations based on this theory can predict chemical processes, explain, and interpret experimental data [4]. Additionally, they are supplemental to the experimental information adding details. Karplus, Levitt and Warshel’s work is revolutionary because they combined the classical consideration of matter with quantum physics and chemistry. Until then, only one type of methodology had to be chosen, i.e., classical or quantum. Classical physics approached large molecules in a simpler way which was an advantage when calculating, counter to its weakness that is the incapacity of simulation of chemical reactions. On the contrary, the quantum consideration of systems can be applied only in small systems, since it demands enormous computing power. As a result, they could be applied for small molecules only. The QM/MM theory solves this impasse of choice, and it combines both theories for a more accurate simulation [4].

In the present review, new advances in the main methodologies, i.e., DFT, SE methods, MD simulations, MM, that are combined in QM/MM and QM/MM/MD approaches are discussed in short. Then, a review on reactions on metalloproteins emphasizing in nitrogenase is presented.

2. Methodologies

2.1. Density Functional Theory

DFT was introduced in 1964 by Hohenberg and Kohn [5]. It predicts the molecular properties based on the calculation of the electron density of molecules. The electron density of a molecule constitutes one of its physical properties. The DFT methodology contrary to the Hartree-Fock (HF) theory, where the full N-electron wavefunction is calculated, aims at calculating the total electronic energy by considering only the total electron density distribution. The inhomogeneous electron gas model suggested by Hohenberg and Kohn indicated that a system’s ground state energy, Ε, could be defined by its electron density, ρ(r). Specifically, the energy functional is written as follows:

E [ρ (r)] = \int^{} V_{e / e x t} (r) ρ (r) d r + F [ρ (r)]

(1)

where

\int^{} V_{e / e x t} (r) ρ (r) d r

corresponds to the interaction of the electrons with an external potential (e.g., the Coulomb interactions with the nuclei), while

F [ρ (r)]

corresponds to the kinetic energy and the contributions from interelectronic interactions. In 1965, Kohn and Sham [6] considered

F [ρ (r)]

to be a sum of three terms:

F [ρ (r)] = E_{K E} [ρ (r)] + Ε_{H} [ρ (r)] + E_{X C} [ρ (r)]

(2)

where

E_{K E} [ρ (r)]

is the kinetic energy of a system with non-interacting electrons with the same electron density as the real one,

Ε_{H} [ρ (r)]

is the Coulomb energy of electrons, and

E_{X C} [ρ (r)]

is a term that contains contributions from exchange and correlation energies while also corresponds to corrections in the kinetic energy that arise from the electron-electron interaction. In particular, the exchange energy is a stabilization energy that arises from the ability of same spin electrons to avoid each other. There is no classical analogue to this, and it comes from the Pauli principle. It is a stabilization energy since the real Coulomb repulsions are lower as the same spin electrons avoid each another.

The major advantage of DFT is the inclusion of correlation energy. In a HF method, electron

i

is considered to move in an average potential that comes from the sum of electrons

j

, with

i \neq j

. However, the motion of the electrons is instantaneously correlated, while they avoid each another in a more dynamic way than what described by an average potential, thus the real Coulomb repulsions between them are lower. Hence, correlation energy is a stabilization energy as well and it is included by the DFT method.

Thus, the full expression of the energy would be:

E [ρ (r)] = \sum_{i = 1}^{N} ψ_{i} (r) (\frac{- \nabla^{2}}{2}) ψ_{i} (r) d r + \frac{1}{2} \iint^{} \frac{ρ (r_{1}) ρ (r_{2})}{[r_{1} - r_{2} |} d r_{1} d r_{2} + E_{X C} - \sum_{A = 1}^{M} Z_{A} \int^{} \frac{ρ (r)}{r - R_{A}} d r

(3)

where the first term is the kinetic energy of non-interacting electrons system, the second one corresponds to the interelectronic repulsions, the third is the exchange-correlation energy and the last one is the Coulomb attractions between electrons and nuclei. Kohn and Sham considered the electron density to be the sum of the square modulus of Ν one-electron orbitals:

ρ (r) = \sum_{i = 1}^{N} | ψ_{i} (r) |^{2}

(4)

The challenge of DFT is to find an appropriate functional to describe the exchange-correlation energy [7,8,9]. For this matter, several approximations have been proposed, leading to a plethora of functionals. Generally, there are four categories of approximations.

The simplest approximation for the exchange-correlation functional is the Local Density Approximation (LDA). It is based upon uniform electron-gas and it presumes the uniformity of the molecule’s density all over the system. The local spin-density approximation (LSDA) is a generalization of LDA where the electron spin is included. Some of the most popular LDA functionals are the Vosko-Wilk-Nusair (VWN) [10] and the Perdew-Wang (PW92) [11] functionals. However, this approximation is not appropriate for molecules, wherein electron density is clearly nonuniform, while it works well for the calculation of the electronic band structures of solid-state.

The second category contains functionals where a gradient correction factor is included; this category is a significant improvement to the LDA approach. The gradient corresponds to the non-uniformity that characterizes the electron density, and it is known as gradient-corrected (GC) or non-local functionals. Hence, ρ(r) is not considered to be constant. Typically, these gradient corrections are divided in separate exchange and correlation functionals, such as the Becke exchange functional B88 [12] or the Lee-Yang-Parr correlation function, LYP [13]. Their combination led to the widely used BLYP GGA functional. An improvement on the GGA is the meta-GGA approach, where the functionals depend on the density, on the gradient, on their second derivatives, i.e., M06-2L [14].

The third category includes the hybrid functionals, which seek to include some elements from ab initio methodologies along with improvements via DFT mathematical formulas. A percentage of precise HF exchange is included (i.e., ab initio exchange without any parametrization). This approach furnishes the hybrid GGA functionals, among which is B3LYP [15] (which contains a precise 20% HF exchange). Note that hybrid methods, for instance B3LYP, are preferred for computational chemistry calculations. Moreover, there are the hybrid meta-GGA functionals. One of them is the M06 functional which was proposed by Y. Zhao & D.G. Truhlar in 2006, and it contains a precise 27% of HF exchange [15]. At last, double hybrid functionals have been developed, such as B2BLYP, based on the meta-GGA approach for the inclusion of precise HF exchange, combined with a perturbative second-order correlation part acquired from the DFT orbitals and eigenvalues [16]. Finally, the recent range-separated functionals, i.e., HSE06, LC-wPBE, and RS-DDH belong in this category [17,18]

Finally, the last category includes combination of DFT with other ab initio methodologies, such as multiconfiguration pair DFT (MC-PDFT) [19], multireference DFT (MRDFT) method, [20] dynamical DFT (DDFT) which is an extended DFT approach to nonequilibrium systems [21] etc. However, all these DFT methods are more time-consuming than traditional DFT methods.

The generation of divergent types of functionals has been useful in describing a variety of systems and applications. It is important here to mention the functionals that include long-range corrections. Generally, the non-Coulomb part of the exchange-correlation functionals vanishes quickly and it is not accurate at large distances, making them unsuitable for the study of electron excitations to high orbitals, for non-covalent bonds as well as for van der Waals bonds which are typically found in biological systems. Various schemes have been constructed to handle such cases. Commonly used functionals for these cases are LC-wHPBE, [22], CAM-B3LYP, [23] wB97XD, [24], MN15, [25], etc.

Finally, the time-dependent DFT (TD-DFT) has been developed for the computing of the excited states of a molecular system. TD-DFT shares the same philosophy as DFT, but it considers a time-dependent problem [26]. According to the Runge-Gross theorem [27], the time-dependent electron density uniquely defines a time-dependent external potential at any time. The Hamiltonian of the system takes the following form:

\hat{H} (t) = \hat{T} + {\hat{V}}_{e / e x t} (t) + \hat{W}

(5)

Here

\hat{T}

is the operator of the kinetic energy,

{\hat{V}}_{e / e x t} (t)

is the operator of a time-dependent external potential,

\hat{W}

is the operator referring to electron-electron interactions. Within the TDDFT theory, initially a Kohn-Sham scheme is employed, in a similar fashion as the common DFT. So, a non-interacting system is considered, that yields the same time-dependent electron density with the real, interacting system. A Hamiltonian for the non-interacting system is constructed:

\hat{H} (t) = \hat{T} + {\hat{V}}_{K S} (r, t)

(6)

{\hat{V}}_{K S} (t)

stands for Kohn-Sham potential, which acts on the non-interacting wavefunction

Φ (r, t)

.

In summary, DFT is a computational cheap methodology comparing to ab initio methods, such as multireference and coupled cluster approaches. It calculates a part of electron correlation energy which is determined as the energy of the exact solution obtained from Schrödinger equation minus the HF energy. However, contrary to other methods, decisions must be taken regarding to which functional will be used for a specific application. For instance, TPSSh functional [28] constitutes a very good choice for molecules including transition metals, but not for organic molecules. B3LYP is a good one, a “standard” one for projects involving relatively small closed-shell molecules, MPW1K [29], is an exceptional good one in studying modeling kinetics of reactions by determining the transition states, etc. Generally, although DFT is an accurate reformulation of quantum theory, approximations are needed regarding the Exchange-Correlation energy functional. Most of their deficiencies lies upon two main errors of standard density-functionals: the delocalization and static correlation error [30]. However, DFT approach is a computational cheap methodology comparing to ab initio methodologies, it can be used in systems up to a few hundred atoms, its accuracy can be compared to other ab initio methods, while efforts are being made to derive functionals suitable for many types of applications [25].

2.2. Semiempirical Methods

Semiempirical methods throw bridges across ab initio and empirical approaches when calculating large molecules of biological systems. These methods are built on the HF formalization with a lot of differences based on approximations and empirical data. They have been characterized as the new generation of SCF methods [31]. The main concept that SE methods follow is that some complicated integrals are not calculated, but instead they are parameterized and replaced by approximations. Thus, many terms which are not important can be neglected throughout this process. However, the errors resulting from the process, can be fixed by incorporating some empirical parameters into the very first formalism, while they are modified with respect accurate experimental or calculated reference values. The SE representation tries to maintain the crucial physics behind the studied system. The parameterization corresponds to all other effects in an average sense, ending with an evaluation based in numerical accuracy. Many times, these methods may not seem very accurate, but they are efficient at last [32].

The SE approaches are categorized in two major groups depending on their approach of the system. Firstly, we have the Hückel’s π-electron method, where MOs are generated basically from the molecule’s connectivity matrix. It is basically used for the calculation of the excited states of polyenes and other unsaturated molecules leading to some very important qualitative physical insights concerning the structure, stability, and spectroscopy [33]. An advance in this theory is the Hoffmann’s extended Hückel theory, where all valence electrons are included, which is robust for inorganic and organometallic compounds [29]. Below, various SE methods are reported, which have qualitatively improved the MO theory and they are used to understand some chemical phenomena in terms of orbital interactions [34].

In the first method group, the one-electron integrals are included and are therefore noniterative, while a two-electron integrals approach has been used, which appears to work in semiempirical SCF methods. Working on π-electrons, known as Pariser–Parr–Pople method, there is a successful approach for the electronic spectra of unsaturated molecules [35,36]. Moving to the second method group, Pople proposed a generalization to valence electrons and established approximations of integrals that content the rotational invariance and some other consistency criteria. In this group belongs the CNDO, INDO and NDDO methods. They stand for the complete neglect of differential overlap (CNDO), intermediate neglect of differential overlap (INDO), neglect of diatomic differential overlap methods (NDDO) [37]. Dewar recommended calibration against experimental reference data mostly for organic molecules at the ground state of their potential surfaces. Three new models were developed. The MINDO/3 method which is an INDO-based one, and the MNDO and AM1 methods which are NDDO-based ones. Next, with a new parameterization of the MNDO model, the PM3 was created. Officially, AM1 and PM3 are different from MNDO in the selection of the empirical core repulsion function only and are partly used to see the limits of parameterization of the MNDO electronic structure model [38,39,40,41,42,43]. There has been a generalization on the MNDO model from a sp basis to a spd basis. This can have legitimate results for heavier elements, particularly hypervalent main-group elements. Some extended parameterizations based on spd basis lead up to PM6 and PM7 methods, which are widely used, especially the PM6 method, for calculations of metalloproteins [44,45]. Lastly, a series of orthogonalization models, namely, OM1, OM2, and OM3, have been suggested where orthogonalization corrections are incorporated in the one-electron terms of NDDO [46].

Concerning each methodology development, it is based on different integral approximations and on the character of the interactions that are incorporated. For instance, MNDO and OMx handle the valence electrons via SCF-MO using a minimal basis set. The core electrons are calculated via reduced nuclear charge and electron correlation only if it is mandatory for zero-order description. Finally, dynamic correlation effects are subsumed through two-electron integrals and the general parameterization [32]. Integral approximations that are made based on the ignorance of all three-center together with four-center two-electron integrals, simplify the standard SCF-MO equations. These approximations are included in CNDO, INDO, and NDDO methods. Additionally, the MNDO-type methods that are used more in calculations of metalloproteins, use Slater-type atomic orbitals as the basis functions. After some adjustments, in the Fock matrix the one-center integrals have been exported from available atomic spectroscopic data. Some further parameterizations for one-center two-electron integrals and two-center two-electron integrals occur at large distances according to classical electrostatics. The original MNDO method parameterization emphasized on ground-state properties, mostly geometries and heats of formation, with the use of ionization potentials and dipole moments as supplementary reference information [32]. Ionization potentials together with dipole moments are also included as additional reference data.

The AM1 and PM3 methods that are suitable for calculations of metalloproteins, are based upon the same model as the MNDO, but they vary from it in the effective atom-pair potential in the core–core repulsion function only. More adjustable parameters are included, making the basic function more flexible. The extra Gaussian terms are empirically used and not established in theory [41,42,43]. Lastly, the parameterization in AM1, PM3 and MNDO seem to have the same philosophy, but the optimization of parameters per element was increased to 18 in PM3, while was 5 to 7 in MNDO. The three methods use a sp basis without d orbitals and have no application to most transition metal compounds like a lot of metalloproteins. This issue is overcome when the two-center two-electron integrals are parameterized for a spd basis which is an extension of the original point-charge model for a sp basis. Note that this is used in MNDO/d extension which can be applied in conjunction with any MNDO-type approach like the PM6 and PM7 methods. These methods are widely applicable since they have been parametrized for many elements, i.e., the MNDO/d parameters are applicable to second-row elements, halogens, and zinc group elements. Specifically, the PM6 parameters have been established for most Periodic Table elements, specifically for about 70 elements so far [32].

To conclude, SE methods are valuable tools for studying electronic effects in large molecules and they can be applied successfully in complex systems. More details for the above formalisms can be found in in the original publications and several comprehensive review articles [47,48,49,50,51,52,53,54].

2.3. Molecular Mechanics (MM)

The investigation of processes of biological importance often requires modelling of large systems, consisting of hundreds or thousands of atoms. The number of electrons present in such systems is too demanding for rigorous quantum chemical calculations, even for the current computational capacity. Thus, the empirical MM methodology is employed, under which the energy of the system is considered with classical mechanics. Note that the Born-Oppenheimer approximation is also considered. The potential energy of the system is determined as a function of the nuclear coordinates with the use of molecular force fields (FF), while the motions of electrons are not considered. The energy of the system is described by its Hamiltonian, but within a classical (Newtonian) framework. It includes a kinetic energy term as well as a potential energy term:

ℋ = K + V

. However, for the definition of the potential energy

V

, special attention is needed. As reported above, the potential energy of the system can be defined with the use of the force field method, where electronic motions are not considered, and the energy of the system is determined as a function of nuclear positions only. The molecular force fields can be regarded as a rather simple, four-component picture of the intra- and inter-molecular forces inside a system. Specifically, the system’s potential energy [55,56,57] can be analyzed as

V (r^{N}) = \sum_{b o n d s} \frac{k_{i}}{2} {(b_{i} - b_{o})}^{2} + \sum_{a n g l e s} \frac{k_{i}}{2} {(θ_{i} - θ_{o})}^{2} + \sum_{t o r s i o n s} \frac{V_{n}}{2} (1 + \cos (n ω - γ)) + \sum_{i = 1}^{N} \sum_{j = 1 + 1}^{N} (4 ε_{i j} [{(\frac{σ_{i j}}{r_{i j}})}^{12} - {(\frac{σ_{i j}}{r_{i j}})}^{6}] + \frac{q_{i} q_{j}}{4 π ε_{o} r_{i j}})

(7)

where the first term is a summation over all the bonds, the second a summation over all the angles, the third a summation over all the torsions, and the fourth includes all non-bonded interactions, i.e., van der Waals (vdW) and Coulomb contributions.

Bond terms: Molecules undergo vibrational motion which is modelled as a harmonic potential, according to Hooke’s law [57]:

v (b) = \int^{} F (b) d b = \int^{} - k b = \frac{k}{2} {(b - b_{o})}^{2}

, where

k

represents the force constant,

k = ω^{2} μ

,

ω

is the vibrational frequency of the bond,

μ

the reduced mass and

b_{o}

the equilibrium value around which a bond oscillates. Both

k

and

b_{o}

are parametrized for the type of atom that participates in the studied bond. In most force fields, an atom type contains additional data concerning its hybridization state and even the local environment [53]. Although modelling a bond using Hooke’s law allows for some vibrational deviation from the equilibrium bond length, the true bond-stretching is not harmonic. Due to this non-harmonic nature, its average value will deviate from the equilibrium bond value and in high energies it would even be dissociative. Nonetheless, Hooke’s law functional form is a logical approach at the equilibrium bond distances of the ground-state molecules, a more accurate approach is the use of the Morse potential:

v (b) = D_{e} {(1 - e^{[- α (b - b_{o})]})}^{2}

,

D_{e}

is the depth of the potential energy minimum,

α = ω \sqrt{\frac{μ}{2 D_{e}}}

, μ is the reduced mass and ω is the vibrational frequency of the bond. Although the bond is described more accurately, the Morse potential is not usually used in MM force fields, since it requires three parameters to be specified for each bond. The inability of modelling a bond break (and a bond formation, respectively) is amongst the most important restrictions of the MM methodology. Thus, the QM methodology must be employed to consider, examine, and interpret these phenomena.

Angle terms: The deviation of angles from their reference values is described using a harmonic potential. The second term of potential energy

V

,

\sum_{angles} \frac{k_{i}}{2} {(θ_{i} - θ_{o})}^{2}

describes a bond bend, which is also characterized by a force constant

k_{i}

and an equilibrium angle

θ_{o}

. Both are distinct for each type of atoms and for their characteristics such as their hybridization.

Torsion terms: They are included to describe the steric barrier between atoms separated by three covalent bonds. The motion associated with this steric effect is the bond rotation described by a dihedral angle around the bond connecting the two middle atoms. Unlike bond stretches and bends, which require quite substantial energies to cause significant deformations from their reference values, dihedral bends are less energetically expensive and correspond to most of the variations in structure and relative energies of a molecule. The third term of potential is regarded as a periodic one,

\sum_{torsions} \frac{V_{n}}{2} (1 + \cos (n ω - γ))

, and it represents the rotational degrees of freedom of the molecule, where V_n is a constant corresponding to the barrier height of rotation, n is the multiplicity, specifically, the number of minimum points of the function as the bond rotates over 360° corresponding to the periodicity of the function, ω is the dihedral angle, while γ is the phase factor, which determines the point where the torsion angle passes through its minimum.

Non-Bonded Terms: They consist of the vdW and Coulomb contributions. In most force fields, vdW contributions to the potential energy are described by the Lennard-Jones potential [58]:

V_{v d W} = \sum_{i = 1}^{N} \sum_{j = 1 + 1}^{N} (4 ε_{i j} [{(\frac{σ_{i j}}{r_{i j}})}^{12} - {(\frac{σ_{i j}}{r_{i j}})}^{6}] + \frac{q_{i} q_{j}}{4 π ε_{o} r_{i j}})

, where r refers to the distance connecting two particles, ε is the depth of the energy well and σ is the interatomic distance, for which the energy becomes zero. Distance σ represents the minimum distance at which two particles can approach each other because for

r < σ

the potential energy tends to infinity, while in long distances, the potential energy tends to zero, i.e., the particles do not interact. Finally, Coulomb interactions are described by the Coulomb’s law.

To determine the functions as well as the parameters which include the FF, atom types are used by MM. As mentioned above, an element may be determined by various MM atom types, depending on several characteristics and conditions, for instance hybridization and chemical environment. Some examples of MM force fields are UFF, Dreiding, MM2, MM3, MM4, MMFF, AMBER, CHARM, OPL, and ECEPP. UFF considers the type of element, its hybridization, as well as its connectivity. Additionally, UFF can be employed in MD simulations. Dreiding employs general force constants along with geometry parameters, while hybridization is considered. MM2 is used mainly for simple organic molecules, i.e., ethers, ketones, aromatic compounds, etc. In MM2, the anharmonic breakage of bonds is included via additional terms. MM3 is an improved form of MM2 including potential functions, where corrections and/or modifications, i.e., correction of high rotational barriers in congested hydrocarbons, alternations in vdW parameters etc, are considered. MM4 incorporates some interactions, such as torsion–bend along with bend–torsion–bend interactions, resulting in a better calculation of vibrational frequencies. MMFF (Merck Molecular FF) comprises of a broad range of excellent data used for MM and MD simulation. AMBER stands for Assisted Model Building with Energy Refinement. It is appropriate for the modeling of both small molecules and polymers. CHARMM stands for Chemistry at HARvard Macromolecular Mechanics, and it is appropriate for application such as conformational analysis, molecular minimization, free energies. It is applied in the study of biomolecules, i.e., peptides, nucleic acids, proteins, lipids, and carbohydrates. OPLS stands for optimized potentials for liquid simulations; it employs additional functions that denote the H-bonding. Finally, the ECEPP, which is an empirical conformational energy program for peptides, uses experimental data that are continuously updated, and it employs a series of parameters for the definition of the geometry of amino acid residues and the interatomic interactions. To sum up, there is a series of molecular force fields suitable for various applications, so as the investigation of chemical processes involving large systems to be feasible in a good level of accuracy.

2.4. Molecular Dynamics Simulations

MD simulations are very useful for the understanding of the physical basis of the structure and the functions of many biological macromolecules. They were first developed in the late 70s and they have evolved throughout the decades from a simulating system of hundreds of atoms to macromolecules of biological interest such as nucleosomes, ribosomes, and the macromolecules of our interest, metalloproteins. The range of the population of atoms of the calculated systems varies from 50,000 up to 500,000. The most populous systems need appropriate computer facilities, and it can be succeeded using high-performance computing (HPC). A dynamic model is built for metalloproteins where the internal motions as well as the resulting conformational changes are both important in their functions. This is contrary to the old considerations that the proteins have a rather rigid geometrical structure [59].

MD simulations depict and predict the trajectories of the particles of a studied system. To accomplish this calculation correctly, a simple algorithm is developed, which calculates the trajectories through force field approach. It begins with the potential energy calculation E_pot {x_i} of every particle, it continues with the calculations of the acting forces F_i = −E_pot/x_i and the acceleration a_i = F_i/m_i of it. It ends with calculating the velocity v_i (t + dt) = v(t)_i + a_i dt and the particle coordinate x_i (t + dt) = x(t)_i + v_i dt. In this way, we result in a complete trajectory of a particle. The algorithm works for 3N particles in total [59].

Moving on, the representations are based to different levels of details. The metalloprotein can be initially modelized using structures found experimentally or from other modeling data. The atomistic representation is not commonly used for large systems such as metalloproteins, even though it leads to the best reproduction of a system. The most suitable model is the coarse-grained (CG) method, where molecules are represented by “pseudo-atoms” approximating specific groups of atoms, for example they represent the whole amino acid residue and individual atoms are not considered [60]. At first, CG models were developed based on classical statistical mechanics, but with the passage of time, CG methods considering quantum Boltzmann statistics were established [61]. Specifically, scientists try to reproduce information from a fine-grained atomistic level to CG approaches for a better description of the studied system based on natural laws. A bottom-up CG theory in quantum Boltzmann statistics based on the Multiscale CG methodology has already been developed, which describes more accurately biomolecular systems [61]. Additionally, other bottom-up evolved methodologies are developed that are based on inversion of Monte Carlo simulation, Boltzmann Inversion and iterative variation and multiscale CG methodology generally [62,63,64,65]. Regarding the solvent, it can be calculated with metalloproteins. The solvent representation constitutes a crucial matter for the system under investigation. Of course, the addition of solvent molecules explicitly is the most effective approach, and the success of this representation is influenced by the increase in the size of the system. The explicit solvent addition can retrieve most of the solvation effects together with those that result from entropy such as the hydrophobic effect. While all their ingredients and the approaches of them have been discussed, the interactions of them are studied through force-fields. All above lead to the calculation of the potential energy of the system under study [59].

The force-field representation includes solutions of complex equations which occurs easily with the assistance of computer systems. The bond length and the angles are represented by springs. Periodic functions are used for bond rotations and Lennard–Jones potentials, along with Coulomb’s law which is used for vdW and electrostatic interactions, respectively. In this way, energy and force calculations, even for large systems, are exceptionally rapid. FF that are used in atomistic molecular simulations are parameterized in a different way. This is a consequence of different types of FFs, namely general FFs that are used widely for various chemical compounds and dedicated ones for specific types of systems [66]. For a suited selection of FF for a studied system, the desired results need to be taken into consideration, i.e., FFs based on spectroscopic, structural, and thermodynamic data are chosen in calculations of analogous properties. Combining an accurate reproduction of the desired structure with the right FF, there will be a correct approach of the studied system [66]. As the parameter fitting is concerned, cautious selections of potential energy functions set, reference data set and a methodology for quantitatively correlating experimental and calculated structural parameters, are essential for a successful computational representation. Most parameter fitting is succeeded manually, but there is a development of automatically optimizations. Lastly, the parameters that may be chosen, may originate from both experimental and theoretical data. The most common and more suitable parameters are got from structural data from X-ray experiments, but in occasions where there are no experimental data, structural parameters are extracted from DFT calculations [66]. Simulations, where modern FFs are used, are commonly equal, but parameters used in classical FFs representations are not definitely exchangeable, while not all FFs permit representation of all molecule types [67,68]. A representative case is the different conclusions for helicity of proteins structures that are obtained from a classic set Amber ff14SB in conjunction with TIP3P three-point water model and standard ions compared to the modern set Amber ff19SB with OPC four-point water model and 12-6-4 ion parameters. The classic set approach results in an inherent underestimation of helicity in a protein structure counter to the modern set, which appears to have better predictive power not only in the basic protein structure, but also for protein mutations, sequence-specific behavior, and rational protein design [69]. Despite all these, when studying a reacting system such as reactions of metalloproteins, a Reactive FF needs to be used for the best description of the system. Therefore, a lot of reactive FFs have been developed. Specifically, the ReaxFF is the most common one, where Coulomb and Morse (van der Waals) potentials participate in calculations of nonbond interactions between all atoms of the reactants [70]. Parameters need to be derived from already verified calculating methods such as calculations on bond dissociation and reactions of small molecules combined with formation heat and structural data [70]. Adding to that, another widespread reactive FF is the Empirical Valence Bond (EVB) model, wherewith chemical reactions that are carried out by enzymes or in condensed phases are truthfully studied in different environments [71]. Moving on, the law of motion from classical physics is employed for the calculation of accelerations and velocities and for the update of the atom positions in all FFs types. The use of a time step shorter than the fastest movement in the molecule is essential to avoid instability when the integration of movement is completed numerically. One of the most significant obstacles in this simulation procedure is the fact that this integration ranges usually between 1 and 2 fs when referring to atomistic simulations. The microsecond-long simulations for biological processes, demand 109 times repetition over this calculation cycle. This constitutes one of the strengths of the coarse-grained approaches. The time length of the simulations is expanded when the system is represented in a simpler manner, and thus more time steps occur. For the increase in the performance of MD simulations, algorithmic advances are used, i.e., parallel running, graphical processing units, namely, GPUs, and fine-tuning of energy calculations [59].

Nowadays, the new generation computers are equipped and supported with accelerator and the parallelism process which are suitable to fasten the simulation. Specifically, the simulation codes AMBER [72], CHARMM [73], GROMACS [74], or NAMD [75], that are the most commonly used, are running in parallel via messaging passing interface (MPI). MPI is very suitable for reducing the computation time in the case where many computer cores are used at the same time. With the aim of exploiting the locality of interactions, the system is distributed to processors. The term for this scheme is spatial decomposition and each processor is used to accommodate the simulation of a small part of the system solely. Each processor is responsible for simulating a space region independently of the total number of particles, leading to the most profitable partition when simulating is based on position in space of the particles that are included in the studied system. Additionally, the processors are not sharing information between each other, except when they are simulating neighboring regions of the simulated system [76]. A breakthrough in simulation codes is the use of accelerators like GPU. They represent a great technological advance in performing atomistic MD calculations. So far, most important MD codes have been developed for GPUs, while MD codes have been constructed especially to be used on GPUs (ACEMD [77]). In order to achieve a high performance of MD simulations, they run on GPUs, and sometimes they are adjoined with MPI. Closing, the HPC use in natural and life sciences is developing more and more now. The improvement of their performance leads to more accurate simulations with the help of increasing power and sophistication of GPUs.

2.5. QM/MM and QM/MM/MD Approaches

During a chemical process, the electronic structures of the involved species can alter. For instance, bonds are broken or formed. As a result, the inclusion of the electronic motion is required, i.e., a quantum mechanical description is required. For instance, for the study of a chemical reaction in a solvent, a QM methodology has been proposed for the molecular species, while the solvent is included via a dielectric constant, i.e., it is modeled by presuming a homogeneous polarizable medium [78,79,80,81]. However, as the magnitude of the employed QM system is increased, the selection of a dielectric constant is less significant, while the choice of the correct dielectric constant is not trivial [80,81]. Note that all-important energetic components, i.e., solvation and dispersion and all structural components, involved in a direct or indirect way need to be included [82]. Finally, it is known that some of the solvent molecules that interact with the studied molecular system must be treated explicitly, i.e., they should be included in the QM calculation [83].

Nonetheless, this approach is attainable for systems having not more than about several hundred atoms. For larger systems, the only solution is a multiscaling approach, i.e., a combined QM/MM approach, where QM is used to treat the main part of the system. At the same time, classical force field methods are usually employed for the rest [84,85,86,87,88]. Here, the most crucial task is to have an efficient interface between QM and MM, where four important features must be taken into consideration: (i) the partitioning of the system into QM and MM parts (ii) how the interaction between MM and QM is dealt with, (iii) how the covalent bonds between atoms at the QM/MM boundary will be calculated, (iv) how the total energy will be computed [82,89]. Finally, the dynamics of the system is important to be incorporated. For instance, it could be included via an MD approach, which calculates time averages of equilibrium properties. Note that, simulations are usually at the minimum 10 times longer than the slowest studied natural process [89,90,91,92,93]. Additionally, there are two other crucial aspects which are raised, specifically, the simulation protocol as well as the splitting of the system into MM and QM regions which is kept fixed during the simulation. Below, the four essential aspects, that must be considered for an efficient boundary between the two types of regions, will be analyzed.

Partitioning: Regarding the study of a chemical reaction in solution, the partitioning of the system into QM and MM parts and its drawbacks will be explained. For the QM system, there are two alternatives: (i) only the solute molecules (this approach has been discussed above), (ii) the solutes and the nearest solvent molecules will be included. It has been mentioned that the latter choice is better, but there are some issues: the neighboring solvent QM molecules at the start of the simulation are replaced by MM solvent molecules during the MD simulations [94], and the solute−solvent interactions are not included accurately. Thus, the QM solvent molecules need to be kept near to the solute. A simple treatment of this solvent exchange issue is the update of the solvent molecules which are treated as QM or MM according to their relative position with respect to solute molecules. However, this treatment results in spatial and time-related discontinuities. The first ones result from the artificial boundary between the two regions, QM and MM. Note that there are differences concerning solvent properties at QM and MM levels and this results in an instability to the MD simulation. Thus, two different approaches are employed to solve these issues, namely, the constrained and the adaptive QM/MM [82,94]. The constrained QM/MM approaches derive from a single QM/MM partitioning scheme, i.e., BCC, BEST, FIRES [95,96,97]. The boundary between the two regions is closed and the QM solvent molecules stay fixed during the simulations, but now the deriving dynamic is not realistic. Thus, the constrained approaches are used only to reproduce equilibrium properties and are incapable of being employed for the study of reactions or diffusion dynamics. On the contrary, adaptive QM/MM approaches are open-boundary, i.e., they permit the smooth exchange of solvent molecules between the two regions depending on their distance from the solutes. Thus, they can study both equilibrium properties and dynamics [98,99,100,101]. Finally, it should be noted that the adaptive QM/MM methodologies usually are formulated on multi-partitioning schemes, i.e., different partitioning schemes are regarded [100,101].

Interaction between QM and MM regions: There are three ways to approach the electrostatic interactions between these two regions: (i) electrostatic embedding, which activate the polarization of QM region; (ii) mechanical embedding, which is less accurate than the first one, and it considers the atoms in QM region as point charges, bond dipoles, or higher multipoles; and (iii) polarizable embedding, which regards the polarization of the MM part as a reaction to the charge distribution [102].

Crossing of the covalent bonds connecting atoms at QM/MM boundaries: There are various opinions and options. For the crossed covalent bond in the boundary of the two regions, link atoms, pseudoatoms, or localized orbitals are introduced [1,102,103,104].

Total energy: It can be calculated using an additive or a subtractive QM/MM coupling [105,106]. (i) Additive QM/MM approach: The QM system is embedded within the MM one. The total E energy is, E = E_QM + E_MM + E_QM/MM. Here, E_QM refers to the energy related to the QM region, E_MM to the energy of the MM region, while E_QM/MM corresponds to the interaction between QM and MM subsystems and contains the bonded interactions (QM/MM coupling terms). (ii) Subtractive QM/MM approach: Here, an extrapolation from a QM part to the whole system is conducted out. The total energy E is, E = E_QM^(QM) + E_QM/MM^(MM) − E_QM(MM). E_QM(QM) is the energy of the QM region computed at QM level, E_QM(MM) is the energy of the QM region computed at MM level, E_QM/MM^(MM) is the MM energy of the whole system.

Synoptically, the combination of the QM/MM methodology with direct MD simulation, is a robust tool for studying drug delivery, chemical reactions mechanism in a complex environment, properties of molecular devices, organic electronics, etc., [59,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113,114]. An analytic computational protocol for a multiscaling modeling of enzymes is given in [82]. Finally, it should be mentioned that the growing development of computing power and capacity allows us to study very large and complicated systems, to study their properties and to explain many complicated processes.

2.6. Computational Times of Methodologies

Generally, the QM methodologies can be very accurate for small systems. However, they are computationally expensive, and as the size of the system is increased, their computational time is increased sharply. On the contrary, the MM methods are much faster, but they suffer from several limits, such as their requirement for extensive parameterization and the fact that the calculated energies are not very accurate. QM/MM approach constitutes a new class of efficient methodologies that combines the good points of both methodologies, i.e., the accuracy of QM and the speed of MM calculations. The most important advantage of hybrid QM/MM method is the speed. The cost of doing classical MM in the most straightforward case scales is O(N²), where n is the number of atoms in the system, meaning that a system having twice many atoms it would take four times as much computing power. This is mainly due to the electrostatic interactions term. Moreover, via the use of cutoff radius, periodic pair-list updates, and variations of the Particle Mesh Ewald (PME) method, computational time ranges from O(N) to O(N²). On the other hand, the simplest ab initio calculations typically scales is O(N³), while the accurate coupled cluster singles + doubles + perturbative triples RCCSD(T) methodology scales up to O(N⁷) [115,116]. To overcome this limit, a small part of the system is treated quantum-mechanically using a cheap QM methodology such as DFT, and the remaining system is treated classically. In more sophisticated implementations, QM/MM methods treat both light nuclei susceptible to quantum effects (such as hydrogens) and electronic states. This allows generating hydrogen wave-functions. This methodology has been useful in investigating phenomena such as hydrogen tunneling [116].

3. Metalloproteins

Μetalloproteins are proteins having a metal ion cofactor [117]. Metalloproteins can be found in many living species. It is regarded that half of all recorded proteins consist of a metal compound, while the metal compounds play a determinative role in their function in some of these cases [118,119]. Metalloproteins have a variety of functions, i.e., they storage and transport elements that are significant in a cell’s living or they transport even larger molecules. One of the most important functions is the catalysis of various chemical reactions that occur in a cell’s environment [117,120]. The most common metal elements found in the metalloproteins of a human body are Fe, Mn, Zn, Co, Ni and Cu, and they are considered to be of vital importance. However, the metals are not always a part of the active center of the protein or assist the protein’s involved processes. Thus, they can just be carried and transported by the protein [121].

There are two major groups of reactions related to metalloproteins. First, there are reactions which lead to the formation of metalloproteins. This group seems almost too complicated to be studied by a MM/MD simulation and there is limited literature on this topic. The second group of reactions, which is more often studied via MM/MD simulations, includes reactions that occur when the metalloprotein acts as a reactant or a catalyst. The increase in computational capacity and the theoretically development of simulation approaches in conjunction with the experimental data (crystallography) have resulted in further clarification of the way that metal clusters are assembled or inserted into target proteins. Additionally, the catalytic pathways of such a range of complex chemical reactions by metalloproteins is clarified and explained [122].

The computational characterization of metalloproteins can be an exceptionally difficult task. The presence of a metal cations is responsible for strong Coulomb forces that act on charged amino acids and the rest of the molecule. Proteins respond dramatically to the insertion or extraction of metal cations. Significant conformational modifications are observed and even aggregations occur. Metals having partially occupied d atomic orbitals favor specific coordination geometries. Regarding the metal, the geometry of the whole molecule and the dynamics of the surroundings and of the environment may or may not favor these coordination modes. Variations from the desired geometry decrease the protein-metal binding affinity. Note that, the electronic structure of a metal is directly affected from its surroundings. The electronic configurations of the metal depend on its ligands. Thus, the metal’s electronic structure and geometry of the molecule are strongly related with each other. Any modification of each one causes changes to the other one [123,124,125,126,127,128,129,130,131,132].

In this review, we are going to focus on important computational studies using different approaches which have been conducted for some vital metalloproteins, while attention is given to nitrogenase and its FeMo cofactor.

3.1. Reactions of Metalloproteins

DFT approach usually is employed for a quantitative estimation of the complexation energies of several transition metal cations. The selectivity of metal-binding sites is investigated calculating the interaction energies between cations and its environment. Simple molecules with a general formula [MX_n]^a+ (where Xi’s are simplified ligands representing the protein environment) are studied and the energies of the transition metal ion complexation are evaluated. When small and large representations of metal-binding sites, i.e., small and large L ligands, are compatible with each other, useful information for reaction in even bigger systems are provided, see for instance [124,125].

The effect of specific groups or bonds on the properties and functions of proteins are studied also by using simplified ligands. For example, in the case of oxymyoglobin, which is a single chain globular protein, the hydrogen-bonding effect on Mössbauer spectroscopic properties is studied, for various active site models [125]. A porphyrin is used for representation of the heme group, and it is found that the H-bond between an His residue and the diatomic O₂ enhances the binding of oxygen in the active center of protein [125].

It should be noted that metalloproteins’ metal centers present versatile chemical reactivity. The use of single-molecule atomic force microscopy (AFM) induces partial unfolding and exposes the metal centers. The rubredoxin is the first metalloprotein that has been studied via single molecule AFM in detail. QM/MD calculations on rubredoxin descripted in detail its unfolding and the breaking mechanism of ferric–thiolate bonds in different solvent conditions [132].

QM/DMD (discrete MD) approach works through a repetitious approach between QM and DMD [126,127]. DMD is a simplified MD, where discrete step function potentials are employed in the place of the continuous potential which are employed in common MD. Thus, the ballistic equations of motion are solved only for the species participating in a collision. In all, the QM/DMD predicts the structures of the metalloproteins, in agreement with X-ray experiment, as well as specific structural details, such as bond lengths of weak hydrogen bonds and their variations upon mutations in the protein. The method also can reintroduce the protein’ structure to equilibrium after a mild distortion due to the property of the combined potential energy function reaching its minimum at the intrinsic structure [123]. Up to now, it has been successfully used for the study of the function of ARD (acireductone dioxygenase) enzyme, which catalyzes two different oxidation reactions, depending only on which ion is bound to the protein, Fe²⁺ or Ni²⁺. The interconversion between the Fe²⁺-ARD and Ni²⁺-ARD is simple. Both forms of ARD were found that have different functions and the QM/DMD approach was an ideal methodology for the study of this interconversion [127]. Additionally, it has also successfully used in the modeling of the ion exchange, Ca²⁺ versus Mg²⁺, in the catechol-O-methyl transferase (COMT) enzyme, in the Fe-S electron-transporting protein rubredoxin and in several of its mutants [123].

Furthermore, in some proteins, the metal replacement can result in large-scale changes in geometry, protein motions and repacking, as is the case of COMT enzyme. COMT is enzyme involved in the physiology of pain. COMT has a Mg²⁺ cation, which can be interchanged with a variety of cations. This replacement results to significant alters in the structure and the activity of the enzyme. It influences the catalytic function, suppress it or it turns the enzyme to be an inhibitor. The inhibition is found that it is a simple geometric result. Multi-scaling calculations explains all mechanistic paths [127].

The metal-MFCC approach, namely metal molecular fractionation with conjugate caps, has been developed for efficient linear-scaling QM calculation of the potential energy and for atomic forces of metalloproteins. The protein’s potential energy is computed as a linear combination of (i) the potential energies of the neighboring residues, (ii) the 2-body interaction energy between non-adjacent residues, which are closely located, and (iii) the potential energy of the metal binding group. Each individual fragments in metal-MFCC can be calculated independently, so as the approach to be suitable for massively parallel computations. Thus, as the size of the studied system is increased, the computational cost of the QM calculation for the whole system increases rapidly. On the contrary, the computational cost of the metal-MFCC method increases almost linearly. It has been found that the metal-MFCC is in good agreement with full QM approach [128].

Recently in 2021, multiscale quantum refinement methods, combining several multiscale computational schemes with experimental data obtained from X-ray diffraction, were developed for metalloproteins. Different ONIOM combinations of QM, SE, and MM methodologies were used to check the performance and reliability on the refined local structure in two specific metalloproteins. It was found that ONIOM (QM/SE/MM) approach presented good results with low computational costs compared to the more expensive QM/SE approach [129]. This approach takes advantage of different flexible ONIOM schemes and experimental (XRD) information, in which the demanding transition-metal binding site is described with an efficient and accurate QM method, while the remaining system and its interactions are approximated by much faster computational low-level methods. Thus, this QM/SE/MM approach was proposed as a very good choice for computation of metal binding site(s) in metalloproteins with high efficiency.

Gallium cation, Ga³⁺, can mimic the ferric ion, Fe³⁺, and as a result it intervenes to some processes in which ferric cofactors are required. Thus, Ga³⁺ as a salt is used to fight various types of cancer and infectious and inflammatory diseases. However, they present some differences, for instance, Ga³⁺ ion cannot participate in redox reactions, or it has a different ability regarding the deprotonation of the bound water in aqua complexes. In summary Ga³⁺ and Fe³⁺ are distinguishable for some biological processes. The interactions of cations with protein ligands play a key role in their competition. These systems have been calculated via DFT, while the surroundings were represented by an effective dielectric constant. The DFT results explain and confirm the experimental findings, while they result in significant conclusions regarding the binding affinity of cations with respect to the change of the pH and of the environment [130].

The electron-transfer rates and the electronic-coupling interactions in proteins have been calculated and compared with available experimental data for a series of ruthenated azurins [131]. The DFT data are in good agreement with the experimental ones. The conformers with the strongest electron-coupling dominate on the electron-transfer rate, while the averaging, over all thermally accessible conformers of the protein and of the redox cofactors, is crucial. It is concluded that electronic coupling values based on calculations reproduce the coupling-limited experimental rates when the rates are averaged over ligand-field states and thermally accessible geometries [131].

Many studies regarding the use of MD and QM/MM in metalloproteins have been conducted. For instance, a combination of docking, QM/MM methods, and MD simulation has been used for binding affinity estimation of metalloprotein ligands [133]. Additionally, heme-containing proteins, due to their physiological importance, have been extensively characterized by computational methods and were the first protein class to be studied by MD simulations with Karplus’s work on myoglobin [134]. QM/MM calculations with DFT have been carried out for considering protein effects on the EPR and optical spectra of metalloproteins. Here, plastocyanin was used as a case study [135]. The QM/MM method has also been used to assess metalloproteins, human deacetylases, which are targets for a variety of medical conditions including neurodegenerative diseases and HIV infection. The method has also been proved to be capable of describing the kinetic differences associated with replacing Zn²⁺ with other metal co-factors [136]. In another case, the key step in the reaction mechanism of multicopper oxidases—the cleavage of the O–O bond in O₂—has been investigated using QM/MM methods [137]. In general, enzymatic reactions have been the primary target of QM/MM studies. The examples of chorismate mutase and cytochrome P450 have been highlighted. Chorismate mutase catalyses the Claisen rearrangement of chorismate to prephenate, a key step of the shikimate pathway for the synthesis of aromatic amino acids in plants, fungi, and bacteria. On the other hand, cytochrome P450 enzymes are monooxygenases that perform a variety of essential functions, such as detoxification and biosynthesis, in nearly all living species. They also catalyze many types of reactions [138]. QM/MM reaction pathway analysis has provided detailed insight into the chemistry of glutathione S-transferase and can be used to obtain mechanistic insight into the effects of specific mutations on this catalytic process [139]. A developed QM/MM modification of the Linear Response method was used to distinguish ligand affinities for closely related metalloproteins. The precision level acquired makes the approach a useful tool for design of selective ligands to similar targets, as results can be extrapolated to maximize selectivity [140]. A QM/MM study of the formation of the elusive active species Compound I of nitric oxide synthase from the oxyferrous intermediate showed that two protons should be provided to produce a reaction that is reasonably exothermic and that leads to the appearance of a radical on the tetrahydrobiopterin cofactor [141]. QM/MM calculations have been employed to investigate the role of hydrogen bonding and π-stacking in single- and double-stranded DNA oligonucleotides [142]. MD simulations of metalloproteins were also carried out in a folding study of rubredoxin from Pyrococcus furiosus [143].

3.2. Nitrogenase and FeMo Cofactor

3.2.1. General about Nitrogenase—Structure

Nitrogenase is one of the most fascinating natural metalloenzymes. It is produced by certain prokaryotes, such as cyanobacteria and it is essential for all living beings. Nitrogenase catalyzes an essential step of procedures in nitrogen fixation, where the reduction in the N₂ to NH₃ occurs through a complex and multistage reactions [144,145,146,147,148,149,150,151,152,153,154,155,156,157,158,159,160,161,162,163,164,165,166,167,168,169,170,171,172,173,174,175,176,177,178]. Ammonia is vital for all species, because of its essential role in synthesis of biomolecules such as nucleotides and amino acids. Despite the fact that N₂ is abundant in the earth’s atmosphere, it is essentially inert at room temperature without a suitable catalyst. That leads to the vital role of nitrogenase. As a result, the scientific community is highly interested to study properly this reaction both through experiments and simulations. It is known that Nif genes or homologs have the information to correct creation of nitrogenase [144,145].

Regarding the structure of this molecular system, see Figure 1, it contains two metalloproteins, the homodimeric iron (Fe-) protein, which is a great reductase and is responsible for the electrons’ supply. It is a dimer of two identical subunits. They are connected through two covalent bonds with one [Fe₄S₄] cluster [146]. (Fe-) protein is responsible for electron transfer from a reducing agent, such as ferredoxin or flavodoxin, to the nitrogenase protein (MoFe-) protein. This transfer demands an input of chemical energy. It can be covered by the binding and hydrolysis of ATP. A configuration change occurs because of the hydrolysis of ATP within the whole complex. Note that the two main metalloproteins are brought closer together so the electron transfer is easier to occur [147].

The second part of the nitrogenase complex is the heterotetrameric α₂β₂ or heterodimeric (αβ)₂ molybdenum-iron (MoFe-) protein, where electrons are used for the conversion of Ν₂ to ΝH₃. It consists of two α and two β subunits [146]. MoFe-contains two identical iron-sulfur [8Fe-7S] clusters, namely P-clusters. They are located at the interface between the α and β subunits, counter to the other feature clusters, the two FeMo cofactors (FeMoco), which show up within the α subunits. Both subunits are of similar size and are encoded by the rifD and nifK genes [148]. The Mo cation is considered to be Mo(III), contrary to Mo(V) that prevailed earlier [149]. The [Fe₈S₇] core of the P-cluster consists of two cubes [Fe₄S₃] linked by a carbon atom. The two P-clusters are connected via covalent bonds with the rest of MoFe-through bridges that consist of six cysteine residues. Moving on to the two identical FeMo cofactors [MoFe₇S₉C], each contains two different clusters, i.e., [Fe₄S₃] and [MoFe₃S₃]. The last ones are linked by three sulfide ions. One cysteine and one histidine residues are used to connect each FeMo cofactor with the α submit through covalent bonds. Regarding the role of every part of the nitrogenase complex, the Fe- protein provides electrons that are entered to the P-clusters of the MoFe-protein. Then, they are transferred from the P-clusters to the FeMo cofactors, where the nitrogen fixation occurs, and the dinitrogen is connected in the central cavity of the FeMoco [144].

Some variations of this complex appear in nature. Thus, two types of such nitrogenases have been confirmed: the vanadium-iron type (VFe; Vnf) and the iron-iron type (FeFe; Anf), where the (MoFe-) protein is replaced. There are 2 α, 2 β and 2 δ or γ subunits instead of (αβ)₂ of the usual complex [150,151]. Nevertheless, molybdenum nitrogenase, is the one that has been studied more extensively, because of its abundance versus the others and is thus the most well characterized [144].

Figure 1. Structure of Nitrogenase complex [152].

3.2.2. General Mechanism

As mentioned before, the reduction in N₂ to NH₃ demands a catalytic route to occur because of inaction of N₂. The required activation energy for the reduction is large (E_a = 230–420 kJ mol⁻¹), but the enthalpy is negative (ΔH° = −45.2 kJ mol⁻¹). This means that the whole reaction is thermodynamically favorable [153]. All these are also confirmed through the industrial fixation of N₂ by the Haber-Bosch process, where this specific reduction takes place in temperatures ranging from 300 to 500 °C, while the pressures are more than 300 atm. The presence of Fe-based catalysts is necessary [145].

Continuing with the reduction in the substrate by nitrogenase, three basic steps occur where electrons are transfers. Firstly, the reduction in (Fe-) protein is occurred where electrons are transferred from electron carriers such as ferredoxinor or flavodoxin in vivo or dithionite in vitro to (Fe-) protein. The second step is described by the transfer of single electrons from (Fe-) to (MoFe-) protein in an MgATP-dependent process. A minimal stoichiometry of two MgATP are hydrolyzed per electron. The last e⁻ transfer occurs to the substrate which is almost certainly bound to the active site of the (MoFe-) protein [144]. The overall stoichiometry of N₂ reduction by nitrogenase has been established as [145]:

N₂ + 8 H⁺ + 16 MgATP + 8 e⁻ → 2 NH₃ + H₂ + 16 MgADP + 16 P_i

Studying the general equation of this reaction, nitrogenase also catalyzes the reduction in H⁺ to H₂ (which is necessary for the formation of NH₃) along with the reduction in dinitrogen to ammonia. Additionally, it catalyzes the reduction in other small unsaturated molecules such as azide, cyanide, acetylene [154].

The Lowe-Thorneley (LT) kinetic model, is the one that has been established for the whole process and was developed experimentally, see Figure 2. Eight H⁺ and eight e⁻ are transferred during the reaction [145,146,155,156]. Each intermediate stage is represented as E_n, n = 0–8, which is proportionate to the numerous of the equivalents thar are transferred. The connection of N₂ with the complex occurs at the stage E₄, where four equivalents have already been transferred [146]. However, N₂ sometimes binds to nitrogenase at the stage E₃.

This model was based on spectroscopic data that were selected throughout the process. The clarification of the mechanism is still an active area of research and a debate for the scientific community. The E₀ state is the initial one where the enzyme rests at equilibrium before the catalysis begins [157]. The reductions begin at the E₁ state where an e⁻ is transferred to the (Fe-) protein, with the escort of a proton (H⁺). The intermediate state E₂ is described by the metal cluster being in its resting oxidation state, the two added e⁻ deposited in a bridging hydride, while the additional H⁺ is bonded to a sulfur atom. Lastly before the dinitrogen connection to the complex, the single reduced FeMo cofactor with one bridging hydride and one H⁺, belong to the E₃ state. Moving on, the E₄ state is considered to be a critical stage and takes part in the middle of the catalytic cycle. It appears after the accumulation of 4 pairs of electrons and protons, and it is named as Janus intermediate because of its dynamic nature. The system can decay back to E₀, aborting the pairs that were collected or it can proceed with nitrogen binding and complete the catalytic cycle. The FeMo cofactor appears to be in its resting oxidation state with two bridging hydrides and two sulfur bonded H⁺ [145].

Based on the above intermediate states, a dynamic equilibrium is proposed for the oxidation states of the metal cluster, and especially between its initial oxidation state and a singly reduced one with additional electrons which are stored in hydrides. On the other hand, it is considered that in each step, the formation of a hydride occurs and that the metal cluster exists between the initial oxidation state and the single oxidized one [145].

Moving on towards the production of the ammonia, two basic hypotheses exist for the pathway in the second half of the mechanism: the “distal” and the “alternating” pathway, c.f. Figure 3. In the “distal” route, the dinitrogen is firstly hydrogenated on the one atom of nitrogen, leading to the release of ammonia and then the second nitrogen, which is directly bound to the metal, is hydrogenated. In the “alternating” route, the nitrogen atoms are hydrogenated alternately. This pattern goes on until NH₃ is released from both nitrogen atoms [144,158]. It has not been clarified which pathway is correct and occurs at last. The solution to this, is the isolation of forementioned intermediates, such as the nitrido in the “distal” route and the diazene and hydrazine in the “alternating” route. However, many more problems occur from this process. The use of model complexes helps the isolation of intermediates but there is a metal center dependance. When Molybdenum model complexes are studied, the distal way predominates counter to the Iron model complexes, where the alternating pathway is preferred from the system [145].

3.2.3. Calculations

Many calculations have been performed throughout the years for this complex system and attention has been given to its catalytic role in the nitrogen fixation process. The included clusters and the cofactor have been studied and characterized independently, while there are studies of the whole complex of the metalloenzyme. Here, a review on the calculations of the states E_n that were involved in the proposed mechanism is presented.

DFT calculations have been carried out for the MoFe cofactor [MoFe₇S₉C], including the 35 possible broken-symmetry (BS) states in the resting state, a reduced state, and a protonated state of the cofactor. The results show that the relative energies of the calculated states depend on their geometry, the environment, i.e., surrounding protein, and the choice of the methodology, i.e., DFT functionals, basis sets. Specifically, the basis sets affect the energy values of the states, i.e., up to 11 kJ/mol. The effects of the structure of the surrounding protein result to energy differences up to 7 and 10 kJ/mol for the vdW and the electrostatic energy, respectively [159].

Single-point energy calculations using experimental geometries give similar values to the energies calculated after the optimization of geometry, but some BS states differ from the experimental ones up to 37 kJ/mol. Changing the functional from the pure TPSS to the hybrid B3LYP, a difference in energies up to 58 kJ/mol is noticed, while the correlation between the two results is small, (R² = 0.57–0.72). Nevertheless, both DFT functionals are in agreement regarding the ground spin state and the reduced one. All results related to the most stable states of the structure, are useful for further calculations on the mechanism of the catalysis leading to more accurate results [159].

Furthermore, in the above study, QM/MM calculations were carried out, using the classic set Amber ff14SB force field with TIP3P for water molecules while geometry optimization was performed through TPSS-D3 method and the def2-SV(P) basis set. It was concluded that four of the Fe ions need to have the dominant α spin and three should have the opposite β spin in order to reach the experimentally observed quartet state of the cofactor, and when in asymmetric protein, there are 35 different ways that this can occur. Last but not the least, an interesting fact was concluded, namely 3 to 6 BS states of the same C3-symmetry type had close energy values leading to the fact that the protein influences a little the relative energies of the BS states that are related by the approximate three-fold symmetry of the FeMo cofactor [159].

B. Benediktsson and R. Bjornsson have carried out a series of calculations [160] where the protein environment has been taken into account. QM/MM methods are employed to study the MoFe protein and the FeMo cofactor. They concluded that only the [MoFe₇S₉C]¹⁻ charge is a possible resting state charge. The result of −1, as a charge of the resting state, provides data in completely agreement with recent spectroscopic [161] and other computational studies [162]. Considering different spin isomers, the one that agrees with the crystallographic Fe−Fe and Mo−Fe distances has Fe cations with spin directions which lead to a rare case of spin-coupling phenomena. According to this study, on the alkoxide group on the Mo-bound homocitrate under resting state conditions, exist a proton. This proton affects the nature of the redox states of FeMoco and additionally affects some substrate reduction mechanisms [160].

Regarding the mechanism and the reaction states, the conjunction of theoretical and experimental data leads to the fact that formation of E₁ is occurred via a Fe-centered reduction in combination with the protonation of a sulfide of the cluster [163]. An interesting fact about Thorhallsson and Bjornsson works [163,164] is that the used theoretical approaches for subsequent states E_n (n = 1–8) are the same with the used ones for E₀ state (CHARMM36 as a force-field for MM level of theory and TPSSh hybrid density functional for QM level, respectively). Moving on with the mechanism, it is possible for, only the E₀ and E₁ states to be selectively populated under conditions in which the rate of H₂ production from the E₂ state is faster than the rate of the formation of E₂. Additionally, E₁ models having a protonated bridging sulfide are in total agreement with the EXAFS data. All these lead to the most likely candidates to describe the E₁ state. Last but not the least, minor modulation of Mo-O, Mo-Fe, and Fe-Fe distances occur throughout the process of E₀ to the E₁ state and the first reduction [163].

A systematic theoretical study of the relative energies of possible protonation states of the FeMo cluster in nitrogenase in the E₀–E₄ states has been performed via a QM/MM approach [165]. Additionally, the resting state, the states with 1–4 electrons and protons added before N₂ binding were studied. In these calculations, the complete solvated heterotetrameric enzyme has been included for more accurate results. Two different B3LYP-D3 and TPSS-D3 dispersion corrected functionals with different basis sets, def2-SV(P) and def2-TZVPD, were used and they led to different results on the E₂–E₄ states, counter to the E₀ and E₁ states. Specifically, TPSS-D3 supports hydride ions binding to the Fe ions at the E₂–E₄ states creating a bridge between the Fe metals. Nonetheless, B3LYP-D3 predicts that one to three H⁺ cations are connected to the central carbide ion and that the most energetically stable structures of the E₂, E₃ and E₄ states have the carbide ion doubly or triply protonated. Lastly, the most favorable protonation site was found to be the S2B in the E₁ state [166].

The redistribution of electrons within the active site of the FeMo-co during the reductive removal of H₂ to activate the N₂, has also been calculated via QM/MM MD simulations. The nitrogen fixation process starts with the binding of N₂ to E₄ combined with the elimination of H₂ [166]. This loss cannot start in absence of N₂ in E₄(4H) state, despite the fact that it interconverts with E₄(H₂,2H). This occurs because of the resulting high-energy E₄(2H)* state that causes a H₂ rebind [166]. Additionally, the non-participation of the Mo site in the electron redistribution was observed as the reaction with the N₂ begins and it was also found that the change of Mo’s valence electrons is unlikely to occur throughout the nitrogenase cycle. Finally, it was shown that the electron redistribution upon conversion of hydride elimination and removal of H₂ from E₄(4H) to E₄(2H)* is activating one or both Fe cations to bind N₂ in the catalytically central H₂ complex, E₄(H₂,2H). Thus, the coupled removal of H₂ and the reduction in N₂ is initiated [166].

Specifically, the E₄ state attract the research interest. Possible models for this state of nitrogenase and how N₂ can be connected to some of these models was calculated via QM/MM approaches. Some calculations using the CHARMM36 force field for the MM approach, combined with a recent ENDOR study, result to the most favored structure of FeMo cofactor at the E₄ state, see Figure 4. However, further QM calculations using hybrid functionals (B3LYP, TPSSh, M06-2X and HF exchange) lead to higher energy values for this structure counter to all open-sulfide bridge models, while this model has not been found to bind N₂, which remains an open question to be investigated in [164]. Thorhallsson et al. proposed a mechanism for the E₄ state. Specifically, the function of various components of the cofactor of nitrogenase is introduced. The cofactor’s size and the nature of the Fe–S bonds play a primary role. Moreover, the sulfide bridge between the cubanes increases the stability of the hydride. The molybdenum ion is likely to affect the redox potential of the cofactor and it could be vital for further stabilization of the N₂-bound Fe(I) ion in E₄-l-N₂, which is formed after the reductive step, so that the N₂ ligand to find available e⁻, to assist its activation. Finally, it has been proposed that the H⁺ on the Mo-bound alcohol group of homocitrate is in the best position, so as theN₂ ligand to be protonated [164].

In 2020, Cao and Ryde [167] carried out a QM/MM study on N₂ bound state of nitrogenase assuming that N₂ is instantly protonated to a N₂H₂ state, and thus the issue of finding the position of the H⁺ cations in the cluster is avoided. The Amber f14SB FF was used for the protein and the MM approach and the TIP3P model was chosen to describe water molecules in the environment. The charges were obtained at TPSS/def2-SV(P) level of theory and the non-bonded model approached the metal sites. Studying both pathways, the distal and the alternating one (HNNH and NNH₂ respectively), it was found that the binding of N₂H₂ is mainly occurs due to the interactions and steric clashes with the protein and not due to the intrinsic preferences of the ligand and of the cluster. Regarding the energies of the calculated states, noticeable differences are observed regarding the relative energy difference of the low-lying structures, when different functionals are used [167].

To conclude, a lot of questions are still open, like the exact way in which ligands are activated for protonation. Note that it could be very useful any additional experimental data on the E_n states to further restrict the mechanistic possibilities of FeMo cofactor for comparison with the calculated data. Nonetheless, the published studies propose a pathway to clarify the mechanism of nitrogenase catalytic role [167,168,169,170,171,172,173,174,175,176,177,178].

4. Discussion and Conclusions

Multiscaling methodologies that combine the quantum mechanical description of specific interactions, for instance metal-ligand ones, with classical sampling of the entire system, for instance protein structure, are promising and powerful tools for computational chemistry. The studied system is split to regions. The most important area, i.e., the area where the chemical process is occurred, is calculated via a QM methodology, i.e., DFT or SE; the surrounding is studied with a less accurate method, i.e., SE or MM; while the environment with an MM approach, or via the use of a dielectric constant for the solvent, or via MD simulations. In the last case, the trajectories of the particles of the studied system are predicted.

The commonly used QM methodology in the QM/MM and QM/MM/MD approaches is DFT which can be used in systems up to a few hundred atoms. DFT is a computational cheap methodology comparing to ab initio methods, such as multi-reference and coupled-cluster approaches, while its accuracy is comparable to them especially when the optimal functional has be used for a particular application [12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,178]. B3LYP is a commonly used functional that generally works well in many applications. For more demanding applications, there is a plethora of functionals as well as many published studies that can assist for the choice of the appropriate functional. Finally, efforts are being made for the development of functionals that will be suitable for a wide range of applications [25,168]. When DFT methodology is difficult to be applied, SE methods are used. They are built on the HF formalism, but various approximations have been considered and empirical data are used. They are valuable methodologies for studying electronic effects in large molecules of biological systems and they can be applied successfully in complex systems [47,48,49,50,51,52,53,54]. When the surrounding consists of hundreds to thousands of atoms, QM (DFT or SE) calculations are not feasible, thus the potential energy of the system is defined using a force field method, where the electronic motions are ignored, and the energy of the system is calculated as a function only of the nuclear positions. Finally, MD simulations are employed to simulate system of hundreds of atoms to macromolecules of biological interest such as ribosomes, nucleosomes, metalloproteins, etc. The range of the population of atoms of the calculated systems is up to 500,000. A dynamic model is built, for instance for proteins, where the internal motions and the subsequent conformational changes significantly affect their function [59]. Algorithms are developed to calculate the trajectories through a force field approach. There are two main approaches for MD simulations: (i) the atomistic representation used for small systems and (ii) the coarse-grained method, where molecules are represented by “pseudo-atoms” approximating groups of atoms. While the first approach is more accurate, the second one is used for metalloproteins due to the size of the studied system. However, when the system is too large, i.e., liposomes with infinite radius in terms of Å, planar bilayers can be used, and thus the system can be studied via atomistic MD simulations. On the contrary, small liposomes can be fully considered using atomic level MD. Nevertheless, liposomes are generally studied better using CG models [110].

The computational study of metalloproteins and reactions involved can be a very difficult and demanding task. The presence of the metal cations that have different coordination numbers, empty or half occupied d orbitals and low lying atomic excited states further complicate the calculations. As a result, the insertion or removal of metal cations affects proteins, large conformational changes are caused, and even aggregations are formed. Thus, the study of chemical reactions of proteins and specifically: (i) the exact reaction mechanism/pathway, and (ii) the evaluation of the properties of catalytic intermediates are very hot topics.

5. Future Directions

The rapid increase in computer capabilities and storage in conjunction with the theory and algorithm development, increase the size of the molecular systems which can be calculated via multiscaling approaches. The most populous ones need the use of high-performance computer facilities employing QM/MM and QM/MM/MD approaches, which have been developed not only for biomolecular systems but also for modeling a variety of complex systems, i.e., inorganic/organometallic, liquids, solid-state, etc., see for instance [179,180,181,182,183,184,185,186,187,188,189,190,191].

The forthcoming step in multiscaling approaches that will lead to the increase in the size of the studied molecular systems is the use of machine learning, which is a type of artificial intelligence that trains computers to learn without being explicitly programed. It focuses on the development of suite of codes that can change when exposed to new data. Over the decades, a lot of simulations for biomolecular systems have been done with the QM/MM approach. All these data can be used to train computers to learn its own patterns.

Finally, this review has highlighted some of the recent computational studies regarding metalloproteins, their reactions, and the interpretation of the mechanistic steps involved in nitrogenase’s complex [117,118,119,120,121,122,123,124,125,126,127,128,129,130,131,132,133,134,135,136,137,138,139,140,141,142,143,144,145,146,147,148,149,150,151,152,153,154,155,156,157,158,159,160,161,162,163,164,165,166,167,168,169,170,171,172,173,174,175,176,177]. Up to now, significant progress has been made, and details of the mechanism have been provided; however, new data on each intermediate stage of mechanism and on the excited states of the involved complexes to further restrict the mechanistic possibilities of FeMo cofactor are needed. Additionally, many questions are still unanswered, such as the exact way in which the N₂ is activated for protonation. Future progress promises to address a lot of these questions regarding the metalloproteins’ reaction mechanisms.

Author Contributions

C.E.T. and M.A.M. have contributed equally to the present review. Investigation, C.E.T., M.A.M. and D.T.; Writing—Original Draft Preparation, C.E.T., M.A.M. and D.T.; Review and Editing, D.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Acknowledgments

M.A.M. acknowledges Bodossaki Institution for financial support. D.T. acknowledges the National and Kapodistrian University of Athens, Special Accounts for Research Grants for supporting of this research through the project ‘‘SONFM’’ (KE 17034).

Conflicts of Interest

The authors declare no conflict of interest.

References

Warshel, A.; Levitt, M. Theoretical studies of enzymic reactions: Dielectric, electrostatic and steric stabilization of the carbonium ion in the reaction of lysozyme. J. Mol. Biol. 1976, 103, 227–249. [Google Scholar] [CrossRef]
Field, M.J.; Bash, P.A.; Karplus, M. A combined quantum mechanical and molecular mechanical potential for molecular dynamics simulations. J. Comput. Chem. 1990, 11, 700–733. [Google Scholar] [CrossRef]
Senn, H.M.; Thiel, W. QM/MM methods for biomolecular systems. Angew. Chem. Int. Ed. Engl. 2009, 48, 1198–1229. [Google Scholar] [CrossRef]
Noorden, R.V. Modellers react to chemistry award. Nature. 2013, 502, 280. [Google Scholar] [CrossRef] [PubMed][Green Version]
Hohenberg, P.; Kohn, W. Inhomogeneous Electron Gas. Phys. Rev. 1964, 136, B864–B871. [Google Scholar] [CrossRef]
Kohn, W.; Sham, L.J. Self-Consistent Equations Including Exchange and Correlation Effects. Phys. Rev. 1965, 140, A1133–A1138. [Google Scholar] [CrossRef]
Kristyán, S.; Csonka, G.I. New development in RECEP (rapid estimation of correlation energy from partial charges) method. Chem. Phys. Lett. 1999, 307, 469–478. [Google Scholar] [CrossRef]
Kristyán, S. Immediate estimation of correlation energy for molecular systems from the partial charges on atoms in the molecule. Chem. Phys. 1997, 224, 33–51. [Google Scholar] [CrossRef]
Kristyán, S. Theory of variational calculation with a scaling correct moment functional to solve the electronic schrödinger equation directly for ground state one-electron density and electronic energy. Int. J. Quantum Chem. 2013, 113, 1479–1492. [Google Scholar] [CrossRef]
Vosko, S.H.; Wilk, L.; Nusair, M. Accurate spin-dependent electron liquid correlation energies for local spin density calculations: A critical analysis. Can. J. Phys. 1980, 58, 1200–1211. [Google Scholar] [CrossRef]
Perdew, J.P.; Wang, Y. Accurate and simple analytic representation of the electron-gas correlation energy. Phys. Rev. B 1992, 45, 13244–13249. [Google Scholar] [CrossRef] [PubMed]
Becke, A.D. Density-functional exchange-energy approximation with correct asymptotic behavior. Phys. Rev. A 1988, 38, 3098–3100. [Google Scholar] [CrossRef] [PubMed]
Lee, C.; Yang, W.; Parr, R.G. Development of the Colle-Salvetti correlation-energy formula into a functional of the electron density. Phys. Rev. B 1988, 37, 785–789. [Google Scholar] [CrossRef]
Zhao, Y.; Truhlar, D.G. The M06 suite of density functionals for main group thermochemistry, thermochemical kinetics, noncovalent interactions, excited states, and transition elements: Two new functionals and systematic testing of four M06-class functionals and 12 other functionals. Theor. Chem. Acc. 2008, 120, 215–241. [Google Scholar] [CrossRef]
Becke, A.D. A new mixing of Hartree–Fock and local density-functional theories. J. Chem. Phys. 1993, 98, 1372–1377. [Google Scholar] [CrossRef]
Grimme, S.; Neese, F. Double-hybrid density functional theory for excited electronic states of molecules. J. Chem. Phys. 2007, 127, 154116. [Google Scholar] [CrossRef]
Vydrov, O.A.; Scuseria, G.E. Assessment of a long-range corrected hybrid functional. J. Chem. Phys. 2006, 125, 234109. [Google Scholar] [CrossRef]
Skone, J.H.; Govoni, M.; Galli, G. Nonempirical range-separated hybrid functionals for solids and molecules. Phys. Rev. B 2016, 93, 235106. [Google Scholar] [CrossRef]
Paier, J.; Janesko, B.G.; Henderson, T.M.; Scuseria, G.E.; Grüneis, A.; Kresse, G. Hybrid functionals including random phase approximation correlation and second-order screened exchange. J. Chem. Phys. 2010, 132, 094103. [Google Scholar] [CrossRef]
Zhou, C.; Zhang, Y.; Gong, X.; Ying, F.; Su, P.; Wu, W. Hamiltonian Matrix Correction Based Density Functional Valence Bond Method. J. Chem. Theory Comput. 2017, 13, 627–634. [Google Scholar] [CrossRef]
Te Vrugt, M.; Löwen, H.; Wittkowski, R. Classical dynamical density functional theory: From fundamentals to applications. Adv. Phys. 2020, 69, 121–247. [Google Scholar] [CrossRef]
Henderson, T.M.; Izmaylov, A.F.; Scalmani, G.; Scuseria, G.E. Can short-range hybrids describe long-range-dependent properties? J. Chem. Phys. 2009, 131, 044108. [Google Scholar] [CrossRef]
Yanai, T.; Tew, D.; Handy, N. A new hybrid exchange-correlation functional using the Coulomb-attenuating method (CAM-B3LYP). Chem. Phys. Lett. 2004, 393, 51–57. [Google Scholar] [CrossRef]
Chai, J.-D.; Head-Gordon, M. Long-range corrected hybrid density functionals with damped atom-atom dispersion corrections. Phys. Chem. Chem. Phys. 2008, 10, 6615–6620. [Google Scholar] [CrossRef]
Yu, H.S.; He, X.; Li, S.L.; Truhlar, D.G. MN15: A Kohn-Sham Global-Hybrid Exchange-Correlation Density Functional with Broad Accuracy for Multi-Reference and Single-Reference Systems and Noncovalent Interactions. Chem. Sci. 2016, 7, 5032–5051. [Google Scholar] [CrossRef]
Elliott, P.; Furche, F.; Burke, K. Excited States from Time-Dependent Density Functional Theory. Rev. Comp. Chem. 2008, 26, 91–165. [Google Scholar] [CrossRef]
Runge, E.; Gross, E.K.U. Density-Functional Theory for Time-Dependent Systems. Phys. Rev. Lett. 1984, 52, 997–1000. [Google Scholar] [CrossRef]
Tao, J.M.; Perdew, J.P.; Staroverov, V.N.; Scuseria, G.E. Climbing the density functional ladder: Nonempirical meta-generalized gradient approximation designed for molecules and solids. Phys. Rev. Lett. 2003, 91, 146401. [Google Scholar] [CrossRef]
Lingwood, M.; Hammond, J.R.; Hrovat, D.A.; Mayer, J.M.; Thatcher Borden, W. MPW1K Performs Much Better than B3LYP in DFT Calculations on Reactions that Proceed by Proton-Coupled Electron Transfer (PCET). J. Chem. Theory Comput. 2006, 2, 740–745. [Google Scholar] [CrossRef]
Cohen, A.J.; Mori-Sánchez, P.; Yang, W. Insights into Current Limitations of Density Functional Theory. Science 2008, 321, 792–794. [Google Scholar] [CrossRef]
Carpentieri, M.; Porro, L.; Del Re, G. Numerical studies for a theoretical analysis of semiempirical LCAO–CI methods. Int. J. Quantum Chem. 1968, 2, 807–824. [Google Scholar] [CrossRef]
Thiel, W. Semiempirical quantum–chemical methods. WIREs Comput. Mol. Sci. 2014, 4, 145–157. [Google Scholar] [CrossRef]
Hückel, E. Quantum contributions to the benzene problem. Z Phys. 1931, 70, 204–286. [Google Scholar] [CrossRef]
Hoffmann, R. An extended Hückel theory. I. Hydrocarbons. J. Chem. Phys. 1963, 39, 1397–1412. [Google Scholar] [CrossRef]
Pariser, R.; Parr, R.G. A semi-empirical theory of the electronic spectra and electronic structure of complex unsaturated molecules. J. Chem. Phys. 1953, 21, 466–471. [Google Scholar] [CrossRef]
Pople, J.A. Electron interaction in unsaturated hydrocarbons. Trans. Farad. Soc. 1953, 49, 1375–1385. [Google Scholar] [CrossRef]
Pople, J.A.; Santry, D.P.; Segal, G.A. Approximate Self-Consistent Molecular Orbital Theory. I. Invariant procedures. J. Chem. Phys. 1965, 43, S129–S135. [Google Scholar] [CrossRef]
Bingham, R.C.; Dewar, M.J.S.; Lo, D.H. Ground states of molecules. XXV. MINDO/3. Improved version of the MINDO semiempirical SCF-MO method. J. Am. Chem. Soc. 1975, 97, 1285–1293. [Google Scholar] [CrossRef]
Dewar, M.J.S.; Thiel, W. Ground states of molecules. 38. The MNDO method. Approximations and parameters. J. Am. Chem. Soc. 1977, 99, 4899–4907. [Google Scholar] [CrossRef]
Dewar, M.J.S.; Thiel, W. Ground states of molecules. 39. MNDO results for molecules containing hydrogen, carbon, nitrogen and oxygen. J. Am. Chem. Soc. 1977, 99, 4907–4917. [Google Scholar] [CrossRef]
Dewar, M.J.S.; Zoebisch, E.; Healy, E.F.; Stewart, J.J.P. Development and use of quantum mechanical molecular models. AM1: A new general purpose quantum mechanical molecular model. J. Am. Chem. Soc. 1985, 107, 3902–3909. [Google Scholar] [CrossRef]
Stewart, J.J.P. Optimization of parameters for semiempirical methods I. Method. J. Comput. Chem. 1989, 10, 209–220. [Google Scholar] [CrossRef]
Stewart, J.J.P. Optimization of parameters for semiempirical methods II. Applications. J. Comput. Chem. 1989, 10, 221–264. [Google Scholar] [CrossRef]
Stewart, J.J.P. Optimization of parameters for semiempirical methods V: Modification of NDDO approximations and application to 70 elements. J. Mol. Model. 2007, 13, 1173–1213. [Google Scholar] [CrossRef]
Stewart, J.J.P. Optimization of parameters for semiempirical methods VI: More modifications to the NDDO approximations and re-optimization of parameters. J. Mol. Model. 2013, 19, 1–32. [Google Scholar] [CrossRef]
Weber, W.; Thiel, W. Orthogonalization corrections for semiempirical methods. Theor. Chem. Acc. 2000, 103, 495–506. [Google Scholar] [CrossRef]
Thiel, W. Semiempirical methods: Current status and perspectives. Tetrahedron 1988, 44, 7393–7408. [Google Scholar] [CrossRef]
Stewart, J.J.P. Semiempirical Molecular orbital methods. Rev. Comput. Chem. 1990, 1, 45–81. [Google Scholar] [CrossRef]
Stewart, J.J.P. MOPAC: A semiempirical molecular orbital program. J. Comp-Aided Mol. Des. 1990, 4, 1–103. [Google Scholar] [CrossRef]
Thiel, W. Perspectives on semiempirical molecular orbital theory. Adv. Chem. Phys. 1996, 93, 703–757. [Google Scholar] [CrossRef]
Clark, T. Quo vadis semiempirical MO theory. J. Mol. Struct. (THEOCHEM) 2000, 530, 1–10. [Google Scholar] [CrossRef]
Thiel, W. Semiempirical methods. In Modern Methods and Algorithms of Quantum Chemistry; Grotendorst, J., Ed.; John von Neumann Institute for Computing: Jülich, Germany, 2000; Volume 3, pp. 261–283. ISBN 3-00-005834-6. [Google Scholar]
Bredow, T.; Jug, K. Theory and range of modern semiempirical molecular orbital methods. Theor. Chem. Acc. 2005, 113, 1–14. [Google Scholar] [CrossRef]
Thiel, W. Semiempirical quantum-chemical methods in computational chemistry. In Theory and Applications of Computational Chemistry: The First 40 Years; Dykstra, C.E., Kim, K.S., Frenking, G., Scuseria, G.E., Eds.; Elsevier B.V.: Amsterdam, The Netherlands, 2005; pp. 559–580. ISBN 9780080456249. [Google Scholar]
Leach, A.R. Molecular Modelling: Principles and Applications; Pearson Education: London, UK, 2001; ISBN 0-582-38210-6. [Google Scholar]
Cramer, C.J. Essentials of Computational Chemistry: Theories and Models; Wiley: Louisville, KY, USA, 2013; ISBN 978-0-470-09182-1. [Google Scholar]
Jensen, F. Introduction to Computational Chemistry, 3rd ed.; John Wiley & Sons: Louisville, KY, USA, 2006; ISBN 978-1-118-82599-0. [Google Scholar]
Jones, J.E.; Chapman, S. On the determination of molecular fields. From the variation of the viscosity of a gas with temperature. Proc. R. Soc. A 1924, 106, 441–462. [Google Scholar] [CrossRef]
Hospital, A.; Goñi, J.R.; Orozco, M.; Gelpí, J.L. Molecular dynamics simulations: Advances and applications. Adv. Appl. Bioinform Chem. 2015, 8, 37–47. [Google Scholar] [CrossRef] [PubMed]
Kmiecik, S.; Gront, D.; Kolinski, M.; Wieteska, L.; Dawid, A.E.; Kolinski, A. Coarse-Grained Protein Models and Their Applications. Chem. Rev. 2016, 116, 7898–7936. [Google Scholar] [CrossRef]
Han, Y.; Jin, J.; Wagner, J.W.; Voth, G.A. Quantum theory of multiscale coarse-graining. J. Chem. Phys. 2018, 148, 102335. [Google Scholar] [CrossRef]
Tschöp, W.; Kremer, K.; Batoulis, J.; Bürger, T.; Hahn, O. Simulation of polymer melts. I. Coarse-graining procedure for polycarbonates. Acta Polym. 1998, 49, 61–74. [Google Scholar] [CrossRef]
Reith, D.; Pütz, M.; Müller-Plathe, F. Deriving effective mesoscale potentials from atomistic simulations. J. Comput. Chem. 2003, 24, 1624. [Google Scholar] [CrossRef]
Murtola, T.; Falck, E.; Patra, M.; Karttunen, M.; Vattulainen, I. Coarse-grained model for phospholipid/cholesterol bilayer. J. Chem. Phys. 2004, 121, 9156–9165. [Google Scholar] [CrossRef]
Izvekov, S.; Voth, G.A. A Multiscale Coarse-Graining Method for Biomolecular Systems. J. Phys. Chem. B 2005, 109, 2469–2473. [Google Scholar] [CrossRef] [PubMed]
Comba, P.; Remenyi, R. Inorganic and bioinorganic molecular mechanics modeling—The problem of the force field parameterization. Coord. Chem. Rev. 2003, 238–239, 9–20. [Google Scholar] [CrossRef]
Rueda, M.; Ferrer-Costa, C.; Meyer, T.; Pérez, A.; Camps, J.; Hospital, A.; Gelpí, J.L.; Orozco, M. A consensus view of protein dynamics. Proc. Natl. Acad. Sci. USA 2007, 104, 796–801. [Google Scholar] [CrossRef] [PubMed]
Perez, A.; Lankas, F.; Luque, F.J.; Orozco, M. Towards a molecular dynamics consensus view of B-DNA flexibility. Nucleic Acids Res. 2008, 36, 2379–2394. [Google Scholar] [CrossRef]
Tian, C. Improving the Accuracy of Amber Force Field for Biomolecular Simulation. Ph.D Thesis, Stony Brook University, New York, NY, USA, November 2019. [Google Scholar]
Van Duin, A.C.T.; Dasgupta, S.; Lorant, F.; Goddard III, W.A. ReaxFF: A Reactive Force Field for Hydrocarbons. J. Phys. Chem. A 2001, 105, 9396–9409. [Google Scholar] [CrossRef]
Warshel, A.; Weiss, R.M. An Empirical Valence Bond Approach for Comparing Reactions in Solutions and in Enzymes. J. Am. Chem. Soc. 1980, 102, 6218–6226. [Google Scholar] [CrossRef]
Case, D.A.; Darden, T.A.; Cheatham, T.E.I.; Simmerling, C.; Wang, J. AMBER 12; University of California: San Francisco, CA, USA, 2012. [Google Scholar]
Brooks, B.R.; Brooks, C.L., III; Mackerell, A.D.; Nilsson, I., Jr.; Petrella, R.J.; Roux, B.; Won, Y.; Archontis, G.; Bartels, C.; Boresch, S. CHARMM: The biomolecular simulation program. J Comput Chem. 2009, 30, 1545–1614. [Google Scholar] [CrossRef]
Hess, B.; Kutzner, C.; van der Spoel, D.; Lindahl, E. GROMACS 4: Algorithms for highly efficient, load-balanced, and scalable molecular simulation. J. Chem. Theory Comput. 2008, 4, 435–447. [Google Scholar] [CrossRef]
Nelson, M.T.; Humphrey, W.; Gursoy, A.; Dalke, A.; Kalé, L.V.; Skeel, R.D.; Schulten, K. NAMD: A parallel, object-oriented molecular dynamics program. Int. J. Supercomput. Appl. High Perform. Comput. 1996, 10, 251–268. [Google Scholar] [CrossRef]
Larsson, P.; Hess, B.; Lindahl, E. Algorithm improvements for molecular dynamics simulations. Wiley Interdiscip. Rev. Comput. Mol. Sci. 2011, 1, 93–108. [Google Scholar] [CrossRef]
Harvey, M.J.; Giupponi, G.; De Fabritiis, G. ACEMD: Accelerating biomolecular dynamics in the microsecond time scale. J. Chem. Theory Comput. 2009, 5, 1632–1639. [Google Scholar] [CrossRef] [PubMed]
Blomberg, M.R.A.; Borowski, T.; Himo, F.; Liao, R.-Z.; Siegbahn, P.E.M. Quantum chemical studies of mechanisms for metalloenzymes. Chem. Rev. 2014, 114, 3601–3658. [Google Scholar] [CrossRef] [PubMed]
Georgieva, P.; Himo, F. Quantum chemical modeling of enzymatic reactions: The case of histone lysine methyltransferase. J. Comput. Chem. 2010, 31, 1707–1714. [Google Scholar] [CrossRef] [PubMed]
Siegbahn, P.E.M.; Himo, F. The quantum chemical cluster approach for modeling enzyme reactions. WIREs Comput. Mol. Sci. 2011, 1, 323–336. [Google Scholar] [CrossRef]
Ramos, M.J.; Fernandes, P.A. Computational Enzymatic Catalysis. Acc. Chem. Res. 2008, 41, 689–698. [Google Scholar] [CrossRef] [PubMed]
Ahmadi, S.; Barrios Herrera, L.; Chehelamirani, M.; Hostaš, J.; Jalife, S.; Salahub, D.R. Multiscale modeling of enzymes: QM-cluster, QM/MM, and QM/MM/MD: A tutorial review. Int. J. Quantum Chem. 2018, 118, e25558. [Google Scholar] [CrossRef]
Tzeli, D.; Tsoungas, P.G.; Petsalakis, I.D.; Kozielewicz, P.; Zloh, M. Intramolecular Cyclization of β-Nitroso-o-Quinone Methides. A Theoretical Endoscopy of a Potentially Useful Innate “Reclusive” Reaction. Tetrahedron 2015, 71, 359–369. [Google Scholar] [CrossRef]
Dapprich, S.; Komaromi, I.; Byun, K.S.; Morokuma, K.; Frisch, M.J. A new ONIOM implementation in Gaussian98. Part I. The calculation of energies, gradients, vibrational frequencies and electric field derivatives. J. Mol. Struct. 1999, 461, 1–21. [Google Scholar] [CrossRef]
Vreven, T.; Morokuma, K. Chapter 3: Hybrid methods: ONIOM(QM:MM) and QM/MM. Annu. Rep. Comput. Chem. 2006, 2, 35–51. [Google Scholar] [CrossRef]
Vreven, T.; Morokuma, K.; Farkas, Ö.; Schlegel, H.B.; Frisch, M.J. Geometry optimization with QM/MM, ONIOM, and other combined methods. I. Microiterations and constraints. J. Comput. Chem. 2003, 24, 760–769. [Google Scholar] [CrossRef]
Tzeli, D.; Theodorakopoulos, G.; Petsalakis, I.D.; Ajami, D.; Rebek, J. Theoretical study of hydrogen bonding in homodimers and heterodimers of amide, boronic acid and carboxylic acid, free and in encapsulation complexes. J. Am. Chem. Soc. 2011, 133, 16977. [Google Scholar] [CrossRef] [PubMed]
Tzeli, D.; Theodorakopoulos, G.; Petsalakis, I.D.; Ajami, D.; Rebek Jr, J. Conformations and Fluorescence of Encapsulated Stilbene. J. Am. Chem. Soc. 2012, 134, 4346–4354. [Google Scholar] [CrossRef] [PubMed]
Rahman, A. Correlations in the Motion of Atoms in Liquid Argon. Phys. Rev. 1964, 136, A405–A411. [Google Scholar] [CrossRef]
Howard, J. Mechanics of Motor Proteins and the Cytoskeleton; Sinauer Associates, Inc.: Oxford, UK, 2001; ISBN 0-87893-334-4. [Google Scholar]
Koehl, P.; Levitt, M. A brighter future for protein structure prediction. Nat. Struct. Biol. 1999, 6, 108–111. [Google Scholar] [CrossRef]
Zhou, Y.; Wang, S.; Li, Y.; Zhang, Y. Born–Oppenheimer Ab Initio QM/MM Molecular Dynamics Simulations of Enzyme Reactions. Methods Enzymol. 2016, 577, 105–118. [Google Scholar] [CrossRef]
Zuckerman, D.M. Equilibrium Sampling in Biomolecular Simulations. Annu. Rev. Biophys. 2011, 40, 41–62. [Google Scholar] [CrossRef]
Watanabe, H.C.; Cui, Q. Quantitative Analysis of QM/MM Boundary Artifacts and Correction in Adaptive QM/MM Simulations. J. Chem. Theory Comput. 2019, 15, 3917–3928. [Google Scholar] [CrossRef]
Shiga, M.; Masia, M. Boundary based on exchange symmetry theory for multilevel simulations. I. Basic theory. J. Chem. Phys. 2013, 139, 044120. [Google Scholar] [CrossRef]
Takahashi, H.; Kambe, H.; Morita, A. A simple and effective solution to the constrained QM/MM simulations. J. Chem. Phys. 2018, 148, 134119. [Google Scholar] [CrossRef]
Rowley, C.N.; Roux, B. The Solvation Structure of Na⁺ and K⁺ in Liquid Water Determined from High Level ab Initio Molecular Dynamics Simulations. J. Chem. Theory Comput. 2012, 8, 3526–3535. [Google Scholar] [CrossRef]
Heyden, A.; Lin, H.; Truhlar, D.G. Adaptive partitioning in combined quantum mechanical and molecular mechanical calculations of potential energy functions for multiscale simulations. J. Phys. Chem. B 2007, 111, 2231–2241. [Google Scholar] [CrossRef] [PubMed]
Takenaka, N.; Kitamura, Y.; Koyano, Y.; Nagaoka, M. The number-adaptive multiscale QM/MM molecular dynamics simulation: Application to liquid water. Chem. Phys. Lett. 2012, 524, 56–61. [Google Scholar] [CrossRef]
Watanabe, H.C.; Kubar, T.; Elstner, M. Size-Consistent Multipartitioning QM/MM: A Stable and Efficient Adaptive QM/MM Method. J. Chem. Theory Comput. 2014, 10, 4242–4252. [Google Scholar] [CrossRef]
Bernstein, N.; Varnai, C.; Solt, I.; Winfield, S.A.; Payne, M.C.; Simon, I.; Fuxreiter, M.; Csanyi, G. QM/MM simulation of liquid water with an adaptive quantum region. Phys. Chem. Chem. Phys. 2012, 14, 646–656. [Google Scholar] [CrossRef] [PubMed]
Zhang, R.; Lev, B.; Cuervo, J.E.; Noskov, S.Y.; Salahub, D.R. A guide to QM/MM methodology and applications. Adv. Quantum Chem. 2010, 59, 353–400. [Google Scholar] [CrossRef]
Cerqueira, N.M.F.S.A.; Moorthy, H.; Fernandes, P.A.; Ramos, M.J. The mechanism of the Ser-(cis)Ser-Lys catalytic triad of peptide amidases. Phys. Chem. Chem. Phys. 2017, 19, 12343–12354. [Google Scholar] [CrossRef]
Zhang, Y. Pseudobond ab initio QM/MM approach and its applications to enzyme reactions. Theor. Chem. Acc. 2006, 116, 43–50. [Google Scholar] [CrossRef]
Groenhof, G. Introduction to QM/MM Simulations. Methods Mol. Biol. 2013, 924, 43–46. [Google Scholar] [CrossRef]
Chung, L.W.; Hirao, H.; Li, X.; Morokuma, K. The ONIOM method: Its foundation and applications to metalloenzymes and photobiology. WIREs Comput. Mol. Sci. 2012, 2, 327–350. [Google Scholar] [CrossRef]
Villalobos, R.; Garcia, E.; Quintanar, D.; Young, P. Drug release from inert spherical matrix systems using Monte Carlo simulations. Curr. Drug Deliv. 2017, 14, 65–72. [Google Scholar] [CrossRef]
Ryde, U. QM/MM Calculations on Proteins. Methods Enzymol. 2016, 577, 119–158. [Google Scholar] [CrossRef] [PubMed]
Lopes, D.; Jakobtorweihen, S.; Nunes, C.; Sarmento, B.; Reis, S. Shedding light on the puzzle of drugmembrane interactions: Experimental techniques and molecular dynamics simulations. Prog. Lipid Res. 2017, 65, 24–44. [Google Scholar] [CrossRef] [PubMed]
Albano, J.M.R.; de Paula, E.; Pickholz, M. Molecular dynamics simulations to study drug delivery systems. In Molecular Dynamics; Vakhrushev, A., Ed.; IntechOpen: London, UK, 2018; p. 73. [Google Scholar] [CrossRef]
Sousa, S.F.; Ribeiro, A.J.M.; Neves, R.P.P.; Brás, N.F.; Cerqueira, N.M.F.S.A.; Fernandes, P.A.; Ramos, M.J. Application of quantum mechanics/molecular mechanics methods in the study of enzymatic reaction mechanisms. WIREs Comput. Mol. Sci. 2017, 7, e1281. [Google Scholar] [CrossRef]
Difley, S.; Wang, L.-P.; Yeganeh, S.; Yost, S.R.; Van Voorhis, T. Electronic Properties of Disordered Organic Semiconductors via QM/MM Simulations. Acc. Chem. Res. 2010, 43, 995–1004. [Google Scholar] [CrossRef]
Shen, L.; Yang, W. Molecular Dynamics Simulations with Quantum Mechanics/Molecular Mechanics and Adaptive Neural Networks. J. Chem. Theory Comput. 2018, 14, 1442–1455. [Google Scholar] [CrossRef]
Pokorná, P.; Kruse, H.; Krepl, M.; Šponer, J. QM/MM Calculations on Protein-RNA Complexes: Understanding Limitations of Classical MD Simulations and Search for Reliable Cost-Effective QM Methods. J. Chem. Theory Comput. 2018, 14, 5419–5433. [Google Scholar] [CrossRef]
Altoè, P.; · Stenta, M.; · Bottoni, A.; Garavelli, M. A tunable QM/MM approach to chemical reactivity, structure and physico-chemical properties prediction. Theor. Chem. Acc. 2007, 118, 219–240. [Google Scholar] [CrossRef]
Small, D.W. Remarkable Accuracy of an O(N6) Perturbative Correction to Opposite-Spin CCSD: Are Triples Necessary for Chemical Accuracy in Coupled Cluster? J. Chem. Theory Comput. 2020, 16, 4014–4020. [Google Scholar] [CrossRef]
Banci, L.; Sigel, A.; Sigel, H.; Sigel, R.K. (Eds.) Metallomics and the Cell; Metal Ions in Life Sciences; Springer: Berlin/Heidelberg, Germany, 2013; Volume 12, pp. 1–13. ISBN 978-94-007-5561-1. [Google Scholar] [CrossRef]
Thomson, A.J.; Gray, H.B. Bioinorganic chemistry. Curr. Opin. Chem. Biol. 1998, 2, 155–158. [Google Scholar] [CrossRef]
Waldron, K.J.; Robinson, N.J. How do bacterial cells ensure that metalloproteins get the correct metal? Nat. Rev. Microbiol. 2009, 7, 25–35. [Google Scholar] [CrossRef]
Carver, P.L. Metal Ions and Infectious Diseases. An Overview from the Clinic. In Interrelations between Essential Metal Ions and Human Diseases; Sigel, A., Sigel, H., Sigel, R.K., Eds.; Metal Ions in Life Sciences; Springer: Berlin/Heidelberg, Germany, 2013; Volume 13, pp. 1–28. ISBN 978-94-007-7499-5. [Google Scholar] [CrossRef]
Maret, W. Metalloproteomics, metalloproteomes, and the annotation of metalloproteins. Metallomics. 2010, 2, 117–125. [Google Scholar] [CrossRef] [PubMed]
Finkelstein, J. Metalloproteins. Nature 2009, 460, 813. [Google Scholar] [CrossRef] [PubMed]
Sparta, M.; Shirvanyants, D.; Ding, F.; Dokholyan, N.V.; Alexandrova, A.N. Hybrid Dynamics Simulation Engine for Metalloproteins. Biophys. J. 2012, 103, 767–776. [Google Scholar] [CrossRef] [PubMed]
Rulíšek, L.; Havlas, Z. Using DFT Methods for the Prediction of the Structure and Energetics of Metal-Binding Sites in Metalloproteins. Int. J. Quantum Chem. 2003, 91, 504–510. [Google Scholar] [CrossRef]
Ling, Y.; Zhang, Y. Deciphering Structural Fingerprints for Metalloproteins with Quantum Chemical Calculations. Annu. Rep. Comput. Chem. 2010, 6, 65–77. [Google Scholar] [CrossRef]
Shirvanyants, D.; Ding, F.; Tsao, D.; Ramachandran, S.; Dokholyan, N.V. Discrete molecular dynamics: An efficient and versatile simulation method for fine protein characterization. J. Phys. Chem. B 2012, 116, 8375–8382. [Google Scholar] [CrossRef]
Nechay, M.R.; Valdez, C.E.; Alexandrova, A.N. Computational Treatment of Metalloproteins. J. Phys. Chem. B 2015, 119, 5945–5956. [Google Scholar] [CrossRef]
Xu, M.; He, X.; Zhu, T.; Zhang, J.Z.H. A Fragment Quantum Mechanical Method for Metalloproteins. J. Chem. Theory Comput. 2019, 15, 1430–1439. [Google Scholar] [CrossRef]
Yan, Z.; Li, X.; Chung, L.W. Multiscale Quantum Refinement Approaches for Metalloproteins. J. Chem. Theory Comput. 2021, 17, 3783–3796. [Google Scholar] [CrossRef]
Nikolova, V.; Angelova, S.E.; Markova, N.; Dudev, T. Gallium as a Therapeutic Agent: A Thermodynamic Evaluation of the Competition between Ga³⁺ and Fe³⁺ Ions in Metalloproteins. J. Phys. Chem. B 2016, 120, 2241–2248. [Google Scholar] [CrossRef]
Prytkova, T.R.; Kurnikov, I.V.; Beratan, D.N. Ab Initio Based Calculations of Electron-Transfer Rates in Metalloproteins. J. Phys. Chem. B 2005, 109, 1618–1625. [Google Scholar] [CrossRef] [PubMed]
Zheng, P.; Arantes, G.M.; Field, M.J.; Li, H. Force-induced chemical reactions on the metal centre in a single metalloprotein molecule. Nat. Commun. 2015, 6, 7569. [Google Scholar] [CrossRef] [PubMed]
Khandelwal, A.; Lukacova, V.; Comez, D.; Kroll, D.M.; Raha, S.; Balaz, S. A Combination of Docking, QM/MM Methods, and MD Simulation for Binding Affinity Estimation of Metalloprotein Ligands. J. Med. Chem. 2005, 48, 5437–5447. [Google Scholar] [CrossRef]
Banci, L. Molecular dynamics simulations of metalloproteins. Curr. Opin. Chem. Biol. 2003, 7, 143–149. [Google Scholar] [CrossRef]
Sinnecker, S.; Neese, F. QM/MM calculations with DFT for taking into account protein effects on the EPR and optical spectra of metalloproteins. Plastocyanin as a case study. J. Comput. Chem. 2006, 27, 1463–1475. [Google Scholar] [CrossRef] [PubMed]
Gleeson, D.; Gleeson, M.P. Application of QM/MM and QM methods to investigate histone deacetylase 8. MedChemComm 2015, 6, 477–485. [Google Scholar] [CrossRef]
Srnec, M.; Ryde, U.; Rulíšek, L. Reductive cleavage of the O–O bond in multicopper oxidases: A QM/MM and QM study. Faraday Discuss. 2011, 148, 41–53. [Google Scholar] [CrossRef] [PubMed]
Senn, H.M.; Thiel, W. QM/MM studies of enzymes. Curr. Opin. Chem. Biol. 2007, 11, 182–187. [Google Scholar] [CrossRef] [PubMed]
Bowman, A.L.; Ridder, L.; Rietjens, I.M.C.M.; Vervoort, J.; Mulholland, A.J. Molecular Determinants of Xenobiotic Metabolism: QM/MM Simulation of the Conversion of 1-Chloro-2,4-dinitrobenzene Catalyzed by M1-1 Glutathione S-Transferase. Biochemistry 2007, 46, 6353–6363. [Google Scholar] [CrossRef]
Khandelwal, A.; Balaz, S. QM/MM linear response method distinguishes ligand affinities for closely related metalloproteins. Proteins: Struct. Funct. Bioinform. 2007, 69, 326–339. [Google Scholar] [CrossRef]
Cho, K.-B.; Derat, E.; Shaik, S. Compound I of Nitric Oxide Synthase: The Active Site Protonation State. J. Am. Chem. Soc. 2007, 129, 3182–3188. [Google Scholar] [CrossRef] [PubMed]
Robertazzi, A.; Platts, J.A. Gas-Phase DNA Oligonucleotide Structures. A QM/MM and Atoms in Molecules Study. J. Phys. Chem. A 2006, 110, 3992–4000. [Google Scholar] [CrossRef] [PubMed]
Sala, D.; Giachetti, A.; Rosato, A. Molecular dynamics simulations of metalloproteins: A folding study of rubredoxin from Pyrococcus furiosus. AIMS Biophys. 2018, 5, 77–96. [Google Scholar] [CrossRef]
Kim, J.; Rees, D.C. Nitrogenase and Biological Nitrogen Fixation. Biochemistry 1994, 33, 389–397. [Google Scholar] [CrossRef]
Hoffman, B.M.; Lukoyanov, D.; Yang, Z.Y.; Dean, D.R.; Seefeldt, L.C. Mechanism of nitrogen fixation by nitrogenase: The next stage. Chem. Rev. 2014, 114, 4041–4062. [Google Scholar] [CrossRef] [PubMed]
Burges, B.K.; Lowe, D.J. Mechanism of Molybdenum Nitrogenase. Chem. Rev. 1996, 96, 2983–3011. [Google Scholar] [CrossRef]
Lawson, D.M.; Smith, B.E. Molybdenum nitrogenases: A crystallographic and mechanistic view. In Metals Ions in Biological System; Sigel, A., Sigel, H., Eds.; CRC Press: Boca Raton, FL, USA, 2002; Volume 39, pp. 75–119. [Google Scholar]
Brigle, K.E.; Newton, W.E.; Dean, D.R. Complete nucleotide sequence of the Azotobacter vinelandii nitrogenase structural gene cluster. Gene 1985, 37, 37–44. [Google Scholar] [CrossRef]
Bjornsson, R.; Delgado-Jaime, M.U.; Lima, F.A.; Sippel, D.; Schlesier, J.; Weyhermüller, T.; Einsle, O.; Neese, F.; DeBeer, S. Molybdenum L-Edge XAS Spectra of MoFe Nitrogenase. Z. Anorg. Allg. Chem. 2015, 641, 65–71. [Google Scholar] [CrossRef]
Hales, B.J. Vanadium Nitrogenase. Catalysts for Nitrogen Fixation: Nitrogenases, Relevant Chemical Models and Commercial Processes; Springer: Berlin/Heidelberg, Germany, 2004; pp. 255–279. ISBN 978-1-4020-3611-8. [Google Scholar] [CrossRef]
Schneider, K.; Mueller, A. Iron-Only Nitrogenase: Exceptional Catalytic, Structural and Spectroscopic Features. Catalysts for Nitrogen Fixation: Nitrogenases, Relevant Chemical Models and Commercial Processes; Springer: Berlin/Heidelberg, Germany, 2004; pp. 281–307. ISBN 978-1-4020-3611-8. [Google Scholar] [CrossRef]
Igarashi, R.Y.; Seefeldt, L.C. Nitrogen Fixation: The Mechanism of the Mo-Dependent Nitrogenase. Cr. Rev. Biochem. Mol. Biol. 2003, 38, 351–384. [Google Scholar] [CrossRef]
Modak, J.M. Haber Process for Ammonia Synthesis. Resonance 2002, 7, 69–77. [Google Scholar] [CrossRef]
Burgess, B.K. Molybdenum Enzymes (Metal Ions in Biology Series); Spiro, T.G., Ed.; Wiley-Interscience: Hoboken, NJ, USA, 1985; ISBN 978-0471885429. [Google Scholar]
Simpson, F.B.; Burris, R.H. A nitrogen pressure of 50 atmospheres does not prevent evolution of hydrogen by nitrogenase. Science 1984, 224, 1095–1097. [Google Scholar] [CrossRef] [PubMed]
Yang, Z.Y.; Danyal, K.; Seefeldt, L.C. Mechanism of Mo-Dependent Nitrogenase. Nitrogen Fixation. In Methods in Molecular Biology (Methods and Protocols); Ribbe, M., Ed.; Humana Press: Totowa, NJ, USA, 2011; Volume 766. [Google Scholar] [CrossRef]
Barney, B.M.; Lee, H.I.; Dos Santos, P.C.; Hoffman, B.M.; Dean, D.R.; Seefeldt, L.C. Breaking the N₂ triple bond: Insights into the nitrogenase mechanism. DalT Trans. 2006, 19, 2277–2284. [Google Scholar] [CrossRef]
Neese, F. The Yandulov/Schrock cycle and the nitrogenase reaction: Pathways of nitrogen fixation studied by density functional theory. Ang. Chem. 2005, 45, 196–199. [Google Scholar] [CrossRef] [PubMed]
Cao, L.; Ryde, U. Influence of the protein and DFT method on the broken-symmetry and spin states in nitrogenase. Int. J. Quant. Chem. 2018, 118, e25627. [Google Scholar] [CrossRef]
Benediktsson, B.; Bjornsson, R. QM/MM Study of the Nitrogenase MoFe Protein Resting State: Broken-Symmetry States, Protonation States, and QM Region Convergence in the FeMoco Active Site. Inorg. Chem. 2017, 56, 13417–13429. [Google Scholar] [CrossRef]
Spatzal, T.; Aksoyoglu, M.; Zhang, L.; Andrade, S.L.A.; Schleicher, E.; Weber, S.; Rees, D.C.; Einsle, O. Evidence for Interstitial Carbon in Nitrogenase FeMo Cofactor. Science 2011, 334, 940. [Google Scholar] [CrossRef]
Best, R.B.; Zhu, X.; Shim, J.; Lopes, P.E.; Mittal, J.; Feig, M.; Mackerell, A.D. Optimization of the additive CHARMM all-atom protein force field targeting improved sampling of the backbone φ, ψ and side-chain χ(1) and χ(2) dihedral angles. J. Chem. Theory Comput. 2012, 8, 3257–3273. [Google Scholar] [CrossRef] [PubMed]
Van Stappen, C.; Thorhallsson, A.T.; Decamps, L.; Bjornsson, R.; DeBeer, S. Resolving the structure of the E1 state of Mo nitrogenase through Mo and Fe K-edge EXAFS and QM/MM calculations. Chem. Sci. 2019, 10, 9807–9821. [Google Scholar] [CrossRef]
Thorhallsson, A.T.; Benediktsson, B.; Bjornsson, R. A model for dinitrogen binding in the E4 state of nitrogenase. Chem. Sci. 2019, 10, 11110–11124. [Google Scholar] [CrossRef]
Cao, L.; Caldararu, O.; Ryde, U. Protonation and Reduction of the FeMo Cluster in Nitrogenase Studied by Quantum Mechanics/Molecular Mechanics (QM/MM) Calculations. J. Chem. Theory Comput. 2018, 14, 6653–6678. [Google Scholar] [CrossRef] [PubMed]
Lukoyanov, D.A.; Yang, Z.-Y.; Dean, D.R.; Seefeldt, L.C.; Raugei, S.; Hoffman, B.M. Electron Redistribution within the Nitrogenase Active Site FeMo- Cofactor During Reductive Elimination of H₂ to Achieve N≡N Triple-Bond Activation. J. Am. Chem. Soc. 2020, 142, 21679–21690. [Google Scholar] [CrossRef] [PubMed]
Cao, L.; Ryde, U. N₂H₂ binding to the nitrogenase FeMo cluster studied by QM/MM methods. J. Biol. Inorg. Chem. 2020, 25, 521–540. [Google Scholar] [CrossRef] [PubMed]
Seefeldt, L.C.; Yang, Z.-Y.; Lukoyanov, D.A.; Harris, D.F.; Dean, D.R.; Raugei, S.; Hoffman, B.M. Reduction of Substrates by Nitrogenases. Chem. Rev. 2020, 120, 5082–5106. [Google Scholar] [CrossRef]
Hoffman, B.M.; Lukoyanov, D.; Dean, D.R.; Seefeldt, L.C. Nitrogenase: A draft mechanism. Acc. Chem. Res. 2013, 46, 587–595. [Google Scholar] [CrossRef]
Sgrignani, J.; Franco, D.; Magistrato, A. Theoretical Studies of Homogeneous Catalysts Mimicking Nitrogenase. Molecules 2011, 16, 442–465. [Google Scholar] [CrossRef]
Lukoyanov, D.; Khadka, N.; Yang, Z.Y.; Dean, D.R.; Seefeldt, L.C.; Hoffman, B.M. Reversible Photoinduced Reductive Elimination of H2 from the Nitrogenase Dihydride State, the E4(4H) Janus Intermediate. J. Am. Chem. Soc. 2016, 138, 1320–1327. [Google Scholar] [CrossRef]
Lukoyanov, D.A.; Krzyaniak, M.D.; Dean, D.R.; Wasielewski, M.R.; Seefeldt, L.C.; Hoffman, B.M. Time-Resolved EPR Study of H₂ Reductive Elimination from the Photoexcited Nitrogenase Janus E4(4H) Intermediate. J. Phys. Chem. B 2019, 123, 8823–8828. [Google Scholar] [CrossRef]
Lukoyanov, D.; Khadka, N.; Dean, D.R.; Raugei, S.; Seefeldt, L.C.; Hoffman, B.M. Photoinduced Reductive Elimination of H₂ from the Nitrogenase Dihydride (Janus) State Involves a FeMo-cofactor-H₂ Intermediate. Inorg. Chem. 2017, 56, 2233–2240. [Google Scholar] [CrossRef]
Raugei, S.; Seefeldt, L.C.; Hoffman, B.M. Critical computational analysis illuminates the reductive-elimination mechanism that activates nitrogenase for N₂ reduction. Proc. Natl. Acad. Sci. USA 2018, 115, E10521. [Google Scholar] [CrossRef]
Tzeli, D.; Raugei, S.; Xantheas, S.S. Quantitative Account of the Bonding Properties of a Rubredoxin Model Complex [Fe(SCH₃)₄]^q, q = −2, −1, +2, +3. J. Chem. Theory Comput. 2021, 17, 6080–6091. [Google Scholar] [CrossRef] [PubMed]
Mejuto-Zaera, C.; Tzeli, D.; Williams-Young, D.; Tubman, N.M.; Matoušek, M.; Brabec, J.; Veis, L.; Xantheas, S.S.; de Jong, W.A. The Effect of Geometry, Spin and Orbital Optimization in Achieving Accurate, Correlated Results for Iron-Sulfur Cubanes. J. Chem. Theory Comput. 2022. accepted. [Google Scholar] [CrossRef] [PubMed]
Elghobashi-Meinhardt, N.; Tombolelli, D.; Mroginski, M.A. Electronic and Structural Properties of the Double Cubane Iron-Sulfur Cluster. Catalysts 2021, 11, 245. [Google Scholar] [CrossRef]
Bartlett, R.J. Adventures in DFT by a wavefunction theorist. J. Chem. Phys. 2019, 151, 160901. [Google Scholar] [CrossRef]
Church, J.R.; Olsen, J.M.H.; Schapiro, I. The Impact of Retinal Configuration on the Protein–Chromophore Interactions in Bistable Jumping Spider Rhodopsin-1. Molecules 2022, 27, 71. [Google Scholar] [CrossRef]
Chontzopoulou, E.; Papaemmanouil, C.; Chatziathanasiadou, M.V.; Kolokouris, D.; Kiriakidi, S.; Konstantinidi, A.; Gerogianni, I.; Tselios, T.; Kostakis, I.K.; Chrysina, E.D.; et al. Artificial and natural sweeteners as potential anti-inflammatory agents. J. Biomol. Struct. Dyn. 2021, 9, 1–13. [Google Scholar] [CrossRef]
Tolbatov, I.; Marrone, A.; Coletti, C.; Re, N. Computational Studies of Au(I) and Au(III) Anticancer MetalLodrugs: A Survey. Molecules 2021, 26, 7600. [Google Scholar] [CrossRef]
Skoko, S.; Ambrosetti, M.; Giovannini, T.; Cappelli, C. Simulating Absorption Spectra of Flavonoids in Aqueous Solution: A Polarizable QM/MM Study. Molecules 2020, 25, 5853. [Google Scholar] [CrossRef]
Spinello, A.; Ritacco, I.; Magistrato, A. The Catalytic Mechanism of Steroidogenic Cytochromes P450 from All-Atom Simulations: Entwinement with Membrane Environment, Redox Partners, and Post-Transcriptional Regulation. Catalysts 2019, 9, 81. [Google Scholar] [CrossRef]
Krivitskaya, A.V.; Khrenova, M.G.; Nemukhin, A.V. Two Sides of Quantum-Based Modeling of Enzyme-Catalyzed Reactions: Mechanistic and Electronic Structure Aspects of the Hydrolysis by Glutamate Carboxypeptidase. Molecules 2021, 26, 6280. [Google Scholar] [CrossRef]
Yu, Μ.; Liu, Υ. A QM/MM Study on the Initiation Reaction of Firefly Bioluminescence- Enzymatic Oxidation of Luciferin. Molecules 2021, 26, 4222. [Google Scholar] [CrossRef] [PubMed]
Georgiou, N.; Gouleni, N.; Chontzopoulou, E.; Skoufas, G.S.; Gkionis, A.; Tzeli, D.; Vassiliou, S.; Mavromoustakos, T. Structure assignment, conformational properties and discovery of potential targets of the Ugi cinnamic adduct NGI25. J. Biomol. Struct. Dyn. 2021. [Google Scholar] [CrossRef] [PubMed]
Zlobin, A.; Diankin, I.; Pushkarev, S.; Golovin, A. Probing the Suitability of Different Ca²⁺ Parameters for Long Simulations of Diisopropyl Fluorophosphatase. Molecules 2021, 26, 5839. [Google Scholar] [CrossRef] [PubMed]
Landi, A.; Capobianco, A.; Peluso, A. The Time Scale of Electronic Resonance in Oxidized DNA as Modulated by Solvent Response: An MD/QM-MM Study. Molecules 2021, 26, 5497. [Google Scholar] [CrossRef] [PubMed]
Bouback, T.A.; Pokhrel, S.; Albeshri, A.; Aljohani, A.M.; Samad, A.; Alam, R.; Hossen, M.S.; Al-Ghamdi, K.; Talukder, M.E.K.; Ahammad, F.; et al. Pharmacophore-Based Virtual Screening, Quantum Mechanics Calculations, and Molecular Dynamics Simulation Approaches Identified Potential Natural Antiviral Drug Candidates against MERS-CoV S1-NTD. Molecules 2021, 26, 4961. [Google Scholar] [CrossRef]
Breijyeh, Z.; Karaman, R. Enzyme Models-From Catalysis to Prodrugs. Molecules 2021, 26, 3248. [Google Scholar] [CrossRef]
Khrenova, M.G.; Bulavko, E.S.; Mulashkin, F.D.; Nemukhin, A.V. Mechanism of Guanosine Triphosphate Hydrolysis by the Visual Proteins Arl3-RP2: Free Energy Reaction Profiles Computed with Ab Initio Type QM/MM Potentials. Molecules 2021, 26, 3998. [Google Scholar] [CrossRef]

Figure 2. Lowe-Thorneley kinetic model [156].

Figure 3. Nitrogen Fixation Mechanism [145].

Figure 4. Structure of FeMo cofactor at the E₄ state [164].

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tzeliou, C.E.; Mermigki, M.A.; Tzeli, D. Review on the QM/MM Methodologies and Their Application to Metalloproteins. Molecules 2022, 27, 2660. https://doi.org/10.3390/molecules27092660

AMA Style

Tzeliou CE, Mermigki MA, Tzeli D. Review on the QM/MM Methodologies and Their Application to Metalloproteins. Molecules. 2022; 27(9):2660. https://doi.org/10.3390/molecules27092660

Chicago/Turabian Style

Tzeliou, Christina Eleftheria, Markella Aliki Mermigki, and Demeter Tzeli. 2022. "Review on the QM/MM Methodologies and Their Application to Metalloproteins" Molecules 27, no. 9: 2660. https://doi.org/10.3390/molecules27092660

APA Style

Tzeliou, C. E., Mermigki, M. A., & Tzeli, D. (2022). Review on the QM/MM Methodologies and Their Application to Metalloproteins. Molecules, 27(9), 2660. https://doi.org/10.3390/molecules27092660

Article Menu

Review on the QM/MM Methodologies and Their Application to Metalloproteins

Abstract

1. Introduction

2. Methodologies

2.1. Density Functional Theory

2.2. Semiempirical Methods

2.3. Molecular Mechanics (MM)

2.4. Molecular Dynamics Simulations

2.5. QM/MM and QM/MM/MD Approaches

2.6. Computational Times of Methodologies

3. Metalloproteins

3.1. Reactions of Metalloproteins

3.2. Nitrogenase and FeMo Cofactor

3.2.1. General about Nitrogenase—Structure

3.2.2. General Mechanism

3.2.3. Calculations

4. Discussion and Conclusions

5. Future Directions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI