<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">ijms</journal-id>
<journal-title>International Journal of Molecular Sciences</journal-title>
<abbrev-journal-title>Int. J. Mol. Sci.</abbrev-journal-title>
<issn pub-type="epub">1422-0067</issn>
<publisher>
<publisher-name>Molecular Diversity Preservation International (MDPI)</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3390/ijms131114451</article-id>
<article-id pub-id-type="publisher-id">ijms-13-14451</article-id>
<article-categories>
<subj-group>
<subject>Article</subject></subj-group></article-categories>
<title-group>
<article-title>A Generic Force Field for Protein Coarse-Grained Molecular Dynamics Simulation</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Gu</surname><given-names>Junfeng</given-names></name><xref ref-type="aff" rid="af1-ijms-13-14451">1</xref></contrib>
<contrib contrib-type="author">
<name><surname>Bai</surname><given-names>Fang</given-names></name><xref ref-type="aff" rid="af2-ijms-13-14451">2</xref></contrib>
<contrib contrib-type="author">
<name><surname>Li</surname><given-names>Honglin</given-names></name><xref ref-type="aff" rid="af3-ijms-13-14451">3</xref></contrib>
<contrib contrib-type="author">
<name><surname>Wang</surname><given-names>Xicheng</given-names></name><xref ref-type="aff" rid="af1-ijms-13-14451">1</xref><xref ref-type="corresp" rid="c1-ijms-13-14451">*</xref></contrib></contrib-group>
<aff id="af1-ijms-13-14451">
<label>1</label>State Key Laboratory of Structural Analysis for Industrial Equipment, Department of Engineering Mechanics, Dalian University of Technology, Dalian 116023, China; E-Mail: <email>jfgu@dlut.edu.cn</email></aff>
<aff id="af2-ijms-13-14451">
<label>2</label>Faculty of Chemical, Environmental and Biological Science and Technology, Dalian University of Technology, Dalian 116023, China; E-Mail: <email>fangbai@yahoo.com.cn</email></aff>
<aff id="af3-ijms-13-14451">
<label>3</label>School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China; E-Mail: <email>hlli@ecust.edu.cn</email></aff>
<author-notes>
<corresp id="c1-ijms-13-14451">
<label>*</label>Author to whom correspondence should be addressed; E-Mail: <email>guixum@dlut.edu.cn</email>; Tel.: +86-411-84706223; Fax: +86-411-84708393.</corresp></author-notes>
<pub-date pub-type="collection">
<year>2012</year></pub-date>
<pub-date pub-type="epub">
<day>08</day>
<month>11</month>
<year>2012</year></pub-date>
<volume>13</volume>
<issue>11</issue>
<fpage>14451</fpage>
<lpage>14469</lpage>
<history>
<date date-type="received">
<day>03</day>
<month>09</month>
<year>2012</year></date>
<date date-type="rev-recd">
<day>26</day>
<month>10</month>
<year>2012</year></date>
<date date-type="accepted">
<day>26</day>
<month>10</month>
<year>2012</year></date></history>
<permissions>
<copyright-statement>© 2012 by the authors; licensee Molecular Diversity Preservation International, Basel, Switzerland.</copyright-statement>
<copyright-year>2012</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0">
<p>This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p></license></permissions>
<abstract>
<p>Coarse-grained (CG) force fields have become promising tools for studies of protein behavior, but the balance of speed and accuracy is still a challenge in the research of protein coarse graining methodology. In this work, 20 CG beads have been designed based on the structures of amino acid residues, with which an amino acid can be represented by one or two beads, and a CG solvent model with five water molecules was adopted to ensure the consistence with the protein CG beads. The internal interactions in protein were classified according to the types of the interacting CG beads, and adequate potential functions were chosen and systematically parameterized to fit the energy distributions. The proposed CG force field has been tested on eight proteins, and each protein was simulated for 1000 ns. Even without any extra structure knowledge of the simulated proteins, the Cα root mean square deviations (RMSDs) with respect to their experimental structures are close to those of relatively short time all atom molecular dynamics simulations. However, our coarse grained force field will require further refinement to improve agreement with and persistence of native-like structures. In addition, the root mean square fluctuations (RMSFs) relative to the average structures derived from the simulations show that the conformational fluctuations of the proteins can be sampled.</p></abstract>
<kwd-group>
<kwd>coarse-grained</kwd>
<kwd>force field</kwd>
<kwd>molecular dynamics</kwd>
<kwd>protein</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<title>1. Introduction</title>
<p>Over the last 30 years, the Molecular Dynamics (MD) method has played an increasing important role in dynamic behavior simulation of biomolecule at the atomic level [<xref ref-type="bibr" rid="b1-ijms-13-14451">1</xref>]. In numerous application areas such as structural biology, biophysics, biochemistry, enzymology, molecular biology and medicinal chemistry, <italic>etc.</italic>, MD has become a major routine research tool. By means of MD simulation, biomolecular structure, kinetics, and thermodynamics can be investigated, for example, macromolecular stability, conformational and allosteric properties, the role of dynamics in enzyme activity, molecular recognition and the properties of complexes, ion and small molecule transport, protein association, protein folding, and protein hydration [<xref ref-type="bibr" rid="b2-ijms-13-14451">2</xref>]. However, All-Atom Molecular Dynamics (AA-MD) is restricted severely by available computational capabilities because of the need of large amount of computing resources. In the 1970s, a small protein (bovine pancreatic trypsin inhibitor, composed of about 500 atoms) was first simulated, and lasted only about 10 picoseconds with AA-MD, limited by the computing power at that time [<xref ref-type="bibr" rid="b3-ijms-13-14451">3</xref>]. With the development of modern computer technology, high performance computing and molecular dynamics method, the application of AA-MD has made great progresses in both space scale and time scale. Nowadays, AA-MD can simulate biomolecule system containing up to millions of atoms, with simulation time over microsecond level [<xref ref-type="bibr" rid="b4-ijms-13-14451">4</xref>,<xref ref-type="bibr" rid="b5-ijms-13-14451">5</xref>]. Despite this, AA-MD still cannot meet all the need of biomolecule research. Most dynamics and interactions within cells (e.g., protein-protein docking, rearrangement upon ligand binding, folding) occur on microsecond or even millisecond timescale, and usually involve large macromolecular aggregates. The simulation time of these processes is at least four to six orders of magnitude larger than the feasible time with AA-MD simulation, which has brought large barrier in the biomolecule simulation research.</p>
<p>In the past few years, Coarse-Grained Molecular Dynamics (CG-MD) methods for biomolecule have gained increasing attention [<xref ref-type="bibr" rid="b6-ijms-13-14451">6</xref>–<xref ref-type="bibr" rid="b11-ijms-13-14451">11</xref>]. The basic thought of CG-MD is to treat several or more atoms as a virtual particle (<italic>i.e.</italic>, so-called Coarse-Graining), so the huge quantity of degrees of freedom within complex biomolecule especially protein and the complexity of the corresponding force field will be decreased, therefore dramatically decreases the computational complexity of MD simulation. Various kinds of protein CG models and force fields have been introduced. Referenced to amino acid residue, protein CG models can be simply classified as multiple-point model [<xref ref-type="bibr" rid="b12-ijms-13-14451">12</xref>–<xref ref-type="bibr" rid="b20-ijms-13-14451">20</xref>], two-point model [<xref ref-type="bibr" rid="b21-ijms-13-14451">21</xref>,<xref ref-type="bibr" rid="b22-ijms-13-14451">22</xref>], one-point model [<xref ref-type="bibr" rid="b23-ijms-13-14451">23</xref>–<xref ref-type="bibr" rid="b27-ijms-13-14451">27</xref>] and much coarser multiple-residue model [<xref ref-type="bibr" rid="b28-ijms-13-14451">28</xref>–<xref ref-type="bibr" rid="b30-ijms-13-14451">30</xref>], and CG force fields varied from the simple harmonic potential to more realistic molecular force field. CG-MD has achieved plenty of research results, and has been applied in areas such as membrane [<xref ref-type="bibr" rid="b10-ijms-13-14451">10</xref>,<xref ref-type="bibr" rid="b31-ijms-13-14451">31</xref>,<xref ref-type="bibr" rid="b32-ijms-13-14451">32</xref>], ion-channel [<xref ref-type="bibr" rid="b11-ijms-13-14451">11</xref>,<xref ref-type="bibr" rid="b33-ijms-13-14451">33</xref>], protein folding [<xref ref-type="bibr" rid="b34-ijms-13-14451">34</xref>,<xref ref-type="bibr" rid="b35-ijms-13-14451">35</xref>], and protein-protein interaction [<xref ref-type="bibr" rid="b36-ijms-13-14451">36</xref>]. However, due to the limited speedup and reliability, the main available methods are difficult to be widely used in the simulation of large-scale biological systems to date, and the further development of CG-MD is still a challenge work for researchers. CG models need to be as simplified as possible in order to simulate more complicated biomolecules, while CG force field need to be as realistic as possible so that the kinetic behavior under AA-MD can be accurately reproduced. Current coarse graining methodologies are still not as predictive as AA-MD, because of the intrinsic difficulty in modeling the complex and diverse intra-molecular interactions with few parameters. Developing CG models and accurate force field for protein have become of great importance for studying large biological systems in both time and space scale.</p>
<p>As a representative CG force field, MARTINI has gained the most attention, and has been successfully applied to the simulations of protein and membrane systems. However, MARTINI still needs secondary structure restraints to maintain the stability of the native structure during the simulation, and the parameterization process of CG force field is too complicated and needs much experience, which usually needs quite considerable effort. Therefore, simpler and more efficient methods are continuously being researched. We report in our recent work on the improvement of CG-MD methodology. Novel CG models for protein simulation are designed, with which a residue is composed of only one or two beads, so the computational efficiency of MD can be improved significantly. A force filed based on the models is developed, based on known protein structures and AA-MD simulation results. Then the protein CG models and the force field are applied in MD simulations of eight small to medium size proteins. Finally, the simulation results are given and compared with those of AA-MD simulations and experimental values, indicating the effectiveness of the proposed CG models and force field.</p></sec>
<sec sec-type="results">
<title>2. Results</title>
<sec sec-type="results">
<title>2.1. Results of Bonded Potential Parameterization</title>
<p>In the bonded interactions, all the backbone beads are assumed to be the same, so the bond types can be classified as <italic>B</italic>–<italic>B</italic> and <italic>B</italic>–<italic>S</italic><italic><sup>i</sup></italic>. The statistical results of bond length distribution are shown in <xref ref-type="fig" rid="f1-ijms-13-14451">Figure 1</xref>. <xref ref-type="fig" rid="f1-ijms-13-14451">Figure 1A</xref> shows that the <italic>B</italic>–<italic>B</italic> bond length is distributed in a narrow area from 3.6 Å to 3.9 Å and centered on 3.8 Å, so 3.8 Å is adopted as the equilibrium stretching length <italic>L</italic><italic><sub>bond</sub></italic> in <xref rid="FD2" ref-type="disp-formula">Equation 2</xref> of <italic>B</italic>–<italic>B</italic>. <xref ref-type="fig" rid="f1-ijms-13-14451">Figure 1B</xref> shows the statistical results of distance distributions between 10 types of <italic>S</italic><italic><sub>i</sub></italic> beads and their backbone beads. Each distribution shows a similar character with <italic>B</italic>–<italic>B</italic>, but the equilibrium length of <italic>B</italic>–<italic>S</italic><italic><sub>i</sub></italic> bond is <italic>S</italic><italic><sub>i</sub></italic> bead dependent. <xref ref-type="table" rid="t1-ijms-13-14451">Table 1</xref> summarizes the <italic>L</italic><italic><sub>bond</sub></italic> of each <italic>B</italic>–<italic>S</italic><italic><sub>i</sub></italic> bond adopted in our force field. The stretching energy profile of bond is extracted from the distribution of bond length with Boltzmann conversion method, and fitting with <xref rid="FD2" ref-type="disp-formula">Equation 2</xref> to get the force constant. The <italic>B</italic>–<italic>B</italic> force constant adopts an approximate value 100,000 kJ nm<sup>−2</sup> mol<sup>−1</sup>, and the <italic>B</italic>–<italic>S</italic><italic><sub>i</sub></italic> force constants adopt a mean value 5,000 kJ nm<sup>−2</sup> mol<sup>−1</sup> in our force field.</p>
<p>The angles in CG protein system can be classified into three types: <italic>B</italic>–<italic>B</italic>–<italic>B</italic>, <italic>B</italic>–<italic>B</italic>–<italic>S</italic><italic><sub>i</sub></italic> and <italic>S</italic><italic><sub>i</sub></italic>–<italic>B</italic>–<italic>B</italic>, and angle bending energy profiles calculated from the probability distributions of these angels are shown in <xref ref-type="fig" rid="f2-ijms-13-14451">Figure 2</xref>, in which distinct colors and patterns are used to distinguish different <italic>S</italic><italic><sub>i</sub></italic>. Two minima at about 90 and 120 degrees can be found in energy profile of <italic>B</italic>–<italic>B</italic>–<italic>B</italic> angle, which correspond to the α-helix and β-sheet secondary structure. A similar pattern of energy profiles is observed in <xref ref-type="fig" rid="f2-ijms-13-14451">Figure 2B,C</xref>, and only one set of parameters is used for <italic>B</italic>–<italic>B</italic>–<italic>S</italic><italic><sub>i</sub></italic> (or <italic>S</italic><italic><sub>i</sub></italic>–<italic>B</italic>–<italic>B</italic>) bending potential function. Due to the coarse-graining, we have to neglect some specific characters in the structure or energy distribution, and focus on the common characters behind the details. For fitting with <xref rid="FD3" ref-type="disp-formula">Equation 3</xref>, the mean value smooth technique is adopted to handle different profiles in <italic>B</italic>–<italic>B</italic>–<italic>S</italic><italic><sub>i</sub></italic> (or <italic>S</italic><italic><sub>i</sub></italic>–<italic>B</italic>–<italic>B</italic>), and the fitted potential function curves are also shown with solid curves in <xref ref-type="fig" rid="f2-ijms-13-14451">Figure 2</xref>. Gaussian parameters in <xref rid="FD3" ref-type="disp-formula">Equation 3</xref> obtained from the fitting process are given in the Supplementary Materials.</p>
<p>Similarly, the dihedral can be classified into four types: <italic>S</italic><italic><sub>i</sub></italic>–<italic>B</italic>–<italic>B</italic>–<italic>S</italic><italic><sub>j</sub></italic>, <italic>S</italic><italic><sub>i</sub></italic>–<italic>B</italic>–<italic>B</italic>–<italic>B</italic>, <italic>B</italic>–<italic>B</italic>–<italic>B</italic>–<italic>S</italic><italic><sub>i</sub></italic> and <italic>B</italic>–<italic>B</italic>–<italic>B</italic>–<italic>B</italic>. <xref ref-type="fig" rid="f3-ijms-13-14451">Figure 3</xref> gives the pseudo-dihedral torsion energy profiles of each type, e.g., <xref ref-type="fig" rid="f3-ijms-13-14451">Figure 3A</xref> shows the 100 energy profiles of <italic>S</italic><italic><sub>i</sub></italic>–<italic>B</italic>–<italic>B</italic>–<italic>S</italic><italic><sub>j</sub></italic>. Each type is fitted with <xref rid="FD3" ref-type="disp-formula">Equation 3</xref>, and the fitting results are also shown with solid curves in <xref ref-type="fig" rid="f3-ijms-13-14451">Figure 3</xref>. Gaussian parameters for torsion potential are also given in the Supplementary Materials.</p></sec>
<sec sec-type="results">
<title>2.2. Results of Non-Bonded Potential Parameterization</title>
<p>It is important to accurately describe the non-bonded interactions of 20 CG beads in order to study protein folding and protein-protein interactions. US as a sampling improving technique was used to get the PMF between two homologue CG beads, and PMF curve is fitted to <xref rid="FD5" ref-type="disp-formula">Equation 5</xref> for extracting the best van der Waals interaction potential parameters. <xref ref-type="fig" rid="f4-ijms-13-14451">Figure 4A</xref> gives the histograms of the configurations within the umbrella sampling windows, which indicates there is sufficient overlap between adjacent windows. <xref ref-type="fig" rid="f4-ijms-13-14451">Figure 4B</xref> gives the PMF against the distance of geometric center of two <italic>B</italic><italic><sub>ALA</sub></italic> beads, which have a minimum around 0.45 nm. However, when we made the statistical analyses of the distance distributions between two ALA amino acids on the above-mentioned protein structure database, the probability peak corresponding to the energy minimum was found around 0.55 nm. The reason for this inconsistency is that the CG bead is constrained by the surrounding beads while it is part of a protein, while is unrestricted in the US simulation. Most CG beads cannot be too close to each other in protein as in the US simulations, thus the short-range part of the PMF curve may not appropriate to model the non-bonded interactions in protein. However, the relatively long-distance interactions between CG beads are rarely affected by the environment in protein and can still be described by the PMF curves. Therefore, we made the statistical analyses of the distances for all 20 homologue CG bead pairs to determine the parameter <italic>c</italic><italic><sub>ij</sub></italic> in <xref rid="FD5" ref-type="disp-formula">Equation 5</xref> when the van der Waals potential is equal to zero as listed in <xref ref-type="table" rid="t2-ijms-13-14451">Table 2</xref>. <xref rid="FD5" ref-type="disp-formula">Equation 5</xref> was fitted to the PMF curve for determining the van der Waals well depth parameter with determined parameter <italic>c</italic><italic><sub>ij</sub></italic>. <xref ref-type="fig" rid="f5-ijms-13-14451">Figure 5</xref> gives the fitted results of CG beads <italic>B</italic><italic><sub>GLY</sub></italic>, <italic>B</italic><italic><sub>SER</sub></italic>, <italic>S</italic><italic><sub>GLU</sub></italic> and <italic>S</italic><italic><sub>ILE</sub></italic>. As in most cases, the position of the energy minimum determined by statistical <italic>c</italic><italic><sub>ij</sub></italic> is farther than that of the corresponding PMF curve, the fitting is only noticeable in the long-range part of the PMF, as shown in <xref ref-type="fig" rid="f5-ijms-13-14451">Figure 5</xref> (<italic>B</italic><italic><sub>SER</sub></italic>, <italic>S</italic><italic><sub>GLU</sub></italic> and <italic>S</italic><italic><sub>ILE</sub></italic>).</p></sec>
<sec>
<title>2.3. Verification of the Force Field</title>
<p>To verify our force field, several proteins solvated in water were coarse-grained and simulated for a relatively long time. During the simulations, the maintenance of experimental structures and other thermodynamic properties are deemed to be indications of the feasibility of force field for protein simulation.</p>
<p>The test protein group is composed of eight small to medium size proteins which are not included in the protein set used for bonded potential parameterization. These proteins have recently been used to examine the performance of a modified version of the CHARMM force fields [<xref ref-type="bibr" rid="b37-ijms-13-14451">37</xref>], and part of them have been used to test the PACE CG force field [<xref ref-type="bibr" rid="b14-ijms-13-14451">14</xref>], so they were chose to verify our force filed convenient for comparison. All the CG-MD simulations were based on the GROMACS 4.0.5 package [<xref ref-type="bibr" rid="b38-ijms-13-14451">38</xref>]. First, the protein was coarse-grained based on the proposed CG model, and topology files were generated with our developed scripts. Then the CG protein was solvated in CG water molecules, and the system was energy minimized with the proposed CG force field. Worthy of mention is that the GROMACS does not provide a Gaussian function type interface in its topology files, so user supplied tabulated functions were used for calculating the energy of angle bending and dihedral torsion. After the energy minimization, the CG system was equilibrated for 200 ps and then submitted for a 1000 ns simulation, using the canonical NPT ensemble at 300 K and 1 bar pressure, and the detail information for the eight simulated protein systems are listed in <xref ref-type="table" rid="t3-ijms-13-14451">Table 3</xref>.</p>
<p><xref ref-type="table" rid="t4-ijms-13-14451">Table 4</xref> gives the simulation time and Cα RMSDs of eight proteins from their experimental structures derived from CG-MD simulations <italic>versus</italic> all-atom simulations. With the CG-MD methodology, eight proteins were all simulated for 1000 ns, and the average Cα RMSDs are varied from 0.316 to 0.415 nm, and the final RMSDs are between 0.323 and 0.431 nm. While with the all-atom simulation [<xref ref-type="bibr" rid="b37-ijms-13-14451">37</xref>], eight proteins are simulated over 22–148 ns, and the average and final RMSDs is varied from 0.106 to 0.358 nm and 0.121 to 0.477 nm respectively. In general, the RMSDs with CG-MD are larger than those with AA-MD due to a longer simulation time and the roughness of our CG force field, but their values are comparable, and final RMSD of 1FKS is even lower. Thus, the experimental structures of proteins can be considered to be maintained with our CG force filed via long time MD simulations. <xref ref-type="fig" rid="f6-ijms-13-14451">Figure 6</xref> gives the full trajectories of the Cα RMSDs of eight proteins. Most of the proteins reach their stable conformations within the first 100 ns and the Cα RMSDs are kept around 0.4 nm. In one case, the structure of 3GB1 is more stable and the Cα RMSDs are maintained around 0.32 nm, which mainly because few long loops are included in the native structure of the protein. It is noteworthy that the Cα RMSD trajectory of protein 2AAS has two distinct increases at around 460 and 800 ns. For analyzing the reason and investigating the conformation change during the simulation, the conformations of 2AAS at 0 ns, 250 ns, 450 ns, 480 ns, 750 ns and 1000 ns are sampled and plotted in <xref ref-type="fig" rid="f7-ijms-13-14451">Figure 7</xref>. From observation of the conformations at these 6 time points, skeleton structure of 2AAS is kept stable during the 1000 ns simulation, which also proves the native structure can be maintained with our CG force field. Comparing conformations at 450 ns and 480 ns, loop1 which is composed of residues 20–25 went through a large conformation change as indicated in <xref ref-type="fig" rid="f7-ijms-13-14451">Figure 7</xref>, which correspond to the distinct increase of Cα RMSD trajectory around 460 ns. The conformation change of residues 58–61 (labeled as loop2 in <xref ref-type="fig" rid="f7-ijms-13-14451">Figure 7</xref>) between 750 and 1000 ns corresponds to the RMSD value change around 800 ns. Both loop1 and loop2 are flexible loop regions located at the solvent-exposed surface of the protein, so they are less stable than the secondary structure and the hydrophobic core of the protein during the simulation.</p>
<p>Another question that interested us is whether the conformational fluctuations of a protein can be reasonably simulated with our CG model and force field. In the PDB file of an NMR model, the B-factor column for each atom contains a measure how much that atom position varies throughout the models in the ensemble, which provides an experimentally detectable measure of equilibrium dynamics. <xref ref-type="fig" rid="f8-ijms-13-14451">Figure 8</xref> gives the Root Mean Square Fluctuations (RMSFs) relative to the averaged structures for protein 1BTA, 1D3Z, 1FKS and 3GB1, which provide B-factors in their NMR structures. The RMSFs simulated with CG-MD are compared with B-factors via conversion equation RMSF<sup>2</sup> = 3 × B/8/pi<sup>2</sup>, where B is the B-factor, which indicates the conformational stability degree. As shown in <xref ref-type="fig" rid="f8-ijms-13-14451">Figure 8</xref>, the RMSFs of protein 1D3Z and 3GB1 are consistent with the experimental values from a global perspective. However, at some locations of 1BTA and 1FKS, there are obvious inconsistencies between the simulated RMSFs and the experimental values: at residues 7–13, 15–20, 25–26 and 35–36 of protein 1BTA, the RMSFs are higher than the experimental values, while at residues 33–34, 40–44 and 84–91 of protein 1FKS, the situation is reversed. Through the analysis of protein structure and simulation trajectory, the above mentioned locations of 1BTA are either loops with lower curvature or ends of alpha helixes, while the locations of 1FKS are loops with higher curvature. The main reason of these conflicts is that the loop structure is mainly stabilized by the bonded interactions, while the bonded potentials adopted in our CG force field is a fitting of statistical average values due to the simplification. Therefore, loops with higher curvature are constrained by the bonded potential more strictly than they should be, while the situation of loops with lower curvature is opposite.</p></sec>
<sec>
<title>2.4. Efficiency of the Force Field</title>
<p>The main goal of coarse-graining is to improve the computational efficiency. For comparing the computational efficiency, the above mentioned eight testing proteins were simulated for another 10 ns with the proposed CG-MD methodology, AA-MD and MARTINI respectively. All the simulations were performed in serial on an Intel Xeon processor (2.4 GHz). The proteins were firstly centered in a box, the edge of which is 1 nm far from the molecules, and then solvated with water solvent. In all-atom simulations, GROMOS87 force filed and SPC water model were adopted, and a 2 fs time-step was used. In our and MARTINI CG-MD simulations, the corresponding CG water models and a 16 fs time-step were used. The energy of all the systems were minimized first, then equilibrated for 200 ps and submitted for a 10 ns simulation, using the canonical NPT ensemble at 300 K and 1 bar pressure. The simulation time is listed in <xref ref-type="table" rid="t5-ijms-13-14451">Table 5</xref>. With each simulation method, the simulation time is proportional to the protein size (as listed in <xref ref-type="table" rid="t3-ijms-13-14451">Table 3</xref>). The simulation time with our coarse-graining methodology is slightly less than that with MARTINI, which is mainly due to a coarser protein and water model. Compared with AA-MD simulation, MARTINI and our CG-MD method can achieve about 75~100 speedup. When more complicated solvent model is adopted in AA-MD, such as TIP3P, the speedup will be more obvious. It seems that larger time-step adopted in the CG-MD is a direct factor relating to the speedup, but the profound reason is that the appropriate coarse-graining model can maintain the structure stability of the protein in a CG-MD simulation with a larger time-step.</p>
<p>The average Cα RMSDs of all the simulations are also given in <xref ref-type="table" rid="t5-ijms-13-14451">Table 5</xref>. The values of AA-MD here slightly differ with those provided in <xref ref-type="table" rid="t4-ijms-13-14451">Table 4</xref>, which are got with a modified CHARMM force field and with a longer simulation period. With our simulations, the average RMSDs of AA-MD range from 0.128 to 0.259, which indicates AA-MD is most stable among these three simulation methodologies. The RMSDs with the proposed CG-MD methodology is lower than MARTINI for all eight proteins. Despite the fact that 10 ns is a relatively short simulation period, the results show that the proposed CG-MD methodology a comparable even better ability of native structure maintenance compared with the popular MARTINI CG force field. It should be noticed, the information of secondary structure is required in the simulations with MARTINI, while not required with the proposed method. It indicates that interactions in the CG protein structure can be balanced even without any extra structure restraints, and this makes the proposed model more suitable for simulating random or extended structures.</p>
<p>In addition, for evaluating the role of the CG solvent in the simulations, the testing proteins were also simulated in vacuum, and the average Cα RMSDs of the simulations are shown in <xref ref-type="table" rid="t5-ijms-13-14451">Table 5</xref>. The RMSDs in vacuum range from 0.416 to 0.689, and are significantly higher than those of the simulations with CG solvent. The reason for this obvious difference is that the initial structures of the simulations are the native structures of proteins which are maintained in a solvent environment. The maintenance of the native structure is determined largely by the balance of the interactions among different amino acid residues with each other and with the aqueous solution surrounding the protein, and the solvent influences the conformation by competing with intramolecular interactions. The bonded potential in this work is derived from a statistical analysis of a representative protein set, so the solvent effect is incorporated in an implicit way. However, the van der Waals interactions are determined via simulations in vacuum, so the solvent effect is not incorporated in the potential. Therefore, the RMSD values to the initial structures in the simulations will be larger when the CG solvents are absent.</p></sec></sec>
<sec sec-type="materials|methods">
<title>3. Materials and Methods</title>
<sec>
<title>3.1. The Coarse-Grained Protein Models</title>
<p>With our CG protein models, each amino acid is modeled by one or two beads according to their sizes. In total, 20 types of CG beads were designed for 20 amino acids as shown in <xref ref-type="fig" rid="f9-ijms-13-14451">Figure 9</xref>, which can be classified into two broad categories: backbone bead and side-chain bead. The backbone and side-chain beads can be denoted as <italic>B</italic><italic><sub>i</sub></italic> (<italic>i</italic> = ALA, ASN, ASP, CYS, GLY, LEU, PRO, SER, THR, VAL) and <italic>S</italic><italic><sub>i</sub></italic> (<italic>i</italic> = ARG, GLN, GLU, HIS, ILE, LYS, MET, PHE, TRP, TYR), respectively. Some amino acids are modeled only by one backbone bead due to their small side-chains, while others are modeled by one uniform backbone bead (Glycine bead <italic>B</italic><italic><sub>GLY</sub></italic>) and one distinct side-chain bead. All the CG beads are idealized as a sphere, and center of the backbone bead is located at the alpha-carbon atom, while the center of the side-chain bead is located at the geometric center of all its heavy atoms.</p></sec>
<sec>
<title>3.2. The Coarse-Grained Force Field</title>
<p>With the above mentioned protein CG models, the structure and internal interactions of a protein can be simplified as shown in <xref ref-type="fig" rid="f10-ijms-13-14451">Figure 10</xref>. The CG force field can be formulated as <xref rid="FD1" ref-type="disp-formula">Equation 1</xref>:</p>
<disp-formula id="FD1">
<label>(1)</label>
<mml:math id="mm1" display="block">
<mml:semantics id="sm1">
<mml:mrow>
<mml:mi>U</mml:mi>
<mml:mo>=</mml:mo>
<mml:munder accentunder="true">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>U</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>d</mml:mi></mml:mrow></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>U</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>g</mml:mi>
<mml:mi>l</mml:mi>
<mml:mi>e</mml:mi></mml:mrow></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>U</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>s</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mrow>
<mml:mo stretchy="true">_</mml:mo></mml:munder>
<mml:mo>+</mml:mo>
<mml:munder accentunder="true">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>U</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>v</mml:mi>
<mml:mi>d</mml:mi>
<mml:mi>w</mml:mi></mml:mrow></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>U</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>e</mml:mi>
<mml:mi>l</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:mrow>
<mml:mo stretchy="true">_</mml:mo></mml:munder></mml:mrow></mml:semantics></mml:math></disp-formula>
<p>where <italic>U</italic><italic><sub>bond</sub></italic>, <italic>U</italic><italic><sub>angle</sub></italic> and <italic>U</italic><italic><sub>torsion</sub></italic> are the stretching potential energy of a virtual bond, the potential energy of a virtual angle bending and the potential function of a dihedral angle about a rotating bond, respectively, which describe the bonded interactions between CG beads. <italic>U</italic><italic><sub>vdw</sub></italic> and <italic>U</italic><italic><sub>elec</sub></italic> describe the non-bonded interactions, which are the energy of van der Waals interactions and electrostatic interactions respectively.</p></sec>
<sec>
<title>3.3. The Bonded Potential and Parameterization</title>
<p>The virtual stretching interaction between two bonded CG beads can be described as a harmonic potential:</p>
<disp-formula id="FD2">
<label>(2)</label>
<mml:math id="mm2" display="block">
<mml:semantics id="sm2">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>U</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>d</mml:mi></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:mfrac>
<mml:msub>
<mml:mrow>
<mml:mi>K</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>d</mml:mi></mml:mrow></mml:msub>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>l</mml:mi>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>L</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>d</mml:mi></mml:mrow></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>
<p>where <italic>K</italic><italic><sub>bond</sub></italic> and <italic>L</italic><italic><sub>bond</sub></italic> are the force constant and the equilibrium stretching length of a bond, respectively, which will be determined by fitting the energy distribution of the virtual bond. Due to the coarse-graining, <italic>U</italic><italic><sub>angle</sub></italic> and <italic>U</italic><italic><sub>torsion</sub></italic> curves become more complex and irregular when compared with those of AA force field, and they are described with Gaussian distribution function:</p>
<disp-formula id="FD3">
<label>(3)</label>
<mml:math id="mm3" display="block">
<mml:semantics id="sm3">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>U</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>a</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>g</mml:mi>
<mml:mi>l</mml:mi>
<mml:mi>e</mml:mi>
<mml:mo>/</mml:mo>
<mml:mi>t</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>s</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>n</mml:mi></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:munderover>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mi>N</mml:mi></mml:munderover>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>a</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mi> </mml:mi>
<mml:mtext>exp</mml:mtext>
<mml:mo stretchy="false">[</mml:mo>
<mml:mo>-</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>b</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>c</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup>
<mml:mo stretchy="false">]</mml:mo></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>
<p>where <italic>N</italic>, <italic>a</italic><italic><sub>i</sub></italic>, <italic>b</italic><italic><sub>i</sub></italic> and <italic>c</italic><italic><sub>i</sub></italic> are Gaussian parameters need to be determined in the parameterization process.</p>
<p>For correctly parameterize the bonded potential, we adopted a reduced and non-redundant set of protein structures used for fold recognition [<xref ref-type="bibr" rid="b39-ijms-13-14451">39</xref>]. This set includes about 3600 structures chosen from the Protein Database Bank (PDB) [<xref ref-type="bibr" rid="b40-ijms-13-14451">40</xref>], and the Root Mean Square Deviation (RMSD) of each structure is at least 6 Å to the rest of the structures in the set to avoid structure redundancy. Statistical analyses were performed against this protein set, and the resulting probability distributions were used to calculate Potential of Mean Force (PMF) via Boltzmann conversion method [<xref ref-type="bibr" rid="b13-ijms-13-14451">13</xref>,<xref ref-type="bibr" rid="b41-ijms-13-14451">41</xref>,<xref ref-type="bibr" rid="b42-ijms-13-14451">42</xref>]:</p>
<disp-formula id="FD4">
<label>(4)</label>
<mml:math id="mm4" display="block">
<mml:semantics id="sm4">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>U</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo>=</mml:mo>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>k</mml:mi></mml:mrow>
<mml:mi>B</mml:mi></mml:msub>
<mml:mi>T</mml:mi>
<mml:mi> </mml:mi>
<mml:mtext>ln</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:semantics></mml:math></disp-formula>
<p>where <italic>k</italic><italic><sub>B</sub></italic> is the Boltzmann constant, <italic>T</italic> is the temperature, and <italic>P</italic><italic><sub>i</sub></italic><italic>= n</italic><italic><sub>i</sub></italic>/<italic>n</italic><italic><sub>ref</sub></italic> is the probability of a property at value <italic>i</italic>, in which the reference number <italic>n</italic><italic><sub>ref</sub></italic> is the total number of the investigated internal coordinate obtained from the statistics of the above mentioned protein set.</p></sec>
<sec>
<title>3.4. The Non-Bonded Potential and Parameterization</title>
<p>Modeling the non-bonded potential is a key problem of constructing CG-MD force filed. As in a classical AA force field, we assume that the non-bonded interaction can be subdivided into two categories, <italic>i.e.</italic>, van der Waals interaction and electrostatic interaction. They can be formulated as sums of pairwise potential energy:</p>
<disp-formula id="FD5">
<label>(5)</label>
<mml:math id="mm5" display="block">
<mml:semantics id="sm5">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>U</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>v</mml:mi>
<mml:mi>d</mml:mi>
<mml:mi>w</mml:mi></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&lt;</mml:mo>
<mml:mi>j</mml:mi></mml:mrow></mml:munder>
<mml:mrow>
<mml:mn>4</mml:mn>
<mml:msub>
<mml:mrow>
<mml:mi>ɛ</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi></mml:mrow></mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>c</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi></mml:mrow>
<mml:mrow>
<mml:mn>12</mml:mn></mml:mrow></mml:msubsup></mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>r</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi></mml:mrow>
<mml:mrow>
<mml:mn>12</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:mfrac>
<mml:mo>-</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>c</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi></mml:mrow>
<mml:mn>6</mml:mn></mml:msubsup></mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>r</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi></mml:mrow>
<mml:mn>6</mml:mn></mml:msubsup></mml:mrow></mml:mfrac></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>
<disp-formula id="FD6">
<label>(6)</label>
<mml:math id="mm6" display="block">
<mml:semantics id="sm6">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>U</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>e</mml:mi>
<mml:mi>l</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>c</mml:mi></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&lt;</mml:mo>
<mml:mi>j</mml:mi></mml:mrow></mml:munder>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>Q</mml:mi></mml:mrow>
<mml:mi>i</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>Q</mml:mi></mml:mrow>
<mml:mi>j</mml:mi></mml:msub></mml:mrow>
<mml:mrow>
<mml:mn>4</mml:mn>
<mml:mi>π</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>ɛ</mml:mi></mml:mrow>
<mml:mn>0</mml:mn></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>ɛ</mml:mi></mml:mrow>
<mml:mi>r</mml:mi></mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi>r</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>
<p>where <italic>c</italic><italic><sub>ij</sub></italic> is the van der Waals interaction parameter, <italic>r</italic><italic><sub>ij</sub></italic> is the distance between CG beads <italic>i</italic> and <italic>j</italic>, and <italic>Q</italic><italic><sub>i</sub></italic> and <italic>Q</italic><italic><sub>j</sub></italic> are the charges of <italic>i</italic> and <italic>j</italic>. The strength of the van der Waals interaction is determined by the value of well depth <italic>ɛ</italic><italic><sub>ij</sub></italic> which depends on the types of the interacting CG beads and can be determined in the force field parameterization process for all the 20 types of CG beads. In the proposed force field, the electrostatic interaction is taken into account through distributed point charges, and four CG beads are treated as charged: backbone bead <italic>B</italic><italic><sub>ASP</sub></italic> and side-chain bead <italic>S</italic><italic><sub>GLU</sub></italic> are one unit negatively charged, and side-chain beads <italic>S</italic><italic><sub>ARG</sub></italic> and <italic>S</italic><italic><sub>LYS</sub></italic> are one unit positively charged. The electrostatic interaction between charged beads is calculated via <xref rid="FD6" ref-type="disp-formula">Equation 6</xref> with the relative dielectric constant <italic>ɛ</italic><italic><sub>r</sub></italic> = 1.</p>
<p>In the coarse-graining, a group of atoms are treated as a single bead, and the relative positions of these atoms are fixed, but in reality, their relative positions vary in all the time. PMF is defined as the potential that gives an average force over all the configurations of a given system, and is used here to characterize the non-bonded interactions between CG beads. Umbrella Sampling (US) method [<xref ref-type="bibr" rid="b43-ijms-13-14451">43</xref>] based on AA-MD was applied on the AA molecules of 20 CG beads to get the van der Waals well depth parameter when one molecule interacts with itself, and Lorentz-Berthelot mixing rules were applied for getting the interaction parameters between different CG beads:</p>
<disp-formula id="FD7">
<label>(7)</label>
<mml:math id="mm7" display="block">
<mml:semantics id="sm7">
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:msubsup>
<mml:mrow>
<mml:mi>c</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi></mml:mrow>
<mml:mn>6</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>c</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>i</mml:mi></mml:mrow>
<mml:mn>6</mml:mn></mml:msubsup>
<mml:msubsup>
<mml:mrow>
<mml:mi>c</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mi>j</mml:mi></mml:mrow>
<mml:mn>6</mml:mn></mml:msubsup>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>/</mml:mo>
<mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msubsup>
<mml:mrow>
<mml:mi>c</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi></mml:mrow>
<mml:mrow>
<mml:mn>12</mml:mn></mml:mrow></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>c</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>i</mml:mi></mml:mrow>
<mml:mrow>
<mml:mn>12</mml:mn></mml:mrow></mml:msubsup>
<mml:msubsup>
<mml:mrow>
<mml:mi>c</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mi>j</mml:mi></mml:mrow>
<mml:mrow>
<mml:mn>12</mml:mn></mml:mrow></mml:msubsup>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>/</mml:mo>
<mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mtd></mml:mtr></mml:mtable></mml:semantics></mml:math></disp-formula>
<p>To perform the US simulations, backbone beads are simulated with corresponding AA amino acids, and side-chain beads are replaced by analogous compounds, as listed in <xref ref-type="table" rid="t6-ijms-13-14451">Table 6</xref>. The simulations were performed with GROMACS 4.0.5 package, using fully flexible molecules in vacuum and the canonical NVT ensemble, and with vanishing charge in order to capture the purely non-electrostatic interaction. The GROMOS87 force filed was applied to the molecules, and the temperature is kept at 300 K by coupling to a Berendsen thermostat. Two identical AA molecules of a CG bead were placed together and equilibrated for 2000 ps, then two molecules were pulled from their equilibrium position to 15 angstroms along a reaction coordinate via umbrella pulling with a constant pulling rate 0.001 nm ps<sup>−1</sup>. The snapshots were saved every 1 ps, and the pulling distance was divided into subspaces every 0.5 angstrom. At last, US simulation was applied in every subspace for 10 ns, and the Weighted Histogram Analysis Method (WHAM) [<xref ref-type="bibr" rid="b44-ijms-13-14451">44</xref>] was applied to accurately integrate the PMF of the non-bonded interaction between two homologue AA molecules.</p></sec>
<sec>
<title>3.5. Coarse-Grained Water Model and Parameterization</title>
<p>In protein simulation, quite often most of the computational cost is spent on calculating water-water intermolecular interactions rather than solute-water or solute-solute interactions, so the water coarse-graining will remarkably improve the efficiency of MD simulation. An appropriate water model should be constructed in the CG protein-solvent system, and the CG water model should be consistent with the protein CG models both in volume and mass, so the interactions between protein and solvent can be accurately reproduced. For determining the appropriate coarse-graining methodology of water molecules, we took a simulation of pure water with GROMACS 4.0.5 package with TIP3P water model, using GROMOS87 force field and the canonical NPT ensemble. The system was coupled to a temperature bath at 300 K and a barostat at 1 bar pressure, and was simulated for 1 ns. Statistical analysis with the simulation results showed that the average volume of five water molecules is about 140 Å<sup>3</sup> and the mass is 90 amu, which is consistent with the average volume 120 Å<sup>3</sup> and average mass 95 amu of the proposed CG protein beads. Therefore, the CG solvent model composed of five water molecules is adopted. The CG water bead is treated as neutral according to its total charge, so the interactions between CG water beads and other CG beads are mainly through van der Waals force. In order to determine the parameters of the van der Waals function for the CG water bead, every five nearest water molecules were clustered into a group with K-means algorithm, and the nearest distances between a group and the adjacent groups were calculated. According to the distribution probability, 0.51 nm is adopted for the parameter <italic>c</italic><italic><sub>ij</sub></italic> in <xref rid="FD5" ref-type="disp-formula">Equation 5</xref>. Using identical settings with the previous AA-MD water simulation, CG water system was simulated with different <italic>ɛ</italic><italic><sub>ij</sub></italic>. For determining the best well depth parameter, the bulk density of the CG water system as a function of time was calculated and compared with the density variation of AA-MD. According to the comparison, <italic>ɛ</italic><italic><sub>ij</sub></italic> = 6 kJ mol<sup>−1</sup> is the best value for reproducing the bulk phase density of water, and is adopted in our CG force field.</p></sec></sec>
<sec sec-type="conclusions">
<title>4. Conclusions</title>
<p>In this work, 20 CG beads for protein were constructed according to the characters of 20 amino acids, and a residue is composed of only one or two CG beads. Correspondingly, with the K-means method, a five-water coarse-grained solvent model was adopted to suit the CG protein model. A force field was developed for the CG protein and solvent model. For all the bonded interactions in protein, CG beads are divided into two types. All the combinations with these two types of the bonded interactions were analyzed on a known protein structure subset, and the resulting energy distributions were fitted to various potential functions to formulate the bonded interactions in CG-MD. The umbrella sampling method was used on the AA molecules of the CG beads to get the PMF of non-bonded interactions between CG beads, and the PMF was fitted to a Lennard-Jones function potential to describe the non-bonded interactions in CG-MD.</p>
<p>The CG model and force field were tested on eight small to medium size proteins. With the results analysis of the simulations, without any extra information of the simulated protein structure, the skeleton structure of the protein can be maintained during a long time equilibrium dynamics simulation with the proposed coarse-graining methodology. Comparison of the efficiency shows that the proposed CG-MD can make a 75~100 computing speedup relative to AA-MD, which is also higher than the popular MARTINI model. Meanwhile, the native structure of the proteins can be well preserved during the simulations. In addition, RMSFs of Cα atoms during the simulation show our CG-MD method can reasonably sampling the conformational fluctuations within a protein from a global perspective. However, the simulation results also indicate that the fluctuation of loop structures with a low curvature in protein may be overestimated in some proteins compared with experimental values, and the situation is converse when the loop structures have a high curvature because the simplification in the CG-MD. Further work is needed to investigate and carefully treat the loop structures in protein coarse-graining methodology.</p></sec>
<sec sec-type="supplementary-material">
<title>Supplementary Information</title>
<supplementary-material id="s1-ijms-13-14451" content-type="local-data">
<media xlink:href="ijms-13-14451-s001.pdf" mimetype="application" mime-subtype="pdf"/></supplementary-material></sec></body>
<back>
<ack>
<title>Acknowledgments</title>
<p>The authors gratefully acknowledge financial support for this work from the National Program on Key Basic Research Project (No. 2009CB918501), the Fundamental Research Funds for the Central Universities, and the National Natural Science Funds of China (No. 11202049 and 11072048).</p></ack>
<ref-list>
<title>References</title>
<ref id="b1-ijms-13-14451"><label>1</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gunsteren</surname><given-names>W.F.</given-names></name><name><surname>Dolenc</surname><given-names>J.</given-names></name></person-group><article-title>Biomolecular simulation: Historical picture and future perspectives</article-title><source>Biochem. Soc. Trans</source><year>2008</year><volume>36</volume><fpage>11</fpage><lpage>15</lpage><pub-id pub-id-type="doi">10.1042/BST0360011</pub-id><pub-id pub-id-type="pmid">18208376</pub-id></citation></ref>
<ref id="b2-ijms-13-14451"><label>2</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Adcock</surname><given-names>S.A.</given-names></name><name><surname>McCammon</surname><given-names>J.A.</given-names></name></person-group><article-title>Molecular dynamics: Survey of methods simulating the activity of proteins</article-title><source>Chem. Rev</source><year>2006</year><volume>106</volume><fpage>1589</fpage><lpage>1615</lpage><pub-id pub-id-type="doi">10.1021/cr040426m</pub-id><pub-id pub-id-type="pmid">16683746</pub-id></citation></ref>
<ref id="b3-ijms-13-14451"><label>3</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McCammon</surname><given-names>J.A.</given-names></name><name><surname>Gelin</surname><given-names>B.R.</given-names></name><name><surname>Kaplus</surname><given-names>M.</given-names></name></person-group><article-title>Dynamics of folded proteins</article-title><source>Nature</source><year>1977</year><volume>267</volume><fpage>585</fpage><lpage>590</lpage><pub-id pub-id-type="doi">10.1038/267585a0</pub-id><pub-id pub-id-type="pmid">301613</pub-id></citation></ref>
<ref id="b4-ijms-13-14451"><label>4</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Klepeis</surname><given-names>J.L.</given-names></name><name><surname>Lindorff-Larsen</surname><given-names>K.</given-names></name><name><surname>Dror</surname><given-names>R.O.</given-names></name><name><surname>Shaw</surname><given-names>D.E.</given-names></name></person-group><article-title>Long-timescale molecular dynamics simulations of protein structure and function</article-title><source>Curr. Opin. Struct. Biol</source><year>2009</year><volume>19</volume><fpage>120</fpage><lpage>127</lpage><pub-id pub-id-type="doi">10.1016/j.sbi.2009.03.004</pub-id><pub-id pub-id-type="pmid">19361980</pub-id></citation></ref>
<ref id="b5-ijms-13-14451"><label>5</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sanbonmatsun</surname><given-names>K.Y.</given-names></name><name><surname>Tung</surname><given-names>C.-S.</given-names></name></person-group><article-title>High performance computing in biology: Multimillion atom simulations of nanoscale systems</article-title><source>J. Struct. Biol</source><year>2007</year><volume>157</volume><fpage>470</fpage><lpage>480</lpage><pub-id pub-id-type="doi">10.1016/j.jsb.2006.10.023</pub-id><pub-id pub-id-type="pmid">17187988</pub-id></citation></ref>
<ref id="b6-ijms-13-14451"><label>6</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Muller-Plathe</surname><given-names>F.</given-names></name></person-group><article-title>Coarse-graining in polymer simulation: From the atomistic to the mesoscopic scale and back</article-title><source>ChemPhysChem</source><year>2002</year><volume>3</volume><fpage>754</fpage><lpage>769</lpage><pub-id pub-id-type="doi">10.1002/1439-7641(20020916)3:9&lt;754::AID-CPHC754&gt;3.0.CO;2-U</pub-id></citation></ref>
<ref id="b7-ijms-13-14451"><label>7</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kolinski</surname><given-names>A.</given-names></name><name><surname>Skolnick</surname><given-names>J.</given-names></name></person-group><article-title>Reduced models of proteins and their applications</article-title><source>Polymer</source><year>2004</year><volume>45</volume><fpage>511</fpage><lpage>524</lpage><pub-id pub-id-type="doi">10.1016/j.polymer.2003.10.064</pub-id></citation></ref>
<ref id="b8-ijms-13-14451"><label>8</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tozzini</surname><given-names>V.</given-names></name></person-group><article-title>Coarse-grained models for proteins</article-title><source>Curr. Opin. Struct. Biol</source><year>2005</year><volume>15</volume><fpage>144</fpage><lpage>150</lpage><pub-id pub-id-type="doi">10.1016/j.sbi.2005.02.005</pub-id><pub-id pub-id-type="pmid">15837171</pub-id></citation></ref>
<ref id="b9-ijms-13-14451"><label>9</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clementi</surname><given-names>C.</given-names></name></person-group><article-title>Coarse-grained models of protein folding: Toy models or predictive tools</article-title><source>Curr. Opin. Struct. Biol</source><year>2008</year><volume>18</volume><fpage>10</fpage><lpage>15</lpage><pub-id pub-id-type="doi">10.1016/j.sbi.2007.10.005</pub-id><pub-id pub-id-type="pmid">18160277</pub-id></citation></ref>
<ref id="b10-ijms-13-14451"><label>10</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lindahl</surname><given-names>E.</given-names></name><name><surname>Sansom</surname><given-names>M.S.</given-names></name></person-group><article-title>Membrane proteins: Molecular dynamics simulation</article-title><source>Curr. Opin. Struct. Biol</source><year>2008</year><volume>18</volume><fpage>425</fpage><lpage>431</lpage><pub-id pub-id-type="doi">10.1016/j.sbi.2008.02.003</pub-id><pub-id pub-id-type="pmid">18406600</pub-id></citation></ref>
<ref id="b11-ijms-13-14451"><label>11</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Khalili-Araghi</surname><given-names>F.</given-names></name><name><surname>Gumbart</surname><given-names>J.</given-names></name><name><surname>Wen</surname><given-names>P.</given-names></name><name><surname>Sotomayor</surname><given-names>M.</given-names></name><name><surname>Tajkhorshid</surname><given-names>E.</given-names></name><name><surname>Schulten</surname><given-names>K.</given-names></name></person-group><article-title>Molecular dynamics simulations of membrane channels and transporters</article-title><source>Curr. Opin. Struct. Biol</source><year>2009</year><volume>19</volume><fpage>128</fpage><lpage>137</lpage><pub-id pub-id-type="doi">10.1016/j.sbi.2009.02.011</pub-id><pub-id pub-id-type="pmid">19345092</pub-id></citation></ref>
<ref id="b12-ijms-13-14451"><label>12</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Monticelli</surname><given-names>L.</given-names></name><name><surname>Kandasamy</surname><given-names>S.K.</given-names></name><name><surname>Periole</surname><given-names>X.</given-names></name><name><surname>Larson</surname><given-names>R.G.</given-names></name><name><surname>Tieleman</surname><given-names>D.P.</given-names></name><name><surname>Marrink</surname><given-names>S.</given-names></name></person-group><article-title>The MARTINI coarse-grained force field: Extension to proteins</article-title><source>J. Chem. Theory Comput</source><year>2008</year><volume>4</volume><fpage>819</fpage><lpage>834</lpage><pub-id pub-id-type="doi">10.1021/ct700324x</pub-id></citation></ref>
<ref id="b13-ijms-13-14451"><label>13</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ha-Duong</surname><given-names>T.</given-names></name></person-group><article-title>Protein backbone dynamics simulations using coarse-grained bonded potentials and simplified hydrogen bonds</article-title><source>J. Chem. Theory Comput</source><year>2010</year><volume>6</volume><fpage>761</fpage><lpage>773</lpage><pub-id pub-id-type="doi">10.1021/ct900408s</pub-id></citation></ref>
<ref id="b14-ijms-13-14451"><label>14</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Han</surname><given-names>W.</given-names></name><name><surname>Wan</surname><given-names>C.K.</given-names></name><name><surname>Jiang</surname><given-names>F.</given-names></name><name><surname>Wu</surname><given-names>Y.D.</given-names></name></person-group><article-title>PACE force field for protein simulations. 1. Full parameterization of version1 and verification</article-title><source>J. Chem. Theory Comput</source><year>2010</year><volume>6</volume><fpage>3373</fpage><lpage>3389</lpage><pub-id pub-id-type="doi">10.1021/ct1003127</pub-id></citation></ref>
<ref id="b15-ijms-13-14451"><label>15</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Han</surname><given-names>W.</given-names></name><name><surname>Wan</surname><given-names>C.K.</given-names></name><name><surname>Jiang</surname><given-names>F.</given-names></name><name><surname>Wu</surname><given-names>Y.D.</given-names></name></person-group><article-title>PACE force field for protein simulations. 2. Folding simulations of peptides</article-title><source>J. Chem. Theory Comput</source><year>2010</year><volume>6</volume><fpage>3390</fpage><lpage>3402</lpage><pub-id pub-id-type="doi">10.1021/ct100313a</pub-id></citation></ref>
<ref id="b16-ijms-13-14451"><label>16</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Basdevant</surname><given-names>N.</given-names></name><name><surname>Borgis</surname><given-names>D.</given-names></name><name><surname>Ha-Duong</surname><given-names>T.</given-names></name></person-group><article-title>A coarse-grained protein-protein potential derived from an all-atom force field</article-title><source>J. Phys. Chem. B</source><year>2007</year><volume>111</volume><fpage>9390</fpage><lpage>9399</lpage><pub-id pub-id-type="doi">10.1021/jp0727190</pub-id><pub-id pub-id-type="pmid">17616119</pub-id></citation></ref>
<ref id="b17-ijms-13-14451"><label>17</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Han</surname><given-names>W.</given-names></name><name><surname>Wu</surname><given-names>Y.D.</given-names></name></person-group><article-title>Coarse-grained protein model coupled with a coarse-grained water model: Molecular dynamics study of polyalanine-based peptides</article-title><source>J. Chem. Theory Comput</source><year>2007</year><volume>3</volume><fpage>2146</fpage><lpage>2161</lpage><pub-id pub-id-type="doi">10.1021/ct700151x</pub-id></citation></ref>
<ref id="b18-ijms-13-14451"><label>18</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bereau</surname><given-names>T.</given-names></name><name><surname>Deserno</surname><given-names>M.</given-names></name></person-group><article-title>Generic coarse-grained model for protein folding and aggregation</article-title><source>J. Chem. Phys</source><year>2009</year><volume>130</volume><fpage>235106</fpage><pub-id pub-id-type="doi">10.1063/1.3152842</pub-id><pub-id pub-id-type="pmid">19548767</pub-id></citation></ref>
<ref id="b19-ijms-13-14451"><label>19</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Han</surname><given-names>W.</given-names></name><name><surname>Wan</surname><given-names>C.K.</given-names></name><name><surname>Wu</surname><given-names>Y.D.</given-names></name></person-group><article-title>Toward a coarse-grained protein model coupled with a coarse-grained solvent model: Solvation free energies of amino acid side chains</article-title><source>J. Chem. Theory Comput</source><year>2008</year><volume>4</volume><fpage>1891</fpage><lpage>1901</lpage><pub-id pub-id-type="doi">10.1021/ct800184c</pub-id></citation></ref>
<ref id="b20-ijms-13-14451"><label>20</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>DeVane</surname><given-names>R.</given-names></name><name><surname>Shinoda</surname><given-names>W.</given-names></name><name><surname>Moore</surname><given-names>P.B.</given-names></name><name><surname>Klein</surname><given-names>M.L.</given-names></name></person-group><article-title>Transferable coarse grain nonbonded interaction model for amino acids</article-title><source>J. Chem. Theory Comput</source><year>2009</year><volume>5</volume><fpage>2115</fpage><lpage>2124</lpage><pub-id pub-id-type="doi">10.1021/ct800441u</pub-id><pub-id pub-id-type="pmid">20161179</pub-id></citation></ref>
<ref id="b21-ijms-13-14451"><label>21</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shih</surname><given-names>A.Y.</given-names></name><name><surname>Arkhipov</surname><given-names>A.</given-names></name><name><surname>Freddolino</surname><given-names>P.L.</given-names></name><name><surname>Schulten</surname><given-names>K.</given-names></name></person-group><article-title>A coarse grained protein-lipid model with application to lipprotein particles</article-title><source>J. Phys. Chem. B</source><year>2006</year><volume>110</volume><fpage>3674</fpage><lpage>3684</lpage><pub-id pub-id-type="doi">10.1021/jp0550816</pub-id><pub-id pub-id-type="pmid">16494423</pub-id></citation></ref>
<ref id="b22-ijms-13-14451"><label>22</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname><given-names>J.</given-names></name><name><surname>Thorpe</surname><given-names>L.F.</given-names></name><name><surname>Izvekov</surname><given-names>S.</given-names></name><name><surname>Voth</surname><given-names>G.A.</given-names></name></person-group><article-title>Coarse-grained peptide modeling using a systematic multiscale approach</article-title><source>Biophys. J</source><year>2007</year><volume>92</volume><fpage>4289</fpage><lpage>4303</lpage><pub-id pub-id-type="doi">10.1529/biophysj.106.094425</pub-id><pub-id pub-id-type="pmid">17400700</pub-id></citation></ref>
<ref id="b23-ijms-13-14451"><label>23</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Korkut</surname><given-names>A.</given-names></name><name><surname>Hendrickson</surname><given-names>W.A.</given-names></name></person-group><article-title>A force field for virtual atom molecular mechanics of proteins</article-title><source>Proc. Natl. Acad. Sci. USA</source><year>2009</year><volume>106</volume><fpage>15667</fpage><lpage>15672</lpage><pub-id pub-id-type="doi">10.1073/pnas.0907674106</pub-id><pub-id pub-id-type="pmid">19717427</pub-id></citation></ref>
<ref id="b24-ijms-13-14451"><label>24</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tozzini</surname><given-names>V.</given-names></name><name><surname>Rocchia</surname><given-names>W.</given-names></name><name><surname>McCammon</surname><given-names>J.A.</given-names></name></person-group><article-title>Mapping all-atom models onto one-bead coarse grained models: General properties and applications to a minimal polypeptide model</article-title><source>J. Chem. Theory Comput</source><year>2006</year><volume>2</volume><fpage>667</fpage><lpage>673</lpage><pub-id pub-id-type="doi">10.1021/ct050294k</pub-id><pub-id pub-id-type="pmid">19461947</pub-id></citation></ref>
<ref id="b25-ijms-13-14451"><label>25</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chang</surname><given-names>C.A.</given-names></name><name><surname>Trylska</surname><given-names>J.</given-names></name><name><surname>Tozzini</surname><given-names>V.</given-names></name><name><surname>McCammon</surname><given-names>J.A.</given-names></name></person-group><article-title>Binding pathways of ligands to HIV-1 protease: Coarse-grained and atomistic simulations</article-title><source>Chem. Biol. Drug Des</source><year>2007</year><volume>69</volume><fpage>5</fpage><lpage>13</lpage><pub-id pub-id-type="doi">10.1111/j.1747-0285.2007.00464.x</pub-id><pub-id pub-id-type="pmid">17313452</pub-id></citation></ref>
<ref id="b26-ijms-13-14451"><label>26</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Korkut</surname><given-names>A.</given-names></name><name><surname>Hendrickson</surname><given-names>W.A.</given-names></name></person-group><article-title>Computation of conformational transitions in proteins by virtual atom molecular mechanics as validated in application to adenylate kinase</article-title><source>Proc. Natl. Acad. Sci. USA</source><year>2009</year><volume>106</volume><fpage>15673</fpage><lpage>15678</lpage><pub-id pub-id-type="doi">10.1073/pnas.0907684106</pub-id><pub-id pub-id-type="pmid">19706894</pub-id></citation></ref>
<ref id="b27-ijms-13-14451"><label>27</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alemani</surname><given-names>D.</given-names></name><name><surname>Collu</surname><given-names>F.</given-names></name><name><surname>Cascella</surname><given-names>M.</given-names></name><name><surname>Peraro</surname><given-names>M.D.</given-names></name></person-group><article-title>A nonradial coarse-grained potential for proteins produces naturally stable secondary structure elements</article-title><source>J. Chem. Theory Comput</source><year>2010</year><volume>6</volume><fpage>315</fpage><lpage>324</lpage><pub-id pub-id-type="doi">10.1021/ct900457z</pub-id></citation></ref>
<ref id="b28-ijms-13-14451"><label>28</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arkhipov</surname><given-names>A.</given-names></name><name><surname>Freddolino</surname><given-names>P.L.</given-names></name><name><surname>Schulten</surname><given-names>K.</given-names></name></person-group><article-title>Stability and dynamics of virus capsids described by coarse-grained modeling</article-title><source>Structure</source><year>2006</year><volume>14</volume><fpage>1767</fpage><lpage>1777</lpage><pub-id pub-id-type="doi">10.1016/j.str.2006.10.003</pub-id><pub-id pub-id-type="pmid">17161367</pub-id></citation></ref>
<ref id="b29-ijms-13-14451"><label>29</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arkhipov</surname><given-names>A.</given-names></name><name><surname>Yin</surname><given-names>Y.</given-names></name><name><surname>Schulten</surname><given-names>K.</given-names></name></person-group><article-title>Four-scale description of membrane sculpting by BAR domains</article-title><source>Biophys. J</source><year>2008</year><volume>95</volume><fpage>2806</fpage><lpage>2861</lpage><pub-id pub-id-type="doi">10.1529/biophysj.108.132563</pub-id><pub-id pub-id-type="pmid">18515394</pub-id></citation></ref>
<ref id="b30-ijms-13-14451"><label>30</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arkhipov</surname><given-names>A.</given-names></name><name><surname>Freddolino</surname><given-names>P.L.</given-names></name><name><surname>Imada</surname><given-names>K.</given-names></name><name><surname>Namba</surname><given-names>K.</given-names></name><name><surname>Schulten</surname><given-names>K.</given-names></name></person-group><article-title>Coarse-grained molecular dynamics simulations of a rotating bacterial Flagellum</article-title><source>Biophys. J</source><year>2006</year><volume>91</volume><fpage>4589</fpage><lpage>4597</lpage><pub-id pub-id-type="doi">10.1529/biophysj.106.093443</pub-id><pub-id pub-id-type="pmid">16997871</pub-id></citation></ref>
<ref id="b31-ijms-13-14451"><label>31</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>West</surname><given-names>B.</given-names></name><name><surname>Brown</surname><given-names>F.L.H.</given-names></name><name><surname>Schmid</surname><given-names>B.</given-names></name></person-group><article-title>Membrane-protein interactions in a generic coarse-grained model for lipid bilayers</article-title><source>Biophys. J</source><year>2009</year><volume>96</volume><fpage>101</fpage><lpage>115</lpage><pub-id pub-id-type="doi">10.1529/biophysj.108.138677</pub-id><pub-id pub-id-type="pmid">18835907</pub-id></citation></ref>
<ref id="b32-ijms-13-14451"><label>32</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spijker</surname><given-names>P.</given-names></name><name><surname>Hoof</surname><given-names>B.V.</given-names></name><name><surname>Debertrand</surname><given-names>M.</given-names></name><name><surname>Markvoort</surname><given-names>A.J.</given-names></name><name><surname>Vaidehi</surname><given-names>N.</given-names></name><name><surname>Hilbers</surname><given-names>P.A.J.</given-names></name></person-group><article-title>Coarse grained molecular dynamics simulations of transmembrane protein-lipid systems</article-title><source>Int. J. Mol. Sci</source><year>2010</year><volume>11</volume><fpage>2393</fpage><lpage>2420</lpage><pub-id pub-id-type="doi">10.3390/ijms11062393</pub-id><pub-id pub-id-type="pmid">20640160</pub-id></citation></ref>
<ref id="b33-ijms-13-14451"><label>33</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Treptow</surname><given-names>W.</given-names></name><name><surname>Marrink</surname><given-names>S.</given-names></name><name><surname>Tarek</surname><given-names>M.</given-names></name></person-group><article-title>Gating motions in voltage-gated potassium channels revealed by coarse-grained molecular dynamics simulations</article-title><source>J. Phys. Chem. B</source><year>2008</year><volume>112</volume><fpage>3277</fpage><lpage>3282</lpage><pub-id pub-id-type="doi">10.1021/jp709675e</pub-id><pub-id pub-id-type="pmid">18293960</pub-id></citation></ref>
<ref id="b34-ijms-13-14451"><label>34</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guardiani</surname><given-names>C.</given-names></name><name><surname>Livi</surname><given-names>R.</given-names></name><name><surname>Cecconi</surname><given-names>F.</given-names></name></person-group><article-title>Coarse grained modeling and approaches to protein folding</article-title><source>Curr. Bioinforma</source><year>2010</year><volume>5</volume><fpage>217</fpage><lpage>240</lpage><pub-id pub-id-type="doi">10.2174/157489310792006729</pub-id></citation></ref>
<ref id="b35-ijms-13-14451"><label>35</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liwo</surname><given-names>A.</given-names></name><name><surname>Khalili</surname><given-names>M.</given-names></name><name><surname>Scheraga</surname><given-names>H.A.</given-names></name></person-group><article-title>Ab initio simulations of protein-folding pathways by molecular dynamics with the united-residue model of polypeptide chains</article-title><source>Proc. Natl. Acad. Sci. USA</source><year>2005</year><volume>102</volume><fpage>2362</fpage><lpage>2367</lpage><pub-id pub-id-type="doi">10.1073/pnas.0408885102</pub-id><pub-id pub-id-type="pmid">15677316</pub-id></citation></ref>
<ref id="b36-ijms-13-14451"><label>36</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hall</surname><given-names>B.</given-names></name><name><surname>Sansom</surname><given-names>M.S.P.</given-names></name></person-group><article-title>Coarse-grained MD simulations and protein-protein interactions: The cohesion-dockerin system</article-title><source>J. Chem. Theory Comput</source><year>2009</year><volume>5</volume><fpage>2465</fpage><lpage>2471</lpage><pub-id pub-id-type="doi">10.1021/ct900140w</pub-id></citation></ref>
<ref id="b37-ijms-13-14451"><label>37</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Feig</surname><given-names>M.</given-names></name></person-group><article-title>Is alanine dipeptide a good model for representing the torsional preferences of protein backbone?</article-title><source>J. Chem. Theory Comput</source><year>2008</year><volume>4</volume><fpage>1555</fpage><lpage>1564</lpage><pub-id pub-id-type="doi">10.1021/ct800153n</pub-id></citation></ref>
<ref id="b38-ijms-13-14451"><label>38</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hess</surname><given-names>B.</given-names></name><name><surname>Kutzner</surname><given-names>C.</given-names></name><name><surname>Spoel</surname><given-names>D.</given-names></name><name><surname>Lindahl</surname><given-names>E.</given-names></name></person-group><article-title>GROMACS 4: Algorithms for highly efficient, load-balanced, and scalable molecular simulation</article-title><source>J. Chem. Theory Comput</source><year>2008</year><volume>4</volume><fpage>435</fpage><lpage>447</lpage><pub-id pub-id-type="doi">10.1021/ct700301q</pub-id></citation></ref>
<ref id="b39-ijms-13-14451"><label>39</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meyerguz</surname><given-names>L.</given-names></name><name><surname>Grasso</surname><given-names>C.</given-names></name><name><surname>Kleinberg</surname><given-names>J.</given-names></name><name><surname>Elber</surname><given-names>R.</given-names></name></person-group><article-title>Computational analysis of sequence selection mechanisms</article-title><source>Structure</source><year>2004</year><volume>12</volume><fpage>547</fpage><lpage>557</lpage><pub-id pub-id-type="doi">10.1016/j.str.2004.02.018</pub-id><pub-id pub-id-type="pmid">15062078</pub-id></citation></ref>
<ref id="b40-ijms-13-14451"><label>40</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Berman</surname><given-names>H.M.</given-names></name><name><surname>Westbrook</surname><given-names>J.</given-names></name><name><surname>Feng</surname><given-names>Z.</given-names></name><name><surname>Gilliland</surname><given-names>G.</given-names></name><name><surname>Bhat</surname><given-names>T.N.</given-names></name><name><surname>Weissig</surname><given-names>H.</given-names></name><name><surname>Shindyalov</surname><given-names>I.N.</given-names></name><name><surname>Bourne</surname><given-names>P.E.</given-names></name></person-group><article-title>The protien data bank</article-title><source>Nucl. Acids Res</source><year>2000</year><volume>28</volume><fpage>235</fpage><lpage>242</lpage><pub-id pub-id-type="doi">10.1093/nar/28.1.235</pub-id><pub-id pub-id-type="pmid">10592235</pub-id></citation></ref>
<ref id="b41-ijms-13-14451"><label>41</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Reith</surname><given-names>D.</given-names></name><name><surname>Putz</surname><given-names>M.</given-names></name><name><surname>Muller-plathe</surname><given-names>F.</given-names></name></person-group><article-title>Deriving effective mesoscale potentials from atomistic simulations</article-title><source>J. Comput. Chem</source><year>1997</year><volume>29</volume><fpage>292</fpage><lpage>308</lpage></citation></ref>
<ref id="b42-ijms-13-14451"><label>42</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tozzini</surname><given-names>V.</given-names></name><name><surname>McCammon</surname><given-names>J.A.</given-names></name></person-group><article-title>A coarse grained model for the dynamics of flap opening in HIV-1 protease</article-title><source>Chem. Phys. Lett</source><year>2005</year><volume>413</volume><fpage>123</fpage><lpage>128</lpage><pub-id pub-id-type="doi">10.1016/j.cplett.2005.07.075</pub-id></citation></ref>
<ref id="b43-ijms-13-14451"><label>43</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Torrie</surname><given-names>G.M.</given-names></name><name><surname>Valleau</surname><given-names>J.P.</given-names></name></person-group><article-title>Nonphysical sampling distributions in Monte Carlo free-energy estimation: Umbrella sampling</article-title><source>J. Comput. Phys</source><year>1977</year><volume>23</volume><fpage>187</fpage><lpage>199</lpage><pub-id pub-id-type="doi">10.1016/0021-9991(77)90121-8</pub-id></citation></ref>
<ref id="b44-ijms-13-14451"><label>44</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kumar</surname><given-names>S.</given-names></name><name><surname>Bouzida</surname><given-names>D.</given-names></name><name><surname>Swendsen</surname><given-names>R.H.</given-names></name><name><surname>Kollman</surname><given-names>P.A.</given-names></name><name><surname>Rosenberg</surname><given-names>J.M.</given-names></name></person-group><article-title>The weighted histogram analysis method for free-energy calculations on biomolecular. I. The method</article-title><source>J. Comput. Chem</source><year>1992</year><volume>13</volume><fpage>1011</fpage><lpage>1021</lpage><pub-id pub-id-type="doi">10.1002/jcc.540130812</pub-id></citation></ref></ref-list>
<sec sec-type="display-objects">
<title>Figures and Tables</title>
<fig id="f1-ijms-13-14451" position="float">
<label>Figure 1</label>
<caption>
<p>The bond length distribution of the <italic>B</italic>–<italic>B</italic> and <italic>B</italic>–<italic>S</italic><italic><sub>i</sub></italic>. <italic>B</italic> denotes the backbone bead, and <italic>S</italic><italic><sub>i</sub></italic> denotes the side-chain beads shown in distinct patterns.</p></caption>
<graphic xlink:href="ijms-13-14451f1.gif"/></fig>
<fig id="f2-ijms-13-14451" position="float">
<label>Figure 2</label>
<caption>
<p>The angle bending energy profiles of <italic>B</italic>–<italic>B</italic>–<italic>B</italic>, <italic>B</italic>–<italic>B</italic>–<italic>S</italic><italic><sub>i</sub></italic> and <italic>S</italic><italic><sub>i</sub></italic>–<italic>B</italic>–<italic>B</italic> and the fitted potential function curves (black curves). <italic>B</italic> denotes the backbone bead, and <italic>S</italic><italic><sub>i</sub></italic> denotes the side-chain beads shown in distinct patterns and colors.</p></caption>
<graphic xlink:href="ijms-13-14451f2a.gif"/>
<graphic xlink:href="ijms-13-14451f2b.gif"/></fig>
<fig id="f3-ijms-13-14451" position="float">
<label>Figure 3</label>
<caption>
<p>The dihedral torsion energy profiles of (<bold>A</bold>) <italic>S</italic><italic><sub>i</sub></italic>–<italic>B</italic>–<italic>B</italic>–<italic>S</italic><italic><sub>j</sub></italic>, (<bold>B</bold>) <italic>S</italic><italic><sub>i</sub></italic>–<italic>B</italic>–<italic>B</italic>–<italic>B</italic>, (<bold>C</bold>) <italic>B</italic>–<italic>B</italic>–<italic>B</italic>–<italic>S</italic><italic><sub>i</sub></italic> and (<bold>D</bold>) <italic>B</italic>–<italic>B</italic>–<italic>B</italic>–<italic>B</italic> and the fitted potential function curves (black curves). <italic>B</italic> denotes the backbone bead, and <italic>S</italic><italic><sub>i</sub></italic>/<italic>S</italic><italic><sub>j</sub></italic> denotes the side-chain beads shown in distinct patterns and colors.</p></caption>
<graphic xlink:href="ijms-13-14451f3.gif"/></fig>
<fig id="f4-ijms-13-14451" position="float">
<label>Figure 4</label>
<caption>
<p>The histograms of the configurations within the umbrella sampling windows (<bold>A</bold>) and the potential of mean force against the distance of two ALA molecules (<bold>B</bold>).</p></caption>
<graphic xlink:href="ijms-13-14451f4.gif"/></fig>
<fig id="f5-ijms-13-14451" position="float">
<label>Figure 5</label>
<caption>
<p>The potential of mean force between non-bonded homo pairs of coarse-grained (CG) beads (<italic>B</italic><italic><sub>GLY</sub></italic>, <italic>B</italic><italic><sub>SER</sub></italic>, <italic>S</italic><italic><sub>GLU</sub></italic> and <italic>S</italic><italic><sub>ILE</sub></italic>) against their distance, derived from umbrella sampling method with all-atom simulation (solid curves), and the van der Waals potential by fitting the potential of mean force with the Lennard-Jones function (dash curves).</p></caption>
<graphic xlink:href="ijms-13-14451f5.gif"/></fig>
<fig id="f6-ijms-13-14451" position="float">
<label>Figure 6</label>
<caption>
<p>Resulting profiles of root mean square deviation of Cα carbons for eight proteins.</p></caption>
<graphic xlink:href="ijms-13-14451f6.gif"/></fig>
<fig id="f7-ijms-13-14451" position="float">
<label>Figure 7</label>
<caption>
<p>Snapshots of the 1000 ns coarse-grained molecular dynamics simulation for protein 2AAS at 0 ns (<bold>A</bold>), 250 ns (<bold>B</bold>), 450 ns (<bold>C</bold>), 480 ns (<bold>D</bold>), 750 ns (<bold>E</bold>) and 1000 ns (<bold>F</bold>).</p></caption>
<graphic xlink:href="ijms-13-14451f7.gif"/></fig>
<fig id="f8-ijms-13-14451" position="float">
<label>Figure 8</label>
<caption>
<p>Resulting profiles of the residue root mean square fluctuations (dash curves) relative to averaged conformations compared with NMR experiments (solid curves) for proteins 1BTA, 1D3Z, 1FKS and 3GB1.</p></caption>
<graphic xlink:href="ijms-13-14451f8.gif"/></fig>
<fig id="f9-ijms-13-14451" position="float">
<label>Figure 9</label>
<caption>
<p>The coarse-grained models of 20 protein amino acids.</p></caption>
<graphic xlink:href="ijms-13-14451f9.gif"/></fig>
<fig id="f10-ijms-13-14451" position="float">
<label>Figure 10</label>
<caption>
<p>The coarse-grained protein model: I and II denote the backbone-backbone bead and backbone-side-chain bead bond stretching interaction respectively, θ denotes the virtual angle, and τ is the virtual dihedral angel.</p></caption>
<graphic xlink:href="ijms-13-14451f10.gif"/></fig>
<table-wrap id="t1-ijms-13-14451" position="float">
<label>Table 1</label>
<caption>
<p>The equilibrium length between ten side-chain beads and their backbone beads.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom">Bond</th>
<th align="center" valign="bottom">Length (nm)</th>
<th align="center" valign="bottom">Bond</th>
<th align="center" valign="bottom">Length (nm)</th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top"><italic>B</italic>–<italic>S</italic><italic><sub>ARG</sub></italic></td>
<td align="center" valign="top">0.406</td>
<td align="center" valign="top"><italic>B</italic>–<italic>S</italic><italic><sub>LYS</sub></italic></td>
<td align="center" valign="top">0.344</td></tr>
<tr>
<td align="center" valign="top"><italic>B</italic>–<italic>S</italic><italic><sub>GLN</sub></italic></td>
<td align="center" valign="top">0.301</td>
<td align="center" valign="top"><italic>B</italic>–<italic>S</italic><italic><sub>MET</sub></italic></td>
<td align="center" valign="top">0.287</td></tr>
<tr>
<td align="center" valign="top"><italic>B</italic>–<italic>S</italic><italic><sub>GLU</sub></italic></td>
<td align="center" valign="top">0.295</td>
<td align="center" valign="top"><italic>B</italic>–<italic>S</italic><italic><sub>PHE</sub></italic></td>
<td align="center" valign="top">0.333</td></tr>
<tr>
<td align="center" valign="top"><italic>B</italic>–<italic>S</italic><italic><sub>HIS</sub></italic></td>
<td align="center" valign="top">0.307</td>
<td align="center" valign="top"><italic>B</italic>–<italic>S</italic><italic><sub>TRP</sub></italic></td>
<td align="center" valign="top">0.381</td></tr>
<tr>
<td align="center" valign="top"><italic>B</italic>–<italic>S</italic><italic><sub>ILE</sub></italic></td>
<td align="center" valign="top">0.226</td>
<td align="center" valign="top"><italic>B</italic>–<italic>S</italic><italic><sub>TYR</sub></italic></td>
<td align="center" valign="top">0.371</td></tr></tbody></table></table-wrap>
<table-wrap id="t2-ijms-13-14451" position="float">
<label>Table 2</label>
<caption>
<p>The finite distance <italic>c</italic><italic><sub>ij</sub></italic> when the van der Waals potential of two interacting beads is equal to zero.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom">Interacting beads</th>
<th align="center" valign="bottom">Distance <italic>c</italic><italic><sub>ij</sub></italic> (nm)</th>
<th align="center" valign="bottom">Interacting beads</th>
<th align="center" valign="bottom">Distance <italic>c</italic><italic><sub>ij</sub></italic> (nm)</th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top"><italic>B</italic><italic><sub>ALA</sub></italic>–<italic>B</italic><italic><sub>ALA</sub></italic></td>
<td align="center" valign="top">0.50</td>
<td align="center" valign="top"><italic>S</italic><italic><sub>ARG</sub></italic>–<italic>S</italic><italic><sub>ARG</sub></italic></td>
<td align="center" valign="top">0.60</td></tr>
<tr>
<td align="center" valign="top"><italic>B</italic><italic><sub>ASN</sub></italic>–<italic>B</italic><italic><sub>ASN</sub></italic></td>
<td align="center" valign="top">0.60</td>
<td align="center" valign="top"><italic>S</italic><italic><sub>GLN</sub></italic>–<italic>S</italic><italic><sub>GLN</sub></italic></td>
<td align="center" valign="top">0.45</td></tr>
<tr>
<td align="center" valign="top"><italic>B</italic><italic><sub>ASP</sub></italic>–<italic>B</italic><italic><sub>ASP</sub></italic></td>
<td align="center" valign="top">0.55</td>
<td align="center" valign="top"><italic>S</italic><italic><sub>GLU</sub></italic>–<italic>S</italic><italic><sub>GLU</sub></italic></td>
<td align="center" valign="top">0.45</td></tr>
<tr>
<td align="center" valign="top"><italic>B</italic><italic><sub>CYS</sub></italic>–<italic>B</italic><italic><sub>CYS</sub></italic></td>
<td align="center" valign="top">0.50</td>
<td align="center" valign="top"><italic>S</italic><italic><sub>HIS</sub></italic>–<italic>S</italic><italic><sub>HIS</sub></italic></td>
<td align="center" valign="top">0.45</td></tr>
<tr>
<td align="center" valign="top"><italic>B</italic><italic><sub>GLY</sub></italic>–<italic>B</italic><italic><sub>GLY</sub></italic></td>
<td align="center" valign="top">0.40</td>
<td align="center" valign="top"><italic>S</italic><italic><sub>ILE</sub></italic>–<italic>S</italic><italic><sub>ILE</sub></italic></td>
<td align="center" valign="top">0.50</td></tr>
<tr>
<td align="center" valign="top"><italic>B</italic><italic><sub>LEU</sub></italic>–<italic>B</italic><italic><sub>LEU</sub></italic></td>
<td align="center" valign="top">0.55</td>
<td align="center" valign="top"><italic>S</italic><italic><sub>LYS</sub></italic>–<italic>S</italic><italic><sub>LYS</sub></italic></td>
<td align="center" valign="top">0.45</td></tr>
<tr>
<td align="center" valign="top"><italic>B</italic><italic><sub>PRO</sub></italic>–<italic>B</italic><italic><sub>PRO</sub></italic></td>
<td align="center" valign="top">0.65</td>
<td align="center" valign="top"><italic>S</italic><italic><sub>MET</sub></italic>–<italic>S</italic><italic><sub>MET</sub></italic></td>
<td align="center" valign="top">0.45</td></tr>
<tr>
<td align="center" valign="top"><italic>B</italic><italic><sub>SER</sub></italic>–<italic>B</italic><italic><sub>SER</sub></italic></td>
<td align="center" valign="top">0.50</td>
<td align="center" valign="top"><italic>S</italic><italic><sub>PHE</sub></italic>–<italic>S</italic><italic><sub>PHE</sub></italic></td>
<td align="center" valign="top">0.45</td></tr>
<tr>
<td align="center" valign="top"><italic>B</italic><italic><sub>THR</sub></italic>–<italic>B</italic><italic><sub>THR</sub></italic></td>
<td align="center" valign="top">0.50</td>
<td align="center" valign="top"><italic>S</italic><italic><sub>TRP</sub></italic>–<italic>S</italic><italic><sub>TRP</sub></italic></td>
<td align="center" valign="top">0.65</td></tr>
<tr>
<td align="center" valign="top"><italic>B</italic><italic><sub>VAL</sub></italic>–<italic>B</italic><italic><sub>VAL</sub></italic></td>
<td align="center" valign="top">0.50</td>
<td align="center" valign="top"><italic>S</italic><italic><sub>TYR</sub></italic>–<italic>S</italic><italic><sub>TYR</sub></italic></td>
<td align="center" valign="top">0.55</td></tr></tbody></table></table-wrap>
<table-wrap id="t3-ijms-13-14451" position="float">
<label>Table 3</label>
<caption>
<p>The simulation information of eight protein systems.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom">System</th>
<th align="center" valign="bottom">PDB ID</th>
<th align="center" valign="bottom">Number of residues</th>
<th align="center" valign="bottom">Number of CG waters</th>
<th align="center" valign="bottom">Number of CG beads</th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top">Barstar</td>
<td align="center" valign="top">1BTA</td>
<td align="center" valign="top">89</td>
<td align="center" valign="top">939</td>
<td align="center" valign="top">1069</td></tr>
<tr>
<td align="center" valign="top">CheY</td>
<td align="center" valign="top">1CYE</td>
<td align="center" valign="top">129</td>
<td align="center" valign="top">1196</td>
<td align="center" valign="top">1375</td></tr>
<tr>
<td align="center" valign="top">Ubiquitin</td>
<td align="center" valign="top">1D3Z</td>
<td align="center" valign="top">76</td>
<td align="center" valign="top">1013</td>
<td align="center" valign="top">1124</td></tr>
<tr>
<td align="center" valign="top">FKBP12</td>
<td align="center" valign="top">1FKS</td>
<td align="center" valign="top">107</td>
<td align="center" valign="top">1264</td>
<td align="center" valign="top">1417</td></tr>
<tr>
<td align="center" valign="top">Barnase</td>
<td align="center" valign="top">1FW7</td>
<td align="center" valign="top">110</td>
<td align="center" valign="top">1157</td>
<td align="center" valign="top">1312</td></tr>
<tr>
<td align="center" valign="top">RNase H</td>
<td align="center" valign="top">1RCH</td>
<td align="center" valign="top">155</td>
<td align="center" valign="top">1982</td>
<td align="center" valign="top">2207</td></tr>
<tr>
<td align="center" valign="top">RNase A</td>
<td align="center" valign="top">2AAS</td>
<td align="center" valign="top">124</td>
<td align="center" valign="top">1126</td>
<td align="center" valign="top">1296</td></tr>
<tr>
<td align="center" valign="top">protein G</td>
<td align="center" valign="top">3GB1</td>
<td align="center" valign="top">56</td>
<td align="center" valign="top">887</td>
<td align="center" valign="top">963</td></tr></tbody></table></table-wrap>
<table-wrap id="t4-ijms-13-14451" position="float">
<label>Table 4</label>
<caption>
<p>Resulting root mean square deviations from experimental structures of eight proteins during coarse-grained simulations compared with all-atom simulations (standard deviations are given in parentheses).</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="middle" rowspan="3">PDB</th>
<th colspan="3" align="center" valign="top">CG-MD</th>
<th colspan="3" align="center" valign="top">AA-MD <xref ref-type="table-fn" rid="tfn1-ijms-13-14451">*</xref></th></tr>
<tr>
<th colspan="3" align="left" valign="top">
<hr/></th>
<th colspan="3" align="left" valign="top">
<hr/></th></tr>
<tr>
<th align="center" valign="top">Simulation length (ns)</th>
<th align="center" valign="top">Avg. Ca RMSD (nm)</th>
<th align="center" valign="top">Final Ca RMSD (nm)</th>
<th align="center" valign="top">Simulation length (ns)</th>
<th align="center" valign="top">Avg. Ca RMSD (nm)</th>
<th align="center" valign="top">Final Ca RMSD (nm)</th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top">1bta</td>
<td align="center" valign="top">1000</td>
<td align="center" valign="top">0.393(0.010)</td>
<td align="center" valign="top">0.396</td>
<td align="center" valign="top">142.9</td>
<td align="center" valign="top">0.134(0.016)</td>
<td align="center" valign="top">0.121</td></tr>
<tr>
<td align="center" valign="top">1cye</td>
<td align="center" valign="top">1000</td>
<td align="center" valign="top">0.389(0.036)</td>
<td align="center" valign="top">0.422</td>
<td align="center" valign="top">124.7</td>
<td align="center" valign="top">0.143(0.020)</td>
<td align="center" valign="top">0.170</td></tr>
<tr>
<td align="center" valign="top">1d3z</td>
<td align="center" valign="top">1000</td>
<td align="center" valign="top">0.394(0.020)</td>
<td align="center" valign="top">0.395</td>
<td align="center" valign="top">22.0</td>
<td align="center" valign="top">0.141(0.021)</td>
<td align="center" valign="top">0.128</td></tr>
<tr>
<td align="center" valign="top">1fks</td>
<td align="center" valign="top">1000</td>
<td align="center" valign="top">0.379(0.021)</td>
<td align="center" valign="top">0.415</td>
<td align="center" valign="top">143.5</td>
<td align="center" valign="top">0.358(0.074)</td>
<td align="center" valign="top">0.477</td></tr>
<tr>
<td align="center" valign="top">1fw7</td>
<td align="center" valign="top">1000</td>
<td align="center" valign="top">0.391(0.033)</td>
<td align="center" valign="top">0.408</td>
<td align="center" valign="top">148.0</td>
<td align="center" valign="top">0.171(0.015)</td>
<td align="center" valign="top">0.167</td></tr>
<tr>
<td align="center" valign="top">1rch</td>
<td align="center" valign="top">1000</td>
<td align="center" valign="top">0.415(0.025)</td>
<td align="center" valign="top">0.431</td>
<td align="center" valign="top">121.5</td>
<td align="center" valign="top">0.278(0.017)</td>
<td align="center" valign="top">0.289</td></tr>
<tr>
<td align="center" valign="top">2aas</td>
<td align="center" valign="top">1000</td>
<td align="center" valign="top">0.364(0.034)</td>
<td align="center" valign="top">0.400</td>
<td align="center" valign="top">148.3</td>
<td align="center" valign="top">0.249(0.043)</td>
<td align="center" valign="top">0.321</td></tr>
<tr>
<td align="center" valign="top">3gb1</td>
<td align="center" valign="top">1000</td>
<td align="center" valign="top">0.316(0.015)</td>
<td align="center" valign="top">0.323</td>
<td align="center" valign="top">50.0</td>
<td align="center" valign="top">0.106(0.020)</td>
<td align="center" valign="top">0.143</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn1-ijms-13-14451">
<label>*</label>
<p>The values of AA-MD are from reference <xref ref-type="bibr" rid="b37-ijms-13-14451">37</xref>.</p></fn></table-wrap-foot></table-wrap>
<table-wrap id="t5-ijms-13-14451" position="float">
<label>Table 5</label>
<caption>
<p>The efficiency of 10 ns simulations of eight proteins with three different simulation methodologies.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="middle" rowspan="3">PDB</th>
<th colspan="3" align="center" valign="top">The proposed CG-MD</th>
<th colspan="2" align="center" valign="top">MARTINI</th>
<th colspan="2" align="center" valign="top">AA-MD</th></tr>
<tr>
<th colspan="3" align="left" valign="top">
<hr/></th>
<th colspan="2" align="left" valign="top">
<hr/></th>
<th colspan="2" align="left" valign="top">
<hr/></th></tr>
<tr>
<th align="center" valign="top">Simulation time (s)</th>
<th align="center" valign="top">Avg. Ca RMSD (nm)</th>
<th align="center" valign="top">Avg. Ca RMSD in vacuum (nm)</th>
<th align="center" valign="top">Simulation time (s)</th>
<th align="center" valign="top">Avg. Ca RMSD (nm)</th>
<th align="center" valign="top">Simulation time (s)</th>
<th align="center" valign="top">Avg. Ca RMSD (nm)</th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top">1bta</td>
<td align="center" valign="top">3501</td>
<td align="center" valign="top">0.210</td>
<td align="center" valign="top">0.637</td>
<td align="center" valign="top">4002</td>
<td align="center" valign="top">0.341</td>
<td align="center" valign="top">313062</td>
<td align="center" valign="top">0.148</td></tr>
<tr>
<td align="center" valign="top">1cye</td>
<td align="center" valign="top">4309</td>
<td align="center" valign="top">0.292</td>
<td align="center" valign="top">0.440</td>
<td align="center" valign="top">4972</td>
<td align="center" valign="top">0.503</td>
<td align="center" valign="top">398432</td>
<td align="center" valign="top">0.148</td></tr>
<tr>
<td align="center" valign="top">1d3z</td>
<td align="center" valign="top">4032</td>
<td align="center" valign="top">0.283</td>
<td align="center" valign="top">0.416</td>
<td align="center" valign="top">4261</td>
<td align="center" valign="top">0.426</td>
<td align="center" valign="top">334203</td>
<td align="center" valign="top">0.185</td></tr>
<tr>
<td align="center" valign="top">1fks</td>
<td align="center" valign="top">4484</td>
<td align="center" valign="top">0.324</td>
<td align="center" valign="top">0.505</td>
<td align="center" valign="top">5242</td>
<td align="center" valign="top">0.378</td>
<td align="center" valign="top">436792</td>
<td align="center" valign="top">0.220</td></tr>
<tr>
<td align="center" valign="top">1fw7</td>
<td align="center" valign="top">4391</td>
<td align="center" valign="top">0.247</td>
<td align="center" valign="top">0.574</td>
<td align="center" valign="top">4902</td>
<td align="center" valign="top">0.400</td>
<td align="center" valign="top">388712</td>
<td align="center" valign="top">0.171</td></tr>
<tr>
<td align="center" valign="top">1rch</td>
<td align="center" valign="top">7330</td>
<td align="center" valign="top">0.337</td>
<td align="center" valign="top">0.681</td>
<td align="center" valign="top">7845</td>
<td align="center" valign="top">0.357</td>
<td align="center" valign="top">650507</td>
<td align="center" valign="top">0.234</td></tr>
<tr>
<td align="center" valign="top">2aas</td>
<td align="center" valign="top">4424</td>
<td align="center" valign="top">0.284</td>
<td align="center" valign="top">0.689</td>
<td align="center" valign="top">4801</td>
<td align="center" valign="top">0.421</td>
<td align="center" valign="top">387623</td>
<td align="center" valign="top">0.259</td></tr>
<tr>
<td align="center" valign="top">3gb1</td>
<td align="center" valign="top">3432</td>
<td align="center" valign="top">0.275</td>
<td align="center" valign="top">0.501</td>
<td align="center" valign="top">3721</td>
<td align="center" valign="top">0.339</td>
<td align="center" valign="top">274854</td>
<td align="center" valign="top">0.128</td></tr></tbody></table></table-wrap>
<table-wrap id="t6-ijms-13-14451" position="float">
<label>Table 6</label>
<caption>
<p>The corresponding analogous compounds of side-chain beads.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom">Side-chain bead</th>
<th align="center" valign="bottom">Analogous compound</th>
<th align="center" valign="bottom">Side-chain bead</th>
<th align="center" valign="bottom">Analogous compound</th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top"><italic>S</italic><italic><sub>ARG</sub></italic></td>
<td align="center" valign="top"><italic>n</italic>-propylguanidine</td>
<td align="center" valign="top"><italic>S</italic><italic><sub>LYS</sub></italic></td>
<td align="center" valign="top"><italic>n</italic>-butylamine</td></tr>
<tr>
<td align="center" valign="top"><italic>S</italic><italic><sub>GLN</sub></italic></td>
<td align="center" valign="top">propionamide</td>
<td align="center" valign="top"><italic>S</italic><italic><sub>MET</sub></italic></td>
<td align="center" valign="top">methyl propyl sulfide</td></tr>
<tr>
<td align="center" valign="top"><italic>S</italic><italic><sub>GLU</sub></italic></td>
<td align="center" valign="top">propionic acid</td>
<td align="center" valign="top"><italic>S</italic><italic><sub>PHE</sub></italic></td>
<td align="center" valign="top">toluene</td></tr>
<tr>
<td align="center" valign="top"><italic>S</italic><italic><sub>HIS</sub></italic></td>
<td align="center" valign="top">4-methylimidazole</td>
<td align="center" valign="top"><italic>S</italic><italic><sub>TRP</sub></italic></td>
<td align="center" valign="top">3-methylindole</td></tr>
<tr>
<td align="center" valign="top"><italic>S</italic><italic><sub>ILE</sub></italic></td>
<td align="center" valign="top"><italic>n</italic>-butane</td>
<td align="center" valign="top"><italic>S</italic><italic><sub>TYR</sub></italic></td>
<td align="center" valign="top">p-cresol</td></tr></tbody></table></table-wrap></sec></back></article>
