Next Article in Journal
Fundamentals, Synthetic Strategies and Applications of Non-Covalently Imprinted Polymers
Next Article in Special Issue
Numerical Modeling of Anisotropic Particle Diffusion through a Cylindrical Channel
Previous Article in Journal
Chemical Composition and Antibacterial, Antioxidant, and Cytotoxic Activities of Essential Oils from Leaves and Stems of Aeschynomene indica L.
Previous Article in Special Issue
Enhancing Bioactivity through the Transfer of the 2-(Hydroxymethoxy)Vinyl Moiety: Application in the Modification of Tyrosol and Hinokitiol
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

ezAlign: A Tool for Converting Coarse-Grained Molecular Dynamics Structures to Atomistic Resolution for Multiscale Modeling

1
Lawrence Livermore National Laboratory, Livermore, CA 94550, USA
2
Procter and Gamble, Reading RG2 0RX, UK
3
Procter and Gamble, Mason, OH 45040, USA
4
Pacific Northwest National Laboratory, Richland, WA 99352, USA
*
Author to whom correspondence should be addressed.
Molecules 2024, 29(15), 3557; https://doi.org/10.3390/molecules29153557
Submission received: 18 June 2024 / Revised: 22 July 2024 / Accepted: 25 July 2024 / Published: 28 July 2024
(This article belongs to the Special Issue Feature Papers in Computational and Theoretical Chemistry)

Abstract

:
Soft condensed matter is challenging to study due to the vast time and length scales that are necessary to accurately represent complex systems and capture their underlying physics. Multiscale simulations are necessary to study processes that have disparate time and/or length scales, which abound throughout biology and other complex systems. Herein we present ezAlign, an open-source software for converting coarse-grained molecular dynamics structures to atomistic representation, allowing multiscale modeling of biomolecular systems. The ezAlign v1.1 software package is publicly available for download at github.com/LLNL/ezAlign. Its underlying methodology is based on a simple alignment of an atomistic template molecule, followed by position-restraint energy minimization, which forces the atomistic molecule to adopt a conformation consistent with the coarse-grained molecule. The molecules are then combined, solvated, minimized, and equilibrated with position restraints. Validation of the process was conducted on a pure POPC membrane and compared with other popular methods to construct atomistic membranes. Additional examples, including surfactant self-assembly, membrane proteins, and more complex bacterial and human plasma membrane models, are also presented. By providing these examples, parameter files, code, and an easy-to-follow recipe to add new molecules, this work will aid future multiscale modeling efforts.

1. Introduction

There is great interest in using multiscale modeling to study molecular self-assembly, such as the interactions between membranes and amphiphiles. This interest is particularly motivated by the large range of time and length scales that are important for characterizing these systems. For example, individual hydrogen bonds and other fluctuations that are crucial for some bilayer properties are on the sub-nanometer and sub-nanosecond scales [1], while at the other extreme, bilayer properties such as bending and lipid flip-flop can span hundreds of nanometers and the time scale of hours to days, respectively [2]. To this end, lipid systems are studied with a variety of computational methods, from continuum methods for macroscopic simulations to quantum calculations on extremely small scales. In between these extremes are atomist (AA) and coarse-grained (CG) molecular dynamics (MD) simulations. CG simulations lack atomistic information but retain sub-molecular resolution and can reach orders of magnitude larger and longer simulations than AA simulations.
Converting from a CG structure to an AA representation is a non-trivial task, as one CG particle represents multiple atoms. Therefore, it is not straightforward to place all the atoms inside the CG bead. Figure 1 illustrates the two resolutions, showing a single CG lipid, its corresponding AA lipid, as well as a complex membrane system for each resolution. Multiscale modeling, where both AA and CG models are used in conjunction with one another, offers a great opportunity for obtaining the best of both methods, bridging the micro- and macro-scales. To aid these studies, a number of tools have been built that convert CG to AA models for MD simulations, both from decades ago [3,4] and a number of recent methods [5,6,7]. Backward is one of the most widely used CG-to-AA tools [5]. This method uses pre-defined geometrical relationships for all the atoms in a molecule to place them relative to the CG beads. This is followed by a series of position-restrained energy minimizations and a short MD simulation to relax the system. With this method and any other fragment-based/geometrical method, care must be taken, particularly with regards to unphysically stretched bonds and the chirality of molecules. After back mapping, some bonds within a lipid molecule may be stretched out significantly, which can result in AA simulation systems that are difficult to equilibrate. In such cases, a lipid tail can pierce through an aromatic ring belonging to a membrane protein (called ring penetration in the CHARMM GUI). Our method includes a post-conversion protocol consisting of position-restrained energy minimization and molecular dynamics steps, so that the resulting AA models can be easily equilibrated and simulated. Another important problem is that some molecules can have incorrect stereochemistry after back mapping. Other fragment-based tools include CG2AT [6] and CG2AT2 [7]. Additionally, for fragment-based/geometrical methods, adding new molecules can require careful testing, additional parameterization, and specific chemical knowledge.
Aiming for simplicity and automatability, we implemented ezAlign, a simple, template-based back-mapping tool that requires minimal human time and intervention. This method uses GROMACS [8] and MDAnalysis [9], and similarly to CGTools, developed by Schulten and coworkers in NAMD [4], the initial back mapping of a CG molecule is performed based on an alignment between a subset of individual atoms and their corresponding CG beads. In Figure 1, we summarize the method using a membrane system, where first a template AA molecule is fitted onto each CG lipid using the RMSD of the CG beads to the mapped atoms, utilizing the MDAnalysis Python package [9]. A position restraint is then placed on each mapped atom, with the reference position set by the corresponding CG bead’s position. Each lipid is then energy-minimized without intermolecular interactions, followed by a short stochastic MD simulation. Figure 1 shows a single POPC molecule after using ezAlign, where the AA lipid is found to closely match that of the CG lipid. Four AA waters and ions with four additional waters are then mapped onto each CG water bead, which is similar to the Backward approach for water [5]. The entire system is then assembled, energy is minimized, and a short, restrained MD simulation is conducted, resulting in a full AA system that is ready for subsequent MD simulations. We assess ezAlign’s performance on a simple POPC bilayer compared with Backward and CHARMM-GUI’s Martini to All Atom Converter, followed by presenting several example applications of more complex systems.

2. Results

2.1. POPC Lipid Bilayer

Initial testing was performed with a pure POPC lipid bilayer system, which was built with an insane bilayer builder [10], as explained in Section 4.4. A 200 ns CG MD simulation with the Martini 2.0 force field was then conducted, and ezAlign was used to convert the final simulation frame to an AA representation using the CHARMM36 force field [11]. Figure 2 shows examples of three lipids during the ezAlign procedure. After the initial placement, each atomistic POPC lipid has the same conformation. Energy minimization with position restraints results in the lipids adopting conformations with the mapped atoms closely overlapping the CG model. An additional step of stochastic dynamics with position restraints results in more relaxed conformations.

2.2. Comparison to Backward and CHARMM-GUI

To validate ezAlign, we compared both AA simulation setups with CHARMM-GUI’s membrane builder [12] and CG to AA mapped simulations using Backward [5]. For both ezAlign and Backward, the system was initiated from a CG POPC lipid bilayer built with insane [10]. Figure 3A plots the area per lipid (APL) for the system at the start of the simulation. All three methods start within the expected range of APL fluctuations, which are close to the CG APL. In Figure 3B, we also compare partial density curves for POPC after conversion to atomistic detail and 5 ns of simulation. These data show that all three methods can produce equilibrated AA starting structures for simple membrane systems.
It has been noted previously that Backward occasionally produce molecules that have a different chirality than expected. We found one such example for POPC in Figure 3C, where the glycerol backbone for the POPC lipid is opposite to the expected state shown for the same lipid converted with ezAlign. We note that this behavior in Backward is rare, and additional restraints or other geometric rules could be added to ensure chirality.

2.3. Self-Assembly

The self-assembly of amphiphilic molecules is important for many diverse applications, from biotechnology (drug delivery) to chemical engineering (soap formulations) [13]. Due to the necessarily large length and time scales, simulating self-assembly with AA models is challenging. ezAlign is not restricted to membrane systems and is applicable to self-assembly systems. Figure 4A shows an example system of a cetyl-betaine (CTBE) self-assembled into small spherical micelles with long-time scale CG simulations and back mapped with ezAlign. The micelles interact with a model for an E. coli inner membrane model on a long timescale, with contacts lasting microseconds of simulation time. Over the long-timescale CG simulations, molecular rearrangements are possible, such as monomer CTBE molecules moving from micelles to the E. coli membrane. Figure 4B illustrates a large system that was first run with CG and then back mapped to AA with ezAlign. Large molecular rearrangements are observed, and the collective behavior can then be assessed at both AA and CG levels of detail. In the case of large systems, ezAlign back mapping can take a significant amount of time, but it is still orders of magnitude shorter than standard atomistic production simulations.

2.4. Heterogeneous Membranes

We provide several systems for future use that we have mapped, including AA and CG parameters and starting configurations. Cholesterol is an important eukaryotic lipid that has diverse roles in biology and has been studied extensively in membrane simulations [14,15]. Recently, complex models for a human plasma membrane were compared with models with a smaller number of lipid types, resulting in a simplified plasma membrane model consisting of eight lipid types asymmetrically distributed across the leaflets (Mix8) [16]. Figure 4B presents Mix8, showing the complex mixture of lipids that are present in this system, including bilayer asymmetry. Bacterial membranes may also be modeled, such as the E. coli cytoplasmic membranes shown in Figure 4A.

2.5. Transmembrane Proteins

Transmembrane proteins are another important and well-studied biological system [17]. Simulation systems composed of transmembrane proteins can be quite large and require significant sampling times to adequately establish the lipid/protein interaction ensemble, which are currently inaccessible to AA simulation strategies. With ezAlign, ensemble equilibration can first be performed in CG and then back mapped to AA with ezAlign, yielding significantly faster lipid/protein interaction equilibration while recovering AA resolution. Figure 4C shows an example where GPR40 is first run with the CG model position restraints to allow the lipids to equilibrate. The 200 ns CG frame was then converted to AA using ezAlign. We highlight that with ezAlign, the lipids are packed around the protein, as expected from the CG model.
In addition to GPCR40, ezAlign has been successfully tested on three other transmembrane protein systems. Figure 5 shows the CG and back-mapped AA representations of hERG [18], GABAA [19], and a RAS/RAF complex [20]. All systems were simulated for 200 ns in CG and then converted to AA resolution with ezAlign. All systems exhibited proper lipid packing around the back-mapped proteins. The RAS/RAF system features a large, complex, heterogeneous human plasma membrane model. This system contains approximately one million atoms, which were readily back mapped with ezAlign in a little less than an hour using 36 CPUs with MPI parallelization.

3. Discussion

CG structures are readily converted to AA resolution for multiscale simulations with the new tool ezAlign, publicly available at github.com/LLNL/ezAlign. The ezAlign program is easy to use and allows for the accurate transformation of CG systems to AA detail, where each AA molecule matches its corresponding CG molecule’s conformation. Currently, ezAlign is readily capable of converting several standard molecules with no user modifications, including biologically relevant systems such as lipid membranes, proteins, and small molecules. Adding new molecules is a trivial task due to the use of an initial template molecule. A similar method was previously implemented in NAMD and shown to effectively convert the CG structure to AA representation [4] and applied to a number of interesting problems [21,22] and extensions to polymers [23]. Our tool is implemented for use with GROMACS [8], has a wealth of pre-built systems, and is easily extendable to other systems. Our code uses MDAnalysis [9] for the molecule transformations, so adapting to another MD code or other force fields is, in principle, straightforward.
The ezAlign program reproduces the structure of a lipid bilayer with similar accuracy and efficiency as the popular Backward tool [5]. One advantage of ezAlign is that it is very straightforward to add new molecules. Additionally, Backward and other geometry-based tools can result in molecules with the wrong chirality. Due to the form of the AA MD potential energy function, these states are permissible but change the molecules chemistry, possibly in important ways. Backward has additional parameters that can be added to prevent improper placement or dihedral potentials enforcing a specific tautomer, but these methods require prior chemical knowledge. Other fragment- or geometric rule-based methods, such as CG2AA [7], will also likely suffer from this deficiency. Recently, machine learning tools for CG-to-AA transformations have been developed [24]. These tools require extensive training data, which requires significant work to produce for each new molecule and system. The transferability of the ML model to other situations is also a potential problem. For example, training how to back map a molecule in water is likely not suitable if the molecule is in a lipid membrane.
Systems including a protein complex may be modeled with ezAlign, including transmembrane protein systems. Particularly for membrane proteins that are either known or expected to deform the bilayer morphology, an MD simulation system at CG resolution is often easier to build and equilibrate, even if the simulations themselves are to be carried out at AA resolution. Once the membrane solvation around the protein is equilibrated well in a CG MD, the system of interest can then be converted to AA representation using ezAlign with ease and efficiency. Apart from alleviating issues regarding the building and equilibrating of complex membrane protein simulation systems, CG simulations of membrane proteins can efficiently achieve an equilibrated distribution of different lipid species within a complex membrane. After a complex protein-lipid system equilibrates the long-timescale protein/lipid interactions with CG resolution, ezAlign can be used to recover AA resolution through back mapping. In addition to its ease of use, ezAlign is easy to modify for specific purposes. For example, the incorporation of nonstandard amino acids is straightforward, requiring no direct modification of the core ezAlign code (see Section 4.2).
There are many avenues for future improvements to ezAlign. The ezAlign program is designed so it can readily be adapted to take advantage of improvements to hardware and software for MD simulation speeds in the future. There is considerable room for optimizing the speed and computational cost of the program. Accommodating more sophisticated mapping strategies could prove useful; as it stands, ezAlign must map CG beads to AA atoms. Finally, expanding to other types of molecules, such as DNA and RNA, can be achieved in future versions. We also plan to expand our list of molecules and pre-equilibrated membranes.
As simulation capabilities expand, the need for easy access to model systems and multiscale software will expand as well. We provide several diverse applications and parameters for community use with ezAlign. Systems include basic bilayer systems, complex mixtures, bacterial inner membranes, human plasma membranes, and surfactant self-assembly. Multiscale workflows allow for complex mixtures and molecular rearrangements that are difficult to produce with tools such as CHARMM-GUI alone [25].

4. Methods

4.1. ezAlign Protocol

Figure 1 illustrates the steps to convert a pure POPC lipid bilayer system from the CG structure to the AA model. In Step 1, each CG lipid is aligned to a single AA lipid of the same type. A predefined mapping of each bead to a single AA atom is used to define the position restraints for the AA lipid. In Step 2, each lipid is then subjected to a short energy minimization with the AA-mapped atoms restrained to the position of the respective CG bead. After minimization, a short stochastic dynamics simulation is carried out for each lipid in vacuum. If a protein is included, additional protein minimization and relaxation simulations are performed in vacuum (see Section 4.2). In Step 3, the system is constructed by merging all the lipids, proteins, and other molecules. In Step 4, water (four waters per bead) and ions (one ion and four waters to mimic the ‘solvated’ ion paradigm in Martini) are added to the system. In Step 5, the final system is then energy-minimized and equilibrated with position restraints to generate an output AA structure ready for simulation.

4.2. Protein Minimization and Relaxation

Most Martini CG protein simulations involve a tight elastic network to maintain secondary and tertiary structure. However, there will be some conformational flexibility as well as sidechain motion. Additionally, future Martini CG protein simulations look to do away with the elastic networks, increasing flexibility and the ability to model conformational dynamics with Go-like models [26]. With ezAlign, each amino acid is mapped according to “amino_map.py” in the “files” directory. This file can be easily modified to permit the incorporation of nonstandard amino acids without any modification of the core ezAlign.py code. Using this by-residue mapping strategy, the protein is minimized and relaxed in vacuum, in an analogous manner to the lipid protocol, such that the relaxed AA protein adopts the same conformation as the CG input system (see “em1_prot.mdp” and “md1_prot.mdp” in the “files” directory for specific parameters). Multiple independent protein complexes can be provided for simultaneous mapping using this protocol.

4.3. File Structures

There are two files that currently must be supplied by the user for each new molecule type that is not already included in the ezAlign “files” subdirectory. The atomistic force-field files (in the itp format) must be present, as must the energy-minimized AA PDB files for each molecule type in the simulated system. The file names should match the name of the molecule, with Figure 6 illustrating the file structures, names, and an example mapping for a small three-bead benzyl alcohol molecule.
When running ezAlign, an initial CG structure for a large system with many molecules must be provided as a PDB file with a corresponding CG topology file, which maps the number of molecules of each type. The “residues.map” file in the “files” subdirectory must be modified for the inclusion of new molecules, using the format illustrated in Figure 6.

4.4. Coarse-Grained MD Simulations

A pure POPC lipid membrane was built with the insane bilayer builder [10] and solvated with 0.15 M salt solution. Martini 2.0 parameters were used to calculate the bonded and non-bonded interactions. Ten percent of water beads were modeled as anti-freeze water beads (WF), whereas the rest were modeled as regular water beads (W). For the initial CG setup and runs, we used the Martini v2.0 force field [27] and the insane bilayer builder [10]. MD simulations were run with a 20 fs time step in GROMACS 2018.3 and 5.1.4 [8].
Temperature was maintained at 313 K using the V-rescale method [28], and semi-isotropic pressure coupling was used with the Parrinello-Rahman method [29] and 1 bar pressure. Non-bonded interactions were cut off after 1.2 nm. For electrostatic interactions, a dielectric of 15 is used for implicit charge screening and is shifted from 0 nm to 1.2 nm. Lennard-Jones interactions were shifted from 0.9 nm to 1.2 nm.

4.5. Atomistic MD Simulations

Atomistic simulations were run with GROMACS 2018.3 and GROMACS 2023.2 [8]. Note the current version of ezAlign requires a GROMACS version later than 2022, due to the utilization of the Gapsys et al. soft-core potential [30]. A time step of 2 fs was used with LINCS constraints on the hydrogen bonds and angles [31,32]. Lennard-Jones interactions were cut off at 1.0 nm, and long-range electrostatic interactions were computed using the particle mesh Ewald method [33,34]. Semi-isotropic pressure coupling was used with the Parrinello-Rahman [29] barostat with a reference pressure of 1 bar. Temperature was maintained at 313 K using the Nose-Hoover method [35].

4.6. Transmembrane Protein Simulations

We also simulated the transmembrane protein GPR40 (PDB: 4EJ4), starting from the x-ray structure [36]. For Martini, the v2.2 model was used for the protein [37] and v2.0 for the lipids [27], with simulation parameters the same as for the bilayer-only models. The system was then converted into AA with ezAlign, using the AMBER99 force field [38] for the protein and the AMBER21 lipids [39]. This system was then run for 50 ns to monitor stability. The lipid surface density was calculated and plotted using VMD [40].
Transmembrane proteins hERG (PDB: 7CN1) [18], GABAA (PDB: 8SI9) [19], and a RAS/RAF complex (PDB: 6XI7) [20] were also simulated. The same protocol was used as GPR40, except the CHARMM36 [11] force field was used instead of AMBER.

5. Conclusions

The program ezAlign, a new tool for CG-to-AA resolution transformations, is presented for future use by the scientific community. We validated ezAlign against other methods for converting lipid membrane systems from CG to AA resolution. One significant advantage of ezAlign is its ease of use, where adding new molecules is a trivial and automatable task. Additionally, ezAlign does not require training data or human knowledge of chemistry and can back map complex membrane-protein systems. The ezAlign program is provided with files for several important biological systems, including proteins, lipids, and surfactants, for easy adoption by new users.

Author Contributions

Conceptualization, W.F.D.B., A.B., S.J.F., D.S. and C.M.M.; methodology, W.F.D.B. and A.B.; software, W.F.D.B. and A.B.; validation, W.F.D.B., A.B., T.N.O., H.I.I. and D.S.; writing—original draft, W.F.D.B.; writing—review and editing, W.F.D.B., A.B., T.N.O., H.I.I., S.J.F., D.S. and C.M.M. All authors have read and agreed to the published version of the manuscript.

Funding

Funding was provided by Procter and Gamble through CRADA TC02339 and by the Joint Science and Technology Office (JSTO) under Award CB100035, the NCI-DOE Collaboration established by the US DOE and the NCI of the National Institutes of Health, LDRD 24-ERD-027, and the Livermore Institutional Grand Challenge for computing time. Its contents are solely the responsibility of the authors and do not necessarily represent the official views of DTRA. This work was performed under the auspices of the U.S. DOE by the Lawrence Livermore National Laboratory under contract DE-AC52-07NA27344, release number LLNL-JRNL-839423.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The ezAlign source code, examples, and documentation may be found at https://github.com/LLNL/ezAlign (accessed on 27 July 2024).

Acknowledgments

We thank the Livermore Institutional Grand Challenge for the computing time provided for this work.

Conflicts of Interest

Authors Stephen J. Fox and C. Mark Maupin were employed by the company Procter and Gamble. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Tieleman, D.P.; Marrink, S.J.; Berendsen, H.J. A computer perspective of membranes: Molecular dynamics studies of lipid bilayer systems. Biochim. Biophys. Acta 1997, 1331, 235–270. [Google Scholar] [CrossRef]
  2. Gurtovenko, A.A.; Anwar, J.; Vattulainen, I. Defect-Mediated trafficking across cell membranes: Insights from in silico modeling. Chem. Rev. 2010, 110, 6077–6103. [Google Scholar] [CrossRef] [PubMed]
  3. Rzepiela, A.J.; Schafer, L.V.; Goga, N.; Risselada, H.J.; de Vries, A.H.; Marrink, S.J. Reconstruction of atomistic details from coarse-grained structures. J. Comput. Chem. 2010, 31, 1333–1343. [Google Scholar] [CrossRef] [PubMed]
  4. Shih, A.Y.; Freddolino, P.L.; Sligar, S.G.; Schulten, K. Disassembly of nanodiscs with cholate. Nano. Lett. 2007, 7, 1692–1696. [Google Scholar] [CrossRef] [PubMed]
  5. Wassenaar, T.A.; Pluhackova, K.; Bockmann, R.A.; Marrink, S.J.; Tieleman, D.P. Going Backward: A Flexible Geometric Approach to Reverse Transformation from Coarse Grained to Atomistic Models. J. Chem. Theory Comput. 2014, 10, 676–690. [Google Scholar] [CrossRef]
  6. Stansfeld, P.J.; Sansom, M.S. From Coarse Grained to Atomistic: A Serial Multiscale Approach to Membrane Protein Simulations. J. Chem. Theory Comput. 2011, 7, 1157–1166. [Google Scholar] [CrossRef]
  7. Vickery, O.N.; Stansfeld, P.J. CG2AT2: An Enhanced Fragment-Based Approach for Serial Multi-Scale Molecular Dynamics Simulations. J. Chem. Theory Comput. 2021, 17, 6472–6482. [Google Scholar] [CrossRef]
  8. Abraham, M.J.; Murtola, T.; Schulz, R.; Páll, S.; Smith, J.C.; Hess, B.; Lindahl, E. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 2015, 1, 19–25. [Google Scholar] [CrossRef]
  9. Michaud-Agrawal, N.; Denning, E.J.; Woolf, T.B.; Beckstein, O. MD Analysis: A toolkit for the analysis of molecular dynamics simulations. J. Comput. Chem. 2011, 32, 2319–2327. [Google Scholar] [CrossRef]
  10. Wassenaar, T.A.; Ingolfsson, H.I.; Bockmann, R.A.; Tieleman, D.P.; Marrink, S.J. Computational Lipidomics with Insane: A Versatile Tool for Generating Custom Membranes for Molecular Simulations. J. Chem. Theory Comput. 2015, 11, 2144–2155. [Google Scholar] [CrossRef]
  11. Klauda, J.B.; Venable, R.M.; Freites, J.A.; O’Connor, J.W.; Tobias, D.J.; Mondragon-Ramirez, C.; Vorobyov, I.; MacKerell, A.D., Jr.; Pastor, R.W. Update of the CHARMM all-atom additive force field for lipids: Validation on six lipid types. J. Phys. Chem. B 2010, 114, 7830–7843. [Google Scholar] [CrossRef]
  12. Jo, S.; Kim, T.; Iyer, V.G.; Im, W. CHARMM-GUI: A web-based graphical user interface for CHARMM. J. Comput. Chem. 2008, 29, 1859–1865. [Google Scholar] [CrossRef]
  13. Stephanopoulos, N.; Ortony, J.H.; Stupp, S.I. Self-Assembly for the Synthesis of Functional Biomaterials. Acta Mater. 2013, 61, 912–930. [Google Scholar] [CrossRef]
  14. Bennett, W.F.; Tieleman, D.P. Computer simulations of lipid membrane domains. Biochim. Biophys. Acta 2013, 1828, 1765–1776. [Google Scholar] [CrossRef]
  15. Grouleff, J.; Irudayam, S.J.; Skeby, K.K.; Schiøtt, B. The influence of cholesterol on membrane protein structure, function, and dynamics studied by molecular dynamics simulations. Biochim. Biophys. Acta 2015, 1848, 1783–1795. [Google Scholar] [CrossRef]
  16. Ingólfsson, H.I.; Bhatia, H.; Zeppelin, T.; Bennett, W.F.D.; Carpenter, K.A.; Carpenter, K.A.; Dharuman, G.; Bremer, P.T.; Schiøtt, B.; Lightstone, F.C.; et al. Capturing biologically complex tissue-specific membranes at different levels of compositional complexity. J. Phys. Chem. B 2020, 124, 7819–7829. [Google Scholar] [CrossRef]
  17. Enkavi, G.; Javanainen, M.; Kulig, W.; Róg, T.; Vattulainen, I. Multiscale Simulations of Biological Membranes: The Challenge To Understand Biological Phenomena in a Living Substance. Chem. Rev. 2019, 119, 5607–5774. [Google Scholar] [CrossRef] [PubMed]
  18. Asai, T.; Adachi, N.; Moriya, T.; Oki, H.; Maru, T.; Kawasaki, M.; Suzuki, K.; Chen, S.; Ishii, R.; Yonemori, K.; et al. Cryo-EM Structure of K+-Bound hERG Channel Complexed with the Blocker Astemizole. Structure 2021, 29, 203–212.e4. [Google Scholar] [CrossRef] [PubMed]
  19. Legesse, D.H.; Fan, C.; Teng, J.; Zhuang, Y.; Howard, R.J.; Noviello, C.M.; Lindah, E.; Hibbs, R.E. Structural insights into opposing actions of neurosteroids on GABAA receptors. Nat. Commun. 2023, 14, 5091. [Google Scholar] [CrossRef]
  20. Tran, T.H.; Chan, A.H.; Young, L.C.; Bindu, L.; Neale, C.; Messing, S.; Dharmaiah, S.; Taylor, T.; Denson, J.-P.; Esposito, D.; et al. KRAS interaction with RAF1 RAS-Binding domain and cysteine-rich domain provides insights into RAS-Mediated RAF activation. Nat. Commun. 2021, 12, 1176. [Google Scholar] [CrossRef]
  21. Thøgersen, L.; Schiøtt, B.; Vosegaard, T.; Nielsen, N.C.; Tajkhorshid, E. Peptide aggregation and pore formation in a lipid bilayer: A combined coarse-grained and all atom molecular dynamics study. Biophys. J. 2008, 95, 4337–4347. [Google Scholar] [CrossRef] [PubMed]
  22. Perlmutter, J.D.; Sachs, J.N. Experimental verification of lipid bilayer structure through multi-scale modeling. Biochim. Biophys. Acta 2009, 1788, 2284–2290. [Google Scholar] [CrossRef]
  23. Perlmutter, J.D.; Drasler, W.J., II; Xie, W.; Gao, J.; Popot, J.-L.; Sachs, J.N. All-Atom and coarse-grained molecular dynamics simulations of a membrane protein stabilizing polymer. Langmuir 2011, 27, 10523–10537. [Google Scholar] [CrossRef]
  24. Louison, K.A.; Dryden, I.L.; Laughton, C.A. GLIMPS: A Machine Learning Approach to Resolution Transformation for Multiscale Modeling. J. Chem. Theory Comput. 2021, 17, 7930–7937. [Google Scholar] [CrossRef] [PubMed]
  25. Maffeo, C.; Bhattacharya, S.; Yoo, J.; Wells, D.; Aksimentiev, A. Modeling and simulation of ion channels. Chem. Rev. 2012, 112, 6250–6284. [Google Scholar] [CrossRef]
  26. Pedersen, K.B.; Borges-Araújo, L.; Stange, A.D.; Souza, P.C.T.; Marrink, S.-J.; Schiøtt, B. OLIVES: A Go-like Model for Stabilizing Protein Structure via Hydrogen Bonding Native Contacts in the Martini 3 Coarse-Grained Force Field. arXiv 2023. [Google Scholar] [CrossRef]
  27. Marrink, S.J.; Risselada, H.J.; Yefimov, S.; Tieleman, D.P.; de Vries, A.H. The MARTINI force field: Coarse grained model for biomolecular simulations. J. Phys. Chem. B 2007, 111, 7812–7824. [Google Scholar] [CrossRef]
  28. Bussi, G.; Donadio, D.; Parrinello, M. Canonical sampling through velocity rescaling. J. Chem. Phys. 2007, 126, 014101. [Google Scholar] [CrossRef]
  29. Parrinello, M.; Rahman, A. Polymorphic Transitions in Single-Crystals-a New Molecular-Dynamics Method. J. Appl. Phys. 1981, 52, 7182–7190. [Google Scholar] [CrossRef]
  30. Gapsys, V.; Seeliger, D.; de Groot, B.L. New Soft-Core Potential Function for Molecular Dynamics Based Alchemical Free Energy Calculations. J. Chem. Theory Comput. 2012, 8, 2373–2382. [Google Scholar] [CrossRef]
  31. Hess, B.; Bekker, H.; Berendsen, H.J.C.; Fraaije, J.G.E.M. LINCS: A linear constraint solver for molecular simulations. J. Comput. Chem. 1997, 18, 1463–1472. [Google Scholar] [CrossRef]
  32. Hess, B. P-LINCS: A Parallel Linear Constraint Solver for Molecular Simulation. J. Chem. Theory Comput. 2008, 4, 116–122. [Google Scholar] [CrossRef] [PubMed]
  33. Essmann, U.; Perera, L.; Berkowitz, M.L.; Darden, T.; Lee, H.; Pedersen, L.G. A Smooth Particle Mesh Ewald Method. J. Chem. Phys. 1995, 103, 8577–8593. [Google Scholar] [CrossRef]
  34. Darden, T.; York, D.; Pedersen, L. Particle Mesh Ewald-an N.Log(N) Method for Ewald Sums in Large Systems. J. Chem. Phys. 1993, 98, 10089–10092. [Google Scholar] [CrossRef]
  35. Martyna, G.J.; Klein, M.L.; Tuckerman, M. Nose-Hoover Chains-the Canonical Ensemble via Continuous Dynamics. J. Chem. Phys. 1992, 97, 2635–2643. [Google Scholar] [CrossRef]
  36. Srivastava, A.; Yano, J.; Hirozane, Y.; Kefala, G.; Gruswitz, F.; Snell, G.; Lane, W.; Ivetac, A.; Aertgeerts, K.; Nguyen, J.; et al. High-Resolution structure of the human GPR40 receptor bound to allosteric agonist TAK-875. Nature 2014, 513, 124–127. [Google Scholar] [CrossRef]
  37. de Jong, D.H.; Singh, G.; Bennett, W.F.; Arnarez, C.; Wassenaar, T.A.; Schäfer, L.V.; Periole, X.; Tieleman, D.P.; Marrink, S.J. Improved Parameters for the Martini Coarse-Grained Protein Force Field. J. Chem. Theory Comput. 2013, 9, 687–697. [Google Scholar] [CrossRef] [PubMed]
  38. Hornak, V.; Abel, R.; Okur, A.; Strockbine, B.; Roitberg, A.; Simmerling, C. Comparison of multiple Amber force fields and development of improved protein backbone parameters. Proteins 2006, 65, 712–725. [Google Scholar] [CrossRef]
  39. Dickson, C.J.; Walker, R.C.; Gould, I.R. Lipid21: Complex Lipid Membrane Simulations with AMBER. J. Chem. Theory Comput. 2022, 18, 1726–1736. [Google Scholar] [CrossRef]
  40. Humphrey, W.; Dalke, A.; Schulten, K. VMD: Visual molecular dynamics. J. Mol. Graph. 1996, 14, 33–38. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Schematic showing the ezAlign protocol. Starting on the left panel with a CG system for back mapping, each AA molecule is independently aligned to its CG counterpart (Step 1). The CG beads are used for the reference positions for position restraints of the mapped atoms, and a series of energy minimization and stochastic dynamics are run to allow each AA molecule to adopt a conformation consistent with the CG molecule (Step 2). Lipids and small molecules are then combined and relaxed through interactions (Step 3). Finally, water and ions are placed according to the 4–1 mapping of AA-CG waters and four waters in each ion’s solvation shell (Step 4). Short minimization and equilibration steps are used to relax the system and release the position restraints (Step 5).
Figure 1. Schematic showing the ezAlign protocol. Starting on the left panel with a CG system for back mapping, each AA molecule is independently aligned to its CG counterpart (Step 1). The CG beads are used for the reference positions for position restraints of the mapped atoms, and a series of energy minimization and stochastic dynamics are run to allow each AA molecule to adopt a conformation consistent with the CG molecule (Step 2). Lipids and small molecules are then combined and relaxed through interactions (Step 3). Finally, water and ions are placed according to the 4–1 mapping of AA-CG waters and four waters in each ion’s solvation shell (Step 4). Short minimization and equilibration steps are used to relax the system and release the position restraints (Step 5).
Molecules 29 03557 g001
Figure 2. Single POPC lipid conformations during the ezAlign procedure. Starting on the left, a single atomistic lipid conformation (i.e., same for each row) is fit to the respective CG lipid. After energy minimization, the AA atoms overlap with the mapped CG bead, and a short-position restrained stochastic dynamics simulation improves the lipid conformations. The final column on the right overlaps the CG lipid surface (blue) with the AA lipid surface (red) after ezAlign.
Figure 2. Single POPC lipid conformations during the ezAlign procedure. Starting on the left, a single atomistic lipid conformation (i.e., same for each row) is fit to the respective CG lipid. After energy minimization, the AA atoms overlap with the mapped CG bead, and a short-position restrained stochastic dynamics simulation improves the lipid conformations. The final column on the right overlaps the CG lipid surface (blue) with the AA lipid surface (red) after ezAlign.
Molecules 29 03557 g002
Figure 3. Comparing methods to set up AA MD simulations of a POPC lipid bilayer. (A) Area per lipid (nm2) following the conversion from CG to AA for Backward and ezAlign and following equilibration for the CHARMM-GUI setup and CG system. (B) Density profiles for POPC lipids in the bilayer averaged over 5 ns of simulation time. (C) A single POPC lipid that was back mapped with Backward and ezAlign. For this single lipid, the Backward method flipped the chirality of the glycerol backbone (encircled above), while ezAlign maintains the correct chirality for POPC.
Figure 3. Comparing methods to set up AA MD simulations of a POPC lipid bilayer. (A) Area per lipid (nm2) following the conversion from CG to AA for Backward and ezAlign and following equilibration for the CHARMM-GUI setup and CG system. (B) Density profiles for POPC lipids in the bilayer averaged over 5 ns of simulation time. (C) A single POPC lipid that was back mapped with Backward and ezAlign. For this single lipid, the Backward method flipped the chirality of the glycerol backbone (encircled above), while ezAlign maintains the correct chirality for POPC.
Molecules 29 03557 g003
Figure 4. (A) CG (left) and ezAlign AA (right) amphiphilic molecular self-assembly and interaction with an E. coli inner membrane model. The CTBE tails are orange, and head groups are blue. The POPE and POPG lipids are colored by atom type. The atomistic system contains over 600,000 atoms. (B) Complex model of a human plasma membrane run first with CG (left panel) and converted to AA with ezAlign. Water and ions are not shown for clarity. (C) GPR40 protein in a POPC lipid bilayer converted from CG to AA using ezAlign, showing a top-view (left) and side-view (right). The AA protein is colored magenta, with the CG backbone beads as pink spheres. The CG POPC lipid bilayer is also represented with spheres, colored by bead type. The AA lipids are shown as an orange, semi-transparent surface. Water is not shown for clarity.
Figure 4. (A) CG (left) and ezAlign AA (right) amphiphilic molecular self-assembly and interaction with an E. coli inner membrane model. The CTBE tails are orange, and head groups are blue. The POPE and POPG lipids are colored by atom type. The atomistic system contains over 600,000 atoms. (B) Complex model of a human plasma membrane run first with CG (left panel) and converted to AA with ezAlign. Water and ions are not shown for clarity. (C) GPR40 protein in a POPC lipid bilayer converted from CG to AA using ezAlign, showing a top-view (left) and side-view (right). The AA protein is colored magenta, with the CG backbone beads as pink spheres. The CG POPC lipid bilayer is also represented with spheres, colored by bead type. The AA lipids are shown as an orange, semi-transparent surface. Water is not shown for clarity.
Molecules 29 03557 g004
Figure 5. CG (left) and AA (right) representations of a hERG ion channel in POPC (top), a GABAA receptor in POPC (middle), and a RAS/RAF complex in a human plasma membrane model (bottom). AA systems are back mapped from CG with ezAlign. Solvating water and ions omitted for visual clarity.
Figure 5. CG (left) and AA (right) representations of a hERG ion channel in POPC (top), a GABAA receptor in POPC (middle), and a RAS/RAF complex in a human plasma membrane model (bottom). AA systems are back mapped from CG with ezAlign. Solvating water and ions omitted for visual clarity.
Molecules 29 03557 g005
Figure 6. An example CG-to-AA mapping of a benzyl-alcohol molecule. The “residues.map” file contains the mapping of CG beads to each atom. In this example, CG beads 1, 2, and 3 map to AA atoms 7, 5, and 2, respectively. The “CG_BZA.top” file is the CG GROMACS molecular topology.
Figure 6. An example CG-to-AA mapping of a benzyl-alcohol molecule. The “residues.map” file contains the mapping of CG beads to each atom. In this example, CG beads 1, 2, and 3 map to AA atoms 7, 5, and 2, respectively. The “CG_BZA.top” file is the CG GROMACS molecular topology.
Molecules 29 03557 g006
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bennett, W.F.D.; Bernardi, A.; Ozturk, T.N.; Ingólfsson, H.I.; Fox, S.J.; Sun, D.; Maupin, C.M. ezAlign: A Tool for Converting Coarse-Grained Molecular Dynamics Structures to Atomistic Resolution for Multiscale Modeling. Molecules 2024, 29, 3557. https://doi.org/10.3390/molecules29153557

AMA Style

Bennett WFD, Bernardi A, Ozturk TN, Ingólfsson HI, Fox SJ, Sun D, Maupin CM. ezAlign: A Tool for Converting Coarse-Grained Molecular Dynamics Structures to Atomistic Resolution for Multiscale Modeling. Molecules. 2024; 29(15):3557. https://doi.org/10.3390/molecules29153557

Chicago/Turabian Style

Bennett, W. F. Drew, Austen Bernardi, Tugba Nur Ozturk, Helgi I. Ingólfsson, Stephen J. Fox, Delin Sun, and C. Mark Maupin. 2024. "ezAlign: A Tool for Converting Coarse-Grained Molecular Dynamics Structures to Atomistic Resolution for Multiscale Modeling" Molecules 29, no. 15: 3557. https://doi.org/10.3390/molecules29153557

Article Metrics

Back to TopTop