Computational Modeling of DNA 3D Structures: From Dynamics and Mechanics to Folding

Mu, Zi-Chun; Tan, Ya-Lan; Liu, Jie; Zhang, Ben-Gong; Shi, Ya-Zhou

doi:10.3390/molecules28124833

Open AccessReview

Computational Modeling of DNA 3D Structures: From Dynamics and Mechanics to Folding

by

Zi-Chun Mu

^1,2,

Ya-Lan Tan

¹,

Jie Liu

¹,

Ben-Gong Zhang

^1,*

and

Ya-Zhou Shi

^1,*

¹

Research Center of Nonlinear Science, School of Mathematical & Physical Sciences, Wuhan Textile University, Wuhan 430073, China

²

School of Computer Science and Artificial Intelligence, Wuhan Textile University, Wuhan 430073, China

^*

Authors to whom correspondence should be addressed.

Molecules 2023, 28(12), 4833; https://doi.org/10.3390/molecules28124833

Submission received: 9 May 2023 / Revised: 11 June 2023 / Accepted: 14 June 2023 / Published: 17 June 2023

(This article belongs to the Special Issue Recent Progress for Structure and Function Prediction of Protein and RNA)

Download

Browse Figures

Versions Notes

Abstract

DNA carries the genetic information required for the synthesis of RNA and proteins and plays an important role in many processes of biological development. Understanding the three-dimensional (3D) structures and dynamics of DNA is crucial for understanding their biological functions and guiding the development of novel materials. In this review, we discuss the recent advancements in computer methods for studying DNA 3D structures. This includes molecular dynamics simulations to analyze DNA dynamics, flexibility, and ion binding. We also explore various coarse-grained models used for DNA structure prediction or folding, along with fragment assembly methods for constructing DNA 3D structures. Furthermore, we also discuss the advantages and disadvantages of these methods and highlight their differences.

Keywords:

DNA 3D structures; computational modeling; molecular dynamics simulations; coarse-grained models; structure fragment assembly

1. Introduction

Since the genetic information encoded by DNA forms the basis of life [1,2], the exploration of its structure and stability is a thriving field. For instance, the number of known functional DNA structures in the Protein Data Bank (PDB) continues to increase each year (see Figure 1a). In terms of stability, most DNA exhibits a right-handed double helix structure (B-form, see Figure 1b). This structure follows the Watson–Crick–Franklin law of A-T and G-C base-pairing, serving as a carrier for storing and transmitting genetic information in living organisms [3]. However, recent research has indicated that roughly 13% of human genes can adopt non-right-handed double helix structures (non-B-form) [4,5,6,7,8,9,10], such as hairpins [4], Z-DNA [5], triplexes [7], G-quadruplexes [8,9,10], and i-motifs [4,5] (see Figure 1c–f). These structures have been observed to play significant roles in various cellular processes, including gene expression regulation and cancer development [4,5,6]. For example, DNA triplexes are generally involved in mutagenesis, genetic instability, and DNA repair or recombination, and mutations in helicases that act on G-quadruplex structures could lead to DNA damage or replication errors [7,8,9]. In addition, DNA nanostructures and devices (e.g., interlocks, walkers, tweezers, motors, shuttles, logic circuits, and origami) have immense potential for applications in various fields such as biosensing, food safety, and cancer therapy [11,12,13].

The function of DNA often depends on its 3D structure [3,4,7,8,9]. For example, the dynamically interchangeable G-quadruplex structures in HIV-1 can be stabilized by ligand binding, resulting in decreased viral production [14]. Therefore, understanding the 3D structures and properties of DNA (e.g., dynamics, thermodynamics, and mechanics) is useful in understanding its biological functions and designing DNA nanomaterials [2,3,4,5,6,14]. However, the flexibility and polymorphism of DNA present challenges for current experimental techniques, such as cryo-electron microscopy, X-ray crystallography, NMR spectroscopy, and other single-molecule techniques (e.g., light/magnetic tweezers and atomic force microscopy) [15,16,17,18]. These experimental methods face difficulties in elucidating the underlying aspects of DNA folding, hybridization, and stability. Since these methods are often time-consuming and costly, the number of known DNA structures in the database is still very limited (see Figure 1a).

The field of computer simulation is advancing quickly, providing more precise insights into essential aspects of DNA biophysics compared to traditional experimental approaches [19,20,21,22]. Molecular dynamics (MD) simulations, for instance, can generally reproduce the behavior of molecules in a computer, providing detailed structural and dynamical insights that enhancing our comprehension of relevant experimental data. In recent years, MD simulations using classical force fields such as AMBER [23,24] and CHARMM [25] have provided highly detailed and flexible descriptions of DNA dynamics, including structural transformations, stability of non-canonical conformations, salt ion cohesion effects, twist-stretch coupling of stress, flexibility under methylation modifications, and interactions with other macromolecules. It is always fascinating to obtain microscopic insights into DNA dynamics through MD simulations. However, the innumerable degrees of freedom, interconnected in complex ways, can make it practically impossible to detect DNA dynamics on biologically relevant time scales and length scales using currently available computer hardware [26].

In contrast to all-atom models, continuous DNA models such as the worm-like chain (WLC) model effectively describe the mechanical behavior of double-stranded DNA (dsDNA) on larger length scales. This model considers the double helix as an elastic rod with torsional and bending stiffness (i.e., the predefined angle between neighbor beads and persistence length of the chain) [27,28,29]. Similarly, the nearest neighbor model can predict the secondary structure and melting profiles (such as free energy and melting temperature) of single-stranded DNA (ssDNA) and dsDNA. This model assumes that the free energy of DNA is the sum of the free energy of each base stack, which has been determined through thermodynamic experiments [30,31]. However, these basic models are unable to provide insights into the three-dimensional (3D) structures of DNA.

Meanwhile, coarse-grained (CG) models, which combine highly correlated atoms in the DNA nucleotide into a few interacting sites, can play a crucial role in describing complex biological macromolecular systems (e.g., DNA–protein complexes and DNA nanostructures) at larger length/time scales [32,33]. Compared to all-atom models with large numbers of particles, CG models have a small number of degrees of freedom due to the reduced resolution. CG models are generally effective in studying DNA 3D structures, dynamics, flexibility, and interactions with other biological macromolecules (such as RNA and protein) [34,35,36]. However, all-atom MD simulations for DNA generally require known 3D structures as input. Although many models have been developed for RNA 3D structure prediction [37,38,39,40], there are few methods that can be directly used to predict DNA 3D structures, especially from the sequence. Recently, Xiao et al. provided a fragment assembly method (3dDNA) to automatically predict 3D structures for small DNAs (<100 nt) with very high precision [41,42,43,44]. Since the method depends on limited templates and known secondary structures of the DNA, more DNA 3D structure prediction models, especially ab initio ones, are still needed.

In this work, we provide a comprehensive review of computer modeling techniques used for studying DNA 3D structures. Our goal is to provide an in-depth understanding of the current state-of-the-art research, as well as to discuss the challenges and future developments in the field of DNA 3D structure modeling. First, we reviewed the powerful and versatile all-atom MD simulations, including progress and limitations in capturing DNA dynamics, flexibility, and ionic interactions. Then, we highlighted representative DNA CG models that exhibit excellent performance in DNA folding or simulations for large DNAs beyond the capabilities of all-atom MD simulations. Finally, since MD simulations and several CG models generally require known 3D structures, we provided a brief overview of current structure assembly methods that can construct 3D structures of DNA from their sequences or secondary structures.

2. Molecular Dynamics Simulations for DNAs

MD simulations of DNA systems are typically performed by calculating the force on each atom as a function of their positions using all-atomic force fields (such as AMBER, CHARMM, GROMOS, and OPLS) (Figure 2a). These force fields are parameterized using experiments or quantum chemistry calculations of small systems [23,24,25,45,46,47]. CHARMM36 [25] and AMBER ff99bsc1 [47], which have been validated and improved through multiple revisions, are commonly used for DNA simulations. Although these force fields have limitations, such as AMBER potentially overestimating base stacking effects and CHARMM weakening base pairing [48,49], they have been successfully employed to simulate DNA systems, providing atomistic resolution and establishing quantitative relationships between structure and conformational energy [50,51].

2.1. Structural Dynamics

MD simulations have been effective in accurately probing the atomic motions and structural dynamics of DNAs [52,53,54,55,56,57,58,59], enabling us to understand the DNA functions. To address the question of how long an MD simulation of a B-DNA helix needs to be to sample the dominant structural and dynamical features, Galindo-Murillo et al. presented an extensive analysis using multiple μs-length MD simulations of a dsDNA (d(GCACGAACGAACGAACGC)) with Amber 14 and a ff99SB parmbsc0 or CHARMM C36 force field on multiple computer architectures (including Anton, CPU, and GPU). The results showed that despite the underlying differences in hardware, the simulations performed on different architectures exhibited minimal structural variation with respect to one another. These MD simulations, including the longest one at ~44 μs, also suggested that the structure and dynamics of the DNA helix, excluding the terminal base pairs, reach near-full convergence on the ~1–5 μs timescale. This indicates that the current force field is reasonably robust. However, the convergence of the terminal base pair opening events occurs on time scales significantly longer than 10 μs and cannot be fully captured through ensembles of shorter and independent MD simulations [60,61]. In a separate study, Yang et al. performed umbrella MD simulations of A-T sequence-rich B-DNA using the Amber force field and reproduced the experimental conformational transition path from Watson–Crick to Hoogsteen base pairs observed in NMR relaxation dispersion spectroscopy [62]. This indicates that MD simulations have the power to describe large-scale structural dynamics at short timescales using an advanced-sampling approach [63].

In addition, MD simulations can also provide detailed insight into DNA structure dynamics. For example, Chakraborty et al. employed the AMBER12 package and Joung/Cheatham ion parameters to explore the transition between B- and Z-dsDNAs. Their study found that the free energy landscape exhibits two distinct funnels, leading to the B-DNA and Z-DNA conformations. This suggests that the reversal of chirality is caused by the stretched DNA structure or mutual competition at the B–Z junction [64].

2.2. Structural Flexibility

In recent years, MD simulations have been widely used to study the flexibility of DNAs, as DNA structural flexibility is closely associated with many biological processes involving the storage or encoding of genetic information [65,66]. Although many results from single-molecule experiments can be well-described by the commonly accepted WLC models [27,28,29], atomistic MD simulations are extensively used to obtain microscopic descriptions of DNA flexibility, such as the width and depth of the major/minor grooves and the distances/twist angles between neighbor base pairs [67,68,69]. For example, to explain the experimental results that short DNAs consisting of tens of base pairs (bps) may have seemingly higher flexibility than those of kilobase pairs, Wu et al. performed MD simulations for short dsDNAs with a finite-length of 5–50 bps using the Amber parmbsc0 force field. Their microscopic analyses (the calculation of stretching and bending at the base-pair level) revealed that the apparent high flexibility of short dsDNAs arises from significantly strong bending and stretching flexibilities at each helix end, consisting of ∼6 bps [70]. In addition to the length-dependent flexibility of DNA, Marin-Gonzalez et al. performed over 1μs-long constant-force MD simulations of 18 bp-long dsDNAs (CGCG(NN)₅CGCG, with NN as the AA, AC, AG, AT, CG, and GG). They found that the DNA crookedness (a sequence-dependent deformation of DNA that consists of periodic bends in the chain of base pair centers) and its associated flexibility can regulate DNA mechanical properties at short length scales. This unveiled a one-to-one relation between DNA structure and dynamics [71]. To understand the distinct differences in the flexibility of dsRNA and dsDNA helices, Liebl et al. performed unrestrained/restrained MD simulations for a 16 bp dsDNA or dsRNA using the AMBER12 package with the parmbsc0 force field. Their detailed analysis of helical deformations, coupled with twist, indicated that the interplay of helical rise, base pair inclination, and displacement from the helix axis during twist changes is responsible for the different twist–stretch correlations [72]. Coincidentally, Marin-Gonzalez et al. investigated the difference between dsDNA and dsRNA (16 bp) using microsecond-long MD simulations under constant stretching forces within the range of 1–20 pN. They showed that the opposite twist–stretch coupling of both molecules is due to the markedly different evolution of inter-strand distance with the stretching force, which is directly correlated with the slide base-pair parameter and sugar pucker angle [73]. Recently, Bao et al. also conducted extensive MD simulations for larger dsDNA and dsRNA (40 bp) without applying stretch force, using the AMBER ff99bsc0 force field. Their work provides a more quantitative understanding of the relative flexibility of dsRNA and dsDNA in terms of both stretching and twist–stretch coupling. They noted that the striking difference in twist–stretch coupling between dsRNA and dsDNA is attributed to the apparently stronger base-pair inclination in dsRNA compared to dsDNA (Figure 2b) [74].

In addition, MD simulations can be used to reproduce the effect of base modifications or base-pair mutations on DNA flexibility [75,76]. For example, Aksimentiev et al. combined MD simulations (using the NAMD program with a CHARMM36 force field) with a single-molecule cyclization assay to study how different cytosine modifications influence the physical properties of dsDNA (70 bp). They elucidated the microscopic mechanisms behind the changes in DNA flexibility induced by cytosine modifications: these modifications can promote or dampen structural fluctuations through the competing effects of base polarity and steric hindrance [77]. Given that the appearance of mismatched base pairs (MMs) can result in the development of inherited genetic diseases, cancer, and aging, Rossetti et al. presented the first comprehensive study on the structure of MM-containing DNA duplexes (12 MMs, including A·A, A·C, A·G, C·A, C·C, C·T, G·A, G·G, G·T, T·C, T·G, and T·T, placed in the center of 13 bp duplexes, e.g., d(CCATACXATACGG)). They employed MD simulations (Gromacs v.4.5.5 program with parmbsc0 force field) and NMR spectroscopy and found that the presence of mismatches produced significant local structural alterations due to the flexible MMs (especially in the case of purine transversions). These alterations could be propagated far from the mismatch site, influencing the global structures of DNA [78]. On the other hand, Bouchal et al. also employed MD simulations (Amber 16 package with parmbsc1 force field) to calculate the thermodynamic stabilities of MMs in similar dsDNAs (e.g., d(GGTTAAXTTAACC) with anti/anti, anti/syn, and syn/anti MM combinations) as a function of two geometry parameters of the base pair (opening and shear). However, their detailed analysis showed that there was no clear dissection between the canonical and mismatched base pairs [79]. This discrepancy suggests that MD simulations may be less credible in capturing the local sequence effects on DNA flexibility [74] due to the empirical force field.

2.3. DNA–Ion Interaction

Since DNA is an anionic polyelectrolyte, the solvent environment plays a significant role in DNA structures [80,81,82,83,84]. Pasi et al. performed microsecond MD simulations for 39 dsDNAs (with a length of 18 bp and different sequences) under physiological salt conditions using the parmbsc0 force field with Dang parameters for the ions. They provided a comprehensive state-of-the-art perspective on sequence-dependent potassium ion populations. For example, they observed that potassium ions within the grooves are more likely to accumulate around electronegative base sites rather than the anionic phosphate groups [85]. Considering the experimental results showing that high-valent cation can lead to the opposite effect on the elasticities of DNA and RNA duplexes, Fu et al. used MD simulations for 20 bp dsDNA and dsRNA in trivalent ion solutions (i.e., CoHex³⁺). They found that these results were caused by different binding modes of the cations on dsDNA and dsRNA [86]. More recently, Cruz-Leon et al. also combined high-resolution MT experiments with MD simulations (parmbsc1 force field on 33 bp dsDNA) to show that increasing ion concentration leads to a decrease in helical radius and crookedness, an increase in sugar pucker, and ultimately an increase in a twist. This is due to the increased screening of electrostatic repulsion between phosphate groups [87].

Furthermore, MD can provide an atomistic understanding of how DNA–ion interactions vary with different metal ions (Figure 2b,c). For example, Long et al. performed MD simulations to sample the structures of a 23 bp DNA duplex in various ion solutions (such as Mg²⁺, Ca²⁺, Sr²⁺, or Ba²⁺). They demonstrated that these ions exhibit a preference for binding to the phosphate backbone rather than the major groove [88]. To investigate the competitive binding of divalent and monovalent ions to dsDNA, Xi et al. performed all-atom MD simulations for a 24 bp dsDNA in mixed Mg²⁺/Na⁺ solutions using the Amber parmbsc0 force field with Joung/Cheatham ion model for Na⁺/Cl⁻ and the Aqvist ion model for Mg²⁺. Their comprehensive analysis suggested that the global binding of Mg²⁺ over Na⁺ to nucleic acids is primarily dependent on the surface charge density and Mg²⁺/Na⁺ concentrations [89].

2.4. Limitations

In the last 40 years, MD simulations have made significant progress in providing atomistic insights into DNA structures, including dynamics, flexibility, and ion binding. Although recent efforts combining experiments and simulations show promise for improving the accuracy of nucleic acid force fields, MD simulations are not always effective, particularly for ssDNAs [90,91]. Recently, we performed MD simulations for unstructured ssDNA (with a random sequence: 5′-CTGCCACGCCATGCCTGTGACGA-3′ at 1 M [Na⁺]) and tried to extract the bonded parameters from the equilibrium conformations. However, we found that the distributions of several angles in MD conformations deviated from those observed in PDB structures. For example, the P-C4′-P angle showed a deviation of ~11° from its optimal value, as shown in Figure 2d. In addition, the ion parameters, which are optimized based on a set of experimental solution properties such as solvation-free energies, radial distribution functions, water exchange rates, and activity coefficient derivatives, could be limited in their transferability to quantitatively describe biomolecular systems [92,93]. Thus, further investigations of diverse DNA structures (e.g., ssDNA, pseudoknots, G-quadruplexes, i-motifs, and DNA complexes) in ion solutions are needed to further assess the quality of these force fields [90,91,94,95].

Furthermore, MD simulations in equilibrium are not always adequate to sufficiently explore the structural space needed for accurate property estimation [96,97]. In MD simulations, the initial conformation is usually established based on an experimentally known structure. If the molecule acquires another stable conformation that is separated by a high free energy barrier, the system’s acquisition of this alternative conformation within a realistic computational cost becomes challenging due to the barrier [91]. Finally, sampling remains an issue in some nucleic acid simulations, thus requiring the extension of simulation time scales and exploration of efficient enhanced sampling methods (e.g., temperature replica exchange, Hamiltonian and multi-dimensional replica exchange, metadynamics, and umbrella sampling). These efforts are important for future advancements [98,99,100].

3. Coarse-Grained (CG) Modeling for DNAs

Due to the computational limitations of MD simulations, CG models are often utilized to study DNA structure folding, such as hybridization and melting. These models reduce the complexity of atomistic simulation systems by averaging nonessential degrees of freedom [32,33]. There are currently two primary approaches to CG DNA modeling: top-down, which involves parameter fitting to experimental data, and bottom-up, which involves analyzing parameters from all-atom MD simulations or quantum chemistry calculations [34]. Based on their design purpose and capability, existing DNA CG models fall into three main categories: 3D structure dynamics, properties/folding, and prediction [32,33,34] (see Table 1).

3.1. CG Models for DNA Structure Dynamics

Since MD simulations are limited by computational cost at different length scales and time scales (typically ranging from nanoseconds to milliseconds), CG MD simulations can be a suitable for accelerating the study of large DNA structures.

For this purpose, Marrink et al. proposed an explicit solvent-based DNA CG model that is compatible with the Martini force fields, suited for MD simulations of biomolecular systems [101,102,103]. In the Martini DNA model, each nucleotide is mapped to six or seven CG beads, with one bead for the phosphate group, two for the sugar ring, and three (four) for the pyrimidines (purines) (see Figure 3g). Similar to the Martini protein force field [101], the model incorporates conventional bonded (bond length, angle, and dihedral angle) and nonbonded interactions (Lennard–Jones potential and Coulombic energy) for DNA. In addition, a new interaction was added to model directional hydrogen bonds in DNA. The force field was parameterized by combining top-down information from experiments with bottom-up information derived from reference all-atom MD simulations. For the bonded parameters, all-atom and CG MD simulations were performed on 10 ssDNAs with different sequences, respectively, and the parameters were adjusted to match the conformations available in the Martini force field to the conformational space of the reference all-atom model as closely as possible. The nonbonded parameters were derived from partitioning the nucleobases between polar and nonpolar solvents, as well as the base–base potential of mean force calculations. The model was validated by reproducing the radius of gyration of ssDNA, as well as the double helical structures and persistence length of dsDNA, as observed in atomistic simulations under high ion concentrations. It is important to note that, for dsDNA, an elastic network (which involved predefining the pairing bases and adding interactions between them) was used in the Martini model to preserve the secondary structure. Although the Martini DNA model cannot be used to study DNA hybridization, melting, and hairpin formation due to its inability to model directional hydrogen bonds, its speed and compatibility pave the way for large-scale modeling of complex biomolecular systems involving DNA, such as DNA–protein interactions [102].

Recently, another sequence-dependent CG model (MADna) was proposed by Assenza and Pérez for simulating dsDNA [116]. In the MADna model, each nucleotide is represented by three effective particles located at the geometric centers of the phosphate group, sugar, and base (see Figure 3f). In the model, the sequence-dependent bonded interactions including bond, angle, and dihedral potentials are used to connect beads within the same strand as well as to provide inter-strand links (e.g., between beads in preassigned pairing nucleotides). These interactions are tuned to reproduce the results of atomistic simulations of dsDNAs with various sequences. In addition, the model includes an excluded-volume interaction implemented through the repulsive component of a Lennard–Jones interaction, and the salt-induced electrostatic was modeled via a Debye–Hückel (DH) interaction (between P beads with a charge of −0.6). By combining with LAMMPS, the MADna can capture the sequence-dependence of conformational and elastic properties of dsDNA, including main helix parameters, groove geometry, the diameter of the double helix, and spontaneous curvature quantified by bending metrics, with an accuracy comparable to atomistic simulations. Furthermore, the model can reproduce structural elastic features observed in experiments, such as the stretching and torsion moduli, negative twist–stretch coupling, twist–bend coupling, persistence length, and helical pitch. However, due to the double-stranded structure imposed by the bonded interactions in MADna, it cannot account for breaking events such as the formation of kinks or local melting.

Table 1. Existing DNA structure modeling models/methods.

Models	Representation	Type ^a	Application ^b	Available ^c
Hall et al. [119]	1 bead	Gō-like	Duplex/Triplex T_m	/
Aksimentiev et al. [77]	2 beads	ab initio	R_g, L_p, force-extension	/
oxDNA [104,105,106,107]	2 beads	Gō-like	T_m, L_p, force-extension, hybridization, dynamics, DNA–ion interaction, and nanotechnology	https://oxdna.org (accessed on 1 October 2022)
NARES-2P [108,109]	2 beads	ab initio	3D structure prediction, T_m, dynamics	/
Mittal et al. [120]	2 beads	Gō-like	T_m, Particle interactions	/
MaDNA [116]	3 beads	MD	dsDNA structure/elastic properties, L_p	https://github.com/saassenza/MADna (accessed on 1 October 2022)
3SPN [110,111,112,113]	3 beads	Gō-like	T_m, L_p, structure properties, dynamics, hybridization, DNA–ion interaction, nanotechnology	https://github.com/depablogroup (accessed on 1 October 2022)
TIS [115]	3 beads	Gō-like	R_g, L_p, T_m, force extension,	/
Plotkin et al. [121]	3 beads	ab initio	L_p, DNA twist, and stacking	/
Shi et al. [122]	3 beads	ab initio	3D structure prediction, salt effect, T_m, L_p	https://github.com/RNA-folding-lab/DNAfold (accessed on 1 October 2022)
BioModi [114]	3 beads	Gō-like	Hybridization and self-assembly kinetics, salt-dependent L_p	/
Dorfman et al. [123,124,125]	3 beads	ab initio	T_m, dynamics, structure properties, triplex forming	/
Nordenskiöld et al. [126]	5 beads	MD	dsDNA L_p, L_T	/
SIRAH [127]	6 beads	MD	dsDNA T_m, transitions, and dynamics	/
“sugar” CG [128]	6 beads	MD	dsDNA transitions, DNA–ion interaction	/
MARTINI [102]	6/7 beads	MD	R_g, L_p, 3D structure, DNA–ion interaction, DNA–protein complexes	http://cgmartini.nl/ (accessed on 1 October 2022)
HiRe-DNA [118]	6/7 beads	ab initio	dsDNA 3D structure, T_m	/
UNRES like-DNA [117]	6/7/8 beads	ab initio	dsDNA 3D structure, structure properties, and hybridization	/
3dDNA [44]	all-atom	structure assembly	3D structure prediction for DNAs with single, double, and multi-chains	http://biophy.hust.edu.cn/new/3dRNA (accessed on 1 October 2022)
Saiz et al. [129]	all-atom	structure assembly	ssDNA 3D structure prediction	/
Rahim et al. [130]	all-atom	structure assembly	ssDNA 3D structure prediction	/

^a ab initio: modeling DNA structure from sequence only; Gō-like: predefined secondary structure or base-pairing network is needed; MD: 3D structure is needed; structure assembly: constructing DNA 3D structures based on the secondary structure. ^b indicates what the models can be used for. T_m: melting temperature; R_g: radius of gyration; L_p: persistence length; L_T: torsion persistence length. ^c indicates the open source code or web server of each model/method, and ‘/’ indicates that the model is unavailable.

3.2. CG Models for DNA Structure Folding

Since the above CG models developed for MD simulations require known DNA 3D structures as input, it is difficult to use them to study DNA folding processes such as hybridization, melting, and hairpin formation.

Generally, the Gō-type model is very effective in studying the folding of macromolecules (protein, RNA, and DNA) [104,105,106,107,110,111,112,113,115,131,132,133,134,135]. It achieves this by only considering the interactions that occur at the given native contact sites. The oxDNA is one outstanding representative of this model, which can capture the structural, thermodynamic, and mechanical properties of DNA [104,105,106,107]. In this model, DNA is represented as a string of rigid nucleotides with interaction sites for backbone, stacking, and hydrogen bonding interactions (see Figure 3a). The pairwise potential comprises eight interactions (see Table 2), including connectivity between neighboring backbones, the favorable stacking interactions between adjacent bases, coaxial stacking, and electrostatic repulsive interactions. The model was parameterized using a heuristic top-down approach, which involved reproducing well-known properties of DNA (such as the helical structure of dsDNA) and experimental results (such as melting temperatures of ds/ssDNAs). Combined with the virtual moving Monte Carlo algorithm or LAMMPS simulation software, this model has provided key insights into many different processes relevant to DNA nanotechnology and biophysics. It has also provided direct agreement with experimentally measured properties across a range of systems, including duplex hybridization, hairpin formation, DNA overstretching, thermodynamics, and structural properties of ss/dsDNAs.

The 3SPN model is another three-site per nucleotide model, with one site each for the phosphate, sugar, and base, thereby rendering the investigation of DNA up to a few microns in length computationally tractable [110,111,112,113] (see Figure 3c). In 3SPN, the potential energy of a DNA system comprises seven distinct contributions (Table 2), including typical bonded potentials (intramolecular bonds, bond angles, and dihedral angles) and pairwise nonbonded interactions (e.g., intra-strand base stacking, inter-strand cross-stacking, base pairing, excluded volume contributions, and electrostatic potential). The model is parametrized using thermal denaturation experimental data at a fixed salt concentration. Through replica exchange MD simulations, the 3SPN has been found to effectively reproduce many sequence/salt-dependent structural and mechanical properties of ds/ssDNAs, such as local flexibilities, minor groove width profiles, persistence lengths, melting temperatures, and hybridization rate.

Similar to 3SPN, a new three-interaction site model (TIS) has also been developed to provide a robust description of the sequence-dependent mechanical and thermodynamic properties of ss/ds DNAs [115]. The TIS model includes sequence-dependent stacking, hydrogen bonding, and electrostatic interactions, as well as bond-stretching and bond angle potentials (Table 2). The force constants for the stretching and bending potentials were guided by a Boltzmann inversion procedure using a large representative set of DNA PDB structures, and the parameters in the stacking interactions were calculated using a learning procedure, ensuring faithful reproduction of experimentally measured melting temperatures (i.e., a top-down approach). The model can accurately predict the salt-dependent persistence lengths of ss/dsDNA and melting temperatures of DNA hairpins, which represent a significant improvement over most of the current CG models.

3.3. Ab Initio CG Models

The CG models introduced in the previous section typically utilize a Gō-type potential, which imposes penalties on deviations from a reference structure, to constrain the range of conformations explored by a CG model of DNA. However, this approach also has the potential to restrict the ability of the model to accurately predict structures based solely on sequence information.

In contrast to the Gō-like models mentioned above, Plotkin et al. introduced an alternative CG model for DNA [121] that does not use any structure-based potential. In this model, phosphate and sugar groups are represented by one CG spherical residue each, while bases are represented by rigid-body ellipsoids to model their stereochemistry. The total potential includes eight purely physicochemical interactions (Table 2). In addition to the usual local bonded interactions, the model includes electrostatic repulsion interaction between phosphates, van der Walls interactions between any two beads, and base–base hydrogen bonding. These effective interactions were parameterized through all-atom simulations. For example, local potentials along the backbone were obtained from the statistics on conformations obtained from all-atom simulations, and base–base/backbone interactions were obtained from the best fit between van der Waals interactions in an all-atom model and an anisotropic potential between effective ellipsoids. By employing the LAMMPS package, the model generated stable double-stranded helices with both major and minor grooves for dsDNA and predicted the persistence lengths for ss/dsDNA that were comparable to experimental values. Furthermore, the model examined the degree of stacking and twist as functions of temperature, salt concentration, and sequence for ss/dsDNA.

UNRES-like DNA is a physics-based middle-resolution CG model [117]. In this model, the sugar (S) is represented by a neutral bead, the phosphate (P) is represented by a negatively charged bead, and each base (B) is reduced to a set of rigid bipolar beads (e.g., 4 for T and 5 for A) (see Figure 3h). The total potential energy is a summation of ten interaction potentials (Table 2). The parameters of bonded interactions were derived to reproduce the behavior of model systems in the all-atom representation. Nonbonded interactions were approximated using Lennard–Jones, excluded volume, and electrostatic interactions of charges and dipoles. The model was parameterized in a bottom-up fashion with only small adjustments to obtain the correct balance of key interactions. Using an efficient R-RATTLE rigid-body integration algorithm, the model successfully folded three short dsDNAs from separated complementary strands, despite underestimation of persistence lengths of ss/dsDNA.

HiRE-RNA is another high-resolution CG model designed for both RNA and DNA. In this model, each nucleotide is represented by six or seven beads: one for the phosphate (P), four for the sugar, and one/two for the pyrimidine/purine bases [118,136] (see Figure 3i). The force field of this model is expressed as a sum of local bonded, nonbonded, electrostatic, and hydrogen-bond terms (Table 2). Notably, the hydrogen bond interactions in the model consist of three terms: a two-body interaction (distance and angle), a three-body term (to avoid multiple hydrogen bonds of just one base), and a four-body term (representing stacking between two base pairs). The equilibrium geometrical parameters were initially derived from known structures and subsequently refined through the analysis of long MD simulations for a 15 nt Poly(A) molecule. By using replica exchange molecular dynamics (REMD), the model can accurately be used to predict the correct double helix structure from a completely random configuration and allows for the study of dissociation curves as well as the sequence effect on the melting curves of the duplexes.

On the other hand, NARES-2P is a physics-based CG DNA model with only two interaction sites: one for phosphate (P) and one for the base (B) (see Figure 3b) [108,109]. Similar to in UNRES [117], the effective energy function of the NARES-2P model originates from the PMF of a polynucleotide in water. The energy includes van der Waals or electrostatic interactions between any two beads, virtual bonded interactions, and sugar–base–rotamer energy terms (Table 2). Additionally, a restraint energy was also introduced to maintain selected geometric parameters (e.g., site–site distances) within the desired range. These potential energy terms were parameterized using Boltzmann inversion and fitting the PMF calculated by the all-atomic potential energy surface. The NARES-2P model was built into the UNRES/MD platform, which enables canonical and replica-exchange simulations of nucleic acids to be carried out. Through a global-optimization conformational space annealing algorithm, the model can not only find the native fold for simple DNA duplexes but also reproduce the thermodynamics of folding, although the calculated melting temperatures are generally higher than the experimental values.

Recently, we have also presented a new CG model to fold DNA 3D structures based only on the sequence. In this model, each nucleotide is simplified to three beads corresponding to the phosphate (with a negative charge), sugar, and base (see Figure 4a). The total energy of the system is composed of eight potentials, similar to the RNA CG model previously developed by our team [122,137,138,139,140,141]. The parameters for the bonded potentials, including bond length, bond angle, and bond dihedral, were derived from the Boltzmann inversion of the corresponding atomistic distribution functions obtained through statistical analysis of the experimental structures from the PDB. The excluded volume interaction between the CG beads is modeled by a purely repulsive Lennard–Jones potential. The orientation-dependent base-pairing interaction for the possible canonical Watson–Crick base pairs does not require any predefined structural information (see Figure 4b). The sequence-dependent base-stacking and coaxial-stacking (see Figure 4b) were parameterized using well-established experimental DNA thermodynamic parameters [30,142], and a conformational entropy change was included in the Monte Carlo simulation. It is important to note that the electrostatic interactions between the phosphate beads were also taken into account using the DH approximation, in combination with the counterion condensation theory and tightly-bound ion model [143,144], to predict DNA structures in monovalent/divalent ion solutions.

Using the effective Monte Carlo simulated annealing algorithm, the model successfully folded 20 dsDNAs (≤52 nt) and 20 ssDNAs (≤74 nt) into the corresponding native-like structures based on their sequences, with an overall mean RMSD of 3.4 Å from the experimental structures (Figure 4c).

Furthermore, the model quantitatively predicted the thermodynamic stability of 27 dsDNAs (including bulge loops and internal loops) and 24 ssDNAs (including a double hairpin and a pseudoknot), with a mean deviation of predicted melting temperatures from the corresponding experimental data of only ~2.0 °C (Figure 5). For example, the predicted two transformation temperatures (~48.8 °C and ~72.0 °C) for a DNA pseudoknot at 0.1 M [Na⁺] closely match the experimental data (~52.6 °C and ~70.7 °C), as shown in Figure 5c. Furthermore, the model also reproduced the stability of ssDNAs/dsDNAs under extensive monovalent or mixed monovalent/divalent ion conditions, with the predicted melting temperatures consistent with the available experiments (Figure 5).

Despite recent advancements, the present ab initio models have limitations in predicting large DNAs with complex structures, indicating the need for further improvement in the energy function and sampling methods [145,146,147,148].

3.4. Discussion and Comparison of These CG Models

As shown in Table 1 and Figure 3, the reduced degree of freedom is different for various CG models. For instance, oxDNA uses two beads, TIS uses three beads, and HiRE uses six or seven beads [104,115,118]. Generally, elaborate models can capture more detailed interactions, but they may be limited in structure modeling for large DNAs. For example, although the two, three, and four-body hydrogen bond interactions can be defined in the HiRE model, it is only applicable in small DNAs (<100 nt) [118]. Conversely, the oxDNA model can be used to simulate DNA nanostructures (>1000 nt) [107]. Predefined secondary structure information (i.e., Gō-like) is very important for CG models to simulate large DNAs [104,105,106,107,110,111,112,113].

Furthermore, these models were designed for different purposes, as outlined in Table 1. Some are suitable for CG MD simulations to capture DNA structure dynamics at large time and length scales (e.g., Martini and MADna) [102,116,126,127,128]. These models can reproduce details of DNA structures (e.g., helix parameters and groove geometry) and structural elastic features (e.g., persistence length and twist–stretch coupling) in most cases [102,116]. However, they generally require native/near-native 3D structures as inputs. Some other CG models were developed to simulate DNA folding, such as oxDNA, 3SPN, and TIS, which can be used to predict the thermodynamic or kinetic properties (such as melting temperatures or folding rates) of DNA [104,112,115]. In order to ensure that the DNA can fold into the correct final structure, additional secondary structure constraints are usually necessary in these models. Moreover, some ab initio CG models (such as HiRE, NARES-2P, and our model) have also been proposed to simulate 3D structure folding for DNA based only on its sequence [77,108,117,118,121,122,123]. Notably, these models can be used to predict 3D DNA structures, as well as their corresponding thermodynamic stability. However, they are only applicable to small DNAs (<100 nt).

4. DNA Structure Assembly Method for 3D Structure Construction

Since all-atom MD simulations for DNAs generally require known 3D structures as input, and DNA nanostructures are generally assembled by simple fragments (e.g., double helices), it is crucial to quickly build DNA 3D structures from sequences, especially for large DNAs. In this section, we will review several DNA structure assembly methods based on DNA secondary structures.

Due to considerable progress in RNA 3D structure prediction [37,38,39,40], two indirect ssDNA 3D structure prediction methods have been proposed with the aid of RNA models [129,130]. For example, in the pipeline presented by Saiz et al., a secondary structure was first predicted using Mfold [31] based on the given sequence. Subsequently, a corresponding 3D RNA structure was constructed using RNA structure prediction methods (such as Assemble and RNAComposer). The 3D RNA structure was then converted into a DNA structure by replacing the nucleotide U with T, and the resulting 3D structures were refined through energy minimization, as shown in Figure 6a. Although these methods were only tested on several small ssDNA hairpins (7–27 nt) and their accuracy was not very high (the RMSDs between predicted and experimental structures were larger than 4.0 Å), they offered a new framework for investigating related ssDNA nanotechnology.

Recently, Xiao et al. proposed a direct template-based method, 3dDNA, which is an extension of their previous 3dRNA. This method aims to construct DNA 3D structures by assembling 3D templates of the smallest secondary elements (SSEs) [44], as illustrated in Figure 6b. First, DNA is decomposed into SSEs based on the given secondary structural information. Second, the corresponding 3D template for each SSE can be found in the well-defined DNA fragment library. Subsequently, the selected template of each SSE is assembled with its parent SSE by superposing them using the Kabsch algorithm, with reference to the two common base pairs. The resulting assembly models are further refined by minimizing them using the AMBER force field to repair the chain connectivity of the assembled structures. To evaluate the performance of 3dDNA, it was was benchmarked on three test sets with different numbers of chains. The results showed that 3dDNA can predict DNA 3D structures with a mean RMSD of approximately 2.36 Å for structures with one or two chains, and fewer than 4 Å for structures with three or more chains. These results indicate a significant improvement compared to the indirect methods [44,129,130].

Since these fragment assembly methods heavily rely on the known secondary structure, which can be challenging to determine or predict accurately, especially for large complex DNAs, achieving accurate predictions of DNA 3D structures still seems to be a long-term challenge.

5. Discussion

The rapid advancement of MD simulations and DNA modeling has led to extensive insights into DNA structures at both macroscopic and microscopic scales [32,33,34,50,51,90]. However, the increasing utilization of DNA-based bioengineering and nanotechnology, as well as the discovery of non-B DNA structures with unique biological functions, has further intensified the requirement for DNA modeling. Here, we reviewed the recent advancements in DNA structure dynamics and folding, including MD simulations, CG modeling, and fragment assembly. Our purpose was to enhance DNA structure-based applications and further promote the development of DNA modeling.

In addition to the methods reviewed above, many computational models specially designed for DNA nanostructure construction or simulation have also been developed (e.g., MrDNA, DAEDALUS, and Adenita) [149,150,151]. Due to space limitations, we cannot delve into all of them in detail. Furthermore, the field of biology has seen significant advancements in recent years due to the application of machine learning techniques [152,153,154,155,156]. For example, 3D structure prediction methods such as AlphaFold2 [157] and RoseTTAFold [158] have gained popularity due to their ability to accurately predict protein structures. These deep learning methods could also improve the accuracy of DNA simulations by capturing more complex interactions between atoms whenever possible. However, since deep learning models require large datasets for training, the limited number of known DNA structures challenges the application of these methods in DNA modeling. With the development of advanced hardware, highly accurate force fields, large amounts of experimental data, and refined computer modeling techniques, DNA modeling has the potential to not only explain a large number of experimental results [69,86,87], but also to serve as a guiding tool for new and exciting discoveries [159,160].

Author Contributions

Conceptualization: Y.-Z.S., Z.-C.M. and Y.-L.T.; data curation: Z.-C.M., B.-G.Z. and Y.-L.T.; formal analysis: Y.-Z.S. and Z.-C.M.; funding acquisition: B.-G.Z. and Y.-Z.S.; investigation: Z.-C.M., Y.-Z.S. and J.L.; methodology: Z.-C.M., Y.-Z.S. and B.-G.Z.; supervision: Y.-Z.S. and B.-G.Z.; validation: Z.-C.M., Y.-Z.S. and Y.-L.T.; writing—original draft: Z.-C.M., Y.-L.T. and Y.-Z.S.; writing—review and editing: Z.-C.M., Y.-Z.S. and J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by grants from the National Natural Science Foundation of China (11971367 to B.-G.Z., 12205223 to Y.-L.T. and 11605125 to Y.-Z.S.) and the Department of Education of Hubei Province (Q20221705 to Y.-L.T.).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

There is no new data were created.

Conflicts of Interest

The authors declare no conflict of interest.

Sample Availability

Not applicable.

References

Ferry, G. The structure of DNA. Nature 2019, 575, 35–36. [Google Scholar] [CrossRef]
Neidle, S. Beyond the double helix: DNA structural diversity and the PDB. J. Biol. Chem. 2021, 296, 100553. [Google Scholar] [CrossRef] [PubMed]
Nieuwland, C.; Hamlin, T.A.; Fonseca Guerra, C.; Barone, G.; Bickelhaupt, F.M. B-DNA structure and stability: The role of nucleotide composition and order. ChemistryOpen 2022, 11, e202100231. [Google Scholar]
Guiblet, W.M.; Cremona, M.A.; Harris, R.S.; Chen, D.; Eckert, K.A.; Chiaromonte, F.; Huang, Y.-F.; Makova, K.D. Non-B DNA: A major contributor to small- and large-scale variation in nucleotide substitution frequencies across the genome. Nucleic Acids Res. 2021, 49, 1497–1516. [Google Scholar] [CrossRef]
Bansal, A.; Kaushik, S.; Kukreti, S. Non-canonical DNA structures: Diversity and disease association. Front. Genet. 2022, 13, 959258. [Google Scholar] [CrossRef] [PubMed]
Tateishi-Karimata, H.; Sugimoto, N. Roles of non-canonical structures of nucleic acids in cancer and neurodegenerative diseas-es. Nucleic Acids Res. 2021, 49, 7839–7855. [Google Scholar] [CrossRef] [PubMed]
Belotserkovskii, B.P.; De Silva, E.; Tornaletti, S.; Wang, G.; Vasquez, K.M.; Hanawalt, P.C. A Triplex-forming Sequence from the Human c-MYC Promoter Interferes with DNA Transcription. J. Biol. Chem. 2007, 282, 32433–32441. [Google Scholar] [CrossRef]
Robinson, J.; Raguseo, F.; Nuccio, S.P.; Liano, D.; Di Antonio, M. DNA G-quadruplex structures: More than simple roadblocks to transcription? Nucleic Acids Res. 2021, 49, 8419–8431. [Google Scholar] [CrossRef] [PubMed]
Varshney, D.; Spiegel, J.; Zyner, K.; Tannahill, D.; Balasubramanian, S. The regulation and functions of DNA and RNA G-quadruplexes. Nat. Rev. Mol. Cell Biol. 2020, 21, 459–474. [Google Scholar] [CrossRef]
Zok, T.; Kraszewska, N.; Miskiewicz, J.; Pielacinska, P.; Zurkowski, M.; Szachniuk, M. ONQUADRO: A database of experimentally determined quadruplex structures. Nucleic Acids Res. 2022, 50, D253–D258. [Google Scholar] [CrossRef]
Seeman, N.; Sleiman, H. DNA nanotechnology. Nat. Rev. Mater. 2017, 3, 17068. [Google Scholar] [CrossRef]
Hu, Q.; Li, H.; Wang, L.; Gu, H.; Fan, C. DNA Nanotechnology-Enabled Drug Delivery Systems. Chem. Rev. 2019, 119, 6459–6506. [Google Scholar] [CrossRef] [PubMed]
Ma, W.; Zhan, Y.; Mao, C.; Xie, X.; Lin, Y. The biological applications of DNA nanomaterials: Current challenges and future directions. Signal Transduct. Target. Ther. 2021, 6, 351. [Google Scholar] [CrossRef]
Butovskaya, E.; Heddi, B.; Bakalar, B.; Richter, S.N.; Phan, A.T. Major G-Quadruplex Form of HIV-1 LTR Reveals a (3 + 1) Folding Topology Containing a Stem-Loop. J. Am. Chem. Soc. 2018, 140, 13654–13662. [Google Scholar] [CrossRef]
Bryant, Z.; Oberstrass, F.C.; Basu, A. Recent developments in single-molecule DNA mechanics. Curr. Opin. Struct. Biol. 2012, 22, 304–312. [Google Scholar] [CrossRef]
Kriegel, F.; Ermann, N.; Lipfert, J. Probing the mechanical properties, conformational changes, and interactions of nucleic acids with magnetic tweezers. J. Struct. Biol. 2017, 197, 26–36. [Google Scholar] [CrossRef]
Haynes, P.J.; Main, K.H.S.; Akpinar, B.; Pyne, A.L.B. Atomic Force Microscopy of DNA and DNA-Protein Interactions. Methods Mol. 2022, 2476, 43–62. [Google Scholar] [CrossRef]
Di, W.; Gao, X.; Huang, W.; Sun, Y.; Lei, H.; Liu, Y.; Li, W.; Li, Y.; Wang, X.; Qin, M.; et al. Direct Measurement of Length Scale Dependence of the Hydrophobic Free Energy of a Single Collapsed Polymer Nanosphere. Phys. Rev. Lett. 2019, 122, 047801. [Google Scholar] [CrossRef] [PubMed]
Minhas, V.; Sun, T.; Mirzoev, A.; Korolev, N.; Lyubartsev, A.P.; Nordenskiöld, L. Modeling DNA Flexibility: Comparison of Force Fields from Atomistic to Multiscale Levels. J. Phys. Chem. B 2020, 124, 38–49. [Google Scholar] [CrossRef]
Jones, M.S.; Ashwood, B.; Tokmakoff, A.; Ferguson, A.L. Determining Sequence-Dependent DNA Oligonucleotide Hybridization and Dehybridization Mechanisms Using Coarse-Grained Molecular Simulation, Markov State Models, and Infrared Spectroscopy. J. Am. Chem. Soc. 2021, 143, 17395–17411. [Google Scholar] [CrossRef]
He, J.; Wang, J.; Tao, H.; Xiao, Y.; Huang, S.-Y. HNADOCK: A nucleic acid docking server for modeling RNA/DNA–RNA/DNA 3D complex structures. Nucleic Acids Res. 2019, 47, W35–W42. [Google Scholar] [CrossRef]
Zhang, Y.; Zhou, H.; Ou-Yang, Z.-C. Stretching Single-Stranded DNA: Interplay of Electrostatic, Base-Pairing, and Base-Pair Stacking Interactions. Biophys. J. 2001, 81, 1133–1143. [Google Scholar] [CrossRef]
Wang, J.; Wolf, R.M.; Caldwell, J.W.; Kollman, P.A.; Case, D.A. Development and testing of a general amber force field. J. Comput. Chem. 2005, 26, 114. [Google Scholar] [CrossRef]
Zhang, Y.; Zhang, Y.; McCready, M.J.; Maginn, E.J. Evaluation and refinement of the general AMBER force field for nineteen pure organic electrolyte solvents. J. Chem. Eng. 2018, 3488–3502. [Google Scholar] [CrossRef]
Hart, K.; Foloppe, N.; Baker, C.M.; Denning, E.J.; Nilsson, L.; MacKerell, A.D., Jr. Optimization of the CHARMM additive force field for DNA: Improved treatment of the BI/BII conformational equilibrium. J. Chem. Theory Comput. 2012, 8, 348–362. [Google Scholar] [CrossRef]
Jones, D.; Allen, J.E.; Yang, Y.; Bennett, W.F.D.; Gokhale, M.; Moshiri, N.; Rosing, T.S. Accelerators for Classical Molecular Dynamics Simulations of Biomolecules. J. Chem. Theory Comput. 2022, 18, 4047–4069. [Google Scholar] [CrossRef]
Nomidis, S.K.; Kriegel, F.; Vanderlinden, W.; Lipfert, J.; Carlon, E. Twist-Bend Coupling and the Torsional Response of Double-Stranded DNA. Phys. Rev. Lett. 2017, 118, 217801. [Google Scholar] [CrossRef] [PubMed]
Marko, J.F.; Siggia, E.D. Fluctuations and Supercoiling of DNA. Science 1994, 265, 506–508. [Google Scholar] [CrossRef]
Toan, N.M.; Thirumalai, D. On the origin of the unusual behavior in the stretching of single-stranded DNA. J. Chem. Phys. 2012, 136, 235103. [Google Scholar] [CrossRef] [PubMed]
SantaLucia, J., Jr.; Hicks, D. The Thermodynamics of DNA Structural Motifs. Annu. Rev. Biophys. Biomol. Struct. 2004, 33, 415–440. [Google Scholar] [CrossRef]
Zuker, M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res. 2003, 31, 3406. [Google Scholar] [CrossRef]
Ingólfsson, H.I.; Lopez, C.A.; Uusitalo, J.J.; de Jong, D.H.; Gopal, S.M.; Periole, X.; Marrink, S.J. The power of coarse graining in biomolecular simulations. WIREs Comput. Mol. Sci. 2014, 4, 225–248. [Google Scholar] [CrossRef]
Dans, P.D.; Walther, J.; Gómez, H.; Orozco, M. Multiscale simulation of DNA. Curr. Opin. Struct. Biol. 2016, 37, 29–45. [Google Scholar] [CrossRef]
Sun, T.; Minhas, V.; Korolev, N.; Mirzoev, A.; Lyubartsev, A.P.; Nordenskiöld, L. Bottom-Up Coarse-Grained Modeling of DNA. Front. Mol. Biosci. 2021, 8, 645527. [Google Scholar] [CrossRef]
Reshetnikov, R.; Stolyarova, A.; Zalevsky, A.; Panteleev, D.Y.; Pavlova, G.V.; Klinov, D.V.; Golovin, A.V.; Protopopova, A.D. A coarse-grained model for DNA origami. Nucleic Acids Res. 2018, 46, 1102–1112. [Google Scholar] [CrossRef]
Walther, J.; Dans, P.D.; Balaceanu, A.; Hospital, A.; Bayarri, G.; Orozco, M. A multi-modal coarse grained model of DNA flexibility mappable to the atomistic level. Nucleic Acids Res. 2020, 48, e29. [Google Scholar] [CrossRef] [PubMed]
Krokhotin, A.; Houlihan, K.; Dokholyan, N.V. iFoldRNA v2: Folding RNA with constraints. Bioinformatics 2015, 31, 2891–2893. [Google Scholar] [CrossRef]
Jossinet, F.; Ludwig, T.E.; Westhof, E. Assemble: An interactive graphical tool to analyze and build RNA architectures at the 2D and 3D levels. Bioinformatics 2010, 26, 2057–2059. [Google Scholar] [CrossRef] [PubMed]
Popenda, M.; Szachniuk, M.; Antczak, M.; Purzycka, K.J.; Lukasiak, P.; Bartol, N.; Blazewicz, J.; Adamiak, R.W. Automated 3D structure composition for large RNAs. Nucleic Acids Res. 2012, 40, e112. [Google Scholar] [CrossRef]
Li, J.; Zhang, S.; Zhang, D.; Chen, S.-J. Vfold-Pipeline: A web server for RNA 3D structure prediction from sequences. Bioinformatics 2022, 38, 4042–4043. [Google Scholar] [CrossRef] [PubMed]
Zhao, Y.; Huang, Y.; Gong, Z.; Wang, Y.; Man, J.; Xiao, Y. Automated and fast building of three-dimensional RNA structures. Sci. Rep. 2012, 2, 734. [Google Scholar] [CrossRef]
Wang, J.; Mao, K.; Zhao, Y.; Zeng, C.; Xiang, J.; Zhang, Y.; Xiao, Y. Optimization of RNA 3D structure prediction using evolutionary restraints of nucleotide–nucleotide interactions from direct coupling analysis. Nucleic Acids Res. 2017, 45, 6299–6309. [Google Scholar] [CrossRef] [PubMed]
Wang, J.; Wang, J.; Huang, Y.; Xiao, Y. 3dRNA v2.0: An Updated Web Server for RNA 3D Structure Prediction. Int. J. Mol. Sci. 2019, 20, 4116. [Google Scholar] [CrossRef]
Zhang, Y.; Xiong, Y.; Xiao, Y. 3dDNA: A Computational Method of Building DNA 3D Structures. Molecules 2022, 27, 5936. [Google Scholar] [CrossRef]
Scott, W.R.P.; Hünenberger, P.H.; Tironi, I.G.; Mark, A.E.; Billeter, S.R.; Fennen, J.; Torda, A.E.; Huber, T.; Kruger, P.; Van Gunsteren, W.F. The GROMOS biomolecular simulation program package. J. Phys. Chem. 1999, 103, 3596–3607. [Google Scholar] [CrossRef]
Robertson, M.J.; Tirado-Rives, J.; Jorgensen, W.L. Improved Peptide and Protein Torsional Energetics with the OPLS-AA Force Field. J. Chem. Theory Comput. 2015, 11, 3499–3509. [Google Scholar] [CrossRef]
Ivani, I.; Dans, P.D.; Noy, A.; Pérez, A.; Faustino, I.; Hospital, A.; Walther, J.; Andrio, P.; Goñi, R.; Balaceanu, A.; et al. Parmbsc1: A refined force field for DNA simulations. Nat. Methods 2016, 13, 55–58. [Google Scholar] [CrossRef] [PubMed]
Chen, A.A.; García, A.E. High-resolution reversible folding of hyperstable RNA tetraloops using molecular dynamics simula-tions. Proc. Natl. Acad. Sci. USA 2013, 110, 16820–16825. [Google Scholar] [CrossRef]
Gallardo, A.; Bogart, B.M.; Dutagaci, B. Protein–Nucleic Acid Interactions for RNA Polymerase II Elongation Factors by Molecular Dynamics Simulations. J. Chem. Inf. Model. 2022, 62, 3079–3089. [Google Scholar] [CrossRef] [PubMed]
Salsbury, A.M.; A Lemkul, J. Recent developments in empirical atomistic force fields for nucleic acids and applications to studies of folding and dynamics. Curr. Opin. Struct. Biol. 2021, 67, 9–17. [Google Scholar] [CrossRef]
Kameda, T.; Awazu, A.; Togashi, Y. Molecular dynamics analysis of biomolecular systems including nucleic acids. Biophys. Physicobiology 2022, 19, e190027. [Google Scholar] [CrossRef]
Ghoshdastidar, D.; Bansal, M. Dynamics of physiologically relevant noncanonical DNA structures: An overview from experi-mental and theoretical studies. Brief. Funct. Genom. 2018, 18, 192–204. [Google Scholar] [CrossRef]
Frank-Kamenetskii, M.D.; Prakash, S. Fluctuations in the DNA double helix: A critical review. Phys. Life Rev. 2014, 11, 153–170. [Google Scholar] [CrossRef]
Zgarbová, M.; Šponer, J.; Otyepka, M.; Cheatham, T.E., III; Galindo-Murillo, R.; Jurečka, P. Refinement of the Sugar–Phosphate Backbone Torsion Beta for AMBER Force Fields Improves the Description of Z- and B-DNA. J. Chem. Theory Comput. 2015, 11, 5723–5736. [Google Scholar] [CrossRef] [PubMed]
Strelnikov, I.A.; Kovaleva, N.A.; Klinov, A.P.; Zubova, E.A. C-B-A test of DNA force fields. ACS Omega 2023, 8, 10253–10265. [Google Scholar] [CrossRef] [PubMed]
Panczyk, T.; Wojton, P.; Wolski, P. Mechanism of unfolding and relative stabilities of G-quadruplex and I-motif noncanonical DNA structures analyzed in biased molecular dynamics simulations. Biophys. Chem. 2019, 250, 106173. [Google Scholar] [CrossRef] [PubMed]
Panczyk, T.; Wojton, P.; Wolski, P. Molecular Dynamics Study of the Interaction of Carbon Nanotubes with Telomeric DNA Fragment Containing Noncanonical G-Quadruplex and i-Motif Forms. Int. J. Mol. Sci. 2020, 21, 1925. [Google Scholar] [CrossRef]
Liu, T.; Yu, T.; Zhang, S.; Wang, Y.; Zhang, W. Thermodynamic and kinetic properties of a single base pair in A-DNA and B-DNA. Phys. Rev. E 2021, 103, 042409. [Google Scholar] [CrossRef]
Xu, S.; Zhan, J.; Man, B.; Jiang, S.; Yue, W.; Gao, S.; Guo, C.; Liu, H.; Li, Z.; Wang, J.; et al. Real-time reliable determination of binding kinetics of DNA hybridization using a multi-channel graphene biosensor. Nat. Commun. 2017, 8, 14902. [Google Scholar] [CrossRef] [PubMed]
Galindo-Murillo, R.; Roe, D.R. Cheatham TE 3rd. On the absence of intrahelical DNA dynamics on the μs to ms timescale. Nat. Commun. 2014, 5, 5152. [Google Scholar] [CrossRef]
Galindo-Murillo, R.; Roe, D.R.; Cheatham, T.E. Convergence and reproducibility in molecular dynamics simulations of the DNA duplex d(GCACGAACGAACGAACGC). Biochim. Biophys. Acta 2015, 1850, 1041–1058. [Google Scholar] [CrossRef]
Nikolova, E.N.; Kim, E.; Wise, A.A.; O’brien, P.J.; Andricioaei, I.; Al-Hashimi, H.M. Transient Hoogsteen base pairs in canonical duplex DNA. Nature 2011, 470, 498–502. [Google Scholar] [CrossRef]
Yang, C.; Kim, E.; Pak, Y. Free energy landscape and transition pathways from Watson–Crick to Hoogsteen base pairing in free duplex DNA. Nucleic Acids Res. 2015, 43, 7769–7778. [Google Scholar] [CrossRef]
Chakraborty, D.; Wales, D.J. Probing helical transitions in a DNA duplex. Phys. Chem. Chem. Phys. 2017, 19, 878. [Google Scholar] [CrossRef] [PubMed]
Marin-Gonzalez, A.; Vilhena, J.G.; Perez, R.; Moreno-Herrero, F. A molecular view of DNA flexibility. Q. Rev. Biophys. 2021, 54, e8. [Google Scholar] [CrossRef]
Ghoshdastidar, D.; Bansal, M. Flexibility of flanking DNA is a key determinant of transcription factor affinity for the core motif. Biophys. J. 2022, 121, 3987–4000. [Google Scholar] [CrossRef] [PubMed]
Qiang, X.-W.; Dong, H.-L.; Xiong, K.-X.; Zhang, W.; Tan, Z.-J. Understanding sequence effect in DNA bending elasticity by molecular dynamic simulations. Commun. Theor. Phys. 2021, 73, 075601. [Google Scholar] [CrossRef]
Liebl, K.; Dršata, T.; Lankas, F.; Lipfert, J.; Zacharias, M. Explaining the striking difference in twist-stretch coupling between DNA and RNA: A comparative molecular dynamics analysis. Nucleic Acids Res. 2015, 43, 10143–10156. [Google Scholar] [CrossRef] [PubMed]
Chen, C.; Pettitt, B.M. The Effects of Flexibility on dsDNA-dsDNA Interactions. Life 2022, 12, 699. [Google Scholar] [CrossRef]
Wu, Y.-Y.; Bao, L.; Zhang, X.; Tan, Z.-J. Flexibility of short DNA helices with finite-length effect: From base pairs to tens of base pairs. J. Chem. Phys. 2015, 142, 125103. [Google Scholar] [CrossRef]
Marin-Gonzalez, A.; Vilhena, J.G.; Moreno-Herrero, F.; Perez, R. DNA Crookedness Regulates DNA Mechanical Properties at Short Length Scales. Phys. Rev. Lett. 2019, 122, 048102. [Google Scholar] [CrossRef]
Liebl, K.; Zacharias, M. Accurate modeling of DNA conformational flexibility by a multivariate Ising model. Proc. Natl. Acad. Sci. USA 2021, 118, e2021263118. [Google Scholar] [CrossRef] [PubMed]
Marin-Gonzalez, A.; Vilhena, J.G.; Perez, R.; Moreno-Herrero, F. Understanding the mechanical response of double-stranded DNA and RNA under constant stretching forces using all-atom molecular dynamics. Proc. Natl. Acad. Sci. USA 2017, 114, 7049–7054. [Google Scholar] [CrossRef] [PubMed]
Bao, L.; Zhang, X.; Shi, Y.-Z.; Wu, Y.-Y.; Tan, Z.-J. Understanding the Relative Flexibility of RNA and DNA Duplexes: Stretching and Twist-Stretch Coupling. Biophys. J. 2017, 112, 1094–1104. [Google Scholar] [CrossRef]
Bouchal, T.; Durník, I.; Kulhánek, P. Bending of Canonical and G/T Mismatched DNAs. J. Chem. Inf. Model. 2021, 61, 6000–6011. [Google Scholar] [CrossRef]
Sharma, M.; Predeus, A.V.; Mukherjee, S.; Feig, M. DNA Bending Propensity in the Presence of Base Mismatches: Implications for DNA Repair. J. Phys. Chem. B 2013, 117, 6194–6205. [Google Scholar] [CrossRef]
Maffeo, C.; Ngo, T.T.M.; Ha, T.; Aksimentiev, A. A Coarse-Grained Model of Unstructured Single-Stranded DNA Derived from Atomistic Simulation and Single-Molecule Experiment. J. Chem. Theory Comput. 2014, 10, 2891–2896. [Google Scholar] [CrossRef]
Rossetti, G.; Dans, P.D.; Gomez-Pinto, I.; Ivani, I.; Gonzalez, C.; Orozco, M. The structural impact of DNA mismatches. Nucleic Acids Res. 2015, 43, 4309–4321. [Google Scholar] [CrossRef]
Bouchal, T.; Durník, I.; Illík, V.; Réblová, K.; Kulhánek, P. Importance of base-pair opening for mismatch recognition. Nucleic Acids Res. 2020, 48, 11322–11334. [Google Scholar] [CrossRef]
Lavery, R.; Maddocks, J.H.; Pasi, M.; Zakrzewska, K. Analyzing ion distributions around DNA. Nucleic Acids Res. 2014, 42, 8138–8149. [Google Scholar] [CrossRef] [PubMed]
Tolokh, I.S.; Thomas, D.G.; Onufriev, A.V. Explicit ions/implicit water generalized Born model for nucleic acids. J. Chem. Phys. 2018, 148, 195101. [Google Scholar] [CrossRef]
Sun, L.Z.; Qian, J.L.; Cai, P.; Xu, X. Mutual effects between single-stranded DNA conformation and Na⁺-Mg²⁺ ion competition in mixed salt solutions. Phys. Chem. Chem. Phys. 2022, 24, 20867–20881. [Google Scholar] [CrossRef]
Xue, J.; Wang, P.; Li, X.; Tan, R.; Zong, W. Transformation characteristics of A-DNA in salt solution revealed through molecular dynamics simulations. Biophys. Chem. 2022, 288, 106845. [Google Scholar] [CrossRef] [PubMed]
Sarkar, S.; Singh, P.C. The combined action of cations and anions of ionic liquids modulates the formation and stability of G-quadruplex DNA. Phys. Chem. Chem. Phys. 2021, 23, 24497–24504. [Google Scholar] [CrossRef] [PubMed]
Pasi, M.; Maddocks, J.H.; Lavery, R. Analyzing ion distributions around DNA: Sequence-dependence of potassium ion distributions from microsecond molecular dynamics. Nucleic Acids Res. 2015, 43, 2412–2423. [Google Scholar] [CrossRef] [PubMed]
Fu, H.; Zhang, C.; Qiang, X.-W.; Yang, Y.-J.; Dai, L.; Tan, Z.-J.; Zhang, X.-H. Opposite Effects of High-Valent Cations on the Elasticities of DNA and RNA Duplexes Revealed by Magnetic Tweezers. Phys. Rev. Lett. 2020, 124, 058101. [Google Scholar] [CrossRef] [PubMed]
Cruz-Leon, S.; Vanderlinden, W.; Muller, P.; Forster, T.; Staudt, G.; Lin, Y.Y.; Lipfert, J.; Schwierz, N. Twisting DNA by salt. Nucleic Acids Res. 2022, 50, 5726–5738. [Google Scholar] [CrossRef]
Long, M.P.; Alland, S.; Martin, M.E.; Isborn, C.M. Molecular dynamics simulations of alkaline earth metal ions binding to DNA reveal ion size and hydration effects. Phys. Chem. Chem. Phys. 2020, 22, 5584–5596. [Google Scholar] [CrossRef]
Xi, K.; Wang, F.-H.; Xiong, G.; Zhang, Z.-L.; Tan, Z.-J. Competitive Binding of Mg2+ and Na+ Ions to Nucleic Acids: From Helices to Tertiary Structures. Biophys. J. 2018, 114, 1776–1790. [Google Scholar] [CrossRef]
Krüger, A.; Zimbres, F.M.; Kronenberger, T.; Wrenger, C. Molecular Modeling Applied to Nucleic Acid-Based Molecule Development. Biomolecules 2018, 8, 83. [Google Scholar] [CrossRef] [PubMed]
Oweida, T.J.; Kim, H.S.; Donald, J.M.; Singh, A.; Yingling, Y.G. Faculty Opinions recommendation of Assessment of AMBER Force Fields for Simulations of ssDNA. J. Chem. Theory Comput. 2021, 17, 1208–1217. [Google Scholar] [CrossRef]
Dans, P.D.; Ivani, I.; Hospital, A.; Portella, G.; González, C.; Orozco, M. How accurate are accurate force-fields for B-DNA? Nucleic Acids Res. 2017, 45, 4217–4230. [Google Scholar] [CrossRef]
Cruz-León, S.; Grotz, K.K.; Schwierz, N. Extended magnesium and calcium force field parameters for accurate ion–nucleic acid interactions in biomolecular simulations. J. Chem. Phys. 2021, 154, 171102. [Google Scholar] [CrossRef]
Castelli, M.; Doria, F.; Freccero, M.; Colombo, G.; Moroni, E. Studying the Dynamics of a Complex G-Quadruplex System: Insights into the Comparison of MD and NMR Data. J. Chem. Theory Comput. 2022, 18, 4515–4528. [Google Scholar] [CrossRef]
Havrila, M.; Stadlbauer, P.; Islam, B.; Otyepka, M.; Šponer, J. Effect of Monovalent Ion Parameters on Molecular Dynamics Simula-tions of G-Quadruplexes. J. Chem. Theory Comput. 2017, 13, 3911–3926. [Google Scholar] [CrossRef]
Lazim, R.; Suh, D.; Choi, S. Advances in Molecular Dynamics Simulations and Enhanced Sampling Methods for the Study of Protein Systems. Int. J. Mol. Sci. 2020, 21, 6339. [Google Scholar] [CrossRef] [PubMed]
van Gunsteren, W.F.; Daura, X.; Hansen, N.; Mark, A.E.; Oostenbrink, C.; Riniker, S.; Smith, L.J. Validation of Molecular Simulation: An Overview of Issues. Angew. Chem. Int. Ed. 2018, 57, 884–902. [Google Scholar] [CrossRef] [PubMed]
Betz, R.M.; Dror, R.O. How Effectively Can Adaptive Sampling Methods Capture Spontaneous Ligand Binding? J. Chem. Theory Comput. 2019, 15, 2053–2063. [Google Scholar] [CrossRef]
Markthaler, D.; Fleck, M.; Stankiewicz, B.; Hansen, N. Exploring the Effect of Enhanced Sampling on Protein Stability Prediction. J. Chem. Theory Comput. 2022, 18, 2569–2583. [Google Scholar] [CrossRef] [PubMed]
Kasavajhala, K.; Lam, K.; Simmerling, C. Exploring Protocols to Build Reservoirs to Accelerate Temperature Replica Exchange MD Simulations. J. Chem. Theory Comput. 2020, 16, 7776–7799. [Google Scholar] [CrossRef]
de Jong, D.H.; Singh, G.; Bennett, W.F.D.; Arnarez, C.; Wassenaar, T.A.; Schäfer, L.V.; Periole, X.; Tieleman, D.P.; Marrink, S.J. Improved Parameters for the Martini Coarse-Grained Protein Force Field. J. Chem. Theory Comput. 2012, 9, 687–697. [Google Scholar] [CrossRef]
Uusitalo, J.J.; Ingólfsson, H.I.; Akhshi, P.; Tieleman, D.P.; Marrink, S.J. Martini Coarse-Grained Force Field: Extension to DNA. J. Chem. Theory Comput. 2015, 11, 3932–3945. [Google Scholar] [CrossRef]
Souza, P.C.T.; Alessandri, R.; Barnoud, J.; Thallmair, S.; Faustino, I.; Grünewald, F.; Patmanidis, I.; Abdizadeh, H.; Bruininks, B.M.H.; Wassenaar, T.A.; et al. Martini 3: A general purpose force field for coarse-grained molecular dynamics. Nat. Methods 2021, 18, 382–388. [Google Scholar] [CrossRef] [PubMed]
Ouldridge, T.E.; Louis, A.A.; Doye, J.P.K. Structural, mechanical, and thermodynamic properties of a coarse-grained DNA model. J. Chem. Phys. 2011, 134, 085101. [Google Scholar] [CrossRef] [PubMed]
Ouldridge, T.; Louis, A.; Doye, J. DNA Nanotweezers Studied with a Coarse-Grained Model of DNA. Phys. Rev. Lett. 2010, 104, 178101. [Google Scholar] [CrossRef] [PubMed]
Šulc, P.; Romano, F.; Ouldridge, T.E.; Rovigatti, L.; Doye, J.P.; Louis, A.A. Sequence-dependent thermodynamics of a coarse-grained DNA model. J. Chem. Phys. 2012, 137, 135101. [Google Scholar] [CrossRef]
Snodin, B.E.K.; Randisi, F.; Mosayebi, M.; Šulc, P.; Schreck, J.S.; Romano, F.; Ouldridge, T.E.; Tsukanov, R.; Nir, E.; Louis, A.A.; et al. Introducing improved structural properties and salt dependence into a coarse-grained model of DNA. J. Chem. Phys. 2015, 142, 234901. [Google Scholar] [CrossRef]
He, Y.; Maciejczyk, M.; Ołdziej, S.; Scheraga, H.A.; Liwo, A. Mean-Field Interactions between Nucleic-Acid-Base Dipoles can Drive the Formation of a Double Helix. Phys. Rev. Lett. 2013, 110, 098101. [Google Scholar] [CrossRef]
He, Y.; Liwo, A.; Scheraga, H.A. Optimization of a Nucleic Acids united-RESidue 2-Point model (NARES-2P) with a maximum-likelihood approach. J. Chem. Phys. 2015, 143, 243111. [Google Scholar] [CrossRef]
Knotts, T.A.; Rathore, N.; Schwartz, D.C.; De Pablo, J.J. A coarse grain model for DNA. J. Chem. Phys. 2007, 126, 084901. [Google Scholar] [CrossRef]
Sambriski, E.; Schwartz, D.; de Pablo, J. A Mesoscale Model of DNA and Its Renaturation. Biophys. J. 2009, 96, 1675–1690. [Google Scholar] [CrossRef]
Hinckley, D.M.; Freeman, G.S.; Whitmer, J.K.; De Pablo, J.J. An experimentally-informed coarse-grained 3-site-per-nucleotide model of DNA: Structure, thermodynamics, and dynamics of hybridization. J. Chem. Phys. 2013, 139, 144903. [Google Scholar] [CrossRef]
Freeman, G.S.; Hinckley, D.M.; Lequieu, J.P.; Whitmer, J.K.; de Pablo, J.J. Coarse-grained modeling of DNA curvature. J. Chem. Phys. 2014, 141, 165103. [Google Scholar] [CrossRef]
Markegard, C.B.; Fu, I.W.; Reddy, K.A.; Nguyen, H.D. Coarse-Grained Simulation Study of Sequence Effects on DNA Hybridization in a Concentrated Environment. J. Phys. Chem. B 2015, 119, 1823–1834. [Google Scholar] [CrossRef]
Chakraborty, D.; Hori, N.; Thirumalai, D. Sequence-Dependent Three Interaction Site Model for Single- and Double-Stranded DNA. J. Chem. Theory Comput. 2018, 14, 3763–3779. [Google Scholar] [CrossRef]
Assenza, S.; Perez, R. Accurate sequence-dependent coarse-grained model for conformational and elastic properties of dou-ble-stranded DNA. J. Chem. Theory Comput. 2022, 18, 3239–3256. [Google Scholar] [CrossRef]
Maciejczyk, M.; Spasic, A.; Liwo, A.; Scheraga, H.A. DNA Duplex Formation with a Coarse-Grained Model. J. Chem. Theory Comput. 2014, 10, 5020–5035. [Google Scholar] [CrossRef]
Cragnolini, T.; Derreumaux, P.; Pasquali, S. Coarse-Grained Simulations of RNA and DNA Duplexes. J. Phys. Chem. B 2013, 117, 8047–8060. [Google Scholar] [CrossRef]
Wang, K.W.; Barker, K.; Benner, S.; Betancourt, T.; Hall, C.K. Development of a simple coarse-grained DNA model for analysis of oligonucleotide complex formation. Mol. Simul. 2018, 44, 1004–1015. [Google Scholar] [CrossRef]
Ding, Y.; Mittal, J. Insights into DNA-mediated interparticle interactions from a coarse-grained model. J. Chem. Phys. 2014, 141, 184901. [Google Scholar] [CrossRef]
Morriss-Andrews, A.; Rottler, J.; Plotkin, S.S. A systematically coarse-grained model for DNA and its predictions for persistence length, stacking, twist, and chirality. J. Chem. Phys. 2010, 132, 035105. [Google Scholar] [CrossRef]
Mu, Z.-C.; Tan, Y.-L.; Zhang, B.-G.; Liu, J.; Shi, Y.-Z. Ab initio predictions for 3D structure and stability of single- and double-stranded DNAs in ion solutions. PLoS Comput. Biol. 2022, 18, e1010501. [Google Scholar] [CrossRef]
Kenward, M.; Dorfman, K.D. Brownian dynamics simulations of single-stranded DNA hairpins. J. Chem. Phys. 2009, 130, 095101. [Google Scholar] [CrossRef]
Linak, M.C.; Dorfman, K.D. Analysis of a DNA simulation model through hairpin melting experiments. J. Chem. Phys. 2010, 133, 125101. [Google Scholar] [CrossRef] [PubMed]
Linak, M.C.; Tourdot, R.; Dorfman, K.D. Moving beyond Watson–Crick models of coarse grained DNA dynamics. J. Chem. Phys. 2011, 135, 205102. [Google Scholar] [CrossRef]
Korolev, N.; Luo, D.; Lyubartsev, A.P.; Nordenskiöld, L. A Coarse-Grained DNA Model Parameterized from Atomistic Simulations by Inverse Monte Carlo. Polymers 2014, 6, 1655–1675. [Google Scholar] [CrossRef]
Dans, P.D.; Zeida, A.; Machado, M.R.; Pantano, S. A coarse grained model for atomic-detailed DNA simulations with explicit elec-trostatics. J. Chem. Theory Comput. 2010, 6, 1711–1725. [Google Scholar] [CrossRef]
Kovaleva, N.A.; Koroleva Kikot, I.P.; Mazo, M.A.; Zubova, E.A. The “sugar” coarse-grained DNA model. J. Mol. Model. 2017, 23, 66. [Google Scholar] [CrossRef]
Jeddi, I.; Saiz, L. Three-dimensional modeling of single stranded DNA hairpins for aptamer-based biosensors. Sci. Rep. 2017, 7, 1178. [Google Scholar] [CrossRef]
Sabri, M.Z.; Hamid, A.A.A.; Hitam, S.M.S.; Rahim, M.Z.A. The assessment of three dimensional modelling design for single strand DNA aptamers for computational chemistry application. Biophys. Chem. 2020, 267, 106492. [Google Scholar] [CrossRef]
Prieto, L.; de Sancho, D.; Rey, A. Thermodynamics of Go-type models for protein folding. J. Chem. Phys. 2005, 123, 154903. [Google Scholar] [CrossRef]
Šulc, P.; Romano, F.; Ouldridge, T.E.; Doye, J.P.; Louis, A.A. A nucleotide-level coarse-grained model of RNA. J. Chem. Phys. 2014, 140, 235102. [Google Scholar] [CrossRef]
Hyeon, C.; Thirumalai, D. Mechanical unfolding of RNA hairpins. Proc. Natl. Acad. Sci. USA 2005, 102, 6789–6794. [Google Scholar] [CrossRef]
Hyeon, C.; Thirumalai, D. Capturing the essence of folding and functions of biomolecules using coarse-grained models. Nat. Commun. 2011, 2, 487. [Google Scholar] [CrossRef]
Denesyuk, N.A.; Thirumalai, D. Coarse-Grained Model for Predicting RNA Folding Thermodynamics. J. Phys. Chem. B 2013, 117, 4901–4911. [Google Scholar] [CrossRef]
Pasquali, S.; Derreumaux, P. HiRE-RNA: A high resolution coarse-grained energy model for RNA. J. Phys. Chem. B. 2010, 114, 11957–11966. [Google Scholar] [CrossRef]
Shi, Y.-Z.; Wang, F.-H.; Wu, Y.-Y.; Tan, Z.-J. A coarse-grained model with implicit salt for RNAs: Predicting 3D structure, stability and salt effect. J. Chem. Phys. 2014, 141, 105102. [Google Scholar] [CrossRef]
Shi, Y.-Z.; Jin, L.; Wang, F.-H.; Zhu, X.-L.; Tan, Z.-J. Predicting 3D Structure, Flexibility, and Stability of RNA Hairpins in Monovalent and Divalent Ion Solutions. Biophys. J. 2015, 109, 2654–2665. [Google Scholar] [CrossRef]
Shi, Y.-Z.; Jin, L.; Feng, C.-J.; Tan, Y.-L.; Tan, Z.-J. Predicting 3D structure and stability of RNA pseudoknots in monovalent and divalent ion solutions. PLoS Comput. Biol. 2018, 14, e1006222. [Google Scholar] [CrossRef]
Jin, L.; Shi, Y.-Z.; Feng, C.-J.; Tan, Y.-L.; Tan, Z.-J. Modeling Structure, Stability, and Flexibility of Double-Stranded RNAs in Salt Solutions. Biophys. J. 2018, 115, 1403–1416. [Google Scholar] [CrossRef]
Jin, L.; Tan, Y.-L.; Wu, Y.; Wang, X.; Shi, Y.-Z.; Tan, Z.-J. Structure folding of RNA kissing complexes in salt solutions: Predicting 3D structure, stability, and folding pathway. RNA 2019, 25, 1532–1548. [Google Scholar] [CrossRef]
SantaLucia, J., Jr.; Allawi, H.T.; Seneviratne, P.A. Improved Nearest-Neighbor Parameters for Predicting DNA Duplex Stability. Biochemistry 1996, 35, 3555–3562. [Google Scholar] [CrossRef]
Tan, Z.J.; Chen, S.J. Electrostatic correlations and fluctuations for ion binding to a finite length polyelectrolyte. J. Chem. Phys. 2005, 122, 44903. [Google Scholar] [CrossRef]
Tan, Z.; Zhang, W.; Shi, Y.; Wang, F. RNA folding: Structure prediction, folding kinetics and ion electrostatics. Adv. Exp. Med. Biol. 2015, 827, 143–183. [Google Scholar]
Tan, Y.-L.; Wang, X.; Shi, Y.-Z.; Zhang, W.; Tan, Z.-J. rsRNASP: A residue-separation-based statistical potential for RNA 3D structure evaluation. Biophys. J. 2022, 121, 142–156. [Google Scholar] [CrossRef]
Tan, Y.-L.; Wang, X.; Yu, S.; Zhang, B.; Tan, Z.-J. cgRNASP: Coarse-grained statistical potentials with residue separation for RNA structure evaluation. NAR Genom. Bioinform. 2023, 5, lqad016. [Google Scholar] [CrossRef]
Li, Z.; Yang, Y.; Zhan, J.; Dai, L.; Zhou, Y. Energy Functions in De Novo Protein Design: Current Challenges and Future Prospects. Annu. Rev. Biophys. 2013, 42, 315–335. [Google Scholar] [CrossRef]
Xiong, P.; Wu, R.; Zhan, J.; Zhou, Y. Pairing a high-resolution statistical potential with a nucleobase-centric sampling algorithm for improving RNA model refinement. Nat. Commun. 2021, 12, 2777. [Google Scholar] [CrossRef]
Maffeo, C.; Aksimentiev, A. MrDNA: A multi-resolution model for predicting the structure and dynamics of DNA systems. Nucleic Acids Res. 2020, 48, 5135–5146. [Google Scholar] [CrossRef]
Veneziano, R.; Ratanalert, S.; Zhang, K.; Zhang, F.; Yan, H.; Chiu, W.; Bathe, M. Designer nanoscale DNA assemblies programmed from the top down. Science 2016, 352, 1534. [Google Scholar] [CrossRef]
de Llano, E.; Miao, H.; Ahmadi, Y.; Wilson, A.J.; Beeby, M.; Viola, I.; Barisic, I. Adenita: Interactive 3D modelling and visualization of DNA nanostructures. Nucleic Acids Res. 2020, 48, 8269–8275. [Google Scholar] [CrossRef]
Zeng, C.; Jian, Y.; Vosoughi, S.; Zeng, C.; Zhao, Y. Evaluating native-like structures of RNA-protein complexes through the deep learning method. Nat. Commun. 2023, 14, 1060. [Google Scholar] [CrossRef] [PubMed]
Si, Y.; Yan, C. Improved protein contact prediction using dimensional hybrid residual networks and singularity enhanced loss function. Briefings Bioinform. 2021, 22, bbab341. [Google Scholar] [CrossRef]
Si, Y.; Yan, C. Improved inter-protein contact prediction using dimensional hybrid residual networks and protein language models. Briefings Bioinform. 2023, 24, bbad039. [Google Scholar] [CrossRef]
Huang, B.; Du, Y.; Zhang, S.; Li, W.; Wang, J.; Zhang, J. Computational prediction of RNA tertiary structures using machine learning methods. Chin. Phys. B 2020, 29, 08704. [Google Scholar] [CrossRef]
Li, J.; Zhu, W.; Wang, J.; Li, W.; Gong, S.; Zhang, J.; Wang, W. RNA3DCNN: Local and global quality assessments of RNA 3D structures using 3D deep convolutional neural networks. PLoS Comput. Biol. 2018, 14, e1006514. [Google Scholar] [CrossRef] [PubMed]
Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef]
Baek, M.; DiMaio, F.; Anishchenko, I.; Dauparas, J.; Ovchinnikov, S.; Lee, G.R.; Wang, J.; Cong, Q.; Kinch, L.N.; Schaeffer, R.D.; et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science 2021, 373, 871–876. [Google Scholar] [CrossRef]
Schlick, T.; Portillo-Ledesma, S.; Myers, C.G.; Beljak, L.; Chen, J.; Dakhel, S.; Darling, D.; Ghosh, S.; Hall, J.; Jan, M.; et al. Biomolecular modeling and simulation: A prospering multi-disciplinary field. Annu. Rev. Biophys. 2021, 50, 267–301. [Google Scholar] [CrossRef]
Verkhivker, G.M.; Agajanian, S.; Hu, G.; Tao, P. Allosteric Regulation at the Crossroads of New Technologies: Multiscale Modeling, Networks, and Machine Learning. Front. Mol. Biosci. 2020, 7, 136. [Google Scholar] [CrossRef]

Figure 1. (a) DNA-only structures released in Protein Data Bank (PDB) (http://www.rcsb.org/, accessed on 1 October 2022) per year. Bars: total number of entries available. Line: number of structures released annually. (b–f) Three-dimensional (left) and secondary (right) structures for (b) B-form dsDNA and typical non-B form DNAs: (c) DNA hairpin; (d) triplex; (e) G-quadruplex; (f) i-motif. The 3D structures are shown with PyMol (http://www.pymol.org, accessed on 1 October 2022).

Figure 2. (a) Schematic diagram of MD simulations. (b) Diagram of calculations on the flexibility of dsDNA in MD simulations. Left: initial conformation. Right: equilibrium conformation. (c) MD simulations for short ssDNA in ion solutions. (d) Distributions of the angle between contiguous P-C4′-P atoms from experimental structures (red) and MD simulated conformations (blue). The 3D structures are shown with PyMol (http://www.pymol.org).

Figure 3. Representations of several DNA CG models. (a) oxDNA: two beads [104,105,106,107]. (b) NARES-2P: the united sugar bases (B’s) and the united phosphate groups (P’s) serve as interaction sites [108,109]. (c) 3SPN: three beads [110,111,112,113]. (d) BioModi: three beads [114]. (e) TIS-DNA: three beads [115]. (f) MADna: three beads [116]. (g) MARTINI-DNA: six/seven beads [102]. (h) UNRES-like DNA: six/seven/eight beads [117]. (i) HiRE-DNA: six/seven beads [118]. All 3D structures (ball-stick: CG; cartoon: all-atom) are shown with PyMol (http://www.pymol.org).

Figure 4. (a) CG representation of a DNA fragment in our model superimposed on the all-atom representation. (b) Schematic representation of base-pairing and base-stacking interaction. (c) 3D structures of ssDNAs predicted by our model compared with existing models.

Figure 5. (a) Melting temperatures as functions of [Na⁺] for three dsDNAs with different sequences. (b) Melting temperatures as functions of [Mg²⁺] for the dsDNA at different [Na⁺]s. (c) Comparisons between predictions and experiments for a DNA pseudoknot at 0.1 M [Na⁺].

Figure 6. Workflow of DNA structure assembly method. (a) Workflow of two indirect 3D structure prediction methods for ssDNA with the aid of the RNA structure prediction methods [129,130]. (b) Workflow of 3dDNA for DNA 3D structure prediction [44].

Table 2. Usual potentials explicitly used in typical DNA CG models ^a.

	$U_{b}$	$U_{a}$	$U_{d}$	$U_{e x c}$	$U_{b p}$	$U_{b s}$	$U_{c s}$	$U_{e l}$	$U_{p p}$	$U_{p s}$	$U_{p b}$	$U_{s s}$	$U_{s b}$	$U_{b b}$
Model	$U_{b}$	$U_{a}$	$U_{d}$	$U_{e x c}$	$U_{b p}$	$U_{b s}$	$U_{c s}$	$U_{e l}$	$U_{p p}$	$U_{p s}$	$U_{p b}$	$U_{s s}$	$U_{s b}$	$U_{b b}$
oxDNA	√			√	√	√	√	√	√
3SPN	√	√	√	√	√	√	√	√
TIS	√	√		√	√	√		√
Plotkin et al.	√	√	√		√				√		√		√	√
UNRES-like DNA	√	√	√						√	√	√	√	√	√
HiRE-DNA	√	√	√		√				√
NARES-2P	√	√							√		√
Shi et al.	√	√	√	√	√	√	√	√

^a indicates the main potentials used in typical DNA CG models, and √ indicates that the potential is explicitly included in the model.

U_{b}

,

U_{a}

, and

U_{d}

are potentials of bond length, angle, and dihedral for neighbor CG beads, respectively.

U_{e x c}

: excluded volume interaction;

U_{b p}

: base pairing or hydrogen bonding interactions;

U_{b s}

: base stacking interactions;

U_{c s}

: coaxial stacking interactions;

U_{e l}

: electrostatic repulsive interactions;

U_{p p}

,

U_{p s}

, and

U_{p b}

are interactions between phosphate and-phosphate/sugar/base;

U_{s s}

and

U_{s b}

are interactions between sugar and sugar/base, respectively; and

U_{b b}

: base–base interactions.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mu, Z.-C.; Tan, Y.-L.; Liu, J.; Zhang, B.-G.; Shi, Y.-Z. Computational Modeling of DNA 3D Structures: From Dynamics and Mechanics to Folding. Molecules 2023, 28, 4833. https://doi.org/10.3390/molecules28124833

AMA Style

Mu Z-C, Tan Y-L, Liu J, Zhang B-G, Shi Y-Z. Computational Modeling of DNA 3D Structures: From Dynamics and Mechanics to Folding. Molecules. 2023; 28(12):4833. https://doi.org/10.3390/molecules28124833

Chicago/Turabian Style

Mu, Zi-Chun, Ya-Lan Tan, Jie Liu, Ben-Gong Zhang, and Ya-Zhou Shi. 2023. "Computational Modeling of DNA 3D Structures: From Dynamics and Mechanics to Folding" Molecules 28, no. 12: 4833. https://doi.org/10.3390/molecules28124833

APA Style

Mu, Z.-C., Tan, Y.-L., Liu, J., Zhang, B.-G., & Shi, Y.-Z. (2023). Computational Modeling of DNA 3D Structures: From Dynamics and Mechanics to Folding. Molecules, 28(12), 4833. https://doi.org/10.3390/molecules28124833

Article Menu

Computational Modeling of DNA 3D Structures: From Dynamics and Mechanics to Folding

Abstract

1. Introduction

2. Molecular Dynamics Simulations for DNAs

2.1. Structural Dynamics

2.2. Structural Flexibility

2.3. DNA–Ion Interaction

2.4. Limitations

3. Coarse-Grained (CG) Modeling for DNAs

3.1. CG Models for DNA Structure Dynamics

3.2. CG Models for DNA Structure Folding

3.3. Ab Initio CG Models

3.4. Discussion and Comparison of These CG Models

4. DNA Structure Assembly Method for 3D Structure Construction

5. Discussion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Sample Availability

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI