- freely available
Int. J. Mol. Sci. 2012, 13(4), 5125-5137; doi:10.3390/ijms13045125
Published: 23 April 2012
Abstract: Biomolecular systems that can process information are sought for computational applications, because of their potential for parallelism and miniaturization and because their biocompatibility also makes them suitable for future biomedical applications. DNA has been used to design machines, motors, finite automata, logic gates, reaction networks and logic programs, amongst many other structures and dynamic behaviours. Here we design and program a synthetic DNA network to implement computational paradigms abstracted from cellular regulatory networks. These show information processing properties that are desirable in artificial, engineered molecular systems, including robustness of the output in relation to different sources of variation. We show the results of numerical simulations of the dynamic behaviour of the network and preliminary experimental analysis of its main components.
Computing is the study of natural and artificial information processes . Natural computing includes the implementation of computational paradigms abstracted from natural phenomena either on traditional electronic hardware or on alternative physical media such as in biomolecular (DNA, RNA) computing . Biocompatible systems that can process information are sought for future biomedical applications. Biomolecular systems are potentially important for these and more general computational applications because of their intrinsically high information storage capabilities and parallelism.
DNA is used to build synthetic molecular machines because the simplicity of its structure and interactions allows control of its assembly through information stored in nucleotide sequences [3,4]. The Watson–Crick double helix is formed by linking together two antiparallel strands of DNA that have complementary base sequences, a process known as hybridization. Attractive interactions between complementary nucleotides contribute to the stability of the structure: adenine (A) on one strand pairs with thymine (T) on the other, while cystosine (C) pairs with guanine (G). It is the remarkable specificity of the interactions between these complementary nucleotides, together with the availability of routine commercial synthesis, that allows DNA to be used as engineering material with which to design complex systems and structures capable of self-assembly and parallel operation [3,4]. This versatile molecule has been used to design machines [5–13], finite automata [14,15], logic gates [16,17], reaction networks [18–20] and logic circuits [21,22], amongst many other structures and dynamic systems. Refer to [3,4,23] for reviews of DNA devices and machines.
Here we present preliminary work towards an autonomous, synthetic DNA network, comprised solely of DNA molecules. This DNA network implements computing paradigms abstracted from natural cellular biochemical reaction networks.
Cellular regulation is achieved through complex networks of interactions among biochemicals and cellular structures. Recurrent network motifs, classifiable in terms of function, architecture, dynamics, or biochemical process [24–26], have been identified. Alon and colleagues investigated transcription networks in the bacterium E. coli and the yeast S. cerevisiae whose information processing role is to determine the rate of production of specific proteins as a function of the environment [25,26]. In these networks, the nodes are genes and the edges represent transcriptional regulation of one gene by the protein product of another gene. Two important motifs in E. coli and S. cerevisiae transcription networks are the type-1 coherent feed-forward loop (C1-FFL) and the type-1 incoherent feed-forward loop (I1-FFL). In the C1-FFL (Figure 1a) both paths are positive: X activates both Z and an activator of Z. In the I1-FFL (Figure 1b), the direct path is positive and the indirect path is negative, i.e., X activates Z and an inhibitor of Z. Experimental and computational approaches have shown that the C1-FFL shows sign-sensitive delay that can protect against brief input fluctuations , whereas the I1-FFL is responsible for functions such as pulse generation , adaptation [29,30], fold-change detection  and amplitude filtering . Figure 1c shows an example of pulse generation using the I1-FFL network motif.
Other networks have been investigated and display interesting information processing properties. Acar et al.  investigate inducibility and network-dosage invariance (i.e., invariance to the number of copies of a gene network in a cell) in the yeast galactose network. Their results revealed that, in general, the presence of two network components, one positive and one negative regulator, is the minimal requirement for network-dosage invariance.
In this work, we develop an implementation of an information processing paradigm abstracted from the yeast galactose network using DNA molecular computation. In particular, we aim to show that a DNA system can be designed and programmed to implement an information processing function that is robust to changes in network dosage and is thus capable of contributing to network behaviour that is reliable despite the stochasticity inherent in molecular systems. We choose to implement, on the DNA network, the basic information processing function of the I1-FFL: pulse generation. We note that the I1-FFL network motif  has two components, one positive and one negative regulator, and thus satisfies the minimal requirement for a network whose activity is robust to changes in network dosage .
The paper is organized as follows. Section 2 discusses the toolbox, i.e., the DNA processes and structures from which our DNA network is composed. The network itself is described in Section 3. Section 4 contains the results of numerical simulations, and conclusions are presented in Section 5.
2. The Toolbox of DNA Processes
Figure 2 illustrates an operation that is often harnessed in the design of dynamic DNA systems: toehold hybridization followed by branch migration resulting in strand displacement . The system depicted in Figure 2 can be considered an information processing system that detects the presence of input S3 and releases output S1 in response. Using the same simple mechanism of toehold-mediated strand displacement, in 2006 Seelig et al.  showed experimental results for DNA-based digital logic circuits and in 2009, Soloveichik et al.  discussed reaction cascades with unimolecular and bimolecular kinetics. They illustrate their method, for example, by simulating DNA reaction modules that corresponds to the Lotka–Volterra oscillator.
Loop-containing DNA structures are interesting elements for information processing systems, as energy and information can be stored in the loop in a form that cannot readily be accessed [37,38]. The neck of the hairpin controls access to the information stored in the loop: the neck is opened using toehold-mediated strand displacement (Figure 3a). Dirks and Pierce  demonstrate a system of two hairpins in which the toehold required to open the neck of one is hidden in the loop of the other and vice versa. Polymerisation of the loops is triggered when one of the loops is opened by an initiator DNA.
Here, we introduce a new structure, a “gated hairpin loop” (Figure 3), that is used in our DNA network. The gated hairpin (Figure 3b) has a second level of control, i.e., a gate. A second neck controls the accessibility of the toehold used to open the loop and thus to activate the primary loop domain. Preliminary experimental results show that the loop is not opened until the gate is unlocked with the “key” K (Figure 3c). (Please refer to the Supplementary Material for methods and sequences.) The gated hairpin loop is a useful component for the implementation of the DNA network because, before unlocking the “gate”, the information held within this component is not accessible by the other network components.
3. The DNA Network Design
The DNA network is shown in Figure 4. It is abstracted from the topology and functionality of the I1-FFL network motif  and network-dosage invariance of the galactose signalling pathway of yeast . Its abstract representation is identical to the I1-FFL shown in Figure 1b. In our DNA implementation, the nodes are complexes of DNA strands. The network has two inputs, Ix and Ky, and its activity is measured by the concentration of the output Z*. Its design is based on the principle of toehold-mediated strand displacement (Figure 2) and the gated hairpin structure (Figure 3).
To initiate both positive and negative reaction pathways, input Ix reacts with hairpin Hx (as described in Figure 3a) forming X* and allowing access to the toehold sequestered in the loop of Hx (blue). Following the positive reaction pathway, X* can then react with hairpin Hz to form Z*, in which the toehold sequestered in the loop of Hz (yellow) is activated.
Independently, and in parallel, the second input, strand Ky, activates the gated hairpin loop Gy (as described in Figure 3b), forming Gy*. The latter reacts with X* in the negative regulatory pathway, opening the hairpin and forming product Y*. Y* binds to double-stranded complex Aux in a reaction mediated by its exposed toehold (green), displacing strand Inh from Aux and forming Waste. The displaced single strand Inh hybridizes to Z*, forming the structure InhZ in which the domain that encodes the active state of Z* is inhibited.
The output of the network is measured by the concentration of Z*. The toehold activated by production of Z* (the yellow loop domain that is sequestered in hairpin Hz) is reactive and could be used to cascade downstream reactions. It could act as the input signal to another network or could be designed to regulate the production of its own initiator, input strand Ix, providing feedback to the network.
The gated hairpin loop Gy is not active unless input strand Ky is present to open its gate (Figure 3c). In the absence of input Ky, the network only has the positive reaction pathway and input Ix reacts stoichiometrically to produce output Z*. In the presence of Ky, through production of Gy*, a proportion of Ix is diverted to the negative reaction pathway, leading to inhibition of Z*. The use of the gated hairpin therefore adds an independent control to the output of the network.
Because the two reaction pathways run in parallel, it is important to consider the effects of the timing of the inputs on the network behaviour. We consider two scenarios. In Scenario A, input Ky is added first, such that its reaction with Gy to form Gy* is substantially complete before the second input Ix is added. In Scenario B, both inputs are added simultaneously. Two effects can contribute to a transient overproduction of output Z* that is later compensated by production of Inh, resulting in a pulse of Z*. In Scenario B, production of intermediate Gy* does not begin until both inputs are added, so the initial reaction rate of intermediate X* with Gy*, corresponding to entry into the negative pathway, is zero: this guarantees a pulse of Z*. Even in Scenario A, in which both pathways are active when Ix is added, a relative delay in the production of Inh resulting from the greater length of the negative pathway gives rise to pulse production.
This network motif is designed to operate far-from-equilibrium, as are the natural biochemical networks that inspired it. The ratios between forward and reverse reaction rates are determined by the free energy changes in each reaction and can be made large by design. Slow reverse reactions will have little effect on transient phenomena (pulse generation). However, if a sufficiently long time is allowed to elapse then reverse reactions—however slow—will ensure that the final state of the system is in thermodynamic equilibrium, independent of the details of reaction pathways. If the output of the network motif cascades forward to actuate downstream processes that are similarly far-from-equilibrium then this equilibrium state is never relevant to its operation.
3.1. Programming the DNA Network
In order to achieve a steady-state concentration of output Z* that is invariant to network dosage, i.e., to the concentrations of network components, it is necessary to ensure that the ratio between the time-integrated quantities of input Ix (and therefore of intermediate X*) that flow through the positive and negative reaction pathways is invariant. We assume below that the concentrations of network components Hx, Hz, Gy and Aux and of input Ky are in sufficient excess over the initial concentration of input Ix that perturbations resulting from the reactions triggered by addition of Ix are small. The rate of activation of the negative pathway depends on the concentration of intermediate Gy* which is, in general, time-dependent and a function of the input Ky. In Scenario A, when all Gy* is formed before Ix is added, the time-dependence of the production of Ky plays no part in the behaviour of the network. Network-dosage invariance of the steady-state output is achieved if the concentrations of Hz and the smaller of the concentrations of Gy and of Ky are scaled together ensuring a constant branching ratio between the two reaction pathways. In Scenario B, Ky and Ix are added simultaneously and the time-dependence of the production of Gy* does affect the output of the network. In this case, network-dosage invariance is achieved if the concentrations of Hx, Hz, Gy and Ky are scaled together (see Supplementary Material). Note that the dynamic component of the network output (the pulse) is not, in general, network-dosage invariant.
The network can be configured in three ways—positive, balanced and negative—defined by its behaviour in Scenario A. The branching ratio between positive and negative pathways is determined by the concentrations of Gy* and Hz which compete for reaction with intermediate X*. In a balanced network, the time-integrated branching ratio is 1:1 and addition of input Ix has no effect on the steady-state concentration of output Z*. In an unbalanced network the proportions of Ix that enter the positive and negative pathways are unequal with the result that the output concentration of Z* is changed by addition of Ix: it can be increased or decreased (though not, of course, below zero), depending on the relative concentrations of Gy* and Hz.
4. Simulation Results
We have investigated the behaviour of the proposed DNA network through chemical kinetics simulations. The network was modelled by the following ordinary differential equations:
All reactions, with the exception of the reaction of Inh with Z*, involve toehold-mediated strand displacement to open a secondary structure loop. Rate constants k1 to k6 are set at 105M−1s−1 [35,38]. We have assumed that all reactions are irreversible (rates of reverse reactions can be six orders of magnitude slower for appropriate toehold lengths ). Initial concentrations of DNA molecules are specified in Supplementary Material, Table 1: concentrations of network components are of the order of 1 μM. Simulations were performed in Matlab  using the ODE solver ode15s.
Simulation results are shown in Figure 5. The graphs show the concentration of the network output Z* as a function of time. Unless stated otherwise, the initial concentration of input Ix is set to 100 nM. All network components are present at t = 0 s.
Figure 5a shows simulation results for Scenario A (Section 3): input Ky is added first, at t = 0 s, and input Ix is added at t = 700 s when the reaction of Ky with Gy to produce Gy* is substantially complete. Simulation S1 is unbalanced positively, i.e., the initial concentration of Hz is greater than that of Gy resulting in a non-zero steady-state concentration of output Z*. In simulation 2S1, the initial concentrations of all network components are doubled: the concentration of Ky is greater than that of Gy in both cases. The two networks generate similar pulses, and the steady-state outputs of the two networks are the same, demonstrating that the steady-state output of the DNA network is robust to changes in network dosage. The same network behaviour—pulse generation and network-dosage invariant steady-state output—is displayed for balanced (S2 and 2S2) and negative (S3 and 2S3) networks.
Figure 5b shows simulation results for Scenario B: inputs Ix and Ky are delivered simultaneously at t = 0 s. S4-S6 correspond to positive, balanced and negative networks and 2S4–2S6 to the same networks in which the concentrations of network components and input Ky are doubled. Again, under these conditions the steady-state component of the output is robust to the change in network dosage. Note that, as expected, the steady-state output of the balanced network (S5) is different from the zero output of the same network in Scenario A (S2, Figure 5a): this asymmetry is a result of the initial unbalance in the network during production of Gy*. Also shown in Figure 5b is the result of simulation S7 in which all initial concentrations are the same as in 2S5 with the exception of the concentration of Ky, which is as in S5 (half that in 2S5). The outputs of 2S5 and S7 are different, as expected, demonstrating that the concentration of input Ky must be scaled with those of network components in order to achieve network-dosage invariance.
Simulation results shown in Figure 5a,b confirm that the dynamical component of the output Z* (the pulse) changes with network dose. (For an expanded view of the pulses in Figure 5a,b see Figure 1a,b in Supplementary Material.)
The balanced network has another robust behaviour. Once input Ky has had time to react—whether in Scenario A or Scenario B—the steady-state level of output Z* is unchanged by subsequent addition of small quantities of input Ix. Once a steady (and balanced) concentration of Gy* is established, these subsequent stimuli Ix cause equal activation of the positive and negative reaction pathways, resulting in a pulse of Z* but no change in its steady-state concentration. Figure 5c shows the effects of subsequent additions of input Ix. Simulation S8 is the same as Simulation S5, except that two further quantities of Ix were added after the initial pulse of output Z* had died away. (For a plot of the state space see Figure 2 in Supplementary Material.) The second and third additions of Ix have no effect on the steady-state output of the network, as expected. The results of Simulation S9 are also shown: S9 has the same quantity of input Ix added as in Simulation S8, except that it is all added at once at t = 0 s (initial conditions are otherwise the same as Simulations 5 and 8). The steady-state output in Simulation S9 is different from S5 and S8, demonstrating that the lack of effect of the later additions of Ix is a result of the timing of the inputs and not of saturation of the output of the network.
Figures 3–5 in Supplementary Material present additional simulation results showing robustness of output to subsequent stimuli, the limits of the desired network behaviour and results of extreme imbalance between the activation and repression pathways.
We have investigated the implementation, in a synthetic DNA reaction network, of computing paradigms abstracted from two different cellular biochemical reaction networks, the yeast galactose network  and the type-1 incoherent feed-forward loop network motif . The results of chemical kinetics simulations show that the proposed DNA network can be programmed to implement transient pulse generation with a steady-state output that can be made robust to changes in network dosage. This network has another interesting property: it can be configured such that the steady-state output is proportional to the initial dose of one of the inputs but insensitive to subsequent additions which generate only transient output pulses.
This work was supported by EPSRC Grants EP/F055951/1 and EP/F056605/1.
- Conflict of Interest The authors declare no conflict of interest.
- Denning, P.J. Computing is a natural science. Commun. ACM 2007, 50, 13–18.
- Kari, L.; Rozenberg, G. The many facets of natural computing. Commun. ACM 2008, 51, 72–83.
- Bath, J.; Turberfield, A.J. DNA nanomachines. Nat. Nanotechnol 2007, 2, 275–284.
- Seeman, N.C. DNA in a material world. Nature 2003, 421, 427–431.
- Yurke, B.; Turberfield, A.J.; Mills, A.P.; Simmel, F.C.; Neumann, J.L. A DNA-fuelled molecular machine made of DNA. Nature 2000, 406, 605–608.
- Bath, J.; Green, S.J.; Allen, K.E.; Turberfield, A.J. Mechanism for a directional, processive, and reversible DNA motor. Small 2009, 5, 1513–1516.
- Green, S.; Bath, J.; Turberfield, A. Coordinated chemomechanical cycles: A mechanism for autonomous molecular motion. Phys. Rev. Lett 2008, 101, 1–4.
- Muscat, R.; Bath, J.; Turberfield, A. A programmable molecular robot. Nano Lett 2011, 11, 982–987.
- Shin, J.S.; Pierce, N.A. A synthetic DNA walker for molecular transport. J. Am. Chem. Soc 2004, 126, 10834–10835.
- Sherman, W.B.; Seeman, N.C. A precisely controlled DNA biped walking device. Nano Lett 2004, 4, 1203–1207.
- Goodman, R.P.; Heilemann, M.; Doose, S.; Erben, C.M.; Kapanidis, A.N.; Turberfield, A.J. Reconfigurable, braced, three-dimensional DNA nanostructures. Nat. Nanotechnol 2008, 3, 93–96.
- Omabegho, T.; Sha, R.; Seeman, N.C. A bipedal DNA Brownian motor with coordinated legs. Science 2009, 324, 67–71.
- Wickham, S.F.J.; Endo, M.; Katsuda, Y.; Hidaka, K.; Bath, J.; Sugiyama, H.; Turberfield, A.J. Direct observation of stepwise movement of a synthetic molecular transporter. Nat. Nanotechnol 2011, 6, 4–7.
- Benenson, Y.; Paz-Elizur, T.; Adar, R.; Keinan, E.; Livneh, Z.; Shapiro, E. Programmable and autonomous computing machine made of biomolecules. Nature 2001, 414, 430–434.
- Stojanovic, M.N.; Stefanovic, D. A deoxyribozyme-based molecular automaton. Nat. Biotechnol 2003, 21, 1069–1074.
- Seelig, G.; Soloveichik, D.; Zhang, D.Y.; Winfree, E. Enzyme-free nucleic acid logic circuits. Science 2006, 314, 1585–1588.
- Macdonald, J.; Li, Y.; Sutovic, M.; Lederman, H.; Pendri, K.; Lu, W.; Andrews, B.L.; Stefanovic, D.; Stojanovic, M.N. Medium scale integration of molecular logic gates in an automaton. Nano Lett 2006, 6, 2598–2603.
- Zhang, D.Y.; Turberfield, A.J.; Yurke, B.; Winfree, E. Engineering entropy-driven reactions and networks catalyzed by DNA. Science 2007, 318, 1121–1125.
- Yin, P.; Choi, H.; Calvert, C.; Pierce, N. Programming biomolecular self-assembly pathways. Nature 2008, 451, 318–322.
- Soloveichik, D.; Seelig, G.; Winfree, E. DNA as a universal substrate for chemical kinetics. Proc. Natl. Acad. Sci. USA 2010, 107, 5393–5398.
- Ran, T.; Kaplan, S.; Shapiro, E. Molecular implementation of simple logic programs. Nat. Nanotechnol 2009, 4, 642–648.
- Qian, L.; Winfree, E. Scaling up digital circuit computation with DNA strand displacement cascades. Science 2011, 332, 1196–1201.
- Simmel, F.C.; Dittmer, W.U. DNA nanodevices. Small 2005, 1, 284–299.
- Wolf, D.M.; Arkin, A.P. Motifs, modules and games in bacteria. Curr. Opin. Microbiol 2003, 6, 125–134.
- Milo, R.; Shen-Orr, S.; Itzkovitz, S.; Kashtan, N.; Chklovskii, D.; Alon, U. Network motifs: Simple building blocks of complex networks. Science 2002, 298, 824–827.
- Shen-Orr, S.S.; Milo, R.; Mangan, S.; Alon, U. Network motifs in the transcriptional regulation network of Escherichia coli. Nat. Genet 2002, 31, 64–68.
- Alon, U. Network motifs: Theory and experimental approaches. Nat. Rev. Genet 2007, 8, 450–461.
- Basu, S.; Mehreja, R.; Thiberge, S.; Chen, M.T.; Weiss, R. Spatiotemporal control of gene expression with pulse-generating networks. Proc. Natl. Acad. Sci. USA 2004, 101, 6355–6360.
- Ma, W.; Trusina, A.; El-Samad, H.; Lim, W.A.; Tang, C. Defining network topologies that can achieve biochemical adaptation. Cell 2009, 138, 760–773.
- Bleris, L.; Xie, Z.; Glass, D.; Adadey, A.; Sontag, E.; Benenson, Y. Synthetic incoherent feedforward circuits show adaptation to the amount of their genetic template. Mol. Syst. Biol 2011, 7, doi:10.1038/msb.2011.49.
- Goentoroa, L.; Shoval, O.; Kirschner, M.; Alon, U. The incoherent feedforward loop can provide fold-change detection in gene regulation. Mol. Cell 2009, 36, 894–899.
- Kaplan, S.; Bren, A.; Dekel, E.; Alon, U. The incoherent feed-forward loop can generate non-monotonic input functions for genes. Mol. Syst. Biol 2008, 4, doi:10.1038/msb.2008.43.
- Alon, U. An Introduction to Systems Biology Design Principles of Biological Circuits, 1st ed ed.; Chapman and Hall/CRC: London, UK, 2007.
- Acar, M.; Pando, B.F.; Arnold, F.H.; Elowitz, M.B.; van Oudenaarden, A. A General mechanism for network-dosage compensation in gene circuits. Science 2010, 329, 1656–1660.
- Yurke, B.; Mills, A.P., Jr. Using DNA to power nanostructures. Genet. Programm. Evolvable Mach 2003, 4, 111–122.
- SantaLucia, J. A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics. Proc. Natl. Acad. Sci. USA 1998, 95, 1460–1465.
- Turberfield, A.J.; Mitchell, J.C.; Yurke, B.; Mills, A.P.; Blakey, M.I.; Simmel, F.C. DNA fuel for free-running nanomachines. Phys. Rev. Lett 2003, 90, doi:10.1103/PhysRevLett.90.118102.
- Green, S.J.; Lubrich, D.; Turberfield, A.J. DNA hairpins: Fuel for autonomous DNA devices. Biophys. J 2006, 91, 2966–2975.
- Dirks, R.M.; Pierce, N.A. Triggered amplification by hybridization chain reaction. Proc. Natl. Acad. Sci. USA 2004, 101, 15275–5278.
- Zhang, D.Y.; Winfree, E. Control of DNA strand displacement kinetics using toehold exchange. J. Am. Chem. Soc 2009, 131, 17303–17314.
- MATLAB, version 7.10.0 (R2010a); The MathWorks Inc: Natick, MA, USA, 2010.
© 2012 by the authors; licensee Molecular Diversity Preservation International, Basel, Switzerland. This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).