DNA Structure and the Golden Ratio Revisited

: B-DNA, the informational molecule for life on earth, appears to contain ratios structured around the irrational number 1.618..., often known as the “golden ratio”. This occurs in the ratio of the length:width of one turn of the helix; the ratio of the spacing of the two helices; and in the axial structure of the molecule which has ten-fold rotational symmetry. That this occurs in the information-carrying molecule for life is unexpected, and suggests the action of some process. What this process might be is unclear, but it is central to any understanding of the formation of DNA, and so life.


Introduction
An interesting feature of living organisms on earth is a seeming tendency for their morphology to be structured around the "golden ratio", an irrational number, often denoted by the Greek letter Φ (Phi). Some 35 years ago it was proposed that even DNA might also be structured around Φ which, were it the case, would be remarkable, with ". . . considerable historical, theological, mathematical and, above all, aesthetical interest" [1]. Since then, knowledge of the atomic structure together with the ability to model DNA have both improved, providing the opportunity to better test, with greater precision, to what extent DNA might show structure(s) with ratios convergent toward Φ.
This unique ratio (Φ) has been studied since antiquity, with the discovery of many fascinating mathematical properties. These, and the occurrences of this ratio in the natural world, have been discussed by many other authors [2][3][4].
An increasingly accurate approximation to Φ is found in any number series constructed using the simple rule that, beginning with any two real numbers (and the proviso that both cannot be zero), each successive number in the series is the sum of its two predecessors. Applying this rule in its simplest form (where the first two numbers are 0 and 1) gives the Fibonacci series ([0], 1, 1, 2, 3, 5, 8, 13, 21, 34 . . . ). Division of a number in such a series by its immediate predecessor converges to Φ as the series progresses, while Φ may be calculated by: The reciprocal of Φ, often given the symbol φ (phi) is: and division of a number in the Fibonacci series by its immediate successor converges to φ as the series progresses.
The golden ratio appears in many unanticipated ways in a variety of geometrical constructions [3,4]. Of relevance to the work presented here is the geometry of the golden triangle. This is a ubiquitous motif in a regular pentagon with an inscribed pentagram, Figure 1, having a number of mathematical curiosities.
A golden triangle (gnomon) is isosceles, with the length of the two equal sides Φ (or Φ −1 ) times that of the length of the base, Figure 1. A golden triangle (gnomon) is the only triangle in which the vertex angle, 36 • (108 • ), is half (three times) that of the base angles. These angles are simply related to π, or perhaps the more elegant circle constant τ = 2π, [5]: (1π)/5 = (1τ)/10 = 0.6283 radians = 36 • (2π)/5 = (2τ)/10 = 1.2566 radians = 72 • (3π)/5 = (3τ)/10 = 1.8849 radians = 108 • which sum to π radians, as expected for a triangle, in an elegant manner: Harel et al. [1] first noted that: (a) the ratio of the length of one turn of a B-DNA helix to its width, and (b) the ratio of the major and minor groove separation were both close to Φ. In a living cell B-DNA is a dynamic molecule, and its conformation and degree of twist may vary during transcription, coiling, interaction with cations, and also slightly depending on the actual base sequence. A free strand of B-DNA in solution, with the ends unconstrained, has ∼10.5 base pairs per turn of the helix [6]. This compares to the "canonical" B-DNA molecule, which completes one rotation of the helix after 10 base pairs, as seen in the high humidity crystalline state, in hydrated DNA aggregates and notably also when the B-DNA is supercoiled on nucleosomes within the cell [6][7][8]. It is this latter "canonical" 10 base pair per helical turn B-DNA that most closely approximates a Φ-structure in its form and which is modelled here in this work. Although it may represent an idealised mean of the B-DNA molecule, this does not diminish the significance if it is found to be structured with ratios convergent toward Φ, as this would be unexpected and seem to imply some unknown mechanism, perhaps with relevance to the origin of DNA and so also life.
The results presented here confirm and extend the observations of Harel et al. [1], that the B-DNA molecule is closely structured around Φ. Why this is the case remains unclear and is discussed in the second half of the paper. , containing 20 golden triangles, with a "golden diamond" highlighted. This is shown in expanded view (right) to illustrate its construction consisting of a golden triangle and a golden gnomon (shown shaded) and relationships to Φ. Angles are expressed in radians.

Materials and Methods
The nucleic acid modelling software package 3DNA, Version 2.4.4 [9] was used to construct an atomic model of B-DNA (the hydrated form found in living cells) using the "fiber" command for B-DNA model number 4 [10] and subsequent refinements [11,12]: This produces a Protein Data Bank (PDB) file containing the atomic co-ordinates of B-DNA, with a twist of 36.0 • and a rise of 3.375 Å (1 Å = 0.1 nm) each base pair, for five repetitions of the artificial base sequence adenine, thymine, cytosine and guanine (ATCG), chosen to average out any variations in the structure that occur with non-random base sequences and consequent slight variation in electrostatic forces [8,13,14]. Twenty repetitions of the ATCG sequence were used for the axial view shown in Figure 2 and five for the linear view (shown in part in Figure 3). PyMOL version 1.8.6.0 [15] was subsequently used to visualise the molecule.
The canonical B-DNA molecule is found to contain ratios approximating very closely to Φ in at least three aspects.

Length: Diameter
Franklin and Gosling [16] provided the first measurement of the diameter of the DNA molecule: "Thus. . . we find that the phosphate groups or phosphorus atoms lie on a helix of diameter about 20Å. . . ". This measurement has often been cited in subsequent literature. It is, however, a rounded value for the distance between the phosphorus atoms, these being the most massive and so providing the greatest signal in the X-ray diffraction technique used. The outermost atoms are however not phosphorus, but the oxygens of the phosphate groups which give the B-DNA molecule modelled here a diameter of 20.978 Å, Figure 2.
As noted above, the canonical B-DNA molecule completes one rotation after 10 base pairs, as seen in the crystalline state and when supercoiled on the nucleosome, with a mean pitch of 3.375 Å giving a mean linear distance for one helical rotation of 33.750 Å, Figure 3. As previously noted, there are slight differences in the electrostatic forces existing between the bases along the molecule and so the exact distance will vary slightly depending on the base code.
Given these mean values however, the ratio of the length of one turn of the molecule to its width is 33.750 : 20.978 = 1.6088 or within 0.5% of the value of Φ (1.61803. . . ). This is slightly closer to Φ in comparison to the previously reported value (1.6031) [1]

Major: Minor Groove
The arrangement of the double helices of B-DNA form a relatively deep and wide major, and a shallow, narrower minor groove. The internal groove widths vary somewhat due to hydration and ion interactions and the encoding of the bases.
Various measurements of the internal groove width have been reported [12,17] of between 11.6-14 Å for major, and 6 Å for minor internal groove widths respectively, and the ratio of these is a poor approximation to Φ. However, rather than measure the internal groove width, a more relevant measure of the distance of separation of the two helices are the distances between the phosphorus atoms in the phosphate "backbone". Figure 3 shows part of the DNA molecule constructed as described above, with just the phosphorus atoms plotted. The distances along the z-dimension of the molecule are shown for three phosphorus atoms (8.277, 28.527 and 42.027 Å). Normalising these relative to the first value gives distances of [0], 20.571 and 33.750 Å and so a ratio for the spacing of the two helices of 33.750 : 20.571 = 1.6407 or within 1.4% of Φ. Again this ratio is closer to Φ than the previously published ratio (1.538) [1].

Axial Symmetry
The axial structure of B-DNA is shown in Figures 2 and 4. The rotation of each base pair by 36 • , in the canonical B-DNA molecule, gives a complete 360 • rotation every 10 base pairs and results in a decagonal structure and ten-fold rotational symmetry. These properties alone relate the axial structure of the molecule to Φ. Although a molecule with a decagonal form might be expected to show some simple relationship to Φ, and while there is always a danger in seeing patterns where there may be none, the axial form of B-DNA is neatly encompassed by 10 "golden diamonds" (refer Figure 1), the template of which is shown in Figure 4B, and overlaid on Figure 4A such that points (a) and (c) are located on the outermost hydrogens and point (b) is between the outer oxygen atoms. Any "ball and stick" visualisation of a molecule is an approximation to reality and in Figure 4 the CHNOP atoms are shown scaled at 0.20, 0.10, 0.15, 0.25 and 0.30 of their Van Der Waals radii respectively for clarity, but the same effect is observed if the scale factors are all 1.0.
In Figure 5 two polygons are plotted with origins at the axial centre (0,0) of the molecule. The co-ordinates for the polygon in the "12-o'clock" position are the mean of the x,y co-ordinates for the four hydrogens (centred at ±2.8125, 8.6565 Å) and the outer oxygen pair (0.0, 10.4890 Å). This polygon is close to a "golden diamond" with the distance from the origin to the hydrogens of 9.1019 Å and from the hydrogens to the oxygens of 3.3568 Å or a ratio of 2.7115. For a "golden diamond" this ratio would be Φ 2 = 2.618. . . requiring a co-ordinate of 0.0,10.700 Å, and this is plotted with a dashed line, which passes directly through the outer oxygens.
The co-ordinates for the polygon at "6-o'clock" are for the two inner hydrogens of each group of four (±2.760, −8.4955 Å) and the outer oxygen pair (0.0, −10.4890 Å. This gives side lengths of 8.9335 and 3.4041 Å respectively and a ratio of 2.6243 Φ 2 . That the axial structure of B-DNA can be closely represented by 10 "golden diamonds" is unexpected, and there are many geometrical relationships involving Φ in such a decagonal structure, some of which are summarised in Figure 6.

Discussion
These results show a closer convergence to Φ for the ratio of the length to the width of the molecule and for the spacing of the helices than was obtained by [1], and support their observations. The occurrence of Φ in the axial plane has not previously been discussed.
Unlike the inanimate world, life needs to store and process information. Such information, at a cellular level, includes how to construct and then regulate the timing and quantity of enzymes and other biomolecules for many complex biochemical pathways, as well as to control the fluxes of reactants and products through the cell membrane. For nearly one third of the Universe's history (∼3.8-4.1 billion years or ∼1×10 14 bacterial generations), DNA has acted as such a storage system, transmitting this information with high fidelity [18][19][20][21]. Over that time, the structure of DNA appears unchanged, with all known life using the same four bases (ATCG). Therefore the structure of B-DNA (with its apparent convergence toward Φ), has also been preserved and transmitted over this entire period.
The arrangement of atoms in a molecule (and so its shape) are determined solely by the physical laws established at the origin of the Universe. That B-DNA, the molecule used to store and transmit information, should be composed in such a way that those physical laws result in a molecule with multiple geometries apparently closely approximating Φ is remarkable and seems to imply some underlying factor. What this factor is must then be part of any explanation for the origin of DNA (and life): why should the canonical B-DNA molecule be structured around Φ when presumably many other alternatives, see for example [22], might have been possible?
There is, of course, always a risk of seeing patterns where there are none, so how close must an approximation to Φ be, before it can be said that it is equal to Φ?
Various authors [23,24] have criticised claims that Φ ratios can be found in natural systems, and argue that if enough measurements are made of any given object, there is bound to be some ratio that could be found that might be close to Φ, with [24] proposing (somewhat arbitrarily) that a ratio should be within ±2% of Φ (i.e. between 1.58-1.66) to be considered as a possible candidate for a structure having Φ proportions. As shown in the results above, the ratios calculated here are well within this 2% threshold. Furthermore, they are not based on taking many different measurements of the molecule, but only the simple natural measurements that define it (length, width, spacing of the helices and the axial shape).
The RMS error for the precision of the B-DNA model used here is given as 0.73 Å for all atoms and 0.02 Å for the base atoms [9,10]. As the calculation of Φ involves the ratio of two measurements, the effect of any error in measurement is therefore doubled. Using the lowest RMS error value (0.02Å) would then give an RMS error of 0.04 Å and this means that the ratio of the length:width of the B-DNA molecule (1.6087 Å) and the ratio of the Major:Minor groove phosphorus atoms (1.6407 Å) are both within the RMS error range relative to Φ. In terms of the axial symmetry measure, if the "best fit" ratio of 2.6243 Å were used, this too is within ±0.04 Å of Φ 2 = 2.6180. . . although in this case, as these measurements are not on the base atoms of the molecule, a lower level of precision than 0.04 Å could have been used.
Therefore, to the level of precision available, the B-DNA molecule cannot be said not to have ratios within it which converge to Φ.
A second criticism is that even if an object does have a Φ structure, this does not mean anything, random events occur, and this is just such an occurrence.
Without a knowledge of the full range of possible geometries available to life in selecting a molecule to encode information, it is not possible to assess statistically how likely or unlikely any apparent convergences to a geometry approaching Φ might be. Work on synthetic DNA [22,25] and alternatives to RNA [26,27] suggest that other molecular structures, which do not appear to contain ratios convergent toward Φ are possible. Therefore, it does not seem, a priori, an inevitability that the information-carrying molecule for life should necessarily be structured around Φ.
If a convergence to Φ were seen in just one aspect of B-DNA, that might be considered a fortuitous chance. However, the B-DNA molecule appears to show a convergence toward Φ in at least three ways (each independent to the others), and all related to the fundamental structure of the molecule. The measurements used here are the simple natural ones (length of one helical turn, width and spacing of the helices) which define it. So, while it cannot be ruled out that the Φ structure of DNA is just a random occurrence, it would seem that the chance of this occurring randomly is much smaller than the option that it was produced by some mechanism, especially given that the occurrence of any mathematical pattern(s) in the natural world has invariably implied some underlying cause(s). What this cause is remains to be determined, but it would seem fundamental to any understanding of the origin of life on earth.
There is clear evidence, especially in plants, that macro-scale structures based around Φ occur, and confer selective benefits [28,29]. At the molecular scale, natural selection also operates, both in terms of determining chemical pathways and specific biomolecules. For example, by favouring a species with the ability to process a particular compound, a particular biochemical pathway might be selected for and by selecting for a more efficient catalyst, a particular biomolecule would be selected for. In selecting for the shape of that given biomolecule, natural selection would be expected to favour those shapes which confer the greatest physical functionality, in this case, as a catalyst [30]. However, the shapes of these biomolecules would not be expected to follow any particular mathematical or geometrical system, their shape being dictated solely by their function, which is to process still smaller molecules such as glucose, methane and so on.
What the selection pressure was that might have led to B-DNA having a structure based around Φ remains to be determined. If there were one, it would seem to be specific to the requirements of DNA, unless other biological molecules can also be found which also show Φ patterns. As DNA "processes" information rather than acting as a catalyst, there are different and additional selection pressures that might act on it, compared to an enzyme, for example, those that improve information storage density, copying fidelity and physical stability [31,32], and these different requirements may be where a selection pressure can be applied.
The difficulty of forming DNA de novo in any sort of plausible prebiotic environment [26] has led to the hypothesis that it instead evolved from a pre-existing "RNA world".
RNA uses the bases AGC and uracil (U) instead of thymine (T). Single stranded, RNA can form complimentary base pairings with other parts of the RNA molecule, allowing it to form loops, complex three-dimensional shapes and short double helical structures. These short helical structures have 11 base pairs per turn, with a form similar to A-DNA [33], and do not possess a Φ structure. Therefore, if there were some selection process favouring a Φ structure, it would seem that it would only have been able to act on RNA after the RNA itself had already formed (so leading to DNA), but by that point, three of the four nucleotide bases were already "decided", and a different selection pressure would need to be posited for them. Furthermore, in the RNA-world scenario, while RNA stored information, it also acted as an enzyme, and in that role particularly, its shape is critical. The strongest selection pressure on an RNA molecule, in a hypothetical RNA world (or protein world before it) might therefore be to generate a shape that would act as the most efficient catalyst. Therefore, if an RNA world were the precursor to life based on DNA, none of the benefits of DNA being based around Φ (if there are any) would seem to be realised until DNA had formed, so leading to the question: why was RNA selected for?
RNA bases and RNA itself are not stable [34,35], nor are ribose sugars [19,36], except perhaps in the presence of borate [37]. Recently a possible chemical pathway (requiring phosphate and borate compounds among others) that could lead to the production of pyrimidine nucleosides and ribose under prebiotic conditions, and requiring alternating wet and dry cycles has been proposed [38]. Whether this would be possible at the time life originated on earth is debatable, as it would appear then that earth was essentially, if not completely, covered by sea [39][40][41]. In addition, it is also possible that the required borates may not have been produced on earth at this time [42]. These and other difficulties with RNA being the first form of molecule used for information storage and replication are well discussed [35,43], and have led to the hypothesis that the RNA world was preceded by a protein or other evolving system of chemistry [44][45][46]. If this were the case, then the question is what was the selective pressure which caused this pre-RNA world to evolve to an RNA world with bases which would ultimately evolve to the DNA molecule as we have it, with a Φ structure? What these selective pressures could be is unclear.
Mineral surfaces may adsorb, stabilize and potentially catalyse reactions of ribose, phosphate and nucleobases, as well as adsorbing DNA and RNA molecules [47,48], and various minerals have been suggested as playing a fundamental role in the origin of life [49,50]. The minerals investigated in these studies all have regular crystalline structures.
With this in mind, it is interesting to note that quasicrystals [51,52] form structures which, at the atomic scale, have atoms arranged in explicit multiples of Φ [53,54], and that quasicrystals have been found (in meteorite fragments) in the natural environment [55]. Speculatively then, could some such quasicrystal, perhaps containing phosphate and borate, have provided a Φ patterned ionic surface template (see for example [54]) on which the molecules to produce short strands of RNA, or even DNA, might have been adsorbed and arranged? The bonds holding the RNA/DNA molecule together are stronger than the Van der Waals or ionic bonds with which it would be adsorbed to the quasicrystal surface, so potentially allowing strands of RNA/DNA to form and be released. If this were possible, then it might start to explain why DNA has the Φ structure it does: it is an artefact of a particular mineral template that formed the first RNA/DNA strands. In the absence of experimental data, this must remain speculative. However, if the case, why such a process should have been the one to result in life's information-carrying molecule with a mathematical structure having Φ proportions seems to raise almost teleological questions.

Conclusions
The structure of canonical B-DNA converges to Φ in the ratio of the length:width of one turn of the helix; in the spacing of the two helices; and in the axial arrangement of the atoms. This is unexpected and suggests the action of some external process. The determination of what this process might be remains to be found, but is central to any understanding of the origin of life.