Abstract
Biological processes often involve the attachment and detachment of extended molecules to substrates. Here, the classical dimer model is used to investigate these geometric effects on the free energy, which governs both the equilibrium state and the reaction dynamics. We present a simplified version of Fisher’s derivation of the partition function of a two-dimensional dimer model at filling factor , which takes into account the blocking of two adjacent sites by each dimer. Physical consequences of the dimer geometry on the entropy that are not reflected in simpler theories are identified. Specifically, for dimers adsorbing on the DNA double helix, the dimer geometry gives a persistently nonzero entropy and there is a significant charge inversion as the force binding the particles to the lattice increases relative to the thermal energy, which is not true of the simple lattice gas model for the dimers, in which all the sites are independent.
Keywords:
entropy; dimer model; trace theorems; pfaffians; cruciform matrices; DNA charge; biological physics 1. Introduction
From metabolism to drug design, the free energy of a system controls its biological processes. The free-energy minimum determines the equilibrium state. The free energy gradients drive the kinetics of the biochemical reactions.
The free energy contains both energy and entropy terms. The energy of complex biomolecules can now be calculated quite accurately, especially using the quantum–chemical methods introduced by Hohenberg and Kohn [1] and extensively developed by many authors in subsequent work. The entropy, on the other hand, is less-extensively studied, although the free energy involves the difference between the energy and entropy terms. The entropy term is known to play an important role in at least some biochemical processes. An example of an entropy-driven biochemical reaction can be found in sickle-cell disease, where the carbon dioxide causes hemoglobin polymerization and oxygen from respiration reverses the process [2,3]. In addition, strategies for controlling the entropy are being used in self-assembling systems to generate novel materials in fields like colloids, macromolecular systems and nonequilibrium assembly [4].
Biomolecules are typically long-chain molecules. Arrays of one-dimensional chains on a surface can be created by numerical methods like self-avoiding walks, as can two-dimensional self-avoiding membranes [See Chapter 10 of Plishke and Bergersen, [5]]. Analytic results, however, are few. The simplest “long-chain” molecule is the dimer, and the first analytic derivations of the partition function of such a system were by Temperley, Fisher, and Kasteleyn [6,7,8] for the classical dimer model. The simplest closed form expression for the partition function of a one-dimensional chain appears in Fisher’s 1961 paper [7], and we will be presenting a simplified version of their derivation. In the dimer model, when a dimer attaches to a lattice, it blocks two adjacent sites from other dimers attaching. It is much more difficult to determine the number of possible arrangement of dimers on a lattice than for only monomers, because a monomer only blocks a single site.
Fisher’s solution method for the dimer model builds monomers as well as dimers into the initial formal construction of the partition function. The monomer term complicates the derivation. Although one set of anticommuting matrices, similar to the Dirac matrices of relativistic quantum mechanics, is used to enforce the dimer constraint, the presence of the monomer term requires an additional set of anticommuting matrices in a product form, reminiscent of the chirality operator , to create homogeneity. Even with this construction, Fisher was unable to find an analytic solution to complete the evaluation of the partition function on a two-dimensional lattice with both monomers and dimers included. He therefore dropped the monomer term to reach their final analytic expression for the partition function Z in dimension .
Because of this, we present, beginning in Section 4, a simpler method of solving Fisher’s dimer model by neglecting the monomer term from the beginning, and hence restricting the monomer distribution on the simple square lattice to be completely filled with dimers, which we refer to as filling fraction , the completely filled lattice with no empty sites. We then need only a single set of anticommuting operators that live on the links of the lattice and that we refer to as link fields. This method also gives Fisher’s final analytic expression for the partition function Z in . This result Z is for a finite lattice with the sites arranged in rows and columns.
Beginning in Section 8, we specifically use this result for the special case of a two-leg-ladder lattice with rows and columns to construct, in the end, the partition function for an infinitely long one dimensional chain of sites containing not just dimers but also monomers of two colors. We use this to produce results using the dimer model that parallel the results found in previous papers for the DNA double helix [9,10], which use a lattice gas model. We find results from the dimer model that differ from the simpler lattice gas models, in which all the sites are independent. Perhaps the most important of these is that the dimer geometry gives a persistently nonzero entropy, as the force binding the particles to the lattice increases relative to the thermal energy.
2. Partition Function, Entropy, and Occupancy
Although the classical dimer problem only requires classical physics, classical statistical mechanics is plagued with oddities like Gibbs’s paradox. This means that it is more reliable to start from the quantum-mechanical expression for the partition function of the grand canonical ensemble,
where is the free energy of this ensemble (often called the grand canonical potential), H is the quantum Hamiltonian of the system, N is the number of particles in the system and the chemical potential of the particle of type j. The thermal parameter specifies the temperature T of the system, where is Boltzmann’s constant. The trace here is over quantum states of the system. Taking the trace is equivalent to summing over all the possible configurations of the system.
The entropy is then given by , and the mean number of particles of type j is given by , where the subscripts indicate the quantities that are held constant in taking the partial derivatives.
Fisher’s dimer model describes dimers adsorbed on a two-dimensional lattice. Eventually, we will take the limit in which the lattice becomes large, and for that reason, it is useful to express the quantities as energy per lattice site and entropy per lattice site. Therefore, it is more reasonable to consider the entropy per site and the number of particles of a given type per site, which is the occupancy per site. It is then helpful to define the logarithm of partition function per site as
which means that the entropy per site can be written as
The derivative with respect to gives
and then the entropy becomes
A simple example of partition function is that of a lattice gas. Suppose that there are two kinds of particles, like red balls and blue balls. There are ways to fill the N sites of the lattice, where and are the numbers of red and blue balls. This is divided by , the number of ways to swap red balls, and still has the same configuration, then by , the number of ways to swap blue balls without changing the configuration. This gives the number of configurations of red balls and blue balls,
There are no forces between the balls, merely a force holding them on the lattice sites, and so the Hamiltonian is
while the chemical potential term becomes . The partition function is then
The activity of the red balls is and of the blue balls is . If is the number of states in the trace that contain red balls and blue balls, then the partition function for this system becomes
where . With the binomial theorem, this can be written as
which is the partition function for red and blue balls on a lattice. The sites are independent, so that the factor is the partition function contribution from each site. For an infinite lattice, the logarithm of the partition function per site is then
The entropy per site is then given by
Since is contained in the activities r and b, , and similarly for b. Inserting the explicit form of the partition function yields the entropy per site of the two-color lattice gas,
We can similarly find the mean particle numbers by differentiating by the chemical potentials. For the red balls r,
The number of red balls per site is given by
Substituting the partition function per site for the red balls, we have
The derivative of r with respect to is given by , and so the average occupation of the red balls is
Similarly, for blue balls, we have
and these two expressions add to one, which they should.
Generalizing this argument to include red, blue, and green balls, we have
with the partition function per site in the infinite lattice limit represented as
the entropy per site as
and the occupancies as
If one simply wants vacant sites rather than the green balls or holes in the lattice, the activity is replaced by , because holes have neither an energy nor a chemical potential.
3. The Lattice and Its Dual
Dimer models describe the behavior of rods with ends that occupy sites on a lattice. However, two dimers cannot have their ends on the same site, and this is the fundamental constraint that makes determination of the allowed dimer configurations difficult. A physical example of a dimer might be a hydrogen molecule, with each of the two atoms held to a separate lattice site through electrostatic attraction. The lattice may be completely covered with dimers, or the coverage may be less than complete. In the latter case, the empty sites may be called vacancies, or alternatively regarded as occupied by monomers. The lattice can be of any dimension, although pictures of one- and two-dimensional lattices are easiest to draw.
Dimer models can be either classical or quantum. The difference is, basically, that in the quantum dimer model, the dimers can tunnel between sites, while classically, they can only move by thermally activated diffusive processes. The quantum dimer model is the basis for Anderson’s resonating-valence-bond theory of antiferromagnetism.
Fisher [7] considered a simple square lattice in two dimensions, with dimers placed randomly on this lattice. He then attempted to determine the number of different dimer arrangements. This result could, for example, be used to determine the entropy, because entropy is the logarithm of the number of possible states of a system, Fisher discovered that he could solve this problem when the lattice was fully occupied and there were no vacancies. The key, he found, was to randomly distribute “half dimers” on all sites of the lattice, as in the lattice gas problem. He then threw away all arrangements in which a half dimer was not connected to a neighbor because the orientations did not match. The remaining configurations were those in which dimers completely covered the lattice.
The trick he used to accomplish this feat was to tag each dimer with a member of a sufficiently large set of anticommuting objects such that
so that , with I the identity, where the s were represented by matrices. He then used the trace theorems that are employed in particle physics to simplify calculations involving products of Dirac matrices [11,12,13]. These caused the unwanted configurations to vanish, and he identified those that remained with a Pfaffian. Its evaluation gives the number of dimer configurations.
When the lattice is completely covered with dimers, the calculation simplifies because there are no monomers. The fraction of sites covered by dimers is the “filling fraction”, , and this complete dimer coverage corresponds to .
The trace theorems evaluate products of the s. We will need four different s for each site in the lattice, and for all of them to be independent matrices, their dimension d will be quite large. Fortunately, all we need is the commutation law, because it defines their algebra. We will never have to actually write down any of these matrices.
We consider dimers adsorbed on a simple square lattice of sites, indicated by the red points in Figure 1. The dual lattice is formed by connecting the midpoints of the links of the original lattice. The links are the lines joining each pair of neighboring sites (red, dashed). The dual lattice is the lattice of blue points located at the midpoints of the links of the original red lattice shown in Figure 2.
Figure 1.
A simple square lattice with lattice points represented by red dots and the links represented by red dashed lines.
Figure 2.
A simple square lattice with lattice points represented by red dots and the links represented by red dashed lines. The blue dots label dual lattice points, and the links of the dual lattice are blue dashed lines.
Let us denote the lattice sites by Roman subscripts . The anticommuting objects A actually live on the links of the lattice, or equivalently on the sites of the dual lattice. Because they live on the links, one can think of them as "link fields" that anticommute. Let us use Greek superscripts to indicate the links or sites of the dual lattice, so that these anticommuting link fields can be written as . These link fields are regarded here as operators that act on some set of vectors in a vector space. The introduction of a representation by using a set of basis functions allows these fields to be represented by matrices of some suitable dimension d. The trace is then the sum of the diagonal elements , which is invariant under a unitary transformation.
The next step is to develop the trace theorems involving the fields that are needed here. Because Kronecker deltas and identity matrices are so easy to sum out, it is easy to lose track of them. It is clearer to write the anticommutation law as
where , the Euclidean-space metric. When matrices of dimension d are used to represent the , this becomes with the d-dimensional identity matrix. We will need to evaluate expressions involving , where means the trace over a product of matrices representing the .
A proof of the trace theorems of these matrices is given in Appendix A. A simpler way to look at the procedure is as follows. In the general product , use anticommutation to move s with the same index next to one another as pairs. This introduces a parity factor . Then, replace each matched pair with , since the anticommutator gives . You will obtain if the product consists only of matched pairs. Otherwise, there will be at least one factor that has unmatched indices, and the entire product will vanish. The trace theorems for the are as follows:
- 3.1
- If the general product can be rearranged so that adjacent pairs of indices are the same, thenwhere P is the appropriate parity index for the rearrangement.
- 3.2
- If the general product cannot be rearranged so that adjacent pairs of indices are the same, then
4. The Dimer Model
One of the simplest models of a system containing diatomic molecules is that of lattice gas of rigid dimers, each of which fills two nearest neighbor sites of a space lattice of sites. Fisher was only able to completely evaluate their result for a lattice completely filled, , with dimers, leaving no vacancies or monomers, as shown in Figure 3. Furthermore, because we will eventually represent the lattice with its dimer arrangements by matrices, the y-axis is inverted so that the site is in the upper left corner like the usual initial matrix element .
Figure 3.
Complete filling of dimers (green) on the simple square lattice (red dots and dashes).
We suppose that the dimers do not interact with one another apart from the geometric constraint that only one dimer can be attached to a given site. The dimers are bound to the lattice sites, and we let the total binding energy of a dimer be , which is twice the binding energy to each site because a dimer has two ends. We follow Fisher [7] in allowing the binding energy of dimers aligned in the two orthogonal directions to be different, and call them and . Then, the partition function becomes
Here, is the activity of an x-oriented dimer, and we also let be the activity of a y-oriented dimer, following Fisher, who allowed the dimer activities in the two directions to differ. The trace over quantum states adds together the contributions of states with the same numbers and of dimers oriented in the two directions. If we define the number of such states to be , an equivalent expression for the partition function is
which is actually valid for any filling fraction .
We now invent, following Fisher, a snake-like path through the lattice that allows us to use a one-dimensional numbering scheme to label the lattice sites. The numbering begins at the upper left corner and weaves back and forth along the x-direction, as shown in Figure 4. We note that this one-dimensional numbering scheme, serpentine numbering, also works for a three-dimensional lattice if one draws it on a long sheet of paper and then folds the paper with accordion-like pleats. For a lattice with rows of sites (red) arranged one below another in the vertical y-direction and columns parallel to one another in the horizontal x-direction, we have a lattice of sites. The virtue of this numbering is that it is easy to determine the signs resulting from the interchanges required to move identical s adjacent to one another. For the x-links it is obvious, because the two ends of a dimer are on adjacent sites, so no interchange is needed, and the sign is plus. For a y-link, consider the following example: the link between sites 27 and 40 in Figure 4. The y-links (vertical red dashed lines) look like the ties between two rails on a railroad track, and these ties have two ends. Thus, to move from position 27 to 39 (adjacent to 40) requires moving to the right by six interchanges () along row three, and then moving back left along row 4 by another six interchanges (). Because a tie has two ends, you will always obtain an even number of interchanges, and this will crucially make all terms positive in our construction of the partition function below.
Figure 4.
Serpentine numbering of the lattice sites in green. The row and column numbers of the lattice are also shown The red dots and dashes outline the direct lattice, and the blue dots and dashes the dual lattice.
The dual lattice has twice as many sites as the original lattice because there are two links per site in the square lattice. Consequently, if the original lattice has sites, the dual lattice has links or points. However, there are only distinct adjacent pairs of points in the original lattice, and means that all of these distinct adjacent pairs of points are occupied by dimers. This makes it generally useful to consider as even, because if it is odd, there is at least one vacancy.
An alternative way of labeling the links is to make the superscript of the link field a pair of site indices where j and k label the two ends of the link, where . This leads to the link-field labels of Figure 5.
Figure 5.
Alternative way of numbering of links of the lattice by using pairs of site indices, shown in blue, on top of Figure 4.
The link fields can be arranged as the upper right triangle of an matrix with zeros down the diagonal, suggestive of an antisymmetric matrix. For a smaller example with four rows and three columns but similar to the example above, the upper right triangle of entries is shown in Equation (29) and shown pictorially in Figure 6.
The link fields , , and refer to x-directed links, and the link fields , , , and refer to y-directed links. There are no near-neighbor links corresponding to the elements for which zero is entered. In other words, the entries sloping diagonally upward in the direction lower left corner to upper right corner are y-directed links. The remaining ones sloping downward from the upper left to lower right corners are x-directed links in the pattern shown in the matrix M in Equation (30).
Figure 6.
A smaller example with four rows and three columns, but similar to Figure 5, where the serpentine numbering is shown in green, and the alternative numbering in blue. As before, the red dots and dashes outline the direct lattice, and the blue dots and dashes the dual lattice.
5. Inclusion of Constraints on the Partition Function by Use of a Child’s Toy
Suppose that the lattice has square holes with sides parallel to x and y at each lattice site. Into each of these holes, we insert a toy, like a child’s top with a square shaft that just fits the hole, as shown in Figure 7. The disk that provides most of the top’s angular momentum has a green line painted on it normal to one of the faces of the shaft. This means that the green line can be oriented four ways—north, south, east, and west.
Figure 7.
Square holes in a simple square lattice with a toy with a square shaft that can fit into a hole in one of four directions.
The orientations of the disk are random, so they are distributed like a lattice gas of four colors, as shown in Figure 8. Sometimes, the green lines of neighboring disks point toward one another, and for any adjacent pair, the probability of this occurring is .
Figure 8.
Toys randomly arranged on the simple square lattice, shown as the red square holes and connecting dashes. The half-dimers are represented by the green lines for this particular member of the ensemble, with the red circles the outlines of the toys in Figure 7. Not all the half-dimers point toward each other in pairs, and so taking the trace over link fields will cause this distribution to be removed from the ensemble.
Now, suppose that the green lined represent “half-dimers”, and when green lines point toward one another on adjacent sites, those two half-dimers join up to form a complete dimer between those two sites. In Figure 9, the figure on the left shows the half-dimers pointing toward one another on adjacent sites, and the figure on the right shows the corresponding dimer arrangement.
Figure 9.
Half-dimers arranged on the simple square lattice. In the left figure, the dimers all point toward one another in pairs, so that taking the trace over link fields will retain this as a member of the ensemble, unlike the random arrangement of Figure 8. This corresponds to the completely filled dimer arrangement on the right, which is the same as in Figure 3.
We construct the partition function in a way that allows the constraints of no double occupancy and a completely filled dimer arrangement to be imposed. Associate with each lattice site j a function
where the sum is over the nearest neighbors of site j, as emphasized by the subscript on the sum. The quantity is the activity of a dimer on the link , and is either if the link is in the x-direction, or if the link is in the y-direction. The factor is the link field associated with the link . The square root of the activity is taken because it is the activity of a half-dimer, the object represented by the green line on the disk of the toy. It takes two of these factors, , to give the activity of the dimer on the link. One factor can be thought of as emanating from the site j, and the other from the site ℓ.
The partition function is constructed as a product of these factors , one rooted on each lattice site j. To see how this will work, suppose that we have a product of only two lattice sites called j and k. The product is
If sites j and k are not nearest neighbors, then none of the link fields are common to both factors, and taking simply gives zero and the term vanishes. None of the sixteen terms in the last line then will contribute to the partition function.
The situation is different when j and k are nearest neighbors. Then, there will be one link field in common. Suppose, for example, that it is on an x-directed link, arising from
because j and k are nearest neighbors in the x-direction with k to the right of j. Since the order of subscripts on a link does not matter, . Thus, this term survives when the trace is taken.
In this way, the partition function,
is an expression in which the only surviving terms are those which give dimer arrangements, as shown, for example, in Figure 9. The numbering is one-dimensional, , because we use one-dimensional serpentine ordering.
6. Introduction of the Antisymmetric Matrix
A more concise notation is useful. Furthermore, we want the contributions of the even-numbered and the odd-numbered sites to the partition function to be written slightly differently. We begin by writing
so that , which is, of course, either or , and where the subscript “" on the sum is omitted, although the restriction of the sum in to nearest neighbors of j will remain understood.
Now, consider the product,
which appears in the partition function. The number of sites is chosen to be even, and we insert between each odd–even pair, giving
Then, for odd-numbered sites, we have
and for even-numbered sites, we have
These relations can be combined as
and the original product becomes
Finally, let us set
The quantities are operators formed as linear combinations of the link fields. The link fields satisfy the simple anticommutation law
or, for d-dimensional matrix representations thereof,
We now show that the operators also anticommute, and calculate the values of their anticommutators. We have
Writing this in terms of the anticommutators of the link fields gives
where gives unity if and denote the same link field and zero otherwise. They only denote the same link field if the lattice sites j and k are nearest neighbors, and then only if and , that is, . The anticommutator becomes
which reduces to
Finally, we write this in terms of the notation of Section 5. This gives
if j and k are nearest neighbors, and zero otherwise.
Furthermore, , the square root of the activity of a dimer on the link , which does not depend on the order of the indices. Thus, the anticommutator of the operators and can be written as
the activity x or y of the dimer occupying that link.
The expression giving the partition function is
In order to have the lattice completely filled with dimers (filling factor ), the number of lattice sites must be even in order to avoid a single essential but unwanted left over vacant site. As a reminder of this, let us temporarily set , and write the partition function as
To make further progress, we need to find a way to evaluate the product that appears in the partition function
To do this, we successively use the anticommutator of the operators from Equation (50).
Let us write the anticommutator as
and use this expression to rearrange the product as
The second term on the right has displaced one step to the right, and can be written with the aid of the anticommutation rule as
We continue to move all the way to the right, requiring interchanges. We then take the trace of both sides, and use the cyclic property of the trace to move back to the beginning, thereby reproducing the trace of the left side accompanied by . In the course of performing this, we “contract” and for , producing terms containing a factor of , each accompanied by . This result is
The factors of two can cancel. Furthermore, for is the top row of a Pfaffian of order .
Appendix B contains a brief description of Pfaffians and an example of how a simple Pfaffian is evaluated. On iterating, each step produces a single additional factor of in every one of the terms resulting from the trace on the right. The end result is that the trace of Equation (57) produces the Pfaffian
If j and k are extended to cover the entire lattice, the complete entries in the Pfaffian and the upper right triangle of the corresponding antisymmetric matrix will be filled in. In the present context, the only nonvanishing activities are on links with j and k nearest-neighbor sites, and all the remaining entries are zero. This result is found in Fisher’s paper [14] and references therein, and is written in detail in J. C. Baker’s MS thesis [15].
Earlier, an example with three rows and four columns of sites was discussed, and is shown in Figure 6. The antisymmetric matrix associated with the Pfaffian of this lattice example, written in block form, is
and that block form consists of the nine blocks, denoted as
as shown in Figure 10.
Figure 10.
The block matrix form of the antisymmetric matrix. The red lines outline the blocks, and the light blue X, Y and labels in the background signify the X, Y and blocks of the matrix.
This is also a tridiagonal matrix, where the blocks X, Y, and 0 are blocks. These blocks are
and
while “0” stands for a block of sixteen zeros. In what follows, we will be using the fact that matrices can be multiplied block by block, and this is shown in Appendix C.
Consider the upper right triangle that forms the corresponding Pfaffian. The first row has x-oriented links between the sites, and so the block X has entries x in the direction of the diagonal. For the example, there are four columns, so and there are three entries x in the block X. They are offset from the diagonal by one, so the dimensions of the block X shown are . In general, the blocks X are .
Between each row and the one below it, there are vertical y links, and so there are entries of y up the diagonal line from the lower left to upper right corner. In the example, , and there are four of these so the block Y is . In general, the blocks Y are of dimension , just like the blocks X, as they must be to fill the original matrix of dimension , because . There are blocks with entries to account for the entries in the antisymmetric matrix M.
In general, then, when the lattice of sites has rows and columns, the dimension of each individual block is . The matrix of blocks, on the other hand, has dimension . The total number of matrix elements is then , which is the square of the number of sites .
7. Eigenvalues and Eigenvectors of the Asymmetric Matrix
The scheme that makes the calculation simplest is to diagonalize the X block and use those eigenvectors as the basis to transform the Y block. This will lead to a cruciform version of the big matrix, which will be convenient for the calculations. Therefore, first we investigate the eigenvalues and eigenvectors of the block X. The eigenvalue equation for X is given by
where is an eigenvalue and v a column vector with entries . The set of linear equations represented by this equation have the form
We can make all the linear equations look the same if we imagine padding the list with and and then apply the boundary condition . Then, we have
The generic equation is the difference equation
which can be rewritten as
Regarding a as a lattice spacing, we recognize this equation as the discretized version of the first-order differential equation
with exponential solutions that are growing or decaying if the eigenvalue is real. If is imaginary, the solutions oscillate.
A solution that satisfies the boundary conditions is given by
This gives for k even but for k odd.
The boundary condition at is
which is correct. At we have
We also require this to vanish, which occurs if the quantity in brackets vanishes, that is, if
Taking logarithms of both sides gives
where ω is a winding number that gives the number of times that the point circles the singularity that terminates the branch cut of the logarithm of . As a result, the angle is quantized, that is, it acquires only discrete values given by
so that only takes the values
that is, the discrete values . The values and are the boundary points where the solution vanishes that were added by padding the ends of the list. The components of the eigenvector are
with A the normalization constant determined from . Figure 11 shows the discrete values of the angle for the example with eleven columns.
Figure 11.
Discrete values of the angle for the example with eleven columns.
The previous derivation of the eigenvalue is unchanged. Insert one of the terms of the generic solution —either one will do—into the difference equation
Substituting yields
and as long as the eigenvalue satisfies this equation, is a solution to the difference equation. When we include the boundary-condition requirements, the angle is restricted to the discrete values , and the eigenvalues of X are
Figure 12 shows the values of the eigenvalue for the example with eleven columns.
Figure 12.
Eigenvalue for the example with eleven columns.
The expression for the eigenfunction,
can be rewritten by inserting the explicit form of the angle . This gives
giving
Let us now find the normalization constant using . The components of the eigenvector are with . Substituting the expression for gives
We then have
Now, because is always even, giving
where . The normalization constant is then given by the finite sum
which gives
Then, since a shift in phase of gives another factor of ,
The value of the sum over cosines is , and the normalization factor is
Taking A to be real and positive, the normalized eigenvectors are
with and eigenvalue . The subscript k distinguishes the various components of a given eigenvector . The values and are the boundary points where vanishes. In Figure 13, are the first four eigenvectors for and various winding numbers ω. The points show the values, while the lines are merely to guide the eye.
Figure 13.
First four and the last eigenvectors for the example with eleven columns.
The previous example of a lattice of sites that has three rows and four columns of sites was shown in Figure 6. This lattice has sites, so the dimensions of the antisymmetric matrix M that gives the square of the partition function through
is , that is, . This matrix M provides an example of the block-matrix form of the antisymmetric matrix M having the Pfaffian array as its upper right triangle, and is given in terms of the dimer activities x and y by the matrix shown in Figure 10.
This particular matrix has 12 rows and 12 columns, and consists of blocks called X, Y, and zero. The matrix is divided into blocks by the red lines in Figure 10, and, in general, such a matrix has the form
For clarity, the zeros are not shown. This is a tridiagonal block matrix, and this form always arises when using serpentine numbering of the lattice sites. The size of each block is the number of sites along a single row of the lattice, as is most easily seen by examining the blocks Y. The number of blocks is the square of the number of rows in the lattice of sites.
Earlier, we described the diagonalization procedure for a matrix with the form of the block X. That same procedure can be used to find eigenvectors and eigenvalues of the block form of M. One can manipulate the blocks just like matrix elements, because block matrices can be multiplied block by block. The notation, though, can be confusing if the matrix, which is M, is denoted by M both when it is written as matrix elements and when it is written in blocks. Consequently, let us call the block form B, writing
when it is in block form, and
when it is written in terms of its matrix elements .
We will need here our earlier observation that, in general, when the lattice of sites has rows and columns, the dimension of each individual block is . The matrix of blocks, on the other hand, has the dimension . The total number of matrix elements is then , which is the square of the number of sites .
The eigenvectors of the block matrix are the vectors u with components that satisfy
The components will actually be vectors, so that a given may have components as in the eigenvectors of the block X given in Equation (90). That, however, does not concern us here. Eventually, though, the transformed matrix M will be cruciform.
Writing the eigenvector in terms of its components, the eigenvalue equation is
The eigenvectors can be found from the rows, which are given by
with boundary conditions
just as in the example of the diagonalization of the X matrix earlier in this section. Since the block matrix B is an array of blocks, the eigenvector has components.
This difference equation has solution, once again, of the form or, in order to keep the angle positive, a linear combination consisting of and . We write these eigenvectors as
and apply the boundary condition to obtain
and so , and then
Either of the two terms appearing here could be used to find the eigenvalue by inserting it into the difference equation. We choose , and have
which, after canceling , gives
so that
is an matrix that represents the “eigenvalue” of a block, irrespective of whether or not X and Y are diagonal or even simultaneously diagonalizable.
The second boundary condition quantizes the angle , just as it quantized the angle for the matrix X. It gives
so that
Taking the logarithm of both sides shows that
where W is another winding number arising from crossing the branch cut in the logarithm. This is what provides the complete set of eigenvectors, characterized by the various values that the angle can take on as W assumes the values . Then,
are the various values of . The values and are the boundary points where the solution vanishes that were added by padding the ends of the list. The components of the eigenvector are
with A the normalization constant determined from .
The expression for the eigenfunction,
can be rewritten by inserting the explicit form of the angle . This produces
giving
The normalization constant A is found earlier in this section in the diagonalization of the X matrix by requiring that . This gives
and so the components of the normalized eigenvector are
and the associated eigenvalue is
These are the block eigenvalues of the block matrix B. Writing these down the diagonal gives B in diagonal form,
Each of these eigenvalues is actually an matrix forming a block of the now-diagonal block matrix B, and if written out in full represents the original matrix M, although that matrix is not yet cruciform. We now set out to make it so.
The block X is not yet in diagonal form, although we know its eigenvalues, which are with . Thus, when diagonalized, we have
The question is, what happens to Y when we transform both X and Y using the unitary transformation U that diagonalizes X? The short answer is that the vertical dimer activity y shows up along the antidiagonal, which is the “diagonal” that runs from the lower left to upper right corners of a matrix. This produces the cruciform matrix
where . We then have a block matrix consisting of these cruciform blocks, and its determinant gives the square of the partition function.
This cruciform shape results from the transformation U that diagonalizes X, which is constructed from the normalized eigenvectors of X with the components of each vector forming the columns of U, giving
The block Y, written explicitly in terms of the activity, has entries y along the antidiagonal and can be written as y times the matrix with entries of 1 on the antidiagonal. To see what effect such a matrix has on another matrix, a two-dimensional example should be sufficient. We have
It seems that the effect of a matrix with unity along the antidiagonal (the anti-identity?) is to reverse the rows of the matrix it multiplies. This result implies that the matrix element of is
The general matrix element then consists of the product of the component and with , or in terms of the components of the eigenvector, and . The matrix elements of are
which is derived in detail in Appendix D.
These are the terms that appear on the antidiagonal of Y after transforming to the representation in which X is diagonal. All matrix elements off the antidiagonal of Y are zero. This matrix Y can finally be written as
The transformed matrix becomes
This matrix is a block of the matrix B, which, because it has the form of a cross, is called a cruciform matrix.
The partition function is given by the Pfaffian of this matrix B, and since the square of a Pfaffian is a determinant, it is more convenient to evaluate the square of the partition function, which is the determinant of B. The block eigenvalues form the diagonal elements of B, while all other blocks are zero. The determinant of a diagonalized block matrix like this is the product of the determinants of the individual blocks, and the square of the partition function is the determinant.
The determinant of a cruciform matrix is easily determined by expanding by the first row of the matrix and then in each minor expanding by the last row. Then each new minor is expanded by the first row and each of the resulting minors by their bottom rows. This repetition creates factors of the type . The brackets indicate the floor function, which gives the highest integer lower than x. The square of the partition function is evaluated in Appendix E, with the result
The square root of this, the partition function itself, is also evaluated in Appendix E and is given by
8. Dimer Partition Function in the Large Lattice Limit
We can calculate the partition function explicitly if we take the limit as the lattice becomes large, and . It is better to find this limit of the logarithm of Z, so that products become sums and we have
In the infinite limit, and are much greater than unity, and so and , while . The basic rule for converting sums to integrals is
with and . Using this, the sum over ω becomes
Choosing the integration variable as is convenient, giving a factor , and when is large, the lower limit becomes . The upper limit is , and so the integral takes the form
The analog for the sum over W is
The double sum is then
The integral is given by Equation (4.224.9) of reference [16] as
Here, or , so that the integrals can be evaluated, although we have no need to do so. The entropy is an extensive quantity, with its logarithm proportional to as the number of sites becomes large. The result diverges as the lattice becomes infinite, and the only meaningful expression is the “partition function per site”,
This result is given by Fisher and followed by a derivation of the entropy and an excellent discussion of the result and associated dimer densities per site and that will not be reiterated here [7].
9. Colored y Dimers and the Partition Function for Dimers and Monomers
The partition function in Equation (127) is applicable to any lattice formed from rows and columns of lattice points. Suppose that the lattice has the form of a two-leg ladder, with rows and columns and that it is completely filled with dimers, . A dimer arrangement on this lattice is shown in the top of Figure 14. The y-direction dimers are colored orange to distinguish them from the x dimers, which are colored green. Looking down on this ladder so that it is viewed edge-on, as in the bottom picture, it looks like a linear lattice filled with dimers and orange monomers.
Figure 14.
A two-leg ladder ((top) figure) viewed from above ((bottom) figure). The y dimers are shown in orange, with the x dimers shown in green. The bottom picture could be considered to be a linear lattice in which the orange y dimers appear to be monomers. The red dots and dashes outline the lattices.
What is the effect of randomly coloring the vertical dimers red and blue and assigning them corresponding activities and , just like the red and blue balls discussed earlier? If we could find the partition function for this system, we could have a linear system of dimers, monomers, and vacant sites, even if this problem has not been solved analytically in two dimensions in Fisher’s work.
We can see how to do this by starting with the expression
because is the number of states accessible to the system when there are precisely horizontal dimers and vertical dimers. If the vertical y-dimers come in two colors, and , the number of distinct states increases, and does so by a factor of , where is the number of red y-bonds and is the number of blue y-dimers, and . The sum over gains the additional factor
and we have
The link fields are colorless and do not distinguish between and , even though they throw away a large fraction of the random arrangements of the child’s toys. Thus, the rest of the calculation proceeds as before, the only change being the replacement , giving
We now return to the two-leg ladder. Here, we want a very long ladder, , while for two legs, . Evaluating the partition function for gives , and we have the logarithm of the partition function given by
The angle is , and so the cosine is , giving
Combining the first term with the sum brings a factor of two inside the log in the sum, giving
This is the correct result for the two-leg ladder. It is not, however, correct for the linear chain. To see why, imagine squeezing the rungs of the ladder together, producing the orange disks from the original y-oriented dimers, as shown in Figure 15. This results in a double dimer, one from each leg of the ladder, on pairs of sites that should only a single dimer. In the logarithm of the partition function, the origin of the double dimers is the term, which because we have logarithms, gives . The correct result for the linear chain, , has replaced by x,
Fisher implies that he confirmed this result by including vacancies in their original formulation and evaluating the Pfaffian, and states that this result is consistent with other ways of determining configurations of a monomer–dimer linear chain.
Figure 15.
Imagine squeezing the rungs of the ladder together, producing the orange disks from the original y-oriented dimers.
10. The Linear Chain
This gives the partition function for an infinitely long linear chain by letting and using the integral representation introduced above. This gives
Evaluating the integral gives where and . In the infinite limit of a very long chain,
Under the square root, we have , giving
Then the log of the partition function per site is
for a long monomer–dimer chain. This allows systems involving the attachment of monomers and dimers to a long polymeric chain, such as charged dimers and point-like ions binding electrostatically to a DNA strand.
For colored vertical dimers, we replace the activity y by , with the activity of red y-dimers and y the activity of blue y-dimers. Making this change in the partition function of the one-dimensional very long chain, we have
We can now identify the red y-dimers with monomers with activity and the blue y-dimers with vacancies with activity , because vacancies have neither energy nor chemical potential, and so . We will call the activity of the x dimers . Then the log of the partition function per site that includes y dimers and vacancies is
In general, the entropy per site is given by the expression
Therefore, the entropy for the dimer model becomes
where twice the derivative of the partition function is
The derivative of the activity with respect to is
Therefore, the derivative of the partition function per site is
This means that the entropy is given by
Simplifying, we have
Similarly, the mean site occupation by the perpendicular dimers is
Therefore, taking the derivative, we have
The derivative of the activity with respect to is
and this is zero applied to the activity of a different type of particle. Then the mean site occupation by the perpendicular dimers is
Simplifying, we have
Factoring out from the parentheses, we have
Now we see that the quantity inside the large parentheses is , and so the mean site occupancy reduces to
The same steps lead to the mean site occupancy for the vacancies v, which is
Since the parallel dimers each occupy two sites, the mean occupancy per site must be doubled, and we have the mean site occupancy for the parallel dimers as
Taking the derivative, we have
Simplifying yields
The sum of the occupancies for the three species should be 1, and so let us check that. The sum is
Substituting in the numerator, we have
Factoring from the numerator and expanding, we have
Factoring from the numerator produces
The last factor on the right is exactly , and so the right-hand side reduces to one, and we have
Therefore, the total occupancy is, correctly, unity.
11. Results for the Entropy, Average Occupation, and Total Charge
Under certain conditions, DNA molecules condense. In this process, the DNA chain rolls into a tight toroid. If a DNA strand were simply placed in aqueous solution, this would not happen, because the phosphate groups that form the DNA backbone are negative, so that one length of the chain repels another. However, in a solution containing cations, the positive cations are electrostatically attracted to the negative backbone, neutralizing it or even inverting the charge. When the charge is inverted, the DNA chain dressed by the cations becomes positive, allowing it to coil up compactly.
Understanding the statistical mechanics of such processes gives insight into the evolution of complex life and the dynamics that maintains it. This led two of the authors to use methods borrowed from the physics of interacting particles and quantum field theory to the electrostatics of biomolecules [9,10] using lattice gas models. Such models regard all ions, monomers, dimers, or larger polyions as pointlike particles, completely abandoning the geometric constraints that come from the extended nature of the adsorbing species. All the geometry inherent in attaching a polyion flat against a surface so that it covers several lattice sites is lost.
In the plots of this section, any numerical parameters used were taken from our earlier work [9,10]. In these plots, the horizontal axis is the dimensionless product . Because parallel dimers occupy two sites, while the perpendicular dimers only occupy one, we set . Strong binding to the lattice occurs when is large and negative, while the weak binding region is where is positive. Assuming that the DNA double helix is in equilibrium with a solution of dimers, all the chemical potentials must be the same and be equal to the chemical potential of dimers in solution given by . The plots are all for the single physiological temperature of K, with .
The Fisher dimer model, as formulated in the previous sections, describes parallel dimers, with activity , perpendicular dimers, with activity , and vacancies with activity v. These vacancies are on the negative backbone, and each of these vacant sites carries a charge of , in units of the magnitude of the electron charge. The dimers have charge , with one positive charge on each end. Then, electrostatic attraction causes the dimers to bind to the vacant sites. They can do so in two ways, first by lying flat, parallel to the DNA chain, and binding to two vacant sites, so that they have an activity of . Alternatively, they can protrude at right angles to the DNA chain, with only one end bound to the chain and activity . These two possibilities are called parallel and perpendicular dimers, with the perpendicular dimers acting as the monomers in the dimer, monomer, and vacancy model.
Figure 16 shows the entropies given by the Fisher dimer model and by the non-interacting lattice gas model. The striking feature is the way that entropy appears to plateau in the strong binding region. The lattice gas model does not show this because all the dimers lie flat on the lattice by . The nonzero value of the entropy in the Fisher dimer model suggests that disorder persists in this region even as the binding force becomes quite strong.
Figure 16.
The plot of the two entropies given by the lattice gas model and the Fisher model of dimers.
To understand this disorder in more detail, Figure 17 shows the site occupancies for the Fisher dimer model. In the region of strong binding, both the site occupancies for the parallel and perpendicular dimers also plateau, even though the number of vacancies drops to near zero. This indicates that the disorder that leads to the nonvanishing entropy consists of a mixture of parallel and perpendicular dimers, occupying practically all the sites and accounting for the nonzero value of the entropy as the binding force becomes strong.
Figure 17.
The average occupation of species in the Fisher model of dimers showing species mixing even at large negative binding energy. The purple dashed line is a numerical check showing that the sum of occupancies is one, as demonstrated analytically in the text.
For comparison, Figure 18 shows the corresponding site occupancies for the lattice gas model. There, the perpendicular dimer occupancy drops to zero in the region where the binding force is strong, as does the vacancy density, and the lattice is completely occupied by parallel dimers. An interpretation of the contrasting behavior between the two models is that the actual shape of a dimer spanning two sites when it lies parallel is responsible for the disorder, while in the lattice gas model, the parallel dimers are treated as though they are monomers, but with twice the binding energy.
Figure 18.
The average occupation of species in the Fisher model of dimers showing species mixing even at large negative binding energy.
The phenomenon of the DNA strand rolling up compactly requires excess charge, and so Figure 19 shows the total charge . This charge is positive when the binding force is strong because in that region there are few negative vacancies, but still a large number of perpendicular dimers, while each parallel dimer neutralizes two sites. That persistent positive charge in the Fisher model does not appear in the lattice gas model, where the charge drops to near zero in that region. In the weak binding region, the charge becomes small in the same way in both models, and the two curves lie on top of one another.
Figure 19.
A comparison of the total charge vs. the binding energy on the lattice between the lattice gas model and the Fisher model.
In the context of DNA compaction, this excess charge is the physically most important feature seen from the Fisher model. This is because it leads to larger and more persistent charge inversion than is seen in the lattice gas model. It does so because once a sequence of parallel dimer–vacancy–parallel dimer forms, the only way the vacancy can be filled is with a perpendicular dimer. This must be the persistent disorder that occurs in the region of strong binding force.
12. Discussion
We have described a simplified version of Fisher’s derivation of the partition function of the completely filled dimer model. This provides the entropy as calculated by Fisher, and the mathematics described has some relevance to the Ising spin model. While this work was in progress, we became aware of the paper by Allegra and Fortin [17] that proposes the use of Grassmann variables to solve the dimer problem including monomers, obtaining the partition function as a product of two Pfaffians. That opens the way for entropy and site occupancy calculations for the two-dimensional sheet following methods described here.
After extracting the partition function for a one-dimensional dimer–monomer–vacancy system from that for a two-leg ladder, we compared the entropy and site occupations as a function of the binding energy to the lattice with the similar results for the lattice gas. The entropy appeared to plateau at a nonzero value in the strong-binding-force region, suggesting that disorder persists in that region.
In this context, the dimers represent dimeric polyions with a unit positive charge on each end that neutralizes the unit negative charge on the backbone lattice site, so the parallel dimers are effectively charge-neutral. The monomers represent the same dimers but oriented perpendicular to the surface so that only one end neutralizes the backbone lattice site to which it is attached, and it makes a unit-positive-charge contribution to the charge density on the chain, while each vacant site contributes its unit negative charge. In the strong-binding-force region, most lattice sites are occupied and few vacancies remain, but the perpendicular dimers scattered among the parallel dimers is a source of disorder. This leads to the persistent excess charge in the strong-binding region, and provides the disorder that accounts for the persistent plateau in the entropy there as well.
In our earlier paper [10] using the lattice gas model, we included electrostatic repulsion among the dimers, monomers, and negative empty sites on the DNA backbone to one-loop order. That did produce a reduction in the entropy by up to of the total, but only in the weak-binding region of the plot. Consequently, this reduction would not affect the strong-binding region where the dimer model entropy appears to plateau at a nonzero value and the excess positive charge persists. Because this strong-binding region is the more important one, the geometry of the extended dimer occupying more than one site is the more important physically.
In other words, our conclusion from this work is that if physical consequences are the concern, the extended geometry of the attached molecules has more influence than the electrostatic many-particle interactions of species that carry electrical charge. When determining entropy, the shape of the attached molecule and the volume it excludes does matter, and ways of including these effects are needed.
Author Contributions
Writing—original draft preparation, J.C.B., M.F.B. and T.M.; writing—review and editing, J.C.B., M.F.B. and T.M. All authors contributed to this paper equally. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
This is not relevant to our study. This does not require ethical approval.
Data Availability Statement
There is no data involved. This is a theoretical study.
Acknowledgments
We are grateful to Kevin R. Ward, Department of Emergency Medicine, University of Michigan, for introducing us to the role of charge in biological systems and to the practical consequences that could be achieved by controlling the charge distribution on biological molecules.
Conflicts of Interest
Author John C. Baker was employed by the company CACI International Inc. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Appendix A. Trace Theorems of the Aμ Matrices
We now prove the trace theorems that we need in order to study the statistical mechanics of the dimer model on a square lattice. The commutation law with shows that
This ensures that the trace of a single matrix vanishes, because inserting and using the cyclic property of the trace yields
where to obtain the third equality we anticommuted and , introducing a minus sign. Now, the product can be regarded as some d-dimensional matrix M with matrix elements . Then, . Thus
This can only be true if
This is easily generalized to the trace of any product of an odd number of matrices, because then
since an odd number of interchanges is required to move past other matrices. We have
so that the trace is zero,
However, the trace of the product of an even number of matrices does not always vanish. The simplest example is
Taking the trace gives
where in the last equality we used the cyclic property of the trace. Since the last trace on the right is the same as the expression we started with, moving the last term on the right to the left side gives
after canceling a factor of two. When .
Now, let us tackle the general case of arbitrary numbers of both matched pairs and unmatched s. Any product can be reduced to sums of products of the and a leftover term by repeated application of the anticommutation relation . Consider
Continuing to move to the end in the last term gives
The metric commutes with all the , and can be moved to the front of each term. We then have the general pattern
with the signs alternating from one term to the next, and the last having an overall parity factor of because interchanges are needed to move to the end.
Although we could now apply this general formula, which we will call the contraction theorem, to continue to reduce the sub-products like , let us first take the trace of this expression and use the cyclic property to see that
Here, the trace on the left is the same as the last trace on the right, and if n is even, they are added to give
However, if n is odd, they cancel, and the sum of all terms remaining on the right must vanish,
A second consequence of n being odd is that it is not possible to continue the reduction to all possible pairs, that is, a product of . There will always be one left over. In that case, since , the trace becomes either zero if at least one factor of vanishes, or we have just . Consequently, the trace always vanishes if n is odd.
For n even, we have
because picks out the term or terms in the sum over k that have and replaces the factor by the d-dimensional identity, which can be omitted from the remaining product. The simplest case is when only one term has , and that is the only possibility that we will encounter, because the construction we will use can only place at most two on a single link, or dual site, of the lattice. Then, the trace above reduces to
with P the appropriate permutation index. Then the contraction theorem can be applied again to the remaining product. Continuing the process eventually reduces the right side to a permutation factor times the trace of , which is d, or to a product of unmatched , the trace of which vanishes.
Appendix B. Pfaffians and Determinants
Pfaffians are rarely discussed. They do not appear in common mathematical physics textbooks, but they have shown up in the physics literature—there is a Pfaffian quantum Hall state, for example [18,19].
A Pfaffian is similar to a determinant, but it is taken on the upper right triangle of an antisymmetric matrix. One of the Pfaffian’s most important properties is that its square is equal to the determinant of the corresponding antisymmetric matrix M,
where an example of dimension is the antisymmetric matrix
The corresponding Pfaffian is sometimes written as
Here, we use the convention that M is an antisymmetric matrix, is its determinant, and is the Pfaffian of the upper right triangle of M (with the diagonal, which consists only of zeros, deleted).
The Pfaffian expansion by the first row is like a regular determinant with the notable caveat that both the rth row and column and the sth row and column are deleted. Suppose that we illustrate with an example of order ,
Expanding by the top row gives a first term of
Including the remaining terms, which have alternating signs just like a determinant, we have
Continuing, we expand each of these minors by its first row. The first gives
Including the rest of the terms, the Pfaffian is
This is the full expansion of the Pfaffian.
Appendix C. Block by Block Multiplication
In this context, it is important to realize that matrices can be multiplied block by block. Let us justify this using a simple example. Suppose that we have a matrix A divided into four blocks, , , , and ,
and a similar division of a second matrix B divided similarly,
The matrix elements of the product of these two matrices are
The sum over j can be divided up into two sums,
Suppose that and . This gives the upper left entry in the block product. The matrix elements that appear in the product are
where we wrote the result as a block-matrix product in the last equality. If and , we obtain the lower left entry in the block product. The matrix elements that appear in the sum are
If and , we obtain the lower right entry in the block product. The matrix elements that appear in the sum are
Now, if we calculate the product by merely using block indices, we obtain
where we offset the individual blocks using brackets. This is the same result. The conclusion is that block matrices can be multiplied block by block.
Appendix D. Evaluation of tω,ω′
This is a derivation of the matrix element of the block Y in antidiagonal form.
Simplifying somewhat, the sum becomes
The next task is to evaluate the sum of the product of sines
First, the product of sines should be turned into a sum of cosines via the identity . This gives
which can be put into exponential form as
where Re indicates that the real part of the sum is required. Because is a positive integer, there is a factor of in each exponential. We also notice that each term can be written as an exponential to the power, and we have
Each term is now in the form of a geometric series [20] and can be summed as
giving
Distributing the exponentials in each term will cause the first exponential in each numerator contain and reduce to simpler exponentials as
By Euler’s Formula, the first term in each numerator reduces to and , respectively, yielding
The denominator of the first term in the brackets is discontinuous at , and so this geometric expansion can not be solved for instances when . Those instances will be discussed in Section 4.
Moving forward from here depends on whether the exponents of the sign terms inside the Re operation are odd or even. If is even then must also be even, so that both terms inside the square brackets have the same parity in the first term in their numerators. When and are both even, it is easy to see that reduces to
When and are both odd, the situation is more complicated. The first terms in each of the numerators equal , and the sum simplifies to the form
From here, the numerators and denominators are close to being sines and cosines. The factor is multiplied both the numerator and denominator of the first exponential term, and is multiplied on the top and bottom of the second term, producing
Now, both terms in brackets have a sine in the numerator and a cosine in the denominator. Since and , is
Because each term inside the Re operation is purely imaginary this is identically 0. Therefore, the matrix element matrix Y will be zero everywhere except those elements where . These elements in the matrix could not be determined through this method because they cause a singularity in a denominator.
To show the behavior of the elements of when , one must backtrack to the general form of the sum , which now is
and substitute to obtain
It is useful to break the arguments of the sines into separate terms to look for identities. This gives
The trigonometric identity allows this to be written as
Because k and are both integers, this reduces to
where the change in signs occurs via . Now, the trig identity is used to give
which has been seen before in solving the normalization of , where the cosine sum is , and so we have
The matrix elements can thus be written as
Recalling that allows this to be written as
This can also be written as
Appendix E. Evaluating the Determinant and the Partition Function
The determinant of a cruciform matrix is easily determined by expanding by the first row of the matrix and then in each minor expanding by the last row. Then, each new minor is expanded by the first row and each of the resulting minors by their bottom rows. This repetition creates factors of the type . The brackets indicate the floor function, which gives the highest integer lower than x.
It is not immediately obvious that this is the correct expansion. This becomes clearer if we consider the general cruciform matrix
The expansion begins with the first row as
where the only two elements in row one are multiplied by their minors. A factor of is necessary to obtain the correct sign since it depends on the number of rows in the matrix. Then the remaining minors are expanded by their bottom rows. Note that this minor has rows, since its first row was eliminated,
Notice that the second term in this expansion was in the row and first column and so there is an additional factor of . In general, when taking a determinant, a negative sign is included with any element whose row and column positions sum to an odd number. The first term in the expansion of will always be positive because the subscripts of the elements on the main diagonal always sum to an even number. The sums of the row and column numbers of the elements on the antidiagonal will depend on whether n is odd or even. However, if the matrix has an odd number of rows and columns, then the minor obtained by expanding by a row will have an even number of rows and columns and vice versa. Therefore, either the element in the position will have a negative sign or the one in position will have a negative sign, but they will not both have one. The sign of the second term is then always negative, since . After factoring out the remaining determinant, the expansion by the first row can be written as
The remaining determinant is expanded in the same way. This process is repeated until the matrix is fully expanded. The only thing to be aware of at this point is that if the matrix has odd dimension, the final remaining determinant will be of a single term.
The square of the partition function involves the evaluation of a cruciform matrix as
where the transformed matrix is
where . The first iteration in the expansion of this cruciform matrix is
A second iteration gives
which is enough to see the pattern in the terms in the resulting product. In general, this determinant of the eigenvalue matrix reduces to the form
where s is a placeholder positive integer for now. When is odd, the center term of the matrix is multiplied after the product is taken. This is what becomes last term in the odd expansion.
Ideally what is needed is a product of ω over the entire matrix , but the extra term in the middle for odd matrices disallows a single product expression over ω. Recall that ω was the original subscript to and that s is just a placeholder. To make the change to a product over ω, the expression must be rewritten. We substitute and pull the factor of out of the square in the second term. Then, a factor of is pulled out of all of the terms. This leaves
Because of the behavior of i when exponentiated, the sign terms inside the bracket reduce via . The sign will always be since . Therefore, can be written as
where has been moved outside of the product.
The next step in working toward a sum over ω is to factor the first bracketed term. This yields
where s skips over in the odd case for the moment. To arrive at this factorization, it is necessary to realize, from Equation (A70), that , which makes the cross terms cancel. We will show this now. Since ,
After splitting up the fraction this is
which becomes
Through the trig identity , the expression becomes
This is quickly reduced to
Then by the relationship is expressed as
which reduces to
A factor of is introduced, and then the relationship is used in reverse, giving
This relationship causes all of the cross-terms to cancel out and allows the factoring of the first term in the product of .
This relationship is also useful for reducing the extra term in the odd expansion. In the case where , it is shown here that .
The term can now be easily manipulated. The first term is 0 due to the relationship above. Now the factor in front of the needs to be investigated. First we see that
Then because , and because this is the case where is odd, , and the above product reduces to
Now, the leftover term in the odd expansion can be written as . The product can be written as a single product from to instead of being split into cases and instead of going to .
where a 2 has been factored out of each term in the product, giving a out front. The two different sines, and , will take opposite signs as ω goes from 1 to . In this form of , the term will take all of the values that were formerly split between the combination of the two sines because the product now goes to .
The arguments in can be reduced slightly. By the trig identity , any angle in the form can be written as . Combining this with the definition of the state function in Equation (A63) gives
After pulling the terms between the products out, the state function is
Equation (A84) is the exact square of the partition function of a finite lattice as a product of terms. If both and are odd, then there is a term in the product for which and , which means that both the x and y parts vanish identically. The whole product is then multiplied by zero, correctly reflecting the fact that a lattice with an odd number of points cannot be occupied solely by dimers. Therefore, the lattice is required to have an even number of points and either or or both must be even.
Because the entropy will eventually be required, it would be best to have a partition function that is not squared. If it is assumed that is even, the partition function can be manipulated into a more useable form. Holding even instead of is an arbitrary choice. Moving the 2 back inside the products makes the partition function take the form
In the case where is odd, and focusing only on the product over W, it is possible to write this as
where . Comparing the factor with to the factor with in the cosine shows that one of those terms is the negative of the other via
This relationship holds for the pair and and so on. Thus, for each term in the product, there will appear a complex conjugate of that term. The only exception is the unpaired term where , and so that term must be written separately. With the terms paired off, the product can be cut in half.
Substituting a gives
The last product reduces to via the identity , as can be seen here,
The sign factor may be combined with the already existing sign factor to cancel as when is even and is odd. Then,
Using the same sort of expansion over the as was used for the product over gives
Equation (A84) is the exact partition function of a finite lattice as a product of terms. If both and are odd, then there is a term in the product for which and , which means that both the x and y parts vanish identically. The whole product is then multiplied by zero, correctly reflecting the fact that a lattice with an odd number of points can not be occupied solely by dimers. Therefore, the lattice is required to have an even number of points, and either or or both must be even.
In the case for which is even, the solution follows the same process. The key differences are that the product over will not have an unpaired term when is even and the sign term in Z reduces identically to 1. Thus, the partition function is
References
- Hohenberg, P.; Kohn, W. Inhomogeneous Electron Gas. Phys. Rev. 1964, 136, B864. [Google Scholar] [CrossRef]
- Bishop, M.F.; Ferrone, F.A. The Sickle-Cell Fiber Revisited. Biomolecules 2023, 13, 413. [Google Scholar] [CrossRef] [PubMed]
- Cao, Z.; Ferrone, F.A. Homogeneous Nucleation in Sickle Hemoglobin: Stochastic Measurements with a Parallel Method. Biophys. J. 1997, 72, 342–352. [Google Scholar] [CrossRef] [PubMed]
- Zhang, X.; Dai, X.; Gao, L.; Xu, D.; Wan, H.; Wang, Y.; Yan, L.T. The entropy-controlled strategy in self-assembling systems. Chem. Soc. Rev. 2023, 52, 6806–6837. [Google Scholar] [CrossRef] [PubMed]
- Plischke, M.; Bergersen, B. Equilibrium Statistical Mechanics; World Scientific: Singapore, 2006. [Google Scholar]
- Temperley, H.N.V.; Fisher, M.E. Dimer problem in statistical mechanics-an exact result. Phil. Mag. 1961, 6, 1061–1063. [Google Scholar] [CrossRef]
- Fisher, M.E. Statistical Mechanics of Dimers on a Plane Lattice. Phys. Rev. 1961, 124, 1664–1672. [Google Scholar] [CrossRef]
- Kasteleyn, P.W. The Statistics of Dimers on a Lattice: I. The number of dimer arrangements on a quadratic lattice. Physica 1961, 27, 1209–1225. [Google Scholar] [CrossRef]
- Bishop, M.F.; McMullen, T. Lattice-gas model of DNA charge inversion by a positively charged polyelectrolyte. Phys. Rev. E 2006, 74, 021906. [Google Scholar] [CrossRef] [PubMed]
- Sievert, M.D.; Bishop, M.F.; McMullen, T. Entropy of Charge Inversion in DNA including One-Loop Fluctuations. Entropy 2023, 25, 1373. [Google Scholar] [CrossRef] [PubMed]
- Feynman, R.P. Quantum Electrodynamics; Frontiers in Physics; W. A. Benjamin: New York, NY, USA, 1961. [Google Scholar]
- Feynman, R.P. The Theory of Positrons. Phys. Rev. 1949, 76, 749. [Google Scholar] [CrossRef]
- Thomson, M. Modern Particle Physics; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar]
- Fisher, M.E. On the Dimer Solution of Planar Ising Models. J. Math. Phys. 1966, 7, 1776–1781. [Google Scholar] [CrossRef]
- Baker, J.C., III. Application of the Fisher Dimer Model to DNA Condensation. Master’s Thesis, Virginia Commonwealth University, Richmond, VA, USA, 2017. [Google Scholar]
- Gradshteyn, I.; Ryzhik, I. Table of Integrals, Series, and Products; Elsevier: Boston, MA, USA, 2007. [Google Scholar]
- Allegra, N.; Fortin, J.Y. Grassmannian representation of the two-dimensional monomer-dimer model. Phys. Rev. E 2014, 89, 062107. [Google Scholar] [CrossRef] [PubMed]
- Moore, G.; Read, N. Nonabelions in the Fractional Quantum Hall Effect. Nucl. Phys. B 1991, 360, 362–396. [Google Scholar] [CrossRef]
- Fradkin, E.; Nayak, C.; Tsvelik, A.; Wilczek, F. A Chern-Simons effective field theory for the Pfaffian quantum Hall state. Nucl. Phys. B 1998, 516, 704–718. [Google Scholar] [CrossRef]
- Dwight, H.B. Table of Integrals and Other Mathematical Data, 4th ed.; Macmillan: Toronto, ON, Canada, 1961. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).