Entropies of the Classical Dimer Model

John C. Baker; Marilyn F. Bishop; Tom McMullen

doi:10.3390/e27070693

,

and

¹

CACI International Inc., 16480 Commerce Dr., King George, VA 22485-5860, USA

²

Department of Physics, Virginia Commonwealth University, Richmond, VA 23284-2000, USA

^*

Author to whom correspondence should be addressed.

Entropy2025, 27(7), 693;https://doi.org/10.3390/e27070693

This article belongs to the Section Entropy and Biology

Version Notes

Order Reprints

Abstract

Biological processes often involve the attachment and detachment of extended molecules to substrates. Here, the classical dimer model is used to investigate these geometric effects on the free energy, which governs both the equilibrium state and the reaction dynamics. We present a simplified version of Fisher’s derivation of the partition function of a two-dimensional dimer model at filling factor

ν = 1

, which takes into account the blocking of two adjacent sites by each dimer. Physical consequences of the dimer geometry on the entropy that are not reflected in simpler theories are identified. Specifically, for dimers adsorbing on the DNA double helix, the dimer geometry gives a persistently nonzero entropy and there is a significant charge inversion as the force binding the particles to the lattice increases relative to the thermal energy, which is not true of the simple lattice gas model for the dimers, in which all the sites are independent.

Keywords:

entropy; dimer model; trace theorems; pfaffians; cruciform matrices; DNA charge; biological physics

1. Introduction

From metabolism to drug design, the free energy of a system controls its biological processes. The free-energy minimum determines the equilibrium state. The free energy gradients drive the kinetics of the biochemical reactions.

The free energy contains both energy and entropy terms. The energy of complex biomolecules can now be calculated quite accurately, especially using the quantum–chemical methods introduced by Hohenberg and Kohn [1] and extensively developed by many authors in subsequent work. The entropy, on the other hand, is less-extensively studied, although the free energy involves the difference between the energy and entropy terms. The entropy term is known to play an important role in at least some biochemical processes. An example of an entropy-driven biochemical reaction can be found in sickle-cell disease, where the carbon dioxide causes hemoglobin polymerization and oxygen from respiration reverses the process [2,3]. In addition, strategies for controlling the entropy are being used in self-assembling systems to generate novel materials in fields like colloids, macromolecular systems and nonequilibrium assembly [4].

Biomolecules are typically long-chain molecules. Arrays of one-dimensional chains on a surface can be created by numerical methods like self-avoiding walks, as can two-dimensional self-avoiding membranes [See Chapter 10 of Plishke and Bergersen, [5]]. Analytic results, however, are few. The simplest “long-chain” molecule is the dimer, and the first analytic derivations of the partition function of such a system were by Temperley, Fisher, and Kasteleyn [6,7,8] for the classical dimer model. The simplest closed form expression for the partition function of a one-dimensional chain appears in Fisher’s 1961 paper [7], and we will be presenting a simplified version of their derivation. In the dimer model, when a dimer attaches to a lattice, it blocks two adjacent sites from other dimers attaching. It is much more difficult to determine the number of possible arrangement of dimers on a lattice than for only monomers, because a monomer only blocks a single site.

Fisher’s solution method for the dimer model builds monomers as well as dimers into the initial formal construction of the partition function. The monomer term complicates the derivation. Although one set of anticommuting matrices, similar to the Dirac matrices

γ^{μ}

of relativistic quantum mechanics, is used to enforce the dimer constraint, the presence of the monomer term requires an additional set of anticommuting matrices in a product form, reminiscent of the chirality operator

γ^{5}

, to create homogeneity. Even with this construction, Fisher was unable to find an analytic solution to complete the evaluation of the partition function on a two-dimensional lattice with both monomers and dimers included. He therefore dropped the monomer term to reach their final analytic expression for the partition function Z in dimension

d = 2

.

Because of this, we present, beginning in Section 4, a simpler method of solving Fisher’s dimer model by neglecting the monomer term from the beginning, and hence restricting the monomer distribution on the simple square

d = 2

lattice to be completely filled with dimers, which we refer to as filling fraction

ν = 1

, the completely filled lattice with no empty sites. We then need only a single set of anticommuting operators

A^{μ}

that live on the links of the lattice and that we refer to as link fields. This method also gives Fisher’s final analytic expression for the partition function Z in

d = 2

. This result Z is for a finite lattice with the sites arranged in

n_{r}

rows and

n_{c}

columns.

Beginning in Section 8, we specifically use this result for the special case of a two-leg-ladder lattice with

n_{r} = 2

rows and

n_{c} \to \infty

columns to construct, in the end, the partition function for an infinitely long one dimensional chain of sites containing not just dimers but also monomers of two colors. We use this to produce results using the dimer model that parallel the results found in previous papers for the DNA double helix [9,10], which use a lattice gas model. We find results from the dimer model that differ from the simpler lattice gas models, in which all the sites are independent. Perhaps the most important of these is that the dimer geometry gives a persistently nonzero entropy, as the force binding the particles to the lattice increases relative to the thermal energy.

2. Partition Function, Entropy, and Occupancy

Although the classical dimer problem only requires classical physics, classical statistical mechanics is plagued with oddities like Gibbs’s paradox. This means that it is more reliable to start from the quantum-mechanical expression for the partition function of the grand canonical ensemble,

Z \equiv e^{- β Ω_{G}} = Tr (e^{- β (H - \sum_{j} μ_{j} N_{j}}),

(1)

where

Ω_{G}

is the free energy of this ensemble (often called the grand canonical potential), H is the quantum Hamiltonian of the system, N is the number of particles in the system and

μ_{j}

the chemical potential of the particle of type j. The thermal parameter

β \equiv \frac{1}{k_{B} T}

specifies the temperature T of the system, where

k_{B}

is Boltzmann’s constant. The trace here is over quantum states of the system. Taking the trace is equivalent to summing over all the possible configurations of the system.

The entropy is then given by

S = - {(\frac{\partial Ω_{G}}{\partial T})}_{V, {μ_{j}}}

, and the mean number of particles of type j is given by

⟨N_{j}⟩ = - {(\frac{\partial Ω_{G}}{\partial μ_{j}})}_{T, V, {μ_{k \neq j}}}

, where the subscripts indicate the quantities that are held constant in taking the partial derivatives.

Fisher’s dimer model describes dimers adsorbed on a two-dimensional lattice. Eventually, we will take the limit in which the lattice becomes large, and for that reason, it is useful to express the quantities as energy per lattice site and entropy per lattice site. Therefore, it is more reasonable to consider the entropy per site and the number of particles of a given type per site, which is the occupancy per site. It is then helpful to define the logarithm of partition function per site as

ln Z_{site} = lim_{N \to \infty} \frac{1}{N} ln Z,

(2)

which means that the entropy per site can be written as

S_{site} = lim_{N \to \infty} \frac{1}{N} \frac{\partial}{\partial T} {(\frac{1}{β} ln Z)}_{V, {μ_{γ}}} = \frac{\partial}{\partial T} (\frac{1}{β} ln Z_{site}) = - k_{B} β^{2} \frac{\partial}{\partial β} (\frac{1}{β} ln Z_{site}) .

(3)

The derivative with respect to

β

gives

\frac{\partial}{\partial β} (\frac{1}{β} ln Z_{site}) = - \frac{1}{β^{2}} ln Z_{site} + \frac{1}{β Z_{site}} \frac{\partial Z_{site}}{\partial β},

(4)

and then the entropy becomes

S_{site} = k_{B} ln Z_{site} - \frac{k_{B} β}{Z_{site}} \frac{\partial Z_{site}}{\partial β} .

(5)

A simple example of partition function is that of a lattice gas. Suppose that there are two kinds of particles, like red balls and blue balls. There are

(N_{r} + N_{b})! = N!

ways to fill the N sites of the lattice, where

N_{r}

and

N_{b}

are the numbers of red and blue balls. This

N!

is divided by

N_{r}!

, the number of ways to swap red balls, and still has the same configuration, then by

N_{b}!

, the number of ways to swap blue balls without changing the configuration. This gives the number of configurations of

N_{r}

red balls and

N_{b}

blue balls,

g_{N} (N_{r}, N_{b}) = \frac{N!}{N_{b}! N_{r}!} .

(6)

There are no forces between the balls, merely a force holding them on the lattice sites, and so the Hamiltonian is

H = ε_{r} N_{r} + ε_{b} N_{b},

(7)

while the chemical potential term becomes

μ_{r} N_{r} + μ_{b} N_{b}

. The partition function is then

Z = Tr (e^{- β (ε_{r} N_{r} + ε_{b} N_{b} - μ_{r} N_{r} - μ_{b} N_{b})}) .

(8)

The activity of the red balls is

r = e^{- β (ε_{r} - μ_{r})}

and of the blue balls is

b = e^{- β (ε_{b} - μ_{b})}

. If

g_{N} (N_{r}, N_{b})

is the number of states in the trace that contain

N_{r}

red balls and

N_{b}

blue balls, then the partition function for this system becomes

Z = \sum_{N_{r}, N_{b}} g (N_{r}, N_{b}) r^{N_{r}} b^{N_{b}} = \sum_{N_{r}, N_{b}} \frac{N!}{N_{r}! (N - N_{r})!} r^{N_{r}} b^{N_{b}},

(9)

where

N = N_{r} + N_{b}

. With the binomial theorem, this can be written as

Z = {(r + b)}^{N},

(10)

which is the partition function for red and blue balls on a lattice. The sites are independent, so that the factor

r + b

is the partition function contribution from each site. For an infinite lattice, the logarithm of the partition function per site is then

ln Z_{site} = lim_{N \to \infty} \frac{1}{N} ln {(r + b)}^{N} = ln (r + b) .

(11)

The entropy per site is then given by

S_{site} = k_{B} ln (r + b) - \frac{k_{B} β}{r + b} \frac{\partial (r + b)}{\partial β} .

(12)

Since

β

is contained in the activities r and b,

\frac{\partial r}{\partial β} = - (ε_{r} - μ_{r}) r = \frac{1}{β} r ln r

, and similarly for b. Inserting the explicit form of the partition function yields the entropy per site of the two-color lattice gas,

S_{site} = - k_{B} ln (r + b) + \frac{k_{B} β}{(r + b)} (r ln r + b ln b) .

(13)

We can similarly find the mean particle numbers by differentiating by the chemical potentials. For the red balls r,

⟨N_{r}⟩ = \frac{1}{β} \frac{\partial ln Z}{\partial μ_{r}} .

(14)

The number of red balls per site

⟨n_{r}⟩

is given by

⟨n_{r}⟩ = lim_{N \to \infty} \frac{⟨N_{r}⟩}{N} = \frac{1}{β} \frac{\partial}{\partial μ_{r}} ln Z_{site} = \frac{1}{β Z_{site}} \frac{\partial Z_{site}}{\partial μ_{r}} .

(15)

Substituting the partition function per site for the red balls, we have

⟨n_{r}⟩ = \frac{1}{β (r + b)} \frac{\partial (r + b)}{\partial μ_{r}} .

(16)

The derivative of r with respect to

μ_{r}

is given by

\frac{\partial r}{\partial μ_{r}} = β r

, and so the average occupation of the red balls is

⟨n_{r}⟩ = \frac{r}{r + b} .

(17)

Similarly, for blue balls, we have

⟨n_{b}⟩ = \frac{b}{r + b},

(18)

and these two expressions add to one, which they should.

Generalizing this argument to include red, blue, and green balls, we have

Z = {(r + b + g)}^{N},

(19)

with the partition function per site in the infinite lattice limit represented as

ln Z_{site} = ln (r + b + g),

(20)

the entropy per site as

S_{site} = - k_{B} ln (r + b) + \frac{k_{B} β}{(r + b)} (r ln r + b ln b + g ln g),

(21)

and the occupancies as

⟨n_{r}⟩ = \frac{r}{r + b + g}, ⟨n_{b}⟩ = \frac{b}{r + b + g}, ⟨n_{g}⟩ = \frac{g}{r + b + g} .

(22)

If one simply wants vacant sites rather than the green balls or holes in the lattice, the activity is replaced by

g = e^{- β (ε_{g} - μ_{g})} = 1

, because holes have neither an energy nor a chemical potential.

3. The Lattice and Its Dual

Dimer models describe the behavior of rods with ends that occupy sites on a lattice. However, two dimers cannot have their ends on the same site, and this is the fundamental constraint that makes determination of the allowed dimer configurations difficult. A physical example of a dimer might be a hydrogen molecule, with each of the two atoms held to a separate lattice site through electrostatic attraction. The lattice may be completely covered with dimers, or the coverage may be less than complete. In the latter case, the empty sites may be called vacancies, or alternatively regarded as occupied by monomers. The lattice can be of any dimension, although pictures of one- and two-dimensional lattices are easiest to draw.

Dimer models can be either classical or quantum. The difference is, basically, that in the quantum dimer model, the dimers can tunnel between sites, while classically, they can only move by thermally activated diffusive processes. The quantum dimer model is the basis for Anderson’s resonating-valence-bond theory of antiferromagnetism.

Fisher [7] considered a simple square lattice in two dimensions, with dimers placed randomly on this lattice. He then attempted to determine the number of different dimer arrangements. This result could, for example, be used to determine the entropy, because entropy is the logarithm of the number of possible states of a system, Fisher discovered that he could solve this problem when the lattice was fully occupied and there were no vacancies. The key, he found, was to randomly distribute “half dimers” on all sites of the lattice, as in the lattice gas problem. He then threw away all arrangements in which a half dimer was not connected to a neighbor because the orientations did not match. The remaining configurations were those in which dimers completely covered the lattice.

The trick he used to accomplish this feat was to tag each dimer with a member of a sufficiently large set of anticommuting objects

A_{j}

such that

{[A_{i}, A_{j}]}_{+} = A_{i} A_{j} + A_{j} A_{i} = 2 δ_{i j},

(23)

so that

A_{j}^{2} = I

, with I the identity, where the

A_{j}

s were represented by matrices. He then used the trace theorems that are employed in particle physics to simplify calculations involving products of Dirac matrices [11,12,13]. These caused the unwanted configurations to vanish, and he identified those that remained with a Pfaffian. Its evaluation gives the number of dimer configurations.

When the lattice is completely covered with dimers, the calculation simplifies because there are no monomers. The fraction of sites covered by dimers is the “filling fraction”,

ν

, and this complete dimer coverage corresponds to

ν = 1

.

The trace theorems evaluate products of the

A_{i}

s. We will need four different

A_{i}

s for each site in the lattice, and for all of them to be independent matrices, their dimension d will be quite large. Fortunately, all we need is the commutation law, because it defines their algebra. We will never have to actually write down any of these matrices.

We consider dimers adsorbed on a simple square lattice of sites, indicated by the red points in Figure 1. The dual lattice is formed by connecting the midpoints of the links of the original lattice. The links are the lines joining each pair of neighboring sites (red, dashed). The dual lattice is the lattice of blue points located at the midpoints of the links of the original red lattice shown in Figure 2.

Figure 1. A simple square lattice with lattice points represented by red dots and the links represented by red dashed lines.

Figure 2. A simple square lattice with lattice points represented by red dots and the links represented by red dashed lines. The blue dots label dual lattice points, and the links of the dual lattice are blue dashed lines.

Let us denote the lattice sites by Roman subscripts

i, j, \dots

. The anticommuting objects A actually live on the links of the lattice, or equivalently on the sites of the dual lattice. Because they live on the links, one can think of them as "link fields" that anticommute. Let us use Greek superscripts

α, β, γ, \dots

to indicate the links or sites of the dual lattice, so that these anticommuting link fields can be written as

A^{μ}

. These link fields are regarded here as operators that act on some set of vectors

| α ⟩

in a vector space. The introduction of a representation by using a set of basis functions allows these fields

A^{μ}

to be represented by matrices of some suitable dimension d. The trace is then the sum of the diagonal elements

⟨ α | A^{μ} | α ⟩

, which is invariant under a unitary transformation.

The next step is to develop the trace theorems involving the fields

A^{μ}

that are needed here. Because Kronecker deltas and identity matrices are so easy to sum out, it is easy to lose track of them. It is clearer to write the anticommutation law as

{[A^{μ}, A^{ν}]}_{+} = A^{μ} A^{ν} + A^{ν} A^{μ} = 2 g^{μ ν},

(24)

where

g^{μ ν} \equiv δ^{μ ν}

, the Euclidean-space metric. When matrices of dimension d are used to represent the

A^{μ}

, this becomes

g^{μ ν} \equiv δ^{μ ν} I_{d}

with

I_{d}

the d-dimensional identity matrix. We will need to evaluate expressions involving

\frac{1}{d} Tr (\dots)

, where

Tr (\dots)

means the trace over a product of matrices representing the

A^{μ}

.

A proof of the trace theorems of these matrices is given in Appendix A. A simpler way to look at the procedure is as follows. In the general product

A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{n}}

, use anticommutation to move

A^{μ}

s with the same index next to one another as pairs. This introduces a parity factor

{(- 1)}^{P}

. Then, replace each matched pair with

I_{d}

, since the anticommutator

{[A^{μ}, A^{ν}]}_{+} = 2 g^{μ ν}

gives

{(A^{μ})}^{2} = g^{μ μ} = I_{d}

. You will obtain

Tr (I_{d}) = d

if the product consists only of matched pairs. Otherwise, there will be at least one

g^{μ ν}

factor that has unmatched indices, and the entire product will vanish. The trace theorems for the

A^{μ}

are as follows:

3.1: If the general product $A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{n}}$ can be rearranged so that adjacent pairs of indices are the same, then

$\frac{1}{d} Tr (A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{n}}) = {(- 1)}^{P},$

(25)

where P is the appropriate parity index for the rearrangement.
3.2: If the general product $A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{n}}$ cannot be rearranged so that adjacent pairs of indices are the same, then

$Tr (A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{n}}) = 0 .$

(26)

4. The Dimer Model

One of the simplest models of a system containing diatomic molecules is that of lattice gas of

N_{d}

rigid dimers, each of which fills two nearest neighbor sites of a space lattice of

N_{sites}

sites. Fisher was only able to completely evaluate their result for a lattice completely filled,

ν = 1

, with dimers, leaving no vacancies or monomers, as shown in Figure 3. Furthermore, because we will eventually represent the lattice with its dimer arrangements by matrices, the y-axis is inverted so that the site

(x = 1, y = 1)

is in the upper left corner like the usual initial matrix element

M_{11}

.

Figure 3. Complete filling of dimers (green) on the simple square lattice (red dots and dashes).

We suppose that the dimers do not interact with one another apart from the geometric constraint that only one dimer can be attached to a given site. The dimers are bound to the lattice sites, and we let the total binding energy of a dimer be

ε

, which is twice the binding energy to each site because a dimer has two ends. We follow Fisher [7] in allowing the binding energy of dimers aligned in the two orthogonal directions to be different, and call them

ε_{x}

and

ε_{y}

. Then, the partition function becomes

Z = Tr (e^{- β [(ε_{x} - μ_{x}) N_{x} + (ε_{y} - μ_{y}) N_{y}]}) .

(27)

Here,

x \equiv e^{- β (ε_{x} - μ_{x})}

is the activity of an x-oriented dimer, and we also let

y \equiv e^{- β (ε_{y} - μ_{y})}

be the activity of a y-oriented dimer, following Fisher, who allowed the dimer activities in the two directions to differ. The trace over quantum states adds together the contributions of states with the same numbers

N_{x}

and

N_{y}

of dimers oriented in the two directions. If we define the number of such states to be

g (N_{x}, N_{y})

, an equivalent expression for the partition function is

Z = \sum_{N_{x}, N_{y}} g (N_{x}, N_{y}) x^{N_{x}} y^{N_{y}} .

(28)

which is actually valid for any filling fraction

ν

.

We now invent, following Fisher, a snake-like path through the lattice that allows us to use a one-dimensional numbering scheme to label the lattice sites. The numbering begins at the upper left corner and weaves back and forth along the x-direction, as shown in Figure 4. We note that this one-dimensional numbering scheme, serpentine numbering, also works for a three-dimensional lattice if one draws it on a long sheet of paper and then folds the paper with accordion-like pleats. For a lattice with

n_{r}

rows of sites (red) arranged one below another in the vertical y-direction and

n_{c}

columns parallel to one another in the horizontal x-direction, we have a lattice of

N_{sites} = n_{r} n_{c}

sites. The virtue of this numbering is that it is easy to determine the signs resulting from the interchanges required to move identical

A^{μ}

s adjacent to one another. For the x-links it is obvious, because the two ends of a dimer are on adjacent sites, so no interchange is needed, and the sign is plus. For a y-link, consider the following example: the link between sites 27 and 40 in Figure 4. The y-links (vertical red dashed lines) look like the ties between two rails on a railroad track, and these ties have two ends. Thus, to move

A^{μ}

from position 27 to 39 (adjacent to 40) requires moving to the right by six interchanges (

27 \to 28 \to 29 \to 30 \to 31 \to 32 \to 33

) along row three, and then moving back left along row 4 by another six interchanges (

33 \to 34 \to 35 \to 36 \to 37 \to 38 \to 39

). Because a tie has two ends, you will always obtain an even number of interchanges, and this will crucially make all terms positive in our construction of the partition function below.

Figure 4. Serpentine numbering of the lattice sites in green. The row and column numbers of the lattice are also shown The red dots and dashes outline the direct lattice, and the blue dots and dashes the dual lattice.

The dual lattice has twice as many sites as the original lattice because there are two links per site in the square lattice. Consequently, if the original lattice has

N_{sites}

sites, the dual lattice has

2 N_{sites}

links or points. However, there are only

\frac{1}{2} N_{sites}

distinct adjacent pairs of points in the original lattice, and

ν = 1

means that all of these

\frac{1}{2} N_{sites}

distinct adjacent pairs of points are occupied by dimers. This makes it generally useful to consider

N_{sites}

as even, because if it is odd, there is at least one vacancy.

An alternative way of labeling the links is to make the superscript

μ

of the link field

A^{μ}

a pair of site indices

μ = (j, k)

where j and k label the two ends of the link, where

j < k

. This leads to the link-field labels of Figure 5.

Figure 5. Alternative way of numbering of links of the lattice by using pairs of site indices, shown in blue, on top of Figure 4.

The link fields can be arranged as the upper right triangle of an

N_{sites} \times N_{sites}

matrix with zeros down the diagonal, suggestive of an antisymmetric matrix. For a smaller

12 \times 12

example with four rows and three columns but similar to the example above, the upper right triangle of entries

A^{j k}

is shown in Equation (29) and shown pictorially in Figure 6.

A^{j k} = (\begin{matrix} 0 & A^{1, 2} & 0 & 0 & 0 & 0 & 0 & A^{1, 8} & 0 & 0 & 0 & 0 \\ 0 & A^{2, 3} & 0 & 0 & 0 & A^{2, 7} & 0 & 0 & 0 & 0 & 0 \\ 0 & A^{3, 4} & 0 & A^{3, 6} & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & A^{4, 5} & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & A^{5, 6} & 0 & 0 & 0 & 0 & 0 & A^{5, 12} \\ 0 & A^{6, 7} & 0 & 0 & 0 & A^{6, 11} & 0 \\ 0 & A^{7.8} & 0 & A^{7, 10} & 0 & 0 \\ 0 & A^{8, 9} & 0 & 0 & 0 \\ 0 & A^{9, 10} & 0 & 0 \\ 0 & A^{10, 11} & 0 \\ 0 & A^{11, 12} \\ 0 \end{matrix}) .

(29)

The link fields

A^{1, 2}

,

A^{2, 3}

, and

A^{3, 4}

refer to x-directed links, and the link fields

A^{4, 5}

,

A^{3, 6}

,

A^{2, 7}

, and

A^{1, 8}

refer to y-directed links. There are no near-neighbor links corresponding to the elements for which zero is entered. In other words, the entries sloping diagonally upward in the direction lower left corner to upper right corner are y-directed links. The remaining ones sloping downward from the upper left to lower right corners are x-directed links in the pattern shown in the matrix M in Equation (30).

M = (\begin{matrix} 0 & x & 0 & 0 & 0 & 0 & 0 & y & 0 & 0 & 0 & 0 \\ 0 & x & 0 & 0 & 0 & y & 0 & 0 & 0 & 0 & 0 \\ 0 & x & 0 & y & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & y & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & x & 0 & 0 & 0 & 0 & 0 & y \\ 0 & x & 0 & 0 & 0 & y & 0 \\ 0 & x & 0 & y & 0 & 0 \\ 0 & y & 0 & 0 & 0 \\ 0 & x & 0 & 0 \\ 0 & x & 0 \\ 0 & x \\ 0 \end{matrix}) .

(30)

Figure 6. A smaller

12 \times 12

example with four rows and three columns, but similar to Figure 5, where the serpentine numbering is shown in green, and the alternative numbering in blue. As before, the red dots and dashes outline the direct lattice, and the blue dots and dashes the dual lattice.

5. Inclusion of Constraints on the Partition Function by Use of a Child’s Toy

Suppose that the lattice has square holes with sides parallel to x and y at each lattice site. Into each of these holes, we insert a toy, like a child’s top with a square shaft that just fits the hole, as shown in Figure 7. The disk that provides most of the top’s angular momentum has a green line painted on it normal to one of the faces of the shaft. This means that the green line can be oriented four ways—north, south, east, and west.

Figure 7. Square holes in a simple square lattice with a toy with a square shaft that can fit into a hole in one of four directions.

The orientations of the disk are random, so they are distributed like a lattice gas of four colors, as shown in Figure 8. Sometimes, the green lines of neighboring disks point toward one another, and for any adjacent pair, the probability of this occurring is

1 / (4 \times 4) = 1 / 16

.

Figure 8. Toys randomly arranged on the simple square lattice, shown as the red square holes and connecting dashes. The half-dimers are represented by the green lines for this particular member of the ensemble, with the red circles the outlines of the toys in Figure 7. Not all the half-dimers point toward each other in pairs, and so taking the trace over link fields will cause this distribution to be removed from the ensemble.

Now, suppose that the green lined represent “half-dimers”, and when green lines point toward one another on adjacent sites, those two half-dimers join up to form a complete dimer between those two sites. In Figure 9, the figure on the left shows the half-dimers pointing toward one another on adjacent sites, and the figure on the right shows the corresponding dimer arrangement.

Figure 9. Half-dimers arranged on the simple square lattice. In the left figure, the dimers all point toward one another in pairs, so that taking the trace over link fields will retain this as a member of the ensemble, unlike the random arrangement of Figure 8. This corresponds to the completely filled dimer arrangement on the right, which is the same as in Figure 3.

We construct the partition function in a way that allows the constraints of no double occupancy and a

ν = 1

completely filled dimer arrangement to be imposed. Associate with each lattice site j a function

V_{j} = \sum_{\begin{matrix} l = 1 \\ n n \end{matrix}} \sqrt{z_{j l}} A^{j l},

(31)

where the sum is over the nearest neighbors of site j, as emphasized by the subscript

n n

on the sum. The quantity

z_{j l}

is the activity of a dimer on the link

j l

, and is either

x_{j l}

if the link is in the x-direction, or

y_{j l}

if the link is in the y-direction. The factor

A^{j k}

is the link field associated with the link

j l

. The square root of the activity is taken because it is the activity of a half-dimer, the object represented by the green line on the disk of the toy. It takes two of these factors,

{(\sqrt{z_{j l}})}^{2} = z_{j l}

, to give the activity of the dimer on the link. One factor can be thought of as emanating from the site j, and the other from the site ℓ.

The partition function is constructed as a product of these factors

V_{j}

, one rooted on each lattice site j. To see how this will work, suppose that we have a product of only two lattice sites called j and k. The product is

\begin{matrix} V_{j} V_{k} & = & (\sum_{\begin{matrix} l = 1 \\ n n \end{matrix}}^{4} \sqrt{z_{j l}} A^{j l}) (\sum_{\begin{matrix} m = 1 \\ n n \end{matrix}}^{4} \sqrt{z_{k m}} A^{k m}) \\ = & (\sqrt{x_{j l_{1}}} A^{j l_{1}} + \sqrt{y_{j l_{2}}} A^{j l_{2}} + \sqrt{x_{j l_{3}}} A^{j l_{3}} + \sqrt{x_{j l_{4}}} A^{j l_{4}}) \times \\ \times (\sqrt{x_{k m_{1}}} A^{k m_{1}} + \sqrt{y_{k m_{2}}} A^{k m_{2}} + \sqrt{x_{k m_{3}}} A^{k m_{3}} + \sqrt{y_{k m_{4}}} A^{k m_{4}}) \\ = & \sqrt{x_{j l_{1}}} \sqrt{k_{k m_{1}}} A^{j l_{1}} A^{k m_{1}} + \sqrt{x_{j l_{1}}} \sqrt{y_{k m_{2}}} A^{j l_{1}} A^{k m_{2}} \\ + \sqrt{x_{j l_{1}}} \sqrt{x_{k m_{3}}} A^{j l_{1}} A^{k m_{3}} + \sqrt{x_{j l_{1}}} \sqrt{y_{k m_{4}}} A^{j l_{1}} A^{k m_{4}} + \dots . \end{matrix}

(32)

If sites j and k are not nearest neighbors, then none of the link fields are common to both factors, and taking

\frac{1}{d} T r \dots

simply gives zero and the term vanishes. None of the sixteen terms in the last line then will contribute to the partition function.

The situation is different when j and k are nearest neighbors. Then, there will be one link field in common. Suppose, for example, that it is on an x-directed link, arising from

V_{j} V_{k} = \dots + \sqrt{x_{j l_{1}}} \sqrt{x_{k m_{3}}} A^{j l_{1}} A^{k m_{3}} + \dots = \dots + \sqrt{x_{j k}} \sqrt{x_{k j}} A^{j k} A^{k j} + \dots,

(33)

because j and k are nearest neighbors in the x-direction with k to the right of j. Since the order of subscripts on a link does not matter,

\sqrt{x_{k j}} \sqrt{x_{k j}} = {(\sqrt{x_{k j}})}^{2} = x^{2}

. Thus, this term survives when the trace is taken.

In this way, the partition function,

Z = \frac{1}{d} Tr \prod_{j = 1}^{N_{sites}} V_{j} = \frac{1}{d} Tr V_{1} V_{2} \dots V_{N_{sites}},

(34)

is an expression in which the only surviving terms are those which give

ν = 1

dimer arrangements, as shown, for example, in Figure 9. The numbering is one-dimensional,

123 \dots N_{sites}

, because we use one-dimensional serpentine ordering.

6. Introduction of the Antisymmetric Matrix

A more concise notation is useful. Furthermore, we want the contributions of the even-numbered and the odd-numbered sites to the partition function to be written slightly differently. We begin by writing

V_{j} = \sum_{l = 1}^{4} v_{j l} A^{j l},

(35)

so that

v_{j l} \equiv \sqrt{z_{j l}}

, which is, of course, either

\sqrt{x}

or

\sqrt{y}

, and where the subscript “

n n

" on the sum is omitted, although the restriction of the sum in

V_{j}

to nearest neighbors of j will remain understood.

Now, consider the product,

\prod_{j = 1}^{N_{sites}} V_{j} = V_{1} V_{2} \dots V_{N_{sites}},

(36)

which appears in the partition function. The number of sites is chosen to be even, and we insert

1 = (- i) i

between each odd–even pair, giving

\begin{matrix} \prod_{j = 1}^{N_{sites}} V_{j} & = & V_{1} (- i) i V_{2} V_{3} (- i) i V_{4} \dots V_{N_{sites} - 1} (- i) i V_{N_{sites}} \\ = & (- i V_{1}) (i V_{2}) (- i V_{3}) (i V_{4}) \dots (- i V_{N_{sites} - 1}) (i V_{N_{sites})} . \end{matrix}

(37)

Then, for odd-numbered sites, we have

Q_{j = o d d} \equiv (- i) V_{j = o d d} = \sum_{l = 1}^{4} (- i) v_{j l} A^{j l},

(38)

and for even-numbered sites, we have

Q_{j = e v e n} \equiv i V_{j = e v e n} = \sum_{l = 1}^{4} i v_{j l} A^{j l} .

(39)

These relations can be combined as

Q_{j} = \sum_{l = 1}^{4} {(- i)}^{j} i V_{j l} A^{j l},

(40)

and the original product becomes

\prod_{j = 1}^{N_{sites}} V_{j} = \prod_{j = 1}^{N_{sites}} Q_{j} .

(41)

Finally, let us set

Z = \frac{1}{d} Tr \prod_{j = 1}^{N_{sites}} Q_{j} = \frac{1}{d} Tr Q_{1} Q_{2} \dots Q_{N_{sites}} .

(42)

The quantities

Q_{j}

are operators formed as linear combinations of the link fields. The link fields satisfy the simple anticommutation law

{[A^{μ}, A^{ν}]}_{+} = A^{μ} A^{ν} + A^{ν} A^{μ} \equiv 2 g^{μ ν} = 2 δ^{μ ν},

(43)

or, for d-dimensional matrix representations thereof,

{[A^{μ}, A^{ν}]}_{+} = A^{μ} A^{ν} + A^{ν} A^{μ} \equiv 2 g^{μ ν} = 2 δ^{μ ν} I_{d} .

(44)

We now show that the

Q_{j}

operators also anticommute, and calculate the values of their anticommutators. We have

{[Q_{j}, Q_{k}]}_{+} = Q_{j} Q_{k} + Q_{k} Q_{j} = \sum_{l = 1}^{4} q_{j l} A^{j l} \sum_{m = 1}^{4} q_{k m} A^{k m} + \sum_{m = 1}^{4} q_{k m} A^{k m} \sum_{l = 1}^{4} q_{j l} A^{j l} .

(45)

Writing this in terms of the anticommutators of the link fields gives

{[Q_{j}, Q_{k}]}_{+} = \sum_{l = 1}^{4} q_{j l} \sum_{m = 1}^{4} q_{k m} (A^{j l} A^{k m} + A^{k m} A^{j l}) = 2 \sum_{l = 1}^{4} \sum_{m = 1}^{4} q_{j l} q_{k m} δ^{(j l), (k m)},

(46)

where

δ^{(j l), (k m)}

gives unity if

j l

and

k m

denote the same link field and zero otherwise. They only denote the same link field

A^{μ}

if the lattice sites j and k are nearest neighbors, and then only if

m = j

and

l = k

, that is,

δ^{(j l), (k m)} = δ_{j m} δ_{k l}

. The anticommutator becomes

{[Q_{j}, Q_{k}]}_{+} = 2 \sum_{l = 1}^{4} \sum_{m = 1}^{4} q_{j l} q_{k m} δ_{j m} δ_{k l},

(47)

which reduces to

{[Q_{j}, Q_{k}]}_{+} = 2 q_{j k} q_{k j} .

(48)

Finally, we write this in terms of the notation

q_{j l} = {(- 1)}^{j} i v_{j l}

of Section 5. This gives

{[Q_{j}, Q_{k}]}_{+} = 2 v_{j k} v_{k j},

(49)

if j and k are nearest neighbors, and zero otherwise.

Furthermore,

v_{j l} = \sqrt{z_{j l}}

, the square root of the activity of a dimer on the link

j l

, which does not depend on the order of the indices. Thus, the anticommutator of the operators

Q_{j}

and

Q_{k}

can be written as

{[Q_{j}, Q_{k}]}_{+} = 2 {(v_{j k})}^{2} = 2 z_{j k},

(50)

the activity x or y of the dimer occupying that link.

The expression giving the partition function is

Z = \frac{1}{d} Tr \prod_{j = 1}^{N_{sites}} Q_{j} = \frac{1}{d} Tr Q_{1} Q_{2} \dots Q_{N_{sites}} .

(51)

In order to have the lattice completely filled with dimers (filling factor

ν = 1

), the number

N_{sites}

of lattice sites must be even in order to avoid a single essential but unwanted left over vacant site. As a reminder of this, let us temporarily set

N_{sites} = 2 h

, and write the partition function as

Z = \frac{1}{d} Tr Q_{1} Q_{2} \dots Q_{2 h} .

(52)

To make further progress, we need to find a way to evaluate the product

\prod_{j = 1}^{2 h} Q_{j}

that appears in the partition function

Z = \frac{1}{d} Tr \prod_{j = 1}^{2 h} Q_{j} .

(53)

To do this, we successively use the anticommutator of the operators

Q_{j}

from Equation (50).

Let us write the anticommutator as

{[Q_{j}, Q_{k}]}_{+} = 2 z_{j k} - Q_{k} Q_{j},

(54)

and use this expression to rearrange the product as

\prod_{j = 1}^{2 h} Q_{j} = Q_{1} Q_{2} \dots Q_{2 h} = 2 z_{12} Q_{3} Q_{4} \dots Q_{2 h} + (- 1) Q_{2} Q_{1} Q_{3} Q_{4} \dots Q_{2 h} .

(55)

The second term on the right has

Q_{1}

displaced one step to the right, and can be written with the aid of the anticommutation rule as

\begin{matrix} (- 1) Q_{2} Q_{1} Q_{3} Q_{4} \dots Q_{2 h} & = & (- 1) Q_{2} (2 z_{13}) Q_{4} \dots Q_{2 h} + {(- 1)}^{2} Q_{2} Q_{3} Q_{1} Q_{4} \dots Q_{2 h} \\ = & (- 1) (2 z_{13}) Q_{2} Q_{4} Q_{5} \dots Q_{2 h} + {(- 1)}^{2} Q_{2} Q_{3} Q_{1} Q_{4} \dots Q_{2 h} . \end{matrix}

(56)

We continue to move

Q_{1}

all the way to the right, requiring

2 h - 1

interchanges. We then take the trace of both sides, and use the cyclic property of the trace to move

Q_{1}

back to the beginning, thereby reproducing the trace of the left side accompanied by

{(- 1)}^{2 h - 1} = - 1

. In the course of performing this, we “contract”

Q_{1}

and

Q_{k}

for

k = 2, 3, \dots, 2 h

, producing

2 h - 1

terms containing a factor of

z_{1 k}

, each accompanied by

{(- 1)}^{k - 1}

. This result is

2 Tr \prod_{j = 1}^{2 h} Q_{j} = \sum_{k = 2}^{2 h} {(- 1)}^{k - 1} (2 z_{1 k}) Tr [Q_{2} Q_{3} Q_{4} \dots Q_{k - 1} Q_{k + 1} \dots Q_{2 h}] .

(57)

The factors of two can cancel. Furthermore,

z_{1 k}

for

k = 2, 3, \dots, 2 h

is the top row of a Pfaffian of order

2 h

.

Appendix B contains a brief description of Pfaffians and an example of how a simple Pfaffian is evaluated. On iterating, each step produces a single additional factor of

z_{k l}

in every one of the terms resulting from the trace on the right. The end result is that the trace of Equation (57) produces the Pfaffian

Tr \prod_{j = 1}^{2 h} Q_{j} = Pf [z_{j k}] .

(58)

If j and k are extended to cover the entire lattice, the complete entries in the Pfaffian and the upper right triangle of the corresponding antisymmetric matrix will be filled in. In the present context, the only nonvanishing activities

z_{j k}

are on links with j and k nearest-neighbor sites, and all the remaining entries are zero. This result is found in Fisher’s paper [14] and references therein, and is written in detail in J. C. Baker’s MS thesis [15].

Earlier, an example with three rows and four columns of sites was discussed, and is shown in Figure 6. The

12 \times 12

antisymmetric matrix associated with the Pfaffian of this

3 \times 4

lattice example, written in block form, is

{}^{(3 \times 4)}M = (\begin{matrix} 0 & x & 0 & 0 & 0 & 0 & 0 & y & 0 & 0 & 0 & 0 \\ - x & 0 & x & 0 & 0 & 0 & y & 0 & 0 & 0 & 0 & 0 \\ 0 & - x & 0 & x & 0 & y & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & - x & 0 & y & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & - y & 0 & x & 0 & 0 & 0 & 0 & 0 & y \\ 0 & 0 & - y & 0 & - x & 0 & x & 0 & 0 & 0 & y & 0 \\ 0 & - y & 0 & 0 & 0 & - x & 0 & x & 0 & y & 0 & 0 \\ - y & 0 & 0 & 0 & 0 & 0 & - x & 0 & y & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & - y & 0 & x & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & - y & 0 & - x & 0 & x & 0 \\ 0 & 0 & 0 & 0 & 0 & - y & 0 & 0 & 0 & - x & 0 & x \\ 0 & 0 & 0 & 0 & - y & 0 & 0 & 0 & 0 & 0 & - x & 0 \end{matrix}) .

(59)

and that block form consists of the nine

4 \times 4

blocks, denoted as

{}^{(3 \times 4)}M = (\begin{matrix} X & Y & 0 \\ - Y & X & Y \\ 0 & - Y & X \end{matrix}),

(60)

as shown in Figure 10.

Figure 10. The block matrix form of the

12 \times 12

antisymmetric matrix. The red lines outline the

4 \times 4

blocks, and the light blue X, Y and

- Y

labels in the background signify the X, Y and

- Y

blocks of the matrix.

This is also a tridiagonal matrix, where the blocks X, Y, and 0 are

4 \times 4

blocks. These blocks are

X = (\begin{matrix} 0 & x & 0 & 0 \\ - x & 0 & x & 0 \\ 0 & - x & 0 & x \\ 0 & 0 & - x & 0 \end{matrix})

(61)

and

Y = (\begin{matrix} 0 & 0 & 0 & y \\ 0 & 0 & y & 0 \\ 0 & y & 0 & 0 \\ y & 0 & 0 & 0 \end{matrix}),

(62)

while “0” stands for a

4 \times 4

block of sixteen zeros. In what follows, we will be using the fact that matrices can be multiplied block by block, and this is shown in Appendix C.

Consider the upper right triangle that forms the corresponding Pfaffian. The first row has

n_{c} - 1

x-oriented links between the

n_{c}

sites, and so the block X has

n_{c} - 1

entries x in the direction of the diagonal. For the

3 \times 4

example, there are four columns, so

n_{c} - 1 = 3

and there are three entries x in the block X. They are offset from the diagonal by one, so the dimensions of the block X shown are

4 \times 4

. In general, the blocks X are

(n_{c} - 1 + 1) \times (n_{c} - 1 + 1) = n_{c} \times n_{c}

.

Between each row and the one below it, there are

n_{c}

vertical y links, and so there are

n_{c}

entries of y up the diagonal line from the lower left to upper right corner. In the

3 \times 4

example,

n_{c} = 4

, and there are four of these so the block Y is

4 \times 4

. In general, the blocks Y are of dimension

n_{c} \times n_{c}

, just like the blocks X, as they must be to fill the original matrix of dimension

N_{sites} \times N_{sites}

, because

N_{sites} = n_{r} n_{c}

. There are

n_{r}^{2}

blocks with

n_{c}^{2}

entries to account for the

N_{sites}^{2}

entries in the antisymmetric matrix M.

In general, then, when the lattice of sites has

n_{r}

rows and

n_{c}

columns, the dimension of each individual block is

n_{c} \times n_{c}

. The matrix of blocks, on the other hand, has dimension

n_{r} \times n_{r}

. The total number of matrix elements is then

n_{r}^{2} n_{c}^{2}

, which is the square of the number of sites

N_{sites} = n_{r} n_{c}

.

7. Eigenvalues and Eigenvectors of the Asymmetric Matrix

The scheme that makes the calculation simplest is to diagonalize the X block and use those eigenvectors as the basis to transform the Y block. This will lead to a cruciform version of the big matrix, which will be convenient for the calculations. Therefore, first we investigate the eigenvalues and eigenvectors of the block X. The eigenvalue equation for X is given by

X v = λ v,

(63)

where

λ

is an eigenvalue and v a column vector with entries

v_{1}, v_{2}, \dots, v_{n_{c}}

. The set of linear equations represented by this equation have the form

(\begin{matrix} - λ & x & 0 & \dots & 0 \\ - x & - λ & x & \dots & 0 \\ ⋮ & ⋱ & ⋱ & ⋱ & ⋮ \\ 0 & \dots & - x & - λ & x \\ 0 & \dots & 0 & - x & - λ \end{matrix}) (\begin{matrix} v_{1} \\ v_{2} \\ v_{3} \\ ⋮ \\ v_{n_{c} - 1} \\ v_{n_{c}} \end{matrix}) = (\begin{matrix} - λ v_{1} + x v_{2} \\ - x v_{1} - λ v_{2} + x v_{3} \\ ⋮ \\ - x v_{n_{c} - 2} - λ v_{n_{c} - 1} + x v_{n_{c}} \\ - x v_{n_{c} - 1} - λ v_{n_{c}} \end{matrix}) = (\begin{matrix} 0 \\ 0 \\ ⋮ \\ 0 \\ 0 \end{matrix}) .

(64)

We can make all the linear equations look the same if we imagine padding the list with

v_{0}

and

v_{n_{c} + 1}

and then apply the boundary condition

v_{0} = v_{n_{c} + 1} = 0

. Then, we have

(\begin{matrix} - x v_{0} - λ v_{1} + x v_{2} \\ - x v_{1} - λ v_{2} + x v_{3} \\ ⋮ \\ - x v_{n_{c} - 2} - λ v_{n_{c} - 1} + x v_{n_{c}} \\ - x v_{n_{c} - 1} - λ v_{n_{c}} + x v_{n_{c} + 1} \end{matrix}) = (\begin{matrix} 0 \\ 0 \\ ⋮ \\ 0 \\ 0 \end{matrix}) .

(65)

The generic equation is the difference equation

- x v_{k - 1} - λ v_{k} + x v_{k + 1},

(66)

which can be rewritten as

\frac{v_{k + 1} - v_{k - 1}}{2 a} = \frac{λ}{2 a x} v_{k} .

(67)

Regarding a as a lattice spacing, we recognize this equation as the discretized version of the first-order differential equation

\frac{d v}{d t} = \frac{λ}{2 a x} v (t),

(68)

with exponential solutions

v (t) = e^{\frac{λ t}{2 a x}}

that are growing or decaying if the eigenvalue

λ

is real. If

λ

is imaginary, the solutions oscillate.

A solution that satisfies the boundary conditions is given by

v_{k} = A (e^{i k φ} - e^{i π k} e^{- i k φ}) = A [e^{i k φ} - {(- 1)}^{k} e^{- i k φ}] .

(69)

This gives

sin (k φ)

for k even but

cos (k φ)

for k odd.

The boundary condition at

k = 0

is

v_{0} = A [1 - {(- 1)}^{0} 1] = A (1 - 1) = 0,

(70)

which is correct. At

k = n_{c} + 1

we have

\begin{matrix} v_{k = n_{c} + 1} & = & A [e^{i (n_{c} + 1) φ} - {(- 1)}^{(n_{c} + 1)} e^{- i (n_{c} + 1) φ}] \\ = & A [e^{i (n_{c} + 1) φ} - {(e^{i π})}^{(n_{c} + 1)} e^{- i (n_{c} + 1) φ}] \\ = & A [e^{i (n_{c} + 1) φ} - e^{- i (n_{c} + 1) (π - φ)}] . \end{matrix}

(71)

We also require this to vanish, which occurs if the quantity in brackets vanishes, that is, if

e^{i (n_{c} + 1) φ} = e^{- i (n_{c} + 1) (π - φ)} .

(72)

Taking logarithms of both sides gives

i (n_{c} + 1) φ = - i (n_{c} + 1) (π - φ) + 2 π i ω,

(73)

where ω is a winding number that gives the number of times that the point

φ + 2 π ω

circles the singularity that terminates the branch cut of the logarithm of

e^{i (n_{c} + 1) φ}

. As a result, the angle

φ

is quantized, that is, it acquires only discrete values given by

2 (n_{c} + 1) φ = (n_{c} + 1) π + 2 π ω,

(74)

so that

φ

only takes the values

φ = \frac{π}{2} + \frac{ω}{n_{c} + 1} π,

(75)

that is, the discrete values

φ = \frac{π}{2} + \frac{π}{n_{c} + 1}, \frac{π}{2} + \frac{2 π}{n_{c} + 1}, \dots, \frac{π}{2} + \frac{n_{c} π}{n_{c} + 1}

. The values

φ = 0

and

φ = π

are the boundary points where the solution vanishes that were added by padding the ends of the list. The components of the eigenvector

v_{k}^{ω}

are

v_{k}^{ω} = A [e^{i k φ_{ω}} - {(- 1)}^{k} e^{- i k φ_{ω}}],

(76)

with A the normalization constant determined from

\sum_{k = 1}^{n_{c} - 1} {|v_{k}^{ω}|}^{2} = 1

. Figure 11 shows the discrete values of the angle

φ_{ω}

for the example with eleven columns.

Figure 11. Discrete values of the angle

φ_{ω}

for the example with eleven columns.

The previous derivation of the eigenvalue

λ

is unchanged. Insert one of the terms of the generic solution

v_{k} = e^{\pm i k φ}

—either one will do—into the difference equation

v_{k + 1} - v_{k - 1} = \frac{λ}{x} v_{k} .

(77)

Substituting

v_{k} = e^{i k φ}

yields

\frac{λ}{x} = e^{i φ} - e^{- i φ} = 2 i sin φ,

(78)

and as long as the eigenvalue

λ

satisfies this equation,

v_{k} = e^{\pm i k φ}

is a solution to the difference equation. When we include the boundary-condition requirements, the angle

φ

is restricted to the discrete values

φ_{ω}

, and the eigenvalues of X are

λ_{ω} = 2 i sin φ_{ω} .

(79)

Figure 12 shows the values of the eigenvalue

λ_{ω}

for the example with eleven columns.

Figure 12. Eigenvalue

λ_{ω}

for the example with eleven columns.

The expression for the eigenfunction,

v_{k}^{ω} = A [e^{i k φ_{ω}} - {(- 1)}^{k} e^{- i k φ_{ω}}],

(80)

can be rewritten by inserting the explicit form of the angle

φ_{ω} = \frac{π}{2} + \frac{ω π}{n_{c} + 1}

. This gives

\frac{1}{A} v_{k}^{ω} = e^{i k (\frac{π}{2} + \frac{ω π}{n_{c} + 1})} - e^{i π k} e^{- i k (\frac{π}{2} + \frac{ω π}{n_{c} + 1})} = e^{i \frac{π}{2} k} [e^{i k \frac{ω π}{n_{c} + 1}} - e^{- i k \frac{ω π}{n_{c} + 1}}] = (i^{k}) [2 i sin (\frac{π ω k}{n_{c} + 1})],

(81)

giving

v_{k}^{ω} = 2 A i^{k + 1} sin (\frac{π ω k}{n_{c} + 1}) .

(82)

Let us now find the normalization constant using

\sum_{k = 1}^{n_{c} - 1} {|v_{k}^{ω}|}^{2} = 1

. The components of the eigenvector are

v_{k}^{ω} = A [e^{i k φ_{ω}} - {(- 1)}^{k} e^{- i k φ_{ω}}]

with

φ_{ω} = (\frac{π}{2} + \frac{ω π}{n_{c} + 1})

. Substituting the expression for

φ_{ω}

gives

\begin{matrix} {|v_{k}^{ω}|}^{2} & = & {|A|}^{2} [e^{i k φ_{ω}} - {(- 1)}^{k} e^{- i k φ_{ω}}] [e^{i k φ_{ω}} - {(- 1)}^{k} e^{- i k φ_{ω}}] \\ = & {|A|}^{2} [1 - {(- 1)}^{k} e^{2 i k φ_{ω}} - {(- 1)}^{k} e^{- 2 i k φ_{ω}} + {(- 1)}^{2 k}] . \end{matrix}

(83)

We then have

{|v_{k}^{ω}|}^{2} = {|A|}^{2} [1 + {(- 1)}^{2 k} - 2 {(- 1)}^{k} cos (2 k φ_{ω})] .

(84)

Now,

{(- 1)}^{2 k} = 1

because

2 k

is always even, giving

{|v_{k}^{ω}|}^{2} = {|A|}^{2} [2 - 2 {(- 1)}^{2 k} cos (2 k φ_{ω})],

(85)

where

i^{k} {(- i)}^{k} = 1

. The normalization constant is then given by the finite sum

\sum_{k = 1}^{n_{c}} {|v_{k}^{ω}|}^{2} = \sum_{k = 1}^{n_{c}} {|A|}^{2} [2 - 2 {(- 1)}^{2 k} cos (2 k φ_{ω})] = 1,

(86)

which gives

\frac{1}{2 {|A|}^{2}} = \sum_{k = 1}^{n_{c}} [1 - {(- 1)}^{k} cos (2 k φ_{ω})] .

(87)

Then, since a shift in phase of

k π

gives another factor of

{(- 1)}^{k}

,

\sum_{k = 1}^{n_{c}} {(- 1)}^{k} cos (2 k φ_{ω}) = \sum_{k = 1}^{n_{c}} {(- 1)}^{2 k} cos [2 k (\frac{π}{2} + \frac{ω π}{n_{c} + 1})] = \sum_{k = 1}^{n_{c}} cos (\frac{2 π k ω}{n_{c} + 1}) .

(88)

The value of the sum over cosines is

- 1

, and the normalization factor is

|A| = \frac{1}{\sqrt{2 (n_{c} + 1)}} .

(89)

Taking A to be real and positive, the normalized eigenvectors are

v_{k}^{ω} = \frac{1}{\sqrt{2 (n_{c} + 1)}} [e^{i k φ_{ω}} - {(- 1)}^{k} e^{- i k φ_{ω}}] = \sqrt{\frac{2}{n_{c} + 1}} i^{k + 1} sin (\frac{π k ω}{n_{c} + 1}),

(90)

with

φ = \frac{π}{2} + \frac{ω π}{n_{c} + 1}

and eigenvalue

λ_{ω} = 2 i x sin φ_{ω}

. The subscript k distinguishes the various components of a given eigenvector

v^{ω}

. The values

φ = 0

and

π

are the boundary points where

v_{k}^{ω}

vanishes. In Figure 13, are the first four eigenvectors for

n_{c} = 11

and various winding numbers ω. The points show the values, while the lines are merely to guide the eye.

Figure 13. First four and the last eigenvectors

v_{k}^{ω}

for the example with eleven columns.

The previous example of a lattice of sites that has three rows and four columns of sites was shown in Figure 6. This lattice has

3 \times 4 = 12

sites, so the dimensions of the antisymmetric matrix M that gives the square of the partition function through

Z^{2} = {(Pf [M])}^{2} = det [M]

(91)

is

12 \times 12

, that is,

N_{sites} \times N_{sites}

. This matrix M provides an example of the block-matrix form of the antisymmetric matrix M having the Pfaffian array as its upper right triangle, and is given in terms of the dimer activities x and y by the matrix shown in Figure 10.

This particular matrix has 12 rows and 12 columns, and consists of

3 \times 3 = 9

blocks called X, Y, and zero. The matrix is divided into blocks by the red lines in Figure 10, and, in general, such a matrix has the form

M = (\begin{matrix} X & Y \\ - Y & X & ⋱ \\ ⋱ & ⋱ & Y \\ - Y & X \end{matrix}) .

(92)

For clarity, the zeros are not shown. This is a tridiagonal block matrix, and this form always arises when using serpentine numbering of the lattice sites. The size of each block is the number of sites along a single row of the lattice, as is most easily seen by examining the blocks Y. The number of blocks is the square of the number of rows in the lattice of sites.

Earlier, we described the diagonalization procedure for a matrix with the form of the block X. That same procedure can be used to find eigenvectors and eigenvalues of the block form of M. One can manipulate the blocks just like matrix elements, because block matrices can be multiplied block by block. The notation, though, can be confusing if the matrix, which is M, is denoted by M both when it is written as matrix elements and when it is written in blocks. Consequently, let us call the block form B, writing

B = (\begin{matrix} X & Y \\ - Y & X & ⋱ \\ ⋱ & ⋱ & Y \\ - Y & X \end{matrix})

(93)

when it is in block form, and

M = (\begin{matrix} 0 & x \\ - x & 0 & ⋱ \\ ⋱ & ⋱ & x \\ - x & 0 \end{matrix})

(94)

when it is written in terms of its matrix elements

m_{j k} = x, y, 0

.

We will need here our earlier observation that, in general, when the lattice of sites has

n_{r}

rows and

n_{c}

columns, the dimension of each individual block is

n_{c} \times n_{c}

. The matrix of blocks, on the other hand, has the dimension

n_{r} \times n_{r}

. The total number of matrix elements is then

n_{r}^{2} n_{c}^{2}

, which is the square of the number of sites

N_{sites} = n_{r} n_{c}

.

The eigenvectors of the block matrix are the vectors u with components

u_{j}

that satisfy

B u = Λ u .

(95)

The components

u_{j}

will actually be vectors, so that a given

u_{j}

may have components

v_{k}

as in the eigenvectors of the block X given in Equation (90). That, however, does not concern us here. Eventually, though, the transformed matrix M will be cruciform.

Writing the eigenvector in terms of its components, the eigenvalue equation is

\begin{matrix} B u & = & (\begin{matrix} X & Y \\ - Y & X & ⋱ \\ ⋱ \\ ⋱ & ⋱ & Y \\ - Y & X \end{matrix}) (\begin{matrix} u_{1} \\ ⋮ \\ u_{l} \\ ⋮ \\ u_{n_{r}} \end{matrix}) = (\begin{matrix} X u_{1} + Y u_{2} + \dots \\ ⋮ \\ - Y u_{l - 1} + X u_{l} + Y u_{l + 1} + \dots \\ ⋮ \\ - Y u_{n_{r} - 1} + X u_{n_{r}} + + \dots \end{matrix}) \\ = & λ (\begin{matrix} u_{1} \\ ⋮ \\ u_{l} \\ ⋮ \\ u_{n_{r}} \end{matrix}) = λ u . \end{matrix}

(96)

The eigenvectors can be found from the rows, which are given by

- Y u_{l - 1} + X u_{l} + Y u_{l + 1} = Λ u,

(97)

with boundary conditions

u_{0} = u_{n_{r} + 1} = 0,

(98)

just as in the example of the diagonalization of the X matrix earlier in this section. Since the block matrix B is an

n_{r} \times n_{r}

array of blocks, the eigenvector has

n_{r}

components.

This difference equation has solution, once again, of the form

e^{i l θ}

or, in order to keep the angle

θ

positive, a linear combination consisting of

e^{i l θ}

and

e^{- i l θ}

. We write these eigenvectors as

u_{l} = A e^{i l θ} + B {(- 1)}^{l} e^{- i l θ},

(99)

and apply the

l = 0

boundary condition to obtain

u_{0} = A + B,

(100)

and so

B = - A

, and then

u_{l} = A [e^{i l θ} - {(- 1)}^{l} e^{- i l θ}] = 0,

(101)

Either of the two terms appearing here could be used to find the eigenvalue

Λ

by inserting it into the difference equation. We choose

e^{i l θ}

, and have

- Y e^{i (l - 1) θ} + X e^{i l θ} + Y e^{i (l + 1) θ} = Λ e^{i l θ},

(102)

which, after canceling

e^{i l θ}

, gives

Λ = - Y e^{- i θ} + X + Y e^{i θ},

(103)

so that

Λ = X + 2 i Y sin θ

(104)

is an

n_{c} \times n_{c}

matrix that represents the “eigenvalue” of a block, irrespective of whether or not X and Y are diagonal or even simultaneously diagonalizable.

The second boundary condition

u_{m + 1} = 0

quantizes the angle

θ

, just as it quantized the angle

φ

for the matrix X. It gives

u_{n_{r} + 1} = A [e^{i (n_{r} + 1) θ} - {(- 1)}^{n_{r} + 1} e^{- i (n_{r} + 1) θ}] = 0,

(105)

so that

e^{i (n_{r} + 1) θ} = {(- 1)}^{n_{r} + 1} e^{- i (n_{r} + 1) θ} = e^{i π (n_{r} + 1)} e^{- i (n_{r} + 1) θ} .

(106)

Taking the logarithm of both sides shows that

(n_{r} + 1) θ = (n_{r} + 1) π + 2 π W,

(107)

where W is another winding number arising from crossing the branch cut in the logarithm. This is what provides the complete set of eigenvectors, characterized by the various values that the angle

θ = θ_{W}

can take on as W assumes the values

W = 1, 2, \dots n_{r}

. Then,

θ_{W} = \frac{π}{2} + \frac{π W}{n_{r} + 1}

(108)

are the various values of

θ_{W}

. The values

θ = 0

and

θ = π

are the boundary points where the solution vanishes that were added by padding the ends of the list. The components of the eigenvector

u_{l}^{W}

are

u_{l}^{W} = A [e^{i k θ_{W}} - {(- 1)}^{l} e^{- i k θ_{W}}],

(109)

with A the normalization constant determined from

\sum_{k = 1}^{n_{c} - 1} {|u_{k}^{W}|}^{2} = 1

.

The expression for the eigenfunction,

u_{l}^{W} = A [e^{i l θ_{W}} - {(- 1)}^{l} e^{- i l θ_{W}}] = 0,

(110)

can be rewritten by inserting the explicit form of the angle

θ_{W} = \frac{π}{2} + \frac{W π}{n_{r} + 1}

. This produces

\frac{1}{A} u_{l}^{W} = e^{i l (\frac{π}{2} + \frac{W π}{n_{r} + 1})} - e^{i π l} e^{- i l (\frac{π}{2} + \frac{W π}{n_{r} + 1})} = e^{i \frac{π}{2} l} [e^{i l \frac{W π}{n_{r} + 1}} - e^{- i l \frac{W π}{n_{r} + 1}}] = {(i)}^{l} [2 i sin (\frac{π W l}{n_{r} + 1})],

(111)

giving

u_{l}^{W} = 2 A {(i)}^{l + 1} sin (\frac{π l W}{n_{r} + 1}) .

(112)

The normalization constant A is found earlier in this section in the diagonalization of the X matrix by requiring that

\sum_{l = 1}^{n_{r}} {|u_{l}^{W}|}^{2} = 1

. This gives

A = \frac{1}{\sqrt{2 (n_{r} + 1)}},

(113)

and so the components of the normalized eigenvector are

u_{l}^{W} = \sqrt{\frac{2}{n_{r} + 1}} i^{l + 1} sin (\frac{π l W}{n_{r} + 1}),

(114)

and the associated eigenvalue is

Λ_{W} = X + 2 i Y sin θ_{W} .

(115)

These are the block eigenvalues

Λ_{W}

of the block matrix B. Writing these down the diagonal gives B in diagonal form,

B = (\begin{matrix} X + 2 i Y sin (θ_{1}) \\ X + 2 i Y sin (θ_{2}) \\ X + 2 i Y sin (θ_{3}) \\ ⋱ \\ X + 2 i Y sin (θ_{n_{r}}) \end{matrix}) .

(116)

Each of these eigenvalues is actually an

n_{c} \times n_{c}

matrix forming a block of the now-diagonal block matrix B, and if written out in full represents the original

N_{sites} \times N_{sites}

matrix M, although that matrix is not yet cruciform. We now set out to make it so.

The block X is not yet in diagonal form, although we know its eigenvalues, which are

λ_{ω} = 2 i x sin φ_{ω}

with

ω = 1, 2, \dots n_{c}

. Thus, when diagonalized, we have

X = (\begin{matrix} 2 i x sin (φ_{1}) \\ 2 i x sin (φ_{2}) \\ 2 i x sin (φ_{3}) \\ ⋱ \\ 2 i x sin (φ_{n_{c}}) \end{matrix}) .

(117)

The question is, what happens to Y when we transform both X and Y using the unitary transformation U that diagonalizes X? The short answer is that the vertical dimer activity y shows up along the antidiagonal, which is the “diagonal” that runs from the lower left to upper right corners of a matrix. This produces the cruciform matrix

Λ_{W} = (\begin{matrix} λ_{1} & {(- 1)}^{n_{c} - 1} ξ \\ λ_{2} & {(- 1)}^{n_{c} - 2} ξ \\ ⋱ & ⋰ \\ ⋰ & ⋱ \\ {(- 1)}^{1} ξ & λ_{n_{c} - 1} \\ {(- 1)}^{0} ξ & λ_{n_{c}} \end{matrix}),

(118)

where

ξ = 2 i^{n_{c}} y sin θ_{W}

. We then have a block matrix consisting of these cruciform blocks, and its determinant gives the square of the partition function.

This cruciform shape results from the transformation U that diagonalizes X, which is constructed from the normalized eigenvectors

v^{ω}

of X with the components of each vector

v^{ω}

forming the columns of U, giving

U = (\begin{matrix} v_{1}^{1} & \dots & v_{1}^{ω} & \dots & v_{1}^{n_{c}} \\ ⋮ & ⋱ \\ v_{k}^{1} & v_{k = ω}^{ω} \\ ⋮ & ⋱ \\ v_{n_{c}}^{1} & v_{n_{c}}^{n_{c}} \end{matrix}) .

(119)

The block Y, written explicitly in terms of the activity, has entries y along the antidiagonal and can be written as y times the matrix with entries of 1 on the antidiagonal. To see what effect such a matrix has on another matrix, a two-dimensional example should be sufficient. We have

\begin{matrix} \frac{1}{y} U^{†} Y U & = & (\begin{matrix} u_{11}^{*} & u_{21}^{*} \\ u_{12}^{*} & u_{22}^{*} \end{matrix}) (\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}) (\begin{matrix} u_{11} & u_{12} \\ u_{21} & u_{22} \end{matrix}) = (\begin{matrix} u_{11}^{*} & u_{21}^{*} \\ u_{12}^{*} & u_{22}^{*} \end{matrix}) (\begin{matrix} u_{21} & u_{22} \\ u_{11} & u_{12} \end{matrix}) \\ = & (\begin{matrix} u_{11}^{*} u_{21} + u_{21}^{*} u_{11} & u_{11}^{*} u_{22} + u_{21}^{*} u_{12} \\ u_{12}^{*} u_{21} + u_{22}^{*} u_{11} & u_{12}^{*} u_{22} + u_{22}^{*} u_{12} \end{matrix}) . \end{matrix}

(120)

It seems that the effect of a matrix with unity along the antidiagonal (the anti-identity?) is to reverse the rows of the matrix it multiplies. This result implies that the matrix element

t_{12}

of

\frac{1}{y} U^{†} Y U

is

t_{12} = u_{11}^{*} u_{n_{c} 2} + u_{21}^{*} u_{(n_{c} - 1) 2} + u_{31}^{*} u_{(n_{c} - 2) 2} + \dots + u_{k 1}^{*} u_{(n_{c} - k + 1) 2} + \dots + u_{n_{c} 1}^{*} u_{(n_{c} - n_{c} + 1) 2} .

(121)

The general matrix element then consists of the product of the component

u_{k ω}^{*}

and

u_{k^{'} ω^{'}}

with

k^{'} = n_{c} - k + 1

, or in terms of the components of the eigenvector,

v_{k}^{ω *}

and

v_{k^{'} = n_{c} - k + 1}^{ω^{'}}

. The matrix elements of

\frac{1}{y} U^{†} Y U

are

t_{ω, ω^{'}} = \sum_{k = 1}^{n_{c}} v_{k}^{ω *} v_{n_{c} - k + 1}^{n_{c} - ω} = i^{n_{c} - 1} {(- 1)}^{n_{c} - ω} δ_{ω^{'}, n_{c} + 1 - ω},

(122)

which is derived in detail in Appendix D.

These are the terms that appear on the antidiagonal of Y after transforming to the representation in which X is diagonal. All matrix elements off the antidiagonal of Y are zero. This

n_{c} \times n_{c}

matrix Y can finally be written as

Y = y (\begin{matrix} i^{n_{c} - 1} {(- 1)}^{n_{c} - 1} \\ \dots \\ i^{n_{c} - 1} {(- 1)}^{n_{c} - (n_{c} - 1)} \\ i^{n_{c} - 1} {(- 1)}^{n_{c} - n_{c}} \end{matrix}) .

(123)

The transformed matrix

Λ_{W} = X + 2 i sin (θ_{W}) Y

becomes

Λ_{W} = (\begin{matrix} 2 i x sin (φ_{1}) & {(- 1)}^{n_{c} - 1} ξ) \\ 2 i x sin (φ_{2}) & {(- 1)}^{n_{c} - 2} ξ \\ ⋱ & ⋰ \\ ⋰ & ⋱ \\ {(- 1)}^{1} ξ & 2 i x sin (φ_{n_{c} - 1}) \\ {(- 1)}^{0} ξ & 2 i x sin (φ_{n_{c}}) \end{matrix}) .

(124)

This matrix

Λ_{W}

is a block of the matrix B, which, because it has the form of a cross, is called a cruciform matrix.

The partition function is given by the Pfaffian of this matrix B, and since the square of a Pfaffian is a determinant, it is more convenient to evaluate the square of the partition function, which is the determinant of B. The block eigenvalues

Λ_{W}

form the diagonal elements of B, while all other blocks are zero. The determinant of a diagonalized block matrix like this is the product of the determinants of the individual blocks, and the square of the partition function is the determinant.

Z^{2} = det [B] = \prod_{W = 1}^{n_{r}} det Λ_{W} .

(125)

The determinant of a cruciform matrix is easily determined by expanding by the first row of the matrix and then in each minor expanding by the last row. Then each new minor is expanded by the first row and each of the resulting minors by their bottom rows. This repetition creates

⌊ \frac{1}{2} (n + 1) ⌋

factors of the type

(d_{11} d_{n n} - d_{1 n} d_{n 1})

. The brackets

⌊ x ⌋

indicate the floor function, which gives the highest integer lower than x. The square of the partition function is evaluated in Appendix E, with the result

Z^{2} = {(- 1)}^{n_{r} ⌊ \frac{n_{c}}{2} ⌋} \prod_{ω = 1}^{n_{c}} \prod_{W = 1}^{n_{r}} [2 x cos (\frac{π ω}{n + 1}) + 2 i y cos (\frac{π W}{n_{r} + 1})] .

(126)

The square root of this, the partition function itself, is also evaluated in Appendix E and is given by

Z = 2^{\frac{n_{c}}{2} ⌊ \frac{n_{r}}{2} ⌋} \prod_{ω = 1}^{\frac{n_{c}}{2}} \prod_{W = 1}^{⌊ \frac{n_{r}}{2} ⌋} [x^{2} (1 + cos (\frac{2 π ω}{n_{c} + 1})) + y^{2} (1 + cos (\frac{2 π W}{n_{r} + 1}))] \times \{\begin{matrix} 1 & n_{r} even \\ x^{\frac{n_{c}}{2}} & n_{r} odd \end{matrix} .

(127)

8. Dimer Partition Function in the Large Lattice Limit

We can calculate the partition function explicitly if we take the limit as the lattice becomes large,

n_{r} \to \infty

and

n_{c} \to \infty

. It is better to find this limit of the logarithm of Z, so that products become sums and we have

\begin{matrix} ln Z & = & ln (2^{\frac{n_{c}}{2} ⌊ \frac{n_{r}}{2} ⌋}) + \sum_{ω = 1}^{\frac{n_{c}}{2}} \sum_{W = 1}^{⌊ \frac{n_{r}}{2} ⌋} ln \{x^{2} [1 + cos (\frac{2 π ω}{n_{c} + 1})] + y^{2} [1 + cos (\frac{2 π W}{n_{r} + 1})]\} + \\ + \{\begin{matrix} 0 & n_{r} even \\ \frac{n_{c}}{2} ln x & n_{r} odd \end{matrix} . \end{matrix}

(128)

In the infinite limit,

n_{r}

and

n_{c}

are much greater than unity, and so

n_{r} + 1 \sim n_{r}

and

n_{c} + 1 \sim n_{c}

, while

⌊ \frac{n_{r}}{2} / 2 ⌋ \sim \frac{n_{r}}{2}

. The basic rule for converting sums to integrals is

\sum_{k = 1}^{k_{\max}} f (x_{k}) Δ x = \int_{a}^{b} f (x) d x,

(129)

with

Δ x = \frac{b - a}{k_{\max} - 1}

and

x_{k} = a + (k - 1) Δ x

. Using this, the sum over ω becomes

\sum_{ω = 1}^{\frac{n_{c}}{2}} ln \{x^{2} [1 + cos (\frac{2 π ω}{n_{c} + 1})] + y^{2} [1 + cos (\frac{2 π W}{n_{r} + 1})]\} \sim \int_{a}^{b} f (x) d x,

(130)

Choosing the integration variable as

t = \frac{2 π ω}{n_{c}}

is convenient, giving a factor

d t = \frac{d t}{d ω} d ω = \frac{2 π}{n_{c}} d ω

, and when

n_{c}

is large, the lower limit

ω = 1

becomes

a \approx 0

. The upper limit is

b = \frac{2 π}{n_{c}} \frac{n_{c}}{2} = π

, and so the integral takes the form

\begin{matrix} \sum_{ω = 1}^{\frac{n_{c}}{2}} ln \{x^{2} [1 + cos (\frac{2 π ω}{n_{c} + 1})] + y^{2} [1 + cos (\frac{2 π W}{n_{r} + 1})]\} \\ \sim \int_{0}^{π} \frac{n_{c}}{2 π} d t ln \{x^{2} [1 + cos t] + y^{2} [1 + cos (\frac{2 π W}{n_{r}})]\} . \end{matrix}

(131)

The analog for the sum over W is

\begin{matrix} \sum_{W = 1}^{\frac{n_{r}}{2}} ln \{x^{2} [1 + cos (\frac{2 π ω}{n_{c} + 1})] + y^{2} [1 + cos (\frac{2 π W}{n_{r} + 1})]\} \\ \sim \int_{0}^{π} \frac{n_{r}}{2 π} d s ln \{x^{2} [1 + cos (\frac{2 π ω}{n_{c}})] + y^{2} [1 + cos s]\} . \end{matrix}

(132)

The double sum is then

\begin{matrix} \sum_{ω = 1}^{\frac{n_{c}}{2}} \sum_{W = 1}^{\frac{n_{r}}{2}} ln \{x^{2} [1 + cos (\frac{2 π ω}{n_{c} + 1})] + y^{2} [1 + cos (\frac{2 π W}{n_{r} + 1})]\} \\ \sim \frac{n_{c} n_{r}}{4 π^{2}} \int_{0}^{π} d s \int_{0}^{π} d t ln \{x^{2} [1 + cos t] + y^{2} [1 + cos s]\} . \end{matrix}

(133)

The integral is given by Equation (4.224.9) of reference [16] as

\int_{0}^{π} d s ln (a + b cos s) = π ln (\frac{a + \sqrt{a^{2} - b^{2}}}{2}) for a \geq | b | > b .

(134)

Here,

a = x^{2} (1 + cos t) + y^{2}

or

y^{2} (1 + cos s)

, so that the integrals can be evaluated, although we have no need to do so. The entropy is an extensive quantity, with its logarithm proportional to

N_{sites}

as the number of sites becomes large. The result diverges as the lattice becomes infinite, and the only meaningful expression is the “partition function per site”,

\begin{matrix} ln Z_{site} & \equiv & lim_{N_{sites} \to \infty} \frac{1}{N_{sites}} \sum_{ω = 1}^{\frac{n_{c}}{2}} \sum_{W = 1}^{\frac{n_{r}}{2}} ln \{x^{2} [1 + cos (\frac{2 π ω}{n_{c} + 1})] + y^{2} [1 + cos (\frac{2 π W}{n_{r} + 1})]\} . \end{matrix}

(135)

This result is given by Fisher and followed by a derivation of the entropy and an excellent discussion of the result and associated dimer densities per site

n_{x}

and

n_{y}

that will not be reiterated here [7].

9. Colored y Dimers and the Partition Function for Dimers and Monomers

The partition function in Equation (127) is applicable to any lattice formed from

n_{r}

rows and

n_{c}

columns of lattice points. Suppose that the lattice has the form of a two-leg ladder, with

n_{r} = 2

rows and

n_{c}

columns and that it is completely filled with dimers,

ν = 1

. A dimer arrangement on this lattice is shown in the top of Figure 14. The y-direction dimers are colored orange to distinguish them from the x dimers, which are colored green. Looking down on this ladder so that it is viewed edge-on, as in the bottom picture, it looks like a linear lattice filled with dimers and orange monomers.

Figure 14. A two-leg ladder ((top) figure) viewed from above ((bottom) figure). The y dimers are shown in orange, with the x dimers shown in green. The bottom picture could be considered to be a linear lattice in which the orange y dimers appear to be monomers. The red dots and dashes outline the lattices.

What is the effect of randomly coloring the vertical dimers red and blue and assigning them corresponding activities

y_{r}

and

y_{b}

, just like the red and blue balls discussed earlier? If we could find the partition function for this system, we could have a linear system of dimers, monomers, and vacant sites, even if this problem has not been solved analytically in two dimensions in Fisher’s work.

We can see how to do this by starting with the expression

Z = \sum_{N_{x}, N_{y}} g (N_{x}, N_{y}) x^{N_{x}} y^{N_{y}},

(136)

because

g (N_{x}, N_{y})

is the number of states accessible to the system when there are precisely

N_{x}

horizontal dimers and

N_{y}

vertical dimers. If the vertical y-dimers come in two colors,

y_{r}

and

y_{b}

, the number of distinct states increases, and does so by a factor of

\frac{N_{y}!}{N_{r}! N_{b}!}

, where

N_{r}

is the number of red y-bonds and

N_{b}

is the number of blue y-dimers, and

N_{r} + N_{b} = N_{y}

. The sum over

N_{y}

gains the additional factor

\sum_{N_{r} = 0}^{N_{y}} \frac{N_{y}!}{N_{r}! N_{b}!} y_{r}^{N_{r}} y_{b}^{N_{b}} = \sum_{N_{r} = 0}^{N_{y}} \frac{N_{y}!}{N_{r}! (N_{y} - N_{r})!} y_{r}^{N_{r}} y_{b}^{N_{y} - N_{r}} = {(y_{r} + y_{b})}^{N_{y}},

(137)

and we have

Z = \sum_{N_{x}, N_{y}} g (N_{x}, N_{y}) x^{N_{x}} \sum_{N_{r} = 0}^{N_{y}} \frac{N_{y}!}{N_{r}! N_{b}!} y_{r}^{N_{r}} y_{b}^{N_{b}} = \sum_{N_{x}, N_{y}} g (N_{x}, N_{y}) x^{N_{x}} {(y_{r} + y_{b})}^{N_{y}} .

(138)

The link fields

A^{μ}

are colorless and do not distinguish between

y_{r}

and

y_{b}

, even though they throw away a large fraction of the random arrangements of the child’s toys. Thus, the rest of the calculation proceeds as before, the only change being the replacement

y \to y_{r} + y_{b}

, giving

\begin{matrix} Z & = & 2^{\frac{n_{c}}{2} ⌊ \frac{n_{r}}{2} ⌋} \prod_{ω = 1}^{\frac{n_{c}}{2}} \prod_{W = 1}^{⌊ \frac{n_{r}}{2} ⌋} [x^{2} (1 + cos (\frac{2 π ω}{n_{c} + 1})) + {(y_{r} + y_{b})}^{2} (1 + cos (\frac{2 π W}{n_{r} + 1}))] \\ \times \{\begin{matrix} 1 & n_{r} even \\ x^{\frac{n_{c}}{2}} & n_{r} odd \end{matrix} . \end{matrix}

(139)

We now return to the two-leg ladder. Here, we want a very long ladder,

n_{c} \to \infty

, while for two legs,

n_{r} = 2

. Evaluating the partition function for

n_{r} = 2

gives

⌊ \frac{n_{r}}{2} ⌋ = 1

, and we have the logarithm of the partition function given by

\begin{matrix} ln Z & = & ln (2^{\frac{n_{c}}{2} ⌊ \frac{n_{r}}{2} ⌋}) + \sum_{ω = 1}^{\frac{n_{c}}{2}} \sum_{W = 1}^{⌊ \frac{n_{r}}{2} ⌋} ln \{x^{2} [1 + cos (\frac{2 π ω}{n_{c} + 1})] + y^{2} [1 + cos (\frac{2 π W}{n_{r} + 1})]\} + \\ + \{\begin{matrix} 0 & n_{r} even \\ \frac{n_{c}}{2} ln x & n_{r} odd \end{matrix} \\ = & ln 2^{\frac{n_{c}}{2}} + \sum_{ω = 1}^{\frac{n_{c}}{2}} \sum_{W = 1}^{1} ln \{x^{2} [1 + cos (\frac{2 π ω}{n_{c} + 1})] + y^{2} [1 + cos (\frac{2 π W}{2 + 1})]\} \\ = & \frac{n_{c}}{2} ln 2 + \sum_{ω = 1}^{\frac{n_{c}}{2}} ln \{x^{2} [1 + cos (\frac{2 π ω}{n_{c} + 1})] + y^{2} [1 + cos (\frac{2 π}{3})]\} . \end{matrix}

(140)

The angle

\frac{2 π}{3}

is

120^{\circ}

, and so the cosine is

- \frac{1}{2}

, giving

ln Z = \frac{n_{c}}{2} ln 2 + \sum_{ω = 1}^{\frac{n_{c}}{2}} ln \{x^{2} [1 + cos (\frac{2 π ω}{n_{c} + 1})] + \frac{1}{2} y^{2}\} .

(141)

Combining the first term with the sum brings a factor of two inside the log in the sum, giving

ln Z = \sum_{ω = 1}^{\frac{n_{c}}{2}} ln \{2 x^{2} [1 + cos (\frac{2 π ω}{n_{c} + 1})] + y^{2}\} .

(142)

This is the correct result for the two-leg ladder. It is not, however, correct for the linear chain. To see why, imagine squeezing the rungs of the ladder together, producing the orange disks from the original y-oriented dimers, as shown in Figure 15. This results in a double dimer, one from each leg of the ladder, on pairs of sites that should only a single dimer. In the logarithm of the partition function, the origin of the double dimers is the

x^{2}

term, which because we have logarithms, gives

ln x^{2} = ln x + ln x

. The correct result for the linear chain,

n_{r} = 1

, has

x^{2}

replaced by x,

ln Z = \sum_{ω = 1}^{\frac{n_{c}}{2}} ln \{2 x [1 + cos (\frac{2 π ω}{n_{c} + 1})] + y^{2}\} when n_{r} = 1 .

(143)

Fisher implies that he confirmed this result by including vacancies in their original formulation and evaluating the Pfaffian, and states that this result is consistent with other ways of determining configurations of a monomer–dimer linear chain.

Figure 15. Imagine squeezing the rungs of the ladder together, producing the orange disks from the original y-oriented dimers.

10. The Linear Chain

This gives the partition function for an infinitely long linear chain by letting

n_{c} \to \infty

and using the integral representation introduced above. This gives

\begin{matrix} ln Z_{site} & = & lim_{n_{c} \to \infty} \frac{1}{n_{c}} \sum_{ω = 1}^{\frac{n_{c}}{2}} ln \{2 x [1 + cos (\frac{2 π ω}{n_{c} + 1})] + y^{2}\} \\ = & \int_{0}^{π} \frac{d t}{2 π} ln [2 x (1 + cos t) + y^{2}] . \end{matrix}

(144)

Evaluating the integral gives

π ln (\frac{a + \sqrt{a^{2} + b^{2}}}{2})

where

a = 2 x + y^{2}

and

b = 2 x

. In the infinite limit of a very long chain,

ln Z_{site} = \frac{1}{2} ln (\frac{2 x + y^{2} + \sqrt{{(2 x + y^{2})}^{2} - {(2 x)}^{2}}}{2}) .

(145)

Under the square root, we have

4 x^{2} + 4 x y^{2} + y^{4} - 4 x^{2} = y^{2} (4 x + y^{2})

, giving

\begin{matrix} ln Z_{site} & = & \frac{1}{2} ln (\frac{4 x + 2 y \sqrt{4 x + y^{2}} + 2 y^{2}}{4}) = \frac{1}{2} ln (\frac{(4 x + y^{2}) + 2 y \sqrt{4 x + y^{2}} + y^{2}}{4}) \\ = & \frac{1}{2} ln ({[\frac{\sqrt{4 x + y^{2}} + y}{2}]}^{2}) . \end{matrix}

(146)

Then the log of the partition function per site is

ln Z_{site} = ln (\frac{\sqrt{4 x + y^{2}} + y}{2}) n_{r} = 1, n_{c} \to \infty

(147)

for a long monomer–dimer chain. This allows systems involving the attachment of monomers and dimers to a long polymeric chain, such as charged dimers and point-like ions binding electrostatically to a DNA strand.

For colored vertical dimers, we replace the activity y by

y_{b} + y_{r}

, with

y_{r}

the activity of red y-dimers and y the activity of blue y-dimers. Making this change in the partition function of the one-dimensional very long chain, we have

ln Z_{site} = ln (\frac{\sqrt{4 x + {(y_{r} + y_{b})}^{2}} + y_{r} + y_{b}}{2}) n_{r} = 1, n_{c} \to \infty .

(148)

We can now identify the red y-dimers with monomers with activity

z_{⊥} = e^{- β (ε_{⊥} - μ_{⊥})}

and the blue y-dimers with vacancies with activity

v = 1

, because vacancies have neither energy nor chemical potential, and so

v = e^{- β (ε_{v} - μ_{v})} = e^{0} = 1

. We will call the activity of the x dimers

x_{‖} = e^{- β (ε_{‖} - μ_{‖})}

. Then the log of the partition function per site that includes y dimers and vacancies is

ln Z_{site} = ln (\frac{1}{2} [z_{⊥} + v + \sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}}]), n_{r} = 1, n_{c} \to \infty .

(149)

In general, the entropy per site is given by the expression

S_{site} = k_{B} ln Z_{site} - \frac{k_{B} β}{Z_{site}} \frac{\partial Z_{site}}{\partial β} .

(150)

Therefore, the entropy for the dimer model becomes

S_{site} = k_{B} ln Z_{site} - \frac{1}{2} \frac{k_{B} β}{Z_{site}} \frac{\partial}{\partial β} (z_{⊥} + v + \sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}}),

(151)

where twice the derivative of the partition function is

\begin{matrix} 2 \frac{\partial}{\partial β} Z_{site} & = & \frac{\partial}{\partial β} ((z_{⊥} + v) + \sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}}) = \frac{\partial z_{⊥}}{\partial β} + \frac{\partial v}{\partial β} \\ + \frac{1}{2} {[{(z_{⊥} + v)}^{2} + 4 x_{‖}]}^{- \frac{1}{2}} \frac{\partial}{\partial β} [{(z_{⊥} + v)}^{2} + 4 x_{‖}] . \end{matrix}

(152)

The derivative of the activity

a = e^{- β (ε_{a} - μ_{a})}

with respect to

β

is

\frac{\partial}{\partial β} a = - (ε_{a} - μ_{a}) a = \frac{1}{β} a ln a .

(153)

Therefore, the derivative of the partition function per site is

\begin{matrix} 2 \frac{\partial}{\partial β} Z_{site} & = & \frac{1}{β} z_{⊥} ln z_{⊥} + 0 \\ + \frac{1}{2} {[{(z_{⊥} + v)}^{2} + 4 x_{‖}]}^{- \frac{1}{2}} [2 (z_{⊥} + v) \frac{1}{β} z_{⊥} ln z_{⊥} + \frac{4}{β} x_{‖} ln x_{‖}] . \end{matrix}

(154)

This means that the entropy is given by

\begin{matrix} S_{site} & = & k_{B} ln Z_{site} - \frac{1}{2} \frac{k_{B} β}{Z_{site}} \{\frac{1}{β} z_{⊥} ln z_{⊥} + \frac{1}{2} {[{(z_{⊥} + v)}^{2} + 4 x_{‖}]}^{- \frac{1}{2}} \times \\ \times [2 (z_{⊥} + v) \frac{1}{β} z_{⊥} ln z_{⊥} + \frac{4}{β} x_{‖} ln x_{‖}]\} . \end{matrix}

(155)

Simplifying, we have

\begin{matrix} S_{site} & = & k_{B} ln Z_{site} - \frac{k_{B}}{2 Z_{site}} \{z_{⊥} ln z_{⊥} + {[{(z_{⊥} + v)}^{2} + 4 x_{‖}]}^{- \frac{1}{2}} \times \\ \times [(z_{⊥} + v) z_{⊥} ln z_{⊥} + 2 x_{‖} ln x_{‖}]\} . \end{matrix}

(156)

Similarly, the mean site occupation by the perpendicular dimers

z_{⊥}

is

n_{⊥} = \frac{1}{β Z_{site}} \frac{\partial Z_{site}}{\partial μ_{⊥}} = \frac{1}{2 β Z_{site}} \frac{\partial}{\partial μ_{⊥}} (z_{⊥} + v + \sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}}) .

(157)

Therefore, taking the derivative, we have

n_{⊥} = \frac{1}{2 β Z_{site}} (\frac{\partial z_{⊥}}{\partial μ_{⊥}} + 0 + \frac{1}{2} {[{(z_{⊥} + v)}^{2} + 4 x_{‖}]}^{- \frac{1}{2}} \frac{\partial}{\partial μ_{⊥}} [{(z_{⊥} + v)}^{2} + 4 x_{‖}]) .

(158)

The derivative of the activity

a = e^{- β (ε_{a} - μ_{a})}

with respect to

μ_{a}

is

\frac{\partial}{\partial μ_{a}} = \frac{\partial}{\partial μ_{a}} e^{- β (ε_{a} - μ_{a})} = β a,

(159)

and this is zero applied to the activity of a different type of particle. Then the mean site occupation by the perpendicular dimers

z_{⊥}

is

n_{⊥} = \frac{1}{2 β Z_{site}} (β z_{⊥} + \frac{1}{2} {[{(z_{⊥} + v)}^{2} + 4 x_{‖}]}^{- \frac{1}{2}} [2 (z_{⊥} + v) β z_{⊥}]) .

(160)

Simplifying, we have

n_{⊥} = \frac{z_{⊥}}{2 Z_{site}} (1 + (z_{⊥} + v) {[{(z_{⊥} + v)}^{2} + 4 x_{‖}]}^{- \frac{1}{2}}) .

(161)

Factoring out

{[{(z_{⊥} + v)}^{2} + 4 x_{‖}]}^{- \frac{1}{2}}

from the parentheses, we have

n_{⊥} = \frac{z_{⊥}}{2 \sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}}} \frac{1}{Z_{site}} (\sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}} + (z_{⊥} + v)) .

(162)

Now we see that the quantity inside the large parentheses is

2 Z_{site}

, and so the mean site occupancy reduces to

n_{⊥} = \frac{z_{⊥}}{\sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}}} .

(163)

The same steps lead to the mean site occupancy for the vacancies v, which is

n_{v} = \frac{v}{\sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}}} .

(164)

Since the parallel dimers

x_{‖}

each occupy two sites, the mean occupancy per site must be doubled, and we have the mean site occupancy for the parallel dimers as

n_{‖} = 2 \frac{1}{β Z_{site}} \frac{\partial Z_{site}}{\partial μ_{‖}} = \frac{2}{2 β Z_{site}} \frac{\partial}{\partial μ_{‖}} (z_{⊥} + v + \sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}}) .

(165)

Taking the derivative, we have

n_{‖} = \frac{1}{β Z_{site}} \frac{1}{2} {[{(z_{⊥} + v)}^{2} + 4 x_{‖}]}^{- \frac{1}{2}} (4 β x_{‖}) .

(166)

Simplifying yields

n_{‖} = \frac{2 x_{‖}}{Z_{site} \sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}}} .

(167)

The sum of the occupancies for the three species should be 1, and so let us check that. The sum is

n_{⊥} + n_{‖} + n_{v} = \frac{[(z_{⊥} + v) Z_{site} + 2 x_{‖}]}{Z_{site} \sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}}} .

(168)

Substituting

Z_{site}

in the numerator, we have

n_{⊥} + n_{‖} + n_{v} = \frac{1}{Z_{site} \sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}}} \{\frac{1}{2} (z_{⊥} + v) [(z_{⊥} + v) + \sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}}] + 2 x_{‖}\} .

(169)

Factoring

\frac{1}{2}

from the numerator and expanding, we have

n_{⊥} + n_{‖} + n_{v} = \frac{1}{2 Z_{site} \sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}}} [{(z_{⊥} + v)}^{2} + 4 x_{‖} + (z_{⊥} + v) \sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}}] .

(170)

Factoring

\sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}}

from the numerator produces

n_{⊥} + n_{‖} + n_{v} = \frac{\sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}}}{2 Z_{site} \sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}}} [\sqrt{{(z_{⊥} + v)}^{2} + 4 x_{‖}} + (z_{⊥} + v)] .

(171)

The last factor on the right is exactly

2 Z_{site}

, and so the right-hand side reduces to one, and we have

n_{⊥} + n_{‖} + n_{v} = 1 .

(172)

Therefore, the total occupancy is, correctly, unity.

11. Results for the Entropy, Average Occupation, and Total Charge

Under certain conditions, DNA molecules condense. In this process, the DNA chain rolls into a tight toroid. If a DNA strand were simply placed in aqueous solution, this would not happen, because the phosphate groups that form the DNA backbone are negative, so that one length of the chain repels another. However, in a solution containing cations, the positive cations are electrostatically attracted to the negative backbone, neutralizing it or even inverting the charge. When the charge is inverted, the DNA chain dressed by the cations becomes positive, allowing it to coil up compactly.

Understanding the statistical mechanics of such processes gives insight into the evolution of complex life and the dynamics that maintains it. This led two of the authors to use methods borrowed from the physics of interacting particles and quantum field theory to the electrostatics of biomolecules [9,10] using lattice gas models. Such models regard all ions, monomers, dimers, or larger polyions as pointlike particles, completely abandoning the geometric constraints that come from the extended nature of the adsorbing species. All the geometry inherent in attaching a polyion flat against a surface so that it covers several lattice sites is lost.

In the plots of this section, any numerical parameters used were taken from our earlier work [9,10]. In these plots, the horizontal axis is the dimensionless product

β ε_{‖}

. Because parallel dimers occupy two sites, while the perpendicular dimers only occupy one, we set

β ε_{⊥} = \frac{1}{2} β ε_{‖}

. Strong binding to the lattice occurs when

β ε_{‖}

is large and negative, while the weak binding region is where

β ε_{‖}

is positive. Assuming that the DNA double helix is in equilibrium with a solution of dimers, all the chemical potentials must be the same and be equal to the chemical potential of dimers in solution given by

β μ_{dimer}

. The plots are all for the single physiological temperature of

T ≃ 310

K, with

β μ_{dimer} ≃ 0.79

.

The Fisher dimer model, as formulated in the previous sections, describes parallel dimers, with activity

x_{‖}

, perpendicular dimers, with activity

z_{⊥}

, and vacancies with activity v. These vacancies are on the negative backbone, and each of these vacant sites carries a charge of

- 1

, in units of the magnitude of the electron charge. The dimers have charge

+ 2

, with one positive charge on each end. Then, electrostatic attraction causes the dimers to bind to the vacant sites. They can do so in two ways, first by lying flat, parallel to the DNA chain, and binding to two vacant sites, so that they have an activity of

x_{‖}

. Alternatively, they can protrude at right angles to the DNA chain, with only one end bound to the chain and activity

z_{⊥}

. These two possibilities are called parallel and perpendicular dimers, with the perpendicular dimers acting as the monomers in the dimer, monomer, and vacancy model.

Figure 16 shows the entropies given by the Fisher dimer model and by the non-interacting lattice gas model. The striking feature is the way that entropy appears to plateau in the strong binding region. The lattice gas model does not show this because all the dimers lie flat on the lattice by

β ε_{‖} \sim - 10

. The nonzero value of the entropy in the Fisher dimer model suggests that disorder persists in this region even as the binding force becomes quite strong.

Figure 16. The plot of the two entropies given by the lattice gas model and the Fisher model of dimers.

To understand this disorder in more detail, Figure 17 shows the site occupancies for the Fisher dimer model. In the region of strong binding, both the site occupancies for the parallel and perpendicular dimers also plateau, even though the number of vacancies drops to near zero. This indicates that the disorder that leads to the nonvanishing entropy consists of a mixture of parallel and perpendicular dimers, occupying practically all the sites and accounting for the nonzero value of the entropy as the binding force becomes strong.

Figure 17. The average occupation of species in the Fisher model of dimers showing species mixing even at large negative binding energy. The purple dashed line is a numerical check showing that the sum of occupancies is one, as demonstrated analytically in the text.

For comparison, Figure 18 shows the corresponding site occupancies for the lattice gas model. There, the perpendicular dimer occupancy drops to zero in the region where the binding force is strong, as does the vacancy density, and the lattice is completely occupied by parallel dimers. An interpretation of the contrasting behavior between the two models is that the actual shape of a dimer spanning two sites when it lies parallel is responsible for the disorder, while in the lattice gas model, the parallel dimers are treated as though they are monomers, but with twice the binding energy.

Figure 18. The average occupation of species in the Fisher model of dimers showing species mixing even at large negative binding energy.

The phenomenon of the DNA strand rolling up compactly requires excess charge, and so Figure 19 shows the total charge

n_{⊥} + n_{v}

. This charge is positive when the binding force is strong because in that region there are few negative vacancies, but still a large number of perpendicular dimers, while each parallel dimer neutralizes two sites. That persistent positive charge in the Fisher model does not appear in the lattice gas model, where the charge drops to near zero in that region. In the weak binding region, the charge becomes small in the same way in both models, and the two curves lie on top of one another.

Figure 19. A comparison of the total charge vs. the binding energy on the lattice between the lattice gas model and the Fisher model.

In the context of DNA compaction, this excess charge is the physically most important feature seen from the Fisher model. This is because it leads to larger and more persistent charge inversion than is seen in the lattice gas model. It does so because once a sequence of parallel dimer–vacancy–parallel dimer forms, the only way the vacancy can be filled is with a perpendicular dimer. This must be the persistent disorder that occurs in the region of strong binding force.

12. Discussion

We have described a simplified version of Fisher’s derivation of the partition function of the completely filled

ν = 1

dimer model. This provides the entropy as calculated by Fisher, and the mathematics described has some relevance to the Ising spin model. While this work was in progress, we became aware of the paper by Allegra and Fortin [17] that proposes the use of Grassmann variables to solve the

d = 2

dimer problem including monomers, obtaining the partition function as a product of two Pfaffians. That opens the way for entropy and site occupancy calculations for the two-dimensional sheet following methods described here.

After extracting the partition function for a one-dimensional dimer–monomer–vacancy system from that for a

ν = 1

two-leg ladder, we compared the entropy and site occupations as a function of the binding energy to the lattice with the similar results for the lattice gas. The entropy appeared to plateau at a nonzero value in the strong-binding-force region, suggesting that disorder persists in that region.

In this context, the dimers represent dimeric polyions with a unit positive charge on each end that neutralizes the unit negative charge on the backbone lattice site, so the parallel dimers are effectively charge-neutral. The monomers represent the same dimers but oriented perpendicular to the surface so that only one end neutralizes the backbone lattice site to which it is attached, and it makes a unit-positive-charge contribution to the charge density on the chain, while each vacant site contributes its unit negative charge. In the strong-binding-force region, most lattice sites are occupied and few vacancies remain, but the perpendicular dimers scattered among the parallel dimers is a source of disorder. This leads to the persistent excess charge in the strong-binding region, and provides the disorder that accounts for the persistent plateau in the entropy there as well.

In our earlier paper [10] using the lattice gas model, we included electrostatic repulsion among the dimers, monomers, and negative empty sites on the DNA backbone to one-loop order. That did produce a reduction in the entropy by up to

1 / 3

of the total, but only in the weak-binding region of the plot. Consequently, this reduction would not affect the strong-binding region where the dimer model entropy appears to plateau at a nonzero value and the excess positive charge persists. Because this strong-binding region is the more important one, the geometry of the extended dimer occupying more than one site is the more important physically.

In other words, our conclusion from this work is that if physical consequences are the concern, the extended geometry of the attached molecules has more influence than the electrostatic many-particle interactions of species that carry electrical charge. When determining entropy, the shape of the attached molecule and the volume it excludes does matter, and ways of including these effects are needed.

Author Contributions

Writing—original draft preparation, J.C.B., M.F.B. and T.M.; writing—review and editing, J.C.B., M.F.B. and T.M. All authors contributed to this paper equally. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

This is not relevant to our study. This does not require ethical approval.

Data Availability Statement

There is no data involved. This is a theoretical study.

Acknowledgments

We are grateful to Kevin R. Ward, Department of Emergency Medicine, University of Michigan, for introducing us to the role of charge in biological systems and to the practical consequences that could be achieved by controlling the charge distribution on biological molecules.

Conflicts of Interest

Author John C. Baker was employed by the company CACI International Inc. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Appendix A. Trace Theorems of the A^μ Matrices

We now prove the trace theorems that we need in order to study the statistical mechanics of the

ν = 1

dimer model on a square lattice. The commutation law with

ν = μ

shows that

2 A^{μ} A^{ν} = 2 g^{μ μ} = 2 I_{d} .

(A1)

This ensures that the trace of a single

A^{μ}

matrix vanishes, because inserting

{(A^{ν})}^{2} = I_{d}

and using the cyclic property of the trace yields

Tr (A^{μ}) = Tr [A^{μ} {(A^{ν})}^{2}] = Tr (A^{ν} A^{μ} A^{ν}) = Tr (- A^{μ} A^{ν} A^{ν}),

(A2)

where to obtain the third equality we anticommuted

A^{ν}

and

A^{μ}

, introducing a minus sign. Now, the product

- A^{μ} A^{ν} A^{ν}

can be regarded as some d-dimensional matrix M with matrix elements

m_{i j}

. Then,

Tr (- M) = \sum_{j = 1}^{d} (- m_{j j}) = - \sum_{j = 1}^{d} m_{j j} = - Tr (M)

. Thus

Tr (A^{μ}) = Tr (- A^{μ} A^{ν} A^{ν}) = - Tr (A^{μ} A^{ν} A^{ν}) = - Tr (A^{μ}) .

(A3)

This can only be true if

Tr (A^{μ}) = 0 .

(A4)

This is easily generalized to the trace of any product of an odd number of

A_{j}

matrices, because then

\begin{matrix} Tr (A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{2 n + 1}}) & = & Tr [A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{2 n + 1}} {(A^{ν})}^{2}] = Tr (A^{ν} A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{2 n + 1}} A^{ν}) \\ = & Tr [{(- 1)}^{2 n + 1} A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{2 n + 1}} A^{ν} A^{ν}], \end{matrix}

(A5)

since an odd number of interchanges is required to move

A^{ν}

past

2 n + 1

other matrices. We have

Tr (A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{2 n + 1}}) = {(- 1)}^{2 n + 1} Tr (A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{2 n + 1}}) = - Tr (A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{2 n + 1}}),

(A6)

so that the trace is zero,

Tr (A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{2 n + 1}}) = 0 .

(A7)

However, the trace of the product of an even number of

A^{μ}

matrices does not always vanish. The simplest example is

A^{μ} A^{ν} = 2 g^{μ ν} - A^{ν} A^{μ} .

(A8)

Taking the trace gives

Tr (A^{μ} A^{ν}) = 2 Tr (g^{μ ν}) + Tr (- A^{ν} A^{μ}) = 2 Tr (g^{μ ν}) - Tr (A^{ν} A^{μ}) = 2 Tr (g^{μ ν}) - Tr (A^{μ} A^{ν}),

(A9)

where in the last equality we used the cyclic property of the trace. Since the last trace on the right is the same as the expression we started with, moving the last term on the right to the left side gives

Tr (A^{μ} A^{ν}) = Tr (g^{μ ν}),

(A10)

after canceling a factor of two. When

ν = μ, Tr (g^{μ ν}) = Tr (g^{μ μ}) = d

.

Now, let us tackle the general case of arbitrary numbers of both matched pairs and unmatched

A^{μ}

s. Any product can be reduced to sums of products of the

g^{μ ν}

and a leftover term by repeated application of the anticommutation relation

A^{μ} A^{ν} = 2 g^{μ ν} - A^{ν} A^{μ}

. Consider

\begin{matrix} A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{n}} & = & (2 g^{μ_{1} μ_{2}} - A^{μ_{2}} A^{μ_{1}}) A^{μ_{3}} A^{μ_{4}} \dots A^{μ_{n}} \\ = & 2 g^{μ_{1} μ_{2}} A^{μ_{3}} A^{μ_{4}} \dots A^{μ_{n}} - A^{μ_{2}} A^{μ_{1}} A^{μ_{3}} A^{μ_{4}} \dots A^{μ_{n}} . \end{matrix}

(A11)

Continuing to move

A^{μ_{1}}

to the end in the last term gives

\begin{matrix} A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{n}} & = & 2 g^{μ_{1} μ_{2}} A^{μ_{3}} A^{μ_{4}} \dots A^{μ_{n}} - A^{μ_{2}} (2 g^{μ_{1} μ_{3}} - A^{μ_{3}} A^{μ_{1}}) A^{μ_{4}} A^{μ_{5}} \dots A^{μ_{n}} \\ = & 2 g^{μ_{1} μ_{2}} A^{μ_{3}} A^{μ_{4}} \dots A^{μ_{n}} - A^{μ_{2}} (2 g^{μ_{1} μ_{3}}) A^{μ_{4}} A^{μ_{5}} \dots A^{μ_{n}} \\ + A^{μ_{2}} A^{μ_{3}} A^{μ_{1}} A^{μ_{4}} A^{μ_{5}} \dots A^{μ_{n}} . \end{matrix}

(A12)

The metric

g^{μ ν}

commutes with all the

A^{μ}

, and can be moved to the front of each term. We then have the general pattern

\begin{matrix} A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{n}} & = & 2 g^{μ_{1} μ_{2}} A^{μ_{3}} A^{μ_{4}} \dots A^{μ_{n}} - 2 g^{μ_{1} μ_{3}} A^{μ_{2}} A^{μ_{4}} A^{μ_{5}} \dots A^{μ_{n}} \\ + 2 g^{μ_{1} μ_{4}} A^{μ_{2}} A^{μ_{4}} A^{μ_{5}} \dots A^{μ_{n}} - \dots \\ + {(- 1)}^{n - 1} A^{μ_{2}} A^{μ_{3}} A^{μ_{4}} A^{μ_{5}} \dots A^{μ_{n}} A^{μ_{1}}, \end{matrix}

(A13)

with the signs alternating from one term to the next, and the last having an overall parity factor of

{(- 1)}^{n - 1}

because

n - 1

interchanges are needed to move

A^{μ_{1}}

to the end.

Although we could now apply this general formula, which we will call the contraction theorem, to continue to reduce the sub-products like

A^{μ_{3}} A^{μ_{4}} \dots A^{μ_{n}}

, let us first take the trace of this expression and use the cyclic property to see that

\begin{matrix} Tr (A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{n}}) & = & 2 Tr (g^{μ_{1} μ_{2}} A^{μ_{3}} A^{μ_{4}} \dots A^{μ_{n}}) - 2 Tr (g^{μ_{1} μ_{3}} A^{μ_{2}} A^{μ_{4}} A^{μ_{5}} \dots A^{μ_{n}}) \\ + 2 Tr (g^{μ_{1} μ_{4}} A^{μ_{2}} A^{μ_{3}} A^{μ_{5}} \dots A^{μ_{n}}) - \dots \\ + {(- 1)}^{n - 1} Tr (A^{μ_{1}} A^{μ_{2}} A^{μ_{3}} A^{μ_{4}} A^{μ_{5}} \dots A^{μ_{n}}) . \end{matrix}

(A14)

Here, the trace on the left is the same as the last trace on the right, and if n is even, they are added to give

\begin{matrix} Tr (A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{n}}) & = & Tr (g^{μ_{1} μ_{2}} A^{μ_{3}} A^{μ_{4}} \dots A^{μ_{n}}) - Tr (g^{μ_{1} μ_{3}} A^{μ_{2}} A^{μ_{4}} A^{μ_{5}} \dots A^{μ_{n}}) \\ + Tr (g^{μ_{1} μ_{4}} A^{μ_{2}} A^{μ_{3}} A^{μ_{5}} \dots A^{μ_{n}}) - \dots . \end{matrix}

(A15)

However, if n is odd, they cancel, and the sum of all terms remaining on the right must vanish,

\begin{matrix} Tr (g^{μ_{1} μ_{2}} A^{μ_{3}} A^{μ_{4}} \dots A^{μ_{n}}) & - & Tr (g^{μ_{1} μ_{3}} A^{μ_{2}} A^{μ_{4}} A^{μ_{5}} \dots A^{μ_{n}}) \\ + & Tr (g^{μ_{1} μ_{4}} A^{μ_{2}} A^{μ_{3}} A^{μ_{5}} \dots A^{μ_{n}}) - \dots = 0 . \end{matrix}

(A16)

A second consequence of n being odd is that it is not possible to continue the reduction to all possible pairs, that is, a product of

g^{μ ν}

. There will always be one

A^{μ}

left over. In that case, since

g^{μ ν} = δ^{μ ν} I_{d}

, the trace becomes either zero if at least one factor of

g^{μ ν}

vanishes, or we have just

Tr (A^{μ}) = 0

. Consequently, the trace always vanishes if n is odd.

For n even, we have

\begin{matrix} Tr (A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{n}}) & = & Tr (g^{μ_{1} μ_{2}} A^{μ_{3}} A^{μ_{4}} \dots A^{μ_{n}}) - Tr (g^{μ_{1} μ_{3}} A^{μ_{2}} A^{μ_{4}} A^{μ_{5}} \dots A^{μ_{n}}) \\ + Tr (g^{μ_{1} μ_{4}} A^{μ_{2}} A^{μ_{3}} A^{μ_{5}} \dots A^{μ_{n}}) - \dots \\ = & \sum_{k = 2}^{n} {(- 1)}^{k} Tr (g^{μ_{1} μ_{k}} \prod_{\begin{matrix} l = 2 \\ l \neq k \end{matrix}}^{n} A^{μ_{l}}), \end{matrix}

(A17)

because

g^{μ ν} = δ^{μ ν} I_{d}

picks out the term or terms in the sum over k that have

μ_{k} = μ_{1}

and replaces the factor

g^{μ_{1} μ_{k}}

by the d-dimensional identity, which can be omitted from the remaining product. The simplest case is when only one term has

μ_{k} = μ_{1}

, and that is the only possibility that we will encounter, because the construction we will use can only place at most two

A^{μ}

on a single link, or dual site, of the lattice. Then, the trace above reduces to

Tr (A^{μ_{1}} A^{μ_{2}} \dots A^{μ_{n}}) = {(- 1)}^{P} Tr (\prod_{l = 1}^{n - 2} A^{μ_{l}}),

(A18)

with P the appropriate permutation index. Then the contraction theorem can be applied again to the remaining product. Continuing the process eventually reduces the right side to a permutation factor

{(- 1)}^{P}

times the trace of

I_{d}

, which is d, or to a product of unmatched

A^{μ}

, the trace of which vanishes.

Appendix B. Pfaffians and Determinants

Pfaffians are rarely discussed. They do not appear in common mathematical physics textbooks, but they have shown up in the physics literature—there is a Pfaffian quantum Hall state, for example [18,19].

A Pfaffian is similar to a determinant, but it is taken on the upper right triangle of an antisymmetric matrix. One of the Pfaffian’s most important properties is that its square is equal to the determinant of the corresponding antisymmetric matrix M,

{(Pf [M])}^{2} = det [M],

(A19)

where an example of dimension

2 h

is the antisymmetric matrix

M = (\begin{matrix} 0 & a_{1, 2} & a_{1, 3} & a_{1, 4} & \dots & a_{1, 2 h} \\ - a_{1, 2} & 0 & a_{2, 3} & a_{2, 4} & \dots & a_{2, 2 h} \\ - a_{1, 3} & - a_{2, 3} & 0 & a_{3, 4} & \dots & a_{3, 2 h} \\ - a_{1, 4} & - a_{2, 4} & - a_{3, 4} & 0 & ⋱ & ⋮ \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋱ & a_{2 h - 1, 2 h} \\ - a_{1, 2 h} & - a_{2, 2 h} & - a_{3, 2 h} & \dots & - a_{2 h - 1, 2 h} & 0 \end{matrix}) .

(A20)

The corresponding Pfaffian is sometimes written as

Pf [M] = \begin{matrix} | a_{1, 2} & a_{1, 3} & a_{1, 4} & \dots & a_{1, 2 h} \\ a_{2, 3} & a_{2, 4} & \dots & a_{2, 2 h} \\ a_{3, 4} & \dots & a_{3, 2 h} \\ ⋱ & ⋮ \\ a_{2 h - 1, 2 h} \end{matrix}| .

(A21)

Here, we use the convention that M is an antisymmetric matrix,

Det [M]

is its determinant, and

Pf [M]

is the Pfaffian of the upper right triangle of M (with the diagonal, which consists only of zeros, deleted).

The Pfaffian expansion by the first row is like a regular determinant with the notable caveat that both the rth row and column and the sth row and column are deleted. Suppose that we illustrate with an example of order

2 h = 6

,

Pf [M] = \begin{matrix} | a_{1, 2} & a_{1, 3} & a_{1, 4} & a_{1, 5} & a_{1, 6} \\ a_{2, 3} & a_{2, 4} & a_{2, 5} & a_{2, 6} \\ a_{3, 4} & a_{3, 5} & a_{3, 6} \\ a_{4, 5} & a_{4, 6} \\ a_{5, 6} \end{matrix}| .

(A22)

Expanding by the top row gives a first term of

Pf [M] = a_{1, 2} \times \begin{matrix} | a_{3, 4} & a_{3, 5} & a_{3, 6} \\ a_{4, 5} & a_{4, 6} \\ a_{5, 6} \end{matrix}| + \dots .

(A23)

Including the remaining terms, which have alternating signs just like a determinant, we have

\begin{matrix} Pf [M] & = & a_{1, 2} \times \begin{matrix} | a_{3, 4} & a_{3, 5} & a_{3, 6} \\ a_{4, 5} & a_{4, 6} \\ a_{5, 6} \end{matrix}| - a_{1, 3} \times \begin{matrix} | a_{2, 4} & a_{2, 5} & a_{2, 6} \\ a_{4, 5} & a_{4, 6} \\ a_{5, 6} \end{matrix}| \\ + & a_{1, 4} \times \begin{matrix} | a_{2, 3} & a_{2, 5} & a_{2, 6} \\ a_{3, 5} & a_{3, 6} \\ a_{5, 6} \end{matrix}| - a_{1, 5} \times \begin{matrix} | a_{2, 3} & a_{2, 4} & a_{2, 6} \\ a_{3, 4} & a_{3, 6} \\ a_{4, 6} \end{matrix}| \\ + & a_{16} \times \begin{matrix} | a_{2, 3} & a_{2, 4} & a_{2, 5} \\ a_{3, 4} & a_{3, 5} \\ a_{4, 5} \end{matrix}| . \end{matrix}

(A24)

Continuing, we expand each of these minors by its first row. The first gives

\begin{matrix} | a_{3, 4} & a_{3, 5} & a_{3, 6} \\ a_{4, 5} & a_{4, 6} \\ a_{5, 6} \end{matrix}| = a_{3, 4} \times \begin{matrix} | a_{4, 5} & a_{4, 6} \\ a_{5, 6} \end{matrix}| = a_{3, 4} a_{5, 6} .

(A25)

Including the rest of the terms, the Pfaffian is

\begin{matrix} Pf [M] & = & a_{1, 2} (a_{3, 4} a_{5, 6} - a_{3, 5} a_{4, 6} + a_{3, 6} a_{4, 5}) - a_{1, 3} (a_{2, 4} a_{5, 6} - a_{2, 5} a_{4, 6} + a_{2, 6} a_{4, 5}) \\ + a_{1, 4} (a_{2, 3} a_{5, 6} - a_{2, 5} a_{3, 6} + a_{2, 6} a_{3, 5}) - a_{1, 5} (a_{2, 3} a_{4, 6} - a_{2, 4} a_{3, 6} + a_{2, 6} a_{3, 4}) \\ + a_{1, 6} (a_{2, 3} a_{4, 5} - a_{2, 4} a_{3, 5} + a_{2, 5} a_{3, 4}) \\ = & a_{1, 2} a_{3, 4} a_{5, 6} - a_{1, 2} a_{3, 5} a_{4, 6} + a_{1, 2} a_{3, 6} a_{4, 5} - a_{1, 3} a_{2, 4} a_{5, 6} + a_{1, 3} a_{2, 5} a_{4, 6} \\ - a_{1, 3} a_{2, 6} a_{4, 5} + a_{1, 4} a_{2, 3} a_{5, 6} - a_{1, 4} a_{2, 5} a_{3, 6} + a_{1, 4} a_{2, 6} a_{3, 5} - a_{1, 5} a_{2, 3} a_{4, 6} \\ + a_{1, 5} a_{2, 4} a_{3, 6} - a_{1, 5} a_{2, 6} a_{3, 4} + a_{1, 6} a_{2, 3} a_{4, 5} - a_{1, 6} a_{2, 4} a_{3, 5} + a_{1, 6} a_{2, 5} a_{3, 4} . \end{matrix}

(A26)

This is the full expansion of the Pfaffian.

Appendix C. Block by Block Multiplication

In this context, it is important to realize that matrices can be multiplied block by block. Let us justify this using a simple example. Suppose that we have a

2 n \times 2 n

matrix A divided into four

n \times n

blocks,

A^{(1, 1)}

,

A^{(1, 2)}

,

A^{(2, 1)}

, and

A^{(2, 2)}

,

A = (\begin{matrix} A^{(1, 1)} & A^{(1, 2)} \\ A^{(2, 1)} & A^{(2, 2)} \end{matrix}),

(A27)

and a similar division of a second

2 n \times 2 n

matrix B divided similarly,

B = (\begin{matrix} B^{(1, 1)} & B^{(1, 2)} \\ B^{(2, 1)} & B^{(2, 2)} \end{matrix}) .

(A28)

The matrix elements of the product of these two matrices are

{[A B]}_{i k} = \sum_{j = 1}^{2 n} A_{i j} B_{j k} .

(A29)

The sum over j can be divided up into two sums,

{[A B]}_{i k} = \sum_{j = 1}^{n} A_{i j} B_{j k} + \sum_{j = n + 1}^{2 n} A_{i j} B_{j k} .

(A30)

Suppose that

1 \leq i \leq n

and

1 \leq k \leq n

. This gives the upper left entry in the block product. The matrix elements that appear in the product are

{[A B]}_{i k} = \sum_{j = 1}^{n} {[A^{(1, 1)}]}_{i j} {[B^{(1, 1)}]}_{j k} + \sum_{j = n + 1}^{2 n} {[A^{(1, 2)}]}_{i j} {[B^{(2, 1)}]}_{j k} = A^{(1, 1)} B^{(1, 1)} + A^{(1, 2)} B^{(2, 1)},

(A31)

where we wrote the result as a block-matrix product in the last equality. If

n + 1 \leq i \leq 2 n

and

1 \leq k \leq n

, we obtain the lower left entry in the block product. The matrix elements that appear in the sum are

{[A B]}_{i k} = \sum_{j = 1}^{n} {[A^{(1, 1)}]}_{i j} {[B^{(1, 2)}]}_{j k} + \sum_{j = n + 1}^{2 n} {[A^{(1, 2)}]}_{i j} {[B^{(2, 2)}]}_{j k} = A^{(1, 1)} B^{(1, 2)} + A^{(1, 2)} B^{(2, 2)} .

(A32)

If

n + 1 \leq i \leq 2 n

and

n + 1 \leq k \leq 2 n

, we obtain the lower right entry in the block product. The matrix elements that appear in the sum are

{[A B]}_{i k} = \sum_{j = 1}^{n} {[A^{(2, 1)}]}_{i j} {[B^{(1, 2)}]}_{j k} + \sum_{j = n + 1}^{2 n} {[A^{(2, 2)}]}_{i j} {[B^{(2, 2)}]}_{j k} = A^{(2, 1)} B^{(1, 2)} + A^{(2, 2)} B^{(2, 2)} .

(A33)

Now, if we calculate the product by merely using block indices, we obtain

A B = (\begin{matrix} [A^{(1, 1)} B^{(1, 1)} + A^{(1, 2)} B^{(2, 1)}] & [A^{(1, 1)} B^{(1, 2)} + A^{(2, 1)} B^{(2, 2)}] \\ [A^{(2, 1)} B^{(1, 1)} + A^{(2, 2)} B^{(2, 1)}] & [A^{(2, 1)} B^{(1, 2)} + A^{(2, 2)} B^{(2, 2)}] \end{matrix}),

(A34)

where we offset the individual blocks using brackets. This is the same result. The conclusion is that block matrices can be multiplied block by block.

Appendix D. Evaluation of t_ω,ω′

This is a derivation of the matrix element of the block Y in antidiagonal form.

\begin{matrix} t_{ω, ω^{'}} = \sum_{k = 1}^{n_{c}} v_{k}^{ω *} v_{n_{c} - k + 1}^{ω^{'}} & = & \frac{2}{n + 1} \sum_{k = 1}^{n_{c}} [\sqrt{\frac{2}{n_{c} + 1}} {(- i)}^{k + 1} sin (\frac{π ω k}{n_{c} + 1})] \\ \times [\sqrt{\frac{2}{n_{c} + 1}} i^{n_{c} - k + 2} sin (\frac{π ω^{'} (n_{c} - k + 1)}{n_{c} + 1})] . \end{matrix}

(A35)

Simplifying somewhat, the sum becomes

t_{ω, ω^{'}} = \frac{2}{n_{c} + 1} i^{n_{c} + 3} \sum_{k = 1}^{n_{c}} {(- 1)}^{k + 1} sin (\frac{π ω k}{n_{c} + 1}) sin (\frac{π ω^{'} (n_{c} - k + 1)}{n_{c} + 1}) .

(A36)

The next task is to evaluate the sum of the product of sines

S = 2 \sum_{k = 1}^{n_{c}} {(- 1)}^{k + 1} sin (\frac{π ω k}{n_{c} + 1}) sin (\frac{π ω^{'} (n_{c} + 1 - k)}{n_{c} + 1}) .

(A37)

First, the product of sines should be turned into a sum of cosines via the identity

sin u sin v = \frac{1}{2} [cos (u - v) - cos (u + v)]

. This gives

S = \sum_{k = 1}^{n_{c}} \{{(- 1)}^{k + 1} [cos (\frac{π ω k}{n_{c} + 1} - \frac{π ω^{'} (n_{c} + 1 - k)}{n_{c} + 1}) - cos (\frac{π ω k}{n_{c} + 1} + \frac{π ω^{'} (n_{c} + 1 - k)}{n_{c} + 1})]\},

(A38)

which can be put into exponential form as

S = Re \{\sum_{k = 1}^{n_{c}} [{(- 1)}^{k + 1} (e^{i \frac{(π ω k - π ω^{'} (n_{c} + 1 - k))}{n_{c} + 1}} - e^{i \frac{(π ω k + π ω^{'} (n_{c} + 1 - k))}{n_{c} + 1}})]\},

(A39)

where Re indicates that the real part of the sum is required. Because

ω^{'}

is a positive integer, there is a factor of

e^{\pm i π ω^{'}} = {(- 1)}^{ω^{'}}

in each exponential. We also notice that each term can be written as an exponential to the

k^{th}

power, and we have

S = {(- 1)}^{ω^{'} + 1} Re \{\sum_{k = 1}^{n_{c}} [{(- e^{i π \frac{(ω + ω^{'})}{n_{c} + 1}})}^{k} - {(- e^{i π \frac{(ω - ω^{'})}{n_{c} + 1}})}^{k}]\} .

(A40)

Each term is now in the form of a geometric series [20] and can be summed as

\sum_{k = 1}^{n_{c}} r^{k} = r \sum_{k = 1}^{n_{c}} r^{k - 1} = r (\frac{r^{n_{c}} - 1}{r - 1}),

(A41)

giving

S = {(- 1)}^{ω^{'} + 1} Re [(- e^{i π \frac{(ω + ω^{'})}{n_{c} + 1}}) \frac{{(- e^{i π \frac{(ω + ω^{'})}{n_{c} + 1}})}^{n_{c}} - 1}{(- e^{i π \frac{(ω + ω^{'})}{n_{c} + 1}} - 1)} - (- e^{i π \frac{(ω - ω^{'})}{n_{c} + 1}}) \frac{{(- e^{i π \frac{(ω - ω^{'})}{n_{c} + 1}})}^{n_{c}} - 1}{(- e^{i π \frac{(ω - ω^{'})}{n_{c} + 1}} - 1)}] .

Distributing the exponentials in each term will cause the first exponential in each numerator contain

\frac{n_{c} + 1}{n_{c} + 1}

and reduce to simpler exponentials as

S = {(- 1)}^{ω^{'} + 1} Re [- \frac{{(- 1)}^{n_{c} + 1} e^{i π (ω + ω^{'})} + e^{i π \frac{(ω + ω^{'})}{n_{c} + 1}}}{1 + e^{i π \frac{(ω + ω^{'})}{n_{c} + 1}}} + \frac{{(- 1)}^{n_{c} + 1} e^{i π (ω - ω^{'})} + e^{i π \frac{(ω - ω^{'})}{n_{c} + 1}}}{1 + e^{i π \frac{(ω - ω^{'})}{n_{c} + 1}}}] .

(A42)

By Euler’s Formula, the first term in each numerator reduces to

{(- 1)}^{ω + ω^{'} + n + 1}

and

{(- 1)}^{ω - ω^{'} + n + 1}

, respectively, yielding

S = {(- 1)}^{ω^{'} + 1} Re [- \frac{{(- 1)}^{ω + ω^{'} + n_{c} + 1} + e^{\frac{i π (ω + ω^{'})}{n_{c} + 1}}}{1 + e^{\frac{i π (ω + ω^{'})}{n_{c} + 1}}} + \frac{{(- 1)}^{ω - ω^{'} + n_{c} + 1} + e^{\frac{i π (ω - ω^{'})}{n_{c} + 1}}}{1 + e^{\frac{i π (ω - ω^{'})}{n_{c} + 1}}}] .

(A43)

The denominator of the first term in the brackets is discontinuous at

ω + ω^{'} = n_{c} + 1

, and so this geometric expansion can not be solved for instances when

ω + ω^{'} = n_{c} + 1

. Those instances will be discussed in Section 4.

Moving forward from here depends on whether the exponents of the sign terms inside the Re operation are odd or even. If

ω + ω^{'}

is even then

ω - ω^{'}

must also be even, so that both terms inside the square brackets have the same parity in the first term in their numerators. When

ω + ω^{'} + n + 1

and

ω - ω^{'} + n + 1

are both even, it is easy to see that

S

reduces to

S = {(- 1)}^{ω^{'} + 1} Re (- 1 + 1) = 0 .

(A44)

When

ω + ω^{'} + n + 1

and

ω - ω^{'} + n + 1

are both odd, the situation is more complicated. The first terms in each of the numerators equal

- 1

, and the sum simplifies to the form

S = {(- 1)}^{ω^{'} + 1} Re [\frac{- e^{\frac{i π (ω + ω^{'})}{n_{c} + 1}} - 1}{e^{\frac{i π (ω + ω^{'})}{n_{c} + 1}} + 1} + \frac{e^{\frac{i π (ω - ω^{'})}{n_{c} + 1}} - 1}{e^{\frac{i π (ω - ω^{'})}{n_{c} + 1}} + 1}] .

(A45)

From here, the numerators and denominators are close to being sines and cosines. The factor

e^{- \frac{i π (ω + ω^{'})}{2 (n_{c} + 1)}}

is multiplied both the numerator and denominator of the first exponential term, and

e^{- \frac{i π (ω - ω^{'})}{2 (n_{c} + 1)}}

is multiplied on the top and bottom of the second term, producing

S = {(- 1)}^{ω^{'} + 1} Re [- \frac{e^{\frac{i π (ω + ω^{'})}{2 (n_{c} + 1)}} - e^{- \frac{i π (ω + ω^{'})}{2 (n_{c} + 1)}}}{e^{\frac{i π (ω + ω^{'})}{2 (n_{c} + 1)}} + e^{- \frac{i π (ω + ω^{'})}{2 (n_{c} + 1)}}} + \frac{e^{\frac{i π (ω - ω^{'})}{2 (n_{c} + 1)}} - e^{- \frac{i π (ω - ω^{'})}{2 (n_{c} + 1)}}}{e^{\frac{i π (ω - ω^{'})}{2 (n_{c} + 1)}} + e^{- \frac{i π (ω - ω^{'})}{2 (n_{c} + 1)}}}] .

(A46)

Now, both terms in brackets have a sine in the numerator and a cosine in the denominator. Since

e^{i x} - e^{- i x} = 2 i sin x

and

e^{i x} + e^{- i x} = 2 cos x

,

S

is

S = {(- 1)}^{ω^{'} + 1} Re [- \frac{2 i sin (\frac{(ω + ω^{'}) π}{2 (n_{c} + 1)})}{2 cos (\frac{(ω + ω^{'}) π}{2 (n_{c} + 1)})} + \frac{2 i sin (\frac{(ω - ω^{'}) π}{2 (n_{c} + 1)})}{2 cos (\frac{ω - ω^{'}) π}{2 (n_{c} + 1)})}] .

(A47)

Because each term inside the Re operation is purely imaginary this is identically 0. Therefore, the matrix element

t_{ω ω^{'}}

matrix Y will be zero everywhere except those elements where

ω + ω^{'} = n_{c} + 1

. These elements in the matrix could not be determined through this method because they cause a singularity in a denominator.

To show the behavior of the elements of

t_{ω ω^{'}}

when

ω = ω^{'}

, one must backtrack to the general form of the sum

S

, which now is

S = 2 δ_{ω, n_{c} + 1 - ω^{'}} \sum_{k = 1}^{n_{c}} {(- 1)}^{k + 1} sin (\frac{π ω k}{n_{c} + 1}) sin (\frac{π ω^{'} (n_{c} + 1 - k)}{n_{c} + 1}),

(A48)

and substitute

ω = n_{c} + 1 - ω^{'}

to obtain

S = 2 δ_{ω, n_{c} + 1 - ω^{'}} \sum_{k = 1}^{n_{c}} {(- 1)}^{k + 1} sin (\frac{π (n_{c} + 1 - ω^{'}) k}{n_{c} + 1}) sin (\frac{π ω^{'} (n_{c} + 1 - k)}{n_{c} + 1}) .

(A49)

It is useful to break the arguments of the sines into separate terms to look for identities. This gives

S = δ_{ω, n_{c} + 1 - ω^{'}} \sum_{k = 1}^{n_{c}} {(- 1)}^{k + 1} sin (π k - \frac{π ω^{'} k}{n_{c} + 1}) sin (π ω^{'} - \frac{π ω^{'} k}{n_{c} + 1}) .

(A50)

The trigonometric identity

sin (u - v) = sin u cos u - cos u sin v

allows this to be written as

\begin{matrix} S & = & δ_{ω, n_{c} + 1 - ω^{'}} \sum_{k = 1}^{n_{c}} {(- 1)}^{k + 1} [sin (π k) cos (\frac{π ω^{'} k}{n_{c} + 1}) - cos (π k) sin (\frac{π ω^{'} k}{n_{c} + 1})] \\ \times [sin (π ω^{'}) cos (\frac{π ω^{'} k}{n_{c} + 1}) - cos (π ω^{'}) sin (\frac{π ω^{'} k}{n_{c} + 1})] . \end{matrix}

(A51)

Because k and

ω^{'}

are both integers, this reduces to

\begin{array}{l} S & = & δ_{ω, n_{c} + 1 - ω^{'}} \sum_{k = 1}^{n_{c}} {(- 1)}^{k + 1} [{(- 1)}^{k + 1} sin (\frac{π ω^{'} k}{n_{c} + 1})] [{(- 1)}^{ω^{'} + 1} sin (\frac{π ω^{'} k}{n_{c} + 1})] \end{array}

(A52)

\begin{matrix} = & δ_{ω, n_{c} + 1 - ω^{'}} {(- 1)}^{ω^{'} - 1} \sum_{k = 1}^{n_{c}} {sin}^{2} (\frac{π ω^{'} k}{n_{c} + 1}), \end{matrix}

(A53)

where the change in signs occurs via

{(- 1)}^{k + 1} {(- 1)}^{k + 1} {(- 1)}^{ω^{'} + 1} = {(- 1)}^{2 k} {(- 1)}^{3} {(- 1)}^{ω^{'}} = {(- 1)}^{ω^{'} + 1} = {(- 1)}^{ω^{'} - 1}

. Now, the trig identity

2 {sin}^{2} (u) = 1 - cos (2 u)

is used to give

S = δ_{ω, n_{c} + 1 - ω^{'}} {(- 1)}^{ω^{'} - 1} \sum_{k = 1}^{n_{c}} [1 - cos (\frac{2 π ω^{'} k}{n + 1})],

(A54)

which has been seen before in solving the normalization of

u_{q l}

, where the cosine sum is

- 1

, and so we have

S = {(- 1)}^{ω^{'} - 1} (n_{c} + 1) δ_{ω, n_{c} + 1 - ω^{'}} .

(A55)

The matrix elements can thus be written as

t_{ω, ω^{'}} = \frac{i^{n_{c} + 3}}{n_{c} + 1} S = \frac{i^{n_{c} + 3}}{n_{c} + 1} {(- 1)}^{ω^{'} - 1} (n_{c} + 1) δ_{ω, n_{c} + 1 - ω^{'}} .

(A56)

Recalling that

i^{n} = i^{n mod 4}

allows this to be written as

t_{ω, ω^{'}} = i^{n_{c} - 1} {(- 1)}^{ω^{'} - 1} δ_{ω, n_{c} + 1 - ω^{'}} .

(A57)

This can also be written as

t_{ω, ω^{'}} = i^{n_{c} - 1} {(- 1)}^{n_{c} - ω} δ_{ω^{'}, n_{c} + 1 - ω} .

(A58)

Appendix E. Evaluating the Determinant and the Partition Function

The determinant of a cruciform matrix is easily determined by expanding by the first row of the matrix and then in each minor expanding by the last row. Then, each new minor is expanded by the first row and each of the resulting minors by their bottom rows. This repetition creates

⌊ \frac{1}{2} (n + 1) ⌋

factors of the type

(d_{11} d_{n n} - d_{1 n} d_{n 1})

. The brackets

⌊ x ⌋

indicate the floor function, which gives the highest integer lower than x.

It is not immediately obvious that this is the correct expansion. This becomes clearer if we consider the general cruciform matrix

\bar{C}

| \bar{C} | = (\begin{matrix} d_{11} & d_{1, n} \\ d_{22} & d_{2, n - 1} \\ ⋱ & ⋰ \\ ⋰ & ⋱ \\ d_{n - 1, 2} & d_{n - 1, n - 1} \\ d_{n 1} & d_{n n} \end{matrix}) .

(A59)

The expansion begins with the first row as

\begin{matrix} | \bar{C} | = d_{11} & |\begin{matrix} d_{22} & d_{2, n - 1} & 0 \\ ⋱ & ⋰ \\ ⋰ & ⋱ \\ d_{n - 1, 2} & d_{n - 1, n - 1} \\ 0 & d_{n n} \end{matrix}| \\ + {(- 1)}^{n - 1} d_{1, n} |\begin{matrix} 0 & d_{22} & d_{2, n - 1} \\ ⋱ & ⋰ \\ ⋰ & ⋱ \\ d_{n - 1, 2} & d_{n - 1, n - 1} \\ d_{n 1} & 0 \end{matrix}|, \end{matrix}

(A60)

where the only two elements in row one are multiplied by their minors. A factor of

{(- 1)}^{n - 1}

is necessary to obtain the correct sign since it depends on the number of rows in the matrix. Then the remaining minors are expanded by their bottom rows. Note that this minor has

n - 1

rows, since its first row was eliminated,

\begin{matrix} | \bar{C} | = d_{11} d_{n n} & |\begin{matrix} d_{22} & d_{2, n - 1} \\ ⋱ & ⋰ \\ ⋰ & ⋱ \\ d_{n - 1, 2} & d_{n - 1, n - 1} \end{matrix}| \\ + {(- 1)}^{n - 2} {(- 1)}^{n - 1} d_{1 n} d_{n 1} |\begin{matrix} d_{22} & d_{2, n - 1} \\ ⋱ & ⋰ \\ ⋰ & ⋱ \\ d_{n - 1, 2} & d_{n - 1, n - 1} \end{matrix}| . \end{matrix}

(A61)

Notice that the second term in this expansion

d_{n 1}

was in the

n - 1

row and first column and so there is an additional factor of

{(- 1)}^{n - 2}

. In general, when taking a determinant, a negative sign is included with any element whose row and column positions sum to an odd number. The first term in the expansion of

| \bar{C} |

will always be positive because the subscripts of the elements on the main diagonal always sum to an even number. The sums of the row and column numbers of the elements on the antidiagonal will depend on whether n is odd or even. However, if the matrix

\bar{C}

has an odd number of rows and columns, then the minor obtained by expanding by a row will have an even number of rows and columns and vice versa. Therefore, either the element in the position

d_{1 n}

will have a negative sign or the one in position

d_{n 1}

will have a negative sign, but they will not both have one. The sign of the second term is then always negative, since

{(- 1)}^{n - 1} {(- 1)}^{n - 2} = - 1

. After factoring out the remaining determinant, the expansion by the first row can be written as

| \bar{C} | = (d_{11} d_{n n} - d_{1 n} d_{n 1}) |\begin{matrix} d_{22} & d_{2, n - 1} \\ ⋱ & ⋰ \\ ⋰ & ⋱ \\ d_{n - 1, 2} & d_{n - 1, n - 1} \end{matrix}| .

(A62)

The remaining determinant is expanded in the same way. This process is repeated until the matrix is fully expanded. The only thing to be aware of at this point is that if the matrix has odd dimension, the final remaining determinant will be of a single term.

The square of the partition function involves the evaluation of a cruciform matrix as

Z^{2} = det [B] = \prod_{W = 1}^{n_{r}} det Λ_{W},

(A63)

where the transformed matrix

Λ_{W} = X + 2 i sin (θ_{W}) Y

is

Λ_{W} = (\begin{matrix} 2 i x sin (φ_{1}) & {(- 1)}^{n_{c} - 1} ξ) \\ 2 i x sin (φ_{2}) & {(- 1)}^{n_{c} - 2} ξ \\ ⋱ & ⋰ \\ ⋰ & ⋱ \\ {(- 1)}^{1} ξ & 2 i x sin (φ_{n_{c} - 1}) \\ {(- 1)}^{0} ξ & 2 i x sin (φ_{n_{c}}) \end{matrix}),

(A64)

where

ξ = 2 i^{n_{c}} y sin θ_{W}

. The first iteration in the expansion of this cruciform matrix

Λ_{W}

is

\begin{matrix} det Λ_{W} = {[2 i x & sin (φ_{1})] [2 i x sin (φ_{n_{c}})] - [{(- 1)}^{n_{c} - 1} ξ] [{(- 1)}^{0} ξ]} \times \\ \times |\begin{matrix} 2 i x sin (φ_{2}) & {(- 1)}^{n_{c} - 2} ξ \\ ⋱ & ⋰ \\ ⋰ & ⋱ \\ {(- 1)}^{1} ξ & 2 i x sin (φ_{n_{c} - 1}) \end{matrix}| . \end{matrix}

(A65)

A second iteration gives

\begin{matrix} det Λ_{W} = {[2 i x & sin (φ_{1})] [2 i x sin (φ_{n_{c}})] - [{(- 1)}^{n_{c} - 1} ξ] [{(- 1)}^{0} ξ]} \\ \times {[2 i x sin (φ_{2})] [2 i x sin (φ_{n_{c} - 1})] - [{(- 1)}^{n_{c} - 2} ξ] [{(- 1)}^{1} ξ]} \\ \times |\begin{matrix} 2 i x sin (φ_{3}) & {(- 1)}^{n_{c} - 3} ξ \\ ⋱ & ⋰ \\ ⋰ & ⋱ \\ {(- 1)}^{2} ξ & 2 i x sin (φ_{n_{c} - 2}) \end{matrix}|, \end{matrix}

(A66)

which is enough to see the pattern in the terms in the resulting product. In general, this determinant of the eigenvalue matrix reduces to the form

\begin{matrix} det Λ_{W} = \prod_{s = 1}^{⌊ \frac{n}{2} ⌋} & \{[2 i x sin (φ_{s})] [2 i x sin (φ_{n_{c} + 1 - s})] - {(- 1)}^{n_{c} - s} {(- 1)}^{s - 1} ξ^{2}\} \\ \times \{\begin{matrix} 1, & n even \\ 2 i x sin (φ_{\frac{(n_{c} + 1)}{2}}) + {(- 1)}^{n_{c} - \frac{(n_{c} + 1)}{2}} ξ, & n_{c} odd \end{matrix} \end{matrix},

(A67)

where s is a placeholder positive integer for now. When

n_{c}

is odd, the center term of the matrix is multiplied after the product is taken. This is what becomes last term in the odd expansion.

Ideally what is needed is a product of ω over the entire matrix

n_{c}

, but the extra term in the middle for odd matrices disallows a single product expression over ω. Recall that ω was the original subscript to

φ

and that s is just a placeholder. To make the change to a product over ω, the expression must be rewritten. We substitute

ξ

and pull the factor of

i^{(n_{c} - 1)}

out of the square in the second term. Then, a factor of

i^{2}

is pulled out of all of the terms. This leaves

\begin{matrix} | Λ_{W} | & = & \prod_{s = 1}^{⌊ \frac{n_{c}}{2} ⌋} i^{2} \{[2 x sin (φ_{s})] [2 x sin (φ_{n_{c} + 1 - s})) {] - (- 1)}^{n_{c} - 1} {(i)}^{2 (n_{c} - 2)} {[2 i y sin (θ_{W})]}^{2}\} \\ \times \{\begin{matrix} 1, & n_{c} even \\ [2 i x sin (φ_{\frac{(n_{c} + 1)}{2}}) + {(- 1)}^{\frac{(n_{c} - 1)}{2}} 2 i^{n_{c}} y sin (θ_{W})], & n_{c} odd \end{matrix} . \end{matrix}

(A68)

Because of the behavior of i when exponentiated, the sign terms inside the bracket reduce via

{(- 1)}^{n_{c} - 1} {(i)}^{2 (n_{c} - 2)} = {(- 1)}^{n_{c} - 1} {(- 1)}^{n_{c}} = - 1

. The sign will always be

- 1

since

i^{2} = - 1

. Therefore,

| Λ_{W} |

can be written as

\begin{matrix} | Λ_{W} | & = & {(- 1)}^{⌊ \frac{n_{c}}{2} ⌋} \prod_{s = 1}^{⌊ \frac{n_{c}}{2} ⌋} \{[2 x sin (φ_{s})] [2 x sin (φ_{n_{c} + 1 - s})] + {[2 i y sin (θ_{W})]}^{2}\} \\ \times \{\begin{matrix} 1, & n_{c} even \\ [2 i x sin (φ_{\frac{(n_{c} + 1)}{2}}) + {(- 1)}^{\frac{(n_{c} - 1)}{2}} 2 i^{n_{c}} y sin (θ_{W})], & n_{c} odd \end{matrix}, \end{matrix}

(A69)

where

i^{2}

has been moved outside of the product.

The next step in working toward a sum over ω is to factor the first bracketed term. This yields

\begin{matrix} | Λ_{W} | = {(- 1)}^{⌊ \frac{n_{c}}{2} ⌋} \prod_{s = 1}^{⌊ \frac{n_{c}}{2} ⌋} & \{[2 x sin (φ_{s}) + 2 i y sin (θ_{ω})] [2 x sin (φ_{n_{c} + 1 - s}) + 2 i y sin (θ_{W})]\} \\ \times \{\begin{matrix} 1, & n_{c} even \\ [2 i x sin (φ_{\frac{(n_{c} + 1)}{2}}) + {(- 1)}^{\frac{(n_{c} - 1)}{2}} 2 i^{n_{c}} y sin (θ_{W})], & n_{c} odd \end{matrix} \end{matrix},

(A70)

where s skips over

\frac{n_{c} + 1}{2}

in the odd case for the moment. To arrive at this factorization, it is necessary to realize, from Equation (A70), that

sin (φ_{s}) = - sin (φ_{n_{c} + 1 - s})

, which makes the cross terms cancel. We will show this now. Since

φ_{s} = \frac{π}{2} + \frac{π s}{n_{c} + 1}

,

sin (φ_{n_{c} + 1 - s}) = sin (\frac{π}{2} + \frac{π (n_{c} + 1 - s)}{n_{c} + 1}) .

(A71)

After splitting up the fraction this is

sin (φ_{n_{c} + 1 - s}) = sin (\frac{π}{2} + \frac{π (n_{c} + 1)}{n_{c} + 1} - \frac{π s}{n_{c} + 1}) .

(A72)

which becomes

sin (φ_{n_{c} + 1 - s}) = sin (\frac{π}{2} + π - \frac{π s}{n_{c} + 1}) .

(A73)

Through the trig identity

sin (u \pm v) = sin u cos v s . \pm cos u sin v

, the expression becomes

sin (φ_{n_{c} + 1 - s}) = sin (\frac{π}{2}) cos (π - \frac{π s}{n_{c} + 1}) + cos (\frac{π}{2}) sin (π - \frac{π s}{n_{c} + 1}) .

(A74)

This is quickly reduced to

sin (φ_{n_{c} + 1 - s}) = cos (π - \frac{π s}{n_{c} + 1}) .

(A75)

Then by

cos (u \pm v) = cos u cos v s . \mp sin u sin v

the relationship is expressed as

sin (φ_{n_{c} + 1 - s}) = cos (π) cos (\frac{π s}{n_{c} + 1}) + sin (π) sin (\frac{π s}{n_{c} + 1}),

(A76)

which reduces to

sin (φ_{n_{c} + 1 - s}) = - cos (\frac{π s}{n_{c} + 1}) .

(A77)

A factor of

sin (\frac{π}{2})

is introduced, and then the relationship

sin (u \pm v) = sin u cos v s . \pm cos u sin v

is used in reverse, giving

sin (φ_{n_{c} + 1 - s}) = - sin (\frac{π}{2} + \frac{π s}{n_{c} + 1}) = sin (φ_{s}) .

(A78)

This relationship causes all of the cross-terms to cancel out and allows the factoring of the first term in the product of

| Λ_{W} |

.

This relationship is also useful for reducing the extra term in the odd expansion. In the case where

s = \frac{n_{c} + 1}{2}

, it is shown here that

sin (φ_{s}) = 0

.

sin φ_{s} = sin (φ_{\frac{n_{c} + 1}{2}}) = - sin (\frac{π}{2} + \frac{π}{n_{c} + 1} \frac{n_{c} + 1}{2}) = sin (π) = 0 .

(A79)

The term

[2 i x sin (φ_{\frac{(n_{c} + 1)}{2}}) + {(- 1)}^{\frac{(n_{c} - 1)}{2}} 2 i^{n_{c}} y sin (θ_{W})]

can now be easily manipulated. The first term is 0 due to the relationship above. Now the factor

{(- 1)}^{\frac{(n_{c} - 1)}{2}} i_{c}^{n}

in front of the

sin (θ_{W})

needs to be investigated. First we see that

{(- 1)}^{\frac{(n_{c} - 1)}{2}} i^{n_{c}} = {(i^{2})}^{\frac{(n_{c} - 1)}{2}} i^{n_{c}} = {(i)}^{2 n_{c} - n_{c} - 1} i^{n_{c}} = {(i)}^{2 n_{c}} {(i)}^{- 1} .

(A80)

Then because

i^{- 1} = - i

, and because this is the case where

n_{c}

is odd,

i^{2 n_{c}} = - 1

, and the above product reduces to

{(- 1)}^{\frac{(n_{c} - 1)}{2}} i^{n_{c}} = (- 1) (- i) = i .

(A81)

Now, the leftover term in the odd expansion can be written as

2 i y sin (θ_{W})

. The product can be written as a single product from

ω = 1

to

n_{c}

instead of being split into cases and instead of going to

⌊ \frac{n_{c}}{2} ⌋

.

\begin{matrix} | Λ_{W} {| = (- 1)}^{⌊ \frac{n_{c}}{2} ⌋} 2^{n_{c}} \prod_{ω = 1}^{n_{c}} [x sin (φ_{ω}) + i y sin (θ_{W})] \end{matrix},

(A82)

where a 2 has been factored out of each term in the product, giving a

2^{n_{c}}

out front. The two different sines,

sin (φ_{ω})

and

sin (φ_{n_{c} + 1 - ω})

, will take opposite signs as ω goes from 1 to

⌊ \frac{n_{c}}{2} ⌋

. In this form of

Λ_{W}

, the term

sin (φ_{ω})

will take all of the values that were formerly split between the combination of the two sines because the product now goes to

n_{c}

.

The arguments in

Λ_{W}

can be reduced slightly. By the trig identity

sin (u \pm v) = sin u cos v s . \pm cos u sin v

, any angle in the form

(\frac{π}{2} + \frac{π s}{n_{c} + 1})

can be written as

cos (\frac{π ω}{n_{c} + 1})

. Combining this with the definition of the state function in Equation (A63) gives

Z^{2} = \prod_{W = 1}^{n_{r}} {(- 1)}^{⌊ \frac{n_{c}}{2} ⌋} 2^{n_{c}} \prod_{ω = 1}^{n_{c}} [x cos (\frac{π ω}{n_{c} + 1}) + i y cos (\frac{π W}{n_{r} + 1})] .

(A83)

After pulling the terms between the products out, the state function is

Z^{2} = {(- 1)}^{n_{r} ⌊ \frac{n_{c}}{2} ⌋} 2^{n_{r} n_{c}} \prod_{W = 1}^{n_{r}} \prod_{ω = 1}^{n_{c}} [x cos (\frac{π ω}{n + 1}) + i y cos (\frac{π W}{n_{r} + 1})] .

(A84)

Equation (A84) is the exact square of the partition function of a finite

n_{r} \times n_{c}

lattice as a product of

n_{r} n_{c}

terms. If both

n_{r}

and

n_{c}

are odd, then there is a term in the product for which

ω = \frac{1}{2} (n_{c} + 1)

and

W = \frac{1}{2} (n_{r} + 1)

, which means that both the x and y parts vanish identically. The whole product is then multiplied by zero, correctly reflecting the fact that a lattice with an odd number of points cannot be occupied solely by dimers. Therefore, the lattice is required to have an even number of points and either

n_{r}

or

n_{c}

or both must be even.

Because the entropy will eventually be required, it would be best to have a partition function that is not squared. If it is assumed that

n_{c}

is even, the partition function can be manipulated into a more useable form. Holding

n_{c}

even instead of

n_{r}

is an arbitrary choice. Moving the 2 back inside the products makes the partition function take the form

Z^{2} = {(- 1)}^{n_{r} ⌊ \frac{n_{c}}{2} ⌋} \prod_{W = 1}^{n_{r}} \prod_{ω = 1}^{n_{c}} [2 x cos (\frac{π ω}{n_{c} + 1}) + 2 i y cos (\frac{π W}{n_{r} + 1})] .

(A85)

In the case where

n_{r}

is odd, and focusing only on the product over W, it is possible to write this as

{Z_{\begin{matrix} n_{c} = even \\ n_{r} = odd \end{matrix}}}^{2} = {(- 1)}^{n_{r} ⌊ \frac{n_{c}}{2} ⌋} \prod_{W = 1}^{n_{r}} \prod_{ω = 1}^{n_{c}} [a + 2 i y cos (\frac{π W}{n_{r} + 1})],

(A86)

where

a = 2 x cos (\frac{π ω}{n_{c} + 1})

. Comparing the factor with

W = 1

to the factor with

W = n_{r} - 1

in the cosine shows that one of those terms is the negative of the other via

cos (\frac{π n_{r}}{n_{r} + 1}) = cos (π - \frac{π}{n_{r} + 1}) = - cos (\frac{π}{n_{r} + 1}) .

(A87)

This relationship holds for the pair

W = 2

and

W = n_{r} - 1

and so on. Thus, for each term in the product, there will appear a complex conjugate of that term. The only exception is the unpaired term where

cos (\frac{π n_{r}}{n_{r} + 1}) = cos (\frac{π}{2}) = 0

, and so that term must be written separately. With the terms paired off, the product can be cut in half.

{Z_{\begin{matrix} n = even \\ m = odd \end{matrix}}}^{2} = {(- 1)}^{m ⌊ \frac{n}{2} ⌋} \{\prod_{r = 1}^{n} \prod_{q = 1}^{⌊ \frac{m}{2} ⌋} [a^{2} - {(2 i y cos (\frac{π q}{m + 1}))}^{2}]\} \{\prod_{r = 1}^{n} (a + 0)\} .

(A88)

Substituting a gives

\begin{matrix} {Z_{\begin{matrix} n_{c} = even \\ n_{r} = odd \end{matrix}}}^{2} & = & {(- 1)}^{n_{r} ⌊ \frac{n_{c}}{2} ⌋} \{\prod_{ω = 1}^{n_{c}} \prod_{W = 1}^{⌊ \frac{n_{r}}{2} ⌋} [4 x^{2} {cos}^{2} (\frac{π ω}{n_{c} + 1}) + 4 y^{2} {cos}^{2} (\frac{π W}{n_{r} + 1})]\} \\ \times \{\prod_{ω = 1}^{n_{c}} 2 x cos (\frac{π ω}{n_{c} + 1})\} . \end{matrix}

(A89)

The last product reduces to

x^{n_{c}} {(- 1)}^{\frac{n_{c}}{2}}

via the identity

\prod_{k = 1}^{n - 1} cos \frac{k π}{n} = \frac{sin \frac{π n}{2}}{2^{n - 1}}

, as can be seen here,

\prod_{ω = 1}^{n_{c}} 2 x cos (\frac{π ω}{n_{c} + 1}) = 2^{n_{c}} x^{n_{c}} \prod_{ω = 1}^{n_{c}} cos (\frac{π ω}{n_{c} + 1}) = 2^{n_{c} + 1} x^{n_{c}} \frac{sin \frac{π (n_{c} + 1))}{2}}{2^{n_{c}}} = x^{n_{c}} {(- 1)}^{\frac{n_{c}}{2}} .

(A90)

The sign factor

{(- 1)}^{\frac{n_{c}}{2}}

may be combined with the already existing sign factor to cancel as

{(- 1)}^{n_{r} ⌊ \frac{n_{c}}{2} ⌋} {(- 1)}^{\frac{n_{c}}{2}} = 1

when

n_{c}

is even and

n_{r}

is odd. Then,

{Z_{\begin{matrix} n_{c} = even \\ n_{r} = odd \end{matrix}}}^{2} = x^{n_{c}} \prod_{ω = 1}^{n_{c}} \prod_{W = 1}^{⌊ \frac{n_{r}}{2} ⌋} [4 x^{2} {cos}^{2} (\frac{π ω}{n_{c} + 1}) + 4 y^{2} {cos}^{2} (\frac{π W}{n_{r} + 1})] .

(A91)

Using the same sort of expansion over the

n_{c}

as was used for the product over

n_{r}

gives

{Z_{\begin{matrix} n_{c} = even \\ n_{r} = odd \end{matrix}}}^{2} = x^{n_{c}} \prod_{ω = 1}^{\frac{n_{c}}{2}} \prod_{W = 1}^{⌊ \frac{n_{r}}{2} ⌋} {[4 x^{2} {cos}^{2} (\frac{π ω}{n_{c} + 1}) + 4 y^{2} {cos}^{2} (\frac{π W}{n_{r} + 1})]}^{2} .

(A92)

Equation (A84) is the exact partition function of a finite

n_{r} \times n_{c}

lattice as a product of

n_{r} n_{c}

terms. If both

n_{c}

and

n_{r}

are odd, then there is a term in the product for which

r ω = \frac{1}{2} (n_{c} + 1)

and

W = \frac{1}{2} (n_{r} + 1)

, which means that both the x and y parts vanish identically. The whole product is then multiplied by zero, correctly reflecting the fact that a lattice with an odd number of points can not be occupied solely by dimers. Therefore, the lattice is required to have an even number of points, and either

n_{r}

or

n_{c}

or both must be even.

In the case for which

n_{r}

is even, the solution follows the same process. The key differences are that the product over

n_{r}

will not have an unpaired term when

n_{r}

is even and the sign term in Z reduces identically to 1. Thus, the partition function is

Z = 2^{\frac{n_{c}}{2} ⌊ \frac{n_{r}}{2} ⌋} \prod_{ω = 1}^{\frac{n_{c}}{2}} \prod_{W = 1}^{⌊ \frac{n_{r}}{2} ⌋} [x^{2} (1 + cos (\frac{2 π ω}{n_{c} + 1})) + y^{2} (1 + cos (\frac{2 π W}{n_{r} + 1}))] \times \{\begin{matrix} 1 & n_{r} even \\ x^{\frac{n_{c}}{2}} & n_{r} odd \end{matrix} .

(A93)

References

Hohenberg, P.; Kohn, W. Inhomogeneous Electron Gas. Phys. Rev. 1964, 136, B864. [Google Scholar] [CrossRef]
Bishop, M.F.; Ferrone, F.A. The Sickle-Cell Fiber Revisited. Biomolecules 2023, 13, 413. [Google Scholar] [CrossRef] [PubMed]
Cao, Z.; Ferrone, F.A. Homogeneous Nucleation in Sickle Hemoglobin: Stochastic Measurements with a Parallel Method. Biophys. J. 1997, 72, 342–352. [Google Scholar] [CrossRef] [PubMed]
Zhang, X.; Dai, X.; Gao, L.; Xu, D.; Wan, H.; Wang, Y.; Yan, L.T. The entropy-controlled strategy in self-assembling systems. Chem. Soc. Rev. 2023, 52, 6806–6837. [Google Scholar] [CrossRef] [PubMed]
Plischke, M.; Bergersen, B. Equilibrium Statistical Mechanics; World Scientific: Singapore, 2006. [Google Scholar]
Temperley, H.N.V.; Fisher, M.E. Dimer problem in statistical mechanics-an exact result. Phil. Mag. 1961, 6, 1061–1063. [Google Scholar] [CrossRef]
Fisher, M.E. Statistical Mechanics of Dimers on a Plane Lattice. Phys. Rev. 1961, 124, 1664–1672. [Google Scholar] [CrossRef]
Kasteleyn, P.W. The Statistics of Dimers on a Lattice: I. The number of dimer arrangements on a quadratic lattice. Physica 1961, 27, 1209–1225. [Google Scholar] [CrossRef]
Bishop, M.F.; McMullen, T. Lattice-gas model of DNA charge inversion by a positively charged polyelectrolyte. Phys. Rev. E 2006, 74, 021906. [Google Scholar] [CrossRef] [PubMed]
Sievert, M.D.; Bishop, M.F.; McMullen, T. Entropy of Charge Inversion in DNA including One-Loop Fluctuations. Entropy 2023, 25, 1373. [Google Scholar] [CrossRef] [PubMed]
Feynman, R.P. Quantum Electrodynamics; Frontiers in Physics; W. A. Benjamin: New York, NY, USA, 1961. [Google Scholar]
Feynman, R.P. The Theory of Positrons. Phys. Rev. 1949, 76, 749. [Google Scholar] [CrossRef]
Thomson, M. Modern Particle Physics; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar]
Fisher, M.E. On the Dimer Solution of Planar Ising Models. J. Math. Phys. 1966, 7, 1776–1781. [Google Scholar] [CrossRef]
Baker, J.C., III. Application of the Fisher Dimer Model to DNA Condensation. Master’s Thesis, Virginia Commonwealth University, Richmond, VA, USA, 2017. [Google Scholar]
Gradshteyn, I.; Ryzhik, I. Table of Integrals, Series, and Products; Elsevier: Boston, MA, USA, 2007. [Google Scholar]
Allegra, N.; Fortin, J.Y. Grassmannian representation of the two-dimensional monomer-dimer model. Phys. Rev. E 2014, 89, 062107. [Google Scholar] [CrossRef] [PubMed]
Moore, G.; Read, N. Nonabelions in the Fractional Quantum Hall Effect. Nucl. Phys. B 1991, 360, 362–396. [Google Scholar] [CrossRef]
Fradkin, E.; Nayak, C.; Tsvelik, A.; Wilczek, F. A Chern-Simons effective field theory for the Pfaffian quantum Hall state. Nucl. Phys. B 1998, 516, 704–718. [Google Scholar] [CrossRef]
Dwight, H.B. Table of Integrals and Other Mathematical Data, 4th ed.; Macmillan: Toronto, ON, Canada, 1961. [Google Scholar]

Figure 1. A simple square lattice with lattice points represented by red dots and the links represented by red dashed lines.

Figure 2. A simple square lattice with lattice points represented by red dots and the links represented by red dashed lines. The blue dots label dual lattice points, and the links of the dual lattice are blue dashed lines.

Figure 3. Complete filling of dimers (green) on the simple square lattice (red dots and dashes).

Figure 4. Serpentine numbering of the lattice sites in green. The row and column numbers of the lattice are also shown The red dots and dashes outline the direct lattice, and the blue dots and dashes the dual lattice.

Figure 5. Alternative way of numbering of links of the lattice by using pairs of site indices, shown in blue, on top of Figure 4.

Figure 6. A smaller

12 \times 12

example with four rows and three columns, but similar to Figure 5, where the serpentine numbering is shown in green, and the alternative numbering in blue. As before, the red dots and dashes outline the direct lattice, and the blue dots and dashes the dual lattice.

Figure 7. Square holes in a simple square lattice with a toy with a square shaft that can fit into a hole in one of four directions.

Figure 8. Toys randomly arranged on the simple square lattice, shown as the red square holes and connecting dashes. The half-dimers are represented by the green lines for this particular member of the ensemble, with the red circles the outlines of the toys in Figure 7. Not all the half-dimers point toward each other in pairs, and so taking the trace over link fields will cause this distribution to be removed from the ensemble.

Figure 9. Half-dimers arranged on the simple square lattice. In the left figure, the dimers all point toward one another in pairs, so that taking the trace over link fields will retain this as a member of the ensemble, unlike the random arrangement of Figure 8. This corresponds to the completely filled dimer arrangement on the right, which is the same as in Figure 3.

Figure 10. The block matrix form of the

12 \times 12

antisymmetric matrix. The red lines outline the

4 \times 4

blocks, and the light blue X, Y and

- Y

labels in the background signify the X, Y and

- Y

blocks of the matrix.

Figure 11. Discrete values of the angle

φ_{ω}

for the example with eleven columns.

Figure 12. Eigenvalue

λ_{ω}

for the example with eleven columns.

Figure 13. First four and the last eigenvectors

v_{k}^{ω}

for the example with eleven columns.

Figure 14. A two-leg ladder ((top) figure) viewed from above ((bottom) figure). The y dimers are shown in orange, with the x dimers shown in green. The bottom picture could be considered to be a linear lattice in which the orange y dimers appear to be monomers. The red dots and dashes outline the lattices.

Figure 15. Imagine squeezing the rungs of the ladder together, producing the orange disks from the original y-oriented dimers.

Figure 16. The plot of the two entropies given by the lattice gas model and the Fisher model of dimers.

Figure 17. The average occupation of species in the Fisher model of dimers showing species mixing even at large negative binding energy. The purple dashed line is a numerical check showing that the sum of occupancies is one, as demonstrated analytically in the text.

Figure 18. The average occupation of species in the Fisher model of dimers showing species mixing even at large negative binding energy.

Figure 19. A comparison of the total charge vs. the binding energy on the lattice between the lattice gas model and the Fisher model.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Entropies of the Classical Dimer Model

Abstract

1. Introduction

2. Partition Function, Entropy, and Occupancy

3. The Lattice and Its Dual

4. The Dimer Model

5. Inclusion of Constraints on the Partition Function by Use of a Child’s Toy

6. Introduction of the Antisymmetric Matrix

7. Eigenvalues and Eigenvectors of the Asymmetric Matrix

8. Dimer Partition Function in the Large Lattice Limit

9. Colored y Dimers and the Partition Function for Dimers and Monomers

10. The Linear Chain

11. Results for the Entropy, Average Occupation, and Total Charge

12. Discussion

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Trace Theorems of the A^μ Matrices

Appendix B. Pfaffians and Determinants

Appendix C. Block by Block Multiplication

Appendix D. Evaluation of t_ω,ω′

Appendix E. Evaluating the Determinant and the Partition Function

References

Article Metrics

Citations

Article Access Statistics

Entropies of the Classical Dimer Model

Abstract

1. Introduction

2. Partition Function, Entropy, and Occupancy

3. The Lattice and Its Dual

4. The Dimer Model

5. Inclusion of Constraints on the Partition Function by Use of a Child’s Toy

6. Introduction of the Antisymmetric Matrix

7. Eigenvalues and Eigenvectors of the Asymmetric Matrix

8. Dimer Partition Function in the Large Lattice Limit

9. Colored y Dimers and the Partition Function for Dimers and Monomers

10. The Linear Chain

11. Results for the Entropy, Average Occupation, and Total Charge

12. Discussion

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Trace Theorems of the Aμ Matrices

Appendix B. Pfaffians and Determinants

Appendix C. Block by Block Multiplication

Appendix D. Evaluation of tω,ω′

Appendix E. Evaluating the Determinant and the Partition Function

References

Article Metrics

Citations

Article Access Statistics

Appendix A. Trace Theorems of the A^μ Matrices

Appendix D. Evaluation of t_ω,ω′