Open Access
This article is

- freely available
- re-usable

*Entropy*
**2013**,
*15*(11),
4956-4968;
doi:10.3390/e15114956

Article

Beyond Landauer Erasure

^{1}

School of Physics and Astronomy, University of Glasgow, Glasgow G12 8QQ, UK

^{2}

Centre for Quantum Computation and Communication Technology (Australian Research Council), Centre for Quantum Dynamics, Griffith University, Brisbane, Queensland 4111, Australia

^{*}

Author to whom correspondence should be addressed.

Received: 1 August 2013; in revised form: 29 October 2013 / Accepted: 31 October 2013 / Published: 13 November 2013

## Abstract

**:**

In thermodynamics, one considers thermal systems and the maximization of entropy subject to the conservation of energy. A consequence is Landauer’s erasure principle, which states that the erasure of one bit of information requires a minimum energy cost equal to $kTln\left(2\right)$, where T is the temperature of a thermal reservoir used in the process and k is Boltzmann’s constant. Jaynes, however, argued that the maximum entropy principle could be applied to any number of conserved quantities, which would suggest that information erasure may have alternative costs. Indeed, we showed recently that by using a reservoir comprising energy degenerate spins and subject to conservation of angular momentum, the cost of information erasure is in terms of angular momentum rather than energy. Here, we extend this analysis and derive the minimum cost of information erasure for systems where different conservation laws operate. We find that, for each conserved quantity, the minimum resource needed to erase one bit of memory is ${\lambda}^{-1}ln\left(2\right)$, where λ is related to the average value of the conserved quantity. The costs of erasure depend, fundamentally, on both the nature of the physical memory element and the reservoir with which it is coupled.

Keywords:

thermodynamics; information erasure; maximum entropy principle; spin reservoirClassification:

**PACS**05.70.-a; 03.65.Ta; 03.67.-a; 89.70.-a

## 1. Introduction

The idea of a link between information and thermodynamics can be traced back to Maxwell’s demon, a supposed microscopic intelligent being, the actions of which might present a challenge to the second law of thermodynamics [1,2,3]. This idea was made quantitative by Szilard, who showed, by means of a simple one-molecule gas, that information acquisition, for example, by a Maxwell demon, is necessarily accompanied by an entropy increase of not less than $kln\left(2\right)$ [4], where k is Boltzmann’s constant. A closely related phenomenon is the demonstration, due to Landauer, that erasing an unknown bit of information requires energy to be dissipated as heat, amounting to not less than $kTln\left(2\right)$ [5,6,7], where T is the temperature of the environment surrounding the bit.

Information theory is usually cast in a form that is independent of any particular physical realization. In particular, information processing could take place within degenerate manifolds of identical energy. In such a situation, it is difficult to see what role the energy could play. It would then be natural to question the place of Landauer’s erasure principle relying, as it does, on the thermal energy of the surrounding reservoir. Indeed, we have shown that information can be erased at the cost of spin angular momentum and not energy [8]. We explore this idea further here. We show, in particular, that the energy does not have a special place among conserved quantities. Rather, Landauer’s principle needs to be supplemented with costs associated with each of the conserved quantities.

The theoretical foundation for our new result lies in Jaynes’ maximum entropy principle [9,10], which is colloquially called the MaxEnt principle. It applies when we have only partial information about a system. The assignment of a probability distribution to account for the missing information that maximizes the entropy “...is the least biased estimate possible on the given information; i.e., it is maximally noncommittal with respect to the missing information” [9]. It has repercussions for erasure, because the entropy of the environment increases as it absorbs the entropy of the erased information, and any physical resources linked to the entropy change as a result.

The organisation of this paper is as follows. In Section 2, we review the MaxEnt principle in as much as it applies to information erasure. We then review an archetypical erasure model in Section 3. The main results, the minimum cost of erasure in terms of a number of conserved quantities, are presented in Section 4, and we end with a discussion in Section 5.

## 2. General Principles for Information Erasure

To perform information erasure, we need a system, which we shall call a reservoir, that is capable of absorbing the entropy of the unwanted information. Typically, this means that the reservoir will be large in that it will have very many degrees of freedom. The kind of reservoir we wish to analyze differs from the familiar thermal reservoir in that it is not necessarily characterised by an average energy or temperature. Instead, the only information we have about the reservoir is the expectation values for a number of physical variables. Let these expectation values be written as $\langle {\widehat{V}}_{k}\rangle $, where ${\widehat{V}}_{k}$ is a physical variable, such as energy or angular momentum, etc., and k is an integer that indexes different variables. We assume that the variables are independent, and so in quantum theory, they are represented by commuting self-adjoint operators. Then, according to the MaxEnt principle, the best description of the state of the system is one for which entropy is maximized, in which case, the density operator for the reservoir is given by:
and its Shannon entropy is [9]:
Here, ${\lambda}_{k}$ are Lagrange multipliers. If the system is altered in such a way that the expectation values, $\langle {\widehat{V}}_{k}\rangle $, and operators, ${\widehat{V}}_{k}$, change independently of each other, then a small change in entropy is given by [9]:
Here, $\delta \langle {\widehat{V}}_{k}\rangle $ denotes a change in the expectation value of ${\widehat{V}}_{k}$, whereas $\langle \delta {\widehat{V}}_{k}\rangle $ is the expectation value of a change in the operator, ${\widehat{V}}_{k}$, itself. The variables are not necessarily conserved, but rather, their expectation values are presumed to be known at all times.

$$\widehat{\rho}=exp\left[-\left({\lambda}_{0}+\sum _{k}{\lambda}_{k}{\widehat{V}}_{k}\right)\right]$$

$${S}_{r}={\lambda}_{0}+\sum _{k}{\lambda}_{k}\langle {\widehat{V}}_{k}\rangle \phantom{\rule{4pt}{0ex}}$$

$$\delta {S}_{r}=\sum _{k}{\lambda}_{k}(\delta \langle {\widehat{V}}_{k}\rangle -\langle \delta {\widehat{V}}_{k}\rangle )\phantom{\rule{4pt}{0ex}}$$

Let the memory be a two-state system that is initially in a maximally mixed state with a Shannon entropy of ${S}_{m}=ln2$. Erasing the information represented by this state will entail a physical process that leaves the memory in a predetermined pure state, and thus, the memory will suffer a change in entropy of $\delta {S}_{m}=-ln2$. If the erasure process is reversible, the total entropy of the combined system will be unchanged, and so:
Given $\delta {S}_{m}=-ln2$, the entropy of the reservoir will necessarily be increased by $\delta {S}_{r}=ln2$ in the process. If the erasure leaves the variables, ${\widehat{V}}_{k}$, unchanged (i.e., $\delta {\widehat{V}}_{k}=0$), then the only way an entropy increase can be accommodated is by an increase in the expectation values as follows:
which becomes:
This result shows that the reservoir undergoes physical changes as a result of the erasure. In particular, the changes to the variables, $\langle {\widehat{V}}_{k}\rangle $, represent the physical costs of the process. There is no distinction between the relative sizes of the cost associated with each variable; the only requirement is that the weighted sum of the costs be equal to $ln2$. The cost of erasure could be paid in terms of just a single variable, say $k=1$, in which case:
If ${\widehat{V}}_{1}$ represents the Hamiltonian, $\widehat{H}$, then ${\lambda}_{1}=1/kT$, where k is Boltzmann’s constant and T is the temperature, and we recover Landauer’s principle:
This need not be the case, however; the erasure mechanism we reported in [8] corresponds to ${\widehat{V}}_{1}$ representing the z component of spin angular momentum, ${\widehat{J}}_{z}$, in which case, the corresponding cost of erasure is in terms of spin angular momentum and not energy. More will be said about this result later.

$$0=\delta {S}_{r}+\delta {S}_{m}\phantom{\rule{4pt}{0ex}}$$

$$\delta {S}_{r}=\sum _{k}{\lambda}_{k}\delta \langle {\widehat{V}}_{k}\rangle $$

$$ln2=\sum _{k}{\lambda}_{k}\delta \langle {\widehat{V}}_{k}\rangle \phantom{\rule{4pt}{0ex}}$$

$$\delta \langle {\widehat{V}}_{1}\rangle =\frac{1}{{\lambda}_{1}}ln2\phantom{\rule{4pt}{0ex}}$$

$$\delta \langle \widehat{H}\rangle =kTln2\phantom{\rule{4pt}{0ex}}$$

## 3. Archetypal Erasure Model

As the forgoing demonstrates, erasure entails transferring entropy from memory to the reservoir. A generic way to do this is as follows [8]. Let the memory be represented by a two-state system with energy eigenstates, $|0\rangle $ and $|1\rangle $, which we will refer to as logic states. Next, imagine the memory is in an initially unknown state and that we wish to reset it by forcing it into the logic state, $|0\rangle $. Let the two states, $|0\rangle $ and $|1\rangle ,$ be initially degenerate, with energy of zero. We can erase the memory by placing it in contact with a conventional thermal reservoir at temperature T and then inducing an energy splitting between the states, so that $|0\rangle $ has energy of zero, but the state, $|1\rangle $, has energy of E, as illustrated in Figure 1. The splitting is induced quasi-statically, that is, sufficiently slowly so that the memory system is in thermal equilibrium with the thermal reservoir. The state of the memory when the energy splitting is E is governed by the Boltzmann (or maximum entropy) distribution with density operator:
The work required to increase the splitting from E to $E+dE$, while in contact with the reservoir, is, given the probability of occupying the state, $|1\rangle $, multiplied by $dE$, that is $dW={e}^{-E/kT}{(1+{e}^{-E/kT})}^{-1}dE$. The total work in increasing the splitting from zero to infinity is $W=\int dW=kTln2$, and the final state of the memory system is the logic state, $|0\rangle $, as required. The memory is removed from the reservoir and the energy degeneracy then restored. The erasure here is driven by maximizing the entropy subject to the conservation of energy as the energy gap between the states of the memory system grows.

$$\widehat{\rho}=\frac{|0\rangle \langle 0|+{e}^{-E/kT}|1\rangle \langle 1|}{1+{e}^{-E/kT}}\phantom{\rule{4pt}{0ex}}$$

**Figure 1.**Energy level diagram for the archetypal erasure model. For clarity, only a few representative energy levels in the continuous energy spectrum of the reservoir are shown. The reservoir and memory remain in thermal equilibrium and exchange energy, while the energy gap between the states of the memory is slowly increased. The size of the circles on each level indicates the relative probabilities of the states.

## 4. Erasure with Multiple Costs

We now use the principle underlying the archetypal erasure model to construct a more general erasure process. Consider the situation in which the memory logic states, $|0\rangle $ and $|1\rangle $, are associated with different eigenvalues of another conserved observable in addition to energy. For definiteness, we take this observable to be the z component of angular momentum and the memory to be a spin-$\frac{1}{2}$ particle with $|0\rangle $ representing the eigenstate with eigenvalue $-\frac{1}{2}\hslash $ and $|1\rangle $ representing the eigenstate with eigenvalue $\frac{1}{2}\hslash $. Let the reservoir of the previous section be constructed of a collection of N such particles. Imagine that the information known about the reservoir is the expectation values of its total energy and the z component of angular momentum, $\langle \widehat{H}\rangle $ and $\langle {\widehat{J}}_{z}\rangle $, respectively. According to the MaxEnt principle, the best description of the state of the reservoir is given by:
where μ, β and γ are appropriate Lagrange multipliers. We will investigate the situation where β is independent of γ in the following section. For the moment, however, let us consider the situation where the internal spin and energy are correlated. A case in point is where each spin particle experiences the same magnetic field orientated in the z direction. Then, the logical states will be Zeeman shifted in energy with, for example, $|1\rangle $ being at an energy ε higher than $|0\rangle $. In this case, we can write the Hamiltonian, $\widehat{H}$, as a sum of an internal term, ${\widehat{H}}_{in}$, corresponding to the Zeeman shift, and an external term, ${\widehat{H}}_{ex}$, corresponding to the motional degrees of freedom of the particles. Because ${\widehat{H}}_{in}$ and ${\widehat{J}}_{z}$ share the same eigenstates, we can write:
and it immediately follows that knowing both $\langle {\widehat{H}}_{in}\rangle $ and $\langle {\widehat{J}}_{z}\rangle $ does not give any more information about the state of the reservoir than knowing just one. In fact, in this case, Equation (10) reduces to:

$$\widehat{\rho}=exp[-(\mu +\beta \widehat{H}+\gamma {\widehat{J}}_{z})]$$

$${\widehat{H}}_{in}=\frac{\epsilon}{\hslash}{\widehat{J}}_{z}\phantom{\rule{4pt}{0ex}}$$

$$\widehat{\rho}=exp[-(\mu +\beta {\widehat{H}}_{ex}+\gamma \hslash {\widehat{H}}_{in}/\epsilon )]=exp[-(\mu +\beta {\widehat{H}}_{ex}+\gamma {\widehat{J}}_{z})]\phantom{\rule{4pt}{0ex}}$$

If we allow the internal spin states and external motion to be at the same temperature T, then $\beta =\gamma \hslash /\epsilon =1/kT$ and:

$$\widehat{\rho}=exp\{-[\mu +\beta ({\widehat{H}}_{ex}+{\widehat{H}}_{in})]\}=exp\{-[\mu +\beta ({\widehat{H}}_{ex}+\epsilon {\widehat{J}}_{z}/\hslash )]\}\phantom{\rule{4pt}{0ex}}$$

It is convenient to use a set of basis states, $|n,\nu \rangle $, to describe the internal collective spin state of the reservoir in terms of the number of spins, n, in the logical state, $|1\rangle $ (i.e., spin up), and the multiplicity index, ν, which uniquely specifies an arrangement of n spins in the state, $|1\rangle $, and $N-n$ in the state, $|0\rangle $. The full basis set is given by $\left\{\right|n,{\nu}_{n}\rangle :n=0,\dots ,N;{\nu}_{n}=1,\dots ,\left({}_{n}^{N}\right)\}$. The energy and z component of the spin of the state, $|n+1,\nu \rangle $, is ε and ℏ, respectively, higher than the state, $|n,\nu \rangle $. According to Equation (13), the probability, ${P}_{n,\nu}$, describing the occupation of the reservoir state, $|n,\nu \rangle $, is proportional to ${e}^{-n\epsilon /kT}$. Taking into account the $\left({}_{n}^{N}\right)$-fold degeneracy of this state then leads to the normalised probability distribution:
which is independent of the index, ν.

$${P}_{n,\nu}=\frac{{e}^{-n\epsilon /kT}}{{(1+{e}^{-\epsilon /kT})}^{N}}$$

As before, the memory spin is in an unknown (maximally mixed) state, and we wish to return it to the logical zero state. The first stage of our erasure scheme entails putting the memory spin in thermal and spin exchange contact with the reservoir, letting the combined system come to equilibrium, and then separating the memory spin from the reservoir. At this point, the state of the memory spin is:
with ${p}_{1}={e}^{-\epsilon /kT}/(1+{e}^{-\epsilon /kT})=1-{p}_{0}$. We assume throughout that the reservoir is sufficiently large so that the temperature, T, is unaffected by the erasure process and that there are a large number of ancillary spins, all in the state, $|0\rangle $, at our disposal. A CNOToperation [11,12] is then performed on the memory spin and one ancilla spin, with the former being the control qubit; this yields the state, ${p}_{0}|00\rangle \langle 00|+{p}_{1}|11\rangle \langle 11|$, where $|xy\rangle $ represents the state, $|x\rangle $, of the memory spin and $|y\rangle $ of the ancilla spin. The spin cost of this operation is $\hslash {p}_{1}={e}^{-\epsilon /kT}\hslash /(1+{e}^{-\epsilon /kT})$, and the energy cost is $\epsilon {p}_{1}={e}^{-\epsilon /kT}\epsilon /(1+{e}^{-\epsilon /kT})$. This system is again placed in thermal and spin exchange contact with the reservoir. The thermal and spin exchange between the reservoir and the memory-ancilla system is constructed to leave all states unchanged, except for the following mapping:
where $|n,\nu \rangle |ij\rangle $ represents the reservoir collective state, $|n,\nu \rangle $, and memory-ancilla system state, $|ij\rangle $. The thermal and spin exchange continues for sufficient time for the reservoir and memory-ancilla system to equilibrate. The state of the memory-ancilla system is then given by:
where now ${p}_{1}={e}^{-2\epsilon /kT}/(1+{e}^{-2\epsilon /kT})=1-{p}_{0}$. Another ancilla spin is added, and a CNOT operation is performed as before to yield the state, ${p}_{0}|000\rangle \langle 000|+{p}_{1}|111\rangle \langle 111|$, with spin and energy costs, ${e}^{-2\epsilon /kT}\hslash /(1+{e}^{-2\epsilon /kT})$ and ${e}^{-2\epsilon /kT}\epsilon /(1+{e}^{-2\epsilon /kT})$, respectively. The combined memory-ancilla system is put in thermal and spin-exchange contact with the reservoir with the mapping:

$${p}_{0}|0\rangle \langle 0|+{p}_{1}|1\rangle \langle 1|$$

$$|2,1\rangle |00\rangle \leftrightarrow |0,1\rangle |11\rangle $$

$${p}_{0}|00\rangle \langle 00|+{p}_{1}|11\rangle \langle 11|$$

$$|3,1\rangle |000\rangle \leftrightarrow |0,1\rangle |111\rangle \phantom{\rule{4pt}{0ex}}$$

The process is repeated. Figure 2 illustrates the situation after five cycles. After m cycles, the memory-ancilla spins are in the logical zero state and the logical 1 state with probabilities ${p}_{0}$ and ${p}_{1}$, where ${p}_{1}={e}^{-m\epsilon /kT}/(1+{e}^{-m\epsilon /kT})=1-{p}_{0}$. The upper limit for m is the number of spins, N, in the reservoir. In the limit of many repetitions and large N, the memory-ancilla system approaches a pure state, where each spin is in the logical zero state, $|0\rangle $. In this limit, all the ancillary spins used in the process have been returned to their initial state, $|0\rangle $, and the information represented by the initial state of the memory spin has been erased. The total spin cost, $\Delta {J}_{z}$, of the CNOT operations of the whole process approaches:
This sum is bounded by:
If we include the spin of the initial state, then the spin cost is given by $\Delta {J}_{z}^{\prime}$ where:
Similarly, the energy cost, $\Delta E$, of erasing the memory of the memory spin is given by:
which includes the initial energy of the memory spin.

$$\Delta {J}_{z}=\sum _{n=1}^{\infty}\frac{{e}^{-n\epsilon /kT}}{1+{e}^{-n\epsilon /kT}}\hslash \phantom{\rule{4pt}{0ex}}$$

$$\frac{kT\hslash}{\epsilon}ln(1+{e}^{-\epsilon /kT})<\Delta {J}_{z}<\frac{kT\hslash}{\epsilon}ln\left(2\right)\phantom{\rule{4pt}{0ex}}$$

$$\Delta {J}_{z}^{\prime}=\sum _{n=0}^{\infty}\frac{{e}^{-n\epsilon /kT}}{1+{e}^{-n\epsilon /kT}}\hslash =\Delta {J}_{z}+\frac{1}{2}\hslash \phantom{\rule{4pt}{0ex}}$$

$$\frac{kT\hslash}{\epsilon}ln\left(2\right)<\Delta {J}_{z}^{\prime}<\frac{kT\hslash}{\epsilon}ln(1+{e}^{\epsilon /kT})\phantom{\rule{4pt}{0ex}}$$

$$kTln\left(2\right)<\Delta E<kTln(1+{e}^{\epsilon /kT})\phantom{\rule{4pt}{0ex}}$$

**Figure 2.**Energy and angular momentum diagram for the erasure model with multiple costs. The energy and the z component of the angular momentum of each system has discrete levels in multiples of ε and ℏ, respectively. For convenience, the lowest angular momentum level is labelled with zero angular momentum. To erase the contents of the memory, the reservoir and memory-ancilla systems are allowed to equilibrate by exchanging energy and angular momentum; next, the gap between the states of the memory-ancilla system are increased, and then, the process is repeated.

It is interesting to relate the two costs via dimensionless quantities. Let $\alpha ={e}^{-\epsilon /kT}/(1+{e}^{-\epsilon /kT})$ be the probability that a reservoir spin is in logical 1 state. The average energy of the reservoir spin is $\alpha \epsilon $, and the average ${\widehat{J}}_{z}$ value is $(\alpha -\frac{1}{2})\hslash $. We can rewrite the lower bounds in terms of α by noting that:
so that:
The first of these is the energy cost due to Landauer [5]. In order to compare with our general result Equation (6), we need to add appropriately-scaled versions of these two costs as follows:
The right side is greater by a factor of two compared to that expected from Equation (6), due to the fact that in order for the internal spin states of the reservoir to absorb the entropy of the memory, two different physical variables are changed, and each is associated with a corresponding physical cost. The actual change in the entropy of the reservoir is just $ln2$, however. This example of information erasure is clearly not as efficient as it could be.

$$\begin{array}{ccc}\hfill \frac{kT}{\epsilon}& =& \frac{1}{ln\left(\frac{1-\alpha}{\alpha}\right)}\hfill \end{array}$$

$$\begin{array}{ccc}\hfill \Delta E& >& \frac{\epsilon}{ln\left(\frac{1-\alpha}{\alpha}\right)}ln\left(2\right)=kTln2\phantom{\rule{4pt}{0ex}}\hfill \end{array}$$

$$\begin{array}{ccc}\hfill \Delta {J}_{z}^{\prime}& >& \frac{\hslash}{ln\left(\frac{1-\alpha}{\alpha}\right)}ln\left(2\right)=\hslash \frac{kT}{\epsilon}ln2\phantom{\rule{4pt}{0ex}}\hfill \end{array}$$

$$\begin{array}{c}\hfill \beta \Delta E+\frac{\beta \epsilon}{\hslash}\Delta {J}_{z}^{\prime}>2ln2\phantom{\rule{4pt}{0ex}}\end{array}$$

## 5. Erasure without an Energy Cost

We have arrived at a strong link between the energy and angular momentum costs due to the relationship between the Zeeman splitting and the angular momentum. Yet, it is not necessary for this energy and angular momentum to be so linked. By modifying, or better, removing, the link between the the energy and angular momentum of the states, it is possible to change the costs of erasure. In particular, if the magnetic field is removed, the logical states, $|0\rangle $ and $|1\rangle $, become degenerate in energy, and, as Equation (11) indicates, the Hamiltonian term, ${\widehat{H}}_{in}$, vanishes. This means that only the expression on the right of Equation (12) holds, i.e.:
where β and γ are independent Lagrange multipliers. Examples of collections of spin particles that are described by density operators of this form are given by cold atomic gases confined in optical dipole force traps [13]. The internal states of the particles are described by the collective states, $|n,\nu \rangle $, exactly as before, except that now, they are associated with a probability, ${P}_{n,\nu}$, that is proportional to ${e}^{-n\gamma \hslash}$ in accordance with Equation (28). The normalised probability distribution is easily found to be:
We can relate the Lagrange multiplier, γ, to the average of the z component of the spin of the reservoir linebreak by:
where $\langle {\widehat{J}}_{z}\rangle =(\alpha -\frac{1}{2})N\hslash $ for $0\le \alpha \le 1$ as before. We also let the logical states of the memory and ancilla particle be energy degenerate. The erasure process proceeds as in the previous section and is represented in Figure 3. The only differences are that, due to the energy degeneracy of all the states involved, the memory-ancilla system is brought into equilibrium with the reservoir by the exchange of spin angular moment only, the cost of performing the CNOT operations is also in terms of spin angular momentum and not energy and the parametrisation of the probability distribution is in terms of γ instead of β. With this in mind, we find that in the limit of many repetitions and large N, the memory-ancilla system approaches a pure state, where each spin is in the logical zero state. The total angular momentum cost of the whole process is given by Equation (19) with $\beta =1/kT$ replaced with γ, i.e.,8]:
which, correspondingly, is bounded by:
If we include the spin of the initial state, then the cost is:
Hence:
with the value of γ given by Equation (30). This is the total cost of the erasure; there is no energy cost in this case. Clearly, the costs associated with erasure depend on the physical nature of the memory system and the reservoir and need not include an energy term, in contradistinction to the suggestion of Landauer and many others.

$$\widehat{\rho}=exp[-(\mu +\beta {\widehat{H}}_{ex}+\gamma {\widehat{J}}_{z})]$$

$${P}_{n,\nu}=\frac{{e}^{-n\gamma \hslash}}{{(1+{e}^{-\gamma \hslash})}^{N}}\phantom{\rule{4pt}{0ex}}$$

$$\gamma =\frac{1}{\hslash}ln\left(\frac{1-\alpha}{\alpha}\right)\phantom{\rule{4pt}{0ex}}$$

$$\Delta {J}_{z}=\sum _{n=1}^{\infty}\frac{{e}^{-n\gamma \hslash}}{1+{e}^{-n\gamma \hslash}}\hslash $$

$${\gamma}^{-1}ln(1+{e}^{-\gamma \hslash})<\Delta {J}_{z}<{\gamma}^{-1}ln\left(2\right)\phantom{\rule{4pt}{0ex}}$$

$$\Delta {J}_{z}^{\prime}=\sum _{n=0}^{\infty}\frac{{e}^{-n\gamma \hslash}}{1+{e}^{-n\gamma \hslash}}=\Delta {J}_{z}+\frac{1}{2}\hslash \phantom{\rule{4pt}{0ex}}$$

$${\gamma}^{-1}ln\left(2\right)<\Delta {J}_{z}^{\prime}<{\gamma}^{-1}ln(1+{e}^{\gamma \hslash})\phantom{\rule{4pt}{0ex}}$$

$$\Delta {J}_{z}^{\prime}>{\gamma}^{-1}ln\left(2\right)$$

**Figure 3.**Angular momentum diagram for the erasure model without an energy cost. The z component of angular momentum of both systems has discrete levels in multiples of ℏ. All states of both systems are degenerate in energy. The reservoir and memory-ancilla systems are allowed to equilibrate by exchanging angular momentum only; next, the gap between the states of the memory-ancilla system is increased, and then, the process is repeated.

This an important point and deserves some emphasis. Landauer’s principle provides a basis for claiming an equivalence between thermodynamic and information entropies. However, the equivalence rests on the erasure of information being necessarily associated with the dissipation of energy and with it, the increase of a corresponding amount of thermodynamic entropy. Our finding breaks this association. As a consequence, information entropy can no longer be claimed to be equivalent to thermodynamic entropy. Rather, how we might think of information entropy depends on the physical system used to store the information. That we need to consider a physical system to store information is, however, consistent with Landauer’s catchcry that “information is physical" [14].

We now examine the absence of the energy cost in more detail. Note that a cost of zero energy also occurs in Landauer’s erasure when the reservoir temperature is zero. The erasure in this case can be thought of as a “cooling” process; the reservoir simply absorbs the heat of the memory bit. One may wonder if the absence of an energy cost in the present case is associated with the reservoir being at zero temperature. However, a degenerate reservoir in thermal equilibrium (at any temperature) has a flat probability distribution across all states and, so, cannot act to cool the memory bit. We conclude that it is not appropriate to consider the $T=0$ cooling mechanism for the degenerate case. We should rather think of the cost of erasure not in terms of energy, but of the quantity defining the logic states.

Further insight into this issue is given by considering the non-degenerate model discussed in Section 4 in the degenerate limit, $\epsilon \to 0$, where the limit is approached in such a way that the $\Delta {J}_{z}$ cost given by Equation (26) is unchanged. The question we wish to address is whether the probability distribution in Equation (29) can be consistently associated with a temperature $T=0$. To answer it, we examine the behaviour of T as $\epsilon \to 0$, such that the probability distributions in Equations (14) and (29) are equal, i.e.,
and keeping the $\Delta {J}_{z}$ cost is fixed irrespective of the values of T and ε. This means we need to keep the parameter, γ, fixed, and thus from Equation (30), we need to keep α fixed. From Equation (24), we see that this requires $T\to 0$ linearly with ε. In principle this, could be achieved as follows. We could slowly reduce the Zeeman splitting ε and, at the same time, reduce the temperature of the reservoir as $T=\epsilon /k$. The angular momentum distribution of the reservoir, Equation (29), is unchanged in this process. One might then argue that Equation (29) is therefore associated with $T=0$. We could, however, subsequently raise the temperature of the reservoir to some finite value $T>0$ while keeping the energy levels of the reservoir spins degenerate (i.e., while maintaining a zero Zeeman splitting $\epsilon =0$). As the spins are degenerate, no energy is exchanged between the external motional degrees of freedom and the internal spin degrees of freedom. Thus, the probability distribution of the reservoir remains unchanged irrespective of the temperature of the reservoir. We could equally associate the distribution Equation (29) with any arbitrary temperature. This means that Equation (29) cannot be consistently associated with any single temperature. The $T=0$ cooling mechanism, therefore, is not operating here. We are thus led to the conclusion that energy (and, thus, work) does not play a role in erasure in the degenerate case.

$${P}_{n,\nu}=\frac{{e}^{-n\epsilon /kT}}{{(1+{e}^{-n\epsilon /kT})}^{N}}=\frac{{e}^{-n\gamma \hslash}}{{(1+{e}^{-n\gamma \hslash})}^{N}}\phantom{\rule{4pt}{0ex}}$$

The distribution Equation (29), moreover, represents a state of maximum entropy for a fixed z component of angular momentum. Its ability to absorb the entropy of the memory spin is due to the maximization of entropy of the combined reservoir-memory-ancilla system subject to the conservation of the z component of angular momentum. The conservation of energy is trivially satisfied here due to the degeneracy and, as such, the conservation of energy does not influence the equilibrium states. Again, we are led to the conclusion that the $T=0$ cooling mechanism cannot operate in the degenerate case.

Finally, one may wonder whether the effect of a small residual magnetic field that induces a correspondingly small energy splitting ε between the spin states would void our assertion that Equation (35) represents the total cost of the erasure. Indeed, Equation (25) indicates that an energy splitting of ε, however small, leads to an energy cost of $kTln2$. However, any known magnetic field can be eliminated systematically, in principle. We can assume, therefore, that any residual magnetic field is small and able to be estimated only. It will presumably vary from the site of one spin to another. Consider the local residual field at the site of an arbitrary spin given by position vector, $\mathbf{r}$. Any component in the $x-y$ plane will induce the precession of the spin orientation, which will lead to inefficiencies and, thus, a higher spin cost or incomplete erasure. It will not, however, fundamentally change the character of the cost of erasure. In contrast, any component in the z direction will produce a Zeeman splitting in energy, which we shall represent as $\epsilon \left(\mathbf{r}\right)$ for a spin located at $\mathbf{r}$. If the spins in the memory-ancilla system experience this splitting, the CNOT operation will incur an energy cost in addition to the angular momentum cost in Equation (35). The total energy cost of the CNOT operations will be given by:
where ${\mathbf{r}}_{n}$ is the position of the ancilla spin, which is the target of the n-th CNOT operation. Let $\epsilon \left({\mathbf{r}}_{n}\right)$ for any given n be a stochastic variable with zero mean, i.e., $\overline{\epsilon \left({\mathbf{r}}_{n}\right)}=0$, where the overlining represents the ensemble average. It then follows that:
On average, there is no net energy cost: the erasure process is as likely to result in an energy gain as an energy loss. The fundamental point to be made here is that Landauer’s cost of $kTln2$ per bit is an average cost, and here, the average energy cost is zero. We conclude that a small, unpredictable, residual magnetic field does not void our result that, in principle, the cost of erasure is given in terms of angular momentum and not energy.

$$\Delta E=\sum _{n=1}^{\infty}\frac{{e}^{-n\gamma \hslash}}{1+{e}^{-n\gamma \hslash}}\epsilon \left({\mathbf{r}}_{n}\right)$$

$$\overline{\Delta E}=0\phantom{\rule{4pt}{0ex}}$$

## 6. Discussion

We are now able to state general results. To this end, we write the costs of erasure in terms of a dimensionless parameter. A convenient measure of the state of the reservoir for this is given by the number, n, of particles that are in the logical $|1\rangle $ state. We can call this parameter the Hamming weight of the reservoir. To keep our analysis general, we shall refer to the particles in the reservoir as qubits [11,12] rather than spins. We have already seen that the increase in the entropy of the reservoir, $\Delta S$, for the erasure of one bit is bounded below by $ln\left(2\right)$, i.e.:
The actual change, $\Delta S$, due to a Hamming cost of $\Delta n$ can be approximated to first order using:
Combining these two results yields:
For a reservoir comprising N qubits with an average Hamming weight of $n=\alpha N$, the entropy is:
and so, on performing the derivative in Equation (41), we find the lower bound on the Hamming cost:
which agrees with Equation (35) for the value of γ given by Equation (30). Our erasure scheme is therefore optimal in this sense.

$$\Delta S>ln\left(2\right)\phantom{\rule{4pt}{0ex}}$$

$$\Delta S\approx \frac{dS}{dn}\Delta n\phantom{\rule{4pt}{0ex}}$$

$$\Delta n>ln\left(2\right){\left(\frac{dS}{dn}\right)}^{-1}\phantom{\rule{4pt}{0ex}}$$

$$S=-N\left[\right(1-\alpha )ln(1-\alpha )+\alpha ln(\alpha \left)\right]$$

$$\Delta n>\frac{ln\left(2\right)}{ln\left(\frac{1-\alpha}{\alpha}\right)}$$

Similarly, if the logical states, $|0\rangle $, $|1\rangle $, of the qubits are eigenstates of a physical variable ${\widehat{V}}_{k}$ with corresponding eigenvalues, zero, ${v}_{k}$, then the average value of ${\widehat{V}}_{k}$ for the reservoir is $\langle {\widehat{V}}_{k}\rangle =\alpha {v}_{k}N$, and the entropic cost can be reexpressed as:
Using Equations (39) and (42), we then find the corresponding cost for arbitrary erasure schemes:
The total cost of erasure is therefore:
for M physical variables. The right side is M times the right side of Equation (6), due to the fact that here, the expectation values, $\langle {\widehat{V}}_{k}\rangle $, vary in proportion to α, whereas in Equation (6), the expectation values, $\langle {\widehat{V}}_{k}\rangle $, are assumed to vary independently. Information erasure is clearly inefficient when more than one physical variable is associated with the degree of freedom of the reservoir that absorbs the entropy of the memory.

$$\Delta S\approx \frac{dS}{d\langle {\widehat{V}}_{k}\rangle}\Delta \langle {\widehat{V}}_{k}\rangle \phantom{\rule{4pt}{0ex}}$$

$$\Delta \langle {\widehat{V}}_{k}\rangle >\frac{{v}_{k}ln\left(2\right)}{ln\left(\frac{1-\alpha}{\alpha}\right)}\phantom{\rule{4pt}{0ex}}$$

$$ln\left(\frac{1-\alpha}{\alpha}\right)\sum _{k=1}^{M}\frac{1}{{v}_{k}}\Delta {V}_{k}>Mln\left(2\right)$$

These results open up a range of topics for investigation. For example, the operation of Carnot ‘heat’ engines operating with angular momentum reservoirs and generating angular momentum ‘work’ (or some other resource) instead of mechanical work. Another possibility is the use of a combination of different types of reservoir. For example, a Maxwell’s demon can operate on a single thermal reservoir to extract work from the reservoir. However, there is an associated unmitigated cost in that the memory of the demon has to be erased. Bennett’s argument [15] is to use a thermal reservoir following Landauer’s erasure principle to do this, so that the extracted work is (more than) balanced by the cost of erasure. However, given the forgoing, we now know that the memory of the demon can be erased using an entirely different reservoir at no cost in energy. The cost instead could be in terms of angular momentum, say [8]. Fundamentally, however, the irreducible cost is information theoretic, i.e., a cost in terms of Shannon entropy. This suggests the operation of generalized Carnot cycles between different kinds of reservoirs, the fundamental principle of operation being a movement of entropy from one reservoir to another. These issues will be explored elsewhere.

## Acknowledgments

S.M.B. thanks the Royal Society and the Wolfson Foundation for financial support. J.A.V. thanks Terry Rudolph, David Jennings, Sania Jevtic and Matteo Lostaglio for useful discussions and gratefully acknowledges financial support from the Australian Research Council through the Centre of Excellence Grant No. CE110001027.

## Conflicts of Interest

The authors declare no conflict of interest.

## References

- Maxwell, J.C. Theory of Heat; Longmans, Green, and Co.: London, UK, 1871. [Google Scholar]
- Maxwell’s Demon; Harvey, H.S.; Lex, A.F. (Eds.) Adam Hilger: Bristol, UK, 1990.
- Maxwell’s Demon 2; Harvey, H.S.; Lex, A.F. (Eds.) Institute of Physics Publishing: Bristol, UK, 2003.
- Szilard, L. Über die Entropieverminderung in eigem thermodynamischen system bei eingriffen intelligenter wesen. Zeits. Phys.
**1929**, 53, 840–856. [Google Scholar] [CrossRef] - Landauer, R. Irreversibility and heat generation in the computing process. IBM J. Res. Dev.
**1961**, 5, 183–191. [Google Scholar] [CrossRef] - Plenio, M.; Vitelli, V. The physics of forgetting: Landauer’s erasure principle and information theory. Contemp. Phys.
**2001**, 42, 25–60. [Google Scholar] [CrossRef] - Maroney, O.J.E. Generalizing Landauer’s principle. Phys. Rev. E
**2009**, 79, 031105. [Google Scholar] [CrossRef] [PubMed] - Vaccaro, J.A.; Barnett, S.M. Information erasure without an energy cost. Proc. R. Soc. A
**2011**, 467, 1770–1778. [Google Scholar] [CrossRef] - Jaynes, E.T. Information theory and statistical mechanics. Phys. Rev.
**1957**, 106, 620–630. [Google Scholar] [CrossRef] - Jaynes, E.T. Probability Theory: The Logic of Science; Cambridge University Press: Cambridge, UK, 2003. [Google Scholar]
- Nielsen, M.A.; Chuang, I.L. Quantum Computation and Quantum Information; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
- Barnett, S.M. Quantum Information; Oxford University Press: Oxford, UK, 2009. [Google Scholar]
- Leanhardt, A.E.; Shin, Y.; Chikkatur, A.P.; Kielpinski, D.; Ketterle, W.; Pritchard, D.E. Bose-Einstein condensates near a microfabricated surface. Phys. Rev. Lett.
**2003**, 90, 100404. [Google Scholar] [CrossRef] [PubMed] - Landauer, R. Information is Physical. Phys. Today
**1991**, 44, 23–29. [Google Scholar] [CrossRef] - Bennett, C.H. The thermodynamics of computation—A review. Int. J. Theor. Phys.
**1982**, 21, 905–940. [Google Scholar] [CrossRef]

© 2013 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).