Relative Entropy, Interaction Energy and the Nature of Dissipation

Gaveau, Bernard; Granger, Léo; Moreau, Michel; Schulman, Lawrence S.

doi:10.3390/e16063173

Open AccessArticle

Relative Entropy, Interaction Energy and the Nature of Dissipation

by

Bernard Gaveau

¹,

Léo Granger

²,

Michel Moreau

³ and

Lawrence S. Schulman

^4,*

¹

Laboratoire analyse et physique mathématique, 14 avenue Félix Faure, 75015 Paris, France

²

Max Planck Institute for the Physics of Complex Systems, Nöthnitzer Str. 38, D-01187 Dresden, Germany

³

Université Pierre et Marie Curie, LPTMC, case 121, 4 pl. Jussieu, 75252 Paris cedex 05, France

⁴

Physics Department, Clarkson University, Potsdam, New York 13699-5820, USA

^*

Author to whom correspondence should be addressed.

Entropy 2014, 16(6), 3173-3206; https://doi.org/10.3390/e16063173

Submission received: 10 February 2014 / Revised: 20 April 2014 / Accepted: 23 May 2014 / Published: 6 June 2014

(This article belongs to the Special Issue Complex Systems)

Download

Browse Figures

Versions Notes

Abstract

: Many thermodynamic relations involve inequalities, with equality if a process does not involve dissipation. In this article we provide equalities in which the dissipative contribution is shown to involve the relative entropy (a.k.a. Kullback-Leibler divergence). The processes considered are general time evolutions both in classical and quantum mechanics, and the initial state is sometimes thermal, sometimes partially so. By calculating a transport coefficient we show that indeed—at least in this case—the source of dissipation in that coefficient is the relative entropy.

Keywords:

relative entropy; dissipation; thermodynamic inequalities; interaction energy

1. Introduction

The distinction between heat and work, between the uncontrollable flow of energy of molecular processes and the controllable flow of energy usable by an agent, underlies all of thermodynamics, and is implicitly incorporated in the equation dE = đW + đQ. This distinction is defined by human subjectivity and by the human technological ability to extract work from the flow of energy of microscopic processes.

On the other hand, assuming that the evolution is given by the fundamental laws of dynamics, classical or quantum, one must represent the evolution of a system at a fundamental microscopic level by the action of a unitary operator on a quantum state (density operator) in quantum situations or by the action of a symplectic operator on a classical state (probability distribution) in classical situations. The state of the system evolves according to the Heisenberg equation of motion or according to the Liouville equation respectively. An immediate consequence is that the entropy of the exact microscopic state of a system that is isolated or is coupled only to an external source of work stays constant during the evolution. In particular, the microscopic state cannot tend to an equilibrium state. Thus, the information content of the exact microscopic state stays constant during the evolution. The problem is that in practice it is impossible both to define the microscopic state and to follow its exact evolution. The definition of the state and the representation of the exact evolution using unitary or symplectic dynamics are thus untenable idealizations. Nevertheless, and this constitutes a paradox, these idealizations cannot be ignored or dismissed, because it is precisely the difference between the exact evolution and its standard approximations which explains and can be used to predict dissipative effects, both of energy and information.

In thermodynamics, in kinetic theories or in stochastic dynamics, the exact microscopic state of a system is replaced by an approximate or “coarse-grained” state and the corresponding exact evolution is replaced by an evolution of the corresponding coarse-grained state (or, in standard thermodynamics by a quasi-static or formal evolution). There are two main reasons for using these approximate states and evolutions:

(1): As discussed, it is impossible—even in principle—to specify the exact state of a large system and follow its evolution. An attempt at extremely high precision would modify the system, even in a classical context (related to Maxwell’s demon). And it is even worse for quantum systems. Moreover, this would be useless.
(2): Only slow variables (on the time scales of microscopic processes) can be measured with confidence and stability. As a result, an observer can only describe the system as a state of minimal information (or maximal entropy) compatible with the observed slow variables [1,2].

The coarse-grained state is thus a statistical data structure which summarizes at a given moment the knowledge of the observer. The evolution of this coarse-grained state merely reflects the evolution of the knowledge of the observer about the system. The observer cannot follow the microscopic processes, but only the slow variables which can be measured and used, and as a consequence there is a loss of information about the details of the microscopic processes; in the traditional language of thermodynamics, entropy increases or is produced. The observer, reflecting a particular state of knowledge (or more precisely, a lack of knowledge), describes the state of the system as a state of minimal information (or maximal entropy) compatible with observation. Thus, entropy is not a kind of substance flowing from one part of a system to another part or mysteriously produced internally by the physical system, as is often suggested by many texts of thermodynamics or statistical physics: it is only the observer’s partial inability to relate the exact microscopic theory to a reduced macroscopic description in order to use the system as a source of useful work or information. This is what is measured as an increase of entropy or by entropy production. The macroscopic state is the result of a statistical inference (specifically, maximum entropy) for the given, observed, macroscopic variables (which are the slow variables of the system [1,2]. This point of view on the nature of entropy was emphasized by Jaynes, who observed [3,4], “The expression ‘irreversible process’ represents a semantic confusion…”

The difference between the exact evolution of the microscopic ideal state and the evolution of its coarse-grained approximation is what is called “dissipation,” both of information content and of energy or other “useful” variables. Standard thermodynamics uses the maximal coarse graining of equilibrium, and the idealized evolution is not modeled explicitly, so dissipation can be taken into account only by inequalities. For more detailed coarse-graining (as in hydrodynamics, Boltzmann’s equation, kinetic theories or stochastic thermodynamics) one can obtain an estimate for the dissipative effects, for example, by the calculation of transport coefficients.

In this article, our main purpose is to prove that the relative entropy term between the initial and final states measures dissipation. In our approach, “dissipation” is defined as the difference between the maximal work that the physicist thinks could be extracted from a system when using the thermodynamic or quasi-static theory to make predictions, and the work that is actually extracted because the system is evolving according to the exact dynamics, classical or quantum, independently of what the physicist thinks (see also our use of relative entropy in [5] and [6], where the context was more limited [7]). Moreover, in the present context we find that the relative entropy terms are proportional to the square of the interaction energy. In all standard theories, dissipative effects are measured by the transport coefficients of energy or momentum or concentration of chemical species. Thus, we need prove that the relative entropy allows the calculation of transport coefficients. Indeed, we show below that the relative entropy terms provide the calculation of the thermal conductivity between two general quantum systems, initially at thermal equilibrium at different temperatures. This is a kind of Fourier law, except that we do not suppose a linear regime, so that the temperature dependence is more complicated than simple linearity. Moreover, our exact calculation of the transport coefficient shows that it is indeed proportional to the square of the interaction energy, which confirms that for vanishingly small interaction energy no transfer occurs in finite time. In other words, no power or finite rate of information flow can be extracted from a system if one does not have at the same time dissipative effects.

In the following material, we first consider a system comprised of two components, A and B. We make no specific hypotheses on the size of the systems, and we do not introduce thermal reservoirs. Thus, the identities we derive are in effect exact tautologies. In Sections 3 to 5, we present several identities. We here mention two examples: (1) a derivation of the Brillouin-Landauer estimate of the energy necessary to change the information content of a system; (2) an estimate of the work that can be extracted from a two-part system in interaction with an external source of work in terms of non-equilibrium free energies and relative entropy of the state before and after the evolution. Similar identities were also obtained recently by Esposito et al. [8], Reeb and Wolf [9], and Takara et al. [10] Continuing, we study the effect of an external agent on an (otherwise) isolated system; again we obtain an identity relating the work to the difference of internal (not the free) energies along with the usual dissipative terms. Then, we derive the relation between the relative entropy and the heat conductivity in a quantum system. Finally, we define a general notion of coarse-graining or reduced description, which includes the usual notions. In some of our examples one or both systems are initially at thermal equilibrium, but only the initial temperatures appear explicitly in the definition of the non-equilibrium free energies. The latter are no longer state functions because they depend explicitly on the initial temperature and not on the actual effective temperature. No coarse graining by an effective final or intermediate thermal state is used, and neither system is a reservoir.

2. Notations and Basic Identities

2.1. States and Entropy

Many results will be valid both in classical and quantum contexts. We denote by ρ either a probability distribution function over a classical phase space, or a density matrix in the quantum case. We denote by Tr either the integral on the phase space, or the trace operation. Thus ρ is a positive quantity and satisfies Tr ρ = 1. The entropy of ρ is

S (ρ) = - Tr ρ \log ρ .

(1)

It is defined up to a multiplicative constant. (Classically ρ should be divided by a dimensional constant to render it dimensionless.)

The relative entropy (see [11]) is defined by

S (ρ | ρ^{'}) = Tr (ρ (\log ρ - \log ρ^{'})),

(2)

where ρ and ρ′ are states.

One has

S (ρ | ρ^{'}) \geq 0,

(3)

and S(ρ|ρ′) does not depend on the units in phase space. Moreover S(ρ|ρ′) = 0 if and only if ρ = ρ′.

Writing S(ρ|ρ′) as − Tr ρ log ρ′ − (− Tr ρ log ρ), suggests the following interpretation: Suppose the true state is ρ, but the observer thinks that the state is ρ′. S(ρ|ρ′) is then the true average of the missing information minus the estimate of the missing information.

2.2. The Basic Identity

If we add and subtract S(ρ′) in the second member of Equation (2), we obtain the basic identity

S (ρ | ρ') = S (ρ') - S (ρ) - Tr ((ρ - ρ') \log ρ') .

(4)

Most of our results follow from this identity.

When ρ′ is a thermal state at (inverse) temperature β [12],

ρ^{'} = ρ_{β} = \frac{e^{- β H}}{Z (β, H)},

(5)

where

Z (β, H) = Tr e^{- β H}

(6)

is the partition function. With ρ′ = ρ_β, the identity (4) reduces to

S (ρ | ρ_{β}) = S (ρ_{β}) - S (ρ) + β Tr ((ρ - ρ_{β}) H) .

(7)

Here H is a given function or operator.

Defining the free energy of state ρ by

F (ρ, H) = Tr (ρ H) - \frac{1}{β} S (ρ),

(8)

we obtain

S (ρ | ρ_{β}) = β (F (ρ, H) - F (ρ_{β}, H)),

(9)

and F (ρ_β, H) is the equilibrium free energy related to the partition function by

Z (β, H) = \exp (- β F (ρ_{β}, H)) .

(10)

Equation (9) is important for applications, because its right hand side can be related to energy dissipation (up to the factor β), which gives a clear physical meaning to the relative entropy (see Section 3).

2.3. Evolution Operators and Entropy

We assume that the system (classical or quantum) evolves under the action of an arbitrary operator U (symplectic or unitary). If ρ is a state, we denote by ρ^(U) the new state after the evolution U.

Entropy is conserved by the evolution

S (ρ^{(U)}) = S (ρ) .

(11)

For example, in the quantum case, we have ρ^(U) = Uρ U^†, where U is the propagator: $i \frac{d U}{d t} = [H, U]$ , U|_t₌₀ = 1, with H a possibly time-dependent Hamiltonian.

If ϕ(ρ) is a functional of ρ which evolves with U, and ϕ(ρ^(U)) is the functional after evolution of ρ, we denote the variation of ϕ(ρ) after the evolution U in the following way

δ^{(U)} (ϕ (ρ)) = ϕ (ρ^{(U)}) - ϕ (ρ) .

(12)

Remark 1: Many of our results are valid for a general evolution U which is not symplectic or unitary, for example stochastic evolution.

3. Two Systems in Interaction

A basic procedure in thermodynamics is to consider the evolution and properties of an otherwise isolated two-part system. Although it is often the case that the overall system conserves energy, for the subsystems more general behavior is often seen.

3.1. Hypotheses

We assume that the system is formed of two parts, A and B, in interaction. At time-0, the state is a product state

ρ_{0} = ρ_{A, 0} \otimes ρ_{B, 0} .

(13)

After the evolution U, the state is ρ^(U) and we denote by $ρ_{A}^{(U)}$ and $ρ_{B}^{(U)}$ its marginals,

ρ_{A}^{(U)} = {Tr}_{B} ρ^{(U)} and ρ_{B}^{(U)} = {Tr}_{A} ρ^{(U)},

(14)

which are then states on A and B respectively. We also assume that there is a quantity H that is conserved by the evolution and H has the form

H = H_{A} + H_{B} + V_{A B},

(15)

where H_A and H_B are quantities depending only on A and B respectively and V_AB is an interaction term. Then, if we denote

E (ρ) = Tr (ρ H) = E_{A} (ρ) + E_{B} (ρ) + E_{V} (ρ)

(16)

E_{A} (ρ) = Tr (ρ H_{A}) = Tr (ρ_{A} H_{A})

(17)

E_{B} (ρ) = Tr (ρ H_{B}) = Tr (ρ_{B} H_{B})

(18)

E_{V} (ρ) = Tr (ρ V_{A B}),

(19)

our hypothesis is that

δ^{(U (t))} E (ρ) \equiv E (ρ^{(U)}) - E (ρ_{0}) = 0 .

(20)

In particular this is the case if U is time-evolution with Hamiltonian H.

Remark 2: For this situation, certain results are also valid without the assumption that the evolution U preserves the energy H.

If ρ is a state corresponding to a system formed of two parts, A and B, and ρ_A and ρ_B are its marginals (as in Equation (14)), then the relative entropy S (ρ|ρ_A ⊗ ρ_B) is the same as the mutual information of the associated distributions. It can be interpreted as the amount of information in ρ that comes from the fact that A and B are in interaction (see [11]). This quantity will appear in many of our relations below (e.g., Equation (28)) as part of the dissipation.

3.2. Relation between a State and Its Marginals

Assuming Equation (13) (that the initial state is a product state), one has the identity

δ^{(U (t))} S (ρ_{A}) + δ^{(U (t))} S (ρ_{B}) = S (ρ^{(U)} | ρ_{A}^{(U)} \otimes ρ_{B}^{(U)}) .

(21)

Indeed, using the conservation of the entropy of ρ during the evolution U,

This is because one evidently has $- Tr (ρ^{(U)} \log ρ_{A}^{(U)}) = - Tr (ρ_{A}^{(U)} \log ρ_{A}^{(U)})$ . Note that Equation (21) requires that U preserve the entropy. One has also the well-known inequality

S (ρ^{(U)}) \leq S (ρ_{A}^{(U)}) + S (ρ_{B}^{(U)}),

(22)

which is a particular case of

S (ρ) \leq S (ρ_{A}) + S (ρ_{B})

(23)

for any state ρ.

Note that the stronger result, Equation (21), is obtained by retaining the relative entropy term in this equation. The same remark will apply in most of the following results.

3.3. The Case Where A is Initially in a Thermal State

At time 0 we take ρ_A_,0 to be thermal with temperature β_A,

ρ_{A, 0} = ρ_{A, β_{A}} = \frac{e^{- β_{A} H_{A}}}{Z_{A} (β_{A})},

(24)

where Z_A(β_A) = Z_A(β_A, H_A) is the partition function, (6). From Equation (7) with $ρ \to ρ_{A}^{(U)}$ and $ρ_{β} \to ρ_{A, β_{A}}$ , we deduce (note that this requires that H_A be independent of time)

δ^{(U (t))} S (ρ_{A}) - β_{A} δ^{(U (t))} E_{A} (ρ_{A}) = - S (ρ_{A}^{(U)} | ρ_{A, β_{A}}),

(25)

and as a consequence

δ^{(U (t))} S (ρ_{A}) - β_{A} δ^{(U (t))} E_{A} (ρ_{A}) \leq 0.

(26)

The last two equations do not require that U be a unitary evolution conserving the entropy, nor that it conserve the energy.

Remark 3: This inequality can be found in [13] as an unnumbered equation. Its consequences were not deduced in that reference.

Remark4: Note that it is the initial temperature that appears in Equations (25) and (26). Moreover, $ρ_{A}^{(U)}$ is not in general an equilibrium state.

Suppose that B starts in an arbitrary initial state ρ_B_,0, while A begins in the thermal state $ρ_{A, β_{A}}$ . Combining Equations (21) and (25), we obtain

β_{A} δ^{(U (t))} E_{A} (ρ) + δ^{^{(U (t))}} S (ρ_{B}) = S (ρ^{(U)} | ρ_{A}^{(U)} \otimes ρ_{B}^{(U)}) + S (ρ_{A}^{(U)} | ρ_{A, β_{A}}) .

(27)

The last equation requires that U preserve entropy, since that feature is used in the derivation of Equation (21). It also remains valid if the Hamiltonian of B, H_B, depends on an external parameter varying with time, so that B receives work from an external agent. This is because the entropy-preserving property only depends on U being unitary (or symplectic). On the other hand, H_A should be time independent (see the parenthetical remark before Equation (25)). Then if U conserves energy

β_{A} (δ^{(U (t))} E_{B} (ρ) + δ^{(U (t))} E_{V} (ρ)) = δ^{(U (t))} S (ρ_{B}) - [S (ρ^{(U)} | ρ_{A}^{(U)} \otimes ρ_{B}^{(U)}) + S (ρ_{A}^{(U)} | ρ_{A, β_{A}})] .

(28)

These relations imply the following inequalities:

(1)

If U preserves entropy, even if H_B depends on an external parameter varying with time

β_{A} δ^{(U (t))} E_{A} (ρ) \geq - δ^{(U (t))} S (ρ_{B}) .

(29)

(2)

If U conserves entropy and the total energy, one has

β_{A} (δ^{(U (t))} E_{B} (ρ) + δ^{(U (t))} E_{V} (ρ)) \leq δ^{(U (t))} S (ρ_{B}),

(30)

with the following interpretations. Suppose U conserves the entropy; then we couple a system B (initially in an arbitrary state ρ_B_,0) to system A (initially in thermal equilibrium) and that we want to lower the entropy of B so that δ^(U(t)) S(ρ_B) ≤ 0. Then, the energy of A must increase by at least

δ^{(U (t))} E_{A} (ρ) \geq \frac{1}{β_{A}} | δ^{(U (t))} S (ρ_{B}) |

(31)

even if B receives work from an external source (so that H_B depends on an external parameter). Moreover, if the total energy is conserved, the sum of the energy of B and the coupling energy must decrease by at least:

δ^{(U (t))} E_{B} (ρ) + δ^{(U (t))} E_{V} (ρ) \leq \frac{1}{β_{A}} δ^{(U (t))} S (ρ_{B}) < 0.

(32)

Thus lowering the entropy of a system B, coupled to a system initially at equilibrium, costs transfers of energy from B to A or to the interaction energy; thus the sum of B’s energy and the interaction energy must decrease, but B’s energy alone need not decrease. This is a result analogous to those of Brillouin [14] and Landauer [15] (reprinted in Leff and Rex [16]), even if system B receives work from an external source. But note again that only the temperature β_A appears. This is the initial temperature at the beginning of the evolution U, so that system A is not necessarily a thermal bath, because its temperature may vary during the evolution U.

3.4. The Case of Equality in Equation (31)

It is important to study the case where the previous inequalities are changed into equalities, because this occurs if and only if strong conditions are verified: expressing these conditions is one of the advantages of our approach.

Equation (31) was derived under the hypothesis δ^(U) S(ρ_B) < 0. Then by Equation (27) one has $S (ρ^{(U)} | ρ_{A}^{(U)} \otimes ρ_{B}^{(U)}) = S (ρ_{A}^{(U)} | ρ_{A, β_{A}}) = 0$ . This implies $ρ_{A}^{(U)} = ρ_{A, β_{A}}$ and $ρ^{(U)} = ρ_{A}^{(U)} \otimes ρ_{B}^{(U)}$ . Because $ρ_{A}^{(U)} = ρ_{A, β_{A}}, δ^{(U)} E_{A} (ρ_{A}) = 0$ , so by Equation (31), δ^(U) S(ρ_B) = 0. But we could also derive δ^(U) S(ρ_B) = 0 from Equation (21), because δ^(U(t)) S(ρ_A) = 0 and $S (ρ^{(U)} | ρ_{A}^{(U)} \otimes ρ_{B}^{(U)}) = 0$ .

3.5. Both Systems A and B are at Equilibrium

Assume that A and B are initially at thermal equilibrium at different temperatures. Then, one has

(1)

For a general evolution

δ^{(U (t))} S (ρ_{A}) - β_{A} δ^{(U (t))} E_{A} (ρ_{A}) = - S (ρ_{A}^{(U)} | ρ_{A, β_{A}}),

(33)

δ^{(U (t))} S (ρ_{B}) - β_{B} δ^{(U (t))} E_{B} (ρ_{B}) = - S (ρ_{B}^{(U)} | ρ_{B, β_{B}}) .

(34)

(1)

If U conserves entropy

δ^{(U (t))} S (ρ_{A}) + δ^{(U (t))} S (ρ_{B}) = S (ρ^{(U)} | ρ_{A}^{(U)} \otimes ρ_{B}^{(U)}) .

(35)

(1)

If U conserves energy

δ^{(U (t))} E_{A} (ρ_{A}) + δ^{(U (t))} E_{B} (ρ_{B}) + δ^{(U (t))} E_{V} (ρ) = 0.

(36)

Then, we conclude

(A)

If U conserves entropy: Combining Equations (33), (34), and (35) yields

β_{A} δ^{(U (t))} E_{A} (ρ_{A}) + β_{B} δ^{(U (t))} E_{B} (ρ_{B}) = S (ρ^{(U)} | ρ_{A, β_{A}} \otimes ρ_{B, β_{B}}),

(37)

It is easy to check directly that

S (ρ^{(U)} | ρ_{A, β_{A}} \otimes ρ_{B, β_{B}}) = S (ρ^{(U)} | ρ_{A}^{(U)} \otimes ρ_{B}^{(U)}) + S (ρ_{A}^{(U)} | ρ_{A, β_{A}}) + S (ρ_{B}^{(U)} | ρ_{B, β_{B}}) .

Thus

(38)

This last identity implies the Clausius-like inequality

0 \leq δ^{(U (t))} S (ρ_{A}) + δ^{(U (t))} S (ρ_{B}) \leq β_{A} δ^{(U (t))} E_{A} (ρ_{A}) + β_{B} δ^{(U (t))} E_{B} (ρ_{B}) .

(39)

(B)

For a general evolution U: Combining Equations (33) and (34)

δ^{(U (t))} E_{A} (ρ_{A}) + δ^{(U (t))} E_{B} (ρ_{B}) = T_{A} [δ^{(U (t))} S (ρ_{A}) + S (ρ_{A}^{(U)} | ρ_{A, β_{A}})] + T_{B} [δ^{(U (t))} S (ρ_{B}) + S (ρ_{B}^{(U)} | ρ_{B, β_{B}})],

(40)

and thus

δ^{(U (t))} E_{A} (ρ_{A}) + δ^{(U (t))} E_{B} (ρ_{B}) \geq T_{A} δ^{(U (t))} S (ρ_{A}) + T_{B} δ^{(U (t))} S (ρ_{B}) .

(41)

This may be viewed as an inequality for free energies of A and B at the respective temperatures T_A and T_B. Note again that during the time evolution neither A nor B need remain in thermal states.

3.6. Case of Equality in (39) (2nd) and (41)

(A)

U conserves entropy. Equality in Equation (39) implies immediately that $ρ_{A}^{(U)} = ρ_{A, β_{A}}$ and $ρ_{B}^{(U)} = ρ_{B, β_{B}}$ , in which case the energy of A and the energy of B have not changed and δ^(U(t)) S(ρ_A) = δ^(U(t)) S(ρ_B) = 0. From the first equality in Equation (38) one has

ρ^{(U)} = ρ_{A}^{(U)} \otimes ρ_{B}^{(U)} = ρ_{A, β_{A}} \otimes ρ_{B, β_{B}},

(42)

and one deduces that the state ρ has not changed.

(B)

General evolution U. If one has equality in Equation (41), it follows from Equation (40) the same results as above: the state ρ has not changed.

3.7. Interaction Energy and Relative Entropy

It is often assumed that the interaction energy between the parts of the complete system can be neglected but, obviously, if this were exactly true the subsystems would evolve independently. Of course, an interaction can be small but nevertheless have significant impact when it persists for long times. However, there are cases where even for short times the interaction cannot be neglected. Assume then that U conserves entropy and energy. Divide Equations (33) and (34) by β_B and add; then use the conservation of energy Equation (36) to eliminate δ^(U) E_B(ρ) and deduce after some calculations

- δ^{(U (t))} E_{V} (ρ) = (1 - \frac{β_{A}}{β_{B}}) δ^{(U (t))} E_{A} (ρ_{A}) + T_{B} S (ρ^{(U)} | ρ_{A, β_{A}} \otimes ρ_{B, β_{B}}),

(43)

so that

- δ^{(U (t))} E_{V} (ρ) \geq (1 - \frac{β_{A}}{β_{B}}) δ^{(U (t))} E_{A} (ρ_{A}) .

(44)

In case of equality in Equation (44), one deduces that $ρ^{(U)} = ρ_{A, β_{A}} \otimes ρ_{B, β_{B}}$ so that the state has not changed and δ^(U) E_A(ρ_A) = δ^(U) E_V (ρ) = 0. Moreover if δ^(U) E_A(ρ_A) is positive and T_A is larger than T_B, the interaction energy V is necessarily not zero and δ^(U) E_V (ρ) is negative.

Finally, if one could neglect the interaction energy, Equation (44) implies that energy flows from the hot to the cold system.

3.8. The Case β_A = β_B

Again assume that U conserves both entropy and energy. From Equation (43) and the conservation of energy, one deduces

- δ^{(U (t))} E_{V} (ρ) = δ^{(U (t))} E_{A} (ρ_{A}) + δ^{(U (t))} E_{B} (ρ_{B}) = \frac{1}{β} S (ρ^{(U)} | ρ_{A, β_{A}} \otimes ρ_{B, β_{B}}),

(45)

so that δ^(U) E_V (ρ) ≤ 0. Thus when A and B are initially at thermal equilibrium at the same temperature, the sum of the energies of A and B can only increase at the expense of the interaction energy [17].

4. Two Systems in Interaction With a Work Source

The problem of converting heat into work, first treated by Carnot, was at the origin of classical thermodynamics. Here, to address this issue, we explicitly introduce a work source interacting with two systems A and B, before focusing in Section 5 on the interaction of one system with a work source.

4.1. Hypotheses

We consider two systems A and B in interaction, with system A coupled to a work source. We represent the action of the work source by parameters, collectively denoted by λ, so that H_A = H_A(λ). Thus we assume that H_B and V are independent of λ. The action of the work source is given by an evolution of the parameters λ(t) imposed by an external agent. The total system A+B has a unitary or symplectic evolution U(t) depending explicitly on time-t. Clearly, U(t) conserves entropy but does not conserve energy, and instead one has the identity

δ^{(U (t))} E_{A} (ρ) + δ^{(U (t))} E_{B} (ρ) + δ^{(U (t))} E_{V} (ρ) + δ^{(U (t))} W = 0,

(46)

with the following notation

δ^{(U (t))} E_{B} (ρ) = Tr ((ρ^{(U)} - ρ_{0}) H_{B})

(47)

δ^{(U (t))} E_{V} (ρ) = Tr ((ρ^{(U)} - ρ_{0}) H_{V})

(48)

δ^{(U (t))} E_{A} (ρ) = Tr (ρ^{(U)} H_{A} (λ^{(U)}) - ρ_{0} H_{A} (λ_{0})) .

(49)

Here, λ₀ is the initial value of the parameter λ and λ^(U) is its final value at the end of the evolution U, this being an abbreviation for U(t), t being the final time. Note that Equation (49) extends the definition given near Equation (12). Such an extension is needed because we now allow changes in the Hamiltonian, represented by the additional variable λ. Equation (46) defines the work δ^(U)W, which is taken to be positive if the source receives work from the system A + B. (Note that this is opposite to the usual convention which was implicit in the opening paragraph of this paper.)

We assume that initially A and B are in independent thermal states, but A depends on the work source parameter λ. The complete initial state at time 0 is thus

ρ_{0} = ρ_{A, β_{A,} λ_{0}} \otimes ρ_{B, β_{B}},

(50)

with

ρ_{A, β_{A,} λ_{0}} = \frac{e^{- β_{A} H_{A} (λ_{0})}}{Z_{A} (β_{A}, λ_{0})},

(51)

Z_{A} (β_{A}, λ_{0}) = Tr (e^{- β_{A} H_{A} (λ_{0})}),

(52)

and

Z_{A} (β_{A}, λ_{0}) = \exp (- β_{A} F_{A} (β_{A}, λ_{0})) .

(53)

Here, F_A(β_A, λ₀) denotes the equilibrium free energy for A. For a general state ρ of a system with energy H we define the non equilibrium free energy of the state ρ at temperature T to be

F (β, ρ) = Tr (ρ H) - \frac{1}{β} S (ρ) .

(54)

In particular, for subsystem A one can define the non equilibrium free energy of the state $ρ_{A}^{(U)}$ at temperature β_A to be

F_{A}^{(U)} (β_{A}) = F_{A} (β_{A}, ρ_{A}^{(U)}) = Tr (ρ_{A}^{(U)} H_{A} (λ^{(U)})) - \frac{1}{β_{A}} S (ρ_{A}^{(U)}) .

(55)

In both of the above formulas temperature is not necessarily related to the state ρ.

4.2. Identities for the Work

We next establish the following two relations

(56)

and

(57)

with $F_{A}^{(U)}$ the non equilibrium free energy of $ρ_{A}^{(U)}$ calculated at the initial temperature T_A, namely Equation (55), $F_{A}^{(U)} = {Tr}_{A} (ρ_{A}^{(U)} H_{A} (λ^{(U)}) - T_{A} S (ρ_{A}^{(U)})$ . We will comment on these relations in Par. 4.3 Remark 5: Here the free energy of Equation (55) is not a state function, because it is calculated at the initial temperature of A. Note our notation: When we write F_A(β_A, λ₀) this is the equilibrium free energy of the thermal state of A at temperature β_A and external parameter λ₀. When we write $F_{A}^{(U)}$ we mean the non-equilibrium free energy, as defined above.

Proof of Equation (56): One again starts from the fundamental identities Equations (7) and (25)

δ^{(U (t))} S (ρ_{B}) - β_{B} δ^{(U (t))} E_{B} (ρ) = - S (ρ_{B}^{(U)} | ρ_{B, β_{B}}),

(58)

δ^{(U (t))} S (ρ_{A}) - β_{A} {Tr}_{A} ((ρ_{A}^{(U)} - ρ_{A, β_{A}, λ_{0}}) H_{A} (λ_{0})) = - S (ρ_{A}^{(U)} | ρ_{A, β_{A}, λ_{0}}),

(59)

and

δ^{(U (t))} S (ρ_{A}) + δ^{(U (t))} S (ρ_{B}) = S (ρ^{(U)} | ρ_{A}^{(U)} \otimes ρ_{B}^{(U)}) .

(60)

Note that Equation (59) is just Equation (7) with the substitutions $ρ \to ρ_{A}^{(U)}$ and $ρ_{B} \to ρ_{A, β_{A}, λ_{0}}$ . Therefore it contains the initial H_A(λ₀) (referring to $ρ_{B} \to ρ_{A, β_{A}, λ_{0}}$ ), not the final one. Equation (60) is likewise a rewriting of Equation (35).

Now add Equations (58) and (59) and subtract Equation (60), using the fact that

S (ρ^{(U)} | ρ_{A}^{(U)} \otimes ρ_{B}^{(U)}) + S (ρ_{A}^{(U)} | ρ_{A, 0}) + S (ρ_{B}^{(U)} | ρ_{B, 0}) = S (ρ^{(U)} | ρ_{A, 0} \otimes ρ_{B, 0})

(61)

(this is the same as our unnumbered equation between Equations (37) and (38)) we obtain

- β_{B} δ^{(U (t))} E_{B} (ρ) - β_{A} {Tr}_{A} ((ρ_{A}^{(U)} - ρ_{A, β_{A}, λ_{0}}) H_{A} (λ_{0})) = - S (ρ^{(U)} | ρ_{A, 0} \otimes ρ_{B, 0}) .

(62)

Conservation of energy Equation (46) gives

(63)

We eliminate the second trace in the right hand side of Equation (63) using Equation (62), multiply by T_A to obtain Equation (56).

Proof of Equation (57): In Equation (56), we replace the relative entropy term, using

(64)

and use the definition of $F_{A}^{(U)}$ of Equation (55)

- \frac{1}{β_{A}} S - (ρ_{A}^{(U)} | ρ_{A, β_{A}, λ_{0}}) Tr (ρ_{A}^{(U)} (H_{A} (λ^{(U)}) - H_{A} (λ_{0}))) = F_{A} (β_{A}, λ_{0}) - F_{A}^{(U)} .

(65)

4.3. Inequalities for the Work

From the identities of Equations (56) and (57), we deduce immediately corresponding inequalities

δ^{(U (t))} W \leq - δ^{(U (t))} E_{V} (ρ) - {Tr}_{A} (ρ_{A}^{(U)} (H_{A} (λ^{(U)}) - H_{A} (λ_{0}))) - (1 - \frac{β_{B}}{β_{A}}) δ^{(U (t))} E_{B} (ρ)

(66)

and

δ^{(U (t))} W \leq - δ^{(U (t))} E_{V} (ρ) - (1 - \frac{β_{B}}{β_{A}}) δ^{(U (t))} E_{B} (ρ) + (F_{A} (β_{A}, λ_{0}) - F_{A}^{(U)}) .

(67)

The interpretation of inequality (67) is straightforward. If one can neglect the interaction energy, and if T_A = T_B, one gets an analogue of the familiar thermodynamic inequality giving an upper bound between the work received by the work source and the variation of the free energy of A,

δ^{(U (t))} W \leq F_{A} (β_{A}, λ_{0}) - F_{A}^{(U)} .

(68)

Note that this relation is not restricted to cycles, nor to exchanges with thermal baths (which would stay in their initial thermal states).

Remark 6: Equation (57) contains much more information than inequalities Equations (67) and (68), since it expresses the difference between the maximum work that can be delivered by system A and the work effectively extracted from A, which is the energy dissipated in the process. It is expressed in terms of relative entropies, and it will be shown in Section 6 that it can be explicitly estimated, which yields a calculation of transport coefficients from first principles.

4.4. The Case of Equalities in Equations (66) and (67)

If one has equality in Equation (66), the relative entropy of Equation (56) must be equal to 0,

S (ρ^{(U)} | ρ_{A, β_{A}} \otimes ρ_{B, β_{B}}) = 0,

(69)

so $ρ^{(U)} = ρ_{A, β_{A}} \otimes ρ_{B, β_{B}}$ and the final state has come back to its initial value. If we have equality in Equation (67), both relative entropies of Equation (57) are equal to 0. In this case $ρ_{B}^{(U)}$ has come back to its initial value $ρ_{B, β_{B}}$ and δ^(U)E_B(ρ) = 0. Then, one has

δ^{(U (t))} W = - δ^{(U (t))} E_{V} (ρ) + F_{A} (β_{A}, λ_{0}) - F_{A}^{(U)} .

(70)

4.5. Case Where A is not Initially in Thermal Equilibrium

We shall now assume that the initial state is

ρ_{0} = ρ_{A, 0} \otimes ρ_{B, β_{0}},

(71)

ρ_A_,0 being a general state.

The following identity also holds:

- δ^{(U (t))} W = δ^{(U (t))} F_{A} (β_{B}, ρ_{A}) + δ^{(U (t))} E_{V} (ρ) + T_{B} (S (ρ_{A}^{(U)} | ρ_{A}^{(U)} \otimes ρ_{B}^{(U)}) + S (ρ_{B}^{(U)} | ρ_{B, β_{B}})) .

(72)

This equation, true for any initial state ρ_A, can be found in [8,18]. Note the temperature of B appearing in the non equilibrium free energy of A

F_{A} (β_{B}, ρ_{A}) = E_{A} (ρ_{A}) - T_{B} S (ρ_{A}) .

(73)

If no work is performed, Equation (72) reduces to Equation (28) upon exchanging the labels A and B. Proof: Using S(ρ^(U)) = S(ρ₀) and the definition of the thermal state, one has

S (ρ^{(U)} | ρ_{A}^{(U)} \otimes ρ_{B}^{(U)}) + S (ρ_{B}^{(U)} | ρ_{B, β_{B}}) = - S (ρ_{A, 0}) - S (ρ_{B, β_{B}}) + S (ρ_{A}^{(U)}) + β_{B} E_{B} (ρ_{B}^{(U)}) + \log Z_{B} (β_{B}) .

(74)

Then,

\log Z_{B} (β_{B}) = - β_{B} E_{B} (ρ_{B, β_{B}}) + S (ρ_{B, β_{B}}),

(75)

and Equation (74) becomes

S (ρ^{(U)} | ρ_{A}^{(U)} \otimes ρ_{B}^{(U)}) + S (ρ_{B}^{(U)} | ρ_{B, β_{B}}) = δ^{(U (t))} S (ρ_{A}) + β_{B} δ^{(U (t))} E_{B} (ρ_{B}) .

(76)

Using the conservation of energy, Equation (46), one obtains

δ^{(U (t))} W = - δ^{(U (t))} E_{V} (ρ) - δ^{(U (t))} F_{A} (β_{B}, ρ_{A}) - T_{B} [S (ρ^{(U)} | ρ_{A}^{(U)} \otimes ρ_{B}^{(U)}) + S (ρ_{B}^{(U)} | ρ_{B, β_{B}})] .

(77)

Here

δ^{(U (t))} F_{A} (β_{B}, ρ_{A}) = δ^{(U (t))} E_{A} (ρ_{A}) - T_{B} δ^{(U (t))} S (ρ_{A})

(78)

is the variation of the non equilibrium free energy of A calculated at the initial temperature T_B of B and

δ^{(U (t))} E_{A} (ρ_{A}) = {Tr}_{A} (H_{A} (λ^{(U)}) ρ_{A}^{(U)}) - {Tr}_{A} (H_{A} (λ_{0}) ρ_{A, 0}) .

(79)

In particular

δ^{(U (t))} W \leq - δ^{(U (t))} E_{V} (ρ) - δ^{(U (t))} F_{A} (β_{B}, ρ_{A}),

(80)

which gives a general upper bound for the work production from heat exchanges between an arbitrary system A and a system B initially at equilibrium (not necessarily a heat bath). In this relation, equality is realized if and only if the two relative entropy terms of Equation (77) are zero, which means that

ρ_{B}^{(U)} = ρ_{B, β_{B}} and ρ^{(U)} = ρ_{A}^{(U)} \otimes ρ_{B, β_{B}} .

(81)

Remark 7: By convention, a thermal bath is in a thermal state which is assumed to remain constant during the evolution. Our system B is not a thermal bath in this sense; its state varies during the evolution.

5. A System Coupled Only to an External Work Source

While Carnot and many others primarily considered model machines exchanging heat with several reservoirs, new thermodynamic relations have recently been announced [19] concerning exchanges of a single system with a work source. We now focus on this case.

5.1. Hypotheses

We consider a system coupled only to an external work source, so that the Hamiltonian of the system is H(λ).

At time t = 0, the state of the system is supposed to be a thermal state $ρ_{β_{0}} (λ_{0})$ . The external observer imposes an evolution λ(t) of the parameter λ from λ₀ to λ^(U), inducing a unitary or symplectic evolution U of the whole system. The work that the external observer must perform to realize this evolution is obviously the variation of the energy of the system. With the convention of Section 4.1, we denote by δ^(U(t))W the work counted positive if the external source receives it from the system. We are now in a particular case of Section 4.1 when the system is A, there is no system B and no V. Thus from Equation (46)

δ^{(U (t))} W = - δ^{(U (t))} E (ρ) = Tr (ρ_{β_{0}} (λ_{0}) H (λ_{0}) - ρ^{(U)} H (λ^{(U)})) .

(82)

5.2. Identities for the Work

From Equations (56) and (57) we obtain immediately

δ^{(U (t))} W = - Tr (ρ^{(U)} (H (λ^{(U)}) - H (λ_{0}))) - \frac{1}{β_{0}} S (ρ^{(U)} | ρ_{β_{0}} (λ_{0}))

(83)

and

δ^{(U (t))} W = F (β_{0}, λ_{0}) - F^{(U)},

(84)

with F^(U) the non equilibrium free energy at temperature β₀.

F^{(U)} = Tr (ρ^{(U)} H (λ^{(U)})) - \frac{1}{β_{0}} S (ρ^{(U)})

(85)

We now prove the following identity

δ^{(U (t))} W = F (β_{0}, λ_{0}) - F (β_{0}, λ^{(U)}) - \frac{1}{β_{0}} S (ρ^{(U)} | ρ_{β_{0}} (λ^{(U)})) .

(86)

This is a particular case of the result of [19].

Proof of Equation (86): We start from Equation (84) written as

δ^{(U (t))} W = F (β_{0}, λ_{0}) - F^{(U)} = F (β_{0}, λ_{0}) - F (β_{0}, λ^{(U)}) + F (β_{0}, λ^{(U)}) - F^{(U)} .

(87)

Now

β_{0} (F^{(U)} - F (β_{0}, λ^{(U)})) = - S (ρ^{(U)}) + β_{0} Tr (H (λ^{(U)}) ρ^{(U)}) - β_{0} F (β_{0}, λ^{(U)}) .

(88)

But

S (ρ^{(U)} | ρ_{β_{0}} (λ^{(U)})) = - S (ρ^{(U)}) + β_{0} Tr (H (λ^{(U)}) ρ^{(U)}) + \log Z (β_{0}, λ^{(U)}),

(89)

so that comparing Equations (88) and (89), one has

β_{0} (F^{(U)} - F (β_{0}, λ^{(U)})) = S (ρ^{(U)} | ρ_{β_{0}} (λ^{(U)})),

(90)

and from Equation (87) we then deduce Equation (86).

Remark 8: Since the transition under discussion is adiabatic, free energy is less suitable for inequalities of the form (86) than is internal energy. See Section 5.6.

5.3. Inequalities for the Work

5.3.1. From Equation (83)

From Equation (83) we deduce

δ^{(U (t))} W \leq - Tr (ρ^{(U)} (H (λ^{(U)}) - H (λ_{0}))),

(91)

with equality if and only if $ρ^{(U)} = ρ_{β_{0}} (λ_{0})$ ,i.e., the final state is the initial state.

5.3.2. From Equation (86)

From Equation (86) we deduce

δ^{(U (t))} W \leq F (β_{0}, λ_{0}) - F (β_{0}, λ^{(U)})

(92)

with equality if and only if

ρ^{(U)} = ρ_{β_{0}} (λ^{(U)}) .

(93)

That is, ρ^(U) is the thermal state at the initial temperature and final value λ^(U) of λ. Note that a necessary condition for this is that the entropy of the final thermal state is the same as the entropy of the initial state.

5.4. Relation to the Identity of Jarzynski

Let z denote a point in the phase space of the system. In this section we assume that the dynamics is classical.

We denote by z(s|z₀) the classical trajectory of the phase space point at time s starting from z₀ at time s = 0, for the classical evolution U. The external observer imposes the variation λ(s) of λ from λ₀ to λ^(U) = λ(t). The identity of Jarzynski is [20]:

(94)

Because the exponential function is strictly convex, Jensen’s inequality implies that

(95)

so that using Equation (94) and taking the logarithm, one obtains

δ^{(U (t))} W \leq F (β_{0}, λ_{0}) - F (β_{0}, λ^{(U)}),

(96)

which is the inequality (92).

But if the inequality (96) is an equality, we deduce $ρ^{(U)} = ρ_{β_{0}} (λ^{(U)})$ as in Equation (93), but we also deduce that the inequality of Jensen (95) is an equality. Because the exponential function is strictly convex, this implies that the differences

H (z (t | z_{0}), λ^{(U)}) - H (z_{0}, λ_{0}) = C,

(97)

where C is a constant independent of z₀ (but obviously dependent on λ₀, λ^(U) and t); in other words, the “microscopic work” is independent of the microscopic trajectory. Although this equality would seem impossible, it turns out that identity (97) can be realized for certain systems and evolutions of λ (see appendix A).

5.5. Effective Temperatures

Let H(λ) be a Hamiltonian depending on λ and ρ a state (classical or quantum) with energy E(ρ) = Tr (ρH(λ)). We can define two temperatures for ρ.

(i)

The temperature β_e(ρ, λ) is the temperature such that

E (ρ) = E (β_{e}, λ),

(98)

with $E (β_{e}, λ) = Tr (ρ_{β_{e}} (λ) H (λ))$ . It is known that ∂E(β, λ)/∂β < 0, so that Equation (98) defines β_e unambiguously. The basic identity (4) shows that

S (ρ_{β_{e}} (λ)) - S (ρ) = S (ρ | ρ_{β_{e}} (λ)),

(99)

so that

S (ρ_{β_{e}} (λ)) \geq S (ρ),

(100)

which is the well known fact that $ρ_{β_{e}} (λ)$ maximizes the entropy among all states ρ having a fixed energy. The quantity β_e(ρ, λ) can be called the effective temperature.

(ii)

There is a second temperature β_a(ρ, λ) such that

S (ρ) = S (β_{a}, λ) .

(101)

In this definition, S(β, λ) is the entropy of a thermal state with temperature β and external parameter λ. We call this the adiabatic temperature, and by the same arguments as given above it is well-defined. From Equation (100) and Equation (101), one has

S (β_{e}, λ) \geq S (β_{a}, λ) .

(102)

Because

{\frac{\partial E}{\partial S (β, λ)} |}_{_{λ fixed}} = \frac{1}{β}

(103)

we deduce from Equation (102) that

E (β_{a}, λ) \leq E (β_{e}, λ) = E (ρ)

(104)

and

β_{a} \geq β_{e} .

(105)

Because S is a strictly increasing function of E (for λ fixed), one sees that in Equation (102) or (104), one has equality if and only if β_a = β_e. Moreover, one has the identity

S (ρ | ρ_{β_{a}} (λ)) - S (ρ | ρ_{β_{e}} (λ)) = S (ρ_{β_{e}} (λ) | ρ_{β_{a}} (λ)),

(106)

which can immediately be verified.

5.6. A More Precise Expression for the Work

In thermodynamics, for an adiabatic evolution, the work is related to the internal energy by dE = −dW, rather than to the free energy. Similarly, the work is related to the adiabatic temperature rather than to the effective energy temperature.

Given the state ρ^(U) (corresponding to the evolution U, the parameter varying from λ₀ to λ^(U)) we can define the adiabatic temperature $β_{a}^{(U)}$ such that

S (β_{a}^{(U)}, λ^{(U)}) = S (ρ^{(U)}) = S (β_{0}, λ_{0}) .

(107)

We prove the following identity

δ^{(U (t))} W = E (β_{0}, λ_{0}) - E (β_{a}^{(U)}, λ^{(U)}) - \frac{1}{β_{a}^{(U)}} S (ρ^{(U)} | ρ_{β_{a}^{(U)}} (λ^{(U)})) .

(108)

Proof of Equation (108): One has by definition (82)

- δ^{(U (t))} W = E (ρ^{(U)}) - E (β_{0}, λ_{0}) = E (ρ^{(U)}) - E (β_{a}^{(U)}, λ^{(U)}) + E (β_{a}^{(U)}, λ^{(U)}) - E (β_{0}, λ_{0}) .

(109)

Then

(110)

because $S (ρ^{(U)}) = S (β_{a}^{(U)}, λ^{(U)})$ by the definition (107). From this result and Equation (109) we deduce Equation (108).

As a consequence of Equation (108), we deduce the inequality

δ^{(U (t))} W \leq E (β_{0}, λ_{0}) - E (β_{a}^{(U)}, λ^{(U)}) .

(111)

In standard thermodynamics, for system thermally isolated and coupled to a work source, one has dE = −dW, because δ^(U(t)) S = 0 for an adiabatic (thermally isolated) process and we recover equality in Equation (111). In this situation, the inequality (92) comparing the work to the difference of free energies is not relevant, because the temperature does not remain constant.

Note that the work upper bound (111), given in terms of energy and the adiabatic temperature, is sharper than the bound given by (92), which is in terms of free energy. This is proved in the next subsection.

5.7. Upper Bounds on the Work Delivered by a System. Comparison of Equations (92) and (111)

We next show that using internal energy for the work inequality gives a sharper result than using the free energy. Specifically,

δ^{(U (t))} W \leq E (β_{0}, λ_{0}) - E (β_{a}^{(U)}, λ^{(U)}) \leq F (β_{0}, λ_{0}) - F (β_{0}, λ^{(U)}) .

(112)

Proof of Equation (112): We need only prove that

Δ \equiv F (β_{0}, λ_{0}) - E (β_{0}, λ_{0}) - (F (β_{0}, λ^{(U)}) - E (β_{a}^{(U)}, λ^{(U)})) \geq 0.

(113)

Using the definition of the equilibrium free energy and Equation (107) we have

Δ = - \frac{1}{β_{0}} [S (β_{a}^{(U)}, λ^{(U)}) - S (β_{0}, λ^{(U)})] + [E (β_{a}^{(U)}, λ^{(U)}) - E (β_{0}, λ^{(U)})] .

(114)

Note that in Equation (114) all terms involving λ are evaluated at λ^(U). Therefore

Δ = \int_{β_{0}}^{β_{a}^{(U)}} [\frac{\partial E (β, λ^{(U)})}{\partial β} - \frac{1}{β_{0}} \frac{\partial S (β, λ^{(U)})}{\partial β}] d β .

(115)

But

\frac{\partial S (β, λ)}{\partial β} = \frac{\partial S}{\partial E} \frac{\partial E (β, λ)}{\partial β} = β \frac{\partial E (β, λ)}{\partial β} .

(116)

Using Equation (116) in Equation (115), we obtain

Δ = \int_{β_{0}}^{β_{a}^{(U)}} \frac{\partial E (β, λ^{(U)})}{\partial β} [1 - \frac{β}{β_{0}}] d β .

(117)

But $\frac{\partial E (β, λ^{(U)})}{\partial β}$ < 0, so that ∆ ≥ 0. Note that this does not depend on which of β₀ and $β_{a}^{(U)}$ is larger.

5.8. The Case of Equalities in Equations (111) and (92)

5.8.1. Equality in Equation (111)

In this case, one has $S (ρ^{(U)} | ρ_{β_{a}^{(U)}} (λ^{(U)})) = 0$ in Equation (108) so

ρ^{(U)} = ρ_{β_{a}^{(U)}} (λ^{(U)}) .

(118)

In particular, ρ^(U) is a thermal state so that

β_{e}^{(U)} = β_{a}^{(U)} .

(119)

However, if one has equality in Equation (111), this does not improve the upper bound of Equation (92) for the free energy,

δ^{(U (t))} W \leq F (β_{0}, λ_{0}) - F (β_{0}, λ^{(U)}) .

(120)

In other words, the optimal bound for δ^(U(t)) W is given by the internal energy and not the free energy (so that the internal energy in general yields a better bound than that given in [20]).

5.8.2. Equality in Equation (92)

From Equation (93) we deduce that

ρ^{(U)} = ρ_{β_{0}} (λ^{(U)}),

(121)

so that ρ^(U) is a thermal state and thus

β_{e}^{(U)} = β_{0} = β_{a}^{(U)} .

(122)

This implies that we also have equality in Equation (111)

δ^{(U (t))} W = E (β_{0}, λ_{0}) - E (β_{0}, λ^{(U)}) .

(123)

5.9. The Case λ^(U) = λ₀

If one assumes that the final value λ^(U) of λ is equal to its initial value, we see immediately that $β_{a}^{(U)} = β_{0}$ .Indeed

S (β_{a}^{(U)}, λ_{0}) = S (ρ^{(U)}) = S (β_{0}, λ_{0}),

(124)

so that the temperatures are equal $β_{a}^{(U)} = β_{0}$ . In this case, one has from Equation (108)

δ^{(U (t))} W = - \frac{1}{β_{0}} S (ρ^{(U)} | ρ_{β_{0}} (λ_{0})) \leq 0,

(125)

with equality if and only if

ρ^{(U)} = ρ_{β_{0}} (λ_{0}),

(126)

so that the state has returned to its initial value.

Remark 9: If the external observer imposes a variation λ(t) of the control parameter with λ(0) = λ₀, λ(t₁) = λ₁, λ(t_f) = λ₀, inequality (125) says that at the end of the cycle, the observer has always lost work. In particular, the work that the external observer has put in the system in the time interval [0, t₁] cannot be entirely recovered in the time interval [t₁, t_f] whatever one does, except if the final state ρ^(U) is the initial state.

Remark 10: When λ^(U) = λ₀, one can also recover Equation (125) from the identity (86). This identity reduces to

δ^{(U (t))} W = - \frac{1}{β_{0}} S (ρ^{(U)} | ρ_{β_{0}} (λ_{0})) .

(127)

6. Relative Entropy, Energy Dissipation and Fourier’s Law

In this Section we derive dissipation in the quantum context and show it to be intimately related to the relative entropy.

6.1. The Born Approximation

A quantum system has a Hamiltonian

H = H_{0} + V .

(128)

Let $ψ_{k}^{(0)}, E_{k}^{(0)}$ be the eigenstates and eigenvalues of H₀. In the Born approximation, the state $| ψ_{n}^{(0)} 〉$ becomes at time t a state $| ψ_{n} (t) 〉$ with

| ψ_{n} (t) 〉 = \sum_{k} a_{k}^{(n)} (t) e^{- i E_{k}^{(0)} t / ℏ} | ψ_{k}^{(0)} 〉 .

(129)

The quantities $a_{k}^{(n)} (t) = δ_{k, n} + {\tilde{a}}_{k}^{(n)} (t)$ satisfy

i ℏ \frac{d {\tilde{a}}_{k}^{(n)}}{d t} = \sum_{l} V_{k, l} (t) (δ_{l, n} + {\tilde{a}}_{l}^{(n)}),

(130)

where

V_{k, l} (t) = V_{k, l} \exp (\frac{i}{ℏ} (E_{k}^{0} - E_{l}^{(0)}) t) .

(131)

We assume here V_n,_n = 0 for all n. One readily deduces that in the Born approximation

a_{k}^{(n)} (t) = \frac{V_{k, n}}{E_{k}^{(0)} - E_{n}^{(0)}} (1 - e^{i (E_{k}^{(0)} - E_{n}^{(0)}) \frac{t}{ℏ}}) (k \neq n)

(132)

and by unitarity $\sum_{k} {| a_{k}^{(n)} (t) |}^{2} = 1,$ , so to second order in ${\tilde{a}}_{k}^{(n)}$

2 Re {\tilde{a}}_{n}^{(n)} (t) = - \sum_{k \neq n} {| {\tilde{a}}_{k}^{(n)} (t) |}^{2} .

(133)

Let ρ₀ be an initial state diagonal in the basis $ψ_{n}^{(0)}$

ρ_{0} = \sum p_{0, n} | ψ_{n}^{(0)} 〉 〈 ψ_{n}^{(0)} | .

(134)

Then, at time t, the state becomes

(135)

If L is a Hermitian operator diagonal in the basis $ψ_{n}^{(0)}$ with eigenvalues λ_n, using Equation (135) one obtains in the Born approximation

Tr (L (ρ^{(U (t))} - ρ_{0})) = \frac{1}{2} \sum_{k \neq n} {| {\tilde{a}}_{k}^{(n)} (t) |}^{2} (λ_{k} - λ_{n}) (p_{0, n} - p_{0, k}) .

(136)

6.2. Two Interacting Systems

We consider two quantum systems A, B with Hamiltonians H_A, H_B respectively, interacting. Denote by V = V_A,_B the interaction energy and

H = H_{A} + H_{B} + V .

(137)

We call $| ψ_{A, k}^{(0)} 〉, E_{A, k}^{(0)} (resp. | ψ_{B, l}^{(0)} 〉, E_{B, l}^{(0)})$ the eigenstates and eigenvalues of H_A (resp. H_B), and we apply the Born approximation to H, with H₀ = H_A + H_B. The non perturbed Hamiltonian H₀ has eigenstates $| ψ_{A, k}^{(0)} 〉 | ψ_{B, l}^{(0)} 〉$ with eigenvalues $E_{A, k}^{(0)} + E_{B, l}^{(0)}$ .

We assume that at time t = 0, the state of the system A + B is ρ₀ = ρ_A ⊗ ρ_B with

(138)

so that they are diagonal in the eigenbasis of H_A and H_B and therefore commute with H_A +H_B. At time t, the initial state ρ₀ = ρ_A ⊗ ρ_B evolves to ρ(t). Then

S (ρ (t) | ρ_{A} \otimes ρ_{B}) = Tr (ρ (t) \log ρ (t)) - Tr (ρ (t) \log (ρ_{A} \otimes ρ_{B})) .

(139)

But S(ρ(t)) = S(ρ₀) by unitarity of the evolution, so that

S (ρ (t) | ρ_{A} \otimes ρ_{B}) = - Tr [(ρ (t) - ρ_{A} \otimes ρ_{B}) (\log ρ_{A} + \log ρ_{B})] .

(140)

This is of the form of Equation (136) with

L = - (\log ρ_{A} + \log ρ_{B}) .

(141)

L has eigenvectors $| ψ_{A, k}^{(0)} 〉 | ψ_{B, l}^{(0)} 〉$ with eigenvalues log p_A,_k + log p_B,_l; ρ_A ⊗ ρ_B has the same eigenvectors with eigenvalues p_A,_k + p_B,_l. Applying Equation (136), one obtains in the Born approximation

(142)

Notice that the quantity in the right hand side is automatically non-negative. Here, we have

(143)

with $V_{(k, l)}^{(n, m)} = 〈 ψ_{A, n}^{(0)} \otimes ψ_{B, m}^{(0)} | V | ψ_{A, k}^{(0)} \otimes ψ_{B, l}^{(0)} 〉 .$ .

We also deduce from this result that in this approximation S(ρ(t)|ρ_A ⊗ ρ_B) = 0 if and only if V = 0 (recall that the diagonal elements of V are 0).

6.3. The Case Where Both Initial States are Thermal

Assume that at time $t = 0, ρ_{A} = ρ_{A, β_{A}}$ and $ρ_{B} = ρ_{B, β_{B}}$ are the thermal states of A and B respectively. From Equation (37) one has

β_{A} δ^{(U (t))} E_{A} (ρ_{A}) + β_{B} δ^{(U (t))} E_{B} (ρ_{B}) = S (ρ (t) | ρ_{A, β_{B}}) .

(144)

Moreover, from conservation of energy

δ^{(U (t))} E_{A} (ρ_{A}) + δ^{(U (t))} E_{B} (ρ_{B}) + δ^{(U (t))} E_{V} (ρ) = 0,

(145)

so that eliminating δ^(U(t)) E_B(ρ_B), one obtains

(β_{A} - β_{B}) δ^{(U (t))} E_{A} (ρ_{A}) = β_{B} δ^{(U (t))} E_{V} (ρ) + S (ρ (t) | ρ_{A, β_{A}} \otimes ρ_{B, β_{B}}) .

(146)

We now estimate both terms on the right hand side of Equation (146).

6.3.1. Estimate of the Relative Entropy

From Equation (142) we deduce

(147)

Moreover, when t → ∞, as in the usual Born approximation, Equation (143) shows that

{| {\tilde{a}}_{(k, l)}^{(n, m)} (t) |}^{2} ≃ \frac{2 π}{ℏ} {| V_{(k, l)}^{(n, m)} |}^{2} δ (E_{A, k}^{(0)} + E_{B, l}^{(0)} - E_{A, n}^{(0)} - E_{B, m}^{(0)}) t .

(148)

Thus if f_A and f_B denote the density of states for A and B, we obtain from Equation (147)

(149)

6.3.2. Estimate of the Interaction Energy

Because δ^(U(t)) V (ρ) = −δ^(U(t)) E_A(ρ) − δ^(U(t)) E_B(ρ), one has

δ^{(U (t))} V_{A, B} (ρ) = Tr (- (H_{A} + H_{B}) (ρ (t) - ρ_{A, β_{A}} \otimes ρ_{B, β_{A}})) .

(150)

This is of the form of Equation (136) with L = −H_A − H_B, and so

(151)

Up to a sign, this expression is formally identical to the expression Equation (142), except that the difference of energies (E_A,_k + E_B,_l − E_A,_n − E_B,_m) replaces the quantity β_A(E_A,_k − E_A,_n) + β_B(E_B,_l − E_B,_m). As a consequence E_A,_k + E_B,_l − E_A,_n − E_B,_m partially cancels the denominator of ${| {\tilde{a}}_{(k, l)}^{(n, m)} (t) |}^{2}$ and one sees that $β_{B} δ^{(U (t))} E_{V} (ρ)$ is negligible when t → ∞.

Then from Equations (146) and (149), one sees that

δ^{(U (t))} E_{A} (ρ_{A}) ≃ \frac{S (ρ (t) | ρ_{A, β_{A}} \otimes ρ_{B, β_{B}})}{β_{A} - β_{B}} ≃ (β_{A} - β_{B}) K t .

(152)

In Equation (152) K is the positive constant

(153)

with

(154)

It is obvious that φ ≥ 0. Note that K does not vanish for β_A close to β_B.

The expression (152) is a form of Fourier’s law for heat transport from B to A, (β_A − β_B)K being the rate of dissipation. In this case, one sees that the significance of the relative entropy $S (ρ (t) | ρ_{A, β_{A}} \otimes ρ_{B, β_{B}})$ is that of a transport coefficient, here the transport of energy from one part of a system to another part.

7. Coarse Grained States

Coarse-graining is omnipresent in in macroscopic and mesoscopic physics, since microscopic variables are often not what is observed. In general coarse-graining represents a loss of information, hence an increase in entropy. In this section we consider a variety of coarse-graining procedures, and consistent with our work in this article, relative entropy plays a significant role both in the definition of coarse-graining and in the measures of entropy increase.

7.1. Definition

Let ρ and ρ′ be two states of the same system (classical or quantum). We say that ρ′ is obtained from ρ by a coarse graining operation if

Tr ((ρ - ρ') \log ρ') = 0.

(155)

The idea is that the information associated with ρ′ (namely log ρ′) is the same whether one averages with ρ′ or with the more detailed distribution ρ. Using the basic identity, Equation (4), we can say that ρ′ is obtained from ρ by a coarse graining operation, if and only if

S (ρ') - S (ρ) = S (ρ | ρ') .

(156)

In particular, S(ρ′) ≥ S(ρ), so that the entropy increases by coarse-graining. (See the comment after Equation (3).)

A coarse-graining mapping is a mapping Γ which associates to any state ρ (or to some states of a given class), a coarse grained state ρ′ = Γ(ρ).

7.2. Examples of Coarse-graining Mappings

Example 1: Maximum entropy.

Let A₁,…, A_n be observables of the system, so they are either functions in the phase space or hermitian operators on the Hilbert space of the system. We consider the class of states ρ such that

Tr (A_{i ρ}) < \infty, i = 1, .., n .

(157)

One can then consider the state ρ′ such that ρ′ has maximal entropy given the relation

Tr (A_{i} ρ^{'}) = Tr (A_{i} ρ) .

(158)

It is immediately seen that

ρ^{'} = C \exp (\sum_{i = 1}^{n} α_{i} A_{i}),

(159)

where C is a normalization constant and α_i are the “conjugate parameters”, (provided ρ′ is normalizable). The mapping Γ: ρ → ρ′ is indeed a coarse grain mapping in the sense of the previous definition, because by Equation (159)

Tr ((ρ - ρ^{'}) \log ρ^{'}) = \sum_{i = 1}^{n} Tr ((ρ - ρ^{'}) A_{i}) = 0.

(160)

In particular, one has Equation (156).

The case of the thermal state is the best known, where one takes A = H, the Hamiltonian of the system.

Example 2: Naive coarse graining; the observables as characteristic functions.

(i)

Classical case: Let Z be the phase space of the system and {Z_i} a finite partition of Z ( $Z = \cup_{i = 1}^{n} Z_{i}$ and Z_i∩Z_j = Ø for i ≠ j). We choose $A_{i} = χ_{Z_{i}}$ (i.e. the characteristic function of Z_i). This is a particular case of example 1 and if ρ is a state

ρ^{'} = C \exp (\sum_{i = 1}^{n} α_{i} χ_{Z_{i}}) .

(161)

Using the condition (156), namely,

\int_{Z_{i}} ρ^{'} d z = \int_{Z_{i}} ρ d z,

(162)

one can deduce from Equations (161) and (162)

ρ^{'} |_{Z_{i}} = \frac{1}{Vol (Z_{i})} \int_{Z_{i}} ρ d z or ρ^{'} = \sum_{i = 1}^{n} (ρ^{'} |_{Z_{i}}) χ_{Z_{i}} .

(163)

This equation implies that ρ′ is normalized ∫ ρ′dz = 1. We recover the usual coarse graining.

(ii)

Quantum case: Let $ℋ$ be the Hilbert space of the system and P_i a resolution of the identity by orthogonal projectors

Id = \sum P_{i} and P_{i} P_{j} = P_{i} δ_{i, j} .

(164)

Then the analogue of Equation (163) is

ρ^{'} = \sum_{i = 1}^{n} \frac{Tr {(ρ P)}_{i}}{\dim P_{i} (ℋ)} P_{i} .

(165)

Example 3: Coarse graining by marginals.

(i)

Classical case: We assume that the system consists of several parts, and that its phase space is a Cartesian product, $Z = \prod_{i = 1}^{n} Z_{i}$ , corresponding to various subsystems with phase space Z_i. If ρ is a state on Z, we denote by ρ_i its marginal probability distribution on Z_i, so

ρ_{i} (z_{i}) = \int \dots \int ρ (ζ_{1}, \dots, ζ_{i - 1}, z_{i}, ζ_{i + 1}, \dots, ζ_{n}) \prod_{j \neq i} d ζ_{j} .

(166)

Let Γ be the mapping that associates the product of its marginals to ρ(z)

Γ (ρ) (z_{1}, \dots, z_{n}) = \prod_{i = 1}^{n} ρ_{i} (z_{i}) .

(167)

Then the condition (155) is satisfied. It is easy to see that $\prod_{i = 1}^{n} ρ_{i} (z_{i})$ is the state ρ′ that maximizes the entropy among all the states ρ″ such that ${ρ^{″}}_{i} = ρ_{i}$ for any i.

(i)

Quantum case. The Hilbert space of the system is

ℋ = \otimes_{i = 1}^{n} ℋ_{i},

(168)

where the $ℋ_{i}$ are the Hilbert spaces of the subsystems. If ρ is a state, then its marginal state on $ℋ_{i}$ is the partial trace on the Hilbert space $K_{i}$ , which is the tensor product of the Hilbert spaces $ℋ_{j}$ for j different from i

ρ_{i} = T r_{K_{i}} ρ

(169)

and the mapping Γ,

Γ (ρ) = \otimes_{n}^{i = 1} ρ_{i},

(170)

is a coarse grained mapping. Γ(ρ) is again the state ρ′ which maximizes the entropy among all states ρ″ such that ${ρ^{″}}_{i} = ρ_{i}$ for all i.

Example 4: Decomposition of Z.

If $Z = \cup_{i = 1}^{n} Z_{i}$ , but the Z_i do not form a partition of Z (they can have intersections of non-zero measure), one can still apply Example 1 to $A_{i} = χ_{Z_{i}}$ and obtain

ρ^{'} = C \exp (\sum α_{i} χ_{z_{i}}) .

(171)

But now Equation (163) is no longer valid because, for given z, there will be in general several i with z ∈ Z_i.

7.3. Coarse Graining and Relative Entropy

(i)

The case of the naive coarse-graining is distinguished among all types of coarse-graining by the following property. Let $Z = \cup_{i = 1}^{n} Z_{i}$ a partition of the phase space and p, q two probability distributions on Z. Let $\bar{p} = Γ p$ and $\bar{q} = Γ q$ be the coarse grained states of p and q associated to this partition. Then one has

S (\bar{p} | \bar{q}) \leq S (p | q) .

(172)

Proof: call $p_{i} = \int_{Z_{i}} p d z$ and $q_{i} = \int_{Z_{i}} q d z$ . We have, using the definition of $\bar{p}$ and $\bar{q}$

S (\bar{p} | \bar{q}) = \sum_{i = 1}^{n} p_{i} \log \frac{p_{i}}{q_{i}} .

(173)

Now

(174)

But $\int_{Z_{i}} \frac{q (z)}{\int_{Z_{i}} q (z^{'}) d z^{'}} d z = 1$ . We use the fact that the function x log x is convex, so that for each i

(175)

Therefore from Equation (175)

\sum_{i} p_{i} \log \frac{p_{i}}{q_{i}} \leq \sum_{i} \int_{Z_{i}} p (z) \log \frac{p (z)}{q (z)} d z = S (p | q) .

(176)

(ii)

For the coarse-graining associated to subsystems one has $Z = \prod_{i = 1}^{n} Z_{i}$ and if p, q are states on Z, the coarse grained states are $\tilde{p} = \otimes_{i = 1}^{n} p_{i}, \tilde{q} = \otimes_{i = 1}^{n} q_{i}$ and we deduce immediately that

S (\tilde{p} | \tilde{q}) = \sum_{i = 1}^{n} S (p_{i} | q_{i}) .

(177)

Consider the case i = 1, and call z = (z₁, z′) with z′ = (z₂,…, z_n) and call Z′ = Z₂ × ⋯ × Z_n. Then

(178)

Now $\int_{Z^{'}} d z^{'} \frac{q (z_{1}, z^{'})}{\int_{Z^{'}} q (z_{1}, z^{″}) d z^{″}} = 1$ . As in Equation (175), we use the convexity of x log x and deduce that

(179)

From Equation (177) we deduce that for the coarse graining mapping associated to the division of $Z = \prod_{i = 1}^{n} Z_{i}$ in n subsystems, one has

S (\tilde{p} | \tilde{q}) \leq n S (p | q) .

(180)

Remark 11: The upper bound of Equation (180) cannot be improved. Indeed consider the case where: p(z₁,…, z_n) = p₁(z₁)δ(z₁ − z₂) … δ(z_n₋₁ − z_n) q(z₁,…, z_n) = q₁(z₁)δ(z₁ − z₂) … δ(z_n−₁ − z_n). Then p_i = p₁ and q_i = q₁, but S(p|q) = S(p₁|q₁) and $S (\bar{p} | \bar{q}) = n S (p_{1} | q_{1})$ .

(iii)

Thermal coarse graining.

Let Z be a phase space, and p and q two probability distributions on Z, H(z) a function of z ∈ Z.

Let $\tilde{p}$ and $\tilde{q}$ be the thermal coarse grained probability distributions of p and q, respectively, with respect to H. So

\tilde{p} (z) = \frac{1}{Z (β (p))} \exp (- β (p) H (z)) .

(181)

where β(p) is the effective temperature of p, i.e., ${〈 H 〉}_{p} = {〈 H 〉}_{\tilde{p}}$ .

Assuming that p − q is small, an obvious bound, after straightforward calculations (expanding to second order in p − q), is

S (\tilde{p} | \tilde{q}) \leq \frac{{〈 H^{2} 〉}_{p}}{{〈 H^{2} 〉}_{\tilde{p}} - {({〈 H 〉}_{p})}^{2}} S (p | q) .

(182)

This bound is surely not optimal, because if p and q are already thermal states, $S (\tilde{p} | \tilde{q}) = S (p | q)$ . Note though that even without the hypotheses on p and q, $S (\tilde{p} | \tilde{q}) \leq S (p | \tilde{q})$ .

8. Conclusions

The results in this article are used to obtain upper bounds for entropy production or energy variation in various situations of thermodynamic interest, with many such results either new or sharper than similar known bounds. Furthermore, the energy dissipated in these processes is expressed in terms of relative entropies, which not only gives a general microscopic interpretation of dissipation, but also, in relevant examples, leads to an explicit, first principles, evaluation of dissipation terms, analogous to the Fourier law.

Although relative entropy has made appearances in many contexts, especially with respect to information theory, our results on a generalized Fourier heat law relates it in a direct way to the notion of dissipation as understood in physics.

Acknowledgments

We (B.G. and L.S.) are grateful to the visitor program of the Max Planck Institute for the Physics of Complex Systems, Dresden, for its hospitality during the time that much of this work was performed. We also thank one of the referees for an extremely careful reading of the manuscript, leading we believe to considerable improvement in presentation and content.

Author Contributions

The authors contributed equally to the presented mathematical framework and the writing of the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendixe

A. An Example of Trajectory-Independent Microscopic Work

We exhibit a Hamiltonian H(z, λ) and an evolution λ(t) of the external parameter such that

H (z (t_{f} | z_{0}), λ (t_{f})) - H (z_{0}, λ_{0}) = C,

(A1)

with C independent of z₀.

Take the harmonic oscillator

H (x, p, λ) = \frac{p^{2}}{2} + \frac{ω^{2} x^{2}}{2} - ω^{2} λ x .

(A2)

Call

(A3)

the solution with λ = 0.

For λ(s) a function of time s, the solutions of the Hamiltonian equations starting from (x₀, p₀) at s = 0 are

x (t | x_{0}, p_{0}) = \bar{x} (t) + ω \int_{0}^{t} λ (s) \sin (ω (t - s)) d s

(A4)

p (t | x_{0}, p_{0}) = \bar{p} (t) + ω^{2} \int_{0}^{t} λ (s) \cos (ω (t - s)) d s

(A5)

with $\frac{d x}{d t} = p$ and $\frac{d p}{d t} - - ω^{2} x + ω^{2} λ (t) .$ Assume that λ₀ = 0. Define $Λ_{c} (t) \equiv \int_{0}^{t} λ (s) \cos ω (t - s) d s$ and $Λ_{s} (t) \equiv \int_{0}^{t} λ (s) \sin ω (t - s) d s$ . Then

(A6)

We can impose a condition on t such that this quantity does not depend on x₀ and p₀, namely

(A7)

Then using these two equalities, one has

H (x (t), p (t), λ (t)) - H (x_{0}, p_{0}, 0) = - \frac{ω^{2}}{2} {(λ (t))}^{2} .

(A8)

Thus if λ(t) ≠ 0, we can arrange that the microscopic work is independent of the initial condition and is non zero.

B. An Exactly Solvable Model

The system A + B is formed of two two-levels atoms. The Hamiltonians of A and B are

\begin{matrix} H_{A} = (\begin{matrix} 0 & 0 \\ 0 & E_{A} \end{matrix}) & and & H_{B} = (\begin{matrix} 0 & 0 \\ 0 & E_{B} \end{matrix}) \end{matrix},

(A9)

with eigenstates |0_A〉, |+_A〉, |0_B〉, |+_B〉, so that the total Hamiltonian is in the basis |0_A, 0_B〉, |+_A, 0_B〉, |0_A, +_B〉, |+_A, +_B〉:

H = (\begin{matrix} 0 & 0 & 0 & w \\ 0 & E_{A} & 0 & 0 \\ 0 & 0 & E_{B} & 0 \\ w * & 0 & 0 & E_{A} + E_{B} \end{matrix})

(A10)

where w is the interaction energy.

Calling E₀ = E_A + E_B, the eigenvalues of H are

λ_{\pm} = \frac{E_{0} \pm \sqrt{E_{0}^{2} + 4 {| w |}^{2}}}{2} .

(A11)

as well as E_A and E_B. The eigenstates of E_A and E_B are |+_A, 0_B〉, |0_A, +_B〉, and the eigenstates of λ_± are

| φ_{\pm} 〉 = \frac{1}{N_{\pm}} (w | 0_{A}, 0_{B} 〉 + λ_{\pm} | +_{A}, +_{B} 〉),

(A12)

so that

| 0_{A}, 0_{B} 〉 = \frac{N_{+} N_{-}}{w (λ_{-} - λ_{+})} (\frac{λ_{-}}{N_{-}} | φ_{+} 〉 - \frac{λ_{+}}{N_{+}} | φ_{-} 〉)

(A13)

| +_{A} +_{B} 〉 = \frac{N_{+} N_{-}}{λ_{+} - λ_{-}} (\frac{1}{N_{-}} | φ_{+} 〉 - \frac{1}{N_{+}} | φ_{-} 〉) .

(A14)

Here $N_{\pm} = \sqrt{{| w |}^{2} + {| λ_{\pm} |}^{2}}$ is the normalization factor.

The initial state is $ρ_{A, β_{A}} \otimes ρ_{B, β_{B}}$ :

(A15)

Using these formulas one can compute

ρ (t) = e^{- i H t} ρ_{A, β_{A}} \otimes ρ_{B, β_{B}} e^{i H t} = U (t) ρ_{A, β_{A}} \otimes ρ_{B, β_{B}} U {(t)}^{+}

(A16)

and verify that

Tr (H_{A} ρ (t)) = \frac{E_{A}}{Z_{A} Z_{B}} (e^{- β_{A} E_{A}} + e^{- β_{A} E_{A} - β_{B} E_{B}} \frac{{| λ_{+} e^{- i λ + t} - λ_{-} e^{- i λ - t} |}^{2}}{{(λ_{-} - λ_{+})}^{2}} {| ω |}^{2} \frac{{| e^{- i λ + t} - e^{- i λ - t} |}^{2}}{{(λ_{-} - λ_{+})}^{2}}) .

(A17)

Then

S (ρ (t) | ρ_{A, β_{A}} \otimes ρ_{B, β_{B}}) = Tr ((β_{A} H_{A} + β_{B} H_{B}) (ρ (t) - ρ_{A, β_{A}} \otimes ρ_{B, β_{B}}))

(A18)

δ E_{V} (ρ) = - Tr ((H_{A} + H_{B}) (ρ (t) - ρ_{A, β_{A}} \otimes ρ_{B, β_{B}}))

(A19)

and

(A20)

(A21)

Using Equation (146), one obtains

δ^{(U (t))} E_{A} (ρ) = \frac{E_{A}}{Z_{A} Z_{B}} (1 - e^{- β_{A} E_{A} - β_{A} E_{B}}) {| w |}^{2} \frac{\sin^{2} ((λ_{+} - λ_{-}) t)}{{(λ_{-} - λ_{+})}^{2}} .

(A22)

Here these quantities are periodic functions of period $\frac{2 π}{λ_{+} - λ_{-}} = \frac{2 π}{\sqrt{E_{0}^{2} + 4 {| w |}^{2}}}$ . Near resonance, where λ₊ ≃ λ₋, w ≃ 0, E₀ = E_A +E_B ≃ 0 and we recover that δ^(U(t)) E_A(ρ) ≃ K(β_A − β_B)t from Equation (A22).

C. Example: Forced Harmonic Oscillator

We take the Hamiltonian

H = - \frac{1}{2} \frac{d^{2}}{d x^{2}} + \frac{ω^{2} x^{2}}{2} + λ (t) x = H_{0} + λ (t) x,

(A23)

with the condition λ(0) = 0. The classical action is

(A24)

where C(t) does not depend on x or x′. The quantum propagator is

G (x, t | x', 0) ≃ \exp (i S (x, t | x', 0)),

(A25)

where “≃” indicates that we have not written the normalization factor. This factor does not depend on x or x′ and is at the moment unimportant. The thermal state for λ = 0 is

ρ_{β} ≃ \exp (- \frac{i ω}{2 \sin (i ω β)} ((y^{2} + {y^{'}}^{2}) \cos i ω β - 2 y y^{'})) .

(A26)

The time-evolved state at time-t is

ρ (t, x | x^{'}) = \iint G (x, t | y) ρ_{β} (y | y^{'}) G (y^{'} t | x^{'}) * d y d y^{'} .

(A27)

The energy at time-t, using λ(t) = 0, is

E (t) = \int d x H_{0, x} ρ (t, x | x^{'}) |_{x^{'} = x},

(A28)

with $H_{0, x} = - \frac{1}{2} \frac{d^{2}}{d x^{2}} + \frac{ω^{2} x^{2}}{2}$ . Define

I_{1} = \int_{0}^{y} λ (s) \frac{\sin ω s}{\sin ω t} d s

(A29)

I_{2} = \int_{0}^{t} λ (s) \frac{\sin (ω (t - s)}{\sin ω t} d s

(A30)

A = - i I_{1} + i I_{2} (\frac{\sin ω t}{\sin i ω β} - \frac{\sin (ω t + i ω β)}{\sin i ω β})

(A31)

A^{'} = i I_{1} + i I_{2} (\frac{\sin ω t}{\sin i ω β} + \frac{\sin (i ω β - ω t)}{\sin i ω β}) .

(A32)

The calculation of the double Gaussian integral in Equation (A27) gives

ρ (t, x | x^{'}) = \frac{1}{N (t)} \exp (- \frac{ω}{2} (x^{2} + {x^{'}}^{2}) \coth ω β + \frac{ω x x^{'}}{\sinh ω β} + A x + A^{'} x^{'}),

(A33)

where N(t) is the normalization factor

N (t) = \exp (\frac{1}{4} \frac{{(A + A^{'})}^{2}}{ω \coth ω β - \frac{ω}{\sinh ω β}}) \frac{\sqrt{2 π}}{\sqrt{2 (ω \coth ω β - \frac{ω}{\sinh ω β})}} .

(A34)

The action of the Hamiltonian on the propagated state is

H_{0, x} ρ (t, x | x^{'}) = (\frac{1}{2} ω \coth ω β - \frac{1}{2} {((- ω x \coth ω β + \frac{ω x^{'}}{\sinh ω β}) + A)}^{2} + \frac{ω^{2} x^{2}}{2}) ρ (t, x | x^{'}) .

(A35)

We define the variable X as

X = x - \frac{A + A^{'}}{2 (- ω \coth ω β + \frac{ω}{\sinh ω β})} .

(A36)

Then the energy of the propagated state at time t is

(A37)

and E(0) is the value of E(t) at t = 0, so that

E (t) - E (0) = \frac{1}{8} (\frac{{(A + A^{'})}^{2}}{{(\coth ω β - \frac{1}{\sinh ω β})}^{2}} - {(A - A^{'})}^{2}) .

(A38)

Finally using the values of A and A′ in terms of I₁ and I₂, we obtain

E (t) - E (0) = \frac{1}{2} (I_{1}^{2} + I_{2}^{2} + 2 I_{1} I_{2} \cos ω t) .

(A39)

This is independent of β and is positive. As a corollary, this result is valid if one propagates any eigenstate of the Hamiltonian H₀. One can also derive the classical energy

E (t) - E (0) = {〈 H (x (t | x_{0}, p_{0}), p (t | x_{0}, p_{0}), λ = 0) - H (x_{0}, p_{0}, λ = 0) 〉}_{ρ_{β}}_{(λ = 0)},

(A40)

where ρ_β(λ = 0) is the classical thermal state. One uses the equations of motion

x (t | x_{0}, p_{0}) = x_{0} \cos ω t + p_{0} \frac{\sin ω t}{ω} + ω \int_{0}^{t} λ (s) \sin ω (t - s) d s

(A41)

p (t | x_{0}, p_{0}) = - ω x_{0} \sin ω t + p_{0} \cos ω t + ω^{2} \int_{0}^{t} λ (s) \cos ω (t - s) d s

(A42)

ρ_{β} (λ = 0) = \frac{1}{N_{β}} \exp (- β (\frac{p^{2}}{2} + \frac{ω^{2} x^{2}}{2})),

(A43)

and then

E (t) - E (0) = \frac{1}{2} ω^{4} ({(\int_{0}^{t} λ (s) \cos ω (t - s) d s)}^{2} + {(\int_{0}^{t} λ (s) \sin ω (t - s) d s)}^{2}) .

(A44)

If λ(0) = 0 but λ(t) ≠ 0, one gets

(A45)

This can be negative, for example if λ(t) = t:

E (t) - E (0) = 1 - \frac{t^{2} ω^{2}}{2} - \cos ω t < 0.

(A46)

References and Notes

Landau, L.D.; Lifshitz, E.M. Statistical Physics; Pergamon Press: Oxford, UK, 1980. [Google Scholar]
Schulman, L.S.; Gaveau, B. Coarse grains: The emergence of space and order. Found. Phys 2001, 31, 713–731. [Google Scholar]
Jaynes, E.T. Information theory and statistical mechanics. Phys. Rev 1957, 106, 620–630. [Google Scholar]
Jaynes, E.T. Information theory and statistical mechanics II. Phys. Rev 1957, 108, 171–190. [Google Scholar]
Gaveau, B.; Schulman, L.S. Master equation based formulation of non-equilibrium statistical mechanics. J. Math. Phys 1996, 37, 3897–3932. [Google Scholar]
Gaveau, B.; Schulman, L.S. A general framework for non-equilibrium phenomena: The master equation and its formal consequences. Phys. Lett. A 1997, 229, 347–353. [Google Scholar]
In [5] and [6] we found the relative entropy to be related to dissipation, but we there dealt with stochastic dynamics and our conclusions were limited to short times.
Esposito, M.; Lindenberg, K.; van den Broeck, C. Entropy production as correlation between system and reservoir. New J. Phys 2010, 12, 013013. [Google Scholar]
Reeb, D.; Wolf, M.M. (Im-)Proving Landauer’s Principle 2013, arXiv, 1306.4352v2.
Takara, K.; Hasegawa, H.H.; Driebe, D.J. Generalization of the second law for a transition between nonequilibrium states. Phys. Lett. A 2010, 375, 88–92. [Google Scholar]
Cover, T.M.; Thomas, J.A. Elements of Information Theory; Wiley: New York, NY, USA, 1991. [Google Scholar]
Henceforth we suppress the qualifier “inverse” in referring to various β’s as temperatures.
Partovi, M.H. Quantum thermodynamics. Phys. Lett. A 1989, 137, 440–444. [Google Scholar]
Brillouin, L. Science and Information Theory, 2nd ed.; Academic Press: New York, NY, USA, 1962. [Google Scholar]
Landauer, R. Irreversibility and heat generation in the computing process. IBM J. Res. Dev 1961, 5, 183–191. [Google Scholar]
Leff, H.S.; Rex, A.F. Maxwell’s Demon: Entropy, Information, Computing; Princeton University Press: Princeton, NJ, USA, 1990. [Google Scholar]
Schulman, L.S.; Gaveau, B. Ratcheting Up Energy by Means of Measurement. Phys. Rev. Lett 2006, 97, 240405. [Google Scholar]
Esposito, M.; van den Broeck, C. Second law and Landauer principle far from equilibrium. Europhys. Lett 2011, 95, 40004. [Google Scholar]
Kawai, R.; Parrondo, J.M.R.; Van den Broeck, C. Dissipation: The Phase-Space Perspective. Phys. Rev. Lett 2007, 98, 080602. [Google Scholar]
Jarzynski, C. Nonequilibrium Equality for Free Energy Differences. Phys. Rev. Lett 1997, 78, 2690–2693. [Google Scholar]

© 2014 by the authors; licensee MDPI, Basel, Switzerland This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Gaveau, B.; Granger, L.; Moreau, M.; Schulman, L.S. Relative Entropy, Interaction Energy and the Nature of Dissipation. Entropy 2014, 16, 3173-3206. https://doi.org/10.3390/e16063173

AMA Style

Gaveau B, Granger L, Moreau M, Schulman LS. Relative Entropy, Interaction Energy and the Nature of Dissipation. Entropy. 2014; 16(6):3173-3206. https://doi.org/10.3390/e16063173

Chicago/Turabian Style

Gaveau, Bernard, Léo Granger, Michel Moreau, and Lawrence S. Schulman. 2014. "Relative Entropy, Interaction Energy and the Nature of Dissipation" Entropy 16, no. 6: 3173-3206. https://doi.org/10.3390/e16063173

APA Style

Gaveau, B., Granger, L., Moreau, M., & Schulman, L. S. (2014). Relative Entropy, Interaction Energy and the Nature of Dissipation. Entropy, 16(6), 3173-3206. https://doi.org/10.3390/e16063173

Article Menu

Relative Entropy, Interaction Energy and the Nature of Dissipation

Abstract

1. Introduction

2. Notations and Basic Identities

2.1. States and Entropy

2.2. The Basic Identity

2.3. Evolution Operators and Entropy

3. Two Systems in Interaction

3.1. Hypotheses

3.2. Relation between a State and Its Marginals

3.3. The Case Where A is Initially in a Thermal State

3.4. The Case of Equality in Equation (31)

3.5. Both Systems A and B are at Equilibrium

3.6. Case of Equality in (39) (2nd) and (41)

3.7. Interaction Energy and Relative Entropy

3.8. The Case βA = βB

4. Two Systems in Interaction With a Work Source

4.1. Hypotheses

4.2. Identities for the Work

4.3. Inequalities for the Work

4.4. The Case of Equalities in Equations (66) and (67)

4.5. Case Where A is not Initially in Thermal Equilibrium

5. A System Coupled Only to an External Work Source

5.1. Hypotheses

5.2. Identities for the Work

5.3. Inequalities for the Work

5.3.1. From Equation (83)

5.3.2. From Equation (86)

5.4. Relation to the Identity of Jarzynski

5.5. Effective Temperatures

5.6. A More Precise Expression for the Work

5.7. Upper Bounds on the Work Delivered by a System. Comparison of Equations (92) and (111)

5.8. The Case of Equalities in Equations (111) and (92)

5.8.1. Equality in Equation (111)

5.8.2. Equality in Equation (92)

5.9. The Case λ(U) = λ0

6. Relative Entropy, Energy Dissipation and Fourier’s Law

6.1. The Born Approximation

6.2. Two Interacting Systems

6.3. The Case Where Both Initial States are Thermal

6.3.1. Estimate of the Relative Entropy

6.3.2. Estimate of the Interaction Energy

7. Coarse Grained States

7.1. Definition

7.2. Examples of Coarse-graining Mappings

7.3. Coarse Graining and Relative Entropy

8. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

Appendixe

A. An Example of Trajectory-Independent Microscopic Work

B. An Exactly Solvable Model

C. Example: Forced Harmonic Oscillator

References and Notes

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.8. The Case β_A = β_B

5.9. The Case λ^(U) = λ₀