Foundations of Nonequilibrium Statistical Mechanics in Extended State Space

Gujrati, Purushottam Das

doi:10.3390/foundations3030030

Open AccessReview

Foundations of Nonequilibrium Statistical Mechanics in Extended State Space

by

Purushottam Das Gujrati

Department of Physics, School of Polymer Science and Polymer Engineering, The University of Akron, Akron, OH 44325, USA

Foundations 2023, 3(3), 419-548; https://doi.org/10.3390/foundations3030030

Submission received: 21 May 2023 / Revised: 10 August 2023 / Accepted: 14 August 2023 / Published: 23 August 2023

(This article belongs to the Section Physical Sciences)

Download

Browse Figures

Versions Notes

Abstract

:

The review provides a pedagogical but comprehensive introduction to the foundations of a recently proposed statistical mechanics (

μ

NEQT) of a stable nonequilibrium thermodynamic body, which may be either isolated or interacting. It is an extension of the well-established equilibrium statistical mechanics by considering microstates

\{m_{k}\}

in an extended state space in which macrostates (obtained by ensemble averaging

\hat{A}

) are uniquely specified so they share many properties of stable equilibrium macrostates. The extension requires an appropriate extended state space, three distinct infinitessimals

d_{α} = (d, d_{e}, d_{i})

operating on various quantities q during a process, and the concept of reduction. The mechanical process quantities (no stochasticity) like macrowork are given by

\hat{A} d_{α} q

, but the stochastic quantities

{\hat{C}}_{α} q

like macroheat emerge from the commutator

{\hat{C}}_{α}

of

d_{α}

and

\hat{A}

. Under the very common assumptions of quasi-additivity and quasi-independence, exchange microquantities

d_{e}

q

_{k}

such as exchange microwork and microheat become nonfluctuating over

\{m_{k}\}

as will be explained, a fact that does not seem to have been appreciated so far in diverse branches of modern statistical thermodynamics (fluctuation theorems, quantum thermodynamics, stochastic thermodynamics, etc.) that all use exchange quantities. In contrast, dq

_{k}

and

d_{i}

q

_{k}

are always fluctuating. There is no analog of the first law for a microstate as the latter is a purely mechanical construct. The second law emerges as a consequence of the stability of the system, and cannot be violated unless stability is abandoned. There is also an important thermodynamic identity

d_{i} Q \equiv d_{i} W

\geq 0

with important physical implications as it generalizes the well-known result of Count Rumford and the Gouy-Stodola theorem of classical thermodynamics. The

μ

NEQT has far-reaching consequences with new results, and presents a new understanding of thermodynamics even of an isolated system at the microstate level, which has been an unsolved problem. We end the review by applying it to three different problems of fundamental interest.

Keywords:

microstates; macrostates; nonequilibrium; system-intrisic and medium-intrisic quantities; internal variables; uniqueness; internal equilibrium; mechanical and stochastic quantities; fluctuating and nonfluctuating quantities; reduction; microfriction; fluctuation theorem; free expansion

1. Introduction

Thermodynamics is the study of physical systems in nature that eventually evolve in time to stationary macrostates, in which any disturbance generates restoring forces to bring them back to the stationary macrostates [1,2,3], which makes them stable macrostates, usually called equilibrium (EQ) macrostate

M_{eq}

, satisfying certain stability conditions. Any disturbance to modify macrostates of the system invariably results in nonequilibrium (NEQ) processes so that they abound in nature and obey the well-established second law [4,5,6,7]. The law is also obeyed by biological systems [8,9]. However, NEQ processes are not well-understood, as the corresponding thermodynamics (NEQT) is not yet fully developed, despite it having a long history of various competing schools [10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28], among which are the most widely known schools of local equilibrium thermodynamics, rational thermodynamics, extended thermodynamics, and GENERIC thermodynamics [21,29]. They mostly deal with the time evolution of macroscopic quantities only; the latter emerge as instantaneous averages over microstates in a more fundamental and statistical approach, and are used to characterize any thermodynamic process and the resulting nonnegative entropy generation

Δ_{i} S \geq 0

, as first proposed by Clausius [30,31]. In contrast, the equilibrium (EQ) thermodynamics (EQT) in which

Δ_{i} S \equiv 0

is based on the original ideas of Carnot, Clapeyron, Clausius, Thomson, Maxwell, and many others [3,12,32,33,34,35,36,37,38,39,40,41,42,43,44,45], and has by now been firmly established in statistical physics, thanks to Boltzmann [46,47] and Gibbs [48], who established that classical EQ thermodynamics is a direct consequence of the EQ statistical mechanics [33,34,37,38] that deals directly with microstates

\{m_{k}\}

of the Hamiltonian

H

of the system, and their equilibrium probabilities

\{p_{k}^{eq}\}

that together specify the EQ macrostate

M_{eq}

. In contrast, EQT deals directly with

M_{eq}

without any need to know

\{m_{k}\}

and

\{p_{k}^{eq}\}

.

In general, the collection

\{m_{k}, p_{k}\}

of microstates and their probabilities is used in a statistical description of a macrostate

M

of the system

Σ

that may be isolated or interacting with a medium

\tilde{Σ}

, as shown in Figure 1. The same microstate set

\{m_{k}\}

determines different macrostates depending on the probabilities

p_{k} = p_{k} (M)

with which

m_{k}

appears in

M

. As

H

is by definition deterministic,

m_{k}

is also deterministic. Thus, it is independent of

p_{k}

, but is specified by its energies

E_{k}

and the parameters defining

H

. Because of this,

\{m_{k}\}

and

\{E_{k}\}

are the same for any of its possible macrostates

M

including

M_{eq}

. This allows

\{E_{k}\}

to be treated as purely mechanical, which is then supplemented by

\{p_{k}\}

to add stochasticity to the mechanical system. Such a description has proven very useful in EQ statistical mechanics [33,34], where the concepts of the entropy

S_{eq} = S (M_{eq})

that was first introduced by Clausius [30,31] as a state function of

M_{eq}

, and the temperature T are the new concepts that play a central role in the resulting EQ thermodynamics of

Σ

. As such, it is very common to use them to distinguish a thermodynamic system from a mechanical system by recognizing that the concept of heat (a consequence of a particular commutator as described later) is novel to thermodynamics but is not applicable to a mechanical system, which is traditionally taken to be described by a purely conservative Hamiltonian

H

. We use

X ≐ (N, E, V, \dots)

to collectively denote the number of particles N, their energy E, the volume V occupied by them, etc., as representing the common thermodynamic extensive state variables that determine

M_{eq} \equiv M (X)

in the state space

S_{X}

spanned by

X

. We call them observables. As observables, these variables can be controlled from the outside of the system. We will allow

X = X (t)

to have time dependence in this work; here, t denotes the time. For the moment, we suppress the suffix “eq” for notational simplicity unless necessary as we are dealing with

S_{X}

. To be useful, S and T must uniquely refer to the thermodynamic state

M (X)

. This unique relationship is what is meant by S being a state function of

X

, which when inverted gives E a state function of

ζ ≐ (S, w), w ≐ X ∖ E

, where

∖ E

stands for deleting E from the set preceding it. Being functions of

M

of

Σ

, S and T, an intensive field, must be interrelated in some fashion such as

1 / T = \partial S / \partial E,

(1)

(see Equation (129)) so only one of them can be treated as a primitive concept, which we take to be the entropy. The goal of NEQT is to then specify it in terms of

X

in the state space

S_{X}

. In this respect, having S a state function considerably simplifies the study as we then deal with

M_{eq}

. When this cannot be done, we must go beyond

S_{X}

to an extended state space in which the NEQ entropy also becomes a state function, which is the central theme of this review. In this space, a uniform global temperature of the body is defined as its unique field by the above derivative in the extended state space. Thus, our goal will be to identify the NEQ entropy in this space.

Although S plays important roles in diverse fields ranging from classical thermodynamics of Clausius [3,10,12,13,17,20,24,25,30,33,39,40,41,42,43,44,46,47,48,49,50,51,52,53,54,55,56,57,58], quantum mechanics and uncertainty [59,60,61], black holes [62,63,64], coding and computation [65,66,67], to information technology [68,69,70,71,72], it does not seem to have a standard definition in all cases, even though it is well-defined under EQ conditions, as extensively discussed in the literature; see, for example, [46,47,48,73,74,75,76,77,78]. As

S_{eq}

is uniquely determined by

\{p_{k}^{eq}\}

as a state function,

p_{k}^{eq}

’s must be unique functions in

S_{X}

, as is well-known [33]. Requiring this uniqueness will be a guiding force in our endeavor to formulate the NEQ statistical mechanics. Whether S has any physical significance in a NEQ macrostate

M

has been a topic of extensive debate; see for example [73,74,75,76,77] and references therein. The problem arises because it is not clear if, and how,

M

can be uniquely identified. Because of the lack of uniqueness, introducing

S (M)

as a state function becomes nontrivial. The same concern also applies to

\{p_{k}\}

.

Recently, we have been able to extend the classical concept of Clausius entropy from EQ states to NEQ states where irreversible entropy is generated [75,76,77,78]. That approach is an outgrowth of an earlier review [79] in this journal about a possible source of stochasticity that is required in a thermodynamic system, even though its mechanics is completely deterministic due to its Hamiltonian dynamics so that heat and temperature have no mechanical analogs. Not appreciating that the source of stochasticity is independent from the deterministic (mechanical) aspect has been a source of bitter debate between Boltzmann, Zermelo, Poincare, and many others ([45,56,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96], and references cited in there). The dispute required Boltzmann to propose the ideas of molecular chaos and of the ergodicity hypothesis [91] that have played a major role in EQ statistical mechanics. We discuss these important ansatze in [92,93,94,95] with an emphasis on Kac’s ring model [97,98] in more detail, where we find that they are not fulfilled in a deterministic dynamics. We infer, as is commonly believed, that one needs a stochastic dynamics for the ansatz to be satisfied. Both these ideas can only be supported by a stochastic dynamics as discussed in these references [99,100,101,102].

It is clear that we need to supplement a purely mechanical approach by supplementing it with stochasticity. We accomplish treating both aspects separately but unifying them together and enabling uniqueness by using an extended state space

S_{Z}

spanned by extensive state variables (compactly denoted by

Z = Z (t) = X (t) \cup ξ (t)

as an extension of

X (t)

in this review) to obtain a state function S. In general,

Z

includes the observables but possibly some more independent variables, compactly denoted by

ξ (t)

required for an NEQ situation, as will become clear later. The additional state variable

ξ

, when properly chosen as will be described later, allows for a unique description of the macrostate

M (Z)

in

S_{Z}

. Once such a state space has been uncovered for

M (Z)

, its entropy

S (M)

also become a state function

S (Z)

in

S_{Z}

. This again requires its

p_{k}

’s to be unique functions in

S_{Z}

, just as

p_{k}^{eq}

are in

S_{X}

. Thus, the identification of an appropriate

S_{Z}

immediately solves the problem of obtaining a unique statistical mechanics of an NEQ system as it directly leads to

p_{k}

as a unique function of

m_{k}

and

M (Z)

in

S_{Z}

.

In order for such an approach to work, and in particular for

S (Z)

, which itself is a system quantity, it is crucial that we deal with only system-intrinsic (SI) quantities (they are determined by the system), and not medium-intrinsic (MI) quantities (they are primarily determined by the medium) for the simple reason that utilizing

\{m_{k}\}

requires their specification by the Hamiltonian of the system and so require SI-quantities for its specification. (We will use body to refer to

Σ, \tilde{Σ}

, and

Σ_{0}

, and BI-quantities to refer to quantities of a body.) As will become clear in the following, these quantities capture the internal processes going on within the system. They cannot be fully captured by the MI-quantities, even though they have been traditionally used in thermodynamics, for the simple reason that they retain the memory of the medium and can depend on the system only weakly. Thus, they will require additional steps to study internal processes. There has been a long debate about the relevance and significance of the two kinds of quantities that ensued from a very different perspective [103,104,105,106,107], but did not capture the importance these quantities acquire in our approach.

The SI-quantities allow us to develop our NEQ statistical mechanics, which for brevity is identified as the

μ

NEQT, with

μ

referring to the microstates

\{m_{k}\}

, in which we directly capture internal processes that are responsible for irreversibility. As the collection

\{m_{k}\}

is the central object in the

μ

NEQT, the latter deals with quantities such as

\{E_{k}, p_{k}\}

. At the microstate level, there are fluctuations that are essential in a statistical treatment, and are properly captured in the

μ

NEQT through the fluctuations in

E_{k}

and

p_{k}

over

m_{k}

. In contrast, the use of the MI-quantities does not directly describe

\{m_{k}\}

so it cannot properly yield a statistical mechanical description of an NEQ process in a system. This is one of our most important conclusions. In particular, an important consequence of the

μ

NEQT as will be shown later is that MI-quantities, after reduction (being averaged over the microstates of the medium) under commonly accepted conditions of quasi-additivity and quasi-independence, do not exhibit any fluctuations. This explains why they are not suitable in developing the statistical mechanics. We call the resulting version of the microstate NEQT the

\overset{˚}{μ}

NEQT; the circle on

μ

is a reminder for the use of “exchange” microquantities derived from the MI-quantities in its formulation. The most prominent are the exchange (also called external) microwork

d_{e} W_{k} = d_{e} W, \forall k

, and the exchange (also called external) microheat

d_{e} Q_{k} = d_{e} Q, \forall k

, thus, explicitly exhibiting that they have no fluctuations. Because of this, it does not directly capture internal processes at the microstate level, which require additional steps to describe irreversibility as mentioned above. The corresponding macroscopic NEQT from the two approaches are called the MNEQT and the

\overset{˚}{M} NEQT

, respectively; here, M stands for the macroscopic description in terms of macrostates, the circle again having the same connotation as above. There are no fluctuations in these theories, as is well-known. The

\overset{˚}{M} NEQT

is the standard formulation of classical thermodynamics and has been discussed extensively by many prominent scientists [13,18,33,39,41,42,51,108], some including internal variables that play an important role in our approach.

It should be obvious from the above discussion that we need to make a clear distinction between fluctuating (Fl) and nonfluctuating (NFl) quantities. In addition, we also recognize that there are many other macrostates in

S_{Z}

for which neither S nor the corresponding

p_{k}

’s are unique functions in

S_{Z}

, so S must be treated independently of

Z

. Our previous work did not consider such states, but they will be considered in this review. For this purpose, we will find it convenient to introduce the following state variable sets:

S, E, w ≐ X ∖ E, W ≐ Z ∖ E, Z, ζ = (S, W), χ = (S, Z),

(2)

and the corresponding state spaces

S_{S}, S_{E}, S_{W}, S_{Z}, S_{ζ} ≐ S_{S} \cup S_{W}

, and

S_{\emptyset} ≐ S_{S} \cup S_{Z}

, where the suffix denotes the variable set forming the state space.

We should emphasize that internal variables also appear in mechanical systems. A simple example is that of two particles in a system, whose interior is hidden in the lab from us so that we cannot see where the particles are inside the system. From outside the system, we can only be aware of the position of the center of mass by observing its motion in the lab. However, there is no way to determine their separation within the system. This separation and the corresponding relative motion are examples of the internal variable and its motion, and play a role in the dynamics of the mechanical system. Thus, it should not come as a surprise that such internal variables will also be relevant in a thermodynamic system. Indeed, we will see later in Section 14 that this relative motion becomes the source of “microfriction”, resulting in friction, when we treat the system in thermodynamics.

To appreciate at a more fundamental level the distinction between a mechanical and a thermodynamic system, we first realize that both systems are usually separated from their surroundings

\tilde{Σ}

by some clear partitions, the most common being the walls between them; see Figure 1. We collectively call them containers or walls that contain the system [109]. In this review, we find it convenient to not include the container as part of the system, but use it to determine the boundary conditions for the equations of motion or as defining parameters in the Hamiltonian

H

of the system

Σ

. As

H

plays the role of E, the parameters

w

and

W

are obtained by taking out E from

X

and

Z

, respectively:

w = (V, \dots), W = (V, \dots, ξ);

(3)

where ⋯ refers to the rest of the elements in

X

besides V [110]. As will become evident below, these parameters denote the work parameter in the Hamiltonian, which we will denote by

H (x| w)

or

H (x| W)

, respectively. For simplicity in the following, we will always use

W

for the work parameter to refer to both cases and express the Hamiltonian as

H (x| W)

. The parameter can be varied in a process with a concomitant change in

H (x| W)

due to the work done by the system. This is in accordance with the work–energy theorem of mechanics that states that the change in the energy is due to work alone. Also,

x = x (t) ≐ \{r (t), p (t)\}

is the dynamical variable and denotes the collection of coordinates

\{r (t)\}

and momenta

\{p (t)\}

of the N particles in the phase space

Γ (x |W)

of

Σ

[110]. As internal variables play no role in EQ,

W = w

in EQ. For any

x \in Γ (x |W)

, the deterministic energy of

Σ

in the state specified by

x

is

E_{x} (W) = H (x| W)

, which need not be constant. However, there is no stochasticity so there is no concept of heat. Thus, the Hamiltonian itself cannot explain the fundamental difference between the two systems [79].

We elaborate further. Mechanics is a branch of the physical science to study the deterministic behavior of the system in the presence of known forces and radiation in time. The central concept is that of energy whose changes are governed by deterministic Hamiltonian equation of motion in

Γ (x |w)

with deterministic boundary conditions such as at the walls confining the system

Σ

(see Figure 1a) that generate deterministic wall potentials acting on the particles. Accordingly, a point

x (t) \in Γ (x |w)

is uniquely determined by

x (t_{0})

at some reference time

t_{0}

. A central aspect of the equation is that it uniquely determines the properties of the system in the future (

t > t_{0}

) as well as in the past (

t < t_{0}

) [111,112]. We will assume in this review that the fundamental weak nuclear forces are not included in our discussion [113]. A movie of such a deterministic process in the future, when run backward for the past, will appear just as natural with no hint of the direction of the time flow. Thus, starting from

x

, which also identifies a microstate [110], at t, the state undergoes a unique state transformation

x ⟼ x^{'}

(4)

in the same interval

Δ t

for any

Δ t

. If we now consider an ensemble of the same mechanical system, each prepared in the state

x

at

t = 0

, then at

t = Δ t

, each system will be in the same state

x^{'}

. In the language of probability theory [114], we say that

x^{'}

follows with certainty from

x

in

Δ t ≷ 0

. (This will be useful later to associate the concept of a constant entropy to a mechanical system but not heat.)

But the above invariance is contrary to our daily experience as a rule [115,116,117,118,119,120,121,122,123,124]. For example, the initial state

x_{0}

may be when all the gas particles are confined to a small portion of the container [109] located at the center of the container. We are not interested in particle momenta. As the gas expands spontaneously, it occupies the entire volume uniformly. However, once the gas has occupied the entire volume in the state

x_{1}

, the reverse evolution is not seen in nature. Similarly, the cream mixed in a cup of coffee does not ever unmix on its own. The smoke from a burning piece of wood only spreads out in the room, but never confines itself on its own. If we run the movies in any of these cases backward, we immediately realize that the backward movies do not represent physical phenomena that are consistent with our daily experience.

This lack of time-reversal invariance of the equations of motion is a natural fact of daily life where we deal with macroscopic systems [125] that eventually evolve in time to

M_{eq}

. This is at the root of the second law of thermodynamics, and can be easily explained as follows. It happens here that each member of the above ensemble that was initially prepared in the same state

x

evolves during a fixed interval

Δ t

into different states

x^{(n)} = x^{'},

x^{″},

$x^{‴}$

, \dots

for different

n = 1, 2, 3, \dots

x ⟶ x^{(n)}, n = 1, 2, 3, \dots,

(5)

then the certainty implied in Equation (4) is lost so that most often it would happen that the states of different members after

Δ t

would have no discernible pattern for

x^{(n)}

and appear haphazard for the members. The result is ([126], pp. 1–14) a loss of physical determinism [127]. Thus, the mapping

x ⤏ x^{'}

(6)

in (5) between

x

and one of its evolved states

x^{'}

is one-to-many, and the mapping becomes unpredictable, i.e., stochastic [114]. One possible explanation of the loss of certainty at the level of states lies in the presence of stochasticity in the system due to the uncontrollable interactions with the surroundings, as discussed elsewhere [79] and elaborated later in Section 7. This is the foundations of classical probability theory by Laplace, and used to formulate the idea of density matrix by Landau [59,128] and von Neumann [129]. In this case, the mapping (6) cannot be reversed, and we cannot perform time-reversal of the evolution anymore. It is the success of a probabilistic approach to nonequilibrium thermodynamics that prompted Maxwell [50] and Boltzmann [130,131] to promote the “ergodic hypothesis” to achieve EQ. One of our aims in this review is to follow the consequences of this stochasticity in the dynamics such as in the Brownian motion [132] and Langevin’s equation [133], and extend the concept of ergodicity to a special class of NEQ states [134] that has been identified as internal equilibrium states; see Definition 9.

1.1. Scope of the Review

It should be obvious that the scope of the NEQ statistical mechanics, the

μ

NEQT, is more general than that of the equilibrium statistical mechanics, to be denoted here simply as the

μ

EQT in short, in that the attempts are now mostly to deal with the most general time evolution of microscopic quantities in the former. The instantaneous averages of these quantities over microstates

\{m_{k}\}

are used to specify the instantaneous macrostates

M

required to characterize any thermodynamic process

P

in time in the MNEQT. Thus, the tasks in the

μ

NEQT and the MNEQT are more difficult and their foundations less developed, which justifies the motivation of this review. The exception is the validity of the first law in terms of exchange (or external) work and heat between

Σ

and

\tilde{Σ}

in thermodynamics, which plays a central role in the

\overset{˚}{M} NEQT

. These MI-quantities are determined uniquely by

\tilde{Σ}

regardless of

P

being reversible or irreversible, and are easily identified under generally acceptable conditions such as

\tilde{Σ}

always being in EQ, quasi-additivity and quasi-independence; see later. Some of the approaches in the

\overset{˚}{M} NEQT

employ the enlarged state space

S_{Z}

[18,21,42,108]. Being associated with an EQ

\tilde{Σ}

, the MI-quantities including the exchange (or external) entropy carry no information about irreversibilities going on within the system. In contrast, the MNEQT based on the use of the SI-quantities include, by definition, these irreversible contributions so they are directly obtainable. One of our goals, besides laying down the foundations of the

μ

NEQT, is to justify the MNEQT from the

μ

NEQT.

A system in EQ always has its observables uniformly distributed throughout the system so it is uniform in

S_{X}

[33]. In contrast, an NEQ system is not uniform and requires additional information about the nonuniformity to uniquely specify its states, which is provided by a proper choice of internal variables in

ξ

. The set

ξ

allows us to treat

Σ

as uniform in the state space

S_{Z}

(see Section 5.7) so that there is a unique thermodynamic temperature and other fields for the entire system even though it is still nonuniform in

S_{X}

. This is very useful to obtain a proper thermodynamics of the system. For example, the single thermodynamic temperature T even for a nonuniform system satisfies Clausius’s theorem that heat flows from hot to cold. This is what makes the

μ

NEQT in the extended space

S_{Z}

so useful and desirable.

Various microquantities associated with

Σ

(having microstates

\{m_{k}\}

),

\tilde{Σ}

(having microstates

\{{\tilde{m}}_{\tilde{k}}\}

), and

Σ_{0}

(having microstates

\{m_{0 k_{0}}\}

) carry the suffix

k, \tilde{k}

, and

k_{0}

, respectively. However, we are only interested in microquantities associated with

\{m_{k}\}

as our focus is on

Σ

. This means that microquantities of

\tilde{Σ}

and

Σ_{0}

must be manipulated so that they can be associated with

\{m_{k}\}

. To accomplish this, we introduce the principle of reduction, which accounts for the correlation introduced by mutual interactions between

Σ

and

\tilde{Σ}

. Under commonly accepted conditions about

\tilde{Σ}

, the principle shows that the effect of

\tilde{Σ}

on

Σ

can be incorporated by treating its microquantities in the form of exchange (or external) quantities having no fluctuations. This is what makes the MI-quantities play such an important role in classical thermodynamics, but makes them unsuitable to extract fluctuations in a statistical theory.

Our goal here is to provide a comprehensive and self-contained introduction to our recently developed NEQ statistical mechanics (

μ

NEQT), in which we study deterministic time evolution of individual microstates in

\{m_{k}\}

along Hamiltonian trajectories in

\{γ_{k}\}

during

P

. When quantities associated with these trajectories are averaged over them using their probabilities, the result is the MNEQT, an extension of the equilibrium thermodynamics to describe NEQ processes. This consistency with the MNEQT is not only a check on the validity of the

μ

NEQT, but also a justification of the MNEQT by the

μ

NEQT. The use of the SI-quantities in the

μ

NEQT allows for directly obtaining quantities such as

Δ_{i} S

after averaging. Thus, the

μ

NEQT is an extension of the EQ statistical mechanics [33,34], the

μ

EQT, that was originally developed by Boltzmann [46,47] and Gibbs [48], and limited to

Δ_{i} S \equiv 0

.

We will follow deterministic trajectories

\{γ_{k}\}

during

P

between two macrostates

M_{in}

and

M_{fin}

. Only the latter determine the trajectories so they are the same for all processes

P

between them. This makes

\{γ_{k}\}

independent of the trajectory probabilities

\{p_{γ_{k}}\}

controlling various

P

’s, which is similar to

\{m_{k}\}

being independent of the microstate probabilities

\{p_{k}\}

. The extended state space

S_{Z}

is chosen appropriately to uniquely specify

\{m_{k}\}

and

\{γ_{k}\}

in it. This uniqueness is an important aspect of the

μ

NEQT and the MNEQT as it is missing in other contemporary NEQT theories [10,12,13,17,18,19,20,21,24,25,26,27,28,99,135,136,137,138,139,140,141,142,143,144,145,146,147]. The instantaneous

E_{k}

along

γ_{k}

can only change mechanically due to the variation in

W

. This variation is responsible for the net change

\{Δ E_{k}\}

along

\{γ_{k}\}

, and is only determined by

M_{in}

and

M_{fin}

and not by

\{p_{γ_{k}}\}

as noted above. To complete the formulation of the

μ

NEQT, we determine the unique

\{p_{γ_{k}} (P)\}

for any

P

in

S_{Z}

, which is another exceptional aspect of the

μ

NEQT. This way, the deterministic aspect of a process (the mechanical work) has been separated from the stochastic aspect (the heat) in thermodynamics in a unique way in the

μ

NEQT for any

P

, NEQ or not. With the unique probabilities in hand, all calculation can be carried out exactly in the

μ

NEQT, once

S_{Z}

has been identified. In the

\overset{˚}{μ}

NEQT, the trajectory probabilities need to be determined using additional steps such as using the master equation [54], Fokker–Planck equation [37,102], etc., which are phenomenological.

Being deterministic, microquantities associated with

\{m_{k}\}

or

\{γ_{k}\}

are not constrained by the second law, which is a macroscopic law based on stochasticity. This is not surprising, as the Hamiltonian dynamics has nothing to say about the second law. For the MNEQT, we need to determine various thermodynamic averages over

\{γ_{k}\}

using

\{p_{γ_{k}}\}

. Thus, the development of the

μ

NEQT is carried out in two independent stages. First we determine mechanical quantities as if the system is a mechanical one following Hamiltonian dynamics. Its stochastic aspects are captured by

\{p_{γ_{k}} (P)\}

, which determine not only mechanical averages such as work but also the stochastic averages such as heat and entropy. It is the latter that finds itself manifested in the second law for appropriate choices of

\{p_{k}\}

and

\{p_{γ_{k}}\}

. By simply modifying the second stage, we are able to investigate the catastrophic consequences of violating the second law. This proves the usefulness of our approach. With

\{Δ E_{k}\}

and

\{p_{γ_{k}} (P)\}

in hand, we now have a complete NEQ statistical mechanics to describe any process

P

. The division in the two distinct and independent stages is of central importance to the

μ

NEQT and the MNEQT [148,149,150,151,152,153,154,155,156,157].

We have successfully applied the

μ

NEQT recently to study free expansion [154], to provide a correct application of microwork and microheat [155,156] in the various modern fluctuation theorems [26,158,159], and to describe viscous dissipation [157] associated with the dynamics of a Brownian particle (BP) [115,132,133,140] in its medium by developing an alternative to the stochastic Langevin description [38,99]. The above applications clearly show the usefulness of the

μ

NEQT. However, our previous studies were mostly limited to microworks; microheats were not treated as extensively. One of our major incentives here is to overcome this limitation to determine the

μ

NEQT for which the central requirement is the unique microstate probability

p_{k}

in the state space

S_{Z}

. This ensures that

M (Z)

and

S (Z)

are uniquely identified in

S_{Z}

. Such macrostates are said to be in internal equilibrium (IEQ) in

S_{Z}

and written as

M_{ieq}

or

M (Z)

, as opposed to EQ macrostates

M_{eq} = M (X)

in

S_{X}

. The unique entropy

S (Z)

has the maximum possible value for a given

Z

so it has no memory of where the microstate has come from. Once

M

becomes uniquely specified as

M (Z)

in

S_{Z}

, it satisfies the extension of the ergodic hypothesis for

M_{ieq}

; see Section 14 for an example.

But the applications so far of the

μ

NEQT have provided only a piecewise and incomplete description of the

μ

NEQT [148,149,150,151,152,153,154,155,156,157] that was restricted in scope to highlight its NEQ aspects in the limited context. This comprehensive review aims to overcome this limitation and provide a complete introduction to the foundation of the

μ

NEQT by assimilating and extending together the previous results and by including missing details and newer aspects that emerge from the use of the SI-quantities in the extended state space

S_{Z}

, where

m_{k}

and

M

are uniquely specified in an IEQ macrostate

M_{ieq}

just as they are uniquely specified for an EQ macrostate

M_{eq}

in the EQ state space

S_{X}

. The

μ

NEQT has met with success, as we will describe in this review, so it is desirable to introduce it to a wider class of readers.

Due to its microscopic SI-nature, the

μ

NEQT provides a more detailed description of fluctuations in a thermodynamic process that are hidden in the MNEQT. For this reason, therefore, the former is highly desirable from both a theoretical and experimental point of view. It is an extension of the MNEQT [77,78,134,148,149,152,153,160] to the microstate level, which brings about a very close parallel with

μ

EQT [32,33,34,36].

A microstate

m_{k}, k = 1, 2, \dots

, carries an index k; the set

\{m_{k}\}

forms a countable set and is specified by its energy set

\{E_{k} (W)\}

; however, we will usually suppress

W

in

m_{k}

and

E_{k} (W)

, unless necessary. In a macrostate

M

,

m_{k}

’s appear with a probability

p_{k} (M)

; see Section 7 for details. For simplicity, we will also not explicitly show the argument

M

in

p_{k}

; the dependence is always implicit. In the rest of the review, all quantities pertaining to

M

are identified as macroquantities, while those pertaining to

m_{k}

are identified as microquantities that always have the microstate index k of

m_{k}

or of

x_{k}

in

H (x_{k}| W)

; see Definition 4. After statistical averaging over microstates using their probabilities

p_{k}

(see Equation (12) for its proper definition), we obtain quantities without k or

x

.

A microquantity associated with

m_{k}

will always carry the index k (see later). A macrostate

M

and a macroquantity associated with it do not carry the index k so it is always easy to distinguish the two kinds of quantities. We will continue to use “quantity” to stand for both microquantity and macroquantity, unless clarity is needed.

1.2. System-Intrinsic and Medium-Intrinsic Thermodynamics

As the medium is always taken to be in EQ, its properties do not change even if the system is out of equilibrium. This has made the choice of MI-description (

\overset{˚}{M} NEQT

) very convenient to formulate classical thermodynamics [13,18,33,39,41,42,51,108], in which one uses the exchange macroheat

Δ_{e} Q = T_{0} Δ_{e} S

in terms of the exchange entropy (see Equation (46)) and the exchange macrowork

Δ_{e} W

(see Equation (135c)) such as

Δ_{e} W = P_{0} Δ_{e} V = P_{0} Δ V

for the PV-macrowork; see Equation (94) for the first law as an example. Here,

T_{0}

and

P_{0}

are the temperature and pressure of the medium (see Figure 1), which remain the same for all possible states of the system. This has made the

\overset{˚}{M} NEQT

a highly desirable thermodynamic theory as it is applicable in all cases. The main problem with this theory is that it is not directly applicable to an isolated system in Figure 1b for which exchange quantities are identically zero, but which provides the most cogent formulation of the second law

Δ S_{0} \geq 0

; see Equation (213) in Proposition 3. It is useful only for an interacting system in Figure 1a for which the second law is stated indirectly in terms of irreversible entropy generation

Δ_{i} S \geq 0

; see Equation (67c). Indeed, all irreversible quantities including irreversible macrowork are indirectly determined.

In contrast, the MNEQT provides an SI-description involving quantities associated with the system alone so it is applicable to both systems in Figure 1 by explicitly taking into account the EQ properties of the medium, when it is present. All irreversible quantities including macroworks and macroheats are contained in this approach so they are determined directly in the MNEQT.

We elaborate on the distinction between the MNEQT and the

\overset{˚}{M} NEQT

. The exchange quantities

d_{e} Z

require the system

Σ

to be embedded in a medium

\tilde{Σ}

(see Figure 1a) and are controlled by

\tilde{Σ}

[154] so that

d_{e} Z = - d_{e} \tilde{Z}

(see Section 2) and are easy to handle and measure, as

\tilde{Σ}

is normally taken to be in equilibrium with no irreversibility (

d_{i} \tilde{Z} = 0

) so that

d_{e} \tilde{Z} = d \tilde{Z}

. Thus, the exchange quantities do not directly provide any information about

d_{i} Z

and any irreversibility as mentioned above. As an example, the lost macrowork due to irreversibility in the

\overset{˚}{M} NEQT

is defined as

{\overset{˚}{d}}_{lost} W = {\overset{˚}{d}}_{rev} W - {\overset{˚}{d}}_{irr} W \geq 0,

where various

{\overset{˚}{d}}_{rev} W

and

{\overset{˚}{d}}_{irr} W

refer to the exchange macroworks along two distinct processes: a reversible and an irreversible. We have used a new notation

\overset{˚}{d}

to ensure that any

\overset{˚}{d} W

is not confused with

d W

in the MNEQT. It is easy to see that

{\overset{˚}{d}}_{lost} W

is precisely the irreversible macrowork

d_{i} W

, which is determined by the actual process.

Similar distinctions can also be noted between the

\overset{˚}{μ}

NEQT and the

μ

NEQT; they differ at least in the following important ways, with sweeping consequences, as we will see:

A.: The internal microwork $Δ_{i} W_{k}$ has no analog in the former because it uses the following questionable conjecture:

$Δ_{e} W_{k} \overset{?}{=} - Δ E_{k},$

(7)

(see Section 15) which is often used in fluctuation theorems [99,135,136,137,138,139,140,141,142,143,144,145,146,147]; the use of $\overset{?}{=}$ is a reminder of its possible questionable nature, which is justified later in Theorem 7. In these fluctuation theorems, one begins with the conventional form of the first law $d E = d_{e} Q - d_{e} W$ in terms of exchange macroquantities, but identifies

$d_{e} W \overset{?}{=} - \sum_{k} p_{k} d E_{k}, d_{e} Q \overset{?}{=} \sum_{k} E_{k} d p_{k} .$

As a consequence of the above identification, no distinction can be made between fluctuating microwork

$d W_{k} \equiv - d E_{k},$

which is an identity in accordance with the work–energy theorem (see Theorem 6) and nonfluctuating exchange microwork

$d_{e} W_{k} = d_{e} W, \forall k;$

see Theorem 7. The distinction is always maintained in the latter, in which we also show (see Section 10.1) why the above identification cannot be rigorously justified. Similar conclusions as above are obtained by replacing infinitesimal $d_{α}$ by accumulation $Δ_{α}$ , properly defined in Section 13 along a process $P$ .
B.: Consequently, the microforce imbalance ( $μ$ FI) that results in fluctuating $Δ_{i} W_{k} = - Δ_{i} E_{k}$ , a ubiquitous quantity, is absent in the former in that $Δ_{i} W_{k} = Δ W_{k} - Δ_{e} W_{k} \equiv 0$ but is always present ( $Δ_{i} W_{k} \neq 0$ ) in the latter.
C.: The former results in a first law of thermodynamics ( $Δ E_{k} = Δ_{e} Q_{k} - Δ_{e} W_{k}$ ) for each $m_{k}$ , while the latter has it hold ( $Δ E = Δ_{e} Q - Δ_{e} W$ ) only for a $M$ ; however, see Equation (243).
D.: The lost or dissipated macrowork $Δ_{lost} W$ measured by the average $Δ_{i} W_{k}$ should be absent in the former due to its above conjecture, but is always present in the latter.
E.: The exchange microwork $Δ_{e} W_{k}$ depends on the entire trajectory $γ_{k}$ in the former to make it fluctuating over $γ_{k}$ , while in the latter, $Δ W_{k}$ depends only on the terminal microstates of $γ_{k}$ , and $Δ_{e} W_{k} \equiv Δ_{e} W$ is nonfluctuating (it is the same for all $γ_{k}$ ’s).

1.3. Main Results

The review emphasizes the very close parallel with EQ statistical mechanics (

μ

EQT) that is clearly seen in the microstate probabilities and the existence of IEQ partition functions for

M_{ieq}

. There are also major differences mainly in new concepts, some of which are very counter-intuitive, such as ubiquitous

d_{i} E_{k}

, microforce imbalance (

μ

FI) and internal microwork

d_{i} W_{k}

resulting from it, etc., for any macrostates including

M_{eq}

that have not been appreciated so far. They have been introduced previously [77,78,150,156,157] but now receive detailed explanation here. For example, it is a well-known fact that

d_{i} E = 0

[12] (see Equation (53a)) for any

M

; yet

d_{i} E_{k}

is fluctuating and so can be different from zero, its average. The presentation here is simple enough to reach even an untrained reader. To accomplish this goal, we only focus on some examples borrowed from undergraduate physics so that a reader will not be lost; however, it does require an open mind to learn new concepts that are counter-intuitive and perplexing, as it is very hard to shake off old preconceptions.

Remark 1.

As μEQT only deals with EQ processes, the second law plays no role here. However, the situation in the μNEQT is different, where we deal with NEQ processes. As the second law does not operate at the microstate level, our development of the μNEQT is not limited by this law. To make contact with thermodynamics, however, we will have to impose it at the level of macrostate. By investigating the internal inconsistencies that emerge if the second law is violated, we are able to conclude that the law cannot violated for a stable system. This is one of the most important benefits of our approach.

Throughout this review, we work in the enlarged state space

S_{Z}

so we include at least one internal variable

ξ

as a prototype to make our discussion more realistic, as will become clear in Section 4 and Section 14. The main emphasis here will be to demonstrate the ubiquitous nature of internal changes such as

d_{i} E_{k}

, a new concept whose existence has not been previously appreciated in various fluctuation theorems [26,158,159]. Not recognizing its existence has resulted in the conjecture

d_{e} W_{k} = - d E_{k} = -

d_{e} E_{k}

(see Equation (7)), used extensively in the

\overset{˚}{μ}

NEQT. This is contrary to a central result of the

μ

NEQT; see Theorem 6. It is the microforce imbalance (

μ

FI) between the internal and external microforces, a hitherto unrecognized purely mechanical concept at the microstate level in EQ and NEQ thermodynamics, that generates

d_{i} E_{k}

and is present in all processes, whether they are thermodynamic or not, as we will demonstrate. This is the most important outcome of the our approach; see Proposition 2. It emphasizes the importance of SI-quantities (such as in

d E_{k} = - d W_{k}

) that are very different from the MI-quantities (such as in

d_{e} E_{k} = d {\tilde{W}}_{k}

) for any

γ_{k}

, even if the trajectory belongs to a reversible process. The use of generalized work

d W = - d E_{m}

in Equation (234a) as isentropic change allows us to calculate microscopic work (microwork)

d W_{k}

, which changes

E_{k}

but not

p_{k}

. This is because

m_{k}

, whose concept is independent of

p_{k}

, uniquely determines

E_{k}

for a fixed work set

W

; see Definition 5. Therefore,

d E_{k}

is uniquely determined by

d W

and does not have any contribution from the change in

p_{k}

. On the other hand, the generalized heat

d Q

allows us to introduce microscopic heat (microheat)

d Q_{k}

, which does not change

E_{k}

but changes

p_{k}

. The above mutually exclusive nature of

d W_{k}

and

d Q_{k}

proves to be a great simplification and allows us to treat

d W_{k}

and

d Q_{k}

as purely a mechanical and a stochastic concept, respectively, in the development of the

μ

NEQT. In addition, as

d E_{k}

does not have any contribution from

d p_{k}

, it has no microheat contribution, so there is no first law for

m_{k}

in the

μ

NEQT.

As

E_{k}

is fluctuating,

d E_{k} = - d W_{k}

is also fluctuating and is uniquely determined as

d E_{k} ≐ E_{k} (W + d W) - E_{k} (W)

for

m_{k}

; the (slow or fast) nature of the process is irrelevant. The latter only controls

p_{k}

. This provides a simplification in evaluating the cumulative change

Δ W_{k}

, which is independent of the nature of

P

between two macrostates; see Remark 71 and the discussion following it. The fluctuating microwork

d W_{k}

is different from

Δ {\tilde{W}}_{k} = Δ \tilde{W}

, which is the microwork done by the working medium on

m_{k}

after reduction, and which depends strongly on the nature of

P

but is the same for all microstates for a given

P

.

The most important new results that emerge in the

μ

NEQT are the following:

a clear separation of different kinds of work and heat and their fluctuations that emerge from $d_{α}$ ;
additional thermodynamic forces for irreversibility due to internal variables;
stochasticity resulting from a nonvanishing commutator ${\hat{C}}_{α} ≐ d_{α} \hat{A} - \hat{A} d_{α}$ ;
exchange microquantities are nonfluctuating, which makes them useless for directly obtaining fluctuations and irreversibility;
the fundamental identity $Δ_{i} W = Δ_{i} Q$ between irreversible macrowork and macroheat generalizing the result of Count Rumford and the Gouy-Stodola theorem;
the origin of work dissipation $Δ_{i} W > 0$ in an irreversible process;
the uniqueness of macrostates and microstate probabilities in the enlarged state space for $M (Z)$ determined by the experimental setup;
the $μ$ NEQT justifies the MNEQT as the $μ$ EQT justifies the EQT.

1.4. Layout

The layout of the paper is the following. In the next section, we introduce our notation, definitions, and new concepts, which may be unfamiliar to many readers but are justified in the following sections. We describe here our basic approach that a thermodynamic description is equivalent to treating microquantities as purely mechanical without any consideration of stochasticity, to be followed by bringing in microstate probabilities to determine macroquantities, just as in EQ statistical mechanics. Microstate probabilities are not truly microquantities as they are not independent of each other. The stochasticity adds the dimension of entropy, without which we only have a mechanical description of an NEQ body in

S_{Z}

. An arbitrary macrostate

M_{arb}

is divided into an EQ macrostate

M_{eq}

and an NEQ macrostate

M_{neq}

; the latter is further divided into an IEQ (internal equilibrium) macrostate

M_{ieq}

and an NIEQ (non-internal equilibrium) macrostate

M_{nieq}

. The IEQ macrostates share all the properties of EQ macrostates, except that the former have nonvanishing irreversible entropy generation

Δ_{i} S > 0

. The principle of reduction is also introduced here. In Section 3, we discuss the mathematical properties of and manipulations with the linear operators

d_{α}

, and give some examples for clarification. The origin of internal variables is explained in Section 4, where we show that they also emerge in mechanical descriptions so that they are not unique to thermodynamics. This explains why we need the enlarged state space

S_{Z}

for microscopic mechanical descriptions as well. We finally present the fundamentals of the

μ

NEQT in Section 5. This is a very important section, where we present various axioms and requirements of the

μ

NEQT. We then discuss stochasticity to derive a very general formulation of the entropy in terms of

\{p_{k}\}

, which is then used to obtain the unique form of

\{p_{k}\}

for

M_{ieq}

. An important and surprising aspect of the

μ

NEQT is obtained in the equality of internal microwork (a mechanical microquantity) and microheat (a stochastic microquantity) even though they have distinct origins. At this stage, we have a complete and unique NEQ statistical mechanics (the

μ

NEQT) in

S_{Z}

. We identify SI-macroquantities and use them to derive the MNEQT for

M_{ieq}

exemplified by the Gibbs fundamental relation in

S_{Z}

, which is then generalized to obtain the Gibbs fundamental relations for

M_{nieq}

in

S_{Z}

.

In Section 6 and Section 7, we begin to introduce the mechanical and stochastic aspects of the

μ

NEQT, respectively. In Section 6, we use

W

to identify microforces that operate in the mechanical formulation of the body so they are also present in its thermodynamic formulation. We use them to introduce the concept of microforce imbalance in Section 6.4, which captures the mechanical disparity between

Σ

and

\tilde{Σ}

. The imbalance is responsible for the internal microwork. In Section 6.5, we derive the extension of the work–energy theorem of mechanics in

S_{Z}

. In Section 7, we revisit a previous proposal for the origin of stochasticity and extend it further by discussing the effect of correlations between

Σ

and

\tilde{Σ}

, and introducing the principle of reduction in Section 7.2. We then discuss quasi-independence in Section 7.3, and the simplification it brings about in thermodynamic considerations after reduction, especially with respect to the effects produced by

\tilde{Σ}

on

Σ

, which is discussed in Section 7.4 and Section 7.5. The discussion, which forms a very important part of the review, shows why classical thermodynamics works so well.

In Section 8, we discuss the properties of the unique entropy

S_{ieq}

for

M_{ieq}

in

S_{Z}

, and discuss its approximate formulation as a flat distribution that is commonly used in EQ statistical mechanics. This distribution neglects any fluctuations in the entropy, which are always present in the body. Despite this, it correctly gives the entropy so it can always be used to determine it as it simplifies the calculation. We show that the entropy additivity requires quasi-independence in Section 8.1 so the latter should not be confused with the principle of additivity for

W

. Using this flat distribution, we provide a simple proof of the second law for

M_{ieq}

in

S_{Z}

in Section 8.3 by simply counting the number of distinct microstates as the system evolves in time, which can only increase with time; see Theorem 8. This direct proof is supplemented by Theorem 9 in Section 8.4 that the law is simply a direct consequence of the stability of the system so it does not need to be included as an additional part of Axiom 2 in the

μ

NEQT; see Section 5). In Section 9, we show that a violation [161] of the second law results in internally inconsistent thermodynamics for stable physical systems, and cannot be taken seriously (see Conclusion 7), even though thermodynamic instabilities arise in approximate calculations such as van der Waals equations or mean field, but are always removed from consideration; see Remark 58. Therefore, we will always assume that we are dealing with a stable system for which the law is always valid, as noted in Section 1, except in Section 9. In Section 10, we initiate the formulation of the

μ

NEQT by focusing on the two most important concepts, those of generalized or BI-microwork and microheat for

Σ_{b}

. We show that various micro- and macroheats emerge from the nonvanishing commutator

{\hat{C}}_{α}

introduced in Equation (229). For a fuller understanding, we first revisit in Section 10.1 the ensemble average of a fluctuating state variable, and its change in a process

P

. We show that for

Z \in Z

such as E belonging to

S_{Z}

, its change

d Z

consists of two independent process contributions in orthogonal state spaces

S_{Z}

and

S_{S}

, a mechanical one

d Z_{m}

at fixed

\{p_{k}\}

in

S_{Z}

, and a stochastic one

d Z_{s}

at fixed

\{Z_{k}\}

in

S_{S}

. Thus,

d Z ≐ d 〈Z〉 \in S_{χ}

. In contrast, the stochastic state variable

S \in S_{S}

has only stochastic contributions belonging to

S_{S}

. For E,

d E_{m}

represents the negative of the generalized macrowork

d W

, and

d E_{s}

the generalized macroheat

d Q

in the body. Their statistical interpretation is covered in Section 10.2, where we show that

d W_{k}

is purely mechanical, and

d Q_{k}

purely stochastic. In Section 11, we discuss how

d_{e} p_{k}

and

d_{i} p_{k}

are determined, and how they determine the forms of various microworks, microheats, and microentropies. We also give a general proof of the identity

d_{i} E \equiv 0

, even if

d_{i} E_{k} \neq 0, \forall k

. This now completes the formulation of the unique NEQ statistical mechanics (

μ

NEQT) in

S_{Z}

.

The only thing remaining for a complete formulation of the

μ

NEQT is to identify the choice of

S_{Z}

, which is discussed in Section 12. This is a very important section that describes how the choice of

S_{Z}

is dictated by the way an experiment is performed, which must not come as a surprise for an NEQ process. This is because the observation and relaxation times play important roles here. By ordering various internal variables with their relaxation times in decreasing order, we show that only those internal variables have to considered whose relaxation times are greater than the observation time to uniquely specify the macrostate in

S_{Z}

. We show how the unique microstate probability is identified. We consider the possibilities of fluctuating (Fl) and nonfluctuating (NFl) work parameter

W

. It will be convenient to take the parameters to be fixed so that they are the same for all microstates. We introduce the Legendre transform

E_{k}^{L}

of the microenergy

E_{k}

, which proves to be very useful in expressing

p_{k}

. The discussion justifies that once

S_{Z}

has been identified in which

M

becomes uniquely specified, the microstate probabilities are also uniquely specified. No auxiliary step is required to determine

p_{k}

. This is what makes the

μ

NEQT so useful. The discussion is easily extended to consider a microstate that is not unique in

S_{Z}

.

So far, we have provided a complete formulation of the

μ

NEQT for any

M_{arb}

at each instant. To proceed further to extend the

μ

NEQT for any process, we need to introduce a trajectory ensemble and determination of various path and process quantities, which is taken over in Section 13. We show that different trajectory quantities have different trajectory probabilities (path microprobabilities), which has not been appreciated so far. This finally provides a complete description of the

μ

NEQT for any process.

We now turn to some of the applications of the

μ

NEQT in the next three sections. In Section 14, we use it to describe the origin of microfricton at the microstate level. A new NEQ work fluctuation theorem is derived in Section 15 between any two arbitrary macrostates. In Section 16, we use the

μ

NEQT to study the quantum and classical free expansion using our work fluctuation theorem. The final section provides a brief discussion of our conclusions and a summary.

2. Notation, Definitions and New Concepts

Before proceeding further, it is useful to introduce in this section our notation to describe various systems and their behavior, and new concepts for their understanding without much or any explanation (that will be offered later in the review where we discuss them). We also give various definitions and briefly discuss new concepts such as various forms of NEQ work and heat that need to be carefully distinguished for a precise formulation of the

μ

NEQT. Various important concepts are highlighted in the form of Remarks to draw the attention of the reader. It is the hope that a reader can always come back here to be refreshed in case of confusion. In this sense, this section plays an important role in the review for the purpose of bookkeeping.

2.1. Systems and State Variables

Definition 1.

A system Σ is a collection of material particles and radiation enclosed in a region of space defined by some parameter

W

, and its Hamiltonian dynamics is determined by the Hamiltonian

H (x |W)

. A system can be embedded in a medium

\tilde{Σ}

, which is extremely large compared to it, and with which it interacts. The combined system

Σ_{0}

formed by Σ and

\tilde{Σ}

is commonly treated as an isolated system.

Remark 2.

For convenience, we take the parameter

W

to be fixed so that it is the same for all microstates, even though it is not hard to take it to be unfixed so that it changes over the microstates. We will refer to them as nonfluctuating and fluctuating, respectively.

We draw attention to Figure 1 to introduce the notation. The review mostly deals with statistical mechanics of macroscopically large systems

Σ

; however, we will also digress a bit to discuss small systems. In both cases,

Σ

is extremely small compared to the medium

\tilde{Σ}

; see Figure 1b. The medium

\tilde{Σ}

consists of two parts: a work source

{\tilde{Σ}}_{w}

and a heat source

{\tilde{Σ}}_{h}

, both of which can interact with

Σ

directly but not with each other. This separation allows us to study work and heat exchanges between

Σ

and

\tilde{Σ}

separately. We will continue to use

\tilde{Σ} = {\tilde{Σ}}_{w} \cup {\tilde{Σ}}_{h}

to refer to both of them together. The collection

Σ_{0} = Σ \cup \tilde{Σ}

forms an isolated system, which we assume to be stationary. We remark that the concept of an isolated system in a laboratory is an important approximation [79,162] but extremely useful as no such system locally exists in reality. We need to always keep this in mind.

The system in Figure 1a is an isolated system, which we may not be able to divide into a medium and a system. Each medium in Figure 2, although not interacting with each other, has a similar relationship with

Σ

. In case they were mutually interacting, they can be treated as a single medium. The collection

Σ_{0} = Σ \cup {\tilde{Σ}}_{1} \cup {\tilde{Σ}}_{2}

forms an isolated system. In the following, we will mostly focus on Figure 1 to introduce the notation, which can be easily extended to Figure 2 or to an extension with several mediums.

Definition 2.

Observables

X = (E, V, N, \dots)

of a system are extensive quantities that can be controlled from outside the system, and internal variables

ξ = (ξ_{1}, ξ_{2}, ξ_{3}, \dots)

are extensive quantities that cannot be controlled from outside the system. Their collection

Z = X \cup ξ

is called the set of extensive state variables of Σ forming

S_{Z}

, which we may simply write as

S

when no confusion will arise. The set

W

or a subset of it may be fixed or may fluctuate over all microstates of Σ.

Definition 3.

A system-intrinsic (SI) quantity is a quantity that pertains to the system Σ alone and can be used to characterize the system. A medium-intrinsic (MI) quantity is a quantity that is determined by the medium

\tilde{Σ}

alone and can be used to characterize it and also the exchanges between Σ and

\tilde{Σ}

. No external exchange is allowed for

Σ_{0}

.

We use a suffix 0 to denote all quantities pertaining to

Σ_{0}

, a tilde

(\tilde{})

for all quantities pertaining to

\tilde{Σ}

, and no suffix for all quantities pertaining to

Σ

, even if it is isolated. Thus, the set of observables is denoted by

X_{0}, \tilde{X}

, and

X

, respectively, and the set of state variables by

Z_{0}, \tilde{Z}

, and

Z

, respectively, in the state space

S_{Z}

; the set of internal variables are

ξ_{0}, \tilde{ξ}

, and

ξ

, respectively.

Remark 3.

We will use the term “body” to refer to any of

Σ, \tilde{Σ},

and

Σ_{0}

in this review and use

Σ_{b}

to denote it. However, to avoid notational complication, we will use the notation suitable for Σ for

Σ_{b}

if no confusion would arise in the context. The mechanical aspect of a body is described by its Hamiltonian

H (x| W)

, and we refer to all quantities pertaining to it as body-intrinsic (BI), which includes SI, MI, and ISI (for the isolated system) as the case may be.

The discussion below is mostly for a body

Σ_{b}

, but the notation is suited for a system. Thus, it covers the three systems

Σ, \tilde{Σ}

, and

Σ_{0}

, unless mentioned otherwise.

Definition 4.

A microstate of

H (x |W)

represents the instantaneous deterministic state of

Σ_{b}

. The quantum microstates are specified by a set of good quantum numbers, which we usually denote by k as a single quantum number for simplicity; we take

k \in N, N

denoting the set of natural numbers. In the classical case, we use a small cell

δ x_{k}

of volume

h^{3 N}

around

x_{k} = x

as the microstate

m_{k}

[163]; the collection

\{δ x_{k}\}

covers the entire phase space Γ. A microstate

m_{k}

appears with probability

p_{k}

that is central for statistical mechanics.

Below, we clarify the definition further.

2.2. Microstates and Macrostates

Remark 4.

In order to obtain a microscopic understanding of thermodynamics, we need to focus on the countable set of microstates

{\{m_{k}\}}_{k = 1, 2, \dots}

. Then

E_{k} = H (x_{k}| W)

(8)

denotes the microenergy of

m_{k}

. In explicit form, the microenergy for

m_{k}

of Σ will be expressed as

E_{k} (W)

, for

{\tilde{m}}_{\tilde{k}}

of

\tilde{Σ}

it will be expressed as

{\tilde{E}}_{\tilde{k}} (\tilde{w})

(see Equation (28a)), and for

m_{0 k_{0}}

of

Σ_{0}

it will be expressed as

E_{0 k_{0}} (W, \tilde{w})

(see Equation (28b)).

Remark 5.

For clarity and ease of presentation, we will assume each microstate to be nondegenerate, i.e., a singlet. Extending the discussion to degenerate microstates is trivial, as discussed in Section 15.

We now identify microstates

\{m_{k}\}

. In quantum mechanics, they refer to the countable microstates of the Hamiltonian of a bounded body, with k denoting the set of quantum numbers. In classical mechanics, they are usually identified as follows. We will normally employ a discretization of the classical phase space

Γ

of a bounded system by dividing it into countable nonoverlapping cells

δ x_{k}

, centered at

x_{k}

and of some small size, commonly taken to be

{(2 π ℏ)}^{3 N}

. The cells cover the entire phase space

Γ

. To account for the identical nature of the particles, the number of cells and the volume of the phase space are assumed to be divided by

N!

to count distinct microstates

m_{k} ≐ δ x_{k}

, indexed by k

= 1, 2, \dots

; the center of

δ x_{k}

is at

x_{k}

. The energy and probability of these cells are denoted by

\{E_{k}, p_{k}\}

in which

E_{k} (W)

is a function of

W

. The microstates obey deterministic evolution of the Hamiltonian

H (x |W)

of the body. For

Σ_{b} = \tilde{Σ}

,

\{{\tilde{m}}_{\tilde{k}}\}

appear with probabilities

\{{\tilde{p}}_{\tilde{k}}\}

; for

Σ_{b} = Σ_{0}

,

\{m_{0, k_{0}}\}

appear with probabilities

\{p_{0 k_{0}}\}

.

With the discretization, we will use the same symbol

Γ

to denote the space occupied by microstates

m_{k}

.

Claim 1.

It is through the changes in microstate probabilities that a thermodynamic process

P

gets its stochastic nature. In contrast, constant

p_{k}

’s describe a mechanical process, which is deterministic.

A thermodynamic process

P

between any two arbitrary states (we will instead use

\overset{˚}{P}

to denote a process between two equilibrium (EQ) terminal microstates) is understood in the context of the MNEQT [12,51] as a temporal sequence of macrostates

M (t)

of the body which keep changing during

P

due to changes in

{m_{k} (t)}

and/or

{p_{k} (t)}

. The rate of time variation (fast or slow compared to the equilibration time

τ_{eq}

) determines the (reversible or irreversible) nature of

P

.

Definition 5.

At the microscopic level, the state of

Σ_{b}

is specified by microstates set

\{m_{k}\}

, their energy set

\{E_{k}\}

, and their probability set

\{p_{k}\}

. For the same set

\{m_{k}, E_{k}\}

, different choices of

\{p_{k}\}

describe different macrostates

M

(see Definition 6), one of which,

M_{e q}

, corresponding to

\{p_{k}^{e q}\}

, specifies an EQ macrostate having the maximum entropy; all other states have smaller entropies and are called nonequilibrium (NEQ) macrostates.

It is important to draw attention to the following important distinction between the Hamiltonian

H

required for a microstate and the average energy E of a macrostate. While the thermodynamic energy accounts for the stochasticity through microstate probabilities, the use of the Hamiltonian is going to be restricted to a particular microstate. In other words, the Hamiltonian depends on

x

and

W

but the energy depends on the entropy S and

W

. The energy

E_{k}

of

m_{k}

, on the other hand, depends only on

W

and denotes the value of

H

associated with

m_{k}

; see Equation (8). In the following, we will always treat Hamiltonians and microstate energies as equivalent descriptions, which does not depend on knowing

{p_{k}}

; the average energies depend on

{p_{k}}

for their definition; see Equation (12) with

q = E

and

q_{k} = E_{k}

.

Definition 6.

A macrostate

M

in

S

is a collection

\{m_{k}, p_{k}\}

of microstates

m_{k}

and their probabilities

p_{k}, k = 1, 2, \dots

for a

Σ_{b}

. Quantities that are the same for all microstates are called macroquantities as they refer to the macrostates

M

. Quantities that refer to microstates are called microquantities, and carry the suffix k when associated with the microstate

m_{k}

such as

X_{k}

or

Z_{k}

, which are the microanalogs of

X

or

Z

, respectively; however, see Remark 14. We will simply use “quantity” to refer to both of these quantities in short.

For example, we will refer to

d W_{k}

as the microwork; similarly, we will refer to

d {\tilde{W}}_{k}

as the external microwork,

d_{e} W_{k}

as the exchange microwork, and

d_{i} W_{k}

as the internal microwork. The corresponding macroworks are denoted by

d W, d \tilde{W}, d_{e} W

, and

d_{i} W

. We thus see that there are various possible notions of works in NEQT.

A macrostate

M

is usually described by the state variable

Z

in thermodynamics but functions of

Z

can also be used to characterize

M

. They are all macroquantities. In statistical mechanics, microstates of the Hamiltonian are used to describe

M

at the microstate level.

Remark 6.

Microquantities can be divided into two kinds: pure and mixed. A pure microquantity such as

E_{k}

is determined solely by

m_{k}

but not by

M

. A mixed microquantity such as microheat and microentropy is one that is also determined by

M

. With this caveat in mind, we will call both kinds microquantities.

We find the shorthand notation [12,13,51]

d_{α} = (d, d_{e}, d_{i})

(9)

quite useful in the following for the various infinitesimal contributions. Thus,

d_{α} E_{k} = d E_{k}, d_{e} E_{k}, d_{i} E_{k}

will refer to microenergy change, exchange microenergy change, and internal microenergy change, respectively. We similarly use

d_{α} Q_{k} = d Q_{k}, d_{e} Q_{k}, d_{i} Q_{k}

for various forms of microheats, and

d_{α} S_{k} = d S_{k}, d_{e} S_{k}, d_{i} S_{k}

for various forms of microentropies; see Equation (27a) and Remark 14. In particular, the random variable dq should not be confused with the differential of q, which may not even be defined; see Remark 20. We will refer to

d Q_{k}

and

d S_{k}

as microheat and microentropy, respectively. The corresponding macroquantities are denoted by

d_{α} E, d_{α} Q

, and

d_{α} S

, respectively, without the index k. The following notation generalizes the physics of various infinitesimals and their relationship.

2.3. Micro–Macro Variables

Notation 1.

We introduce the sets of state variables

χ_{k} ≐ \{S_{k}, Z_{k}\}, χ ≐ \{S, Z\}, ζ_{k} ≐ \{S_{k}, W_{k}\}, ζ ≐ (S, W),

(10a)

and infinitesimals

d_{α} θ_{k} ≐ \{d_{α} χ_{k}, d_{α} W_{k}, d_{α} Q_{k}\}, d_{α} θ ≐ \{d_{α} χ, d_{α} W, d_{α} Q\} .

(10b)

Notation 2.

We introduce a compact notation

[q]

for the collection

\{q_{k}, q\}

:

[q] ≐ \{q_{k}, q\} .

(11a)

and

[d_{α} q]

to cover all of the following quantities:

[d_{α} q] \in [d_{α} θ] ≐ \{[d_{α} χ], [d_{α} W], [d_{α} Q]\} .

(11b)

Thus,

[χ] ≐ \{χ_{k}, χ\} \in S_{χ},

[ζ] ≐ \{ζ_{k}, ζ\} \in S_{ζ}, [d_{α} χ] ≐ \{d_{α} χ_{k}, d_{α} χ\} \in S_{χ}

, etc. For specificity, we use

χ_{k}^{j}

and

χ^{j}

to refer to the jth element of

χ_{k}

and

χ

, respectively. Similarly, we use

ζ_{k}^{j}, ζ^{j}

for the jth element of

ζ_{k}

and

ζ

, respectively, and

d_{α} θ_{k}^{j}, d_{α} θ^{j}

for the jth element of

d_{α} θ_{k}

and

d_{α} θ

, respectively.

2.4. Random Variable and Average

Remark 7.

In the language of probability theory,

M (t)

can be thought of as a random variable with outcomes

m_{k}

with probability

p_{k} (t)

. A microquantity

q_{k}

associated with

m_{k}

appears with probability

p_{k} (t)

at time t. Thus,

q_{k}

denotes an outcome of a random variable q, and usually forms a fluctuating (Fl) microquantity.

Definition 7.

The ensemble average for

\{q_{k}\}

or of the random variable q is defined by

\hat{A} q (t) = q (t) or \bar{q} (t) or 〈q〉 (t) \equiv \sum_{k} p_{k} (t) q_{k}

(12)

for a countable set

\{p_{k}\} (t)

that satisfies the sum rule

\sum_{k} p_{k} (t) = 1

(13)

due to the conservation of probability. We can also extend Equation (12) to q for which

q_{k} = q, \forall k

. We have used

\hat{A}

to denote the above averaging operator in Equation (12).

In thermodynamics, it is customary to use the simpler notation

q

for

〈q〉 = \hat{A} q

, which we will also follow in this review, such as

E, S

, etc., for the average energy, entropy, etc. However, we will also use the notation

\hat{A} q, \bar{q}

or

〈q〉

, when clarity is needed, as we will see in Section 10 that such a convention can lead to confusion if care is not exercised. We wish to emphasize that

\hat{A} q = q

does not imply that

\hat{A} = 1

, except when

q_{k} = q, \forall k

.

Remark 8.

To avoid confusion with the notation

d_{α} χ_{k}

, which can either mean

d_{α} (χ_{k})

as

d_{α}

acting on

χ_{k}

, or

{(d_{α} (χ))}_{k}

denoting the microquantity associated with

d_{α} (χ)

, we will continue to use

d_{α} χ_{k}

for the former, and

d_{α} {\bar{χ}}_{k}

for the latter, where

χ = \bar{χ}

stands for the macroquantity associated with

χ_{k}

; see Section 10.1 for details. However, we will simply use

d_{α} χ_{k}

in

d_{α} θ_{k}

to simplify the notation, but we will always use the specific notation when clarity is needed.

In this review, we will not consider a constant random variable. Hence, a random variable will always have fluctuating outcomes.

Notation 3.

We use modern notation [13,51] and its extension (see Figure 1), which will be extremely useful to understand the usefulness of our novel approach. Any infinitesimal and extensive

Σ_{b}

-intrinsic quantity dq

(t)

(see Equation (11b)) during an arbitrary infinitesimal process

d P

can be partitioned as

d q (t) \equiv d_{e} q (t) + d_{i} q (t),

(14a)

where

d_{e}

q

(t)

is the change caused by exchange (“e”) with the surroundings such as the medium and

d_{i}

q

(t)

is its change due to internal or irreversible (“i”) processes going on within

Σ_{b}

. As mentioned earlier, the term external quantity will also be used for an exchange quantity to emphasize its external nature in this review. The partition also applies to the outcome

d q_{k}

as follows:

d q_{k} (t) \equiv d_{e} q_{k} (t) + d_{i} q_{k} (t),

(14b)

As an example, we have (see Equation (27a) for the definition of

S_{k}

)

d E_{k} = d_{e} E_{k} + d_{i} E_{k}, d S_{k} = d_{e} S_{k} + d_{i} S_{k}

(15)

for

Σ_{b}

; here

d_{i} E_{k}

or

d_{i} S_{k}

does not have to vanish or have a particular sign even though

d_{i} E = 0

(see Equation (53a)) or

d_{i} S \geq 0

(see Equation (67c)). We see that the linear operators

d_{α}

satisfy

d \equiv d_{e} + d_{i} .

(16)

Claim 2.

An extensive quantity of

Σ_{b}

is additive over its various macroscopic parts, but the energy E is usually quasi-additive; see Section 5.6.

For the sake of clarity, we will take V as a symbolic representation of

X

, and a single

ξ

as an internal variable in many examples. Then,

w = (V), W = (V, ξ)

, and

Z = (E, V, ξ)

.

2.5. Different States in NEQT

Definition 8.

An equilibrium (EQ) macrostate is a uniform macrostate having the maximum possible entropy in

S_{X}

.

Definition 9.

A nonequilibrium macrostate can be classified into two classes:

(a): Internal-equilibrium macrostate (IEQ): The nonequilibrium entropy $S (X, t)$ for such a macrostate is a state function $S (Z)$ in the larger nonequilibrium state space $S_{Z}$ spanned by $Z$ ; $S_{X}$ is a proper subspace of $S_{Z}$ : $S_{X} \subset S_{Z}$ . As there is no explicit time dependence, there is no memory of the initial macrostate in IEQ macrostates.
(b): Non-internal-equilibrium macrostate (NIEQ): The nonequilibrium entropy for such a macrostate is not a state function of the state variable $Z$ . Accordingly, we denote it by $S (Z, t)$ with an explicit time dependence. The explicit time dependence gives rise to memory effects in these NEQ macrostates that lie outside the nonequilibrium state space $S_{Z}$ . An NIEQ macrostate in $S_{Z}$ becomes an IEQ macrostate in a larger state space $S_{Z^{'}}, Z^{'} \supset Z$ , with a proper choice of $Z^{'}$ .

Definition 10.

An arbitrary macrostate

M_{a r b}

of a system refers to all possible thermodynamic states, which include EQ macrostates, and NEQ macrostates with and without the memory of the initial macrostate. From now on, we denote an arbitrary macrostate by

M

, NEQ macrostates by

M_{n e q}

, EQ macrostates by

M_{e q}

, and IEQ macrostates by

M_{i e q}

.

Different choices of

\{p_{k}\}

for the same set

\{m_{k}, E_{k}\}

describe different macrostates for a given

W

, one of which corresponding to

\{p_{k}^{eq}\}

uniquely specifies the EQ macrostate

M_{eq}

; all other states are called NEQ macrostates

M_{n eq}

. Among

M_{neq}

are some special macrostates

M_{ieq}

that are said to be in internal equilibrium (IEQ); the rest are nonIEQ macrostates

M_{nieq}

. An arbitrary macrostate

M

refers to either an EQ or an NEQ macrostate; the latter can be either

M_{ieq}

or

M_{nieq}

.

2.6. Mechanical Description

Claim 3.

There are two distinct approaches to handling state variable

W

for a macrostate

\{m_{k}, p_{k}\}

of

Σ_{b}

; see Remark 2 and Definition 2.

Nonfluctuating (NFl) approach: It can be treated as a nonfluctuating (fixed) parameter in the Hamiltonian of $Σ_{b}$ so that it is the same for all of its microstates. If we alter $W$ , it changes the same way for all $m_{k}$ ’s. We say that $W$ is a NFl-parameter over $m_{k}$ ’s. This results in fluctuating generalized microforce

$F_{w k} ≐ - \partial E_{k} / \partial W$

(17a)

over $m_{k}$ ’s, with its ensemble average (see Equation (12)), given by the generalized macroforce

$F_{w} ≐ \sum_{k} p_{k} F_{w k} = - \partial E / \partial W .$

(17b)

Even though $F_{w k}$ is a microvariable, we find it useful conceptually to think of it as the outcome of a random variable $F_{w}$ on $m_{k}$ . We use the notation $\{W, F_{w k}\}$ to compactly refer to this case.
Fluctuating (Fl) approach: Alternatively, we let $W$ fluctuate over $m_{k}$ ’s and think of it conceptually as a random variable $W$ with outcomes $\{W_{k}\}$ , even though $W_{k}$ is a microvariable. To be consistent with the NFl-approach (see below), we require that $F_{w}$ becomes nonfluctuating (fixed) defined by

$\forall k, - \partial E_{k} / \partial W_{k} = - \partial E / \partial W = F_{w} .$

(18)

In this view, the macroforce $F_{w}$ is fixed (so it is the same for all macrostates) with the result that $m_{k} (W_{k})$ is determined by the fluctuating random variable $W$ over $m_{k}$ ’s, with its average (see Equation (112)) given by

$W ≐ \sum_{k} p_{k} W_{k} .$

(19)

We use the notation $\{W_{k}, F_{w}\}$ to compactly refer to this case.

The same two approaches apply as well if we replace

W

by

w

, and

F_{w}

by

f_{w}

in the above equations.

Claim 4.

The presence of a parameter in the Hamiltonian

H (x| W)

of the body brings forth the Legendre-transformed Hamiltonian

H^{L} (x| W^{L})

as the most important quantity to consider, where

W_{w}^{L}

is the work parameter in

H^{L}

; see Section 6.3.

Claim 5.

The nonfluctuating (NFl) parameter

W

results in fluctuating (Fl) microfield

\{F_{w k}\}

that plays the role of

W^{L}

, and fluctuating

\{W_{k}\}

results in a NFl workfield

F_{w}

that plays the role of

W^{L}

. As noted in Remark 2, we find it convenient to take

W

and

F_{w}

as the parameters, respectively, as will become clear later in the review.

We provide an intuitive understanding of the two approaches. For the NFl-

W

, we use the microstates

\{m_{k}\}

of

H (x| W)

so that every microstate is specified by the same

W

. If we use the same Hamiltonian

H (x| W)

for the Fl-

W

case, this will require considering different Hamiltonian

H (x_{k}| W_{k})

for different microstates so that their slopes are all equal to (

- F_{w}

); see Equation (18). This is quite cumbersome. It is well-known that in this case, it is most convenient to consider

H^{L} (x| F_{w})

with

W^{L} = F_{w}

so that every microstate is specified by the same

W^{L}

, which plays the role of the work-parameter in

H^{L} (x| F_{w})

; see Section 6.3.

Remark 9.

We now explain the concept of consistency noted above. Consider

E_{k} (W)

for some microstate

m_{k}

in the NFl-approach, and determine

F_{w k}

at some

W

. Using the variation

d W

, we determine the change

d E_{k}^{N F l} = - F_{w k} \cdot d W

. In the Fl-approach, we choose that particular value

W_{k}

at which

E_{k}

has the NFl slope

F_{w}

as shown in Equation (18). We emphasize that only the particular

\{W_{k}\}

is considered that satisfies Equation (18). We then determine the variation

d W_{k}

so that

d E_{k}^{F l} = - F_{w} \cdot d W_{k}

, as follows from Equations (17a), has exactly the same value as

d E_{k}^{N F l}

. Therefore, we do not have to distinguish between

d E_{k}^{N F l}

and

d E_{k}^{F l}

, and use the simpler notation

d E_{k}

for both of them. As a consequence, we can make the following:

Claim 6.

We have the same microwork in both approaches:

d W_{k} = F_{w k} \cdot d W = F_{w} \cdot d W_{k} = - d E_{k} .

(20)

Remark 10.

In the NFl approach, we introduce the Legendre transform

E_{k}^{L, N F l} (F_{w k}) = E_{k} (W) + F_{w k} \cdot W,

(21a)

as a function of

F_{w k}

with

W = \partial E_{k}^{N F l} (F_{w k}) / \partial F_{w k} .

(21b)

In the Fl approach, we introduce the Legendre transform

E_{k}^{L, F l} (F_{w}) = E_{k} (W_{k}) + F_{w} \cdot W_{k},

(22a)

as a function of

F_{w}

with

W_{k} = \partial E_{k}^{N F l} (F_{w}) / \partial F_{w} .

(22b)

We see that the above definitions of the Legendre transform

E_{k}^{L}

of

E_{k}

in the two approaches can be compactly denoted by

E_{k}^{L} (b) = E_{k} (a) + Φ (a, b)

(23a)

in terms of a scalar function

Φ (a, b) ≐ a \cdot b;

(23b)

see also Section 6.3. It is clear from Equation (23a) that it is sufficient to investigate the behavior of

E_{k}

; the behavior of

E_{k}^{L}

is easily obtained from it. Therefore, we will mostly focus on

E_{k}

in the review.

Remark 11.

As microstates

m_{k}

play the central and important role in our approach involving the Hamiltonian, the microstate energies

\{E_{k}\}

represent the outcomes of a random variable E over the microstates. Thus, we always deal with a fluctuating microstate energy. Consequently, the corresponding “macroforce”

f_{s} ≐ - \partial E / \partial S = - T

(24)

(see Equation (1) or equivalently Equation (129)) always appears as a NFl-parameter for

Σ_{b}

, which can be combined with

f_{w}

and

F_{w}

as

f ≐ \{f_{s}, f_{w}\}, F ≐ \{f_{s}, F_{w}\}

(25)

to represent the relevant macroforces.

Remark 12.

It follows from the above Remark that we can either consider the case

\{W, F_{w k}\}

or

\{W_{k}, F_{w}\}

. In both cases, we obtain the same thermodynamics. A NFl parameter can be treated as a deterministic parameter for whichq

_{k} =

q

, \forall k

, as the probability of q is unity (certainty).

Remark 13.

The work parameter may be a function of time t. By taking one of the components of the work parameter

w

to be simply t, it is also possible to include t as a separate parameter in

H (x |t, W)

as is common in mechanics [164].

Definition 11.

In general,

p_{k}

are functions of the microquantity

X_{k}

or

Z_{k}

in

S_{X}

or

S_{Z}

, respectively, and are implicit functions of t through the latter; they may also depend explicitly on time t if not unique in the state space. For an EQ or an IEQ macrostate,

p_{k}

have no explicit dependence on t; see Section 12 for details. As

p_{k}

always satisfies the sum rule (see Equation (13)) over any

M

, it is also an ensemble quantity because of this, and should be treated as a mixed microquantity; it is not determined by

m_{k}

alone so it is not a true microquantity.

Definition 12.

The collection

\{m_{k}, p_{k}\}

provides a complete microscopic or statistical mechanical description of thermodynamics of any arbitrary macrostate

M

in some state space

S

in which one deals with macroscopic or ensemble averages using

\{p_{k}\}

(see Definition 7) over

\{m_{k}\}

of microstate variables.

2.7. Entropy and Stochastic Description

Definition 13.

A state function entropy S for

M_{e q}

or

M_{i e q}

is defined thermodynamically by the Gibbs fundamental relation up to a constant.

Definition 14.

Statistical entropy S, often called the Gibbs entropy, for

M

is defined by its microstates by the Gibbs formulation (see Equation (116)),

S \equiv 〈S〉 = \sum_{k} p_{k} S_{k} = - \sum_{k} p_{k} ln p_{k},

(26a)

with its differential given by

d S = d 〈S〉 = - \sum_{k} (η_{k} + 1) d p_{k} \equiv - \sum_{k} {\hat{η}}_{k} d p_{k}

(26b)

where

S_{k}

is defined by

S_{k} \equiv - η_{k} ≐ - ln p_{k};

(27a)

in terms of Gibbs’ index of probability ([48], p. 16)

η_{k} ≐ ln p_{k},

(27b)

and where we have also introduced

{\hat{η}}_{k} ≐ η_{k} + 1 .

(27c)

Remark 14.

The quantity

S_{k}

and any deterministic function of it are mixed microquantities for the simple reason that

p_{k}

satisfies the sum rule in Equation (13), which requires considering all the microstates; see also Definition 11. However, S is a macroquantity that is also a state variable.

This property of

S_{k}

should not be forgotten.

Remark 15.

Being additive, S is extensive. As a consequence,

S_{k}

must be extensive.

As

\tilde{Σ}

is taken to be in EQ, its Hamiltonian is defined by its observable

\tilde{X}

; the internal variable

\tilde{ξ}

plays no role. Thus, we will express its Hamiltonian as

\tilde{H} (\tilde{x} |\tilde{w}) .

(28a)

We will also assume that

\tilde{Σ}

is weakly interacting with

Σ

, a point discussed carefully in Section 5.6. By neglecting their mutual interaction, we have quasi-additivity of their Hamiltonians to determine the Hamiltonian of

Σ_{0}

:

H_{0} (x_{0} |W, \tilde{w}) \approx H (x |W) + \tilde{H} (\tilde{x} |\tilde{w}),

(28b)

and states the quasi-additivity of the microstate energies; see Equation (119). We also assume the following additivity in this case:

W_{0} \equiv W + \tilde{w};

(28c)

2.8. Reduction

Very often, we need to define an ensemble average over a composite system such as

Σ_{0}

formed by two or more systems. We focus on

Σ_{0} = Σ \cup \tilde{Σ}

. A microquantity q

_{0 k_{0}}

associated with

Σ_{0}

may also refer to a microquantity q

_{k}

associated with

Σ

, or a microquantity

{\tilde{q}}_{\tilde{k}}

associated with

\tilde{Σ}

.

Definition 15.

The ensemble average over

m_{0 k_{0}}

of a composite microquantity q

_{0 k_{0}}

of

Σ_{0}

is given by the joint probability

p_{0 k_{0}} \equiv p (k| \tilde{k}) p_{\tilde{k}} = p_{k} p (\tilde{k}| k)

(29)

to be used in the following two equivalent ways:

\begin{matrix} q_{0} & = \sum_{k} p_{k} \sum_{\tilde{k}} \frac{p (k| \tilde{k})}{p_{k}} p_{\tilde{k}} q_{0 k_{0}} \end{matrix}

(30a)

\begin{matrix} = \sum_{k} p_{k} \sum_{\tilde{k}} p (\tilde{k}| k) q_{0 k_{0}} . \end{matrix}

(30b)

This averaging is properly discussed in Section 7. The conditional probabilities

p (k| \tilde{k})

and

p (\tilde{k}| k)

contain all the information about the correlation between

Σ

and

\tilde{Σ}

due to their mutual interaction, which will be considered in detail in Section 5.6 and Section 7. Here, we use the above definition to define the conditional microquantity q

_{0 k}

given that

Σ

is in the microstate

m_{k}

.

Definition 16.

The reduction of the composite microquantity

q_{0 k_{0}}

to a conditional

Σ_{0}

-microquantity q

_{0 k}

is defined by

q_{0 k} ≐ \sum_{\tilde{k}} \frac{p (k| \tilde{k})}{p_{k}} p_{\tilde{k}} q_{0 k_{0}} = \sum_{\tilde{k}} p (\tilde{k}| k) q_{0 k_{0}} .

(31)

Here, the conditional microquantity q

_{0 k}

associated with

Σ_{0}

carries the suffix k and not

k_{0}

, and is obtained under the condition that Σ is in the microstate

m_{k}

, and requires conditionally averaging over all the microstates

{\tilde{m}}_{\tilde{k}}

of

\tilde{Σ}

using the reduced or conditional probability

p (k| \tilde{k}) / p_{k}

; see Section 7 for details.

It is evident, but also easily verified, that the conditional microquantity associated with

Σ

is the same as q

_{k}

. For

{\tilde{q}}_{\tilde{k}}

, we find that

{\tilde{q}}_{k} ≐ \sum_{\tilde{k}} p (\tilde{k}| k) {\tilde{q}}_{\tilde{k}},

(32)

and can be very different from

{\tilde{q}}_{\tilde{k}}

.

Claim 7.

When the two bodies in the above definition are quasi-independent (see Definition 28 and Section 7.3 for full details), then

p (k| \tilde{k}) \approx p_{k}, p (\tilde{k}| k) \approx p_{\tilde{k}} .

(33)

Remark 16.

A composite microquantity

χ_{0 k_{0}}^{j}

and a medium microquantity

{\tilde{χ}}_{\tilde{k}}^{j}

are easily reduced to the conditional microquantities

χ_{0 k}^{j}

and

{\tilde{χ}}_{k}^{j}

ascribed to

m_{k}

, respectively, by using quasi-independence condition as

χ_{0 k}^{j} \approx \sum_{\tilde{k}} p_{\tilde{k}} χ_{0 k_{0}}^{j}, {\tilde{χ}}_{k}^{j} \approx \sum_{\tilde{k}} p_{\tilde{k}} {\tilde{χ}}_{\tilde{k}}^{j} = {\tilde{χ}}^{j},

(34)

so that their averages following Equation (12) finally give

{\bar{χ}}_{0}^{j}

and

{\tilde{χ}}^{j}

approximately compared to its exact formulation in Equation (31). Here, the conditional quantities

χ_{0 k}^{j}

and

{\tilde{χ}}_{k}^{j}

require conditional averaging over all the microstates

{\tilde{m}}_{\tilde{k}}

with their probabilities

p_{\tilde{k}}

, given that Σ is in the microstate

m_{k}

; the reduced or conditional probability approximately becomes unity due to Equation (33). The last equation in Equation (34) follows from Theorem 1.

Remark 17.

The above reduction plays a very important role in the formulation of the NEQ statistical mechanics (μNEQT) of the system Σ by reducing all microquantities in

Σ_{0}

to conditional microquantities under the condition that Σ is in microstate

m_{k}

.

For a medium microquantity

{\tilde{q}}_{\tilde{k}}

, we obtain a very important result, which we quote as a Theorem because of its extreme importance.

Theorem 1.

Under quasi-independence approximation, the conditional

{\tilde{q}}_{k}

is simply given by the macroquantity

\tilde{q}

:

{\tilde{q}}_{k} \approx \tilde{q}, \forall k .

(35)

Proof.

By replacing

{\tilde{χ}}_{\tilde{k}}^{j}

by

{\tilde{q}}_{\tilde{k}}

and

{\tilde{χ}}_{k}^{j}

by

{\tilde{q}}_{k}

in Equation (34), we obtain the ensemble average on the right side, which proves the theorem. ☐

The application and general proof of this important theorem is deferred to Section 7.5, where it is restated slightly differently as Theorem 7, where we justify Remark 16, which is used in the simple proof given above.

2.9. Process Quantities

Remark 18.

For a state variable q

\in \{S, Z\}

for

Σ_{b}

, its microstate analog q

_{k}

is trivially identified as the microstate value q takes on

m_{k}

, and appears as the coefficient of

p_{k}

in the right-hand side of Equation (12), the ensemble average. We now consider the process quantity

d_{q k}

and consider its ensemble average

〈d q〉 \equiv \hat{A} d q_{k} ≐ \sum_{k} p_{k} d q_{k}

(36a)

if we follow the convention adopted in Equation (12). However,

〈d q〉

above is not the same as

d \hat{A} q \equiv d q or d \bar{q} (t) or d 〈q〉 ≐ \sum_{k} p_{k} d {\bar{q}}_{k} \equiv d (\sum_{k} p_{k} q_{k});

(36b)

we have also introduced the microstate analog

d {\bar{q}}_{k}

for dq or

d 〈q〉

to make sure that we distinguish dq

_{k}

and

d {\bar{q}}_{k} = {(d \hat{A} q)}_{k} ≐ d q_{k} + q_{k} d η_{k},

(36c)

so that

d 〈q〉 = \hat{A} d \bar{q}

. This distinction becomes very important for q

= E

and S, as we will see in Section 10.1; see also Definition 23.

Definition 17.

In mechanics, the generalized or BI-microwork by

Σ_{b}

with parameter

W

is defined as the microwork done by the fluctuating microforce

F_{w k}

d W_{k} ≐ F_{w k} \cdot d W = - (\partial E_{k} / \partial W) \cdot d W = - d E_{k},

(37a)

with

F_{w k}

in its component form is given by

F_{w k} = (P_{k}, . . ., A_{k}) = (f_{w k}, A_{k});

(37b)

here, ⋯ denotes microfields corresponding to the rest of the state variables in

w

besides V, and

f_{w k} ≐ - \partial E_{k} / \partial w, A_{k} ≐ - \partial E_{k} / \partial ξ,

(37c)

with

A_{k}

representing the microaffinity.

With fluctuating

\{W_{k}\}

, it is defined as the microwork done over the fluctuating generalized displacement

d W_{k}

d W_{k} ≐ F_{w} \cdot d W_{k} = - (\partial E_{k} / \partial W_{k}) \cdot d W_{k};

(38)

see Claim 6. The generalized or BI-macrowork done by

Σ_{b}

after ensemble averaging (see Equation (19)) in both approaches are the same:

d W = 〈d W〉 = F_{w} \cdot d W

(39)

Explicitly, we express

F_{w}

in its component form as

F_{w} = (P (t), . . ., A (t)) = (f_{w} (t), A (t));

(40)

see Figure 1. Here, ⋯ denotes the macrofields corresponding to the rest of the state variables in

w

besides V, and

f_{w} ≐ - \partial E / \partial w .

(41)

The SI-affinity

A ≐ - \partial E / \partial ξ

(42)

corresponding to

ξ

[12,51] is nonzero, except in EQ, when it vanishes:

A_{eq} \equiv A_{0} = 0 = 0

[13,51].

The SI-macrowork

d W_{ξ}

done by

Σ

as the internal variable

ξ

varies is

d W_{ξ} \equiv d_{i} W_{ξ} ≐ A \cdot d ξ \geq 0 .

(43)

Even for an isolated NEQ system,

d W_{ξ}

will not vanish; it vanishes only in EQ, since

ξ

does no work when

A_{0} = 0

. Because of this,

d_{e} W_{ξ} \equiv 0

so that

d W_{ξ} \equiv d_{i} W_{ξ}

. However,

f_{w}, d \tilde{W}

and

d_{e} W

are unaffected by the presence of

ξ

.

Definition 18.

In statistical mechanics, generalized or SI-microheat for

Σ_{b}

is defined as

d Q_{k} ≐ - T (η_{k} + 1) d η_{k} \equiv - T {\hat{η}}_{k} d η_{k};

(44a)

see Equation (255). The average of

d Q_{k}

is the generalized or SI-macroheat

d Q ≐ \sum_{k} E_{k} d p_{k} \equiv T d S .

(44b)

Remark 19.

We will use “generalized” or “SI” interchageably in this review.

Conclusion 1.

The SI-macroheat or the generalized macroheat

d Q

d Q ≐ T d S,

(45)

is identified as the Clausius equality; see Remark 42.

This interesting equality should be distinguished from the well-known Clausius inequality

d_{e} Q ≐ T_{0} d_{e} S \leq T_{0} d S .

(46)

Thus, Equation (93a) allows us to uniquely identify generalized heat and work as independent of each other.

Remark 20.

The d in

d W, d Q, d W_{k}

, and

d Q_{k}

does not denote any differential operator on some quantity

W, Q, W_{k}

, and

Q_{k}

, respectively. Conventionally, one uses a symbol đ or some other symbols in thermodynamics to emphasize this distinction. However, we follow the standard notation of mechanics for

d W

and

d W_{k}

to emphasize these mechanical concept of work. We also use the same symbol for

d Q

and

d Q_{k}

. If we extend Equation (36a) to also include

d W_{k}

and

d Q_{k}

, then we could also use

\bar{d W}

and

\bar{d Q}

for

d W

and

d Q

, respectively, but we will use the simpler notation

d W

and

d Q

. This should not cause any confusion.

It follows from Equations (45) and (46) that the irreversible macroheat is

d_{i} Q = \{\begin{matrix} (T - T_{0}) d S + T_{0} d_{i} S \\ (T - T_{0}) d_{e} S + T d_{i} S \end{matrix};

(47)

2.10. $Σ_{0}$ (Isolated Body) and $\tilde{Σ}$ (Medium)

Remark 21.

As an isolated body cannot exchange anything with its surroundings, we must always have

d_{e} θ_{0} (t) \equiv 0, d_{e} θ_{0 k_{0}} (t) \equiv 0, d_{e} θ_{0 k} (t) \equiv 0, \forall k_{0}, k;

(48a)

see Definition 3. The last equality emerges from reduction; see Remark 16.

Remark 22.

For a medium

\tilde{Σ}

, which is assumed to be in EQ and weakly interacting with and quasi-independent of the system Σ in microstate

m_{k}

, we must have

d_{i} \tilde{θ} (t) \equiv 0, d_{i} {\tilde{θ}}_{\tilde{k}} (t) \neq 0, d_{i} {\tilde{θ}}_{k} (t) = 0

(49)

after reduction of

d_{i} {\tilde{θ}}_{\tilde{k}} (t)

from

\tilde{k}

to k. The last equality follows from Theorem 1 by replacing

{\tilde{q}}_{k}

by

d_{i} {\tilde{θ}}_{k} = d_{i} \tilde{θ}

, a NFl macroquantity, even though

d_{i} {\tilde{θ}}_{\tilde{k}}

is a Fl microquantity over

{\tilde{m}}_{\tilde{k}}

.

Remark 23.

As we always use microstates

m_{k}

’s with fluctuating energies

E_{k}

, we find it useful and simple to use the notation in which

W

is fluctuating with

W_{k}

over microstates. This means that we will consider the state variable

Z

fluctuating with

Z_{k}

over microstates as if we are dealing with the fixed field approach with

Z

given by the extension of Equation (19)

Z = \sum_{k} p_{k} Z_{k} .

(50)

The approach also covers the fluctuating workfield approach if we simply replace each

W_{k}

by a fixed

W

.

3. Mathematical Digression on $\{d_{α}\}$

In NEQT, there are various forms of work and heat

[d W]

and

[d Q]

. Therefore, it is necessary to distinguish between them. Let us consider the Clausius equality in Equation (45) relating the SI-macroheat

d Q

and the entropy change

d S

. It would be naïve to take this equality to conjecture that

d_{α} Q = T d_{α} S,

for the simple reason that the exchange macroheat

d_{e} Q

is a MI-quantity so it must be determined by the medium alone. The presence of T in the above conjecture

d_{e} Q = T d_{e} S

raises doubts about the conjecture as T has nothing to do with the medium. Therefore, it is important to understand the role of the operators

d_{α}

, which is explained in this section. This makes this section extremely important in the review.

3.1. Generalizing $d \equiv d_{e} + d_{i}$

The linear operators

d_{α}

satisfy not only the identities in Equations (14a) and (14b), but also the following identities:

\begin{matrix} d_{α} (a q_{1} + b q_{2}) & = a d_{α} q_{1} + b d_{α} q_{2}, \\ d_{α} (q_{1} q_{2}) & = q_{1} d_{α} q_{2} + (d_{α} q_{2}) q_{2}; \end{matrix}

(51)

here q

_{1}

and q

_{2}

are two extensive random variables, and a and b are two pure numbers.

The generalization of de Groot–Prigogine notation in Notation 3 provides a very compact description of NEQ processes in the

μ

NEQT. The original notation [13,51] is restricted to the entropy, particle number, energy, and volume changes

d S, d N, d E

, and

d V

, respectively, for

Σ_{b}

; see Figure 1 for

d Z \to d X = d S, d N, d E

and

d V

:

\begin{matrix} d [S] & \equiv d_{e} [S] + d_{i} [S], \end{matrix}

(52a)

\begin{matrix} d [N] & \equiv d_{e} [N] + d_{i} [N], \end{matrix}

(52b)

\begin{matrix} d [E] & \equiv d_{e} [E] + d_{i} [E], \end{matrix}

(52c)

\begin{matrix} d [V] & \equiv d_{e} [V] + d_{i} [V], \end{matrix}

(52d)

As no internal process can change the energy [12], we have

d_{i} E \equiv 0 .

(53a)

The surprising fact is that

d_{i} E_{k} \neq 0

, as we will establish below; see Theorem (6). Similarly,

d_{i} V = 0 .

(53b)

We have also assumed that

d_{i} N = 0

, but this is no consequence as we are assuming no chemical reaction in the review. We should emphasize that the partitions above have nothing to do with the partitions in Equations (238) and (247a), respectively. The original partition in Equation (52b) is not relevant in the review as we do not consider any chemical reaction, so

d N \equiv d_{e} N

. Observe that the above partitions are defined only for macroscopic extensive observables for a body. We have extended the notation to not only all extensive state variables in

[χ]

but for

[d_{α} W], [d_{α} Q]

for any body

Σ_{b}

. We thus have

\begin{matrix} d W_{k} & = d_{e} W_{k} + d_{i} W_{k}, d Q_{k} = d_{e} Q_{k} + d_{i} Q_{k}, \end{matrix}

(54a)

\begin{matrix} d W & = d_{e} W + d_{i} W, d Q = d_{e} Q + d_{i} Q; \end{matrix}

(54b)

For

Σ_{b}

an isolated system

Σ_{0}

, it follows from Equation (48a) that

d_{e} W_{0} \equiv 0, d_{e} Q_{0} \equiv 0 .

(55a)

For

Σ_{b}

a medium

\tilde{Σ}

, it follows from Equation (49) that

d_{i} \tilde{W} \equiv 0, d_{i} \tilde{Q} \equiv 0 .

(55b)

Note that

d W, d Q

, etc., do not represent changes in any SI-macrovariable; see Remark 20.

Remark 24.

We mostly focus on

\{q_{k}\}

or

\{d_{α} q_{k}\}

in the μNEQT, from which we obtain the information about the corresponding macroquantity q or

d_{α}

q, respectively, by ensemble averaging. The approach in this sense is to effectively discuss

[q]

or

[d_{α} q]

, without explicitly showing the suffix k, unless clarity is needed. We will, however, use

{[q]}_{k}

or

{[d_{α} q]}_{k}

when we consider specific cases.

We now consider the three systems separately for clarity below so we need

[q], [d_{α} q], [\tilde{q}],

[d_{α} \tilde{q}]

, and

[q_{0}], [d_{α} q_{0}]

for

Σ, \tilde{Σ}

(not necessarily in EQ) and

Σ_{0}

, respectively, which satisfy additivity for

Σ_{0}

so that

{[q_{0}]}_{k_{0}} = {[q]}_{k} + {[\tilde{q}]}_{\tilde{k}}, {[d_{α} q_{0}]}_{k_{0}} = {[d_{α} q]}_{k} + {[d_{α} \tilde{q}]}_{\tilde{k}},

(56)

where we have explicitly shown microstate indices for

Σ, \tilde{Σ}

, and

Σ_{0}

; here and in the following, q

\in χ

, and

d_{α}

q

\in d_{α} θ

. For these equations to hold, we need to assume that

Σ

and

\tilde{Σ}

interact so weakly that their interactions can be neglected (recall that

[E]

is one of the possible

[q]

) and that

Σ

and

\tilde{Σ}

are quasi-independent [148]; see Section 7.3. We also consider their partitions as shown in Equation (14a).

Remark 25.

The medium

\tilde{Σ}

in Equation (56) need not be in EQ, so Equation (56) also applies to a system Σ consisting of two subsystems

Σ_{1}

and

Σ_{2}

interacting with each other satisfying quasi-additivity and quasi-independence. All we need to do is to take

Σ_{0} \to Σ, Σ \to Σ_{1}

, and

\tilde{Σ} \to Σ_{2}

. We can also have Σ embedded in a medium

\tilde{Σ}

, distinct from the previous

\tilde{Σ}

. It follows from Equation (56) that

q_{k} = q_{k_{1}} + q_{k_{2}}, d_{α} q_{k_{0}} = d_{α} q_{k_{1}} + d_{α} q_{k_{2}},

(57a)

and

q = q_{1} + q_{2}, d_{α} q = d_{α} q_{1} + d_{α} q_{2} .

(57b)

Explicitly, we have

\begin{matrix} d q & = d q_{1} + d q_{2}, d q_{k} = d q_{1 k_{1}} + d q_{2 k_{2}}, \\ d_{e} q & = d_{e} q_{1} + d_{e} q_{2}, d_{e} q_{k} = d_{e} q_{1 k_{1}} + d_{e} q_{2 k_{2}}, \\ d_{i} q & = d_{i} q_{1} + d_{i} q_{2}, d_{i} q_{k} = d_{i} q_{1 k_{1}} + d_{i} q_{2 k_{2}} . \end{matrix}

(58)

in which we must treat

d_{e}

q

_{j}, d_{e} q_{j k_{j}}, j = 1, 2

, carefully. As usual,

[d_{e} q]

is the exchange with

\tilde{Σ}

, but

[d_{e} q_{1}], [d_{e} q_{2}]

each have two exchanges; one exchange involving the suffix m is with

\tilde{Σ}

, and the other exchange is with the other subsystem. Thus, we have

[d_{e} q_{1}] = [d_{e} q_{1 m}] + [d_{e} q_{12}], [d_{e} q_{2}] = [d_{e} q_{2 m}] + [d_{e} q_{21}],

(59)

in which

[d_{e} q_{12}], [d_{e} q_{21}]

stand for mutual exchanges between the subsystems.

Remark 26.

For an isolated Σ in Equation (58), we must have

{[d_{e} q]}_{k} =

{[d_{e} q_{1 m}]}_{k_{1}} = {[d_{e} q_{1 m}]}_{k_{2}} = 0

(see Remark 21), so

{[d_{e} q_{1}]}_{k_{1}} = - {[d_{e} q_{2}]}_{k_{2}} .

(60)

Remark 27.

It follows from Remark 26 that

d_{e} W_{k} = 0, d_{e} W_{1 k_{1}} = - d_{e} W_{2 k_{2}} .

(61)

We now turn back to discussing a system embedded in a medium as above, and prove the following important theorem.

Theorem 2.

We consider the system Σ and the medium

\tilde{Σ}

(not necessarily in EQ) forming the isolated system

Σ_{0}

. We prove two important identities that are extremely useful in the μNEQT:

\begin{matrix} {[d_{e} q]}_{k} & ≐ - {[d_{e} \tilde{q}]}_{\tilde{k}} = - {[d \tilde{q}]}_{\tilde{k}} + {[d_{i} \tilde{q}]}_{\tilde{k}}, \end{matrix}

(62a)

\begin{matrix} {[d q_{0}]}_{k_{0}} & \equiv {[d q]}_{k} + {[d \tilde{q}]}_{\tilde{k}} = {[d_{i} q_{0}]}_{k_{0}} = {[d_{i} q]}_{k} + {[d_{i} \tilde{q}]}_{\tilde{k}} . \end{matrix}

(62b)

Proof.

As

Σ_{0}

is isolated, there cannot be any exchange quantity, so

[d_{e} q_{0}] \equiv 0

. It follows from Equation (60) that

{[d_{e} q_{0}]}_{k_{0}} = {[d_{e} q]}_{k} + {[d_{e} \tilde{q}]}_{\tilde{k}} \equiv 0 .

The identity in Equation (62a) immediately follows. Again using the second equation in Equation (56) for

d_{α} = d

, and using

{[d_{e} q_{0}]}_{k_{0}} = 0

proves the second identity, after using Equations (14a) and (14b) in

{[d q_{0}]}_{k_{0}}

. This case is appropriate for treating

\tilde{Σ}

as another system. ☐

For

\tilde{Σ}

in EQ,

d_{i} \tilde{q} = 0

but not

d_{i} {\tilde{q}}_{\tilde{k}}

, as it is an outcome of a random variable

d_{i} \tilde{q}

; see Remark 22. Thus,

d {\tilde{q}}_{\tilde{k}} = d_{e} {\tilde{q}}_{\tilde{k}} + d_{i} {\tilde{q}}_{\tilde{k}}; d_{α} q_{0 k_{0}} = d_{α} q_{k} + d_{α} {\tilde{q}}_{\tilde{k}},

(63a)

which should undergo reduction as our interest is to investigate

Σ

in

m_{k}

. This is done in Section 7.5, where we find that

d_{e} q_{k} = - d_{e} {\tilde{q}}_{k} = - d_{e} \tilde{q} = d_{e} q, \forall k,

(64a)

showing that exchange microquantities are not random variables; see Theorem 7. For the macrostate, we have

d \tilde{q} = d_{e} \tilde{q} = - d_{e} q, d q_{0} = d_{i} q .

(64b)

For

[q] = [Z]

and

\tilde{Σ}

in EQ, we have from Equation (62b) and the general additivity

{[Z_{0}]}_{k_{0}} \equiv {[Z]}_{k} + {[\tilde{Z}]}_{\tilde{k}},

obtained by extending Equation (28c), the identity

d [Z_{0}] = d_{e} [Z] + d_{i} [Z] + d [\tilde{Z}] = d_{i} [Z],

(65a)

which shows that

d_{e} [Z] = - d [\tilde{Z}] = - d_{e} [\tilde{Z}]

(65b)

in accordance with Equation (62a). We thus have

d Z_{0 k} = d_{i} Z_{k}, d Z_{0} = d_{i} Z,

(65c)

where all quantities pertaining to

m_{0 k_{0}}

have been reduced (see Definition 4) and we have used the fact that after reduction (see Remark 22),

d_{i} {\tilde{Z}}_{k} = d_{i} \tilde{Z} = 0, \forall k .

For q

= [E]

for a macrostate

M

, we have

d E_{0 k} = d_{i} E_{k}, d E_{0} = d_{i} E_{0} = d_{i} E = 0;

(66)

the last equation follows from Equation (53a).

For q

= [S]

for a macrostate

M

, we have the standard result

d {[S]}_{0} \equiv d_{i} [S],

(67a)

from which we obtain

d S_{0 k} \equiv d_{i} S_{k}

(67b)

giving the internal entropy generation, which has no particular sign, and

d S_{0} = d_{i} S \geq 0

(67c)

for the irreversible entropy generation. We similarly have

d W_{0 k} = d_{i} W_{k}, d Q_{0 k} = d_{i} Q_{k},

(68a)

after reducing all quantities pertaining to

m_{0 k_{0}}

. For a macrostate

M

,

d W_{0} = d_{i} W \geq 0; d Q_{0} = d_{i} Q \geq 0;

(69)

see Equation (145). Here,

d_{i} W

and

d_{i} Q

are the irreversible macrowork done by and macroheat generation due to internal processes in

Σ

; see Theorem 4.

Claim 8.

The nonnegative inequalities for macroquantities

d_{i}

q in the above equations are in accordance with the second law, where ensemble averaging at each instant plays a central role. Because of this relationship with the second law, we call these quantities irreversible. There is no sign requirement for corresponding microquantities

d_{i}

q

_{k}

that do not require such averaging. To make this clear distinction, we call these microquantities simply internal.

The discussion above finally justifies Conclusion 2 of several micro- and macroworks that are distinct in nature. Intuitively, the generalized microwork

d W_{k}

denotes the mechanical work done by the system, a part

d_{e} W_{k} = d_{e} W = - d_{e} \tilde{W} = - d \tilde{W}

(70a)

of which is transferred to

{\tilde{Σ}}_{w}

through exchange and

d_{i} W_{k}

is internally spent to overcome internal processes due to the microforce force imbalance (

μ

FI) within

Σ

. Of the three, only

d W_{k}

and

d_{i} W_{k}

are the outcomes of random variables

d W

and

d_{i} W

, respectively.

Similarly, there are several micro- and macroheats that are distinct in nature. Of

d Q_{k}

, a part

d_{e} Q_{k} = d_{e} Q = - d_{e} \tilde{Q} = - d \tilde{Q}

(70b)

is transferred from

{\tilde{Σ}}_{h}

through exchange and

d_{i} Q_{k}

is internally generated by internal processes within

Σ

. Of the three, only

d Q_{k}

and

d_{i} Q_{k}

are the outcomes of random variables

d Q

and

d_{i} Q

, respectively. Similar comments also apply to

d S_{k}, d_{e} S_{k}

, and

d_{i} S_{k}

.

What has been said above can be summarized as follows (also see Claim 15):

Summary 1.

dq

_{k} = (d S_{k}, d E_{k}, d W_{k}, d Q_{k})

and

d_{i}

q

_{k} = (d_{i} S_{k}, d_{i,} E_{k}, d_{i} W_{k}, d_{i} Q_{k})

are random variables and fluctuate around their respective averagesdq and

d_{i}

q, so they have values on both sides of their averages.

This justifies Remark 22.

3.2. Consequences of Theorem 60

We show the importance of the above theorem about exchange microquantities, which is why they have been extensively exploited in modern NEQ statistical mechanics (

\overset{˚}{μ}

NEQT). We will only consider the case of fixed work parameter so we have fluctuating microforces associated with the random variable

F_{w}

. The discussion is easily extended to fluctuating work parameter. We first consider an NEQ

Σ

in the microstate

m_{k}

. The microwork

d W_{k}

is given in Equation (37a); see also Equation (40). The same equation, applied to

\tilde{Σ}

in the microstate

{\tilde{m}}_{\tilde{k}}

, gives

d {\tilde{W}}_{\tilde{k}} ≐ {\tilde{f}}_{w \tilde{k}} \cdot d \tilde{w} = {\tilde{f}}_{w \tilde{k}} \cdot d_{e} \tilde{w} = - {\tilde{f}}_{w \tilde{k}} \cdot d_{e} w,

(71a)

where we have used the fact that

\tilde{Σ}

is in EQ so

{\tilde{E}}_{\tilde{k}}

does not depend on the internal variable

\tilde{ξ}

(see Equation (28a)), so that

{\tilde{F}}_{w \tilde{k}} \to {\tilde{f}}_{w \tilde{k}},

d \tilde{W} \to d \tilde{w} = d_{e} \tilde{w}

, and

d_{i} \tilde{w} = 0

. We have also used Equation (65b) to set

d \tilde{w} = - d_{e} w

in the last equation. Thus,

d {\tilde{W}}_{\tilde{k}} = d_{e} {\tilde{W}}_{\tilde{k}} = - d_{e} W_{k},

(71b)

where we have also used Equation (70a) to derive the last equation.

Remark 28.

A careful reader will notice that we have an equality between two quantities having different and independent suffixes

\tilde{k}

and k. This implies that we can change one index, say k, and not change

\tilde{k}

. As the equality again remains valid, both sides must be independent of the suffixes. It will be justified later in Section 7.5 in a different way.

Thus,

d {\tilde{W}}_{\tilde{k}} = d \tilde{W} = d_{e} \tilde{W}, d_{e} W_{k} = d_{e} W,

(72a)

and

d_{e} W = - d \tilde{W} = - d_{e} \tilde{W} .

(72b)

This is consistent with Equation (64b) as expected. Explicitly, we have

d_{e} {\tilde{W}}_{k} = - {\tilde{f}}_{w} \cdot d_{e} w = d \tilde{W}, {\tilde{f}}_{w} \equiv \sum_{\tilde{k}} p_{\tilde{k}} {\tilde{f}}_{w \tilde{k}} = f_{0 w}, \forall k,

(73a)

where

f_{0 w}

refers to

Σ_{0}

. Thus, the exchange microwork is

d_{e} W_{k} ≐ f_{0 w} \cdot d_{e} w = d_{e} W, \forall k .

(73b)

We now identify the internal microwork

d_{i} W_{k}

:

d W_{0 k} = d W_{k} + d {\tilde{W}}_{k} = d W_{k} - d_{e} W = d_{i} W_{k},

(74)

and is explicitly given by

d_{i} W_{k} = (f_{w k} - f_{0 w}) \cdot d_{e} w + f_{w k} \cdot d_{i} w + A_{k} \cdot d ξ,

(75)

where we have allowed the possibility of an internal change

d_{i} w = d w - d_{e} w

, similar to

d_{i} ξ = d ξ

. Such a situation arises if

w

refers to polarization or magnetization, which can change due to internal processes.

We now turn to the physical significance of the three different terms in the internal microwork

d_{i} W_{k}

:

The first term is the internal microwork due to force imbalance $f_{w k} - f_{0 w}$ between the SI-microforce of $Σ$ , and the MI-macroforce of $\tilde{Σ}$ .
The second term is the internal microwork due to the internal displacement $d_{i} w$ by the SI-microforce $f_{w k}$ of $Σ$ .
The last term is due to the internal variable displacement by the SI-microaffinity $A_{k}$ .

We introduce the internal microforce imbalance (

μ

FI,

μ

for micro) between

Σ

and

\tilde{Σ}

, and the internal SI-microforce

Δ F_{w k} ≐ (f_{w k} - f_{0 w}, A_{k}), f_{w k},

(76a)

respectively, and the corresponding displacements

(d_{e} w, d ξ), d_{i} w

(76b)

to reproduce Equation (75).

The corresponding macroforce imbalance and the internal macroforce are given by

Δ F_{w} = (f_{w} - f_{0 w}, A), f_{w},

(76c)

with the same displacements as above. Here, we will take a more general view of

A

, and also extend its definition to

X

. For

w

, this means that we can treat

f_{w} - f_{0 w}

also as an affinity. By including

Δ F^{h} ≐ T_{0} - T

also as an affinity [134], we can include it with

Δ F_{w}

to form an extended set of thermodynamic forces or macroforce imbalances [51]:

Δ F ≐ (T_{0} - T, f_{w} - f_{0 w}, A) .

(76d)

Claim 9.

The extended set

Δ F

of thermodynamic forces in Equation (76d) must vanish in EQ. However,

Δ F_{w k}

need not vanish even in EQ.

3.3. Some Simple Examples

As an example, we focus on the case with

W = (V, ξ), \tilde{w} = (\tilde{V})

. The corresponding

f_{0 w}

is replaced by

P_{0}

of

\tilde{Σ}

so that (setting

d_{i} V = 0

)

\begin{matrix} d W_{k} & = P_{k} d V + A_{k} d ξ, \end{matrix}

(77a)

\begin{matrix} d_{e} W_{k} & = P_{0} d V, \forall k, d_{e} W = P_{0} d V; \end{matrix}

(77b)

\begin{matrix} d_{i} W_{k} & = (P_{k} - P_{0}) d V + A_{k} d ξ, \forall k; \end{matrix}

(77c)

we can identify the two internal parts

d_{i} W_{k V} = (P_{k} - P_{0}) d V, d_{i} W_{k ξ} = A_{k} d ξ

(78)

that make up

d_{i} W_{k}

. The corresponding macroworks are given by

\begin{matrix} d W & = P d V + A d ξ, d_{e} W = P_{0} d V, \end{matrix}

(79a)

\begin{matrix} d_{i} W & = (P - P_{0}) d V + A d ξ . \end{matrix}

(79b)

The results

d_{e} W = P_{0} d V, d_{i} W = (P - P_{0}) d V

in the absence of

ξ

are well-known in classical thermodynamics [51]. We identify the irreversible macrowork

d W_{V}

and

d W_{ξ}

due to V and

ξ

, respectively, from Equation (78):

d_{i} W_{V} = (P - P_{0}) d V \geq 0, d_{i} W_{ξ} = A d ξ \geq 0 .

(80)

The above example describes a possible NEQ situation in Figure 3a of a gas of volume V in a cylinder with a movable piston forming the system

Σ

described by

W = (V, ξ)

by considering its microstate

M

very close to

M_{eq}

so that only one internal variable is sufficient to describe it uniquely by treating

M = M_{ieq}

. A possible choice of

ξ

can be rationalized as follows. We imagine the gas to be divided into two parts of volumes

V_{1}, V_{2}

and uniform number densities

n_{1}, n_{2}

, respectively, by an imaginary wall, with the region next to the piston designated as

V_{1}

. The entire volume is not uniform if

n_{1} \neq n_{2}

, which we assume. We now define

ξ ≐ V_{1} / n_{1} - V_{2} / n_{2},

recalling that

V = V_{1} + V_{2}

; see Section 4 for a generalization to describe

M = M_{ieq}

far away from

M_{eq}

that will require many internal variables. In a given microstate

m_{k}

, the SI-pressure

P_{k}

and the affinity

A_{k}

form the corresponding microforce

F_{w k} = (P_{k}, A_{k})

(see Equation (17a)) with

P_{k} = - \partial E_{k} / \partial V, A_{k} = - \partial E_{k} / \partial ξ .

(81)

The corresponding generalized microwork

d W_{k}

is given in Equation (77a). For the medium, the generalized microforce

{\tilde{f}}_{w \tilde{k}} = P_{\tilde{k}}

determines the generalized microwork

d {\tilde{W}}_{\tilde{k}} = P_{\tilde{k}} d \tilde{V} = - P_{\tilde{k}} d V

in Equation (71a). The conditional microforce

{\tilde{f}}_{w k}

from Theorem 1 is equal to

{\tilde{f}}_{w} = P_{0}, \forall k

; here,

P_{0}

is the external pressure on the piston. Thus, the conditional microwork is

d {\tilde{W}}_{k} = d \tilde{W} = - P_{0} d V

so that

d_{e} W_{k} = - d {\tilde{W}}_{k} = P_{0} d V = d_{e} W

given in Equation (77b). The internal microwork in the gas is

d_{i} W_{k}

in Equation (77a).

The irreversible macrowork

d_{i} W = (P - P_{0}) d V + A d ξ

must be nonnegative as we prove in Theorem 4. For this to be true, each term must be nonnegative; see Equation (80). Indeed, it is easy to verify that

d_{i} W_{V} ≐ (P - P_{0}) d V \geq 0

. For

P > P_{0}

, the gas must expand so

d V > 0

. For

P < P_{0}

, the gas must contract so

d V < 0

. In both cases, the product satisfies the inequality.

This will become more clear by the following example of a spring discussed below.

The pressure difference

Δ P = P - P_{0}

(see Figure 3a) plays an important role as a macroforce imbalance in capturing dissipation. Only under mechanical equilibrium do we have the imbalance vanish (

Δ P = 0

). This imbalance is a general feature but its importance at the microstate level in NEQ statistical mechanics has not been recognized. The following examples will make it abundantly clear that a nonzero microforce imbalance like

Δ P_{k} = P_{k} - P_{0}

is just as common even in classical mechanics whenever there is absence of mechanical equilibrium as in thermodynamics, EQ or otherwise. This is because the determination of various microforces and microworks are oblivious to any stochasticity; see Remark 30 for

d_{α} W = - d_{α} E_{w}

. As a consequence, there are no restrictions on the sign of

d_{i} W_{k}

as it is purely a mechanical quantity. Therefore, our second example below covers classical mechanics as well as thermodynamics; see also Conclusion 3.

Consider a general but purely classical mechanical one-dimensional massless spring of arbitrary Hamiltonian

H (x)

with one end fixed at an immobile wall on the left and the other end with a mass m free to move; see Figure 3b with vacuum and no fluid filling the cylinder. We consider a particular microstate

m_{k}

of energy

E_{k}

given by

H

. The center of mass of m is located at x from the left wall. The free end is pulled mechanically by an external force (not necessarily a constant)

F_{0}

applied at time

t = 0

and changes x; thus, x acts as a work parameter. We do not show the center-of-mass momentum p, as it plays no role in determining work.

Initially the spring is undisturbed and has zero SI restoring spring microforce

F_{w k} = - \partial E_{k} / \partial x

; see Equation (17a). The microwork done by

F_{w k}

is the SI-work

d W_{k} = F_{w k} d x

as given in Equation (37a). The total microforce

F_{t k} = F_{0} + F_{w k}

(82)

represents the microforce imbalance (

μ

FI)

F_{t k} ≶ 0

as discussed later in Section 6.4; recall that

F_{0}

and

F_{w k}

point in opposite directions so

F_{t k}

is a difference

Δ F_{w k} = F_{w k} - |F_{0}|

. There is no mechanical equilibrium unless

Δ F_{w k} = 0

and the spring continues to stretch or contract, thereby giving rise to an oscillatory motion that will go on forever. During each oscillation,

Δ F_{w k}

is almost always nonzero, except when the mass is momentarily at the equilibrium (mechanical) position of the spring where

Δ F_{w k} = 0

. The SI-microwork done by

F_{w k}

is the spring work (see Equation (37a))

d W_{k} = - d E_{k},

(83)

while the microwork performed by the external source is

d_{e} W = - F_{0} d x

. Being a purely mechanical example, there is no dissipation. Despite this, we can introduce using our notation

d_{i} W_{k} ≐ d W_{k} - d_{e} W \equiv Δ F_{w k} d x;

(84)

this microwork can be of either sign (no second law here) and represents the work done by the

μ

FI

F_{t k} = Δ F_{w k}

. Thus,

Conclusion 2.

d W_{k}, d_{e} W_{k}

and

d_{i} W_{k}

represent different kinds of mechanical work, a result that has nothing to do with dissipation but only with the microforce imbalance; among these, only the generalized work

d W_{k}

is a SI microwork.

3.4. Manipulations with $d_{α}$

As introduced in Equation (9),

d_{α}

can be applied to micro- and macroquantities in the collection such as

d_{α} E_{k}, d_{α} W_{k}, d_{α} Q_{k}

, etc., and

d_{α} E, d_{α} W, d_{α} Q

, etc.

Definition 21.

Micropartition: The micropartition of the BI-

d θ_{k} (t)

for

Σ_{b}

is given in Equation (14b), in which

d_{e} θ_{k} (t)

is the change due to exchange with its surroundings, and

d_{i} θ_{k} (t) (t)

is the internal change within

Σ_{b}

.

The corresponding partitions for

d E_{k}

and

d S_{k}

are given in Equation (15), and those for

d W_{k}

and

d Q_{k}

in Equation (54a). For a Fl-

W

, we have

d W_{k} ≐ d_{e} W_{k} + d_{i} W_{k} .

(85)

The micropartition also applies to

d p_{k}

:

d p_{k} ≐ d_{e} p_{k} + d_{i} p_{k},

(86a)

We define

d_{α} η_{k} ≐ \frac{d_{α} p_{k}}{p_{k}} .

(86b)

Definition 22.

Macropartition: The macropartition of

d θ (t)

for

Σ_{b}

is given in Equation (14a). It consists of two parts; the exchange

d_{e} θ

is the change due to exchange with its surroundings, and

d_{i} θ

is the irreversible change occurring within

Σ_{b}

.

For the average in Equation (19) or for a NFl-

W

, we have

d W ≐ d_{e} W + d_{i} W .

(87)

In a process,

χ

undergoes infinitesimal changes

d_{α} χ_{k}

at fixed

p_{k}

, or infinitesimal changes

d_{α} p_{k}

at fixed

χ_{k}

. The changes result in two distinct ensemble averages or process quantities.

Definition 23.

Infinitesimal macroquantities

〈d_{α} q〉,

q

\in χ = \{S, Z\}

are ensemble averages

d_{α} q_{m} = 〈d_{α} q〉 = \hat{A} d_{α} q_{k} ≐ \sum_{k} p_{k} d_{α} q_{k},

(88a)

at fixed

\{p_{k}\}

so they are isentropic. They generalize the earlier definition in Equation (36a). We identify them as mechanical macroquantity and write them as

d_{α}

q

_{m}

for brevity. Infinitesimal macroquantities

d_{α} q_{s} ≐ 〈q d_{α} η〉 ≐ \sum_{k} q_{k} d_{α} p_{k},

(88b)

which are ensemble averages involving

\{d_{α} p_{k}\}

with a concomitant change

d S

in the entropy. We identify them as stochastic macroquantities and write them as

d_{α}

q

_{s}

for brevity. Together, they determine the change

d_{α}

q:

d_{α} q \equiv d_{α} \bar{q} ≐ d_{α} q_{w} + d_{α} q_{s}, q \in \{S, Z\} .

(89)

Remark 29.

The above equation shows that we must carefully distinguish

d_{α}

q

= d_{α} \bar{q}

and

d_{α}

q

_{w} = \bar{d_{α} q}

; their difference, the commutator

{\hat{C}}_{α}

q, is the stochastic quantity

d_{α}

q

_{s}

, discussed in Section 10:

{\hat{C}}_{α} q = d_{α} q - d_{α} q_{m};

(90)

see Equation (229).

Remark 30.

For E, the above distinction is the content of the extension of the first law or the law of the conservation of energy

d_{α} E = d_{α} Q - d_{α} W .

(91)

We immediately identify that

d_{α} Q = d_{α} E_{s}, d_{α} W = - d_{α} E_{w} .

(92)

For

d_{α} = d, d_{e}

, we have the SI- and MI-formulation of the first law given by (recall that

d E \equiv d_{e} E

as

d_{i} E \equiv 0

)

\begin{matrix} d E & = d Q - d W, \end{matrix}

(93a)

\begin{matrix} d_{e} E & = d_{e} Q - d_{e} W . \end{matrix}

(93b)

Remark 31.

The SI-formulation of the first law in Equation (93a) shows that

d E

can be uniquely partitioned into a stochastic component

d Q

determined by

d S

and a mechanical component

d W

determined by

d W

, which have independent origins.

Traditionally, the first law is expressed in terms of the change in the energy caused by exchange quantities and is written as

d E = d_{e} Q - d_{e} W .

(94)

As the exchange form of

d E

is written as

d_{e} E

(see Equation (52c)), this is equivalent to the first law in Equation (93b).

We now prove the following important thermodynamic identity as a theorem [75,76,134,148,149].

Theorem 3.

For any NEQ process

P

,

d_{i} Q \equiv d_{i} W \geq 0 .

(95)

Proof.

For

d_{α} = d_{i}

in Equation (91), and using Equation (53a), we have

d_{i} E = d_{i} Q - d_{i} W = 0,

(96)

from which follows the following important thermodynamic identity

d_{i} Q \equiv d_{i} W

. We defer the proof of the inequality to a later part of the review. ☐

The above equality emphasizes the well-known fact (first discovered in 1798 by Count Rumford of Bavaria [165]) that the irreversible macrowork is always equal in its value but not in its cause (see later) to the irreversible macroheat. The inequality is governed by the second law. The analysis also demonstrates the important fact that the first law in Equation (93a) can be applied either to an exchange process in Equation (93b) or to an interior process in Equation (96). Indeed, in the last formulation, the law is also applicable to an isolated system for which it is replaced by

d E_{0} = d Q_{0} - d W_{0} = 0 .

(97)

Definition 24.

For any body

Σ_{b}

, we simply refer to

d W_{k}

and

d Q_{k}

as generalized or BI-microwork and generalized or BI-microheat or simply microwork and microheat, respectively. Similarly, we refer to

d W

and

d Q

as generalized or BI-macrowork and generalized or BI-macroheat or simply macrowork and macroheat, respectively. We will always refer to

d_{e} W_{k}

and

d_{e} Q_{k}

as exchange microwork and exchange microheat, respectively. We use exchange macrowork and exchange macroheat for

d_{e} W

and

d_{e} Q

, respectively. As there is no irreversibility in mechanics, we use internal microheat for

d_{i} Q_{k}

and internal microwork for

d_{i} W_{k}

, respectively; see Claim 8. We never the use the prefix irreversible for these or other internal microquantities. We use irreversible macroheat for

d_{i} Q

and irreversible macrowork for

d_{i} W

, respectively.

As the system

Σ

is of primary interest in the

μ

NEQT, we will always reduce any microquantity associated with

\tilde{Σ}

and

Σ_{0}

to refer to the microstate

m_{k}

. Thus, all microquantities for any

Σ_{b}

will carry the suffix k of

m_{k}

. We will usually refer to

d {\tilde{W}}_{k}

as the external microwork to distinguish it from the microwork

d {\tilde{W}}_{\tilde{k}}

done by

\tilde{Σ}

in its microstate

{\tilde{m}}_{\tilde{k}}

. We will use microenergy change, exchange microenergy change, and internal microenergy change for

d E_{k}, d_{e} E_{k},

and

d_{i} E_{k}

. We will refer to

d_{α} S_{k}

as microentropy change, even though both

d_{α} S_{k}

and

d_{α} Q_{k}

are mixed microquantities; see Remark 14.

4. Internal Variables

Let us consider two noninteracting mechanical systems

Σ_{1}

and

Σ_{2}

that form a composite system

Σ

, which we take to be isolated. We assume that both

Σ_{1}

and

Σ_{2}

are physically “similar” in that each requires the same set of NFl-state variable

W

having r components, so separately they are described by Hamiltonians

E_{1 k_{1}} = H_{1 k_{1}} (W_{1})

and

E_{2 k_{2}} = H_{2 k_{2}} (W_{2})

for

m_{1 k_{1}}

and

m_{2 k_{2}}

of

Σ_{1}

and

Σ_{2}

, respectively. We assume that the number of particles

N_{1} \in W_{1} ≐ (w_{1}, ξ_{1})

and

N_{2} \in W_{2} ≐ (w_{2}, ξ_{2})

are kept fixed in the two microstates so their total N is also fixed for each microstate

m_{k}

of

Σ

given by

m_{k} = m_{1 k_{1}} \otimes m_{2 k_{2}} .

(98)

As the particle numbers are fixed, we do not consider them to be part of the work sets anymore. We choose to express the combined Hamiltonian as

E_{k} ≐ H_{k} (Z_{1}, W_{2}) = H_{1 k_{1}} (W_{1}) + H_{2 k_{2}} (W_{2})

(99)

of

m_{k}

, which is a function of

2 r + 2

state variables (which includes the microenergies

E_{1 k_{1}}

and

E_{2 k_{2}}

of

Σ_{1}

and

Σ_{2}

, respectively), from which we construct the following independent combinations:

Z ≐ Z_{1} + Z_{2}, \hat{ξ} ≐ Z_{1} / n_{1} - Z_{2} / n_{2},

(100)

so that we can equivalently express

H_{k} (Z_{1}, W_{2})

as

H_{k} (\hat{W}, ξ)

of

2 (r + 1)

variables, which excludes

E_{k}

as explained below; here,

n_{1} = N_{1} / N

and

n_{2} = N_{2} / N

,

\hat{W} ≐ W_{1} + W_{2} = (w_{1} + w_{2}, ξ_{1} + ξ_{2})

(101)

is the total initial work variable set, and

ξ

is the new set of internal variables beyond those included in

Z_{1}

and

Z_{2}

. In addition, the excluded

E_{k} ≐ E_{1 k_{1}} + E_{2 k_{2}}

is the microenergy of

m_{k}

, and carries the suffix k. The choice of new arguments for

H_{k} (W_{1}, W_{2})

is convenient as it allows it to be expressed as

H_{k} (W)

in terms of the set formed by

2 r + 1

variables

W ≐ (\hat{W}, \hat{ξ})

(102a)

of the composite system

Σ

, as is also done for

Σ_{1}

and

Σ_{2}

. The set of internal variables

ξ ≐ (ξ_{1} + ξ_{2}, \hat{ξ})

(102b)

denotes the set of internal variables for

Σ

.

Manipulating

W

will change the energy

E_{k}

of

Σ

. Thus,

d E_{k} = \frac{\partial E_{1 k_{1}}}{\partial W_{1}} \cdot d W_{1} + \frac{\partial E_{2 k_{2}}}{\partial W_{2}} \cdot d W_{2} .

(103a)

It is easy to check that

d E_{k}

is also given by

d E_{k} = \frac{\partial E_{k}}{\partial W} \cdot d W = \frac{\partial E_{k}}{\partial \hat{W}} \cdot d \hat{W} + \frac{\partial E_{k}}{\partial ξ} \cdot d \hat{ξ},

(103b)

so both representations of

H_{k}

are equivalent in all ways.

The choice of

\hat{ξ}

in terms of

n_{1}

and

n_{2}

ensures that it vanishes if the two systems form a uniform system

Σ

for which we must have

Z_{1} / N_{1} = Z_{2} / N_{2}

. However, other choices for

\hat{ξ}

can also be made as long as

\hat{ξ}

remains independent of

\hat{W}

.

Let us consider a simple example in which we only allow the energy E and volume V for each each system (

r = 1

). We have

ξ_{V}, ξ_{E k}

as work variables in forming

W

. In this case, we have

E_{k} = E_{1 k_{1}} + E_{2 k_{2}}

for the microstate energy and

V = V_{1} + V_{2}

for the total volume. By definition,

ξ_{E k} = E_{1 k_{1}} / n_{1} - E_{2 k_{2}} / n_{2}, ξ_{V} = V_{1} / n_{1} - V_{2} / n_{2} .

(104a)

The microstate energy

E_{k} (V, ξ_{V}, ξ_{E k}) = E_{1 k_{1}} (V_{1}) + E_{2 k_{2}} (V_{2})

(104b)

is a function of three (

2 r + 1

) variables. We first consider

ξ_{V}

. We have for

P_{k}

, using Equation (81),

P_{k} = n_{1} P_{1 k_{1}} + n_{2} P_{2 k_{2}}, A_{V k} = n_{1} n_{2} (P_{1 k_{1}} - P_{2 k_{2}}),

(104c)

where we have used

V_{1} = n_{1} V + n_{1} n_{2} ξ_{V}

and

V_{2} = n_{2} V - n_{1} n_{2} ξ_{V}

. As V is NFl,

P_{k}

is Fl over

m_{k}

, as we have learned.

We now use

ξ_{E k}

to express

E_{k_{1}} = n_{1} E_{k} + n_{1} n_{2} ξ_{E k}

and

E_{k_{2}} = n_{1} E_{k} - n_{1} n_{2} ξ_{E k}

. Differentiating Equation (104b) with respect to

E_{k}

and

ξ_{E k}

, respectively, and using Equation (42), we obtain

1 = n_{1} + n_{2}, A_{E} = 0,

(104d)

where

A_{E}

(see Equation (18)) is NFl, so it has no suffix k.

As

Σ

is an isolated system, it is deterministic. So the observables (

E_{k_{1}}, E_{k_{2}}, V_{1}, V_{2}

) remain constant, which means that

E_{k}

and

ξ_{E k}, \forall k

, will remain constant in time. If we allow a mutual interaction so that there is a possible energy (or volume) transfer between

Σ_{1}

and

Σ_{2}

, then this will be characterized by oscillating

ξ_{E k}

and

ξ_{V}

due to energy and volume transfers, respectively, back and forth between the two systems. On the other hand, if the interacting

Σ_{0}

become stochastic, as discussed in Section 7, it will obey the second law and

ξ_{E k}

and

ξ_{V}

will eventually vanish. This case is studied later, where it is shown that macroheat flows from hot to cold.

The above discussion can be easily extended to a composite system composed of

m > 2

subsystems by the trick proposed by Gujrati in ([77], Section 3). The trick is very simple. We use the collection

W = (\hat{W}, \hat{ξ})

introduced above for the composite system. We consider two such composite systems, and introduce their work parameters

W_{1}

and

W_{2}

, which are used in Equation (103b) for each one of them. We now treat each as a system so that we have two new systems

Σ_{1}

and

Σ_{2}

that form a new composite system

Σ

. We use

W_{1}

and

W_{2}

to obtain the new collection of

(\hat{W}, \hat{ξ})

as introduced above. This set defines a new

W = (\hat{W}, \hat{ξ})

for the new composite system, which now has

m = 4

subsystems. We then treat two such composite systems and treat each as a system to form another new composite system with

m = 8

, and so on to finally consider a composite system formed of m subsystems. We thus claim the following:

Claim 10.

The internal energy

E_{k}

of the microstate

m_{k}

of a composite system of m subsystems is a function of the work set

\{W_{1}, W_{2}, \dots, W_{m}\}

composed of their work parameters, and can be expressed as a function of

\hat{W} ≐ W_{1} + W_{2} + \dots + W_{m}

and a set

\hat{ξ}

of internal variables [77]; together, they form the set

W

for the composite system, as shown in Equation (102a).

Claim 11.

We see that the new combination

ξ

is the set of internal variables, which also plays an important role in the unique description of the composite system. As the uniqueness is just as important in a thermodynamic consideration, which will be taken up in the following sections, internal variables will play just as important a role there as here.

The above discussion is for a mechanical system with no interaction, but is easily extended to the case in which the two systems are interacting, as will be done in the following sections. The internal variables discussed above relate to a particular microstate

m_{k}

so some of them may carry the suffix k, and should be denoted as a internal microvariable

ξ_{k}

. To see this, we recall that the microenergy

E_{k}

carries the suffix k so any internal variable formed from microenergies of

Σ_{1}

and

Σ_{2}

will carry it as was the case for

ξ_{E k}

constructed above. The discussion is also easily extended to include thermodynamics, where the internal macrovariable

ξ

obeys the restrictions imposed by the second law; see Equation (43) and Corollary 1. In this case,

W_{k_{l}}

of the lth subsystem will also include the internal variable

ξ_{k_{l}}

, not to be confused with

ξ_{k}

for the system. It is clear that the complications due to

ξ_{k_{l}}

are avoided if each subsystem is in EQ so that

ξ_{k_{k}}

’s do not exist, as was the simple example considered above. Then there is a maximum number

n^{*}

of internal macrovariables in

ξ

that is determined by m. This has been discussed in recent publications [77,78], to which we refer the reader. By the addition of the suffix, it should be obvious that the above discussion is easily extended to Fl work parameter, such as Fl volume

V_{k}

for

m_{k}

, so that all microstates experience the same pressure P; see Equation (18). Thus, the above concept of internal variables is quite general. However, for the notational simplicity, we will not add the suffix to

W

and

ξ

unless needed for clarity by clearly specifying the situation.

5. Fundamentals of the $μ$ NEQT

In this section, we will usually talk about a system, but the discussion is valid for any body

Σ_{b}

. The most convenient and most common framework of describing a thermodynamic system

Σ

is in terms of the SI-set

X = (E, V, N, \dots)

of its extensive macroscopic observables, which results in the SI-set

f

of the generalized macroforces (see Equation (25)) and the state space

S_{X}

that is sufficient to uniquely describe the EQ system and its macrostate

M_{eq}

. A very important SI quantity in thermodynamics is the entropy S that in EQ is uniquely determined by

X

so that

S_{eq} ≐ S (X)

is a state function of

M_{eq}

. For an NEQ macrostate

M

, S will not be a state function in

S_{X}

, so it will depend explicitly on time. In this case,

X

no longer forms the set of state variables to uniquely describe

M

in

S_{X}

, and both

M

and S have an explicit t-dependence; see Equation (141) for the latter. This is true whether the system is noninteracting (i.e., isolated) or interacting (i.e., interacts with a medium

\tilde{Σ}

, which is external to the system

Σ

); see Figure 1.

With respect to microstates

m_{k}

, the interaction between

Σ

and

\tilde{Σ}

causes MI-exchange

[d_{e} X]

, which is then used to identify

[d_{i} X] ≐ [d X] - [d_{e} X]

; see Notation (11a). In general, the SI-change

[d q]

can be partitioned into

[d_{e} q]

and

[d_{i} q]

in accordance with Equations (14b) and (14a), respectively, in which the MI-exchange between

Σ

and

\tilde{Σ}

is caused by their interaction and

[d_{i} q]

is the change brought about by internal processes within

Σ

. In particular,

d_{i} q_{k}

represents the internal microchange, while

d_{i} q

the irreversible macrochange. The SI-force corresponding to

w

is

[f_{w}]

; see Equation (37c). There is no microanalog of

f_{s}

introduced in Equation (24).

The above discussion is restricted to any

M_{eq}

that is uniquely specified in

S_{X}, X = (E, w)

. In an NEQ macrostate

M_{neq}

,

S_{X}

is no longer a convenient state space as it cannot specify

M_{neq}

NEQ macrostate uniquely. This loss of uniqueness for

M_{neq}

has been a major obstacle in formulating an NEQ thermodynamics that can be as robust and complete as the classical EQ thermodynamics. All competing NEQT approaches belong to

\overset{˚}{M} NEQT

as discussed in Section 1 and deal only with exchange quantities that can be uniquely described in

S_{X}

, as the medium

\tilde{Σ}

is always taken to be in EQ. Thus, they cannot offer any help to overcome the nonuniqueness of

M_{neq}

.

We consider this loss of uniqueness to be the main issue in improving our current incomplete understanding of NEQ processes. Our approach to overcome this loss is to describe

M_{neq}

in an appropriately enlarged state space to

S_{Z}

by including internal variable set [12,13,18,42,51,108,134,148,166,167,168]

ξ

and identifying

Z ≐ X \cup ξ

as the set of state variables to uniquely specify

M_{neq}

. The internal variables also play a very dominant role in glassy and granular materials [169,170,171,172,173]. In all previous theories involving internal variables, they are introduced almost in an ad hoc manner without providing any physical insight into their origin. In contrast, our approach to introduce them differs from other approaches by providing a very clear and physical prescription, as discussed in Section 4. As

M_{eq}

describes a uniform system [33],

M_{neq}

invariably requires some sort of nonuniformity, as in a composite system

Σ = \cup_{i} Σ_{i}

composed of various subsystems

Σ_{i}

. At the mechanical level, this nonuniformity is captured by the parameters of the SI-Hamiltonians of

Σ_{i}

, as was the case with two subsystems in Equation (99). The internal variables as they appear in Equation (100) are mathematically required to ensure that the number of independent variables on both sides in Equation (99) are exactly the same. While their forms may not be unique, they must be independent. In terms of

Z

, we now have a complete SI-specification of

m_{k}

of

Σ

, assuming a certain choice of

ξ

. This is the uniqueness we are looking for to develop the NEQ statistical mechanics. As discussed in Section 4,

ξ

cannot be controlled from the outside of

Σ

. Therefore, its variation is due to internal processes only and may be controlled by the second law. It should be obvious from the discussion in Section 4 that

ξ

for a purely mechanical system such as

m_{k}

cannot have any connection with the second law. Only in the presence of stochasticity required for a thermodynamic system will its average behavior be governed by the second law, so it also plays an important role in our approach. However, the requirement of including internal variables for a complete specification is a mechanical necessity due to nonuniformity, but becomes critical in the NEQ statistical mechanics. We direct readers to Section 5.7 for a simple example that clarifies its importance.

In the following, we will be considering the state space

S_{Z}

in which the entropy is a state function

S (Z)

so that we will be dealing with

M_{ieq}

; see Definition 13. This means that

\{p_{k}\}

are uniquely defined to specify

M_{ieq}

. However,

\{m_{k}\}

themselves are independent of this particular choice of

\{p_{k}\}

, simply because

\{m_{k}\}

are determined by the deterministic Hamiltonian of

Σ

as discussed in Section 1, so they remain oblivious to their probabilities. It is this independence of

\{m_{k}\}

and

\{p_{k}\}

that allows us to develop the

μ

NEQT as a mechanical theory that is modified by stochasticity by extending the conventional similar approach in the

μ

EQT [33,54].

Let us consider an infinitesimal change

d Z

in

Z

that takes

M_{ieq}^{'} = M_{ieq} (Z)

to

M_{ieq}^{″} = M_{ieq} (Z + d Z)

both belonging to

S_{Z}

. If the system always stays within

S_{Z}

during this change, then the change is carried out along an IEQ process in

S_{Z}

. It is

M_{ieq}

during this change so that

d W_{k} = - d E_{k}

. If intermediate macrostates leave

S_{Z}

during this change, then the change is not carried out along an IEQ process in

S_{Z}

. Nevertheless, the microenergy change

d E_{k} = - d W_{k}

between

M_{ieq}^{'}

and

M_{ieq}^{″}

is the same in both situations. In other words,

d E_{k} = - d W_{k}

is the same between

M_{ieq}^{'}

and

M_{ieq}^{″}

, regardless of the nature of the process.

We will focus on an isolated composite system

Σ

in microstate

m_{k}

made of two subsystems

Σ_{1}

in microstate

m_{k_{1}}

and

Σ_{2}

in microstate

m_{k_{2}}

; recall Remark 25. Following from Remarks 21 and 26, we now conclude that

d q_{k} \equiv d_{i} q_{k} = d q_{k 1} + d q_{k 2} .

In particular, we have

d_{i} W_{k} = d W_{k_{1}} + d W_{k_{2}};

(105)

we can use Equation (37a) for NFl

W_{l}

and Equation (38) for Fl

W_{k_{l}}, l = 1, 2

, to determine

d W_{k_{l}}, l = 1, 2

.

Let us consider one of the above three bodies and focus on its

W

. For NFl

W

, the corresponding generalized microforce

F_{w k}

is Fl as shown in Equation (17a). For Fl

W_{k}

, the corresponding generalized microforce

F_{w}

is NFl, as shown in Equation (18). Including

E_{k}

, which is always FL, we see that

Z

for the body is Fl in the latter case.

As shown in Equation (20), the BI-microwork

d W_{k} = F_{w k} \cdot d W

and

d W_{k} = F_{w} \cdot d W_{k}

defined mechanically as force × displacement in the two cases are the same, and are fluctuating over

\{m_{k}\}

as expected due to the ubiquitous Fl microforce and Fl work parameter, respectively. The mechanically defined macrowork

d W

in each case will result in the irreversible macrowork

d_{i} W \geq 0

in accordance with the second law. It follows from Equation (105) that each side represents a mechanical microwork, showing that even

d_{i} W_{k}

is a mechanical quantity. It follows from Theorem 6 that

d_{i} E_{k} = - d_{i} W_{k}

, again emphasizing that

d_{i} W_{k}

has a mechanical origin. However, the second law puts no restriction on the Fl mechanical microanalog

d_{i} W

. For the example of the spring with the force imbalance given in Equation (82) with NFl x, the internal microwork is given in Equation (84) and can be of any sign according to the signature of the internal microforce imbalance

Δ F_{w k}

. In the presence of any microforce imbalance (see Conclusion 2) in an NEQ system,

d_{i} W_{k}

will not vanish, even if its average does. The following Remark emphasizes these points.

Remark 32.

The internal microwork

d_{i} W_{k}

within an isolated Σ due to Fl internal microforces or Fl work parameter is ubiquitous. Its presence has a purely mechanical origin, as seen in Equation (84) or in Equation (78) for NFl

W

. For Fl

W_{k}

, because of their mechanical nature, different additive parts of

d_{i} W_{k}

given in (78) are independent of

p_{k}

in that they remain the same between

M_{ieq}^{'}

and

M_{ieq}^{″}

, both belonging to

S_{Z}

, regardless of the processes between them. Despite this, the macroscopic analogs of each of these parts and

d_{i} W

are controlled by the second law; see Corollary 1. It follows that in general, determining

d_{i} W_{k}

from SI-

d W_{k}

will be a convenient way to discuss the statistical mechanics of NEQ systems; see Section 2.

We now put down the set of axioms for the formulation of the

μ

NEQT that are in addition to the axioms put forward by Callen [3]. Callen only discusses a system in equilibrium, so his two most important axioms are about the existence of the entropy function and of the stable equilibrium for EQ macrostates. We extend these axioms to NEQ macrostates below.

Axiom 1.

Fundamental Axiom The thermodynamic behavior of a system is not the behavior of a single sample, but the average behavior of a large number of independent samples, prepared identically under the same macroscopic conditions at time

t = 0

.

Axiom 2.

Axiom of Entropy Function Existence There exists an entropy function

S (M)

for

M

in any state space, which may be a function of the state variables in that state space and time t.

Axiom 3.

Axiom of IEQ Any

M_{neq}

in

S_{Z}

can always be turned into a unique

M_{ieq}

in a suitably enlarged state space

S_{Z^{'}} \supset S_{Z}, Z^{'} = Z \cup ξ^{'}

so the thermodynamic and statistical entropies are identical; see Proposition 1 and Section 12.6 for details.

Axiom 4.

Axiom of Stability The unique macrostate

M_{ieq}

for a given

Z

is stable in

S_{Z}

in that the system does not leave it if already there or returns to it if disturbed. A stable macrostate satisfies the stability conditions

d^{2} S < 0, d^{2} E > 0 .

(106)

If we consider the matrix

J

formed by

\partial^{2} S / \partial Z_{j} Z_{j^{'}}

, or the matrix

K

formed by

\partial^{2} E / \partial ζ_{j} ζ_{j^{'}}

, then all the principle minors of the determinant of

J

must be strictly negative, or the determinant of

K

must be strictly positive. By allowing

Z

to vary,

M_{ieq}

moves to the most stable macrostate

M_{eq}

, in which all thermodynamic forces (see Equation (76d)) vanish.

We do not consider the stability border

d^{2} S \to 0, d^{2} E \to 0

in the review.

It is an observed fact that nature, in her inorganic as well as organic forms, is driven towards greater stability. This tendency is just as ubiquitous in physics as it is in biology. Anything in nature that is capable of changing always changes eventually into an unchanging stable form, even in an explosion. This is also true of the Belusov reaction [51], undergoing oscillations initially but eventually ending into a stable macrostate.

Axiom 5.

Axiom of quasi-additivity Any quantity

[q]

satisfies the principle of quasi-additivity

[q] \approx \sum_{j} {[q]}_{j} .

(107a)

The above axiom also applies to

[S]

, the entropy, but requires the following additional axiom of quasi-independence, to be discussed later in Section 7.3.

Axiom 6.

Axiom of Quasi-independence For entropy to be quasi-additive, as

[S] \approx \sum_{j} {[S]}_{j},

(108)

requires the property of quasi-independence (see Claim 7) between different parts of the system.

Axiom 7.

Axiom of Reduction All microquantities carrying the suffix

\tilde{k}

and

k_{0}

, and associated with

\tilde{Σ}

and

Σ_{0}

, respectively, must be reduced to microquantities carrying the suffix k under the condition that

Σ

is in the microstate

m_{k}

in order to assess their influence on

m_{k}

.

The discussion of the rules for reduction is postponed to Section 7.4.

5.1. Fundamental Axiom

To avoid any influence of the possible changes in the system brought about by measurements, we instead prepare a large number

N_{S}

of samples or replicas under identical macroscopic conditions. The replicas are otherwise independent of each other in that they evolve independently in time. This is consistent with the requirement that different measurements should not influence each other. In the rest of this review, we will use the same term ensemble to collectively represent the samples. The average over these samples of some thermodynamic quantity then determines the thermodynamic property of the system. As the replica approach plays a central role in our formalism, we state its importance as Axiom 1, which was first proposed in [79].

Such an approach is standard in equilibrium statistical mechanics [11,33,34,36,54], but it must also apply to systems not in equilibrium. For the latter, this averaging must be carried out by ensuring that all samples have identical history, i.e., prepared at the same time

t = 0

. This is obviously not an issue for systems in equilibrium. We refer the reader to a great discussion about the status of statistical mechanics and its statistical nature by Tolman ([54], Section 25), where he clearly puts down this viewpoint of statistical mechanics as follows. We quote from p. 65:

“The methods are essentially statistical in character and only purport to give results that may be expected on the average rather than precisely expected for any particular system.....The methods being statistical in character have to be based on some hypothesis as to a priori probabilities, and the hypothesis chosen is the only postulate that can be introduced without proceeding in an arbitrary manner....”

Tolman [54] then goes on to argue on p. 67 that what statistical mechanics should strive for is to ensure

“...that the averages obtained on successive trials of the same experiment will agree with the ensemble average, thus permitting any particular individual system to exhibit a behavior in time very different from the average;”

see also the last paragraph on p. 106 in Jaynes [174].

5.2. Parameter Description

As said earlier, E is always treated as a random variable E taking the values

\{E_{k}\}

that fluctuate over

\{m_{k}\}

, regardless of how

W

is treated. The most convenient description of a system is to use the NFl-

W

so it is the same for all

\{m_{k}\}

. Per Claim 3, this results in a random SI-variable

F_{w}

with its outcome

F_{w k}

(see Equation (17a)) fluctuating over

\{m_{k}\}

so its ensemble average is the generalized (mechanical) macroforce

F_{w}

; see Equation (17b). In contrast, the conjugate field

β = 1 / T

for Fl-E is fixed.

It is possible to use a mixed parameter approach. We consider

W

having two nonoverlapping subsets

W_{1}

and

W_{2}

, with

W_{1}

a NFl-parameter

W^{NF}

. The remaining subset

W_{2}

is Fl-parameter set

W^{F}

taking the values

\{W_{k}^{F}\}

over

\{m_{k}\}

. We impose the consistency condition on

W_{k}^{F}

(see Claims 3 and 5) so that the corresponding field

F_{w k}^{F} = - \partial E_{k} / \partial W_{k}^{F} = F_{w}^{F}, \forall k

; see Equation (18). For a null set

W^{NF}

, we retrieve the field-parameter description in Claim 3. As before, the consistency requires obtaining the same MNEQT, so we must have

〈W^{F}〉 = W_{2}, 〈F_{w}^{NF}〉 = F_{w 1};

(109)

see Condition 1.

To clarify the above distinction, we consider the simpler case of NFl-

W = (V, ξ)

for a system. The energy E is a random variable E taking Fl-values

\{E_{k}\}

; their average value is determined by a fixed

f_{s} = - T

; see Equation (24) and Claim 5. In this ensemble,

T, V

and

ξ

are fixed so we can also call it a

(T

-V-

ξ)

-ensemble. In this case,

E_{k}, P_{k}

, and

A_{k}

are fluctuating over

\{m_{k}\}

. If we take

W^{NF} = (ξ)

and

W^{F} = (V)

, then

E_{k}, V_{k}

, and

A_{k}

are fluctuating over

\{m_{k}\}

with

T, P

and

ξ

kept fixed in this ensemble, which we can call a

(T

-P-

ξ)

-ensemble. We can also consider an ensemble with

W^{NF} = (V)

and

W^{F} = (ξ)

. In this ensemble,

E_{k}, P_{k}

, and

ξ_{k}

are fluctuating over

\{m_{k}\}

;

T, V

and A are kept fixed so we can call it a

(T

-V-

A)

-ensemble. For these ensembles to represent the same physical system thermodynamically, we must have

V = 〈V〉, P = 〈P〉, ξ = 〈ξ〉

, and

A = 〈A〉

in accordance with Equation (109).

Remark 33.

An NEQ ensemble is specified by the set of its NFl quantities

W^{NF}

and

F_{w}^{F}

.

5.3. Ensemble of Replicas

The discussion here provides an extension of the ideas valid for thermodynamic equilibrium macrostates

M_{eq}

to not only nonequilibrium macrostates

M_{neq}

but also to macrostates

M_{\det}

. The latter are governed by deterministic dynamics in which microstate probabilities remain constant, as will be justified below; see Claim 12. The premise of the extension to

M_{neq}

is that these ideas must be just as valid for them, as they are based on thermodynamics being an experimental science [79]. Thermodynamics (equilibrium and nonequilibrium) requires verification by performing the experiment many times over. The same premise also applies to

M_{\det}

. Therefore, we consider all these macrostates in the following, and simply use

\bar{M}

to stand for all these states. We must prepare many copies or replicas

N > > 1

of the system at the same time t under identical conditions specified by the set of extensive variables

Z (t)

that can be used to also study how the system evolves in time. We identify a replica as simply representing an “instantaneous state” of the system, i.e., one of the microstates

m_{k}

. The collection of all replicas at each instant t is the ensemble, which is specified by the set

Z (t)

and

N

. The ensemble then becomes the representation of the macrostate

\bar{M}

. Any quantity q

(Z, t)

of interest associated with

\bar{M}

is then identified as an instantaneous average over these replicas or samples, and is an explicit function of the set

Z

and possibly t. For simplicity, we will usually suppress

Z

and only exhibit the explicit dependence on t in q. By definition, the ensemble average is given by

q (t) or \bar{q} (t) or < q > (t) ≐ \frac{1}{N} \sum_{k = 1}^{W} N_{k} (t) q_{k},

(110)

where q

_{k}

is the value of q in the kth microstate

m_{k}

,

N_{k} (t)

denotes the number of samples in the kth microstate

m_{k}

at time t, and W is the total number of distinct microstates, which we assume is finite right now. We also assume

\{N_{k} (t)\}

to be a countable set. It should be obvious that

N > > W

for the above definition to make sense. The overbar on or the angular bracket around q in Equation (110) are used to indicate the average q, which is also represented simply as q, following the acceptable tradition in thermodynamics. We will use all three notations to indicate the average in this review as need be.

5.4. Concept of Probability

We now introduce the concept of ensemble probability

p_{k} (t) \equiv lim_{N \to \infty} N_{k} (t) / N, \sum_{k = 1}^{W} p_{k} (t) \equiv 1,

(111)

which is valid even if

W \to \infty

. As is well-known [114], the probabilities require the formal limit

N \to \infty

, which is going to be implicit in the following. This justifies Equation (12).

It should be stressed that the concept of probability introduced in Equation (111) is also valid for a Hamiltonian system with deterministic dynamics. All one needs to do is to prepare an ensemble with a given number

N_{k}

of replicas. As these numbers will not change because the dynamics is deterministic,

p_{k}

will not change.

It should be noted that

m_{k}

, and hence the value q

_{k}

on it, depend on

Z (t)

explicitly, but may also depend on t explicitly. In general,

p_{k} (t)

will be time-dependent as determined by the history of the process. They become history-independent and constant in time t for

M_{eq}

. As we will soon see, they remain constant in a mechanical evolution of

M_{\det}

. In this sense, there is a close parallel between

M_{eq}

and

M_{\det}

, as discussed below.

The average of the state variable

Z

, using the tradition in thermodynamics, is simply written as

Z

(see Equation (110)):

Z \equiv \sum_{k = 1}^{W} p_{k} (t) Z_{k};

(112)

here

Z_{k}

is the value of

Z

in

m_{k}

. We will also extend this tradition to

F_{w}

in Equation (40) so that

F_{w} \equiv \sum_{k = 1}^{W} p_{k} (t) F_{w k},

(113)

where, as usual,

F_{w k}

is the value of

F_{w}

in

m_{k}

.

Claim 12.

The

p_{k}

defined above in Equation (111) remains a constant of motion for a deterministic system.

This is easy to rationalize as follows. Consider a collection of microstates

\{m_{k}\}

of a system with

\{N_{k}\}

copies at some initial time

t = 0

. In a deterministic evolution,

N_{k}

’s do not change, which justifies the above claim.

Definition 25.

To distinguish the usage of constant probabilities for deterministic systems with the usage of probabilities for thermodynamic systems, where they may change spontaneously without any external intervention, we will use the term stochastic for this aspect of probabilistic behavior in

M

, but not in

M_{\det}

.

We clarify this point further. Consider an isolate system that is not in EQ. This means that, according to the Boltzmann principle, not all microstates are equally probable. In time, the system will come to equilibrium by ensuring that all microstates become equally probable. This shows how a thermodynamic system behaves in a way that allows

p_{k}

to change in time even without any external intervention. For a deterministic system such as a loaded die, this will never happen even if it is disturbed by the performance of mechanical work, like throwing, an external intervention.

For a thermodynamic system in EQ,

\{p_{k}\}

remains invariant (constant) in time. In this regard, such a system is identical to a deterministic system that obeys Liouville theorem [164], since it is well-known that an EQ system also obeys the theorem [33]. The reason is very simple. The various members of the above ensemble in EQ occupy various microstates with equal probability with the maximum entropy as shown in Section 5.5. This entropy remains a constant of motion for the EQ system.

Remark 34.

An EQ macrostate

M_{eq}

under fixed conditions of the surroundings so

p_{k}

’s do not change is no different than a deterministic macrostate

M_{\det}

, except that the former has a well-defined notion of temperature but the latter has no such notion.

5.5. Statistical Entropy for $M (t)$

We provide a very general statistical formulation of S for a general system

Σ

that is applicable to mechanical as well as thermodynamic systems. It will be shown to be identical to the thermodynamic entropy S by appealing to the third law. Our derivation demonstrates that the concept of entropy in general is of a statistical nature. We consider a state

M (t) \equiv M (Z (t), t)

of

Σ

at a given instant t. We focus on a macrostate

M (t)

of

Σ

at a given instant t, which refers to the sets

m = \{m_{k}\}

and

p = \{p_{k}\}

of microstates and their probabilities, respectively. We consider Fl-

W

but the discussion is also valid for NFl-

W

by simply setting

W_{k} = W, \forall k

. The microstates are specified by

(E_{k} (t), W_{k} (t))

, which along with

p

need not uniquely specify the macrostate

M (t)

. In the following, we will use the set

Z (t)

for

m

for simplicity. We will also denote

Z (t)

by

\bar{Z}

so that we can separate out the explicit variation due to t in addition to the implicit variation in t due to

\bar{Z}

, if any. For simplicity, we suppress t in

M

in the following. For the computation of combinatorics, the probabilities are handled as described in Section 5.4. We follow the notation used there, choosing

N = C W (\bar{Z})

with

C

some large integer constant, and

W (\bar{Z})

the number of distinct microstates

m_{k}

in the ensemble or the sample space

Γ (\bar{Z})

spanned by

\{m_{k}\}

. We will see that

W (\bar{Z})

is determined by

m_{k}

’s having nonzero probabilities [79]. We will call them available microstates.

The ensemble

Γ (\bar{Z})

above is a generalization of the ensemble introduced by Gibbs, except that the latter is restricted to an equilibrium system, whereas

Γ (\bar{Z})

refers to the system in any arbitrary macrostate so that

p_{k}

in Equation (111) may be time-dependent, and may not be unique. The samples are, by definition, independent of each other so that there are no correlations among them. Because of this, we can treat the samples in

Γ (\bar{Z})

to be the outcomes of some random variable, the macrostate

M (t)

. This independence property of the outcomes is crucial in the following. Each sample of

M (t)

is one of a microstate in

Γ (\bar{Z})

. They may be equiprobable but not necessarily. The number of ways

W

to arrange the

N

samples into

W (\bar{Z})

distinct microstates is

W \equiv N! / \prod_{k} N_{k} (t)! .

(114)

Taking its natural log, as proposed by Boltzmann, to obtain an additive quantity per sample as described in Section 5.6 (see also Axiom 6), we obtain

S \equiv (1 / N) ln W,

(115)

and using Stirling’s approximation, we see easily that it can be written as the ensemble average (see Equations (12) and (26a)),

S (\bar{Z}, t) \equiv - 〈η (t)〉 \equiv - \sum_{k = 1}^{W (\bar{Z})} p_{k} (t) ln p_{k} (t),

(116)

of the negative of Gibbs’ index of probability ([48], p. 16)

η_{k} (t) ≐ ln p_{k} (t) .

(117)

We have shown an explicit time dependence in S, which is distinct from the implicit time dependence in

\bar{Z}

, to merely express the fact that it may not be a state function in

S_{\bar{Z}}

, i.e., that

M

may not be uniquely specified in

S_{\bar{Z}}

. The above derivation clearly shows that Equation (116), which is identical in form to Equation (26a), justifies the latter for an arbitrary

M

.

The identification of entropy in Equation (116) with the Gibbs formulation of entropy is a time-honored practice for nonequilibrium states since the days of Gibbs ([48] see, in particular, chapters 11 and 12, where time dependence is discussed), and has been discussed by Tolman ([54], Ch. 13, and in particular pp. 538–539), Jaynes [174], Rice and Gray [55], and Rice [57], to name a few. There is no restriction on

p_{j} (t)

. In particular, they do not have to be given by probabilities valid for equilibrium states; see also Sethna ([36], Section 5.3.1). The definition merely follows from the observation that the index of probability is an additive quantity for independent replicas (see Fundamental Axiom) and that the entropy is merely its average value (with a negative sign). Tolman takes great care in establishing that this formulation of the entropy satisfies the second law ([54], Section 130). Tolman also shows that the Boltzmann definition of entropy is a special case of the general formulation due to Gibbs ([54], see the derivation of Equation (131.2)), just as we have argued; see Equation (208).

The identification of the entropy with the negative of the Boltzmann H-function ([54], see p. 561), the latter describing a nonequilibrium state, should leave no doubt in anyone’s mind that the Gibbs formulation of the entropy can be applied equally well to an equilibrium or a nonequilibrium system. Nevertheless, we should point out that not all subscribe to this viewpoint of ours about the Gibbs formulation of entropy, because they insist that the Gibbs entropy is a constant of motion [135]. This constancy follows immediately from the application of Liouville’s theorem in classical mechanics [32,33,34,36,54], valid for a system described by a Hamiltonian, as discussed above and as we have already discussed in Section 5.4. We thus see that our formulation of the entropy in EQ is consistent with this theorem.

The above derivation is based on fundamental principles of combinatorics and additivity, and does not require the notion of equilibrium or nonequilibrium in the system; therefore, it is always applicable for any arbitrary macrostate

M (t)

including that of a determining system; see Claim 12. To the best of our knowledge, even though such an expression has been extensively used in the literature for NEQ entropy, it has been used by simply appealing to the information entropy [72,175]. Thus, Equation (116) is a generalization of Equation (26a) to the general case, and thus justifies it for

M (t)

. We now generalize Claim 12 as follows:

Claim 13.

The probability

p_{k}

and the Gibbs entropy (see Equation (26a)) is easy to define for a

M (t)

including that of a deterministic Hamiltonian system. As the probability and the entropy for

M_{\det}

do not change as a function of time, we show in Section 10.1 that the concepts of microheat and macroheat cannot be associated with a Hamiltonian system, although the concepts of microwork and macrowork are defined.

The distinction between the Gibbs’ statistical entropy and the thermodynamic entropy should be emphasized. The latter appears in the Gibbs fundamental relation that relates the energy change

d E

with the entropy change

d S

, as is well-known in classical thermodynamics, and as we will also demonstrate below; see also Equation (93a). The concept of microstates is irrelevant for this, as it is a purely thermodynamic relation. On the other hand, the Gibbs’ statistical entropy is solely determined by

\{m_{k}\}

, so it is a statistical quantity. It then becomes imperative to show their equivalence, mainly because the statistical entropy is based on the Boltzmann idea. This equivalence has been justified elsewhere [75,76], and is summarized in the following Remark.

Remark 35.

Because of this equivalence, we will no longer make any distinction between the statistical Gibbs entropy and the thermodynamic entropy and will use the standard notation S for both of them for a macrostate

M_{ieq}

, of which

M_{eq}

is a special case.

Remark 36.

The Gibbs entropy appears as an instantaneous ensemble average; see Definition 7. This average should be contrasted with a temporal average in which a macroquantity q is considered as the average over a long period

τ_{0}

of time

q = \frac{1}{τ_{0}} \int_{0}^{τ_{0}} q (t) d t,

where q

(t)

is the value of q at time t [33]. For an EQ macrostate

M_{eq}

, both definitions give the same result provided ergodicity holds. The physics of this average is that q

(t)

at t represents a microstate of

M_{eq}

. As

M_{eq}

is invariant in time, these microstates belong to

M_{eq}

, and the time average is the same as the ensemble average if ergodicity holds. However, for an NEQ macrostate

M_{neq} (t)

, which continuously changes with time, the temporal average is not physically meaningful as the microstate at time t corresponds to

M_{neq} (t)

and not to

M_{neq} (t = 0)

in that the probabilities and

Z

are different in the two macrostates. Only the ensemble average makes any sense at any time t, as discussed in [176]. Because of this, we only consider ensemble averages in this review.

A word of caution must be offered. If S is not a state function, it cannot be measured or computed. Thus, while the statistical entropy can be computed in principle in all cases if

\{p_{k}\}

is known, there is no way to compare its value with thermodynamic entropy in all cases. Thus, no comment can be made about their relationship in general for an arbitrary

M (t)

. We have only established their equivalence for

M_{ieq}

for which the two entropies are the same.

Remark 37.

We have summarized our approach for an arbitrary macrostate in Axiom 3, which allows us to identify the two entropies in all cases. Thus, we only need to investigate the μNEQT for

M_{ieq}

to also cover

M

; see Section 5.9.

5.6. Principle of Additivity

5.6.1. Additivity

We consider a system

Σ

consisting of two nonpenetrating sub-bodies

Σ_{1}

and

Σ_{2}

at present, each specified by

W_{1}

and

W_{2}

. Later, we will generalize to any number of sub-bodies

Σ_{j}

. The principle of additivity states that

Σ

is specified by

W

given by

W ≐ \sum_{j} W_{j} .

(118a)

This principle is self-evident for nonpenetrating systems. For example, the number of particles

N \equiv \sum_{j} N_{j}

(118b)

remains an identity. (This remains true even if the bodies are interpenetrating, for which the volumes may not be additive). For nonpenetrating bodies, however, their volumes become additive:

V = \sum_{j} V_{j},

(118c)

which we will assume in this review. We will call the case of nonpenetrating bodies the discrete approach. It is evident that in this approach, the principle of additivity is valid for any number of sub-bodies

Σ_{j}, j = 1, 2, \dots

. In this case, the sum in the above equations is over all sub-bodies.

We now show that the above sample average in Equation (110) also follows immediately from the principle of additivity of quantities that are additive; see Claim 2. One considers a very large macroscopic system

Σ_{0}

of

N_{0} \equiv

N N

particles and imagines dividing it into a large number

N

of macroscopically large and nonoverlapping parts of equal size N, each representing a microstate of the system

Σ

. As the parts are macroscopically large, they will act almost independently; see Section 7.3 for details. How well this condition is satisfied depends on how large the parts are. In principle, they can be made arbitrary large to ensure their complete independence. At the same time t, these parts will be in microstates

m_{k}

of

Σ

with probabilities

p_{k} (t)

. The additivity principle states that any extensive thermodynamic quantity

X (t)

of the system

Σ_{0}

is the sum of this quantity over its various macroscopically large parts. This principle is consistent with the definition of the average in (110). One can also think of the

N

parts as representing the same measurement that has been repeated

N

times on samples prepared under identical macroscopic conditions at the same instant t.

5.6.2. Quasi-Additivity

We have enunciated the principle of additivity for

W

above. The energy E plays a very different role because of mutual interactions between various sub-bodies. We again restrict to only two sub-bodies for simplicity, which can be later generalized to any number of sub-bodies. We assume that they are weakly interacting so that their energies are quasi-additive, which we express in a form using

j = 1, 2

:

E = \sum_{j} E_{j} + U_{int} \approx \sum_{j} E_{j},

(119)

where

U_{int}

is the weak interaction energy between

Σ_{1}

and

Σ_{2}

, and can be neglected to a good approximation provided

U_{int} < < E_{sm} ≐ min \{E_{j}\} .

(120)

We can extend the discussion to many sub-bodies

\{Σ_{j}\}, j = 1, 2, \dots

, by defining

U_{int}

as the net interaction energies between all of them:

U_{int} ≐ \sum_{j > l} E_{j l},

where

E_{j l}

is the interaction energy between

Σ_{j}

and

Σ_{l}

. The inequality in Equation (120) can be made as precise as we wish by making N extremely large compared to various sub-bodies.

Remark 38.

With quasi-additivity for energies, we can extend the principle of additivity from

W

to

Z \approx \sum_{j} Z_{j},

(121)

by including quasi-additivity for the energies; see Claim 2.

However, the quasi-additivity of the entropy is altogether a different issue. The entropy additivity is strictly valid if

Σ

and

\tilde{Σ}

are (statistically) independent [3], i.e., noninteracting. However, this independence is not of any physical interest as

Σ

and

\tilde{Σ}

must be interacting with each other for any interesting thermodynamics; otherwise, there is no need to consider

\tilde{Σ}

, and the issue of additivity does not arise. Thus, we are inclined to consider them to be quasi-independent. To the best of our knowledge, the discussion of quasi-independence and its distinction from interactions between

Σ

and

\tilde{Σ}

that are weak has been carefully presented elsewhere ([148],

S_{corr}

was called

S_{int}

there; however,

S_{corr}

seems to be more appropriate) for the first time, which we summarize below. The presence of interparticle interactions that determine E and

\tilde{E}

for

Σ

and

\tilde{Σ}

, respectively, results in the thermodynamic concept of correlation lengths in them. The correlation length

λ_{corr} > a

is a property of macrostates, and can be much larger than the interparticle interaction length a between particles depending on the macrostate. In general,

λ_{corr} > > a

. A simple well-known example is of the correlation length

λ_{corr}

of a nearest neighbor Ising model, which increases very rapidly as the critical point is reached, and where it can be much larger than the nearest neighbor distance a between the spins. Two interacting Ising systems at the same temperature cannot be “independent”, so the additivity of entropy for

Σ_{0}

is replaced by the following:

S_{0} (X_{0}, t) = S (X (t), t) + \tilde{S} (\tilde{X} (t)) + S_{corr} (t),

(122a)

where

S_{corr} (t)

is a correction term to the entropy due to correlation that is present between

Σ

and

\tilde{Σ}

due to their mutual interaction. If the linear sizes l and

\tilde{l}

of the two bodies are much larger compared to

λ_{corr}

, then this correlation becomes almost nonexistent. In this case,

S_{corr} (t)

can be neglected to a good approximation so that

S_{0} (X_{0}, t) \approx S (X (t), t) + \tilde{S} (\tilde{X} (t)),

(122b)

provided

\tilde{l}, l > > λ_{corr}

. Under this condition,

Σ

and

\tilde{Σ}

are said to be quasi-independent [148], which ensures that their entropies become quasi-additive. This distinction is usually not made explicit in the literature. Usually,

\tilde{l} > > l

, but this condition was not used above so the above additivity is valid for any two bodies for which

\tilde{l}, l > > λ_{corr}

. For

\tilde{Σ}

representing a medium,

\tilde{S}

has no explicit time dependence as it is assumed to be in equilibrium, and

X_{0}

remains constant for the isolated system

Σ_{0}

.

The above quasi-additivity principle is applicable to microstates of

Σ

as well. We now focus on classical microstates represented by the sub-bodies, and apply the discussion to only two sub-bodies representing

Σ = Σ_{1}

and

\tilde{Σ} = Σ_{2}

forming the isolated system

Σ_{0}

as they are central to our statistical mechanics. We consider the energies of the microstates

m_{0 k_{0}}, m_{k}

, and

{\tilde{m}}_{\tilde{k}}

. They are related as follows:

E_{0 k_{0}} = E_{k} + {\tilde{E}}_{\tilde{k}} + E_{k, \tilde{k}}

(123a)

where we have also included the interaction energy

E_{k, \tilde{k}}

due to

U_{int}

, which is usually negligible relative to

E_{k},

{\tilde{E}}_{\tilde{k}}

. These energies are independent of the macrostates and, therefore, independent of quantities such as the temperatures and probabilities that specify macrostates of various bodies forming the system. The energies corresponding to their macrostates are related by

E_{0} = E + \tilde{E} + U_{int};

(123b)

see Equation (119). Again, the smallness of

E_{k, \tilde{k}}

results in its average

U_{int}

obtained by using

p_{0 k_{0}}

and

E_{k, \tilde{k}}

in Equation (112), being negligible relative to E and

\tilde{E}

.

Remark 39.

The assumption to neglect

E_{k, \tilde{k}}

or

U_{int}

merely makes Σ and

\tilde{Σ}

satisfy the principle of additivity. We will make this assumption in this review extensively.

Remark 40.

From now on, we will usually replace the sign “≈” by “=” unless clarity is needed.

Remark 41.

Throughout this review, we will think of the above approximate equalities as equalities to make the energies additive by neglecting the interaction energy between Σ and

\tilde{Σ}

, which is a standard practice in the field, but also assuming quasi-independence between them to make the entropies to be additive, which is not usually mentioned as a requirement in the literature.

5.7. $Σ$ in Internal EQ (IEQ)

The central concept of the

μ

NEQT is that of the internal equilibrium (IEQ) according to which the entropy S of an NEQ macrostate is a state function of the state variables in the enlarged state space

S_{Z}

[134,148,149]. The enlargement of the space relative to the EQ state space

S_{X}

is due to independent internal variables [13,18,51,108], which is sufficient to uniquely specify

M

in

S_{Z}

. We denote such a state by

M_{ieq}

. The same state cannot be uniquely specified in

S_{X}

or any other extended state space

S_{Z^{'}}

that does not have the same set of internal variables as in

Z

.

We give a simple example to clarify why and how internal variables are useful for describing an NEQ state. Consider the case of two identical bodies

Σ_{1}

and

Σ_{2}

in thermal contact at different temperatures

T_{1} (t)

and

T_{2} (t)

and energies

E_{1} (t)

and

E_{2} (t)

, respectively; we ignore other observables

N, V

, etc. Thus,

X = (E \dot{)}

for each system. We assume that each one is in an EQ state of its own at each instant. Together, they form an isolated composite system

Σ

, whose entropy

S (E_{1}, E_{2}) = S_{1} (E_{1}) + S_{2} (E_{2})

is a function of two variables at each instant t, and can be written as a state function in the enlarged state space formed by

E = E_{1} + E_{2} = c o n s t, ξ (t) = E_{1} - E_{2} .

(We have neglected the interaction energy

E_{12}

between

Σ_{1}

and

Σ_{2}

here per Remark 39.) This situation should be compared with its mechanical analog in Section 4, and in particular with Equation (104a) for

ξ_{E k}

; here,

n_{1} = n_{2} = 1 / 2

. The discussion there was purely mechanical so there was no dissipation.

We are in a position now to understand how dissipation emerges in thermodynamics. As the system approaches EQ,

E_{1} \to E_{2}

so that

ξ \to 0

. This also means that

T_{1} (t) \to T_{2} (t) = T_{eq}

, the EQ temperature. The first thing we learn from this simple example is that it clearly shows how the t-dependence in

S (E, t) \equiv S (E_{1}, E_{2})

can be replaced by invoking an extensive internal variable

ξ (t)

so that the entropy can be treated as a state function

S (E, ξ)

in the enlarged state space

S_{Z}

spanned by E and

ξ

. In other words, the system is in an IEQ state. In general, we will need to enlarge

S_{X}

by introducing an appropriate number of internal variables to form

S_{Z}

in which the system is in IEQ. Thus, we can always express S in an IEQ state as a state function

S = S_{ieq} = S (Z)

(124)

in the appropriately enlarged state space

S_{Z}

. This is carefully discussed in Section 12, where we take a different approach. As

S_{1} (E_{1})

and

S_{2} (E_{2})

, being in EQ, have their maximum value for given

E_{1} (t)

and

E_{2} (t)

,

S (E, ξ)

also has its maximum value for given

E (t)

and

ξ (t)

, but this value increases as

ξ \to 0

, and EQ is achieved. In general,

M_{ieq}

has the maximum possible entropy for the given

Z

, and continues to increase as

Z

changes and EQ is reached. For this IEQ state, it is trivial to show that the temperature (

1 / T = \partial S / \partial E

; see Equation (129)) of

Σ

is

T (t) = 2 T_{1} T_{2} / (T_{1} + T_{2})

(125a)

and its affinity

T \partial S / \partial ξ

(see Equation (133)) is given by

A (t) = (T_{2} - T_{1}) / (T_{1} + T_{2}) .

(125b)

At equilibrium,

T_{1} = T_{2} = T_{eq}

and

ξ = 0, A = 0

. Thus,

T_{1}

and

T_{2}

may be very different, yet the system as a whole can be treated as being in IEQ with a unique temperature

T (t)

, any temperature difference

T_{2} (t) - T_{1} (t)

between its parts not withstanding. The discussion can be extended easily to the case when the two bodies are in IEQs and also when they are of different sizes. In all cases, a unique temperature in accordance with Equation (129) can be defined for the composite system [77,78]. Once it is determined, we do not have to worry about the internal temperature difference between

Σ_{1}

and

Σ_{2}

. Any internal heat transfer between them is captured by

β A (t) d ξ = d_{i} S = d E_{1} (β_{1} - β_{2}),

(126)

as can be easily verified; here

d_{i} S

is the irreversible entropy generation due to macroheat exchange [51]. We thus see the affinity for

ξ

is given by

A (t) = \frac{T d_{i} S}{d ξ} = \frac{d E_{1}}{d ξ} \frac{(β_{1} - β_{2})}{β},

(127)

which vanishes as EQ is reached, a well-known feature [51] of classical thermodynamics. The analysis clearly shows how thermodynamics brings in dissipation in a mechanical system, showing the consistency of our approach using internal variables.

5.8. Gibbs Fundamental Relations for $M_{ieq} (Z)$ in $S_{Z}$ and $S_{ζ}$

We first consider the state space

S_{Z}

in which

M_{ieq} (Z)

is uniquely specified. In this space, the state function

S (Z)

results in the general form of the Gibbs fundamental relation

d S (Z) = \frac{\partial S}{\partial E} d E + \frac{\partial S}{\partial W} \cdot d W

(128a)

for the entropy, from which follows the Gibbs fundamental relation for

E (ζ)

in

S_{ζ}

spanned by

ζ ≐ (S, W)

,

d E (ζ) = \frac{\partial E}{\partial S} d S + \frac{\partial E}{\partial W} \cdot d W .

(128b)

Introducing the SI-temperature

T = 1 / β

as

T ≐ \partial E / \partial S, β = \partial S / \partial E,

(129)

and re-expressing the generalized macroforce in Equation (18) as

F_{w} = - \partial E / \partial W = T \partial S / \partial W,

(130)

we rewrite Equations (128a) and (128b) as

\begin{matrix} d S & = β d E + β d W \end{matrix}

(131a)

\begin{matrix} d E & = T d S - d W \end{matrix}

(131b)

in terms of SI macroquantities; here, we have introduced SI-macrowork

d W

as the generalized macrowork

d W ≐ F_{w} \cdot d W \equiv T \frac{\partial S}{\partial W} \cdot d W

(132)

done by the system. The derivative with respect to

ξ

determines the affinity

A ≐ T (\partial S / \partial ξ) = - (\partial E / \partial ξ),

(133)

which vanishes in equilibrium so that

A_{eq} = A_{0} = 0

. Thus, in general,

F_{w} = (f_{w}, A)

, where

f_{w} = - \partial E / \partial w = T \partial S / \partial w

(134)

is the generalized macrowork due to

w

.

Remark 42.

Comparing Equation (131) with Equation (93a) allows us to verify Conclusion 1 for the Clausius equality.

This equality must be distinguished from

d_{e} Q

in Equation (46). Thus, Equation (93a) allows us to uniquely identify the generalized macroheat

d Q = T d S

determined by

d S

and the generalized macrowork determined by

d W

to be independent of each other as they belong to orthogonal subspaces in the subspace

S_{ζ}

; see also Section 10.2. Both are SI-macroquantities. The resulting thermodynamics has been identified as the MNEQT. In terms of various components of

F_{w}

, the generalized macrowork is

d W = P d V - μ d N + \dots + A \cdot d ξ .

(135a)

We can identify various components of the macrowork as

d W_{V} = P d V, d W_{N} = μ d N, \dots,

d W_{ξ_{1}} = A_{1} d ξ_{1}, \dots,

using an obvious notation. The missing terms denote the contribution from the rest of the variables not shown, and

P ≐ - \partial E / \partial V, μ ≐ \partial E / \partial N, \dots, A ≐ - \partial E / \partial ξ,

(135b)

are the SI-fields associated with

W

, with changes

d W =

d V, d N, \dots, d ξ

being the changes in it.

In the

\overset{˚}{M} NEQT

, the first law in Equation (94) refers to exchange macroheat

d_{e} Q = T_{0} d_{e} S

(see Equation (46)) and macrowork

d_{e} W = P_{0} d_{e} V - μ_{0} d_{e} N + \dots;

(135c)

in terms of the fields (the temperature

T_{0}

, pressure

P_{0}

, chemical potential

μ_{0}

,⋯) of the medium and the corresponding macroscopic exchange quantities in all cases, regardless of the irreversibility. As the medium is in EQ, there is no contribution due to

ξ

in

d_{e} W

as the corresponding contribution

A_{0} \cdot d ξ

vanishes due to the fact that the affinity

A_{0} \equiv 0

for the medium. Our sign convention is that

d_{e} Q

is positive when it is added to

Σ

, and

d_{e} W

is positive when it is transferred to

\tilde{Σ}

.

It follows from Equations (135a) and (135c) that the irreversible macrowork, also known as dissipative work, is

d_{i} W = (P - P_{0}) d V - (μ - μ_{0}) d N + \dots + A \cdot d ξ \geq 0 .

(136)

The coefficients

P - P_{0}, μ - μ_{0}, \dots, A

are commonly known as thermodynamic forces or macroforce imbalances [51], which vanish in EQ; see Section 6.4.

Remark 43.

We have included the term associated with N for completeness in Equations (135a), (135c) and (136). We will no longer consider this term anymore.

We should compare the above equations with Equation (79). Once

d_{e} W

or

d W

has been identified, the use of the first law allows us to uniquely determine

d_{e} Q

or

d Q

, respectively.

It is clear that the root cause of dissipation is the macroforce imbalance. It drives the system towards equilibrium [41,42,75,76,134,148,149,150,152,153]. It arises due to the imbalance between the external and the average internal forces performing work; the microforce imbalance is introduced in the following section. The average force imbalances give rise to an internal work

d_{i} W

due to all kinds of force imbalances. The irreversible or dissipated work is given in Equation (136), which is generated within

Σ

.

If we include the relative velocity between a Brownian particle

Σ_{BP}

and the medium to account for the Brownian motion [148,157], we must account for [148] an additional term

- V \cdot d P_{BP}

in

d_{i} W

due to the relative velocity

V

:

d_{i} W = (P - P_{0}) d V - V \cdot d P_{BP} + A d ξ;

(137)

here,

d P_{BP} = F_{wBP} d t

is the change in the linear momentum of the Brownian particle experiencing a macroforce

F_{wBP}

. To see it, we recognize that

V \cdot d P_{BP}

must be nonpositive to comply with the second law. Thus,

F_{wBP}

must be antiparallel to

V

and describes the frictional drag. This is discussed in detail in Ref. [157]. Thus, the force is reviewed in Section 14 as the role of friction in the Langevin equation turns out to be different in the two NEQ thermodynamics. We will come back to this term later when we consider the motion of a particle attached to a spring; see Figure 3b, a system also studied by Jarzynski, so that a comparison can be made.

The irreversible macroheat

d_{i} Q

in all cases is given by Equation (47), and shows that it does not vanish when

T = T_{0}

, provided

d_{i} S > 0

. This means that the irreversible macrowork is present even if there is no temperature difference, such as in an isothermal process, as long as there exists some nonzero thermodynamic force or irreversibility. The resulting irreversible entropy generation is then given by

d_{i} S

. We summarize this [51] as

Conclusion 3.

To have dissipation, it is necessary and sufficient to have a nonzero thermodynamic force. In its absence, there can be no dissipation regardless of the time dependence of the work process; see also Remark 32. This understanding of dissipation becomes clear from the microscopic source of dissipation in Proposition 2.

5.9. Time-Dependent Gibbs Fundamental Relations for $M_{nieq} (Z)$ in $S_{Z}$

We now consider the generalization of the Gibbs fundamental relation for

M_{nieq}

, which is not uniquely specified in

S_{Z}

or

S_{ζ}

, by starting from Equation (295a) having an explicit time dependence that comes from “hidden” internal variables

ξ^{'}

in

S_{Z}

. From the state function entropy

S (Z^{'} (t))

for

M_{ieq} (t)

in

S_{Z^{'}}

, we have

d S (Z^{'} (t)) = \frac{\partial S}{\partial E} d E + \frac{\partial S}{\partial W} \cdot d W + \frac{\partial S}{\partial ξ^{'}} \cdot d ξ^{'},

where

W

is the work variable in

S_{Z}

. Expressing the last term as

\frac{\partial S}{\partial ξ^{'}} \cdot \frac{d ξ^{'}}{d t} d t,

we obtain the following generalization of the Gibbs fundamental relation for

M_{nieq} (t)

in

S_{Z}

:

d S (Z (t), t) = \frac{\partial S}{\partial E} d E + \frac{\partial S}{\partial W} \cdot d W + \frac{\partial S}{\partial t} d t,

(138a)

where

\frac{\partial S}{\partial t} ≐ \frac{\partial S}{\partial ξ^{'}} \cdot \frac{d ξ^{'}}{d t} \geq 0 .

(138b)

Definition 26.

As the presence of

\partial S / \partial t

above in

S_{Z}

is due to “hidden” internal variables in

ξ^{'}

, we will call it the hidden entropy generation rate, and

d_{i} S^{hid} (t) = \frac{\partial S}{\partial t} d t = \frac{\partial S}{\partial ξ^{'}} \cdot d ξ^{'} \geq 0,

(139a)

the hidden entropy generation. It results in a hidden irreversible macrowork

d_{i} W^{hid} ≐ T d_{i} S^{hid} = A^{'} \cdot d ξ^{'},

(139b)

in

S_{Z}

due to the hidden internal variable with affinity

A^{'}

.

In

S_{Z^{'}}

, we can identify the temperature T as the thermodynamic temperature in

S_{Z^{'}}

by the standard definition. It is clear from the above discussion that

\partial S (Z^{'} (t)) / \partial E

in

S_{Z^{'}}

has the same value as

\partial S (Z (t), t) / \partial E

in

S_{Z}

. However, there is an alternative definition of a temperature for

M

in

S_{Z}

as

d Q (Z (t), t) / d S (Z (t), t) = T_{arb}^{alt} (Z (t), t),

while

T (Z^{'} (t)) = d Q (Z^{'} (t)) / d S (Z^{'} (t))

for

M_{ieq}

in

S_{Z^{'}}

. It is easy to see that they are not the same as macroheats

d Q (Z^{'} (t)) = d E (t) + d W (Z^{'} (t))

and

d Q (Z (t), t) = d E (t) + d W (Z (t), t)

are not the same as macroworks. Thus, this definition is not a thermodynamic temperature for

M

in

S_{Z}

. Therefore, we are now set to identify

T_{arb}

(see also Equation (257)) as a thermodynamic temperature of

M_{arb}

by this T.

Remark 44.

1 / T_{arb} ≐ \partial S (Z (t), t) / \partial E

in

S_{Z}

is identified by the same derivative in the Gibbs fundamental relation in

S_{Z^{'}}

as follows:

\frac{1}{T_{arb}} = \frac{\partial S (Z^{'} (t))}{\partial E} \equiv \frac{1}{T (Z (t))},

(140a)

while the alternative nonthermodynamic temperature satisfies

T_{arb}^{alt} (Z (t), t) = T (Z (t)) [1 + d_{i} S^{hid} / d S (Z (t), t)],

(140b)

as is easily verified.

Remark 45.

As discussed above and as will be discussed in detail in Section 12.1, a macrostate

M_{nieq} (t)

with

S (Z (t), t)

can be converted to

M_{ieq} (t)

with a state function

S (Z^{'} (t))

in an appropriately chosen state space

S_{Z^{'}} \supset S_{Z}

by finding the appropriate window in which

τ_{obs}

lies as well. The needed additional internal variable

ξ^{'}

determines the hidden entropy generation rate

\partial S / \partial t

in Equation (138b) due to the non-IEQ nature of

M_{nieq} (t)

in

S_{Z}

, and ensures validity of the Gibbs relation in Equation (138a) for it, thereby not only providing a new interpretation of the temporal variation of the entropy due to hidden variables but also extending the MNEQT to

M_{nieq} (t)

in

S_{Z}

.

The above discussion strongly points towards the following possible proposition.

Proposition 1.

The MNEQT provides a very general framework to study any

M_{nieq} (t)

in

S_{Z}

, since it can be converted into a

M_{ieq} (t)

in an appropriately chosen state space

S_{Z^{'}}

, with

d_{i} S^{hid} (t)

originating from hidden internal variable

ξ^{'}

.

We now consider a process

P

to be studied in

S_{Z}

. It is natural to think of at least the initial macrostate

M^{in}

of

P

as being uniquely identified as

M_{ieq}^{in}

in

S_{Z}

. During the process,

M (t)

along

P

may turn into

M_{nieq} (t)

or remain

M_{ieq} (t)

. The former has been studied above. The latter can happen under the following two cases:

(i) all internal variables in

ξ

remain out of equilibrium;

(ii) internal variables in a subset

ξ^{'} \subset ξ

have equilibrated so that the affinity

A^{'} = T \partial S /

\partial ξ^{'}

vanishes.

In both cases,

M (t)

remains

M_{ieq} (t)

in

S_{Z}

, except that in (ii),

M (t)

can also be treated as

M_{ieq} (t)

in the proper subspaces between

S_{Z^{″}} \subset S_{Z}

and

S_{Z}

, with

Z^{″}

defined by

Z^{″} \cup ξ^{'} = Z

. Even though

A^{'} = 0

in these subspaces so that

d_{i} S^{hid} (t) = 0

and

d_{i} W^{hid} (t) = 0

, the Fl microaffinity

A_{k}^{'} \neq 0

in these subspaces, and will still play an important role in the

μ

NEQT. Therefore,

Remark 46.

We will use the state space

S_{Z}

to construct the NEQ statistical mechanics in (i) and (ii) without affecting the hidden entropy generation and hidden irreversible macrowork. This allows us to use

S_{Z}

over the entire process.

Remark 47.

In a process

P

resulting in

M_{nieq} (t)

in

S_{Z}

, it is natural to assume that the terminal macrostates in

P

are

M_{ieq}

so the affinity corresponding to

ξ^{'}

must vanish in them.

The above discussion can be easily applied to consider the case

S_{Z^{'}} \subset S_{Z}

, in which internal variables in a subset

ξ^{'}

of

ξ

have equilibrated. The result is summarized in the following:

Remark 48.

By replacing

Z

by

X

, and

Z^{'}

by

Z

, we can also express the Gibbs fundamental relation for any NEQ macrostate in

S_{X}

as

d S (X (t), t) = \frac{\partial S}{\partial E} d E + \frac{\partial S}{\partial w} \cdot d w + \frac{\partial S}{\partial t} d t,

(141)

by treating

M_{neq}

as

M_{ieq}

in

S_{Z}

. In an NEQ process

\bar{P}

between two EQ macrostates but resulting in

M_{ieq} (t)

between them in

S_{Z}

, the affinity corresponding to ξ must vanish in the terminal EQ macrostates of

\bar{P}

.

Equation (141) proves extremely useful to describe

M_{neq}

in

S_{X}

as it may not be easy to identify

ξ

in all cases.

Remark 49.

The explicit time dependence in the entropy for

M_{neq}

in

S_{X}

or

M_{nieq} (t)

in

S_{Z}

is solely due to the internal variables, which do not affect the validity of the Clausius equality

d Q = T d S

(Equation (45)), with T defined as the inverse of

\partial S / \partial E

at fixed

w, t

or

W, t

in the two state spaces, respectively; see Equation (129). As a consequence, Equation (47) remains valid for any

M

.

5.10. Consequences of the Second Law

Theorem 4.

As a consequence of the second law, the irreversible macrowork

d_{i} W

(see Equation (136)) which is equal in magnitude to the macroheat

d_{i} Q

(see Equation (95)) for any

M

is nonnegative in any real process.

Proof.

Using Equation (47), we find

\begin{matrix} T_{0} d_{i} S = (T_{0} - T) d S + d_{i} W \\ T d_{i} S = (T_{0} - T) d_{e} S + d_{i} W \end{matrix} \geq 0,

(142)

where the inequality follows from the second law

d_{i} S \geq 0

in Equation (67c); we assume T and

T_{0}

to be nonnegative. Therefore, each of the two independent contributions in each equation must be nonnegative. This thus proves that

d_{i} W = d_{i} Q \geq 0 .

(143)

☐

Corollary 1.

Different components of

d_{i} W

and

d_{i} Q

for any

M

must be individually nonnegative.

Proof.

Consider the independent components such as

d_{i} W_{V}, d_{i} W_{ξ}

, etc., of

d_{i} W

. As

d_{i} W

is nonnegative, each component must be nonnegative. □

This proves the inequalities in Equations (43) and (80). In addition, it shows that each term on the right in Equation (75) is nonnegative. We thus have a proof of a part of Remark 32 that deals with the consequences of the second law.

Corollary 2.

In any real process,

(T_{0} - T) d_{e} S \geq 0; (T_{0} - T) d S \geq 0 .

Proof.

The corollary follows from the preceding theorem. □

The first inequality merely states the well-known fact of thermodynamics that macroheat

d_{e} Q = T_{0} d_{e} S

flows from “hot” to “cold”. The second inequality also states a well-known fact about the stability in thermodynamics, which requires the entropy to increase with temperature. As EQ is reached,

T \to T_{0}

either from above (

T > T_{0}

) or from below (

T < T_{0}

). In the former case, S decreases, while it increases in the latter case.

Corollary 3.

For an isolated system

(d S \equiv d_{i} S)

or for

T = T_{0}

,

T d_{i} S = d_{i} W \geq 0 .

(144)

Proof.

Setting

d_{e} S = 0

for an isolated system or

T = T_{0}

in Equation (142) proves the theorem immediately. □

The inequalities in Equation (142) follow from the second law

d_{i} S \geq 0

in Equation (67c). Each term on the right side, being independent of each other, must be nonnegative separately, which yields

(T_{0} - T) d S \geq 0, (1 - T / T_{0}) d_{e} Q \geq 0, d_{i} W \geq 0

(145)

as consequences of the second law. In view of Equation (95), the last inequality above proves the last two inequalities in Equation (69).

5.11. Assumptions

We list the two important assumptions of our approach. They can be relaxed but we will not do that in this review.

5.11.1. N Fixed for $Σ$

In order to fix the size of

Σ

, we need to specify one of its extensive state variables. Usually, N is kept fixed to ensure a fixed size. Therefore, N is not considered part of

X = (E, V, \dots)

and

Z

from now on [177]. This also means that (i) there is no chemical reaction, and (ii) there is EQ with respect to the chemical potential. Most of the time, we will simplify the discussion by using a single internal variable; the extension to many internal variables is trivial.

Our primary interest is in studying an irreversible process

P

, which in MNEQT requires the existence of thermodynamic forces [51]. Their absence signifies that

P

represents a reversible process. It should be stressed that our notation is designed in such a way that the investigation can also apply directly to the (isolated) NEQ system

Σ_{0}

, if need be, for which no exchange with the outside is possible. In that case, the external driving must be replaced by spontaneous processes going on within

Σ_{0}

that drive it towards equilibrium. During this drive, there is dissipation within

Σ_{0}

that is found to contribute to work fluctuations in the

μ

NEQT. As is well-known, such spontaneous fluctuations are not directly captured in the

\overset{˚}{μ}

NEQT, the microstate extension of the

\overset{˚}{M} NEQT

. This makes our approach superior.

5.11.2. $\tilde{Σ}$ Always in EQ

We will assume

\tilde{Σ}

to be always in equilibrium (which requires it to be extremely large compared to

Σ

, as noted above). Any irreversibility going on within

Σ_{0}

due to internal dissipation, internal motion, internal nonuniformities, etc., is ascribed to

Σ

alone. Moreover, we assume additivity of volume, a weak interaction between, and quasi-independence of,

Σ

and

\tilde{Σ}

; the last two conditions, respectively, ensure that the energies and entropies are additive [75,76,134,148,149] but also impose some restriction on the size of

Σ

in that it cannot be too small. In particular, the size should be at least as big as the correlation length for quasi-independence as discussed there. In this study, we will assume that all required conditions necessary for the above-mentioned additivity are met.

6. Mechanical Aspects

We will consider a system in this section, but the arguments are valid for any system

Σ

.

6.1. Microstate Evolution in $S_{Z}$

The traditional formulation of statistical thermodynamics [33,48,79] is built on a mechanical approach in which

m_{k}

follows its classical or quantum mechanical Hamiltonian evolution dictated by its SI-Hamiltonian

H_{k} = H (x_{k} (t)| W (t))

, which suffices to provide the deterministic mechanical description with NFl-

W

. We will see below that k does not change as

W

changes in a process

P

. We will only consider a classical case system

Σ

, for which the change in

H_{k}

in

P

is

d H_{k} = \frac{\partial H_{k}}{\partial x_{k} (t)} \cdot d x_{k} (t) + \frac{\partial H_{k}}{\partial W (t)} \cdot d W (t) .

(146)

The first term on the right, due to the dynamical variations of

x_{k}

in the system, vanishes identically due to Hamilton’s equations of motion for any

m_{k}

. Thus, for fixed

W

, the energy

E_{k} (W) = H (x_{k}| W)

of

m_{k}

remains constant in time due to deterministic Hamiltonian dynamics. Only the variation

d W

in

S_{Z}

generates any change in

E_{k}

. Consequently, we can write

d H_{k} = d H_{k}^{(w)} = \frac{d H_{k}}{\partial W (t)} \cdot d W (t)

(147)

for all

m_{k}

, which clearly shows that only the variation

d H_{k}^{(w)}

due to

d W

is relevant. This is indicated by the superscript w on

d H_{k}^{(w)}

. We do not worry about how

x_{k}

changes dynamically in

H (x_{k}| W)

from now on, and focus, instead, on the state space

S_{Z}

, in which we can simply express the Hamiltonian as

H_{k} (W)

for any microstate, remembering that its value

E_{k} (W)

is a point in

S_{Z}

.

6.2. SI-Microwork in $S_{Z}$

The point

E_{k} (W)

in

S_{Z}

undergoes a change due to

d W

given by

d E_{k} = \frac{\partial E_{k}}{\partial W} \cdot d W = - d W_{k},

(148)

where

d W_{k} = F_{w k} \cdot d W, F_{w k} ≐ - \partial E_{k} / \partial W .

(149)

denotes the Fl-generalized microwork produced by the Fl-generalized microforce

F_{w k}

; see Definition 17. These are SI-microquantities. As

E_{k}

is uniquely determined by

W

, the microforce is a deterministic and continuous function of

W

; see below. The SI-microwork

d W_{k}

is mechanically defined work as

W

is varied, which explains why

W

is identified as the work parameter in

H

. The variation

d Z (t) ≐ (d E (t), d W (t))

in time defines a thermodynamic process

P

. The trajectory

γ_{k}

in

S_{Z}

followed by

m_{k}

during

P

as a function of time will be called the Hamiltonian trajectory. Being purely mechanical in nature, the trajectory is completely deterministic and cannot describe the evolution of the thermodynamic macrostate

M

during

P

unless supplemented by thermodynamic stochasticity over

P

; see Claim 1. This is accounted for by the variation in

p_{k} (M)

as

M

changes, and is determined by some stochastic perturbation such as the random interaction with

\tilde{Σ}

[33,59]; see Definition 25. We discuss the origin of this stochasticity in Section 7, which will allow us to introduce heat and temperature.

Since

m_{k}

and

p_{k} (M)

are independent of each other, we can treat them separately. This provides a major simplification, as described below, for studying the process

P

in terms of a Hamiltonian trajectory

γ_{k}

. We study the mechanical evolution of

m_{k}

along

γ_{k}

without being concerned about the probabilities. The effect of the probability can then be supplemented by an appropriate probability. This will lead to the introduction of the concept of SI-microheat; see Section 10, where we investigate this concept in detail for the first time.

6.3. SI-Legendre Transform

We can alternatively consider the case with

\{W_{k}\}

as the Fl-parameter. In that case, we will be dealing with

W

as a random variable with outcome

W_{k}

; see Claim 3. Let us clarify the significance of Equation (18) by considering

F_{w k} = (P_{k}, A_{k})

defined above, and show how we ensure a fixed P and A by considering Fl-

W_{k} = (V_{k}, ξ_{k})

. We consider a

m_{k}

with microenergy

E_{k} (V, ξ)

, from which we obtain

P_{k} (V, ξ)

and

A_{k} (V, ξ)

. They are functions of two variables, and we look for their crossing

W_{k} = (V_{k}, ξ_{k})

with a plane

Π

defined by

F_{w} = (P, A)

to determine

W_{k}

. We now do this for every k using the same plane

Π

. Using these crossings, we have

\forall k, P = - \partial E_{k} / \partial V_{k}, A = - \partial E_{k} / \partial ξ_{k} .

(150a)

As the two derivatives have fixed values for every k, their averages are also the same fixed values in

F_{w} = (P, A)

as required in Equation (18). The crossings

W_{k}

give the fluctuating

(V_{k}, ξ_{k})

.

Alternatively, we can easily determine

(V_{k}, ξ_{k})

by considering an NEQ SI-Legendre transform

E_{k}^{L}

of

E_{k}

, defined as

E_{k}^{L} (P, A) ≐ E_{k} (V_{k}, ξ_{k}) + P V_{k} + A ξ_{k},

(150b)

which is a function of P and A, but not of

V_{k}

and

ξ_{k}

, since

\partial E_{k}^{L} / \partial V_{k} = 0, \partial E_{k}^{L} / \partial ξ_{k} = 0

, as is easily seen using Equation (150a). We now have

V_{k} = \partial E_{k}^{L} / \partial P, ξ_{k} = \partial E_{k}^{L} / \partial A .

(150c)

After averaging over microstates in

M_{\det}

, we obtain

E^{L} (P, A) ≐ E (V, ξ) + P V + A ξ .

(151)

Remark 50.

E^{L} (P, A)

must not be confused with the NEQ enthalpy

H = E (V, ξ) + P_{0} V

.

We can generalize the above discussion for the general case of NFl

F_{w}

or Fl

\{W_{k}\}

. We first define the SI-Legendre-transformed Hamiltonian

H^{L} (F_{w}) ≐ H (W) + Φ (F_{w}, W),

(152a)

in terms of

Φ (F_{w}, W)

introduced in Equation (23b). Its microenergy

E_{k}^{L} (F_{w})

is the SI-Legendre transform of

E_{k} (W_{k})

, and is given by

E_{k}^{L} (F_{w}) ≐ E_{k} (W_{k}) + Φ (F_{w}, W_{k});

(153)

compare with

E_{k}^{L, Fl} (F_{w})

in Equation (22a). We are suppressing the suffix NFl, as it is clear from the dependence on

F_{w}

that we are dealing with Fl

W_{k}

; see Claim 5. For

E_{k}^{L} (F_{w})

,

F_{w}

plays the role of the (Legendre-transformed) NFl “work” parameter

W^{L}

(

= F_{w}

) so that the generalized (Legendre-transformed) Fl “microforce”

F_{w k}^{L}

is given by

F_{w k}^{L} ≐ - \partial E_{k}^{L} / \partial F_{w} = - W_{k},

(154)

which should be compared with the second equation in Equation (149); note the presence of the negative sign above on the right side. The extension of the generalized microwork given in the first equation in Equation (149) to this case is the Fl Legendre-transformed microwork

d W_{k}^{L} (F_{w}) = - W_{k} \cdot d F_{w},

(155)

so that

d E_{k}^{L} (F_{w}) \equiv - d W_{k}^{L} (F_{w}),

(156)

which is identical in form with Equation (148).

For the medium

\tilde{Σ}

, we have

d {\tilde{W}}_{\tilde{k}}^{L} ({\tilde{f}}_{w}) = - {\tilde{w}}_{\tilde{k}} \cdot d {\tilde{f}}_{w},

which, after reduction, yields

d {\tilde{W}}_{k}^{L} ({\tilde{f}}_{0 w}) = d {\tilde{W}}^{L} ({\tilde{f}}_{0 w}) = - \tilde{w} \cdot d {\tilde{f}}_{0 w} = - d_{e} W^{L} ({\tilde{f}}_{0 w}),

(157)

where we have replaced

{\tilde{f}}_{w}

by

{\tilde{f}}_{0 w}

of

Σ_{0}

, and used Equation (64a).

The average of

E_{k}^{L} (F_{w})

is given by

E^{L} (S, F_{w}) ≐ E (S, W) + F_{w} \cdot W

(158)

(compare with Equation (152a)), while other microquantities have their averages given by

\begin{matrix} F_{w}^{L} & ≐ - \partial E^{L} (S, F_{w}) / \partial F_{w} = - W, \\ d W^{L} (S, F_{w}) & = - W \cdot d F_{w}, \\ d E^{L} (S, F_{w}) & \equiv - d W^{L} (S, F_{w}), \end{matrix}

(159)

as is expected from the above discussion.

As considering Fl-

W

creates no additional complication, we will mostly deal with NFl-

W

in this review.

For completeness and later usage in Section 12.2, we also introduce another Legendre transform in the case that

W

is NFl, but

F_{w k}

is Fl. We quote the results that are easily derived using a similar approach as above. The SI-Legendre-transformed microenergy is

E_{k}^{L} (F_{w k}) ≐ E_{k} (W) + Φ (F_{w k}, W),

(160)

which should be compared with Equations (22a) and (153); we also have

\begin{matrix} F_{w}^{L} & ≐ - \partial E_{k}^{L} / \partial F_{w k} = - W, \\ d W_{k}^{L} (F_{w k}) & = - W \cdot d F_{w k}, \\ d E_{k}^{L} (F_{w k}) & \equiv - d W_{k}^{L} (F_{w k}), \end{matrix}

(161)

For the macroquantities, we obtain exactly the same equations as in Equations (158) and (159), which is expected in view of the consistency requirement we have imposed; see Remark 9.

6.4. Mechanical Force Imbalance (FI)

We now formalize the important mechanical concept of force imbalance (FI). It is the presence of the FI that results in an NEQ mechanical state and emerges as a central novel concept in NEQ statistical mechanics by being ubiquitous in any arbitrary macrostate

M

. For example, consider a spring being pulled by an external force

F_{0}

. This induces a spring force

F_{s}

in the opposite direction. The total force

F_{t} = F_{0} + F_{s} = F_{0} - |F_{s}|

does not usually vanish, except in stable equilibrium. For nonvanishing

F_{t}

, the spring will undergo an oscillatory motion forever, as there is no second law for a mechanical system.

We now consider a general situation of a FI to formalize it for our purpose. To this end, we focus on an isolated system

Σ

consisting of two systems

Σ_{1}

and

Σ_{2}

, with their Hamiltonians

H_{1} (W_{1})

and

H_{2} (W_{2})

, respectively; we take NFl-

W

parameters for simplicity. Assuming the quasi-additivity (see Section 5.6) of corresponding

Z_{1}

and

Z_{2}

, we have the Hamiltonian

H

of

Σ

given by

H (Z_{1}, W_{2}) = H_{1} (W_{1}) + H_{2} (W_{2}) .

(162a)

Thus, under this assumption, the microenergy

E_{k}

of

m_{k}

is given by

E_{k} (Z_{1}, W_{2}) \approx E_{k_{1}} (W_{1}) + E_{k_{2}} (W_{2})

(162b)

in terms of the microenergies

E_{k_{1}} (W_{1})

of

m_{1 k_{1}}

of

Σ_{1}

, and

E_{k_{2}} (W_{2})

of

m_{1 k_{2}}

of

Σ_{2}

. The generalized microworks by the two systems are

d W_{1 k_{1}} = F_{1 w k_{1}} \cdot d W_{1}, d W_{2 k_{2}} = F_{2 w k_{1}} \cdot d W_{2};

see Equation (37a). Here the suffixes 1 and 2 refer to

Σ_{1}

and

Σ_{2}

, respectively.

Definition 27.

The difference between SI-microforces

F_{1 w k_{1}}

and

F_{2 w k_{1}}

of

Σ_{1}

and

Σ_{2}

, respectively, that is given by

Δ F_{w k} = F_{1 w k_{1}} - F_{2 w k_{2}}

(163)

is called the internal microforce imbalance (μFI) produced by

Σ_{1}

and

Σ_{2}

.

Theorem 5.

The internal microwork

d_{i} W_{k}

by an isolated body Σ consisting of

Σ_{1}

and

Σ_{2}

is an algebraic sum of all possible internal microworks that occur inside Σ.

Proof.

The generalized microwork by Σ is the algebraic sum

d W_{k} = d_{i} W_{k} = d W_{1 k_{1}} + d W_{2 k_{2}};

(164)

see the second equation in Equation (57a). Using Equation (14a) for

d W_{1}

and

d W_{2}

, and using

d_{e} W_{1} = - d_{e} W_{2}

, which follows from Equation (60), we have

d_{i} W_{k} = Δ F_{w k} \cdot d_{e} W_{1} + F_{1 w k_{1}} \cdot d_{i} W_{1} + F_{2 w k_{1}} \cdot d_{i} W_{2} .

(165)

The first term with

Δ F_{w k}

is the internal microwork performed by it over the exchange displacement

d_{e} W_{1}

by

Σ_{1}

. The other two terms also represent internal microworks produced by the two generalized microforces over the internal displacements

d_{i} W_{1}

and

d_{i} W_{2}

, respectively. These three components exhaust all internal microworks within Σ, which proves the theorem. □

We can use Remark 27 to define

d_{e} W_{1 k_{1}}

of

Σ_{1}

as the negative of the exchange microwork by

Σ_{2}

, and vice versa. The importance of this corollary is, therefore, that it allows us to determine one of them in terms of the other one, which happens to be easier to determine, such as when it happens to be a medium, as will be seen in Section 7.5; see Theorem 7.

There is an alternate way to express

E_{k} (Z_{1}, W_{2})

in terms of

\hat{W}

and

ξ

(see Equation (100)), that describes

Σ

directly. It is easy to verify that

W_{1} = n_{1} \hat{W} + n_{1} n_{2} ξ, W_{2} = n_{2} \hat{W} - n_{1} n_{2} ξ .

(166)

In terms of these, we introduce the microforce

F_{w k}

and microaffinity

A_{k}

for

Σ

,

{\hat{F}}_{w k} = - \partial E_{k} / \partial \hat{W}, A_{k} = - \partial E_{k} / \partial ξ;

see Equation (17a). We easily verify that

\begin{matrix} {\hat{F}}_{w k} & = n_{1} F_{1 w k_{1}} + n_{2} F_{2 w k_{2}}, \end{matrix}

(167a)

\begin{matrix} A_{k} & = n_{1} n_{2} (F_{1 w k_{1}} - F_{2 w k_{2}}) = n_{1} n_{2} Δ F_{w k}, \end{matrix}

(167b)

which again shows the physical importance of the microforce imbalance

Δ F_{w k}

. The direct evaluation of

d E_{k} = - d W_{k} = - d_{i} W_{k}

using

E_{k} (\hat{W}, ξ)

gives

d_{i} W_{k} = {\hat{F}}_{w k} \cdot d \hat{W} + A_{k} \cdot d ξ .

It is easy to verify that this

d_{i} W_{k}

is identical to that in Equation (165), as expected by using Equation (166). We also see that

d_{e} W_{k} = d W_{k} - d_{i} W_{k} = 0

in accordance with Remark 27.

For

n_{1} \to 0, n_{2} \to 1

, we have

{\hat{F}}_{w k} \to F_{2 w k_{2}},

a very common situation when

Σ_{2}

becomes extremely large compared to

Σ_{1}

, such as when we consider a system in a medium; see Figure 1.

Let us split

\hat{W} ≐ (\hat{w}, ξ)

, where

\hat{w} = w_{1} + w_{2}, \hat{ξ} = ξ_{1} + ξ_{2}

(168)

(see Equation (101)) are the sum of work-observables and internal variables of

Σ_{1}

and

Σ_{2}

, respectively. Then we can re-express

d_{i} W_{k}

as

d W_{k} = {\hat{f}}_{w k} \cdot d \hat{w} + {\hat{A}}_{k} \cdot d \hat{ξ} + A_{k} \cdot d ξ,

where

{\hat{f}}_{w k} = - \partial E_{k} / \partial \hat{w}, {\hat{A}}_{k} = - \partial E_{k} / \partial \hat{ξ}

.

If we set

d_{e} \hat{w} = 0

for the isolated

Σ

, then

d W_{k}

reduces to the internal microwork done by it:

d_{i} W_{k} = {\hat{f}}_{w k} \cdot d_{i} \hat{w} + {\hat{A}}_{k} \cdot d \hat{ξ} + A_{k} \cdot d ξ .

(169)

The second term on the right is the internal microwork by

\hat{ξ}

and the third term is the internal microwork by the new internal variable

ξ

.

We now apply the above discussion to the important case in which

Σ

becomes the isolated system

Σ_{0}

, and

Σ_{1}

and

Σ_{2}

become the system of interest

Σ

and the medium

\tilde{Σ}

; see below. As the latter is always in EQ, it has no

ξ_{2}

to consider so that

W_{2} \to \tilde{w}, A_{2 k_{2}} \to {\tilde{A}}_{\tilde{k}} = 0

. The

μ

FI from Equation (163) becomes

Δ F_{w k_{0}} = F_{w k} - {\tilde{f}}_{w \tilde{k}},

(170)

which is the difference between the SI-microforce

F_{w k} ≐ - \partial E_{k} / \partial W

of

Σ

, and the MI-microforce

{\tilde{F}}_{w \tilde{k}} ≐ - \partial {\tilde{E}}_{\tilde{k}} / \partial \tilde{W}

associated with

\tilde{Σ}

. Consequently,

Δ f_{w k_{0}} = f_{w k} - {\tilde{f}}_{w \tilde{k}}, Δ A_{k_{0}} = A_{k} .

(171)

The internal microwork for

Σ_{0}

is obtained from Equation (165)

d_{i} W_{0 k_{0}} = Δ F_{w k_{0}} \cdot d_{e} W + F_{w k} \cdot d_{i} W,

(172)

where we have set

d_{i} \tilde{W} = 0

as

\tilde{Σ}

is in EQ. In addition, we also have from Remark 27

d_{e} W_{0 k_{0}} = 0, d_{e} W_{k} = - d_{e} {\tilde{W}}_{\tilde{k}} = - d {\tilde{W}}_{\tilde{k}} .

(173)

Regarding the last two equations above containing different suffixes, we must recall Remark 28, as a consequence of which Equation (194b) results.

6.5. Work–Energy Principle

The most important confirmation of the mechanical nature of microstates appears in the form of the work–energy principle (

d E_{k} = - d W_{k}

) that was proposed a while back [150,151], connecting the SI-microenergy change and the SI-generalized microwork, a principle whose importance has not been recognized in various fluctuation theorems [26,135,136,137,138,139,140,141,142,143,144,145,146,147,158,159], as the distinction between SI and MI quantities has not been properly accounted for. Because of this, it is important to emphasize this principle for microstates and to clarify its significance for the reader. Indeed, we give a more general formulation of the principle than presented earlier.

The well-known work–energy theorem of classical mechanics [33] shows that the SI-work done by the SI-force is nothing but the change in the energy (itself a SI-quantity). But the SI-aspect of the principle is never discussed, even though it is implied. However, in NEQT, there are various works that one needs to confront, as we have seen. The theorem presented below extends the previous result to all bodies and to all microworks.

Theorem 6.

Work–Energy Principle The microenergy change

d_{e} E_{k}

of a body due to parameter change must be identified with the negative of the BI-microwork

d_{e} W_{k}

. The change

d E_{k}

has two contributions; see Equation (15). The first one corresponds to the external microwork

d_{e} E_{k} = - d_{e} W_{k} = d {\tilde{W}}_{\tilde{k}}

(174)

performed by the medium on

m_{k}

and the second one to the internal microwork

d_{i} E_{k} = - d_{i} W_{k},

(175)

given in Equation (165) or (169). All these relations can be compactly expressed by

d_{α} E_{k} = - d_{α} W_{k} .

(176)

Proof.

The generalized microwork in Equation (37a) done by

F_{w k}

is exactly as in mechanics, which proves

d E_{k} = - d W_{k}

(see Equation (148)); both sides are SI-quantities as expected. This is true for

Σ, \tilde{Σ}

, and

Σ_{0}

. For the medium

\tilde{Σ}

, which we take to be in EQ,

d {\tilde{W}}_{\tilde{k}} = - d {\tilde{E}}_{\tilde{k}} = d_{e} {\tilde{W}}_{\tilde{k}} = - d_{e} {\tilde{E}}_{\tilde{k}}

, with

d {\tilde{W}}_{\tilde{k}}

given in Equation (71a). We now use Equation (62a) to relate MI-quantities with SI-quantities of Σ. From this equation, we find

d_{e} {\tilde{E}}_{\tilde{k}} = - d_{e} E_{k}

and

d_{e} {\tilde{W}}_{\tilde{k}} = - d_{e} W_{k}

. It immediately follows from these relations (see also Remark 27) and Equation (148) that

d_{e} E_{k} = - d_{e} W_{k}, d_{i} E_{k} = - d_{i} W_{k} .

This proves the theorem. □

Corollary 4.

We have also seen in Section 6.3 that

d E_{k}^{L} = - d W_{k}^{L},

(177a)

which can be generalized to

d_{α} E_{k}^{L} = - d_{α} W_{k}^{L},

(177b)

to be compared with Equation (176).

Proof.

We follow the same steps as above in Theorem 6 to trivially prove the above conclusion. □

The significance of the identity in Equation (174) cannot be overemphasized. Because

\tilde{Σ}

is in EQ, as the exchange microquantities for

Σ

are determined by the MI-microquantities, having

\tilde{Σ}

in EQ is extremely helpful since all of its microquantities, such as

d {\tilde{W}}_{\tilde{k}}

, are uniquely described in the

μ

EQT. This then determines

d_{e} W_{k}

, and in turn

d_{e} E_{k}

, from which we obtain

d_{i} E_{k}

and

d_{i} W_{k}

using

d_{i} E_{k} \equiv d E_{k} - d_{e} E_{k} = - d_{i} W_{k} .

In the process, we have identified

d_{e} E_{k}

and

d_{i} E_{k}

from the general work–energy principle. The same thread of thought justifies a similar conclusion for the Legendre-transformed energies.

We now elaborate on its significance further.

We first consider the NFl-

W

case, in which the variation

d W

determines the (mechanically defined) generalized microwork

d W_{k}

done by

F_{w k}

as seen in Equations (37a) and (37b). We similarly determine

d {\tilde{W}}_{\tilde{k}}

done by

f_{w \tilde{k}}

; see Equation (71a). From knowing

d {\tilde{W}}_{\tilde{k}}

, we determine

d_{e} E_{k}

. Without the use of the above principle, this will not be possible. We finally determine

d_{i} E_{k}

and

d_{i} W_{k}

, as discussed above. As E with outcomes

\{E_{k}\}

represents a random variable undergoing fluctuations, dE with outcomes

\{d E_{k}\}

also represents a random variable undergoing fluctuations. Accordingly,

d_{i} E_{k} \equiv - d_{i} W_{k} \neq 0

is also fluctuating over the microstates

\{m_{k}\}

so this fluctuation is ubiquitous. Similar arguments also apply to the fluctuating nature of

d E_{k}^{L}

and

d W_{k}^{L}

, and

d_{i} E_{k}^{L}

and

d_{i} W_{k}^{L}

. In this case, we use

Φ (F_{w k}, W)

(see Equation (23b)), to determine

E_{k}^{L}

; see Equation (23a). For the case of Fl-

W

, we need to instead use

Φ (F_{w}, W_{k})

to determine

E_{k}^{L}

.

For completeness, we discuss now the possibility of using only a subset

W^{NF} \subseteq W

as the NFl-parameter and the remaining subset

W^{F}

as the Fl-parameter, and taking the value

W_{k}^{F}

over

m_{k}

as described in Section 5.2. As noted earlier, we must satisfy Condition 1. Thus,

\begin{matrix} F_{w k}^{NF} & ≐ - \partial E_{k} / \partial W^{NF}, F_{w}^{F} ≐ - \partial E_{k} / \partial W_{k}^{F}, \end{matrix}

(178a)

\begin{matrix} F_{w}^{NF} & = 〈F_{w}^{F}〉, W^{NF} = 〈W^{F}〉 . \end{matrix}

(178b)

In this case, we need to use

Φ (F_{w k}^{NF}, W^{NF}, F_{w}^{F}, W_{k}^{F}) ≐ Φ (F_{w k}^{NF}, W^{NF}) + Φ (F_{w}^{F}, W_{k}^{F})

(179)

to obtain

E_{k}^{L}

.

The theorem merely represents the fact that the generalized microwork

d_{α} W_{k}

is at the expense of its microenergy loss

d_{α} E_{k}

[75,76,150,151]. For the case

W = (V, ξ)

, the corresponding microforce

F_{w k}

is (

P_{k}, A_{k})

given in Equation (81). The three different microworks

d_{α} W_{k}

and

d_{α} W

are given in Equations (77) and (79).

As an example, consider V as a NFl-parameter so that the corresponding fluctuating field (the pressure) for

m_{k}

is given by

P_{k} ≐ - \partial E_{k} / \partial V

, with

P = 〈P〉

with nonzero fluctuation

〈{(Δ P)}^{2}〉

[33]. To allow for a fluctuating

V_{k}

over

m_{k}

corresponding to a fixed P (see Equation (18)), we choose it so that

\forall k, \partial E_{k} / \partial V_{k} = - P

. To obtain the same thermodynamics in both cases, we expect that

〈V〉 = V, 〈{(Δ V)}^{2}〉 \geq 0;

here, V is the fixed volume in the NFl-description. Thus, a fixed V results in fluctuating

P_{k}

, and a fixed P results in fluctuating

V_{k}

, as already noted earlier; see Claim 5.

The isentropic change in

d E

is precisely

d E - T d S

, which is nothing but

(- d W) = - F_{w} \cdot d W

; see Equation (131). It represents the average

〈d E〉

of

d E_{k} = - d W_{k}

. It follows from Equation (176) that

〈d_{α} E〉 \equiv - d_{α} W

, a result already derived earlier, with

d_{i} W \geq 0

; see Equation (143). However, there is no constraint on the sign of the internal microwork

d_{i} W_{k} = - d_{i} E_{k}

, as will become clear below.

7. Stochastic Aspects

7.1. Origin of Stochasticity

In the example of the two noninteracting mechanical systems

Σ_{1}

and

Σ_{2}

forming the combined system

Σ

in Section 4, the discussion did not consider any physical or imaginary “wall” separating the two systems through which interactions can be transmitted. We should emphasize that a physical wall may also represent a container

Σ_{2}

used to confine the system

Σ_{1}

under investigation such as in Figure 1a, or a very thin layer separating the two subsystems as in Figure 1b. An imaginary wall may be a way to divide

Σ

into two parts

Σ_{1}

and

Σ_{2}

, a very common trick in EQ statistical mechanics [33] to study conditions of equilibration. However, to be specific, we will be considering a physical wall.

For our investigation, a wall merely allows the possibility of turning the mechanical system

Σ_{1}

into a thermodynamic system surrounded by

Σ_{2}

. It may be real or imaginary. We have used this scheme a while back to study its role for stochasticity in a deterministic system with special attention to Kac’s ring model [92,93,94,95,97,98,174,176]. For concreteness, we will think of the wall as a second system

Σ_{2}

in the following. The same discussion will also cover other kinds of walls mentioned above. A brief discussion of this approach has been given earlier in [79], which we will now elaborate in this review to extend it to microstates.

If there were no interactions between the two systems, they would be completely independent, which is of no interest to us. We have already treated this case in Section 4. The discussion there is easily extended to the case when the two systems are interacting, except that we will consider Fl-work parameters in this section to give a flavor of how to treat them. Following the approach taken in Section 4, we introduce the same two independent combinations

Z_{k}

and

ξ_{k}

by extending Equation (100) to

Z_{1 k_{1}}

and

Z_{2 k_{2}}

in place of

Z_{1}

and

Z_{2}

, respectively. Thus, we will be dealing with NFl-

F

; see Equation (18).

The simplest way to account for the wall is to treat it as rigid having a single fixed microstate

m_{2}^{(0)}

, i.e., having fixed locations of its particles. In this case, its Hamiltonian can be simply written as

H_{2}^{(0)} (W_{2})

with the suffixes referring to

m_{2}^{(0)}

and its energy

E_{2}^{(0)}

. In this case, there is no difference between NFl- or Fl-

W_{2}

for

Σ_{2}

. This case also means that k is not different from

k_{1}

.

We now consider the interacting case, with

H_{12}^{(0)}

the nonvanishing interaction Hamiltonian between

Σ_{1}

and

Σ_{2}

. This interaction causes correlations between them and results in a deterministic force of interaction between the microstate

m_{1 k_{1}}

of

Σ_{1}

and the unique microstate

m_{2}^{(0)}

of

Σ_{2}

. The energy transfer between the two microstates due to

H_{12}^{(0)}

is, therefore, deterministic so there is no possibility of any stochasticity in

Σ

.

As

Σ_{1}

and

Σ_{2}

are purely mechanical systems, it should be clear [79] that the only source of stochasticity can emerge from their mutual interaction, i.e., the interaction

H_{12}

between

Σ_{1}

and

Σ_{2}

. For the wall to produce any stochasticity in

Σ_{1}

, the wall must be allowed to have an enormously large number of possible microstates

m_{2 k_{2}}

. Therefore, we will use

Z_{1 k_{1}}

and

Z_{2 k_{2}}

or

W_{1 k_{1}}

and

W_{2 k_{2}}

for clarity. Let

E_{k}^{(0)} = H_{k}^{(0)} (W_{k}) ≐ H_{1 k_{1}} (W_{1 k_{1}}) + H_{2 k_{2}} (W_{2 k_{2}})

(180)

be the undisturbed Hamiltonian of

Σ

in

m_{k}

in the absence of any interaction; it is a function of

2 r + 2

variables (see Section 4). The above identity should be compared with the approximation in Equation (162a) for NFl-work parameters. Here,

W_{k}

is the Fl-analog of

W

in Equation (102a) with the corresponding microforce

F_{w}

; see Claim 3. The presence of the interaction Hamiltonian

H_{12 k} (Z_{1 k_{1}}, W_{2 k_{2}})

modifies

H_{k}^{(0)} (W_{k})

to the Hamiltonian

H_{k} (W_{k}, ξ_{12 k}) = H_{k}^{(0)} (W_{k}) + H_{12 k} (Z_{1 k_{1}}, W_{2 k_{2}}),

(181)

which is a function of

2 r + 3

variables; here, we have introduced

E_{k} = E_{k}^{(0)} + E_{12 k}, ξ_{12 k} = E_{k}^{(0)} - E_{12 k},

(182)

and

E_{12 k} = H_{12 k} (Z_{1 k_{1}}, W_{2 k_{2}})

. Because of

H_{12}

,

E_{k}

is different from

E_{k}^{(0)}

, even though

m_{k}

is still given by Equation (98). It should be evident from Section 4 that the presence of

Σ_{2}

even when the mutual interaction is absent requires using the internal variable

ξ

to uniquely specify the microstate of

Σ

; see Claim 11. The internal variable ensures the correct matching of the number of quantities specifying

m_{1 k_{1}}

and

m_{2 k_{2}}

together. The presence of

H_{12}

adds another quantity that now must be accounted for a unique description of

m_{k}

. This strongly suggests some modification of how

Σ

and, therefore,

Σ_{1}

should be specified in the presence of

Σ_{2}

, which we now discuss.

As discussed, the inclusion of internal variables must uniquely specify

m_{k}

of

Σ

even in the presence of

H_{12}

. Thus, nothing has changed with respect to including

H_{12}

, which plays the role of an internal variable in the sense that it is not controlled by the outside of

Σ

; the latter is mechanically described by

2 r + 3

variables. This then is similar to the discussion above, so nothing new is required for its analysis, to which we now turn.

The problem arises if we are interested in describing

Σ_{1}

by itself, and are not specifically concerned with any particular microstate

m_{2 k_{2}}

of the wall (

Σ_{2}

) in this description; see Remark 16. Effectively, the effect on

m_{1 k_{1}}

will be different from different microstates

m_{2 k_{2}}

so their effects on

m_{1 k_{1}}

such as its microenergy will appear haphazard, as we are not privy to know or focus on any particular

m_{2 k_{2}}

. It is this stochasticity that gives rise to the notion of a probability

p_{1 k_{1}}

of

m_{1 k_{1}}

and the correlation between

m_{1 k_{1}}

and

m_{2 k_{2}}

that determines their joint probability. As we are interested in quantities pertaining to

m_{1 k_{1}}

, we must reduce the probability for

m_{k}

, for which all of

\{m_{2 k_{2}}\}

must be considered by summing over conditional probabilities, as already noted in Definition 16. Thus, we will follow below the reduction as shown in Equation (31).

7.2. Process of Reduction

Let

p_{2 k_{2}} = p_{2 k_{2}} (Z_{2 k_{2}}, F_{2})

(we include

F ≐

\{- T, F_{w}\}

; see Equation (25) for the two systems for clarity) denote the BI-probability of

Σ_{2}

in

m_{2 k_{2}}

, and

p (k_{1}| k_{2}) = p (\{Z_{1 k_{1}}, F_{1}\}| \{Z_{2 k_{2}}, F_{2}\};

ξ_{12 k})

the conditional probability of the microstate

m_{1 k_{1}}

of

Σ_{1}

given

Σ_{2}

is in the microstate

m_{2 k_{2}}

. Similarly,

p (k_{2}| k_{1}) = p (\{Z_{2 k_{2}}, F_{2}\}| \{Z_{1 k_{1}}, F_{1}\}; ξ_{12 k})

is the conditional probability of the microstate

m_{2 k_{2}}

of

Σ_{2}

given

Σ_{1}

is in the microstate

m_{1 k_{1}}

. The conditional probabilities include the correlation due to the interaction energy. The joint probability

p_{k} = p_{k} (\{Z_{1 k_{1}}, F_{1}\}, \{Z_{2 k_{2}}, F_{2}\}; ξ_{12 k})

given by the identity

p_{k} = p (k_{1}| k_{2}) p_{2 k_{2}} = p (k_{2}| k_{1}) p_{1 k_{1}}

(183)

gives the probability of

m_{1 k_{1}}

and

m_{2 k_{2}}

; compare with Equation (29). If

H_{12}

vanishes identically so that

ξ_{12 k}

becomes superfluous, the two systems become independent, as already discussed in Section 4. The mathematical condition [114] for this is

p_{k} ≐ p_{1 k_{1}} p_{2 k_{2}},

(184a)

from which it follows that the conditional probability is given by

p (k_{1}| k_{2}) ≐ p_{1 k_{1}}; p (k_{2}| k_{1}) ≐ p_{2 k_{2}};

(184b)

compare it with the approximate form given in Equation (33).

We pursue the effect of this interaction further to clarify the situation for the process of reduction (see Remarks 16 and 17), which is required to reduce all the quantities associated with

Σ

to a microstate

m_{k_{1}}

of

Σ_{1}

. This requires “summing” over all states

m_{2 k_{2}}

of

Σ_{2}

. We use the joint probability

p_{k}

to determine the contribution of various Hamiltonians in Equation (181) under reduction over all states

m_{2 k_{2}}

. As a result of the reduction, we obtain the identity, called the law of total probability [114],

p_{1 k_{1}} = \sum_{k_{2}} p_{k} (\{Z_{1 k_{1}}, F_{1}\}, \{Z_{2 k_{2}}, F_{2}\}; ξ_{12 k})

(184c)

as the marginal, which no longer has any information about any particular

m_{2 k_{2}}

. After reduction,

\{Z_{2 k_{2}}, F_{2}\}

is replaced by

F_{2}

. Similarly,

ξ_{12 k}

is replaced by

ξ_{12 k_{1}}

; see below. Thus,

p_{1 k_{1}}

on the left side stands for

p_{1 k_{1}} (\{Z_{1 k_{1}}, F_{1}\}, F_{2}; ξ_{12 k_{1}})

and not for the BI-probability

p_{1 k_{1}} (\{Z_{1 k_{1}}, F_{1}\})

. As the modified probability

p_{1 k_{1}}

depends on

F_{2}

and on

ξ_{12 k_{1}}

of

Σ_{2}

, it carries the correlation effects with

Σ_{2}

because of their mutual interaction. Thus, it is not a SI probability. This will result in modifying results that are obtained by using BI-probabilities. A similar discussion uses the BI-probability

p_{1 k_{1}} (\{Z_{1 k_{1}}, F_{1}\})

in the second equation in Equation (183) for the reduction to obtain a non-BI-probability

p_{2 k_{2}} (F_{1}, \{Z_{2 k_{2}}, F_{2}\}; ξ_{12 k_{2}})

for

m_{2 k_{2}}

of

Σ_{2}

. However, we will only consider the reduction using

m_{2 k_{2}}

below.

For

H_{1} (W_{1 k_{1}})

, we have

\sum_{k_{2}} p_{k} H_{1 k_{1}} (W_{1 k_{1}}) = p_{1 k_{1}} H_{1 k_{1}} (W_{1 k_{1}}),

(185a)

which is again not BI to

Σ_{1}

since

p_{1 k_{1}}

is not one. Summing over

k_{1}

will give the ensemble average

H_{1} (F_{1}, F_{2}; ξ_{12})

, where

ξ_{12}

is defined below. As this average does not only depend on

F_{1}

, it is not a BI-macroquantity of

Σ_{1}

, although it is a SI-macroquantity of

Σ

. For

H_{2} (W_{2 k_{2}})

, we introduce an effective Hamiltonian

E_{2 k_{1}}^{(1)} = H_{2 k_{1}}^{(1)} (W_{1 k_{1}}, F_{2}; ξ_{12 k_{1}})

for

Σ_{2}

defined as follows:

p_{1 k_{1}} H_{2 k_{1}}^{(1)} (W_{1 k_{1}}, F_{2}; ξ_{12 k_{1}}) ≐ \sum_{k_{2}} p_{k} H_{2 k_{2}} (W_{2 k_{2}}) .

(185b)

It represents the effective Hamiltonian of

Σ_{2}

under the condition that

Σ_{1}

is in

m_{1 k_{1}}

as indicated by the subscript. Again, its average is not a BI-microquantity of

Σ_{2}

due to its dependence on

F_{1}

. Similarly, we introduce an effective interaction Hamiltonian

E_{12 k_{1}}^{(1)} = H_{12 k_{1}}^{(1)} (W_{1 k_{1}}, F_{2}; ξ_{12 k_{1}})

under the condition that

Σ_{1}

is in

m_{1 k_{1}}

, as above:

p_{1 k_{1}} H_{12 k_{1}}^{(1)} (W_{1 k_{1}}, F_{2}; ξ_{12 k_{1}}) ≐ \sum_{k_{2}} p_{k} H_{12 k} (W_{1 k_{1}}, W_{2 k_{2}}) .

(185c)

The effect of summing over

m_{2 k_{2}}

results in the two effective Hamiltonians depending explicitly on the microstate

m_{1 k_{1}}

, just as

H_{1 k_{1}} (W_{1 k_{1}})

.

As

p_{1 k_{1}} (\{Z_{1 k_{1}}, F_{1}\}, F_{2}; ξ_{12 k_{1}})

appears as a common factor in all three Hamiltonians, we can identify it as the effective probability that

Σ_{1}

is in the microstate

m_{1 k_{1}}

, irrespective of

m_{2 k_{2}}

, and

\begin{matrix} H_{k_{1}} (W_{1 k_{1}}, F_{2}, ξ_{12 k}) & = H_{1 k_{1}} (W_{1 k_{1}}) + \\ H_{2 k_{1}}^{(1)} (W_{1 k_{1}}, F_{2}; ξ_{12 k_{1}}) + \\ H_{12 k_{1}}^{(1)} (W_{1 k_{1}}, F_{2}; ξ_{12 k_{1}}) \end{matrix}

as its effective conditional Hamiltonian under this condition. The benefit of this rigorous approach is that it allows us to treat

Σ

as if it is in the

m_{1 k_{1}}

with probability

p_{1 k_{1}} (\{Z_{1 k_{1}}, F_{1}\}, F_{2}; ξ_{12 k_{1}})

. The reduction gives

\begin{matrix} E_{k_{1}} & = E_{1 k_{1}} + E_{2 k_{1}}^{(1)} + E_{12 k_{1}}^{(1)}, \\ ξ_{12 k_{1}} & = E_{1 k_{1}} + E_{2 k_{1}}^{(1)} - E_{12 k_{1}}^{(1)} . \end{matrix}

The average energy E of

Σ

is simply given by

E ≐ \sum_{k} p_{1 k_{1}} (\{Z_{1 k_{1}}, F_{1}\}, F_{2}; ξ_{12 k_{1}}) E_{k_{1}} .

The above discussion is valid regardless of the sizes of

Σ_{1}

and

Σ_{2}

. Thus, it is also applicable to small systems and was used recently by us to study the Brownian motion [157].

7.3. Quasi-Independence

We now turn to the consideration of

Σ, \tilde{Σ}

, and

Σ_{0}

by identifying them with

Σ_{1}, Σ_{2}

, and

Σ

, respectively. This requires some changes in the notation; in particular,

ξ_{12 k_{1}}

is replaced by

ξ_{int k}

, and

F_{2}

is replaced by

\tilde{f} ≐ (- \tilde{T}, f_{w})

as

\tilde{Σ}

being in EQ has

\tilde{A} = 0

. The drawback of the above rigorous discussion is that the marginal

p_{k} (\{Z_{k}, F\}, \tilde{f}; ξ_{int k})

of

m_{k}

is not a BI-probability of

Σ

because of the presence of

\tilde{f}

and

ξ_{int k}

, a reflection of the interaction, which has been taken into account exactly. Rather, it is an effective non-SI-probability controlled by the entire system

Σ

. Without any approximation of the correlation induced by

\tilde{Σ}

, the most appropriate and exact thermodynamic discussion can only be obtained for

Σ_{0}

.

If we wish to obtain a SI description of

Σ

, we need to focus only on it and not on

Σ_{0}

. The simplest way to accomplish this is to assume that the interaction part

H_{int}

is nonzero but insignificant compared to

H

and

\tilde{H}

so that

Σ

and

\tilde{Σ}

can be treated as quasi-independent (see Claim 7), the requirements for which have been discuss earlier [148]. This assumption is central to having the approximate entropy additivity discussed in Equation (122b), so it is very common in the field. We will now assume quasi-independence.

Definition 28.

By definition, quasi-independence implies Equation (33) as an approximate equality; the correlation due to weak interaction

H_{int}

has been neglected with the effect that the two systems become almost independent.

The joint probability therefore becomes approximately

p_{0 k_{0}} = p_{k} {\tilde{p}}_{\tilde{k}} .

As we are not interested in the microstate

{\tilde{m}}_{\tilde{k}}

, we must “sum” over all states

{\tilde{m}}_{\tilde{k}}

. As a consequence, it is easy to verify that Equations (185b) and (185c) reduce to

{\tilde{H}}_{k} = \tilde{E} (\tilde{w}), H_{int k} \approx 0,

where

\tilde{E} (\tilde{w})

is the average energy of

\tilde{Σ}

, and the mutual interaction is negligible.

Claim 14.

For any microstate

m_{k}

of the system Σ, the energy of the medium

\tilde{Σ}

is given by its macroenergy

\tilde{E} (\tilde{w})

. Thus,

\tilde{Σ}

exerts its macroforce

{\tilde{f}}_{w} (\tilde{w}) ≐ - \partial \tilde{E} (\tilde{w}) / \partial \tilde{w}

for all microstates

m_{k}

of Σ; compare with Equation (73a).

The above claim follows from Theorem 7, which generalizes it.

The most common situation in statistical mechanics is to consider these interactions to be so weak that we can sensibly talk about the behavior of the system alone to a high degree of accuracy. An idealization of the situation is when the interactions are completely absent, so that the system is isolated from the medium. The conservation laws usually refer to a certain measurable quantity of this system, which is then supposed to have a fixed value as the system evolves. For example, the linear momentum is conserved due to the homogeneity of the space, while the angular momentum is conserved due to the isotropy of space. For discrete symmetries, the parity associated with the symmetry remains conserved. This simplifies the discussion appreciably. If the interactions of the system with the medium are too strong to be neglected, there is no sense in talking about the system alone. In this case, one must consider the combined isolated system

Σ_{0} = Σ \cup \tilde{Σ}

as the system. Accordingly, we will usually consider an isolated system [162] if the interest is in such conserved quantities.

7.4. Reduction

The issue of conditional probabilities was considered in Ref. [157]. Let us consider the set of microstates

\{m_{k}\}, \{{\tilde{m}}_{\tilde{k}}\}

, and

\{m_{0, k_{0}}\}

of

Σ, \tilde{Σ}

, and

Σ_{0}

, respectively, and

[q], q \in χ

. Here,

\tilde{k}

and

k_{0}

index the countable sets

\{{\tilde{m}}_{\tilde{k}}\}

and

\{m_{0, k_{0}}\}

, respectively. The three sets are related in that

m_{0, k_{0}} ≐ m_{k} \otimes {\tilde{m}}_{\tilde{k}},

which follows from the additivity in Equation (168).

Let us introduce conditional probabilities

p (\tilde{k} ∣ k)

of

\tilde{k}

given k, and

p (k ∣ \tilde{k})

for k given

\tilde{k}

. In terms of them, Equation (183) turns into Equation (29). We use them to determine microquantities q

_{0 k}

and

{\tilde{q}}_{k}

of

Σ_{0}

and

\tilde{Σ}

, respectively, that can be associated with

m_{k}

. Following Equation (31), we have

\begin{matrix} q_{0 k} & ≐ \sum_{\tilde{k}} p (\tilde{k}| k) q_{0 k_{0}}, q_{0} = \sum_{k} p_{k} q_{0 k}, \end{matrix}

(186a)

\begin{matrix} {\tilde{q}}_{k} & ≐ \sum_{\tilde{k}} p (\tilde{k}| k) {\tilde{q}}_{\tilde{k}}, \tilde{q} = \sum_{k} p_{k} {\tilde{q}}_{k} . \end{matrix}

(186b)

We thus define the medium microenergy

{\tilde{E}}_{k}

under the condition that

Σ

is in

m_{k}

:

{\tilde{E}}_{k} ≐ \sum_{k} p (\tilde{k}| k) {\tilde{E}}_{\tilde{k}},

(187)

which satisfies the obvious identity

\tilde{E} ≐ \sum_{\tilde{k}} p_{\tilde{k}} {\tilde{E}}_{\tilde{k}} = \sum_{k} p_{k} {\tilde{E}}_{k} .

(188)

For the two MI-microfields defined by

{\tilde{F}}_{w \tilde{k}} ≐ - \partial {\tilde{E}}_{\tilde{k}} / \partial \tilde{W}, {\tilde{F}}_{w k} ≐ - \partial {\tilde{E}}_{k} / \partial \tilde{W},

(189)

we have the identity

{\tilde{f}}_{w} ≐ \sum_{\tilde{k}} p_{\tilde{k}} {\tilde{f}}_{w \tilde{k}} = \sum_{k} p_{k} {\tilde{f}}_{w k} .

(190)

It is clear that

{\tilde{F}}_{w k}

is obtained after reduction from

{\tilde{F}}_{w \tilde{k}}

.

The process of reduction can also be carried out for

[d θ]

for any body following the same steps as above by replacing q by

d θ

in Equation (186).

7.5. Reduction under Quasi-Independence for $m_{k}$

What makes SI-microquantities for a system

Σ

so important in the

μ

NEQT is the fact that they are unaffected by the presence of other objects in the surroundings such as a medium

\tilde{Σ}

; see Figure 1 and Figure 2. For simplicity, we again focus on Figure 1. All MI-microquantities carry the suffix

\tilde{k}

of

{\tilde{m}}_{\tilde{k}}

so they cannot be directly associated with

m_{k}

. Similarly, the microquantities of

Σ_{0}

carry the suffix

k_{0}

. Thus, we need to carry out reduction to

\{m_{k}\}

as prescribed in Equation (186) under the condition of quasi-additivity and quasi-independence. We now wish to discuss this reduction.

We first consider the microquantities associated with

\tilde{Σ}

, and prove the following important theorem that plays a central role in the

μ

NEQT. It is stated slightly differently than Theorem 1 quoted earlier and is proved here by justifying Remark 16.

Theorem 7.

Under the assumption of quasi-additivity and quasi-independence, reduced or conditional MI-microquantities given that Σ is in the microstate

m_{k}

are not fluctuating quantities in that they are the same for all k, i.e., they are NFl-macroquantities.

Proof.

Let us first consider

{\tilde{E}}_{k}

introduced in Equation (187). Using Equation (33), we find

{\tilde{E}}_{k} = \sum_{\tilde{k}} p_{\tilde{k}} {\tilde{E}}_{\tilde{k}} = \tilde{E}, \forall k .

(191)

Similarly, using Equation (186a) for

E_{0 k}

, the reduced microenergy of

Σ_{0}

, given that Σ is in

m_{k}

, we find that

E_{0 k} ≐ \sum_{\tilde{k}} p (\tilde{k}| k) E_{0 k_{0}} = \sum_{\tilde{k}} p_{\tilde{k}} (E_{k} + {\tilde{E}}_{\tilde{k}}) .

where we have used quasi-additivity in Equation (123a). We immediately see that

E_{0 k} = E_{k} + \tilde{E}, \forall k .

(192)

We can carry out a similar calculation for

{\tilde{F}}_{w k} ≐ - \partial {\tilde{E}}_{k} / \partial \tilde{W}

, the microforce generated by

\tilde{Σ}

under the condition that Σ is in

m_{k}

following the above reduction on

{\tilde{F}}_{w \tilde{k}} ≐ - \partial {\tilde{E}}_{\tilde{k}} / \partial \tilde{W}

from Equation (17a) with a similar conclusion that

{\tilde{F}}_{w k} = \sum_{\tilde{k}} p_{\tilde{k}} {\tilde{F}}_{w \tilde{k}} = {\tilde{F}}_{w} = F_{0 w}, \forall k,

(193a)

where

F_{0 w}

refers to

Σ_{0}

. However, it should be remarked that the affinity

\tilde{A} = 0

as

\tilde{Σ}

is taken to be in equilibrium. Moreover, as

{\tilde{f}}_{s} = - \tilde{T} = T_{0}

is NFl-field, it is also the same for all

m_{k}

. It thus follows from the discussion that

{\tilde{F}}_{k} = \tilde{F} = ({\tilde{f}}_{s}, {\tilde{F}}_{w}), \forall k .

(193b)

The discussion is easily extended to

d {\tilde{θ}}_{k}

using the same reasoning (shown explicitly below). Thus,

d {\tilde{θ}}_{k} = d \tilde{θ} = d_{e} \tilde{θ} = - d_{e} θ, \forall k .

(193c)

This proves the theorem. □

Equation (193c) is the simplified version of Equation (62b) under quasi-independence, and justifies Equation (64a). The above theorem provides a theoretical justification for Remark 28. In particular, it shows that

\begin{matrix} d_{e} {\tilde{W}}_{k} & = d {\tilde{W}}_{k} = d_{e} \tilde{W}, \end{matrix}

(194a)

\begin{matrix} d_{e} W_{k} & = - d {\tilde{W}}_{k} = - d_{e} \tilde{W} = d_{e} W \end{matrix}

(194b)

in Equation (173). In addition, it also shows that

d_{i} {\tilde{W}}_{k} = d {\tilde{W}}_{k} - d_{e} {\tilde{W}}_{k} = 0,

(195)

as expected.

By replacing the infinitesimals

d_{α}

by the accumulation

Δ_{α}

, properly defined in Section 13, we can obtain similar relations for

Δ_{α} W_{k}

and

Δ_{α} {\tilde{W}}_{k}

.

Remark 51.

The above theorem has a profound implication for what was noted earlier as the conjecture in Equation (7) in Section 1.2. We observe that the right side there is a Fl microquantity, while the left side is a NFl microquantity so they cannot be equated.

Corollary 5.

For

Σ_{0} = Σ \cup \tilde{Σ}

satisfying quasi-additivity and quasi-independence, we have

\begin{matrix} q_{0 k} & = q_{k} + \tilde{q}, \\ F_{0 w k} & = F_{w k} + \tilde{F}, \\ d θ_{0 k} & = d θ_{k} + d \tilde{θ}; \end{matrix}

(196)

for a NFl-

q

, we simply have

q_{0} = q + \tilde{q}

as expected.

Claim 15.

For a system in

m_{k}

,

q_{k}

for a Fl-

q,

F_{k}

, and

d θ_{k}

are random variables, so they are Fl-microquantities.

It follows from the above theorem that the medium is seen only in its average manifestation, and is the hallmark of classical thermodynamics in which exchange quantities become central because of this manifestation. The above conclusions simplify the statistical mechanical formulation of the system in which the microstates of the medium play no interesting role; all SI-quantities pertaining to

m_{k}

are ubiquitous fluctuating microquantities. Examples are

E_{k}, d W_{k}, d Q_{k}, d S_{k}

, etc.

7.6. Clarifying Examples

We now clarify the importance of the theorem and corollary by some simple examples. We focus on the composite system

Σ_{0} = Σ \cup \tilde{Σ}

, and consider microquantities associated with it. We first treat

d_{i} W_{0 k}

by using Equation (164), which yields

d W_{0 k_{0}} \equiv d_{i} W_{0, k_{0}} = d W_{k} + d {\tilde{W}}_{\tilde{k}},

(197a)

which requires reduction to obtain

d W_{0 k}

:

d W_{0 k} \equiv d_{i} W_{0 k} = d W_{k} + d \tilde{W},

where we have replaced

d {\tilde{W}}_{\tilde{k}}

with

d {\tilde{W}}_{k} = d \tilde{W} = {\tilde{F}}_{w} \cdot d \tilde{W} = d \tilde{W}

in view of Equation (193a) after averaging over

{\tilde{m}}_{\tilde{k}}

using Equation (186a) and

p (\tilde{k}| k) = {\tilde{p}}_{\tilde{k}}

. We obtain the internal microwork in

Σ_{0}

, given

Σ

is in

m_{k}

:

d_{i} W_{0, k} = d W_{k} + d \tilde{W},

(198)

This is in accordance with the third equation in Equation (196). We can now average over

m_{k}

to finally obtain the irreversible macrowork

d W_{0} \equiv d_{i} W_{0} = d W + d \tilde{W} = d_{i} W \geq 0,

(199)

where we have used Equation (143) for the inequality.

We also see how the second equation in Equation (173) reduces to Equation (194b). Of course the first equation reduces to

d_{e} W_{0, k} = 0 .

Similarly, after reducing

d_{i} {\tilde{W}}_{\tilde{k}} = d {\tilde{W}}_{\tilde{k}} - d_{e} {\tilde{W}}_{\tilde{k}}

, it is replaced by

d_{i} {\tilde{W}}_{k} = d {\tilde{W}}_{k} - d_{e} {\tilde{W}}_{k},

which in conjunction with Equation (194a), shows that

d_{i} {\tilde{W}}_{k} = d_{i} \tilde{W} = 0,

which simply shows the EQ nature of

\tilde{Σ}

. As

d W_{k} = - d E_{k}

for any body, all above results can be easily applied to microenergies. In particular,

d_{e} E_{k} = d_{e} \tilde{W}, d_{i} E_{0 k} = d E_{k} - d_{e} E = d_{i} E_{k} .

A similar discussion can be carried out for the microheats, following the same arguments as above. We simply quote the results:

\begin{matrix} d Q_{0 k_{0}} & \equiv d_{i} Q_{0, k_{0}} = d Q_{k} + d {\tilde{Q}}_{k}, \end{matrix}

(200a)

\begin{matrix} d_{i} Q_{0, k} & = d Q_{k} + d \tilde{Q} = d_{i} Q_{k}, \end{matrix}

(200b)

\begin{matrix} d_{e} Q_{k} & = d_{e} Q = - d \tilde{Q} = - d_{e} \tilde{Q} . \end{matrix}

(200c)

Using

d {\tilde{Q}}_{\tilde{k}} = T_{0} d {\tilde{\bar{S}}}_{\tilde{k}}

, we have

d {\tilde{Q}}_{k} = T_{0} d {\tilde{\bar{S}}}_{k} = d \tilde{Q} = T_{0} d \tilde{S}

; see Remark 8 and Section 10.1 for the definition of

d {\tilde{\bar{S}}}_{\tilde{k}}

:

d \tilde{\bar{S}} \equiv \sum_{k} {\tilde{p}}_{\tilde{k}} d {\tilde{\bar{S}}}_{\tilde{k}} .

Thus, under the same reduction, we have

d {\tilde{\bar{S}}}_{\tilde{k}} = d \tilde{\bar{S}} = d \tilde{S}

, which is consistent with Theorem 7.

Remark 52.

It should be stressed again that vanishing of NFl

d_{i} {\tilde{W}}_{k}

and

d_{i} {\tilde{Q}}_{k}

does not imply vanishing of Fl

d_{i} {\tilde{W}}_{\tilde{k}}

and

d_{i} {\tilde{Q}}_{\tilde{k}}

, respectively.

For microentropies, we have

\begin{matrix} d {\bar{S}}_{0 k_{0}} & \equiv d_{i} {\bar{S}}_{0, k_{0}} = d {\bar{S}}_{k} + d {\tilde{\bar{S}}}_{\tilde{k}}, \end{matrix}

(201a)

\begin{matrix} d_{i} {\bar{S}}_{0, k} & = d {\bar{S}}_{k} + d \tilde{\bar{S}} = d_{i} {\bar{S}}_{k}, \end{matrix}

(201b)

\begin{matrix} d_{e} {\bar{S}}_{k} & = d_{e} \bar{S} = - d \tilde{\bar{S}} = - d_{e} \tilde{\bar{S}} . \end{matrix}

(201c)

We consider two simple examples. The first one is of an EQ

Σ

at temperature T in a medium

\tilde{Σ}

at temperature

T_{0}

. The only irreversibility is due to the macroheat flow, which results in nonzero

d_{i} S

and no

d_{i} Q

. We will verify these well-known results in our approach.

We have, from the above,

d \tilde{Q} = d_{e} \tilde{Q} = T_{0} d \tilde{S} = T_{0} d_{e} \tilde{S}

(

d_{i} \tilde{S} = 0

as

\tilde{Σ}

is in EQ) so that

d_{e} Q_{k} = - d \tilde{Q} = - T_{0} d_{e} \tilde{S} = T_{0} d_{e} S,

which now justifies Equation (46). To determine

d_{i} {\bar{S}}_{k}

, we use Equation (201b) to obtain

d_{i} {\bar{S}}_{k} = d {\bar{S}}_{k} - \frac{d Q}{T_{0}},

where we have set

d_{e} Q = d Q

as

Σ

is in EQ. Taking its average, we obtain

d_{i} S = (\frac{1}{T} - \frac{1}{T_{0}}) d Q \geq 0

(202)

We see that

d_{i} S \geq 0

is due to macroheat flow. It is easy to see that the inequality is always satisfied and explains why macroheat always flows from hot to cold. We also see that the first term above is

d S = d Q / T

(see the Clausius equality in Equation (45)), and the second term is

d_{e} S = d Q / T_{0} = d_{e} Q / T_{0}

, as first noted in Equation (46). Thus,

d_{i} Q = d Q - d_{e} Q = 0

as noted above. This shows that the physics of

d_{i} Q

is very different from

d_{i} S

. This is easily seen from Equation (47) or (142). As there is no mechanical work involved in this situation,

d_{i} W = d_{i} Q

vanishes. Despite this, we have nonzero

d_{i} S_{k}

and

d_{i} S

. This means that

d_{i} Q

and

d_{i} S

cannot be linearly related, as seen in Equations (47) and (142).

A similar discussion also applies to

d_{α} η_{k}

for various bodies. Assuming quasi-independence, we have

η_{0 k_{0}} = η_{k} + {\tilde{η}}_{\tilde{k}},

we have

d_{α} η_{0 k_{0}} = d_{α} η_{k} + d_{α} {\tilde{η}}_{\tilde{k}},

(203a)

which reduces to

d_{α} η_{0 k} = d_{α} η_{k} + d_{α} {\tilde{η}}_{k} = d_{α} η_{k},

(203b)

where we have used the fact

d_{α} {\tilde{η}}_{k} ≐ \sum_{\tilde{k}} {\tilde{p}}_{\tilde{k}} d_{α} {\tilde{η}}_{\tilde{k}} = \sum_{\tilde{k}} d_{α} {\tilde{p}}_{\tilde{k}} = 0

as will be established later in Theorem 11. In essence, Equation (203b) is no different from Equation (196), as

d_{α} \tilde{η} = 〈d_{α} \tilde{η}〉 = 0

.

We now use the above reduction to determine

d_{e} {\tilde{Q}}_{\tilde{k}} = {\tilde{E}}_{\tilde{k}} d_{e} {\tilde{η}}_{\tilde{k}}

as the microanalog of

d_{e} {\tilde{Q}}_{\tilde{k}}

following Equation (44b). This identification is shown later in Equation (240). Reducing the identity, we obtain

d_{e} {\tilde{Q}}_{k} = \sum_{\tilde{k}} {\tilde{p}}_{\tilde{k}} {\tilde{E}}_{\tilde{k}} d_{e} {\tilde{η}}_{\tilde{k}} = \sum_{\tilde{k}} {\tilde{E}}_{\tilde{k}} d_{e} {\tilde{p}}_{\tilde{k}} = d_{e} \tilde{Q}

, as seen from Equations (239) in which Equation (236) has to be used.

As a second example, we consider

Σ

to consist of a gas or a spring in a fluid, as shown in Figure 3. The medium exerts a pressure

P_{0}

on the gas or the external pulling force

F_{0}

pulling the spring. The deviation of the micropressure

P_{k}

exerted by the gas on the piston or the spring microforce

F_{w k}

induced in the spring from the external pressure

P_{0}

or the pulling force

F_{0}

, respectively, creates a micro- and macro-force imbalance

Δ F_{w k}

and

Δ F_{w}

, respectively. What is surprising is that

F_{0 w k_{0}} \neq 0, \forall k_{0}, even if F_{0 w} = 0

(204)

for

Σ_{0}

. After reduction,

F_{0 w k} = F_{w k} + \tilde{F} = Δ F_{w, k}

.

The distinction between the SI- and MI-quantities (

d W

and

d \tilde{W}

or

d W_{k}

and

d {\tilde{W}}_{\tilde{k}}

) clarifies the confusion about the meaning of work and heat in classical nonequilibrium thermodynamics [39,42] as is evident from the debate in the literature [145,178,179,180,181,182,183,184,185,186,187,188,189,190,191,192,193]. The debate has only recently been clarified [75,76,134,148,149,152,153] by properly making the distinction between the Fl SI-microwork

d W_{k}

, the work done by the Fl SI-microforce

F_{w k}

, and the NFl MI-work

d \tilde{W}

, the work done by the medium by the force

{\tilde{F}}_{w}

exerted by the medium on

m_{k}

. The confusion mentioned above is due to not differentiating the two quantities, even though

F_{w k}

and

{\tilde{F}}_{w}

are in general not equal and opposite. This we present as a claim [150] that is proved by this study; see the discussion following Claim 15, that

Claim 16.

At the microscopic level,

d {\tilde{W}}_{k} \equiv d \tilde{W}

and

d E_{k}

differ by

d_{i} E_{k} \equiv - d_{i} W_{k}

, whether we consider a purely mechanical or a thermodynamic process. This difference, which is ubiquitous, has nothing to do with stochasticity and is a purely mechanical consequence of a microforce imbalance (μFI)

Δ F_{w, k}

in Σ.

Proposition 2.

The contribution

d_{i} E_{k} \equiv - d_{i} W_{k}

is necessary but not sufficient to describe dissipation; see also Conclusion 3.

The same conclusion also applies to the accumulation

Δ_{i} E_{k} \neq 0

along a trajectory

γ_{k}

taken by

m_{k}

during an NEQ process

P

over a time interval

(0, τ)

; see Section 13 for the proper definition. The Proposition follows directly from Equation (204).

8. Properties of Entropy for $M (t)$

We follow Section 5.5 closely. The maximum possible value of

S (t)

for given

\bar{Z} = Z (t) \in S_{Z}

occurs when

m_{k}

are uniquely specified in

S_{Z}

. This makes

S (t)

a state function

S (\bar{Z})

of

\bar{Z}

with no explicit time dependence. Thus,

{S_{\max} (\bar{Z}, t)|}_{\bar{Z} fixed} = S (\bar{Z}) .

(205)

The simplest way to understand the physical meaning is as follows. Consider

\bar{Z}

at some time t. As

S (t)

may not be a unique function of

\bar{Z}

, we look at all possible entropy functions for this

\bar{Z}

. These entropies correspond to all possible sets of

\{p_{k} (t)\}

for a fixed

\bar{Z}

, and define different possible macrostates

\{M\}

. We pick that particular

\bar{M} \in \{M\}

among these that has the maximum possible value of the entropy, which we denote by

S (\bar{Z})

or

S (Z (t))

without any explicit t-dependence. This entropy is a state function

S (\bar{Z})

. For a macroscopic system, this occurs when the corresponding microstate probabilities for

\bar{M}

are equally probable (ep):

{\bar{p}}_{k} (t) \to p_{k}^{ep} = 1 / W (\bar{Z}) > 0, \forall {\bar{m}}_{k} \in Γ (\bar{Z}),

(206a)

so that

S (\bar{Z}) = ln W (\bar{Z}) .

(206b)

We wish to point out the presence of nonzero probabilities in Equation (206a) that explains the comment above of available microstates. Including microstates with zero probabilities will not correctly account for the number of microstates with given

\bar{Z}

.

Remark 53.

All microstates in

M_{ieq}

are equally probable as seen in Equation (206a), which makes

M_{ieq}

the most probable macrostate for the given

\bar{Z}

. Once in

M_{ieq}

, the body will have no memory of its original macrostate, which may not be in IEQ, from which it arises due to evolution in time.

There is an alternative to the above picture in which we can imagine the

Σ

with fixed

\bar{Z}

, which essentially “isolates”

Σ

and converts it into a

Σ_{0}

. Then, as t varies, its entropy increases until it reaches its maximum value

S (\bar{Z})

; see also Proposition 3.

Remark 54.

We emphasize that

\bar{Z} = (E, W)

so

p_{k}

above in Equation (206a) is determined by the average energy E and not by the microstate energy

E_{k}

, as derived later in Section (Section 12.2). The

p_{k}

in Equation (206a) replaces the actual probability distribution in Equation (275) by a flat distribution of height

1 / W (\bar{Z})

and width

W (\bar{Z})

, a common practice in the thermodynamic limit of statistical mechanics [33]. Therefore, there in no fluctuation in

\{p_{k}\}

. Despite this modification, the entropy has the same value for a macroscopic body so β and

F_{w}

are given by Equations (129) and (17b), respectively; see also Section 12.2.

Let us consider a different formulation of the entropy for a nonunique macrostate

M (t) \in

S_{X}

specified by some

\bar{X} = X (t) \subset Z

at some instance t. This macrostate provides a more incomplete specification than in

S_{Z}

. Applying the above formulation to

M \in

S_{X}

, and consisting of microstates

\{{\bar{m}}_{k}\},

forming the set

\bar{m} \equiv m (\bar{X}),

with probabilities

\{{\bar{p}}_{k} (t)\}

, we find that

S (\bar{X}, t) \equiv - \sum_{k = 1}^{W (\bar{X})} {\bar{p}}_{k} (t) ln {\bar{p}}_{k} (t),

(207)

is the entropy of

M

; here

W (\bar{X})

is the number of distinct microstates

{\bar{m}}_{k}

. It should be obvious that

W (\bar{X}) \equiv \sum_{ξ (t)} W (\bar{Z}) .

Again, under the equiprobable (ep) assumption

{\bar{p}}_{k} (t) \to {\bar{p}}_{k}^{ep} = 1 / W (\bar{X}), \forall {\bar{m}}_{k} \in Γ (\bar{X}),

Γ (\bar{X})

denoting the sample space spanned by

\bar{m} = \{{\bar{m}}_{k}\}

, the above entropy takes its maximum possible value

S_{\max} (\bar{X}, t) = S (\bar{X}) = ln W (\bar{X}),

(208)

which is the well-known value of the Boltzmann entropy for a body in equilibrium

S (\bar{X}) = ln W (\bar{X}),

(209)

and provides a statistical definition of, and hence connects it with, the thermodynamic entropy of the body proposed by Boltzmann [46,47,131]. The maximization again has the same implication as in Equation (205): For given

\bar{X}

, we look for the maximum entropy at all possible times. It is evident that

S (\bar{Z}, t) \leq S (\bar{Z}) \leq S (\bar{X}) .

(210)

Thus, the NEQ entropy

S (\bar{Z}, t)

as

t \to τ_{eq}

, the equilibration time, reduces to

S (\bar{X})

in EQ, as expected. Before equilibration,

S (\bar{Z})

in

S_{Z}

remains a nonstate function

S (\bar{X}, t)

in

S_{X}

, where we do not invoke

ξ

. It is the variation in

ξ

that is responsible for the time variation in

S (\bar{X}, t)

. A simple proof of this conclusion is given in Section 12.6; see Remark 48 also. We can summarize this conclusion as

Conclusion 4.

The variation in time in

S (\bar{X}, t)

in

S_{X}

is due to the missing set of internal variables ξ.

We now revert back to the standard use of

X

, and

Z

. Let us consider an isolated body

Σ_{b}

out of equilibrium so that its macrostate

M_{neq}

in

S_{X}

spontaneously relaxes towards

M_{eq}

at fixed

X

. Its entropy

S (X, t)

has an explicit time dependence, which continues to increase towards

S (X)

. For such NEQ macrostates, the explicit time dependence in

S (X, t)

is explained by introducing

ξ

to make their entropies a state function in an appropriately chosen larger state space

S_{Z}

[148] as explained later in Section 12. It is also shown there that an NIEQ macrostate with entropy

S (Z, t)

may be converted to an IEQ macrostate with a state function entropy

S (Z^{'})

by going to an appropriately chosen larger state space

S_{Z^{'}}

spanned by

Z^{'}

with

S_{Z}

its proper subspace. Therefore, in most cases of interest here, we would be dealing with a state function and usually write it as

S (Z)

, unless a choice for

Z

has been made based on the experimental setup, as discussed in Section 12. In that case, we must deal with a pre-determined state space

S_{Z}

so that some NEQ macrostates that lie outside

S_{Z}

have their entropy of the form

S (Z, t)

in

S_{Z}

as we cannot use the larger state space

S_{Z^{'}}

.

It should be clear now that the explicit time dependence in an NEQ macrostate in

S_{X}

with a nonstate function entropy

S_{neq} (t) ≐ S (X, t)

is due to additional state variables in

ξ

and that this NEQ macrostate may be converted into an IEQ macrostate with a state function entropy

S_{ieq} (Z)

by going from

S_{X}

to an appropriately chosen larger state space

S_{Z}

. Similarly, an NIEQ macrostate

M_{nieq}

in

S_{Z}

with a nonstate function entropy

S_{nieq} (t) ≐ S (Z, t)

is converted to

M_{ieq}^{'}

in an appropriately chosen larger state space

S_{Z^{'}}

with a state function entropy

S_{ieq} (Z^{'})

. The additional internal variables

ξ^{'}

in

Z^{'}

that are over and above

ξ

in

Z

give rise to additional entropy generation as they relax for fixed

Z

. This results in the following inequality:

S_{ieq} (Z) \geq S_{ieq} (Z^{'}) = S_{nieq} (Z, t) .

(211)

However, if the choice for

Z

has been made based on the experimental setup and the observation time

τ_{obs}

(see Section 12), we must restrict our discussion to

S_{Z}

so that we must consider

M_{nieq}

in

S_{Z}

the following. This will be done in Section 12.6; see Remarks 45 and 48.

8.1. System in a Medium and Quasi-Independence

The above formulation of

S (\bar{Z}, t)

can be applied to

Σ, \tilde{Σ}

, and

Σ_{0}

. We assume that

Σ

, and

\tilde{Σ}

are quasi-independent so that

S_{0} (t)

can be expressed as a sum of entropies

S (t)

and

\tilde{S} (t)

of

Σ

and

\tilde{Σ}

, respectively:

S_{0} (t) = S (t) + \tilde{S} (t) .

(212)

This follows immediately from Definition 28 and the observation that three entropies are given by the same formulation as in Equation (26a).

In the derivation of the above additivity (see [148]), we have neither assumed the medium nor the system to be in internal equilibrium; only quasi-independence is assumed. The above formulation of the additivity of statistical entropies will not remain valid if the two are not quasi-independent. From this, we also conclude that the entropy additivity will not be true in the absence of quasi-independence.

8.2. Second Law Postulate of NEQ Entropy S

The uniqueness issue about the NEQ macrostate says nothing about the entropy of an arbitrary (so it may be nonunique) macrostate

M : \{m_{k}, p_{k}\}

, which is always given by the Gibbs entropy in Equation (26a), as derived in Section 5.5; see also [72]. In the demonstration,

M

is not required to be uniquely identified. This entropy satisfies the law of increase of entropy, as is easily seen by the discussion by Landau and Lifshitz [33] for an NEQ ideal gas [194] in

S_{X}

to derive the equilibrium distribution. Thus, the form in Equation (26a) is not restricted to only uniquely identified

M

’s. We now enunciate the central theme of the NEQT, known as the Second Law.

Proposition 3.

The Second Law The NEQ Gibbs entropy

S_{0} (X_{0}, t)

of an isolated system

Σ_{0}

is bounded above by its equilibrium entropy

S_{0} (X_{0})

and continuously increases towards it so that [33]

d S_{0} (X_{0}, t) / d t \geq 0 .

(213)

This proposition is not a part of our axiomatic formulation so it needs to be justified within this formulation. We will do so below by two independent approaches. The second law in standard textbooks is usually stated to be applicable to the universe as a paradigm of an isolated system [195]. However, the universe here cannot represent the entire physical universe as this creates many unsolved issues [196]. Therefore, we will interpret the universe as a causally bounded region of space, which we treat as an isolated system [197], for which the above law applies; see also [162,195].

8.3. A Proof of the Second Law

The second law has been proven so far under different assumptions ([54,57,79,174,176], among others). Here, we provide a simple proof of it based on the postulate of the flat distribution; see Remark 54. The current proof is an extension of the proof given earlier; see ([79], Theorem 4). We consider an isolated system

Σ_{0}

for which the second law is expressed by Equation (213) so we must use the state space

S_{X_{0}}

. For simplicity, we suppress the suffix 0 from all the quantities in this section. As the law requires considering the instantaneous entropy as a function of time, we need to focus on the sample space at each instant to determine its entropy S as a function of time. At each instance, it is an ensemble average over the instantaneous sample space

Γ (t)

formed by the instantaneous set

m (t)

of available microstates in

S_{X}

; see Equation (26a) or (116). This should make it clear that our approach has nothing to do with ergodicity, which requires averaging any quantity defined for a single microstate at each instant over a very long time period; see Remark 36. The sample state

Γ_{ergo} (t)

in the ergodic hypothesis always contains a single microstate. Thus, the issue of any ensemble average at each instant does not arise. In addition, the ergodicity principle deals only limiting average over an extremely long time evolution over

Γ_{ergo} (t)

. In our approach, we are averaging over the set

m (t)

in

Γ (t)

of available microstates at each instant to determine the entropy

S (t)

as a function of time, which is what is required for the second law formulation in Equation (213). As we are only interested in the behavior of the entropy at each instant, we will use the flat distributions for the microstates at each instance (see Remark 54) so that the entropy is given by Equation (206b).

To prove the second law (see Proposition 3), we proceed in steps by considering a sequence of sample spaces belonging to

Γ

as follows [79,176]. At a given instant,

Σ

happens to be in some microstate. We start at

t = t_{1} = 0

, at which time it happens to be in a microstate, which we label

m_{1}

. It forms a sample space

Γ_{1}

containing

m_{1}

with probability

p_{1}^{(1)} = 1

, with the superscript denoting the sample space index. We have

S^{(1)} = 0

. At some

t = t_{2} > t_{1}

, the sample space is enlarged from

Γ_{1} = (m_{1})

to

Γ_{2} = (m_{1}, m_{2})

, which now contains two macrostates

m_{1}

and

m_{2}

, with probabilities

p_{1}^{(2)}

and

p_{2}^{(2)}

, respectively. The enlargement is due to the one-to-many mapping discussed in Section 1 and expressed in Equation (6). At

t_{2}

,

m_{1}

randomly evolves into a different

m_{2}

. As explained above, we need both microstates at

t_{2}

to determine the entropy. Using the flat distribution, the entropy is now

S^{(2)} = S_{\max}^{(2)} = ln 2

. At some

t = t_{3} > t_{2}

,

Γ_{2}

is enlarged to

Γ_{3} = (m_{1}, m_{2}, m_{3})

containing three distinct microstates

m_{1}, m_{2}

, and

m_{3}

so that the entropy becomes

S^{(3)} = ln 3

. At some

t = t_{3} > t_{2}

, the enlarged sample space will include three distinct microstates

m_{1}, m_{2}

, and

m_{3}

so that the entropy becomes

S^{(3)} = ln 3

. We just follow the system in a sequence of time so that at

t = t_{n}

, we have a sample space

Γ_{n} = (m_{1}, m_{2}, \dots, m_{n})

containing n distinct macrostates so that

S^{(n)} = ln n

. Continuing this until all microstates in

Γ

have appeared, we have

S_{\max} = ln W

.

We now discuss the significance of using flat distributions at each time t so we can apply Bolzmann’s formula

S (t) = ln W (t)

for the entropy, called Boltzmann’s principle [198] by Einstein; see Equations (206b) or (209). Their use means that we are neglecting fluctuations in the temporal entropy

S (t)

when the instantaneous distribution is not exactly a flat distribution. As fluctuations are overlooked in thermodynamics, use of this distribution gives the entropy of the most probable macrostate at each

t_{n}

, with

S_{n} \geq S_{n - 1}

. In contrast, Gibbs formulation provides the entropies of instantaneous macrostates with

\{p_{k}\}

that may be different from a flat distribution that occur during the period

(t_{n - 1}, t_{n})

. These macrostates give rise to fluctuations that happen between

S_{n - 1}

and

S_{n}

, and have been investigated earlier [79].

We now make a very important observation that shows how our proof differs from the approach involving the extremely special assumption of molecular chaos [93] made by Boltzmann to establish the H-theorem for the evolution of

M

to

M_{eq}

; see also Section 1 for a brief historical review. The theorem uses the Boltzmann kinetic gas equation for the single-particle distribution

f (r, p)

along with the molecular chaos assumption, a probabilistic concept. Boltzmann recognized that the assumption is central to derive irreversibility. To date, there has been no convincing argument to justify the assumption, which is not surprising as there are examples, such as the velocity inversion in spin-echo experiment or Zermelo’s paradox [92], where the assumption and the H-theorem fail. If that happens, it will not be possible to distinguish between reversible and irreversible processes, as argued by Prigogine [199]. Lanford [200] has shown that the H-theorem is valid not only under the molecular chaos assumption (no correlations), but also only in the limit of vanishing particle size and density. Kac [201] argued that the unjustifiable assumption must not be used for the derivation of the very general law of the increase in entropy. This is understandable as “... it has never been possible to extend Boltzmann’s argument to wider classes of systems. A quite different point of view thus has to be adopted...”, to quote Henin and Prigogine [202]. By investigating Kac’s ring model, Fernando [79,94] observed that the molecular chaos assumption is not unique for irreversibility to emerge, contradicting the above claim of Boltzmann about its centrality. It is important to emphasize that Boltzmann’s molecular chaos cannot handle many-particle interactions. Boltzmann seems to be completely unaware of these shortcomings. Considering all these limitations, we come to the following:

Claim 17.

The molecular chaos assumption can neither be taken seriously to prove the second law nor extended to all cases of interest such as to deterministic microstates that form the basis of the μNEQT.

The most common approach to overcome the above limitations is to assume master equations [54] to justify this theorem instead of assuming molecular chaos [54,55,56,57,58]. We avoid both of these assumptions, which are probabilistic in nature. It is important to emphasize that Boltzmann’s molecular chaos cannot handle many-particle interactions so such a concept is not applicable to the deterministic microstates (see Definition 4), which are our concern. Instead, we use the Boltzmann formulation, the Boltzmann principle [198], of the entropy in terms of just the number of distinct microstates not only at EQ (see Equation (209)) but at all times

t > 0

. As microstates

\{m_{k}\}

are determined by the deterministic Hamiltonian of the system including all of the inter-particle interactions, they are independent not only of each other, but also of

\{p_{k}\}

; see Definition 4. This means that as

Σ

probes more and more microstates, there is no correlation among them. Because of this, we are able to avoid the shortcomings of molecular chaos, which is avoided as said above in Claim 17. The microstates appear randomly, so which ones appear and the order of their appearance are also random. Despite this, the number

W (t)

is an integer, not a random variable, and determines the instantaneous microstate probabilities

\{p_{k}\}

of their frequency of appearance at t; see Equation (111).

Proposition 4.

The microstate number

W (t)

for the isolated system

Σ_{0}

is a pure number that increases monotonically with t, whether we start counting them from

t = 0

(

W (0) = 1

) or some time

t = t^{*} > 0

(

W (t^{*}) = 1

). It is oblivious to which ones arise and their order, which are required to determine

\{p_{k}\}

.

Proposition 5.

The number

W (t)

of distinct microstates passed by the system past

t = t^{*}

cannot ever decrease.

Remark 55.

Propositions 4 and 5 are self-evident.

The above proof of the second law is simply based on the idea of how microstates accumulate in time, as given in Proposition 4. In time, the system will pass through more and more microstates with a concomitant increase in the entropy

S (t)

, assuming flat distributions. Eventually, at

t = τ_{eq}

, all microstates will have appeared once, and their number

W_{0} = W (τ_{eq})

is the total number of distinct microstates of the isolated system. This results in the maximum entropy

S_{\max} = ln W_{0}

.

For a macroscopic system, the probability of a microstate repeating itself initially for

t < τ_{eq}

is negligible, being of the order of

1 / W_{0}

. Thus, initially all microstates are almost distinct and give rise to flat distributions

\{p_{k} = 1 / W (t)\}

at each t as used above. However, we note that during this period, there will be fluctuations in the entropy when we do not have a flat distribution. However, as we are not concerned with fluctuations in thermodynamics (they are important in statistical mechanics), the flat distribution is quite appropriate. For

t > τ_{eq}

, some microstates begin to occur more than once, and we will again have fluctuations, which we have disregarded in the proof. At

t = 2 τ_{eq}

, almost all microstates will have appeared twice but we still have

\{p_{k} = 1 / W_{0}\}

so that the entropy remains at its maximum value

S_{\max}

for all

t > τ_{eq}

.

We now have the following:

Theorem 8.

Under the assumption of flat distributions, Proposition 4 forms the basis of the second law of thermodynamics for the isolated system that

S (t)

is monotonically increasing until it reaches its maximum value

S_{\max} = ln W_{0}

at

t \geq τ_{eq}

.

Proof.

See the discussion above. □

The issue of fluctuations has been discussed at length elsewhere ([79], Figure 6 and its discussion), which shows that the second law is an average law having fluctuations that become insignificant as the size of the system becomes larger and larger. Thus, it is conceivable that in some isolated cases, the second law is violated and the entropy decreases over a finite period of time. But this will not happen in the majority of cases for a macroscopic system. In other words, in most of the experiments, the chance of observing a violation of the second law is extremely low, almost negligible, to the point that we would never observe such an event in our lifetime [203], which also shows a deep connection of the second law with causality.

We defer the critical discussion of this issue to the next section. Here, we only discuss its very small possibility. It should be noted that Maxwell [50] had proposed a device involving his famous demon that is capable of violating the second law. As the violation is not considered a physical reality, it is termed the demon paradox that needs to be explained. Various attempts have been made to clarify the paradox. Szilard [68] proposed the cost of information to clarify the paradox. Later, Brillouin [69] showed that the demon is not capable of violating the second law by carrying out a careful analysis by taking into account a light source to help the demon see and sort molecules. Without light, the demon cannot sort out molecules. Similarly, Smoluchowski [89] also argued that the demon cannot violate the second law by taking into account thermal fluctuations. More recently, we have also investigated the demon paradox and used internal variables [204] and probability arguments [205] to explain it. The investigation of the demon paradox and its successful explanation is clear evidence that any so-called violation of the second law is a consequence of an incomplete or improper analysis; see also Kostic [206] and Norton [207].

As part of our attempt to demonstrate temporal asymmetry or inhomogeneity, we need to show why this probability should be so small. As an example, we consider the demon paradox. Let

x = β ϵ = β ϵ

, where

ϵ

is the energy of a particle. Let a very small but nonzero positive quantity

δ \sim 10^{- 10} - 10^{- 11}

be the limit of the demon’s precision so that it treats all the particles with x in the window

(\bar{x} - δ, \bar{x} + δ)

as particles identified by

x_{mp}

as the particles with most probable energies (around the mean

\bar{x} = 3 / 2

, and the standard deviation

σ = \sqrt{3 / 2}

). It also treats particles with

x < \bar{x} - δ

as slow particles

x_{s}

, and particles with

x > \bar{x} + δ

as fast particles

x_{f}

, respectively. We consider

N = 10^{24}

, and

α = {(δ / σ)}^{2} / 2 = 10^{- 22}

. As the demon observes many slow and fast particles, we need to consider the probability distribution

\bar{f} (x)

of

x = \sum_{i = 1}^{N} x_{i} / N

of independent and identically distributed random variables

x_{i}

of the ith particle [205]. It is found that

{\bar{f}}_{b} (x_{s}) or {\bar{f}}_{b} (x_{f}) ≲ \sqrt{N} e^{- N α} = 10^{12} e^{- 100},

which is ≈

3.758 \times 10^{- 32} \approx 0

. Therefore, fast and slow particles have extremely low probabilities, and make no difference in determining the temperature, which is determined by

x_{mp}

alone. The example clearly shows that thermodynamics is governed by the most probable state, so the demon is not successful in creating a temperature difference. As

W (t)

cannot decrease with time, there is no possibility of observing a violation of the second law with appreciable probability. Indeed, we show in Section 9 that the violation will invalidate Axiom 4, which is the cornerstone of the stability observed in nature.

In any case, the probabilistic interpretation needs to be exploited, as we do here, for a proper understanding of the second law, which merely states that it is nothing but the reflection of the most probable event in probability theory [114]. To appreciate this, we note the Gibbs formulation [48,54,55,57] of the entropy

S (t)

in Equation (26a) for an isolated system. These probabilities are continuous functions of time and ensure that

S (t)

is a continuous function of t. How these probabilities are to be determined or defined has been analyzed earlier [79,176], where we have discussed two possible approaches, the ensemble-based and the temporal-based, to define these probabilities. Both are standard approaches [33] and their equivalence is needed for establishing ergodicity. Determining these probabilities is discussed in Section 12.2. As shown by Tolman ([54], Section 106, where Boltzmann’s

H = - S

is considered), Rice and Gray ([55], see Section 3.3), Rice ([57], Ch. 17), and several other authors, this entropy for an isolated system cannot decrease with time. This expected behavior, which is in accordance with the second law, is shown by the curve OA in Figure 4. If we perform time-reversibility operation (

|t| \to \bar{t} ≐ - |t|

) at

t = 0

, the entropy will follow OB, and not the continuation of AO to negative t. The increase along OB as

\bar{t}

decreases follows from the accumulation of microstates used above to prove the second law. If, instead, the time-reversibility (

|t - t_{0}| \to \bar{t} ≐ - |t - t_{0}|

) is performed at some instance

t = t_{0}

at O

_{0}

, then the entropy will follow O

_{0}

C; it most certainly does not follow O

_{0}

O, the continuation of AO

_{0}

for

t < t_{0}

. Thus, the second law shows temporal asymmetry.

For a reversible process, the entropy of each macrostate

M_{eq} (t) \in S_{X}

of a body along the process is a state function of

X (t)

, but not for an irreversible process for which

M_{neq} (t) \notin S_{X}

. Their entropies are written as

S (X (t), t)

[75,76] with an explicit time dependence. In general [33,75,76,79],

S (X (t), t) \leq S (X (t)); fixed X (t) .

(214)

The equilibrium values of various entropies are always denoted with no explicit time dependence, such as by

S_{0} (X_{0})

for

Σ_{0}

. These entropies represent the maximum possible values of the entropies of a body as it relaxes and comes to equilibrium for a given set of observables. Once in equilibrium, the body will have no memory of its original macrostate; compare with Remark 53. Being observables, the set

X_{0}

, which includes its energy

E_{0}

among others, remains constant for

Σ_{0}

as it relaxes. This notion is also extended to a body in internal equilibrium.

Thus, we have proven the second law in accordance with Proposition 3 without any unsubstantiated approximation.

8.4. Second Law as a Consequence of Stability

A careful reader should have noted by now that all we have done is to use inequalities resulting from the second law, but we have not postulated anything, either by itself or as a part of Axiom 2 in our axiomatic formulation of the

μ

NEQT and the MNEQT. We now wish to emphasize that there is no need to do this, which clarifies its absence. In this regard, we deviate from Callen [3] for MEQT, who uses it as part of his Postulate II. The reason is that, as demonstrated below in Theorem 9, it is a direct consequence of Axiom 4, which is an extension of Postulate I of Callen to NEQ macrostates

M_{neq}

.

To show this, we consider

Σ

embedded in

\tilde{Σ}

, the latter in EQ, so it is specified by its macrofields

T_{0}, P_{0}

, etc. We assume

Σ

not in EQ with

\tilde{Σ}

, so the differences in their fields are given by

Δ F

in Equation (76d). In view of Remark 45, we use

S_{Z}

in which

M_{nieq}

happens to be

M_{ieq}

. We now prove the following:

Theorem 9.

The second law is a direct consequence of the requirement of the Stable Equilibrium (Axiom 4) for a thermodynamic system.

Proof.

We recall Claim 9, and apply it to any

M_{ieq}

in

S_{Z}

. Using Axiom 4, we conclude that

M_{ieq}

must approach the stable EQ macrostate

M_{eq}

, which requires

Δ F \to 0,

which can be expanded to

T \to T_{0}, P \to P_{0}, μ \to μ_{0}, \dots, A \to 0 .

We now rewrite the second equation in Equation (142) in the following form:

d_{i} S = (β - β_{0}) d_{e} Q + β d_{i} W,

(215)

where we have used inverse temperatures, and

d_{i} W

is given explicitly in Equation (136), which we reproduce below:

d_{i} W = (P - P_{0}) d V - (μ - μ_{0}) d N + \dots + A \cdot d ξ \geq 0,

(216)

having various contributions in

S_{Z}

. The first two terms refer to irreversibility caused by exchanges with

\tilde{Σ}

, similar to the exchange macroheat term in Equation (215), and the last term refers to irreversibility caused by internal processes.

The first term in Equation (215) represents the stochastic contribution and the second term is the mechanical contribution. We analyze each term separately. Let us assume that

β_{0} > β

(

T > T_{0}

). For

T \to T_{0}

, Σ must lose energy in the form of exchange macroheat with

\tilde{Σ}

so

d_{e} Q < 0

, which means that the resulting irreversible entropy

d_{i} S^{Q} = (β - β_{0}) d_{e} Q \geq 0

. We now turn to the mechanical contribution in Equation (216), and consider various terms in it. For the first term

(P - P_{0}) d V

, we assume

P > P_{0}

. This means that the volume of Σ will increase in accordance with the laws of mechanics. This results in the corresponding irreversible entropy

d_{i} S^{V} = β (P - P_{0}) d V \geq 0 .

(217a)

We assume

μ > μ_{0}

for the second term. This means that

d N < 0

to bring μ closer to

μ_{0}

until

μ \to μ_{0}

. The corresponding irreversible entropy

d_{i} S^{N} = - β (μ - μ_{0}) d N \geq 0 .

(217b)

Similar arguments apply to missing terms in

d_{i} W

. This brings us to the last term in

d_{i} W

. To be specific, we consider the middle term

- V \cdot d P_{BP}

in Equation (137) as an example of this term; here,

d P_{BP} = F_{wBP} d t

is the change in the linear momentum of the Brownian particle experiencing a macroforce

F_{wBP}

, and

V

is its relative velocity with respect to the center of mass of the system [157]; see also Equation (320a) later. The stable EQ corresponds to a vanishing relative velocity so that there is no motion. For this to happen, the macroforce

F_{wBP}

must oppose this motion as happens in mechanics. Consequently, the corresponding irreversible entropy

d_{i} S^{BP} = - β V \cdot d P_{BP} \geq 0

. As a second example, we consider the macroaffinity

A_{V}

obtained in Equation (104c). It is given by

A_{V} = n_{1} n_{2} (P_{1} - P_{2}) .

With

d ξ_{V}

given in Equation (104a), and by a straightforward manipulation, we find that

A_{V} d ξ_{V} = (P_{1} - P_{2}) d V_{1},

(218)

which is precisely the first term in

d_{i} W

above so it is also nonnegative; see Equation (217a).

Claim 18.

The exercise to obtain Equation (218) also shows that the affinity term in

d_{i} W

in Equation (216) behaves identically to other mechanical terms under the condition of stability.

Finally, the sum

d_{i} S

of all these irreversible entropies follows the inequality

d_{i} S \geq 0,

(219)

which is the statement of the second law for an interacting system. For an isolated system, it reduces to Proposition 3, codified in Equation (213). □

Remark 56.

The form of the first two terms in

d_{i} W

in Equation (216) is not the most general form. From Equations (76b) and (76c), the most general form of the missing term is

(f_{w} - f_{0 w}) d_{e} w + f_{w} d_{i} w,

in which the first term is due to exchange displacement

d_{e} w

as the first two terms in Equation (216), and the second term is due to the irreversible internal displacement as the last term in Equation (216). It follows from Claim 18 that both terms above give a nonnegative irreversible entropy contribution, which makes Equation (219) a general result.

Conclusion 5.

The above theorem shows that there is no need to include the second law as an additional part of Axiom 4 in the axiomatic formulation of the MNEQT. In this sense, the second law is not a fundamental law in our formulation; it is merely a consequence of Axiom 4.

The above discussion now justifies that stability requires that the energy be a convex function upwards and the entropy a convex function downwards as shown in Equation (106) for Axiom 4.

9. Devastations Caused by Second Law Violation

As mentioned briefly in the previous section, we now wish to critically investigate the resulting thermodynamics if we dispose of the second law completely. We will call the resulting thermodynamics the violation thermodynamics and denote it by

\overset{ˇ}{M} NEQT

to draw attention to the this fact. The arbitrary macrostates in the

\overset{ˇ}{M} NEQT

will be denoted by

\overset{ˇ}{M}

in this section. A more detailed discussion will be presented elsewhere.

The violation of the second law in the

\overset{ˇ}{M} NEQT

will result in strict inequalities

d_{i} S \leq 0, d_{i} Q = d_{i} W \leq 0,

(220)

which only characterize the behavior of macrostates. Observe that we have included the equalities, which are also present in Equation (219). We will call systems with equalities to be in EQ, and call a system in NEQ when we have strict inequalities, using the same terminology as in the MNEQT. Thus, a system is allowed to be prepared in EQ. This is precisely how Maxwell had introduced his demon paradox, so the requirement is standard. This is also of technical and experimental importance when one of the systems happens to be a medium

\tilde{Σ}

, which is always taken to be in EQ. At the level of microstates, the second law plays no role; see Remark 1. Therefore, we will be contrasting the

\overset{ˇ}{M} NEQT

with the MNEQT. We first note what is common to both. The discussion involving the use of entropy in Equation (220) clearly implies that the existence of entropy in Axiom 2 must still be accepted. The derivation of entropy in Section 5.5 does not depend on whether the second law holds or not, so this form will still survive. Axioms of additivity 5, quasi-independence 6, and reduction 7 will survive as well. Similarly, the notion of internal variables based of the properties of the Hamiltonians of various bodies as discussed in Section 4, and the notion of partition in Equation (14a) also survive. This also means that the notion of macroheat and macrowork as developed later in Section 10.1 in

S_{Z}

, and which is based on the first law, survives in the

\overset{ˇ}{M} NEQT

. The identification of an IEQ macrostate

{\overset{ˇ}{M}}_{ieq}

as one whose entropy is a state function also survives in the

\overset{ˇ}{M} NEQT

. Accordingly, the Gibbs fundamental relation in Equation (131) does not change, along with the definition of

T, P

, etc. We thus see that there is a lot that is common to both, so most of the notations remain the same. We will only be considering entropies to be state functions here. The discussion can be easily extended to entropies that are not state functions by following the procedure of Section 5.9. We will not do that here.

Conclusion 6.

The entire discussion of the MNEQT and the μNEQT can be carried out verbatim except the second law inequalities used earlier must be replaced by Equation (220) to investigate the consequences of the violation.

The conclusion is useful, as it makes investigation of the violation extremely simple so that we can determine what other changes have to made now in the MNEQT. We proceed as follows.

We consider an isolated system

Σ_{0}

of energy

E_{0}

and consisting of two subsystems

Σ_{1}

and

Σ_{2}

, each having the same number of particles of the same kind. To simplify the discussion, we will consider a particular evolution of the microstate

{\overset{ˇ}{M}}_{0} (t)

of

Σ_{0}

, during which the subsystems are always in EQ macrostates

{\overset{ˇ}{M}}_{1} (t)

and

{\overset{ˇ}{M}}_{2} (t)

, respectively, at all times, but may or may not be in EQ with each other. We use

\overset{ˇ}{P}

to denote this special process, during which their entropies remain state functions in

S_{X}

at all times. We also assume that their initial temperatures are

T_{1} (0)

and

T_{2} (0) > T_{1} (0)

, with

Δ T (0) = T_{2} (0) - T_{1} (0) > 0, Δ β (0) = β_{2} (0) - β_{1} (0) < 0

, pressures are

P_{1} (0)

and

P_{2} (0) > P_{1} (0)

, with

Δ P (0) = P_{2} (0) - P_{1} (0) > 0

, etc. Thus, we will be considering the irreversibility produced by exchanges only, but the irreversibility in the

\overset{ˇ}{M} NEQT

results in the violation; there are no irreversible processes within each subsystem as they are always in EQ, which simplifies the discussion considerably. We only consider positive temperatures here. For simplicity, we will suppress t in the following, unless clarity is needed.

We consider two different situations involving (stochastic) macroheat and (mechanical) macrowork separately for the clarity of the investigation; see Definition 23, and also Section 10.2.

9.1. Macroheat Exchanges

We first assume that no macrowork is exchanged between the subsystems so their volumes do not change. As their temperatures are different, there is macroheat exchange, but

d_{e} Q_{0} = d_{e} Q_{1} + d_{e} Q_{2} = 0

at all times. We set

d Q = d Q_{1} = T_{1} d S_{1} = d_{e} Q_{1}

, and

d Q_{2} = d_{e} Q_{2} = T_{2} d S = T_{1} d S_{1}

. We have, using Axioms of quasi-independence 6,

d S_{0} = d_{i} S_{0}^{Q} = d Q (β_{1} - β_{2}) = Δ T d Q / T_{1} T_{2},

(221a)

which is identical to Equation (126) obtained as promised. It is also similar to Equation (202) derived earlier but for a system in a medium. For the violation to occur, we must impose

d_{i} S_{0}^{Q} < 0

for

Δ T > 0

, which requires that

d Q < 0,

(222)

implying that heat

d Q = d E_{1}

is flowing out of

Σ_{1}

at a lower temperature into

Σ_{2}

at a higher temperature. We observe that Equation (221a) is valid for all times t. As a consequence and in accordance with Axiom of additivity,

E_{1}

decreases and

E_{2}

increases. We introduce the energy difference

Δ E ≐ E_{2} - E_{1} = E_{0} - 2 E_{1} .

Because of the sign of

d Q

,

Δ E

increases, making

{\overset{ˇ}{M}}_{1}

and

{\overset{ˇ}{M}}_{2}

move farther apart, instead of getting closer, in their energies and temperatures, with no hope of them getting towards a stable EQ

{\overset{ˇ}{M}}_{0 eq}

, from which

Σ_{0}

will never leave and where

E_{10}

is the energy of

Σ_{1}

at absolute zero, which will play an important role in the analysis as

E_{1} (t) \geq E_{10}

.

The most important question we need to investigate now is if there exists a unique

{\overset{ˇ}{M}}_{0 eq}

for

{\overset{ˇ}{M}}_{0}

, and whether

Σ_{0}

will approach it in time. This requires investigating what happens to the temperatures defined by

T_{l} = \partial E_{l} / \partial S_{l}

for

Σ_{l}, l = 0, 1

, and 2. This issue was discussed in Section 5.7. That discussion can be carried out without affecting Equations (125a) and (125b) even under the violation. We need to investigate the following two possibilities.

9.1.1. $E_{l}$ Monotonically Increases with $T_{l}$

This means that both systems have positive heat capacities, which is a requirement of stability. It follows from this choice that

β_{1} (t)

is a monotonically increasing function, and

β_{2} (t)

a monotonically decreasing function of t, making

Δ β (t)

decrease with t. Thus,

Σ_{0}

will never approach

{\overset{ˇ}{M}}_{0 eq}

with time, which violates Axiom 4. This axiom is our first casualty of the violation, and has to be abandoned.

The above behavior of energy means that the entropy is also an increasing function of the temperature for both systems. Thus,

S_{1} (t)

decreases and

S_{2} (t)

increases in time, making their disparity also increase in time. Their sum, however, continues to decrease as a function of t because of the violation of the second law. We also see from Equation (221a) that

|d_{i} S_{0}^{Q} (t)| \propto |Δ β (t)|,

(223a)

implying that the degree of violation gets larger and larger. Let

E_{10}

denote the energy of

Σ_{1}

at absolute zero, which will play an important role in the analysis as

E_{1} (t) \geq E_{10}

. Thus,

Δ E (t) \leq E_{0} - 2 E_{10},

which puts a very important mechanical constraint on Equation (220) along

\overset{ˇ}{P}

. At some time

t = t_{term}

, we have

E_{1} \to E_{10}

(

E_{2} \to E_{0} - E_{10}

) so that the energy exchange will terminate and

T_{1} \to 0

as there is no more energy left for exchange to

Σ_{2}

. As

E_{2} (t_{term})

approaches a finite value,

T_{2} (t_{term})

also approaches a finite value. As the derivation of Equations (125a) and (125b) is also applicable in the

\overset{ˇ}{M} NEQT

, we have

T (t \to t_{term}) = 0, A (t \to t_{term}) = 1 .

(224a)

As

A \neq 0

, the terminal macrostate

{\overset{ˇ}{M}}_{0} (t_{term})

is not an EQ macrostate, which should be obvious due to the temperature inhomogeneity.

What about the entropy

S_{0}

of

Σ_{0}

. From Equation (221a), we observe that

|d_{i} S_{0}^{Q}| \to \infty,

(224b)

which results in an entropy catastrophe

S_{0} (t_{term}) \to - \infty,

(224c)

which is an impossibility in view of its Gibbs formulation in Equation (26a), and results in an internal inconsistency. It also makes the third law the second casualty of the violation, as it has to be abandoned. This is not surprising, as the third law is a consequence of mechanical stability of the ground state (

T = 0

) of a system [33] and

{\overset{ˇ}{M}}_{0} (t_{term})

at

T = 0

is not a uniform ground state.

Remark 57.

While satisfying the mechanical constraint at

t = t_{term}

cannot be denied, there is no such constraint on the value of the entropy of

{\overset{ˇ}{M}}_{0} (t_{term})

to become negative in the

\overset{ˇ}{M} NEQT

.

The above catastrophe is the result of an unstable situation that terminates in a catastrophic macrostate

{\overset{ˇ}{M}}_{0}^{cata}

with extreme temperature and energy inhomogeneities and a catastrophe in

S_{0} (t_{term})

but with fixed

E_{0}

. Therefore, the initial macrostate

{\overset{ˇ}{M}}_{0} (0)

must be identified as an unstable NEQ macrostate

{\overset{ˇ}{M}}_{0}^{unst}

, even though

{\overset{ˇ}{M}}_{1} (0)

and

{\overset{ˇ}{M}}_{2} (0)

are both EQ macrostates. This also means that even if both systems start in EQ with each other with

T_{1} (0) = T_{2} (0)

, and

E_{1} (0) = E_{2} (0)

, any fluctuation, no matter how small it is, will drive the system catastrophically towards

{\overset{ˇ}{M}}_{0}^{cata}

. Thus, this EQ macrostate must also be treated as unstable and should be denoted as

{\overset{ˇ}{M}}_{0 eq}^{unst}

.

Claim 19.

The violation leads to a paradoxical situation, in which a thermodynamically stable system will leave an unstable macrostate

{\overset{ˇ}{M}}_{0 eq}^{unst}

merely by a fluctuation, no matter how small it is, or an unstable NEQ macrostate

{\overset{ˇ}{M}}_{0}^{unst}

, and runs towards a catastrophic macrostate

{\overset{ˇ}{M}}_{0}^{cata}

satisfying Equation (224a), and in which the degree of violation measured by

|d_{i} S_{0}^{Q}|

becomes unbounded, as shown in Equation (224c).

The above catastrophe must not be confused with an explosion that happens in a runaway reaction in a stable system in a finite time, which obeys the second law. To conclude, a stable system undergoes an instability to

{\overset{ˇ}{M}}_{0}^{cata}

, causing another internal inconsistency due to the violation.

9.1.2. $E_{l}$ Monotonically Decreases with $T_{l}$

This means that both systems have negative heat capacities, which makes both systems unstable. This also means that

S_{l}

monotonically decreases with

T_{l}

. Thus, as

E_{1}

decreases and

E_{2}

increases,

β_{1} (t)

increases and

β_{2} (t)

decreases so that eventually they become equal (

Δ β (t) \to 0

). The disparities between the two energies and the entropies also vanish so

Σ_{0}

finally approaches a stable EQ macrostate

{\overset{ˇ}{M}}_{0 eq}^{st}

in which

|d_{i} S_{0}^{Q}| \to 0,

(225)

to justify

{\overset{ˇ}{M}}_{0 eq}^{st}

as an EQ macrostate. Once it is there,

Σ_{0}

will never leave

{\overset{ˇ}{M}}_{0 eq}^{st}

. The same is also true if

Σ_{0}

happens in

{\overset{ˇ}{M}}_{0 eq}^{st}

initially. Thus, the two systems come to EQ in time, thus supporting our Axiom 4 but for unstable systems. This is again paradoxical, and causes an internal inconsistency due to the violation. We summarize the conclusion as follows:

Claim 20.

The violation leads to an internal inconsistency, in which an unstable system

Σ_{0}

either does not leave a stable EQ macrostate

{\overset{ˇ}{M}}_{0 eq}^{st}

, or approaches it if not there already so that the irreversible entropy generation vanishes, as seen in Equation (225).

We combine the above two claims in the following:

Conclusion 7.

A thermodynamically stable system will undergo a catastrophe and will end in

{\overset{ˇ}{M}}_{0}^{cata}

, thus forcing us to abandon Axiom 4 or the third law, or a thermodynamically unstable system will end in a stable EQ macrostate

{\overset{ˇ}{M}}_{0 eq}^{st}

. Both alternatives are too unacceptable due to the internal inconsistencies they generate to safely conclude that the violation must be treated as mere curiosity and nothing more.

9.2. Macrowork Exchanges

We recall that macroworks are isentropic quantities, so they are mechanical in nature, as opposed to macroheats above, which are stochastic and are determined by entropy changes. Therefore, we now turn our attention to the irreversible macrowork to see what changes must be allowed in the MNEQT to obtain the

\overset{˚}{M} NEQT

by focusing on its mechanical aspect. According to Theorem 3, this macrowork is related to the irreversible macroheat; see Equation (95). Through this connection, it is indirectly related to

d_{i} S_{0}^{V}

by the following equation:

d_{i} S_{0} = β_{0} d_{i} W_{0},

(226)

obtained from Equation (215), where we have set

β_{1} = β_{2} = β_{0}

for

Σ_{0}

with its energy

E_{0} (N_{0}, V_{0}, T_{0})

fixed so its temperature remains fixed during

\overset{ˇ}{P}

. The irreversible macrowork

d_{i} W_{0}

is given by

d_{i} W_{0} = (P_{1} - P_{2}) d V - (μ_{1} - μ_{2}) d N + \dots + A \cdot d ξ \geq 0;

(227)

compare with Equation (216) above. We have also set

d V = d V_{1}, d N = d N_{1}

, etc. Our goal now is to follow the consequences of each of the contributions on the right side to

d_{i} S_{0}^{V}

as was done in Section 8.4. We first consider the consequences of the pressure work term

d_{i} S_{0}^{V} = β_{0} (P_{1} - P_{2}) d V < 0 .

(228)

We assume

Δ P = P_{2} - P_{1} > 0

, which makes the initial macrostate

{\overset{ˇ}{M}}_{0}

an NEQ macrostate, in which the force exerted by

Σ_{2}

on

Σ_{1}

is stronger than the other way around. Therefore, on purely mechanical grounds, we expect the volume

V_{1}

of

Σ_{1}

to decrease and

V_{2}

of

Σ_{2}

to increase, keeping their sum

V_{0}

fixed. However for

d_{i} S_{0}^{V} < 0

, we require

d V > 0

, which means that

V_{1}

expands, while

V_{2}

shrinks. This contradicts the purely mechanical understanding of forces in general.

Claim 21.

The first consequence of violation on the pressure work is the rejection of mechanics, which becomes the third casualty in the

\overset{˚}{M} NEQT

.

We pursue this further. Consider the volume difference

Δ V ≐ V_{2} - V_{1} = V_{0} - 2 V_{1} .

Because of the sign of

d V

,

Δ V (t)

continues to decrease with time as long as

Δ P (t) > 0

. To proceed further, we need to unravel the behavior of

V_{l}

as a function of

P_{l} = - \partial E_{l} / \partial V_{l}

, and the compressibility

K_{l} ≐ - (\partial V_{l} / \partial P_{l}} / V_{l}

.

9.2.1. $V_{l}$ Monotonically Decreases with $P_{l}$

This case corresponds to

K_{l} > 0

, which is a requirement of the stability. As

V_{1}

increases,

P_{1}

decreases. We also have

P_{2}

increasing so

Δ P (t)

keeps on increasing. Consequently,

Δ V (t)

keeps on decreasing, until finally

V_{1} \to V_{0}, V_{2} \to 0,

along with

P_{2} \to \infty

. Note the similarity with the behavior of

T_{1} (t)

above. We thus conclude that the stable

Σ_{0}

runs away towards a catastrophic macrostate

{\overset{ˇ}{M}}_{0}^{cata}

, which satisfies Equations (224b) and (224c) as seen from Equation (228). This means that

{\overset{ˇ}{M}}_{0}

must be identified as an unstable macrostate

{\overset{ˇ}{M}}_{0}^{unst}

as above, even though

Σ_{0}

is stable because of positive compressibility. As above, we also recognize that even if both systems start in an unstable EQ macrostate

{\overset{ˇ}{M}}_{0 eq}^{unst}

with

P_{1} (0) = P_{2} (0)

, and

V_{1} (0) = V_{2} (0)

at

t = 0

, any fluctuation, no matter how small it is, will drive the system catastrophically towards

{\overset{ˇ}{M}}_{0}^{cata}

. Recalling that

P = T (\partial S / \partial V)

, we have

S (P)

, an increasing function of P for both systems. Thus,

S_{1} (t)

decreases and

S_{2} (t)

increases in time, making their disparity also increase in time. Their sum, however, continues to decrease as a function of t and has the terminal value given in Equation (224c).

The conclusion is that a stable system faces a catastrophic instability to

{\overset{ˇ}{M}}_{0}^{cata}

.

9.2.2. $V_{l}$ Monotonically Increases with $P_{l}$

This case corresponds to

K_{l} < 0

, which makes the system unstable. It follows from this that

V_{2} (0) - V_{1} (0) > 0

. As

V_{1}

increases,

P_{1}

increases, and as

V_{2}

decreases,

P_{2}

decreases. Therefore,

Δ P (t)

decreases so that eventually

P_{1} \to P_{2}

, and

V_{1} \to V_{2}

, which represents a stable EQ macrostate

M_{eq}^{st}

, as is seen clearly from Equation (228), which satisfies Equation (225). This contradicts the unstable nature of the system, which should result in some instability in the system, but it does not happen.

It is easy to verify that each of the other terms in

d_{i} W

gives rise to a similar internal inconsistency.

It should be evident from the above discussion that Conclusion 7 also holds here. As all physical systems form stable systems, we do not need to be concerned with unstable systems.

Remark 58.

It should be noted that there are many examples of thermodynamic instabilities that arise in approximate calculations. A well-known example is the van der Waals equation in which there is a well-defined portion of the equation of state in which compressibility becomes negative. Many examples arise in calculations of the mean field type. In all these cases, the relevant free energies are not globally minimum, so thermodynamics comes to the rescue to allow such portions to be removed from consideration. Exact or rigorous calculations will never result in such instabilities.

10. Microworks, Microheats, and Commutator

To discuss process quantities (see Section 2.9) we need to be extremely careful in distinguishing the order of infinitesimal change operators (denoted by

d_{α}

) and the ensemble average

\hat{A}

(denoted by

〈 〉

) as the two operations do not commute. In other words, for a state quantity

χ ≐ \{S, Z\}

for any body

Σ_{b}

, we will demonstrate that the commutator

{\hat{C}}_{α}

acting on the microquantity

χ

{\hat{C}}_{α} χ ≐ (d_{α} \hat{A} - \hat{A} d_{α}) χ = d_{α} 〈χ〉 - 〈d_{α} χ〉 \neq 0,

(229)

with

\hat{A}

introduced in Definition 7; see also Remark 18. As a result, the microquantities corresponding to

d_{α} 〈χ〉

and

〈d_{α} χ〉

must be carefully distinguished as their difference does not vanish; see Equation (36c). It has been an accepted practice to denote

d_{α} 〈χ〉

by simply

d_{α} χ

as discussed in Section 2.4. As we will show, not recognizing this subtle difference in the orders of the two operations has resulted in some confusion. Thus, in the statistical mechanical formulation, it is useful to not simply use

χ

to denote

〈χ〉

such as E and S for

〈E〉

and

〈S〉

, respectively. While the microquantity associated with

\hat{A} d_{α} χ = 〈d_{α} χ〉

will be denoted by

d_{α} χ_{k}

, so that using Equation (88a), we have

d_{α} χ_{m} ≐ \hat{A} d_{α} χ = 〈d_{α} χ〉 = \sum_{k} p_{k} (d_{α} χ_{k}),

(230)

we will use the notation

{(d_{α} 〈χ〉)}_{k}

or

{(d_{α} \bar{χ})}_{k}

for the microquantity of

d_{α} \hat{A} χ

(see Equation (36c)), so that

d_{α} \hat{A} χ ≐ d_{α} \bar{χ} ≐ d_{α} 〈χ〉 = \sum_{k} p_{k} {(d_{α} \bar{χ})}_{k} .

(231)

The reason for the notation

d_{α} χ_{m}

is given in Definition 23, and is further expanded below. For convenience, we will simply use

d_{α} χ_{k}

and

d_{α} {\bar{χ}}_{k}

without the parentheses for the microquantities, as there cannot be any confusion, since

\bar{χ}

, being an average, has no suffix k, so the latter must be associated with the quantity

d_{α} \bar{χ}

; see also Equation (36c). We now explain this subtle difference and the importance of making a clear distinction between the two microquantities. We will mostly focus on E and S for a body

Σ_{b}

for concreteness so the discussion is valid for any of the three systems.

10.1. Digression on Ensemble Averages

A macroquantity for

Σ_{b}

is an average of microquantities in

M

over all distinct microstates using arbitrary

p_{k}

, as discussed in Section 5. The macroenergy E is the ensemble average

\bar{E}

(see Equation (12)),

E ≐ 〈E〉 = \sum_{k} p_{k} E_{k},

(232)

while

S = \bar{S}

of

S_{k} = - η_{k}

is the ensemble average given in Equation (26a).

There are two kinds of thermodynamic averages we need to consider in thermodynamics. One of them is the instantaneous state-average, such as E and S, of state variable

χ

of a macrostate. From this average, one can construct the differential

d χ

between two neighboring macrostates; the differential does not depend on the path connecting the macrostates. The other one is the process-average of process quantities such as

d W, d Q

, etc., between two neighboring macrostates, but they depend on the path connecting them in an NEQ process

P

; see Definitions 19 and 20.

We first consider

E, S \in χ

and the case of a NFl-

W

. The only fluctuating microquantities are

E_{k}

and

S_{k}

.

The differential

d_{α} E \equiv d_{α} 〈E〉 = \sum_{k} p_{k} d_{α} E_{k} + \sum_{k} E_{k} d_{α} p_{k}

(233)

is a sum to two independent contributions. The term on the left is the first term

d_{α} E

in Equation (229), and the first term on the right in the last equation above is

d E_{m} ≐ 〈d_{α} E〉

in Equation (230). Thus, the second term there is the commutator

{\hat{C}}_{α} E

, and is not zero identically. We also observe that

d E_{m} ≐ \sum_{k} p_{k} d E_{k} = - \sum_{k} p_{k} F_{w k} \cdot d W = 〈d E〉 ≐ - d W;

(234a)

see Equation (17a). The sum is carried out at fixed

\{p_{k}\}

, i.e., at fixed S, so it represents an isentropic contribution to

d E

, i.e., it is a purely mechanical contribution due to microwork

d W_{k}

; see Equation (37a). Thus, we denote this contribution simply as

d E_{m}

(in general

d χ_{m}

) to highlight its mechanical nature; see Equation (92). It is also easy to see that

d_{α} E_{m} = - d_{α} W .

(235)

This is true of any quantity in Equation (230); see also Remark 20.

The second contribution (see Equation (88b)),

d_{α} E_{s} ≐ \sum_{k} E_{k} d_{α} p_{k} = 〈E d_{α} η〉,

(236)

is

{\hat{C}}_{α} E

; see Equation (92). It is a sum involving changes

d_{α} p_{k}

at fixed

E_{k}

. These changes result in the entropy change

d_{α} S

so it is not a mechanical contribution. We will identify it as a purely stochastic contribution involving an entropy change, with the suffix a reminder of its stochastic nature, as discussed in Definition 23. Its microanalog is

d_{α} E_{s, k} ≐ E_{k} d_{α} η_{k} .

(237)

The presence of

d η_{k}

in the microanalog signifies a stochastic average.

We thus have

d_{α} E = d_{α} E_{s} + d_{α} E_{m},

(238)

which should be compared with Equation (91); see Remark 30. Thus,

d_{α} Q \equiv d_{α} E_{s} .

(239)

Using this identity, various microheats can be identified as

d_{α} Q_{k} = d E_{s, k} = E_{k} d_{α} η_{k} .

(240)

The above discussion can be easily extended to any of the state variables in

χ

to justify the existence of the commutator

{\hat{C}}_{α} χ = d_{α} χ_{s}

for any body

Σ_{b}

. We also have

d_{α} χ_{m} = 〈d_{α} χ〉

. For a deterministic system (see Remark 34), for which

p_{k}

’s do not change in any mechanical process,

{\hat{C}}_{α} χ \equiv 0

. Thus, we are able to draw the following important conclusion (see also Remark 59):

Conclusion 8.

The existence of

{\hat{C}}_{α} χ

is related to the existence of stochasticity in a statistical body, and we have

d_{α} χ \equiv d χ_{m} + {\hat{C}}_{α} χ .

(241)

We consider the general relation in Equation (238) for d in place of

d_{α}

. This is simply the first law

d E = d Q - d W,

where we have used Equations (234a) and (239) for the arbitrary macrostate

M

in

S_{Z}, Z = (E, W)

. The entropy for such a state is not a state function and is written as

S (Z, t)

, for which

d S

is given in Equation (138a). While

d W

is defined in the state space

S_{W}

,

d Q = d E + d W

is uniquely defined in a state space

S_{Z^{'}}

requiring additional hidden internal variable

ξ^{'}

, as discussed in Section 5.9. The latter forms a state space orthogonal to

S_{Z}

in

S_{Z^{'}}

.

The entropy becomes uniquely defined as a state function in

S_{Z^{'}}

. However, to discuss the first law, it is convenient to think of E as a state function in the state space

S_{ζ^{'}} ≐ S_{W^{'}} \cup S_{S}, ζ^{'} ≐ (S, W^{'})

to describe

M

as a unique macrostate

M_{ieq}

in

S_{ζ^{'}}

. In this state space, we treat E as a unique function of S and

W^{'}

; here,

S \in S_{S}

is the direction for stochasticity, while

S_{W^{'}}

is the subspace controlling the deterministic mechanical changes.

We now restrict our discussion to

M_{ieq}

in

S_{ζ}, ζ ≐ (S, W)

. As both

d E_{s}

and

d S

are BI-extensive quantities, we expect a linear relationship between them, with the constant of proportionality some BI-intensive quantity T, as established below:

d E_{s} = T d S = T d 〈S 〉 ≐ d Q .

(242)

Comparing with the Clausius equality (see Equation (45)), we find that

T

above is nothing but the thermodynamic temperature in Equation (1) for the body.

Remark 59.

The commutator

{\hat{C}}_{α} E

is the source of micro- and macroheat in a body so it plays an important role in the μNEQT. In a deterministic body, it does not exist.

We now follow the consequence of

{\hat{C}}_{α} E

and the need of distinguishing

d_{α} E_{k}

with

d_{α} {\bar{E}}_{k}

(we use the above notation

d_{α} {\bar{χ}}_{k}

). For this, we identify the microanalog

d_{α} {\bar{E}}_{k}

of

d_{α} E = d_{α} 〈E〉

:

d_{α} {\bar{E}}_{k} = E_{k} d_{α} η_{k} + d_{α} E_{k} ≐ d_{α} Q_{k} - d_{α} W_{k} = d_{α} E_{k} + d_{α} Q_{k},

(243)

which satisfies

d_{α} E = 〈d_{α} \bar{E}〉 ≐ \sum_{k} p_{k} d_{α} {\bar{E}}_{k} .

(244)

We thus see that

d_{α} {\bar{E}}_{k} - d_{α} E_{k} = d_{α} E_{s, k} \equiv d_{α} Q_{k} = {\hat{C}}_{α} χ_{k},

(245)

where

{\hat{C}}_{α} χ_{k}

is the microanalog of

{\hat{C}}_{α} χ = d_{α} Q

. Recall that

d_{α} Q_{k}

is a mixed microquantity (Remarks 11 and 14). This is also evident from the above identity as

d_{α} {\bar{E}}_{k}

is also a mixed microquantity as seen in Equation (243). In contrast,

d E_{k}

is a microquantity. Thus,

d {\bar{E}}_{k}

and

d E_{k}

are distinct. Therefore,

Remark 60.

Care must be exercised in distinguishing

d {\bar{E}}_{k}

and

d E_{k}

, with their difference being the mixed microquantity

d_{α} Q_{k}

, as noted above.

Remark 61.

We should remark that while

d_{α} E_{k}

does not satisfy any first law for

m_{k}

,

d_{α} {\bar{E}}_{k} = d_{α} Q_{k} - d_{α} W_{k}

appears to have an interpretation of a first law for

m_{k}

. But this is a misleading interpretation as

d_{α} {\bar{E}}_{k}

is not a genuine microquantity; see Remarks 60 and 14. While

d_{α} W_{k}

is a microquantity,

d_{α} Q_{k}

is also indirectly controlled by the macrostate

M

. Therefore, Equation (243) should not be taken as a first law for

m_{k}

. In this respect, there is a difference between the μNEQT and the

\overset{˚}{μ}

NEQT as is evident from the work of Sekimoto [140] and Crooks [141], where a first law for each

m_{k}

is proposed without any consideration of

M

.

For completeness, we consider the case of Fl-

W

, for which we have

d_{α} {\bar{W}}_{k} - d_{α} W_{k} = d W_{s, k};

(246)

the right-hand side represents the stochastic contribution in

d {\bar{W}}_{k}

, with

d W_{k}

, the mechanical contribution.

So far, we have not discussed the entropy, which is a stochastic quantity. It follows that

d_{α} S \equiv d 〈S〉 = \sum_{k} p_{k} d_{α} S_{k} + \sum_{k} S_{k} d_{α} p_{k},

(247a)

in which its two contributions

d_{α} S_{s}^{'} ≐ 〈d_{α} S〉 = \sum_{k} p_{k} d_{α} S_{k}, d_{α} S_{s} ≐ 〈S d_{α} η〉 = \sum_{k} S_{k} d_{α} p_{k},

(247b)

are both stochastic in nature; see Equation (86b) for

d_{α} η

. There is no mechanical contribution as in

Z_{k}

. The corresponding microanalogs of the two stochastic contributions are

d_{α} S_{s k}^{'} = - d_{α} η_{k}, d_{α} S_{s k} = - η_{k} d_{α} η_{k} .

As

d_{α} S_{s}^{'} = - \sum_{k} d_{α} p_{k} \equiv 0,

(248)

which follows from Theorem 11, we have

d_{α} S \equiv d_{α} S_{s},

(249)

which is purely stochastic in nature, as it must be. This clearly shows the difference between

d_{α} S

and, for example,

d_{α} E

. From

d_{α} S = - 〈\hat{η} d_{α} η〉

(250)

(see Equation (27c) for

\hat{η}

), we identify the microanalog

d_{α} {\bar{S}}_{k}

of

d_{α} S = d_{α} \bar{S}

d_{α} {\bar{S}}_{k} = - {\hat{η}}_{k} d_{α} η_{k},

(251)

in accordance with Equation (249). This also makes

d_{α} S_{k} = - d_{α} η_{k} = d_{α} S_{s k}^{'}

different from

d_{α} {\bar{S}}_{k}

, just as

d_{α} E_{k}

is different from

d_{α} {\bar{E}}_{k}

. We have for the commutator microanalog

d_{α} {\bar{S}}_{k} - d_{α} S_{k} = d_{α} S_{s k} \neq 0 .

(252)

As

d {\bar{S}}_{k} \neq d S_{k}

, our

μ

NEQT is different from the current microstate approaches to NEQT [145,178,179,190]. The issue has been discussed elsewhere [155], where the relevance of the above distinction between

d S_{s k}^{'}

and

d {\bar{S}}_{k}

is first pointed out. We summarize this as follows:

Conclusion 9.

For Fl-state variables

Z

and S, we have

d_{α} Z = d_{α} \bar{Z} \neq 〈d_{α} Z〉, d_{α} S = d_{α} 〈S〉 \neq 〈d_{α} S〉,

(253)

so care must be exercised in keeping their distinction clear.

The microenergy

E_{k}

changes isentropically as

W

changes without changing

p_{k}

[150]. Accordingly, the generalized microwork

d_{α} W_{k}

does not generate any stochasticity. The latter is brought about by the generalized microheat

d_{α} Q_{k}

, which changes

p_{k}

but without changing

E_{k}

. We summarize these important observations here as the following conclusion:

Conclusion 10.

The change

d_{α} E

for any arbitrary

M

of a body consists of two distinct and independent contributions—an isentropic mechanical change

d_{α} E_{m} = - d_{α} W

, the macrowork, and a stochastic change

d_{α} E_{s} = d_{α} Q

, the macroheat. The entropy change

d_{α} S

is also a purely stochastic change.

We now consider the microscopic analog

d_{α} Q_{k}

of

d_{α} Q

. Naively identifying it using Equation (240) as

d_{α} Q_{k}^{'} ≐ E_{k} d_{α} η_{k}

(254)

does not give a unique microquantity for the following reason. While its average

d_{α} Q

identified by

d_{α} E_{s}

in Equation (242) is uniquely defined,

d Q_{k}^{'}

is not as constant a shift of the origin of

E_{k}, \forall k

, by c it changes

d_{α} Q_{k}

but not

d_{α} Q

, as follows from Equation (268). Therefore, we will uniquely determine

d_{α} Q_{k}

as follows. We first recognizing the Clausius relation in Equation (45) for

d Q

so that

d Q_{k} ≐ T d {\bar{S}}_{k} = - T {\hat{η}}_{k} d η_{k};

(255)

see Equation (251). This justifies Equation (44a). We then determine

d_{e} Q_{k}

by

d_{e} {\tilde{Q}}_{k}

:

d_{e} Q_{k} = - T_{0} {\hat{η}}_{k} d_{e} η_{k} .

(256a)

We now determine

d_{i} Q_{k}

by the difference

d Q_{k} - d_{e} Q_{k}

:

d_{i} Q_{k} = - (T - T_{0}) {\hat{η}}_{k} d η_{k} - T_{0} {\hat{η}}_{k} d_{i} η .

(256b)

This completes the discussion of

d_{α} Q_{k}

.

The generalized heat

d_{α} Q

and

d_{α} Q_{k}

only change

p_{k}

’s, but not

E_{k}

’s. Therefore, the following aspects of the generalized quantities are central in the

μ

NEQT, which we present as three conclusions:

Conclusion 11.

The index k of

m_{k}

is not allowed to change under mechanical work; only

E_{k} (W)

changes. Thus, a purely mechanical approach can be used for microwork. The microwork

d_{α} W_{k}

changes

E_{k}

without changing

p_{k}

. The effect of microheat is to change

p_{k}

but not

E_{k}

so it is microheat that makes a thermodynamic process stochastic by changing

p_{k}

. They occur in two independent state subspaces

S_{W}

and

S_{S}

, respectively, which makes them independent variations; compare with Conclusion 10.

Conclusion 12.

While the microheat

d_{α} Q_{k}

does not change

E_{k}

, it does contribute to the energy change

d_{α} E

through

d_{α} Q = \sum_{k} E_{k} d_{α} p_{k}

as

p_{k}

’s change.

As

d W_{k}

and

d Q_{k}

are independent, any infinitesimal process

δ P

can be treated as a process involving two independent step, a step

δ W

in

S_{W}

and a step

δ S

in

S_{S}

. This makes them independent variations. Their independence is the outcome of using the BI-quantities. This feature is not possible in the

\overset{˚}{M} NEQT

and

\overset{˚}{μ}

NEQT, which shows the superiority of using the BI-quantities. The

μ

NEQT provides a new way to express the macrowork irreversibility of the process [150,151] in terms of the microforce imbalance between external and internal mechanical forces that results in the internal microwork

d_{i} W_{k} = - d_{i} E_{k}

. The internal microwork has no particular sign, even though the corresponding macrowork satisfies the second law and has a particular sign:

d_{i} W = - 〈d_{i} E〉 \geq 0

. Similarly, the macroheat irreversibility is expressed in terms of entropy change

d S

in the probability and is given by

d Q = T d S \geq 0

. We will come back to this issue in Section 13.

10.2. Statistical Significance of $d W$ and $d Q$

Before proceeding further, let us see how the generalized macrowork and macroheat could be understood from a statistical point of view for any arbitrary

M

so that we can identify them using the Hamiltonian. We have already made progress in this direction in the earlier sections so this section basically summarizes this understanding and then extends it a bit. We now prove

Theorem 10.

E (t)

is a state function of

W (t)

and

S (t)

for any

M_{ieq}

in the state space

S_{ζ}

, even though

E_{k} [W (t)]

’s are functions of

W (t)

only.

Proof.

We consider Equation (233) for

d_{α} = d

. As

p_{k} (t)

’s are unchanged in the first sum

d E_{m}

, it is evaluated at constant entropy. It is a function of

W (t)

as is seen clearly in Equation (147). The second contribution is at fixed

E_{k}

’s so

W (t)

is held fixed; see Equation (147). It is the stochastic contribution

d E_{s}

. The changes

\{d_{α} p_{k} (t)\}

result in

d S

. It follows from Equation (242) that

E (t)

is a function of

S (t)

and

W (t)

in general for any

M

. □

The theorem explains why

E (t)

has an additional dependence only on the average S (and not on any complicated functions of

\{p_{k}\}

) in addition to its dependence on

W

for any

M

. The fact is well-known for

M_{eq}

.

We emphasize that the above theorem holds only for

M_{ieq}

for which

W

is the complete set of work parameters. If the set is not complete, we are dealing with an arbitrary macrostate

M

. In this case, we need some hidden internal variable

ξ^{'}

as discussed in Section 5.9 to convert

M

into

M_{ieq}

in a more extended state space

S_{ζ^{'}}

. The hidden internal variable will provide an explicit time dependence in E, which we now write as

E (S, W, t)

. The explicit time dependence gives an additional contribution

d_{i} W^{hid}

given in Equation (139b).

The linear proportionality in Equation (242) between

d Q = d E_{s}

and

d S

for

M

results in

d Q (t) / d S (t) = T_{arb}^{alt} (t),

(257)

see Equation (140b), which extends the statistical proof of the identity in Equation (45) relating

d Q (t)

and

d S (t)

for

M

. We also note that the ratio

T_{arb}^{alt} (t)

is related to the ratio of two SI-macroquantities. Thus, it can be used to characterize the instantaneous macrostate

M

, although not uniquely, as it depends on

d_{i} W^{hid}

; see Equation (139b). This should be contrasted with the

\overset{˚}{M} NEQT

, in which the ratio

d_{e} Q (t) / d_{e} S (t) = T_{0}

(258)

does not characterize the instantaneous macrostate

M

. For

M_{ieq}

,

T_{arb}^{alt} (t)

reduces to

T (t)

.

We should point out that, with

W (t)

as a NFl-parameter,

d W (t)

is the same for all microstates. The statistical nature of

d E_{m}

is reflected in the statistical nature of

F_{w} (t)

,such as

P_{k} (t)

and

A_{k} (t)

, of the body. Thus, the BI-fields

F_{w k} (t)

are fluctuating quantities from microstate to microstate, as expected in any averaging process.

The above discussion proves that the definition of macroheat and macrowork in terms of

d E_{s}

and

d E_{m}

, respectively, is valid for any

M

. But the relationship of

d Q

with

d S

works only for a

M_{ieq}

. It is useful to compare the above approach with the traditional formulation of the first law in terms of

d_{e} Q (t)

and

d_{e} W (t)

: both formulations are valid in all cases. It should be mentioned that the former identification is well-known in equilibrium statistical mechanics, but its extension to irreversible processes and our interpretation are, to the best of our knowledge, novel. While the instantaneous average

F_{w} (t)

, such as the pressure

P (t)

, is mechanically defined under all circumstances, it will only be identified with the thermodynamic definition of the instantaneous pressure

P (t) = - (\partial E / \partial V)

(259)

for a uniquely identified macrostate

M_{ieq}

in

S_{Z}

.

It follows from Conclusion 10 that

d E

consists of two independent and unique contributions—an isentropic mechanical change

d E_{m} = - d W

, and an stochastic change

d E_{s} = d Q

. On the other hand, the MI-macroheat and the MI-macrowork suffer from ambiguities; see, for example, Kestin [42]. The independent partition of

d E

for an arbitrary macrostate

M

plays a central role in developing our NEQ statistical mechanics. Therefore, if we focus on the state space

S_{ζ}

to describe

M

, we must treat E in general as a nonstate function of

S, W

and t. Let us focus on a thermodynamic process

P

in

S_{ζ}

between two IEQ macrostates

M_{ieq}^{(in)}

and

M_{ieq}^{(fin)}

. If all intermediate macrostates

{\{M\}}_{int}

in

P

remain IEQ macrostates in

S_{ζ}

, we denote this

P

by

P_{ieq}

. This is like following a reversible process between two EQ macrostates. In this case,

E (ζ)

has no explicit time dependence as noted above. If some of the intermediate

M_{int}

do not remain in

S_{ζ}

, we need to consider

E (ζ, t)

with an explicit time dependence. This is like following an irreversible process between two EQ macrostates. Let

S_{ζ}^{\max}

denote the largest state space required in which all the macrostates in

M_{ieq}^{(in)}, {\{M\}}_{int}

, and

M_{ieq}^{(fin)}

can be treated as in IEQ (some may have some of the affinities vanishing so the corresponding internal variables become equilibrated; see Section 12.1 for detail). We denote such a

P

by

P_{nieq}

.

The choice of

P

is governed by how far

P_{nieq}

is from

P_{ieq}

. The farther it is, the larger

S_{ζ}^{\max}

is relative to

S_{ζ}

. We put no restriction on their choices for how to do the computation. We will simply use

P

to denote both processes and

S_{ζ}

for both state spaces in the following. In this state space

S_{ζ}

,

d E_{m}

and

d E_{s}

are variations in its orthogonal subspaces

S_{W}

and

S_{S}

, respectively, between two neighboring IEQ macrostates

M_{ieq}^{'}

and

M_{ieq}^{″}

along

P

. This makes the determination of

d E

convenient in any infinitesimal process

δ P \in P

by breaking it into two parts

δ P_{m} \in S_{W}

and

δ P_{s} \in S_{S}

between

M_{ieq}^{'}

and

M_{ieq}^{″}

, with only

\{E_{k}\}

changing along

δ P_{m}

and only

\{p_{k}\}

changing along

δ P_{s}

. We take the changed parameters and probabilities of

M_{ieq}^{^{″}}

to use for the next

δ P

between

M_{ieq}^{″}

and

M_{ieq}^{‴}

, and so on.

Remark 62.

It is clear from the above discussion that it is the macroheat and not the macrowork that causes

p_{k} (t)

, and therefore the entropy, to change. This is the essence of the common wisdom that heat is random motion. But we now have a mathematical definition: Macroheat is the isometric part

d E_{s} (t)

that is directly related to the change in the entropy through changes in

p_{k} (t)

. Macrowork is that part of the energy change

d E_{m} (t)

caused by isentropic variations in the “mechanical" state variables

W (t)

. This is true no matter how far the body is from equilibrium or the internal equilibrium process. Thus, our formulation of the first law and the identification of the two terms is the most general one, and applicable to any

M

by identifying

S_{ζ}^{\max}

.

Remark 63.

The relationship between the macroheat and the entropy becomes simple only when

M

happens to be in internal equilibrium (see Section 5.7), in which case

T_{arb}^{alt} (t)

(see Equation (140b)) is replaced by

T (t)

, which has a thermodynamic significance (see Equations (24) and (129)), and we have the thermodynamic identity, called the Clausius Equality in Equation (45)

d Q (t) = T (t) d S (t)

for

M_{ieq}

, which is very interesting in that it turns the well-known Clausius inequality

d_{e} Q = T_{0} d_{e} S \leq

T_{0} d S

into an equality.

For the sake of completeness, we briefly discuss the various attempts at the study of the microanalogs

d W_{k}

and

d Q_{k}

of the

d W

and

d Q

, respectively, which has flourished into an active field in diverse branches of NEQT at diverse length scales, from mesoscopic to macroscopic lengths [99,135,136,137,138,139,140,141,142,143,144,145,146,147]; see also some recent reviews [178,179,190]. Unfortunately, this endeavor is apparently far from complete [42,99,103,104,105,106,107,135,136,137,138,139,140,141,142,143,144,145,146,147,156,178,179,180,181,182,183,184,185,186,187,188,189,190,191,192,193,208,209,210,211,212,213,214,215,216,217,218]. This is because of the confusion about the meaning of macrowork and macroheat even in classical NEQT [39,42] involving SI- or MI- description, which has only recently been clarified [75,76,134,148,149,150,151,152,153,154,156,157] in the MNEQT, where a clear distinction is made between

d W

(

d Q

) and

d_{e} W

(

d_{e} Q

). In an EQ process, both macroworks (macroheats) have the same magnitude, but not in an NEQ process, where the difference determines

d_{i} W \geq 0

(

d_{i} Q \geq 0

).

10.3. Medium $\tilde{Σ}$

The above discussion can be easily extended to the medium (the suffix

\tilde{k}

denotes its microstates) with the following results:

\begin{matrix} d \tilde{W} (t) & = - d {\tilde{E}}_{m} \equiv - \sum_{\tilde{k}} {\tilde{p}}_{\tilde{k}} \frac{\partial {\tilde{E}}_{\tilde{k}}}{\partial \tilde{w}} \cdot d \tilde{w} \\ = {\tilde{f}}_{w} \cdot d_{e} w = - d_{e} W, \\ d \tilde{Q} (t) & = d {\tilde{E}}_{s} \equiv \sum_{\tilde{k}} {\tilde{E}}_{\tilde{k}} d {\tilde{p}}_{\tilde{k}} = - d_{e} Q, \end{matrix}

(260)

where all the quantities including

\tilde{k}

refer to the medium, except

d_{e} W

and

d_{e} Q

, and have their standard meaning. Here, we have used Equation (72b) for

d \tilde{W} = d_{e} \tilde{W}

. The analog of Equation (257) is

d \tilde{Q /} d \tilde{S} = T_{0}

as expected; see Equation (258). We clearly see that

d W_{0} ≐ d W + d \tilde{W} = d_{i} W \geq 0 .

(261a)

We also have

d Q_{0} ≐ d Q + d \tilde{Q} = d_{i} Q \geq 0,

(261b)

with

d W_{0} = d Q_{0}

in view of Equation (95). We can also express

d_{i} W (t)

and

d_{i} Q (t)

as follows:

d_{i} W (t) \equiv - (d E_{m} + d {\tilde{E}}_{m}), d_{i} Q (t) \equiv (d E_{s} + d {\tilde{E}}_{s}) .

(262)

In a finite process

P

, all infinitesimal quantities are replaced by their net changes

Δ W_{0} ≐ Δ W + Δ \tilde{W} = Δ Q_{0} = Δ_{i} W \geq 0,

(263)

where

Δ_{i} W

is obtained by integrating

d_{i} W

in Equation (75) over

P

; see Equation (303a).

11. External and Internal Variations of ${dp}_{k} (t)$

We now introduce the concept of

d_{α} p_{k}

, which we will focus on in this section. We recall the number

N

of replicas and its partition

\{N_{k}\}

that were introduced in Section 5.3 and Section 5.5. We partition the change

d N_{k}

in accordance with the micropartition rule; see Definition 22. We take

N

to be fixed. In a given process

P

,

N_{k}

is the change without altering

N

. We denote these changes by

d N_{k}

, and define the change in the probabilities by

d p_{k} ≐ d N_{k} / N, N \to \infty .

This ensures that

\sum_{k} d p_{k} = 0,

(264)

as the total probability is conserved. We apply

d_{α}

on

N_{k}

and

p_{k}

with the result

\begin{matrix} d N_{k} & = d_{e} N_{k} + d_{i} N_{k}, \end{matrix}

(265)

\begin{matrix} d p_{k} & = d_{e} p_{k} + d_{i} p_{k}, \end{matrix}

(266)

where

d_{α} p_{k} ≐ d_{α} N_{k} / N

. As usual,

d_{e} p_{k}

is the change due to exchanges with the medium and

d_{i} p_{k}

the change due to internal processes.

It immediately follows from Equation (240) that

d_{e} Q (t) \equiv \sum_{k} E_{k} d_{e} p_{k} (t), d_{i} Q (t) \equiv \sum_{k} E_{k} d_{i} p_{k} (t) .

(267)

Theorem 11.

For any body,

\sum_{k} d_{α} p_{k} (t) = 0, \forall α,

(268)

which puts a limitation on the possible variations

d_{α} p_{k}

.

Proof.

As

d_{α} Q (t)

are thermodynamic quantities, they must not change their values if we change

E_{k}

by adding a constant to

H

. This requires Equation (268) to hold. □

Proof of $d_{i} E = 0$ Even If $d_{i} E_{k}$ ’s Are Not

Using Equation (15) in Equation (233), we have

\begin{matrix} d_{i} E & ≐ \sum_{k} p_{k} d_{i} E_{k} + \sum_{k} E_{k} d_{i} p_{k} = 0, \end{matrix}

(269a)

\begin{matrix} d E & = d_{e} E ≐ \sum_{k} p_{k} d_{e} E_{k} + \sum_{k} E_{k} d_{e} p_{k}, \end{matrix}

(269b)

where we have used the identity

d_{i} W = d_{i} Q

from Equation (95) in the top equation to show consistency of the above approach with the important identity in Equation (96); the first term here represents

(- d_{i} W)

and the second term stands for

d_{i} Q

.

Claim 22.

The most important conclusion of our approach is to establish that even if

d_{i} E_{k} \neq 0

,

d_{i} E = 0

as is well-known; see Equation (53a). This is consistent with

\{d_{i} E_{k}\}

being the outcome of the random variable

d_{i} E

, just as

\{d E_{k}\}

is the outcome of the random variable

d E

. In contrast,

\{d_{e} E_{k}\} = d_{e} E

is constant.

As Equation (269b) reproduces Equation (94), our approach is consistent with the MNEQT.

Even if

d_{i} E_{k} \neq 0

,

d_{i} E = 0

; thus, E cannot change by internal processes as is well-known. The second equation gives the conventional form of the first law in terms of the exchange quantities:

d E = d_{e} E \equiv d_{e} Q - d_{e} W

.

12. Extended State Space, $M_{ieq}$ and $M_{nieq}$

This section forms the central core of the review as it deals with identifying the state space

S_{Z}

based on the experimental setup in which the macrostate is uniquely described in terms of

Z

. This uniqueness of

M_{ieq}

then immediately leads to the unique microstate probabilities

\{p_{k}^{ieq}\}

without any additional requirement or approximation. It is this aspect of uniqueness that distinguishes the

μ

NEQT from other contemporary attempts in the

\overset{˚}{μ}

NEQT, where the determination of

\{p_{k}\}

requires additional ingredients such as the Fokker–Planck equation or the Markov process.

12.1. Choice of $Z$ for $M_{ieq}$ in $S_{Z}$

We come to the very important issue of identifying

S_{Z}

in a given experimental setup. We have recently reported this in [160]. But because of its importance and to provide continuity, we briefly revisit this issue in this section.

We will see from Equations (275) and (286) that the statistical mechanics will be different in the two approaches depending on the choice of the parameter:

W

vs.

F_{w}

. The former has fluctuations in the microforces

F_{w k}

, while the latter has fluctuations in

W_{k}

, as we have already discussed. For the moment, we consider the NFl-

W

as the parameter on which to focus our attention.

We now discuss how to choose a particular state space for a unique description of a macrostate

M_{neq}

depending on the experimental setup. To understand the procedure for this, we begin by considering a set

ξ_{n}

of internal variables

(ξ_{1}, ξ_{2}, \dots, ξ_{n})

and

Z_{n} ≐ X \cup ξ_{n}

to form a sequence of state spaces

S_{Z}^{(n)}

. In general, one may need many internal variables, with the value of n increasing as

M_{neq}

is more and more out of EQ [160] relative to

M_{eq}

. We will take

n^{*}

to be the maximum n in this study, as discussed in Section 4, even though n

< < n^{*}

, needed for

S_{Z}^{(n)}

, will usually be a small number in most cases, which is determined by the experimental setup. The two most important but distinct time scales are

τ_{obs}

, the time to make observations, and

τ_{eq}

, the equilibration time for a macrostate

M_{neq}

to turn into

M_{eq}

. For

τ_{obs} < τ_{eq}

, the system will be in an NEQ macrostate. Let

τ_{i}

denote the relaxation time of

ξ_{i}

needed to come to its equilibrium value so that its affinity

A_{i} \to 0

[12,13,51,160,169,170,171,172,173]. For convenience, we order

ξ_{i}

so that

τ_{1} > τ_{2} > \dots;

(270a)

we assume distinct

τ_{i}

’s for simplicity without affecting our conclusions. For

τ_{1} < τ_{obs}

, all internal variables have equilibrated so they play no role in equilibration, except thermodynamic forces

T - T_{0}, P - P_{0}

, etc., associated with

X

that still drive the system towards EQ. We introduce the relaxation window

Δ_{n} τ

satisfying

Δ_{n} τ ≐ τ_{n} > τ_{obs} > τ_{n + 1}

(270b)

to identify n so that all of

ξ_{1}, ξ_{2}, \dots, ξ_{n}

have not equilibrated (their affinities are nonzero). They play an important role in the NEQT, while

ξ_{n + 1}, ξ_{n + 2}, \dots

need not be considered as they have all equilibrated. This specifies

M_{neq}

uniquely in

S_{Z}^{(n)}

, which was earlier identified as in IEQ.

Note that NEQ macrostates with

τ_{n + 1} > τ_{obs} > τ_{n + 2}

are not uniquely identifiable in

S_{Z}^{(n)}

, even though they are uniquely identifiable in

S_{Z}^{(n + 1)}

. Thus, there are many NEQ macrostates that are not unique in

S_{Z}^{(n)}

. The unique macrostates

M_{ieq}

are special in that its Gibbs entropy

S (Z_{n})

is a state function of

Z_{n}

in

S_{Z}^{(n)}

. Thus, given

τ_{obs}

, we look for the window

τ_{n} > τ_{obs} > τ_{n + 1}

to choose the particular value of n. This then determines

S_{Z}^{(n)}

in which the macrostates are in IEQ. From now onward, we assume that n has been found and

S_{Z}^{(n)}

has been identified. We now suppress n and simply use

S_{Z}

below.

Remark 64.

The linear sizes of various subsystems introduced in Section 5.6 must be larger than the correlation length

λ_{corr}

as discussed elsewhere [148] for the first time. In addition, quasi-independence discussed in Section 7.3 is required to ensure entropy additivity. Therefore, it is usually sufficient to take the linear size of Σ to be a small multiple (for example, 10 to 20) of the correlation length to obtain a proper thermodynamics, which is extensive. This means that we will usually need a theoretically manageable but small number of internal variables n that is controlled by the experimental setup.

Remark 65.

The most direct way to determine n is to begin with a model to describe nonuniformity of a system, determining the number of required internal variables, as described in Section 4, to determine

S_{Z}^{(n)}

. The consequences of the resulting thermodynamics of

M_{ieq}

should be compared with what is observed in experiments performed on the system to verify if the model is appropriate to describe the experiment. This trial and error method is the price that we pay to study NEQ macrostates, of which there are many in

S_{X}

. A simple example of such a modeling is considered in Section 16.

12.2. Microstate Probabilities for $M_{ieq}$ : NFl- $W$

The time dependence in some or all components in

W

during

P

gives rise to time dependence in the Hamiltonian

H (x| W)

; the dynamical variable

x

plays no role as we show in Equations (146) and (148). The time dependence of

W

gives rise to time dependence in

E_{k} (W)

; we will usually suppress the

W

-dependence unless necessary for clarity. The microstate

m_{k}

appears with probability

p_{k}

in the statistical ensemble. The set

\{p_{k}\}

determines the stochasticity in the ensemble. Accordingly, it determines the nature of the macrostate (EQ vs. NEQ) but the sets

\{E_{k}\}

and

\{m_{k}\}

are independent of

\{p_{k}\}

, as they are deterministic.

We take

Σ

to be in an internal equilibrium with temperature T and macroforce

F_{w}

. As

η_{k}

is extensive, it must be a linear combination of extensive quantities specifying

m_{k}

; they are

E_{k}

and

F_{w k}

. Therefore, we express

η_{k}

as

η_{k} = a + b E_{k} + c_{w} \cdot F_{w k},

(271)

where

a, b

and

c_{w}

are unknown quantities that have to be determined by probability normalization and evaluating S using Equation (26a), and comparing with

d S

in Equation (128a) in the MNEQT. Another way to determine

p_{k}

is to use the Lagrange multiplier technique to maximize the entropy in Equation (26a) under the constraints.

As

M_{ieq}

is unique in

S_{Z}

, we need to identify the unique set

\{p_{k}\}

. We recall that

Z = (E, W)

is used in identifying

S_{Z}

, and

W

appears as a parameter in the Hamiltonian

H (W)

so it also appears as a parameter in

M_{ieq} (Z)

. As a consequence (see Definition 12),

F_{w k}

are fluctuating microforces in

S_{Z}

. In addition, we have microstate energies

E_{k}

always fluctuating, with the corresponding “macroforce” inverse temperature

β

fixed. We need to maximize the entropy

S (Z)

([33,175], for example) at fixed

1 = \sum_{k} p_{k}, E = \sum_{k} p_{k} E_{k}, F_{w} = \sum_{k} p_{k} F_{w k}

(272)

by varying

p_{k}

. We now give two different methods to determine the unique entropy.

12.3. Lagrange Multiplier Method: NFl $W$

Using the well-known Lagrange multiplier technique [33], it is easy to show that the conditions in Equation (272) require three Lagrange multipliers

λ_{1}, λ_{2}

, and

λ_{3}

to yield

η_{k} = λ_{1} + λ_{2} E_{k} + λ_{3} \cdot F_{w k} .

(273)

Also, compare with Equation (271), from which follows

E_{k} (W) = (1 / λ_{2}) (- λ_{1} + η_{k} - λ_{3} \cdot F_{w k}) .

Recalling the definition of

F_{w k}

from Equation (149), we identify

λ_{3} / λ_{2} \equiv W

. Taking the ensemble average of

E_{k}

, we find

E (S, W) = (1 / λ_{2}) (- λ_{1} - S - λ_{2} W \cdot F_{w}),

as a function of S and

W

so that we finally identify

λ_{2} = - β, λ_{3} = - β W

. Thus,

S = - λ_{1} + β E + β W \cdot F_{w},

(274a)

and

d S = β d E + β F_{w} \cdot d W,

(274b)

as we vary E and

W

. We finally have

p_{k}^{ieq} (β, E_{k}, F_{w k}, W) = exp [β (G_{Z}^{ieq} - E_{k} - W \cdot F_{w k})],

(275)

where

β G_{Z}^{ieq} ≐

λ_{1}

is easily identified by taking the average of Equation (273) and using Equation (26a). We thus see that the thermodynamic potential

G_{Z}^{ieq}

is a BI-potential given by

G_{Z}^{ieq} (F) = G_{Z}^{ieq} (T, F_{w}) = E^{L} (S, F_{w}) - T S,

(276)

where

E^{L} (S, F_{w})

is the SI-Legendre transform of

E (S, W)

introduced in Equation (158); see Section 6.3. We also have

\forall k, G_{Z}^{ieq} (F) \equiv G_{Z k}^{ieq} ≐ E_{k}^{L} (F_{w k}) - T S_{k},

(277)

where

G_{Z k}^{ieq}

is a micropotential corresponding to

G_{Z}^{ieq} (F)

but does not fluctuate over

\{m_{k}\}

, and

E_{k}^{L} (F_{w k})

is introduced in Equation (160); see also Equation (161). We now see that

p_{k}^{ieq} (β, F_{w k}, E_{k}, W) \equiv exp (- S_{k}),

(278)

which is consistent with the general definition of

S_{k}

; see Equation (27a).

We emphasize that the SI-potential

G_{Z}^{ieq} (F)

is not a function of S and

W

as is easily checked since

d G_{Z}^{ieq} = - S d T + W \cdot d F_{w}

. Moreover, it is determined by the ensemble so it is not a microquantity; see Remark 14. From Equation (277), we find that

Δ_{k} S ≐ S_{k} - S = β (E_{k}^{L} - E^{L})

(279)

is fluctuating over

\{m_{k}\}

; here, we have introduced the notation

Δ_{k} χ ≐ χ_{k} - \bar{χ}

(280)

that describes the fluctuation of

χ

over

\{m_{k}\}

about its average

\bar{χ}

; note the difference in the definition with thermodynamic forces in Equation (76a) that refer to the deviation from the fields of the medium. We see that the fluctuation

Δ_{k} S

has two components. The first part

Δ_{k} S^{E}

is related to microenergy fluctuation

Δ_{k} E_{k}

, and the second one

Δ_{k} S^{F_{w}}

is related to microforce fluctuation

Δ_{k} F_{w}

. We see that there is a very close similarity with the first law and the above result, which we rewrite as

Δ_{k} E^{L} \equiv T Δ_{k} S,

(281)

except that this law relates to fluctuations over microstates and not any transfer.

If we neglect the fluctuations

Δ_{k} E

and

Δ_{k} F_{w}

or replacing

E_{k}

by E and

F_{w k}

by

F_{w}

by considering only those microstates with

E_{k} =

E and

F_{w k} = F_{w}

, then

p_{k}^{ieq}

reduces to the flat distribution

p_{k}^{ieq, ep} = \frac{1}{W (Z)} = exp [β (G_{Z}^{ieq} - E^{L})] = exp (- S);

(282)

see Remark 54, which can be identified as the microstate probability in the NEQ microcanonical ensemble in which

Δ_{k} S = 0

. It should be stressed that this is consistent with the well-known fact that EQ thermodynamics does not describe fluctuations; the latter require statistical mechanics [33]. The same also holds for

M_{ieq} (Z)

and is captured by

p_{k}^{ieq, ep}

above.

The normalization constant

G_{Z}^{ieq} (F)

defines an NEQ partition function

Z_{Z}^{ieq} (F) ≐ exp (- β G_{Z}^{ieq}) \equiv \sum_{k} exp (- β E_{k}^{L}) .

(283)

It should be remarked that the Lagrange multipliers in

p_{k}^{ieq}

are determined by comparing the resulting entropy to match exactly the Gibbs fundamental relation, a thermodynamic relation. This then proves that the statistical entropy is the same as the thermodynamic entropy S up to a constant [76], which can be fixed by appealing to the third law, according to which S vanishes or takes a universal constant at absolute zero; see also [160,176]. The

p_{k}^{ieq}

above clearly shows the effect of irreversibility and is very different from its equilibrium analog

p_{k}^{eq}

p_{k}^{eq} = exp [β_{0} (G_{X} (T_{0}, f_{0 w}) - E_{k} - w \cdot f_{0 w k})];

see Equation (40), obtained by replacing

W

by

w

,

F_{w k}

by

f_{0 w k}

, and

β

by

β_{0}

associated with the medium

\tilde{Σ}

. The fluctuating

E_{k}, f_{0 w k}

satisfy

E = \sum_{k} E_{k} p_{k}^{eq}, f_{0 w} = \sum_{k} f_{0 w k} p_{k}^{eq} .

The observation time

τ_{obs}

is determined by the way T and

W

are changed during a process. Thus, during each change,

τ_{obs}

must be compared with the time needed for

Σ

to come to the next IEQ macrostate, and for the microstate probabilities to be given by Equation (275) with the new values of T and

W

.

12.4. Extensivity Method

We now introduce the second method. For this, we make an important point about the condition extensivity imposes on

η_{k} = ln p_{k}^{ieq}

; see Remark 14. This requirement requires that

η_{k}

must be a linear combination of SI extensive quantities, which in the present case can be written as

λ_{1} + λ_{2} E_{k} + λ_{3} \cdot F_{w k},

(284)

in addition to an extra dependence on the macrostate, which in the present case is

λ_{1}

to be determined by the normalization required by Equation (13). This is nothing but the expression in Equation (271). We now determine

λ_{1}, λ_{2}

, and

λ_{3}

by determining S using Equation (284), from which we obtain

d S

, which is then compared to

d S

in Equation (274b) to determine them. The result is exactly what we found above. Thus, the extensive method is quite useful in obtaining

p_{k}^{ieq}

directly in

S_{Z}

.

12.5. Fluctuating $\{W_{k}\}$

We now obtain

p_{k}^{ieq}

when

F_{w}

is used as a NFl parameter instead of

W

to describe the same macrostate

M_{ieq}

. In this case,

\{W_{k}\}

are fluctuating work variables in

S_{Z}

. We can use the customary Lagrange multiplier technique to maximize the entropy

S (Z)

at fixed

E = \sum_{k} p_{k} E_{k}, W = \sum_{k} p_{k} W_{k},

or use the following linear combination of extensive SI macroquantities

η_{k} = ρ_{1} + ρ_{2} E_{k} + ρ_{3} \cdot W_{k},

and apply the aforementioned extensivity argument. Either way, we obtain

E_{k} (W_{k}) = (1 / ρ_{2}) (- ρ_{1} + η_{k} - ρ_{3} \cdot W_{k}) .

From Equation (18), we identify definition

ρ_{3} / ρ_{2} \equiv F_{w}

. Taking the ensemble average of

E_{k}

, we find

E (S, W) = (1 / ρ_{2}) (- ρ_{1} - S - ρ_{2} W \cdot F_{w}),

as a function of S and

W

so that we finally identify

ρ_{2} = - β, ρ_{3} = - β W

. Thus,

S (E, W) = - ρ_{1} + β E + β W \cdot F_{w},

showing that

ρ_{1} = β G_{Z}

with

G_{Z}

as a normalization constant, which defines a different representation of the NEQ partition function

Z_{Z}^{ieq} ≐ exp (- β G_{Z}^{ieq}) \equiv \sum_{k} exp (- β E_{k}^{L} (F_{w})),

(285)

with

G_{Z}^{ieq}

given in Equation (276), and

E_{k}^{L} (F_{w})

given in Equation (153). It should come as no surprise that the state functions

E (S, W), S (E, W)

, and

G_{Z}^{ieq} (F)

have the same form whether

W

is treated as a parameter or

F_{w}

. The existence of

E (S, W)

or

S (E, W)

for

M_{ieq} (Z)

can now be used to obtain the MNEQT for

M_{ieq} (Z)

.

We finally have

p_{k}^{ieq} (β, E_{k}, F_{w}, W_{k}) = exp [β (G_{Z}^{ieq} (F) - E_{k}^{L} (F_{w}))],

(286)

which shows that these probabilities are different because the microstates are now

m_{k} (W_{k})

, while they were

m_{k}

in Equation (275). In both cases, we have the same macrostate

M_{ieq} (Z)

, and the same entropy; they possess the same MNEQT, but the

μ

NEQT are different for both cases.

Remark 66.

Comparing the forms of

p_{k}^{ieq} (β, F_{w k})

and

p_{k}^{ieq} (β, W_{k})

above, we see that they differ only in the presence of

F_{w k}

(parameter

W

) in the former and

W_{k}

(parameter

F_{w}

) in the latter. Therefore, all we have to do to construct the latter from the former is to simply replace

W \cdot F_{w k}

by

W_{k} \cdot F_{w}

, i.e., to remove the suffix k from

F_{w}

to

W

. In other words, we interchange the fluctuating form (

F_{w k}

) with its nonfluctuating form (

F_{w}

), and vice versa (nonfluctuating form

W

by its fluctuating form

W_{k}

).

Let us now consider a slight modification of the case above for

M_{ieq} (E, W)

, which we now describe. Let us divide

W

into two disjoint groups

W^{NF}

and

W^{F} :

W ≐ W^{NF} \cup W^{F} .

(287)

We now use

W^{NF}

as the parameter, but use

F_{w}^{F}

corresponding to

W^{F}

as another parameter.

Remark 67.

The last statement in Remark 66 also applies to the above modified case in Equation (287), where

W \cdot F_{w k}

in

p_{k}^{ieq} (β, F_{w k})

is replaced by

W^{NF} \cdot F_{w k} + W^{F} \cdot F_{w k}

so that

p_{k}^{ieq} (β, W^{NF}, W^{F}) = exp [β (G_{Z}^{ieq} (F) - E_{k}^{L} (F_{w k}^{NF}, F_{w}^{F}))],

(288)

with

E_{k}^{L} (F_{w k}^{NF}, F_{w}^{F}) ≐ E_{k} + W^{NF} \cdot F_{w k}^{NF} + W^{F} \cdot F_{w}^{F},

(289)

which can be used to define the corresponding NEQ partition function

Z_{Z}^{ieq} ≐ \sum_{k} exp (- β E_{k}^{L} (F_{w k}^{NF}, F_{w}^{F})) .

(290)

Remark 68.

Because of the above two remarks, we will now focus mostly on using NFl-

W

as the parameter in the rest of the review. Changing some of the work parameters to fluctuate can be simply obtained by the results as described above.

12.6. $M_{nieq} (Z)$ and Its Microstate Probabilities

We now focus on a non-unique macrostate

M_{nieq} (S_{Z})

in

S_{Z}

. We will have to confront such macrostates if

τ_{obs}

is reduced to make the process faster so that instead of falling in the window (

τ_{n}, τ_{n + 1}

), it now falls in a higher window such as (

τ_{n + 1}, τ_{n + 2}

). As said above,

M

can now be treated as a unique macrostate in a larger state space

S_{Z^{'}} \supset S_{Z}

, where

n^{'} > n

is the number of internal variables in

S_{Z^{'}}

. Let

ξ^{'} (t)

denote the set of additional internal variables needed over

S_{Z}

so that

Z^{'} (t) = (Z (t), ξ^{'} (t)) .

(291)

The relaxation times of the internal variables in

ξ^{'}

are arranged as in Equation (270a):

τ_{n + 1} > τ_{n + 2} > \dots > τ_{n^{'}} > τ_{obs};

(292)

We can now treat the above

M_{nieq} (S_{Z})

as an IEQ macrostate

M_{ieq} (S_{Z^{'}})

in

S_{Z^{'}} = S_{Z} \cup S_{ξ^{'}}

. In the latter state space, we will have additional macroforce

F_{w k}^{'} ≐ A_{k}^{'}

, the microaffinity associated with

ξ^{'}

in

S_{ξ^{'}}

. Thus, we can always find the state space by identifying the window (

τ_{n^{'}}, τ_{n^{'} + 1}

) in which

τ_{obs}

falls.

Recognizing

M_{nieq} (Z)

becomes

M_{ieq} (Z^{'})

in

S_{Z^{'}}

, we have from Equation (291)

p_{k}^{ieq} (β, E_{k}^{L}, F_{w k}, W, A_{k}^{'}, ξ^{'}) = exp [β (G_{Z^{'}}^{ieq} - E_{k}^{' L} (F_{w k}, A_{k}^{'}))],

(293a)

where

E_{k}^{' L} (F_{w k}, A_{w k}^{'}) ≐ E_{k}^{L} (F_{w k}) + ξ^{'} \cdot A_{k}^{'},

(293b)

in which we have separated out the term, which is the contribution from

S_{ξ^{'}}

, from the Legendre-transformed microenergy in

S_{Z}

. The situation now is no different than that of

M_{ieq} (Z)

studied above, except that

Z

and

S_{Z}

must be replaced by

Z^{'}

and

S_{Z^{'}}

, respectively. This explains the similarity in the form of

p_{k}^{ieq}

in Equation (293a) with that in Equation (275), except that

G_{Z^{'}}^{ieq}

is different from

G_{Z}^{ieq}

.

Let us now pursue what happens to the above

p_{k}^{ieq}

when we wish to describe

M_{ieq} (Z^{'})

in

S_{Z}

, where

ξ^{'}

does not exists. Then,

ξ^{'} \cdot A_{w k}^{'}

, which is defined in orthogonal subspace

S_{ξ^{'}}

, cannot be treated as a function of

Z

. Thus, it must only be considered as an explicit function of t in

S_{Z}

so we introduce a new function

Φ_{k}^{'} (t) ≐ ξ^{'} \cdot A_{k}^{'} .

(294)

In terms of this function, the microstate probability of

M_{nieq} (Z)

becomes

p_{k}^{nieq} (β, F_{w k}, E_{k}^{L}, W, t) = exp [β (G_{Z}^{nieq} - E_{k}^{L} (F_{w k}, t))],

(295a)

with

G_{Z}^{nieq} ≐ G_{Z^{'}}^{ieq}

, and

E_{k}^{L} (F_{w k}, t) = E_{k}^{L} (F_{w k}) + Φ_{k}^{'} (t)

(295b)

with an explicit time dependence coming from

Φ_{k}^{'} (t)

. We now extend the definition of the above partition functions to the current situation of an arbitrary macrostate

M

with a possible explicit time dependence:

Z_{Z} (F, t) ≐ exp (- β G_{Z} (F, t)) \equiv \sum_{k} exp (- β E_{k}^{L} (F_{w k}, t)),

(296)

which covers all possible macrostates; we have eliminated the suffix “nieq” as it is no longer necessary. The extension to the Fl

\{W_{k}\}

is trivial and will not be given.

In this general form,

Z_{Z} (F, t)

includes the above three partition functions.

We have seen above from Equations (275) and (286) that the statistical mechanics will be different in the two approaches depending on the choice of the parameter:

W

vs.

F_{w}

. The former has fluctuating microforces

F_{w k}

, while the latter has fluctuating microworks

W_{k}

.

From

p_{k}^{eq} (β_{0}, w_{0}), p_{k}^{ieq} (β, W)

, and

p_{k}^{nieq} (β, W, t)

, we have a complete description of the microstate probability

p_{k}

for any arbitrary macrostate. This completes the discussion of the most important quantity in the

μ

NEQT.

12.7. Common EQ Ensembles

We make a few comments about some common EQ ensembles in statistical mechanics. They are defined in different state spaces. The microcanonical ensemble correspond to a system at fixed (NFl) E, while a canonical ensemble corresponds to a Fl

\{E_{k}\}

due to a NFl temperature T. In both ensembles, the state variable

w

becomes irrelevant so the entropy

S (E)

only depends on E. This means that

f_{w} \equiv 0

so

f_{w} \cdot w \equiv 0

. This makes

E^{L} = E

for both ensembles. Therefore, both ensembles are defined in the state space

S_{E}

. As there are no fluctuations (besides the unimportant width of the energy shell around E in the phase space) in the microcanonical ensemble, its probability distribution must be given by the flat distribution in Equation (282) over all microstates

p_{k}^{micro} = 1 / W (E, w) = exp (- S);

(297)

see Equation (282). For the canonical ensemble, the microstate probability becomes

\begin{matrix} p_{k}^{can} (β, E_{k}) & = exp (- β_{0} E_{k}) / Z^{eq} (β_{0}), \end{matrix}

(298a)

\begin{matrix} Z^{eq} (β_{0}) & ≐ \sum_{k} exp (- β_{0} E_{k}), \end{matrix}

(298b)

where we have used the standard symbols for the EQ partition function

Z^{eq} (β_{0}) = exp (- β_{0} F (β_{0}))

and the Helmholtz free energy

F (β_{0})

.

In the grand canonical distribution, we allow Fl particle number N, which is controlled by a NFl chemical potential

μ_{0}

, in addition to Fl energy E, so the distribution is defined in the state space

S_{E, N}

. It is written as the

V T μ

-ensemble. By extending the discussion above to now include N and use

S_{E, N}

, we have

\begin{matrix} p_{k}^{gr - can} (β_{0}, μ_{0}, E_{k}, N_{k}) & = exp (- β_{0} E_{k}^{L}) / Z^{eq} (β_{0}, μ_{0}), \end{matrix}

(299a)

\begin{matrix} Z^{eq} (β_{0}, μ_{0}) & ≐ \sum_{k} exp (- β_{0} E_{k}^{L}), \end{matrix}

(299b)

where

E_{k}^{L} (μ_{0}) ≐ E_{k} - μ_{0} N_{k},

(300)

and

Z^{eq} (β_{0}, μ_{0}) ≐ exp (- β_{0} Φ)

is the grand canonical partition function. For the

N T P

-ensemble, we have FL E and V so we find

E_{k}^{L} (P_{0}) ≐ E_{k} (V_{k}) - P_{0} V_{k}

, which is used to obtain the microstate probability and the partition function. Other ensembles can also be considered following the discussion given above.

All these ensembles are easy to extend to NEQ ensembles by introducing the internal variables in

\ddot{ξ} \subseteq ξ

and affinity

{\ddot{A}}_{k}

in

E_{k}^{L}

and the NEQ partition function

Z_{Z} ≐ exp (- β G_{Z}) \equiv \sum_{k} exp (- β E_{k}^{L}) .

With proper change in

E_{k}^{L}

, we can also consider Fl-

\ddot{ξ}

and NFl-

\ddot{A}

, or a combination of them, to be used in the

Z_{Z}

above.

13. Ensembles of Process Trajectories

With

W

(whether NFl or Fl) as the macrowork parameter, the variation

d Z (t) ≐ (d E (t), d W (t))

in

S_{Z}

defines not only the microwork

\{d W_{k}\}

, but also a thermodynamic process

P

. The trajectory

γ_{k}

in

S_{Z}

followed by

m_{k}

as a function of time will be called the Hamiltonian trajectory during which

W

varies from its initial (in) value

W_{in}

to its final (fin) value

W_{fin}

during

P

, the path

γ_{P}

denoting the path the macrostate follows during this process; see Definition 20. The variation produces the generalized microwork

d W_{k}

. As

p_{k}

plays no role in

d W_{k}

, its determination is simplified considerably in the

μ

NEQT. The microwork

d W_{k}

also does not change the index k of

m_{k}

, as said above. The ensemble average of

F_{w k}

is

F_{w}

(see Equation (113)), and that of

d W_{k}

is

d W

(see Equation (39)). This notion of micro- and macrowork is consistent with using the mechanical definition of work (force times displacement).

Here, we discuss a process

P

in terms of trajectories

\{γ_{k}\}

followed by microstate

\{m_{k}\}

. We assume that the trajectories are uniquely specified. For an NEQ process, this requires defining these trajectories in the state space

S_{Z}

, as discussed in Section 12. Trajectories are useful only when the number

|m|

of microstates remain the same during the process:

{|m|}_{in} = {|m|}_{fin}

. If they are different, such as when

{|m|}_{in} < {|m|}_{fin}

, we need to add in

{|m|}_{in}

the missing microstates that have vanishing microstate probabilities initially during

P

. This is a common situation in the expansion of a classical gas, which we discuss in Section 16. The converse happens in the contraction, where missing microstates are added in

{|m|}_{fin}

but again with vanishing microstate probabilities. We will assume in this section that we have ensured that

{|m|}_{in} = {|m|}_{fin}

. Thus, the number of unique Hamiltonian trajectories in

P

is exactly

{|m|}_{in} = {|m|}_{fin}

at all times during

P

.

13.1. Trajectory Ensemble

A central aspect of the

μ

NEQT is the fundamental distinction between process microquantities like

d W_{k}

and

d Q_{k}

along a Hamiltonian trajectory

γ_{k}

. As we have seen in Conclusion 11,

d W_{k}

and

d Q_{k}

are defined in independent state subspaces

S_{W}

and

S_{S}

, respectively, which makes them independent process microquantities. To introduce a trajectory ensemble requires a collection

γ

of all Hamiltonian trajectories

\{γ_{k}\}

and the probabilities, which will be used to introduce the trajectory ensemble average (TEA). The uniqueness inherent in Equation (110) that there is a well-defined

p_{k}

for each uniquely specified

m_{k}

will not usually hold for the TEA

{〈•〉}_{TE}

. We will see that there are many possible probabilities depending on the microquantities

\{χ_{k}\}

being averaged over

γ

. Thus, we need to introduce a trajectory probability

p_{γ_{k}}^{(q, α)}

for accumulation

Δ_{α} χ (P),

χ \in χ ≐ \{S, E, W\}

, with

χ

defined as the average in Equation (12):

Δ_{α} χ (P) = \int_{P} d_{α} χ = \sum_{k} \int_{γ_{k}} (p_{k} d_{α} χ_{k} + χ_{k} d_{α} p_{k})

(301)

The accumulation in the first equation is defined as the integration (summation) of

d_{α} χ

over

P

, which can be expressed as a sum over the Hamiltonian trajectories

\{γ_{k}\}

followed by

\{m_{k}\}

over

P

, as shown in the second equation and briefly discussed recently [156]; see also Equation (307b).

In order for the resulting theory to remain consistent with classical thermodynamics and, in particular, with the second law, we must ensure that

{〈•〉}_{TE}

remains consistent with thermodynamic averages in the MNEQT described in the previous sections. This is not a major issue for the consideration of various microworks in

S_{W}

, as we show now.

13.2. Trajectory Quantities

Let us consider some infinitesimal microquantity

d_{α} θ_{k}

for a body

Σ_{b}

(see Equation (10b)) and the ensemble average

〈d_{α} θ〉 = \sum_{k} p_{k} d_{α} θ_{k}

for a body

Σ_{b}

. We now introduce a trajectory quantity

Δ_{α} θ_{k}

and a path quantity

〈Δ_{α} θ〉

obtained by accumulating them along

γ_{k}

and

P

by simply integrating

d_{α} θ_{k}

and

〈d_{α} θ〉

, respectively:

Δ_{α} θ_{k} ≐ \int_{γ_{k}} d_{α} θ_{k}, 〈Δ_{α} θ〉 ≐ \int_{P} 〈d_{α} θ〉 .

(302)

We give the explicit forms of the most important trajectory quantities below:

\begin{matrix} Δ_{α} W_{k} & ≐ \int_{γ_{k}} d_{α} W_{k}, Δ_{α} {\tilde{W}}_{k} ≐ \int_{γ_{k}} d_{α} {\tilde{W}}_{k}, \end{matrix}

(303a)

\begin{matrix} Δ_{α} Q_{k} & ≐ \int_{γ_{k}} d_{α} Q_{k}, Δ_{α} E_{k} ≐ \int_{γ_{k}} d_{α} E_{k}, \end{matrix}

(303b)

\begin{matrix} Δ_{α} S_{k} & ≐ \int_{γ_{k}} d_{α} {\bar{S}}_{k}, \end{matrix}

(303c)

using which we finally obtain

\begin{matrix} 〈Δ_{α} W〉 & = Δ_{α} W = - \sum_{k} \int_{γ_{k}} p_{k} d_{α} E_{k}, \\ 〈Δ_{α} Q〉 & = Δ_{α} Q = \sum_{k} \int_{γ_{k}} E_{k} d_{α} p_{k}, \\ Δ_{α} S & = - \sum_{k} \int_{γ_{k}} η_{k} d_{α} p_{k} . \end{matrix}

(304)

We must also recall Theorem 3. The equation for

Δ_{α} W_{k}, Δ_{α} W = 〈Δ_{α} W〉

above and their differential forms

d_{α} W_{k}, d_{α} W = 〈d_{α} W〉

provide the correct identification at the microscopic level in terms of the SI-quantities

d_{α} E_{k}, 〈d_{α} E〉

, and must be used to account for irreversibility. So far, we have had no need to introduce the concept of temperature in the discussion so the discussion is valid for all possible processes. In terms of

T (t)

of the body

Σ_{b}

, we can use an alternate expression for

Δ_{α} Q_{k}

and

Δ_{α} Q

as follows:

\begin{matrix} Δ Q_{k} & ≐ \int_{γ_{k}} T (t) d {\bar{S}}_{k}, Δ Q = \sum_{k} \int_{γ_{k}} T (t) d {\bar{S}}_{k}, \\ Δ_{e} Q_{k} & ≐ \int_{γ_{k}} T_{0} (t) d_{e} \bar{S}, Δ_{e} Q = \sum_{k} \int_{γ_{k}} T_{0} (t) d_{e} \bar{S}, \\ Δ_{i} Q_{k} & ≐ Δ Q_{k} - Δ_{e} Q, Δ_{i} Q = \sum_{k} Δ_{i} Q, \end{matrix}

(305)

where we have used the fact that exchange microquantities are NFl; see Theorem 7 and Equation (193c). The instantaneous temperature

T (t)

does not necessarily equal the instantaneous temperature

T_{0} (t)

of the medium

{\tilde{Σ}}_{h}

; see Equation (144). Only if the process is isothermal (we now require the temperature of

{\tilde{Σ}}_{h}

to remain constant at

T_{0}

throughout the process) do we have

Δ_{i} W = T_{0} Δ_{i} S;

(306)

13.3. Trajectory Averages and Probabilities

We consider the expanded form of

Δ_{α} χ

in Equation (301) obtained by interchanging the integration and summation and changing the integration to over the Hamiltonian trajectory

γ_{k}

. Using the mean-value theorem of calculus, we can rewrite it as

Δ_{α} χ (P) = \sum_{k} (p_{γ_{k}}^{(χ α)} Δ_{α} χ_{k} + χ_{γ_{k}}^{(α)} Δ_{α} p_{k})

(307a)

in terms of the mean values

p_{γ_{k}}

and

χ_{γ_{k}}

of the two integrals, respectively,

p_{γ_{k}}^{(χ α)} ≐ \frac{\int_{γ_{k}} p_{k} d_{α} χ_{k}}{Δ_{α} χ_{k}}, χ_{γ_{k}}^{(α)} ≐ \frac{\int_{γ_{k}} χ_{k} d_{α} p_{k}}{Δ_{α} p_{k}};

(307b)

here,

Δ_{α} χ_{k}

is defined in Equation (302), and

Δ_{α} p_{k} ≐ \int_{γ_{k}} d_{α} p_{k} (t) .

(308)

It is well-known that

p_{γ_{k}}^{(χ α)}

is the value of

p_{k}

at some point along

γ_{k}

; similarly,

χ_{γ_{k}}^{(α)}

is the value of

χ_{k}

also at some point along

γ_{k}

; the two points need not be the same. It is easy to verify that

\sum_{k} p_{γ_{k}}^{(χ α)} = 1

(309)

as expected of a probability measure.

We see that the sum of the first term in Equation (307a) is identical in form to the accumulated thermodynamic average

〈Δ_{α} θ〉

given in Equation (302), and is expressed in terms of trajectories as follows:

〈Δ_{α} θ〉 ≐ \sum_{k} \int_{γ_{k}} p_{k} d_{α} θ_{k},

(310)

where

p_{k} (t)

usually cannot be taken out of the integral sign; see Equation (303). We now express

〈Δ_{α} θ〉

in a form suitable for using trajectory quantities. We require that the trajectory average

{〈Δ_{α} θ〉}_{TE}

reproduce the thermodynamic average

〈Δ_{α} θ〉

over

P

. In that case, there is no reason to explicitly show the suffix TE unless the requirement is not met. Using Equation (302) for

〈Δ_{α} θ〉

, we define

{〈Δ_{α} θ〉}_{TE} ≐ \sum_{k} p_{γ_{k}}^{(θ α)} Δ_{α} θ_{k}

(311)

where we have introduced the trajectory probability

p_{γ_{k}} = p_{γ_{k}}^{(θ α)}

in terms of

d_{α} θ_{k}

:

p_{γ_{k}}^{(θ α)} ≐ \int_{γ_{k}} p_{k} (t) d x_{k}^{(θ α)} (t);

(312)

see Equation (307b). Here

d x_{k}^{(θ α)} (t) ≐ d_{α} θ_{k} (t) / Δ_{α} θ_{k} .

We note from Equation (311) that

Δ_{α} θ

is the trajectory average with respect to

{p_{γ_{k}}^{(θ_{α})}}

. The procedure ensures that

{〈Δ_{α} θ〉}_{TE} = 〈Δ_{α} θ〉

per our requirement. Using

d {\tilde{W}}_{k} (t) \equiv - d_{e} W_{k} (t)

and

d W_{k} (t)

for

d_{α} θ_{k} (t)

, we obtain the average accumulated work

Δ \tilde{W} \equiv - Δ_{e} W

done on and

Δ W

done by the system, respectively, in terms of the respective trajectory probabilities:

\begin{matrix} Δ \tilde{W} & ≐ \sum_{k} p_{γ_{k}}^{(We)} Δ {\tilde{W}}_{k} = - Δ_{e} W, \end{matrix}

(313a)

\begin{matrix} Δ W & ≐ \sum_{k} p_{γ_{k}}^{(W)} Δ W_{k}; \end{matrix}

(313b)

the probabilities are determined by

d x_{k}^{(We)} (t) = d {\tilde{W}}_{k} (t) / Δ {\tilde{W}}_{k},

and

d x_{k}^{(W)} (t) ≐ d W_{k} (t) / Δ W_{k},

respectively, over

P

, in Equation (312). The irreversible macrowork is obtained as

Δ_{i} W = Δ W_{0} = Δ W + Δ \tilde{W}

.

One can similarly define another possible trajectory probability

p_{γ_{k}}^{(I)}

by using

d x_{k}^{(t)} (t) ≐ d t / τ

over

γ_{k}

:

p_{γ_{k}}^{(I)} ≐ \int_{γ_{k}} p_{k} (t) d x_{k}^{(t)} (t);

here

τ

is the duration of

P

. This trajectory probability is determined by

γ_{k} (t)

alone (it is not affected by

d_{α} θ_{k} (t)

]) and can be identified as an intrinsic (I) trajectory probability.

It should be evident that the three trajectory probabilities are not the same. In other words, there is no unique trajectory probability

p_{γ_{k}}

as said earlier. This justifies the use of the identifying superscript in each of the trajectory probabilities above.

As suggested above, use of

γ_{k}

allows us to determine thermodynamic macroworks

Δ W

or

Δ \tilde{W}

in a straightforward manner. The determination of

Δ \tilde{W} = - Δ_{e} W

is simplified by the use of Theorem 7 as we show below. We first use

d {\tilde{W}}_{k} = d \tilde{W} = d_{e} \tilde{W}

to obtain

Δ_{e} {\tilde{W}}_{k} ≐ \int_{γ_{k}} d_{e} \tilde{W} = Δ_{e} \tilde{W} = - Δ_{e} W

so that

Δ \tilde{W} = - Δ_{e} W \sum_{k} p_{γ_{k}}^{(I)} = - Δ_{e} W

as expected.

We now turn to the sum of the second term in Equation (307a), which is simply

Δ_{α} χ_{s} (P) ≐ \int_{P} d_{α} χ_{s} ≐ \sum_{k} \int_{γ_{k}} χ_{k} d_{α} p_{k} .

For example, the cumulative macroheat

Δ Q = Δ E_{s}

is the ensemble sum over k of the integral of

E_{k} d p_{k}

or ensemble average of

E_{k} d η_{k}

over

γ_{k}

[33]. We now express

Δ_{α} χ_{s}

in a form suitable for using trajectories by using the mean value theorem; see Equation (307b). This is precisely the above sum in Equation (307a). We can express

χ_{γ_{k}}^{(α)}

χ_{γ_{k}}^{(α)} ≐ \int_{γ_{k}} χ_{k} (t) d y_{k}^{(p α)} (t),

where

d y_{k}^{(p α)} (t) ≐ d_{α} p_{k} (t) / Δ_{α} p_{k} .

As an example, we show how the above discussion can be applied to

Δ_{e} Q

given in Equation (304). We have already observed that

d_{e} Q_{k} = E_{k} d_{e} η_{k} = d_{e} Q

. Thus,

Δ_{e} Q = \sum_{k} \int_{γ_{k}} p_{k} d_{e} Q = \int_{P} d_{e} Q \sum_{k} p_{k} = \int_{P} d_{e} Q,

an obvious result from the MNEQT. By replacing

d_{e} Q_{k}

by

d_{e} S_{k}

, and following the same argument, we find the same consistency with the MNEQT in that

Δ_{e} S = \sum_{k} \int_{γ_{k}} p_{k} d_{e} S = \int_{P} d_{e} S .

13.4. The MNEQT

We briefly review the MNEQT, which is described by considering SI-changes in

χ

over the process

P

. The corresponding change is

Δ χ ≐ \int_{P} d χ = χ_{fin} - χ_{in}

(314)

between the initial (in) and the final (fin) macrostates of the process

P

; here

d χ ≐ χ (t + d t) - χ (t)

. We also need to determine

Δ_{e} χ ≐ \int_{P} d_{e} χ, Δ_{i} χ ≐ \int_{P} d_{i} χ,

(315)

so that

Δ χ = Δ_{e} χ + Δ_{i} χ

, as expected. Similarly, for the process quantities, we have

Δ_{α} χ_{m} ≐ \int_{P} d_{α} χ_{m}, Δ_{α} χ_{s} ≐ \int_{P} d_{α} χ_{s} .

(316)

The determination of

\{χ_{k} (t)\}

at each instant t by following its Hamiltonian evolution

γ_{k}

is needed to formulate the

μ

NEQT. However, this is not the complete story, as the stochasticity requires a strategy to determine the set

\{p_{k} (t)\}

as well. The knowledge of both

\{χ_{k} (t)\}

and

\{p_{k} (t)\}

will completely determine the

μ

NEQT. The same knowledge also allows us to determine the average

χ

, which then determines the MNEQT of

M

. In some situation, it is also possible to derive the

μ

NEQT from first developing the MNEQT. In both cases, we must ensure that Condition 1 is fulfilled:

Condition 1.

A

μ

NEQT is required to remain consistent with the MNEQT. This should be equivalent to the justification of the MNEQT by a statistical procedure.

14. Mechanical Microfriction

Within the framework of the

μ

NEQT, we wish to uncover how microscopic friction (microfriction) that eventually results in frictional dissipation emerges in a system in the guise of an internal variable; see Equation (137). The application further gives an example of how a system Hamiltonian becomes dependent on internal variables, and how the system can be kept stationary despite motion of its parts. However, the most important aspect of this section is the emergence of the Langevin evolution due to the relative motion of its parts in the

μ

NEQT without introducing an ad hoc Langevin dynamics.

Remark 69.

This is an example of how internal variables in deterministic mechanics turn into stochastic (i.e., thermodynamic) variables with fluctuating dynamics. This is a consequence of the stochastic aspect in the μNEQT.

14.1. Piston–Gas System

14.1.1. Microdescription

We consider the traditional undergraduate example depicted in Figure 3a for this exercise. To describe it realistically, we need to treat the motion of the piston by including its momentum

P_{p}

in our discussion. The gas, the cylinder, and the piston constitute the system

Σ

. We have a gas of mass

M_{g}

in the cylindrical volume

V_{g}

, the piston of mass

M_{p}

, and the rigid cylinder (with its end opposite to the piston closed) of mass

M_{c}

. The Hamiltonian

H

of the system is the sum of

H_{g}

of the gas,

H_{c}

of the cylinder,

H_{p}

of the piston, and the interaction Hamiltonian

H_{int}

between the three subsystems

Σ_{g}, Σ_{c}

, and

Σ_{p}

that make up

Σ

, and the stochastic interaction Hamiltonian

H_{stoc}

between

Σ

and

\tilde{Σ}

. As is customary, we will neglect

H_{stoc}

here. We assume that the centers of mass of the composite subsystem

Σ_{gc} = Σ_{g} \cup Σ_{c}

and

Σ_{p}

are moving with respect to the medium with linear momenta

P_{gc}

and

P_{p}

, respectively. We do not allow any rotation for simplicity. We assume that

P_{gc} + P_{p} = 0,

(317)

so that

Σ

is at rest with respect to the medium. Thus,

H (x| V, P_{gc}, P_{p}) = \sum_{λ} H_{λ} (x_{λ}| V_{λ}, P_{λ}) + H_{int},

where

λ =

g,c,p,

x_{λ} = (r_{λ}, p_{λ})

a point in the phase space

Γ_{λ}

of

Σ_{λ}

, and

P_{g} + P_{c} = P_{gc}

;

V_{λ}

is the volume of

Σ_{λ}

, and

V = V_{g} + V_{c} + V_{p}

is the volume of

Σ

. We do not exhibit the number of particles

N_{g}, N_{c}, N_{p}

as we keep them fixed. We let

x

denote the collection (

x_{g}, x_{c}, x_{p}

). Thus, the system’s microstate energy

E_{k} = H (x_{k}| V, P_{gc}, P_{p})

and the average energy E depend on the parameters

V, P_{gc}, P_{p}

and the macrostate. We first consider

E_{k}

and introduce the microfields

P_{k} ≐ - \frac{\partial E_{k}}{\partial V}, V_{gc k} ≐ \frac{\partial E_{k}}{\partial P_{gc}}, V_{p k} ≐ \frac{\partial E_{k}}{\partial P_{p}} .

(318a)

In terms of these microfields, we have

d E_{k} = - P_{k} d V + V_{gc k} \cdot d P_{gc} + V_{p k} \cdot d P_{p} = - d W_{k} .

(318b)

Using Equation (317), we can rewrite this equation as

d E_{k} = - P_{k} d V + V_{k} \cdot d P_{p},

(319a)

in terms of the

r e l a t i v e

microvelocity

V_{k} ≐ V_{p k} - V_{gc k}

(319b)

of the piston with respect to

Σ_{gc}

in the microstate

m_{k}

. The corresponding macrofields are denoted by

P, V ≐ V_{p} - V_{gc}

that appear in the MNEQT, which has been investigated previously in [157]. We briefly summarize this MNEQT. The SI-first law becomes

d E = T d S - [P d V - V \cdot d P_{p}],

(320a)

where we have used the conjugate macrofields

T ≐ \partial E / \partial S, P ≐ - \partial E / \partial V, V ≐ \partial E / \partial P_{p},

(320b)

as shown elsewhere ([148], and references therein). The relative velocity

V

is commonly known as the drift velocity of the piston with respect to

Σ_{gc}

. In terms of the exchange quantities, we can also write the first law as

d E = T_{0} d_{e} S - P_{0} d V,

(321)

as the EQ value of

V

is

V_{0} = \tilde{V} = 0

.

We can cast the velocity term in a more useful form from the viewpoint of dynamics by using

V_{k} \cdot d P_{p} \equiv F_{p} \cdot d R_{k},

(322)

where

F_{p} ≐ d P_{p} / d t

is the NFl force and

d R_{k} = V_{k} d t

is the Fl relative microdisplacement of the piston in

m_{k}

:

d E_{k} = - P_{k} d V + F_{p} \cdot d R_{k} = - d W_{k} .

(323)

Thus,

E_{k}

depends on V and

R_{k}

as parameters, in which V is NFl and

R_{k}

is Fl. This makes the current mechanical description mixed with

P_{k}

Fl and

F_{p}

NFl; see Claim 5. It is also possible to treat

\{F_{p k}\}

Fl with

R_{k} = R

NFl,

\forall k

. In this case,

F_{p} \cdot d R_{k}

will be replaced by

F_{p k} \cdot d R

as discussed earlier in Claim 6. As this microwork is internal, we can use Equation (20) to obtain the same physics. Thus, the corresponding Fl internal microworks are identical:

d_{i} W_{f k} ≐ - F_{p} \cdot d R_{k} \equiv - F_{p k} \cdot d R .

(324)

It is later identified with the work done by the microfriction as is indicated by the additional suffix f.

In the following, we focus on a single piston, so we will use NFl

R_{p}

and Fl

F_{p k}

. Using Equation (176) with

d_{e} W_{k} = P_{0} d V

, we conclude that

d_{i} E_{k} = - d_{i} W_{k} = - (P_{k} - P_{0}) d V + F_{p k} \cdot d R,

(375)

with

d Q_{k}

given in Equation (44a), and

d_{e} Q_{k}

given in Equation (256a). Thus,

d_{e} Q_{k} = - T_{0} d_{e} {\bar{S}}_{k}, d_{i} Q_{k} = d_{i} W_{k} .

(326)

It should be evident that by treating the piston as a mesoscopic particle such as a pollen or a colloid, we can treat its thermodynamics using the above procedure. This allows us to finally make a connection with the system depicted in Figure 3b in which the particle (a pollen or a colloid) may be manipulated by an external force

F_{0}

. We now treat a mesoscopic Brownian particle (BP); we will use

F_{BP}

for

F_{p}

to emphasize this. The internal motions of the BP are not controlled by any external agent, so the relative motion described by the relative displacement

R_{k}

represents an internal variable [42,108]. Accordingly, the corresponding NFl affinity

F_{p 0} = 0

for

\tilde{Σ}

. Because of this, Equation (321) does not contain the relative displacement

R

. Therefore, the use of this MI-version of the first law will not directly reveal all the fluctuations encountered by the BP. The use of the SI-first law in Equation (320a) is perfectly suited for this purpose. This will be the case below.

Let us consider the BP initially at

R (0)

at

t = 0

, which changes to

R (t)

in time as we observe it at successive times

τ_{obs}, 2 τ_{obs}, 3 τ_{obs}, \dots

. In the ballistic regime seen for

t ⪆ 0

, the BP undergoes correlated motion so that

Δ R (t < τ_{obs}) ≐ R (t < τ_{obs}) -

R (0)

depends strongly on the history. In accordance with Equation (95), it takes a while (

t < t^{*}

), the crossover regime, for the memory to disappear so that, at

t = τ_{obs}

, the correlation disappears so that

Δ R (τ_{obs}) ≐ R (τ_{obs}) -

R (0)

has lost memory of the past. In other words,

R (τ_{obs})

has no memory of

R (0)

, a requirement of the BP being in some

M_{ieq}

. In another time period

τ_{obs}

, the macrostate changes into another

M_{ieq}

, and so on. We denote the corresponding microforces to get from one

M_{ieq}

into the next

M_{ieq}

, by

F_{i, BP} (i τ_{obs})

. The motion during each observation follows

m \ddot{R} (i τ_{obs}) = F_{i, BP} (t),

where m is the reduced mass ([157], Equation (31)). At different observation times, the sequence

{\{Δ R (i τ_{obs}) = R (i τ_{obs}) - R ((i - 1) τ_{obs})\}}_{i = 1, 2, \dots}

is a sequence of uncorrelated displacements. We now follow the original idea of Einstein to treat the net displacement

Δ R (t_{obs} = \bar{ı} τ_{obs}) = \sum_{i = 0}^{\bar{ı}} Δ R (i τ_{obs})

of a BP as a random walk, which gives Equation (40) in ([157], Equation (31)), as expected. Note that this is a temporal scan of the BP. As is customary in the

μ

EQT, we can scan the states of the BP at successive time

i τ_{obs}

as different microstates so

\{F_{i, BP}\}

represents the set of microforces [33]. With this interpretation, we can justify the ensemble average to be no different than the temporal average, which is consistent with our discussion of extending the ergodicity hypothesis to NEQ phenomena as discussed in Section 1.1; see also Remark 53.

14.1.2. Macrofriction

With the changeover in Equation (322), Equation (323) becomes

d E = T d S - (P d V - F_{p} \cdot d R)

, which was extensively used in [157] to study the dynamics of the piston. Comparing it with Equation (321), we immediately find

d_{i} W = (P - P_{0}) d V - F_{p} \cdot d R \geq 0,

which is in accordance with Theorem 4. Consequently, we must have

(P - P_{0}) d V \geq 0, V \cdot d P_{p} = F_{p} \cdot d R \leq 0 .

(327)

In equilibrium,

P \to P_{0}, and V \to 0 or F_{p} \to 0

(328)

as expected. The inequality

F_{p} \cdot d R \leq 0

shows that

F_{p}

and

d R

are antiparallel, which is what is expected of a frictional macroforce. This causes the piston to finally come to rest. As

F_{p}

and

V

vanish together, we can express this force as

F_{p} = - μ V f (V^{2}),

(329)

where

μ > 0

and f is an even function of

V

. The medium

\tilde{Σ}

is specified by

T = T_{0}, P = P_{0}

and

V_{0} = 0

or

F_{p} = 0

. We will take

F_{p}

and

d R

to be collinear and replace

F_{p} \cdot d R

by

- F_{f} d x

(

F_{f} d x \geq 0

), where the magnitude

F_{p}

is written as

F_{f}

as a reminder that this force is responsible for the frictional force and

d x

is the magnitude of the relative displacement

d R

. The sign convention is that

F_{f}

and increasing x point in the same direction. From Equation (320a), we obtain

d E = T d S - P d V - F_{f} d x .

(330)

The macrowork by friction is

d W_{f} = F_{f} d x .

(331)

The important point to note is that the friction term

F_{f} d x

properly belongs to

d W

. Thus,

d_{i} W_{f} = F_{f} d x \geq 0;

(332)

thus,

d_{e} W_{f} \equiv 0

. In other words, friction always results in dissipation; it never appears in a reversible process. Both contributions in

d_{i} W

are separately nonnegative; see Corollary 1.

We can determine the exchange heat

d_{e} Q = d Q - d_{i} W

d_{e} Q = T d S - (P - P_{0}) d V - F_{f} d x .

(333)

It should be emphasized that in the above discussion, we have not considered any other internal motion such as between different parts of the gas besides the relative motion between

Σ_{gc}

and

Σ_{p}

. These internal motions within

Σ_{g}

can be considered by following the approach outlined elsewhere [148]. We will not consider such a complication here.

Remark 70.

In the μNEQT, the microfriction work

d_{i} W_{f k}

in Equation (324) appears as part of the internal microwork

d_{i} W_{k}

. This contribution exactly balances a contribution to the internal microheat

d_{i} Q_{k}

due to the last identity in Equation (326). It should be recalled that despite the equality, internal microheat and internal microwork have two independent origins in the μNEQT as discussed in Section 10.2: the former arises from the change

d_{i} η_{k}

(see Equation (240)), while the latter arises from the change

d_{i} E_{k}

(see Equation (176)). It is interesting to observe that Sekimoto [140] treats the frictional work

F_{f k} d x

as microheat, which then allows him to identify it as the exchange microheat

d_{e} Q_{k}

. Then identifying the remainder of the SI work (

P_{k} d V

above) as the opposite of the external (medium) microwork in accordance with Equation (71b) allows him to write down an analog of the first law at the microstate level in the

\overset{˚}{μ}

NEQT; see also Crooks [141]. This makes these approaches different from ours.

14.1.3. Particle–Spring–Fluid System

We need to consider two additional forces

F_{s}

and

F_{f}

, both pointing in the same direction as increasing x; the latter is the frictional force induced by the presence of the fluid in which the particle is moving around. The analog of Equation (332) for this case becomes

d_{i} W = (F_{s} + F_{0}) d x + F_{f} d x ≐ F_{t} d x,

(334)

where

F_{t} = F_{s} + F_{0} + F_{f}

. The other two works are

d W = (F_{s} + F_{f}) d x

and

d \tilde{W} = F_{0} d x = - d_{e} W

. In EQ,

F_{f} = 0

and

F_{s} + F_{0} = 0

(

F_{0} \neq 0

) to ensure

d_{i} W = 0

. In this case,

d \tilde{W} = - d W = F_{0} d x

, but this will not be true for an NEQ state since

d_{i} W > 0

.

14.1.4. Particle–Fluid System

In the absence of a spring in the previous subsection, we must set

F_{s} = 0

so

d W = F_{f} d x, d \tilde{W} = F_{0} d x = - d_{e} W, d_{i} W = (F_{0} + F_{f}) d x .

(335)

This is the situation of a driven particle undergoing Langevin evolution with various works that have been identified. In EQ,

F_{0} + F_{f} = 0

so that

F_{f} = - F_{0}

. This means that in EQ, the particle’s nonzero terminal velocity is determined by

F_{0}

as expected. In this case,

d \tilde{W} = - d W = F_{0} d x

, but this will not be true for an NEQ state.

As the above works denote average works, we can identify their microscopic analogs by inspection:

d W_{k} = F_{f k}, d_{e} W = - F_{0} d x

and

d_{i} W_{k} = (F_{0} + F_{f k}) d x = d_{i} Q_{k}

.

15. An NEQ Microwork Fluctuation Theorem in $S_{Z}$

As an important application of the

μ

NEQT, we derive an NEQ microwork fluctuation theorem for an arbitrary macrostate

M

. This should be contrasted with the fluctuation theorem proposed by [142,143,144], which is restricted between two EQ macrostates. We will follow the method that we have proposed earlier [150,151]. As is usual, we take the set

\{m_{k}\}

to be countable infinite. We also consider

W

to be NFl, but the discussion is easily extended to Fl

\{W_{k}\}

. The Legendre-transformed microenergy

E_{k}^{L} (F_{w k}, t)

(see Equation (295b)) for an arbitrary macrostate

M

changes as

m_{k}

changes due to varying

W

during a process

P

between

M^{(in)} = \{m_{in k}, p_{in k}\}

and

M^{(fin)} = \{m_{fin k}, p_{fin k}\}

, but k does not change. The microenergy change along a trajectory

γ_{k} ≐ γ_{k} (m_{fin k} ∣ m_{in k})

during

P

between

t_{in} = 0

and

t_{fin} = τ

is related to the mechanical microwork

Δ W_{k}^{L} (m_{fin k}, m_{in k}) = - Δ E_{k}^{L} ≐ - (E_{fin k}^{L} - E_{in k}^{L});

(336)

we use Equation (160) for

E_{k}^{L}

. Being mechanical,

Δ E_{k}^{L}

is independent of

p_{k}

. By definition,

Δ E_{k}^{L} = Δ E_{k} + Δ Φ,

(337)

with

Φ

defined in Equation (23b). We finally conclude that

Conclusion 13.

If we are interested in knowing the cumulative change

Δ W_{k}^{L}

, we only need to determine

Δ E_{k}^{L}

by following the same

m_{k}

mechanically along

γ_{k}

during

P

. The probability plays no role as

Δ W_{k}^{L}

is a microstate function, i.e., is a difference between the Legendre-transformed microstate energies of the terminal microstates

m_{in k}

and

m_{fin k}

, and not of the actual trajectory

γ_{k}

; see Equation (336). Thus, it is not a process microquantity.

Remark 71.

It should be stated here that

Δ W_{k}^{L} (m_{fin k}, m_{in k})

is the same for all different processes

P ≐ P (M^{(in)} ∣ M^{(fin)})

’s between the same two arbitrary macrostates

M_{fin}

and

M_{in}

so that they all share the same set of trajectories

\{γ_{k}\}

between

E_{fin k}^{L}

and

E_{in k}^{L}

(see Definition 5), so

Δ W_{k}^{L} (m_{fink}, m_{in k}) = - Δ E_{k}^{L} (F_{w, fin}, F_{w, in}), \forall P,

(338a)

although Fl is not a process quantity. The internal microwork, which is a Fl process microquantity, is

Δ_{i} W_{k}^{L} = Δ W_{k}^{L} - Δ_{e} W^{L},

(338b)

with

Δ_{e} W^{L}

defined by

Δ_{e} W^{L} = - \int_{P} w \cdot d f_{0 w};

see Equation (157). The latter is also a process macroquantity, but is NFl as it is the same for all

m_{k}

’s.

What the above remark implies is the following. Different processes between the same two macrostates

M^{(in)}

and

M^{(fin)}

differ not in

\{Δ W_{k}^{L}\}

but in

\{p_{k}\}

so the Fl

\{Δ W_{k}^{L}\}

is the same for all processes involving

M_{eq}, M_{ieq}

, or

M_{nieq}

. This means that we can determine

\{Δ W_{k}^{L}\}

for some process between

M^{(in)}

and

M^{(fin)}

such as an EQ process between

M^{(in)} = M_{eq}^{(in)}

and

M^{(fin)} = M_{eq}^{(fin)}

. Then, the same

\{Δ W_{k}^{L}\}

will also describe any possible

P (M^{(in)} ∣ M^{(fin)})

. On the other hand,

Δ_{e} W^{L} (P)

is NFl (over

\{m_{k}\}

) but depends on the process, and will have to be determined for each one of them separately. This makes the generalized microwork

Δ_{e} W^{L}

or

Δ W_{k}

unique in that it does not depend on the nature of

P

so dealing with it is simpler. Despite this, as it is Fl, it contains the contribution of dissipation in it given by the average

Δ_{i} W_{k} (P) ≐ 〈Δ_{i} W (P)〉

, as we will demonstrate below.

Before demonstrating this, we make the following observation. The property of a quantum

m_{k}

maintaining its identity during

P

is because we have assumed

m_{k}

to be a singlet; see Remark 5. If

m_{k}

is degenerate, it can be, without any intervention from the medium, transformed into any of them without changing their microenergies. The important fact to remember is that transformations among degenerate microstates happens in both ways so they do not affect their probabilities. This is no different for a classical microstate

m_{k}

; see Definition 4. This microstate changes from

δ x_{k}

to

δ x_{l}, k \neq l

as it evolves in time following its Hamiltonian dynamics, both having the same microenergy so the dynamics relates microstates on the same energy shell just like the degenerate microstates above. The Hamiltonian dynamics also does not change

\{p_{k}\}

. In both mechanics, the deterministic dynamics causes no problem as the change

Δ W_{k}^{L} = - Δ E_{k}^{L}

is not affected by any stochasticity in the evolution. It only changes due to work variables; see Conclusions 11 and 12 for more details. In this case, introducing

E_{k}^{L} (τ) = E_{k}^{L} (δ x_{k} (τ)), E_{k}^{L} (0) = E_{k}^{L} (δ x_{k} (0),

we can write

Δ W_{k}^{L}

as in Equation (336). Thus, whether we are considering a classical system or a quantum system, we can always express

Δ W_{k}^{L}

as in Equation (336).

We now consider a process

P

taken by

Σ

between two arbitrary macrostates

M^{(in)}

and

M^{(fin)}

having

Z_{Z in} (β, F_{w, in}, t)

and

Z_{Z fin} (β, F_{w, fin}, t)

as respective NEQ partition functions (see Equation (283)), as the work parameter varies from

W_{in}

to

W_{fin}

. The inverse temperature in the terminal macrostates is

β

, which may be different from

β_{0}

of the medium. As a special case, the terminal macrostates can refer to EQ macrostates, so they are included in our analysis below. In this case,

F = F_{0}

in the terminal macrostates, and will be considered below.

We now introduce the following exponential microwork average:

W_{in} (β| \{Δ W_{k}^{L}\}) ≐ {〈e^{β Δ W^{L}}〉}_{in} = \sum_{k} p_{k in} e^{β Δ W_{k}}

(339)

involving Fl microworks

Δ W_{k}^{L}

; here,

{〈\cdot〉}_{in}

refers to a special averaging with respect to the initial probabilities given in Equation (295a) at time

t_{in}

:

p_{k in} ≐ e^{- β E_{k, in}^{L}} / Z_{Z in} (β, F_{w, in}, 0)

of the initial macrostate

M_{ieq}^{(in)}

. This particular averaging was first introduced by Jarzynski in deriving what is commonly known as the Jarzynski equality (JE) [142,143,144] in the

\overset{˚}{μ}

NEQT. We will return to the equality latter; see the discussion leading to Equation (345).

Let us evaluate the particular average

{〈e^{β Δ W_{k}^{L}}〉}_{in}

in Equation (339) using Equation (336). We have

\begin{matrix} W_{in} (β| Δ W_{k}^{L}) & ≐ {〈e^{β Δ W^{L}}〉}_{in} ≐ \sum_{k} \frac{e^{- β E_{k, in}^{L}}}{Z_{Z in}} e^{β Δ W_{k}^{L}} \\ = \sum_{k} \frac{e^{- β E_{k, i}^{L}}}{Z_{Z in}} e^{- β (E_{k fin}^{L} - E_{k in}^{L})}, \end{matrix}

which leads to

W_{in} (β| \{Δ W_{k}^{L}\}) = \sum_{k} \frac{e^{- β E_{k, fin}^{L}}}{Z_{Z in}} = \frac{Z_{Z in} (β, F_{w, in}, τ)}{Z_{Z in} (β, F_{w, in}, 0)},

where

E_{k, fin}^{L}

and

Z_{Z in}

are final Legendre-transformed energy and the NEQ partition function for

Σ

. Introducing the thermodynamic potential energy difference

Δ G_{Z} (F, τ) ≐ G_{Z fin} (F_{fin}, τ) - G_{Z fin} (F_{in}, 0)

, we finally have

W_{in} (β| \{Δ W_{k}^{L}\}) = {〈e^{β Δ W^{L}}〉}_{0} = e^{- β Δ G_{Z}} .

(340a)

This is our new microwork theorem involving Legendre-transformed microworks

Δ W_{k}^{L}

. We can re-express the above equation in the following form:

{〈e^{β_{0} (Δ W^{L} + Δ G_{Z})}〉}_{in} = 1 .

(340b)

Recall that

Δ W_{k}^{L} + Δ G_{Z}

in the exponent on the left is nothing but

[- (Δ E_{k}^{L} - Δ G_{Z})] = [- (Δ E_{k} - Δ E) - (Δ Φ_{k} - Δ Φ) + T^{*} Δ S]

, where we have introduced a temperature-like quantity

T^{*}

by the following relation

T^{*} ≐ \frac{\int_{P} T d S}{Δ S} .

We thus see that the exponent on the left contains information about the entropy change

Δ S

, so

W_{in}

contains information about

Δ S

.

Remark 72.

The macrostates between

t_{in}

and

t_{fin}

in

P

used above need not belong to the state space

S_{Z}

.

Instead of an NEQ process between arbitrary macrostates, we now focus on an arbitrary process between

M_{eq}^{(in)}

and

M_{eq}^{(fin)}

, each in a canonical ensemble discussed in Section 12.7. In this case, we need to set

β = β_{0}

for the terminal macrostates, and use

p_{k in}^{can} ≐ e^{- β_{0} E_{k, in}} / Z_{in} (β_{0});

given in Equation (298); here,

Z_{in} (β_{0})

is the initial equilibrium partition function for the system at inverse temperature

β_{0}

, and

E_{k, in}

is the initial EQ microstate energy in

M_{eq}^{(in)}

. As we are in space

S_{X}

, we must set

Φ = 0

so

[E^{L}]

reduces to

[E]

for the terminal macrostates; see Equation (11a). In essence, this means that we do not need to consider any Legendre-transformed quantity in our discussion. Thus,

W_{in}

in Equation (339) is replaced by

W_{in}^{can} (β_{0}| \{Δ_{e} W_{k}\}) ≐ {〈e^{β_{0} Δ W}〉}_{in} = \sum_{k} p_{k in}^{can} e^{β_{0} Δ W_{k}}

(341)

in terms of the microworks

Δ W_{k}

. It is easy to see that Equation (340a) is replaced by

W_{in}^{can} (β_{0}| \{Δ_{e} W_{k}\}) = \frac{Z_{fin} (β_{0})}{Z_{in} (β_{0})} = e^{- β_{0} Δ F},

(342)

in terms of the free energy difference

Δ F ≐ F_{fin} - F_{in}

. This is our new work theorem involving microworks in the canonical ensemble.

On the other hand, if following Jarzynski [142,143,144] we use

Δ_{e} W_{k} = Δ_{e} W

in place of

Δ W_{k}

in

W_{in}

and evaluate the microwork average (we now add another suffix “e” as a reminder of the exchange microworks) introduced by him, we find that

W_{in}^{(can, e)} (β_{0}| \{Δ_{e} W_{k}\}) ≐ {〈e^{β_{0} Δ_{e} W}〉}_{in}

(343)

simply reduces to

W_{in}^{(can, e)} (β_{0}| \{Δ_{e} W_{k}\}) = e^{β_{0} Δ_{e} W} 〈p_{k in}〉 \equiv e^{β_{0} Δ_{e} W},

(344)

which is a purely MI-quantity, so it provides no information about the possible irreversibility in the system. This conclusion is very different from that arrived at by Jarzynski, who derived the Jarzynski relation (we now add another suffix “J” as a reminder of his evaluation)

W_{in}^{(can, J)} (β_{0}| Δ_{e} W_{k}) = e^{- β_{0} Δ F}

(345)

by using the conjecture in Equation (7) mentioned at the end of Section 1.1. The conjecture and its consequence for the concept of NEQ work have generated fierce debate in the literature [156,180,181,182,183,184,185,186,187,188,189,191,192,193,208,209,210,211,212]. We invite the reader to consult these references. We have also discussed the conjecture elsewhere [150,151] so we will not pursue it here. However, we do wish to make the following important observation. Instead of using the initial probability

p_{k in}

, we can use the thermodynamic trajectory probability

p_{γ_{k}}^{(E)}

or

p_{γ_{k}}^{(χ e)}

(see Equation (307b)) or any arbitrary probability measure

p_{γ_{k}}

for each trajectory

γ_{k}

, and still satisfy

〈p_{γ_{k}}〉 = 1

as seen from Equation (309). Thus,

W^{(can, e)} (β_{0}| \{Δ_{e} W_{k}\}) = e^{β_{0} Δ_{e} W} 〈p_{γ_{k}}〉 \equiv e^{β_{0} Δ_{e} W} .

A thermodynamically consistent result can be obtained for

Δ_{e} W_{k}

, which overcomes all the objections raised by Cohen and Mauzerall [180,181]. Using the thermodynamic probability

p_{γ_{k}}^{(χ e)}

in Equation (343) for each trajectory instead of

p_{k in}

, we obtain a thermodynamically consistent NEQ identity

W^{(e)} (β| \{Δ_{e} W_{k}^{L}\}) = e^{β_{0} Δ_{e} W^{L}} 〈p_{γ_{k}}^{(χ e)}〉 \equiv e^{β_{0} Δ_{e} W},

(346)

where

M^{(in)}

and

M^{(fin)}

both have the same NEQ temperature

β

, but the temperature

β (t)

along the rest of

P (t)

does not have to be equal to

β

. Here, the missing suffix “in” in

W^{(e)} (β| \{Δ_{e} W_{k}^{L}\})

(see Equation (339)) implies that we are no longer using the initial

p_{k in}

, and the additional suffix “e” is because we are using the exchange microwork. The trajectory probabilities contain the correct thermodynamic temperature profile of

P (t)

through

p_{k} (t)

in Equation (307b). However, as

W^{(e)} (β| \{Δ_{e} W_{k}^{L}\})

is invariant under the change of

p_{γ_{k}}

, the result does not care if

P

is reversible or not. Therefore, it provides no information about any irreversibility. The identity in Equation (340a) is not a thermodynamic identity but does include irreversibility. Unfortunately, it is not clear how to extract this information from it.

16. The Free Expansion

We now show that the new microwork relations in Equations (342) and (344) work for an isolated system undergoing internal dissipation for which the external work

Δ {\tilde{W}}_{k} = Δ_{e} W_{k} = Δ_{e} W = 0

, but where the applicability of the JE derived in the

\overset{˚}{μ}

NEQT is disputed [183,184,191,192,209,211]. This again shows the superiority of the

μ

NEQT over the

\overset{˚}{μ}

NEQT. Consider the case of a free expansion (

P_{0} = 0

) of a gas in an isolated system of volume

V_{fin}

, divided by an impenetrable partition into the left (L) and the right (R) chambers, as shown in Figure 5a. Initially, all the N particles are in the left chamber of volume

V_{in}

in an equilibrium state at temperature

T_{0}

; there is a vacuum in the right chamber. At time

t = 0

, the partition is suddenly removed, shown by the broken partition in Figure 5b and the gas is allowed to undergo free expansion to the final volume

V_{fin}

during

P

. After the free expansion, the gas is in an NEQ state and is brought in contact with

{\tilde{Σ}}_{h}

during

\bar{P}

to come to equilibrium at the initial temperature

T_{0}

. This will complete the process

\overset{˚}{P}

. If the gas is ideal, there is no need to bring in

{\tilde{Σ}}_{h}

for re-equilibration; we can let the gas come to equilibrium by itself, as it is well-known that the temperature of the equilibrated gas after free expansion is also

T_{0}

. It is this case that we will study here as the system becomes isolated.

It should be stated, as is also evident from Figure 5b, that while the removal of the partition can be instantaneous, the actual process of gas expanding in the right chamber is continuous and gradual. Therefore, at each instant, it is possible to imagine a front of the expanding gas shown by the solid vertical line enclosing the largest among smallest possible volumes containing all the particles so that there are no particles to the right of it in the right chamber in all possible realizations of the expanding gas. By this we mean the following. We consider all possible realizations of the expanding gas at a particular time

t > 0

and locate the front corresponding to the smallest volume containing all the gas particles to its left. Then we choose among all these fronts that particular front that results in the smallest volume on its right or the largest volume on its left. In this sense, this front is an average concept and is shown in Figure 5b. We have identified the volume to its right as “vacuum” in the figure. This means that at each instant when there is a vacuum to the right of this front, the gas is expanding against zero pressure so that

d \tilde{W} = 0

. Despite this, as the expansion is an NEQ process,

d W = d_{i} W > 0

.

The description of the nonuniformity in Figure 5b is an example of modeling noted in Remark 65. Above, we have model nonuniformity by dividing the volume into two regions of different densities. As the region to the left of the solid front is still very nonuniform, we can divide into two different regions of different densities. Similarly, the volume to the right of the front can also be divided into two regions of different densities as this region is certainly not going to a pure vacuum. How good a modeling is required depends on how good the measurements can be made or are required. More nonuniform regions require more internal variables, and the computation will also become complicated.

16.1. Quantum Free Expansion

We now apply Equation (340a) to the free expansion of a one-dimensional ideal gas of classical particles, but treated quantum mechanically as a particle in a box with rigid walls, which has been previously studied [219]; see also Bender et al. [220]. We assume that the gas is thermalized initially at some temperature

T_{0} = 1 / β_{0}

and then isolated from the medium so that the free expansion occurs in an isolated system. After the free expansion from the box size

L_{in}

to

L_{fin} > L_{in}

, the box is left to thermalize as it comes to equilibrium at the same temperature

T_{0}

. The role of V is played by the length L of the box. The discussion here will also set the stage for the classical treatment later.

For an isolated system, as discussed earlier, the Fl

Δ W_{k} \neq 0

, even though

Δ {\tilde{W}}_{k} = Δ \tilde{W} = - Δ_{e} W = 0

. Since we are dealing with an ideal quantum gas, we do not need to bring

{\tilde{Σ}}_{h}

, as said above (see below also), so we treat the system as isolated. As there is no inter-particle interaction, we can focus on a single particle for our discussion; its energy levels are in appropriate units

E_{k} = k^{2} / L^{2},

where L is the length of the box. During the free expansion, we have

Δ_{e} Q = Δ_{e} W = 0

(but

Δ_{i} Q = Δ_{i} W \neq 0

) so that

Δ E_{free} (L_{fin}, L_{in}) = 0

; see Equation (94). After the free expansion from the box size

L_{in}

to

L_{fin} > L_{in}

, the box is allowed to come to equilibrium in isolation so that we have

Δ E_{reeq} (L_{fin}) = 0

. Accordingly,

Δ E_{eq} (L_{fin}, L_{in}) = 0

after reequilibration.

The initial partition function is given by

Z_{in} (β_{0}, L) = \sum_{k} e^{- β_{0} E_{k, in}} .

Approximating the sum by an integration over k, as is common, we can evaluate

Z_{in} (β_{0}, L)

, from which we find that the free energy

F_{eq}

and the average energy

E_{eq}

are given by

β_{0} F_{eq} = - (1 / 2) ln (L^{2} π / 4 β_{0}), E_{eq} = 1 / 2 β_{0};

while

F_{eq}

depends on

β_{0}

and L, and

E_{eq}

depends only on

β_{0}

but not on L so that

E_{eq}

has the same value in the final EQ state. This means that the final equilibrium state has the same temperature

T_{0}

. This explains why we did not need to bring

{\tilde{Σ}}_{h}

in play for re-equilibration, as assumed above.

As we have discussed in reference to Equation (233) and concluded in Conclusions 11, 12, and 13, and summarized in Remark 71,

Δ E_{k} = -

Δ W_{k}

regardless of whether the process is irreversible or not. Below we will show by explicit calculation that we are dealing with an irreversible

\overset{˚}{P}

. The energy change

Δ E_{k}

for

m_{k}

is

Δ E_{k} = k^{2} (1 / L_{fin}^{2} - 1 / L_{in}^{2}) .

Let us determine the microwork done to take the microstate from the initial to the final state by using the internal pressure

P_{k} = - \partial E_{k} / \partial L = 2 E_{k} / L \neq 0

(347)

in

Δ W_{k} = \int_{L_{in}}^{L_{fin}} P_{k} d L .

(348)

It is easy to see that this microwork is precisely equal to

- Δ E_{k}

in accordance with Theorem 6, as expected. It is also evident from Equation (347) that for each L between

L_{in}

and

L_{fin}

,

P = \sum_{k} p_{k} P_{k} = 2 E / L \neq 0 .

We can use this average pressure to calculate the thermodynamic work

Δ W = \int_{L_{in}}^{L_{fin}} P d L = 2 \sum_{k} \int_{L_{in}}^{L_{fin}} p_{k} E_{k} d L / L \neq 0,

as expected. As

Δ E = 0

, this means that

Δ Q = Δ W \neq 0

, which really means

Δ_{i} Q = Δ_{i} W \neq 0

in this case. This establishes that the expansion we are studying is irreversible. This is also evident from the observation that

P \neq P_{0} = 0

.

Despite this,

Δ W_{k}

is always equal to the same

(- Δ E_{k})

regardless of the nature of irreversibility of

\overset{˚}{P}

, which is consistent with Conclusion 13 and Remark 71. The same

Δ W_{k}

will also apply to a reversible

P_{eq}

as we are considering the energy change between the same two states. The only difference is that now

Δ Q = Δ W \neq 0

will mean

Δ_{e} Q = Δ_{e} W \neq 0

. It is trivially seen that Equation (340a) is satisfied for all

\overset{˚}{P}

, not just the free expansion.

As

P_{0} = 0

, there is no difference between the exclusive Hamiltonian and the inclusive Hamiltonian. Thus, the discussion above is also valid for the inclusive Hamiltonian and Equation (344) with

E_{k} = E_{k}

and

F^{'} = F

.

16.2. Classical Free Expansion

We now consider the free expansion of an isolated classical gas in a vacuum (

P_{0} = 0

); see Figure 5. We set

V_{in} = V_{0}

and

V_{fin} = 2 V_{0}

for simplicity. The initial phase space is denoted by the interior of the solid red ellipse

Γ_{in}

on the left side in Figure 6. The final phase space is shown by the interior of the broken red ellipse on the left and the solid red ellipse

Γ_{fin}

on the right in Figure 6. The gas is in a “restricted (i.e., being confined in the left chamber)” equilibrium state with equilibrium microstate probability (with a slight notational change that we find convenient here) in

S_{X}

:

f_{0} (δ z_{0}) = e^{- β_{0} E (z_{0})} / Z_{in} (β_{0}, V_{in})

(349)

at

t = 0

; here, the initial partition function in the initial volume

V_{in}

is

Z_{in} (β_{0}, V_{in}) ≐ \sum_{δ z_{0} \in Γ_{in}} e^{- β_{0} E (z_{0})} .

(350)

We consider the set of microstates in the final phase space

Γ_{fin}

and pick two microstates

δ z_{0}

and

δ z

associated with

z_{0} \in Γ_{in}

and

z \in \bar{Γ} ≐ Γ_{fin} ∖ Γ_{in}

; here,

\bar{Γ} ≐ Γ_{fin} ∖ Γ_{in}

denotes the difference set of

Γ_{fin}

and

Γ_{in}

. We use the notation

{\bar{z}}_{0} ≐ (z_{0}, z)

to denote the two points. Let us identify

(z_{γ}, z_{γ}^{'})

as the unique 1-to-1 phase points obtained by the deterministic Hamiltonian evolution of

(z_{0}, z)

along the deterministic or mechanical trajectories

γ = γ (z_{0})

and

γ^{'} = γ^{'} (z)

corresponding to a given work protocol

\overset{˚}{P}

; see Figure 6. The probabilities of the two paths are irrelevant for the microworks

\begin{matrix} Δ W_{γ} (z_{0}) = - (E (z_{γ}) - E (z_{0})), \\ Δ W_{γ^{'}} (z) = - (E (z_{γ}^{'}) - E (z)); \end{matrix}

(351)

see Conclusions 11 and 12.

Figure 6. The evolution of a microstate

z_{0} \in Γ_{in}, z \in Γ_{fin} ∖ Γ_{in}

following microwork (green arrows) into

z_{γ}

and

z_{γ}^{'}

, respectively. The initial and final phase spaces are

Γ_{in}

and

Γ_{fin}

, shown by the interiors of the red ellipses.

Figure 6. The evolution of a microstate

z_{0} \in Γ_{in}, z \in Γ_{fin} ∖ Γ_{in}

following microwork (green arrows) into

z_{γ}

and

z_{γ}^{'}

, respectively. The initial and final phase spaces are

Γ_{in}

and

Γ_{fin}

, shown by the interiors of the red ellipses.

While the initial EQ probability distribution

f_{0} (δ z_{0})

is nonzero for

δ z_{0} \in Γ_{in}

, it is common to think of

f_{0} (δ z) = 0

for

z \in \bar{Γ}

. This is an ideal situation and requires taking the energy

E (z) = \infty

, but in reality,

f_{0} (δ z)

falls rapidly as we move into the right chamber away from the left one in the initial macrostate. Moreover, during free expansion,

f (δ z)

at

t > 0

is not going to remain zero. Therefore, we formally assume that the initial probability distribution

f_{0} (δ z)

is infinitesimally small by assigning to it a very large positive energy

E (z) = e (z) / ε > 0, z \in \bar{Γ} at t = 0

(352)

by introducing an infinitesimal positive quantity

ε

. At the end of the calculation, we will take the limit

ε \to 0^{+}

, which simply means

ε \to 0

from the positive site. Under this limit, the contribution from

e^{- β_{0} E (z)}

will vanish:

e^{- β_{0} E (z)} \overset{ε \to 0^{+}}{\to} 0 .

This allows us to recast the initial partition function as a sum over all microstates

\bar{z} \in Γ_{fin}

:

lim_{ε \to 0^{+}} Z_{in}^{'} (β_{0}, V_{fin}, ε) ≐ lim_{ε \to 0^{+}} \sum_{δ \bar{z} \in Γ_{fin}} e^{- β_{0} E (\bar{z})} = Z_{in} (β_{0}, V_{in});

(353)

Thus, we can focus on

Γ_{fin}

as the phase space to consider during any work protocol

\overset{˚}{P}

instead of

Γ_{in}

. This allows us to basically use a 1-to-1 mapping between initial microstates

{\bar{z}}_{0} ≐ (z_{0}, z)

and final microstates

{\bar{z}}_{γ} ≐ (z_{γ}, z_{γ}^{'})

discussed above.

We simply denote

z_{0}

or

z

by

\bar{z} \in Γ_{fin}

or

{\bar{z}}_{γ} \in Γ_{fin}

for the Hamiltonian evolution of

\bar{z}

along the microwork protocol from now on. We consider the average

W_{0} (β_{0}| Δ W_{k})

of the exponential work in Equation (340a) for the exclusive Hamiltonian and write it as

lim_{ε \to 0^{+}} {〈e^{β_{0} Δ W}〉}_{0} = lim_{ε \to 0^{+}} \frac{\sum_{δ \bar{z} \in Γ_{fin}} e^{- β_{0} E (\bar{z})} e^{- β_{0} [E ({\bar{z}}_{γ}) - E (\bar{z})]}}{Z_{in} (β_{0}, V_{fin}, ε)},

(354)

where we have used

Δ W_{γ} (\bar{z}) = - (E ({\bar{z}}_{γ}) - E (\bar{z}))

in accordance with Equation (351). Because of the 1-to-1 mapping to

{\bar{z}}_{γ}

, we can replace the sum with a sum over

{\bar{z}}_{γ}

, and at the same time cancel the initial energy

E (\bar{z})

in the exponent; the cancellation is exact even for

\bar{z} = z

for which

E (z) \to + \infty

in the limit

ε \to 0^{+}

. Because of this, the lim operation has no effect on the numerator. The partition function in the denominator reduces to

Z_{in} (β_{0}, V_{in})

as shown in Equation (353). We finally find

W_{0} (β_{0}| Δ W_{k}) = \frac{\sum_{δ z_{γ} \in Γ_{fin}} e^{- β_{0} E (z_{γ})}}{Z_{in} (β_{0}, V_{in})} = \frac{Z_{fin} (β_{0}, V_{fin})}{Z_{in} (β_{0}, V_{in})},

(355)

which is precisely what we wish to prove in Equation (340a).

The situation with the inclusive Hamiltonian is the same as

H^{'} = H

as before. This allows us to also prove Equation (344). Moreover, as said in the previous section, the demonstration of Equations (340a) and (344) is valid for any arbitrary process, not just the free expansion.

It should be emphasized that allowing for a negligible probability is a common practice even in EQ statistical mechanics where we evaluate the partition function by considering all microstates, regardless of how negligibly small the corresponding statistical weight is. This probability could even be zero. The only difference is that the microstate is defined over the volume of the system and not outside. We have allowed microstates in deriving Equations (340a) and (344) with vanishing small or zero probabilities. Here, we are considering microstates outside the volume of the system, but mathematically, there is no difference.

By allowing such microstates in

\bar{Γ}

, we have shown that Equations (340a) and (344) hold even for free expansion of a classical or quantum gas.

17. Brief Discussion and Summary

The present review is motivated by a desire to introduce a recently developed statistical mechanics (

μ

NEQT) as an extension of the EQ statistical mechanics to an NEQ body to a wider audience as the approach has been successfully applied to understand some common problems of interest at the microstate level, so it should useful in other applications. The development of the

μ

NEQT follows two distinct and independent stages. The first stage directly deals with deterministic mechanical evolution of microstates due to the Hamiltonian dynamics, which is then followed in the second stage by its stochastic modification. The division in the two stages is of central importance to the

μ

NEQT and the MNEQT. During the first stage, the second law has no meaning. This allows us to develop the

μ

NEQT by not even imposing the second law; see Remark 1. In the second step, the stochasticity is used to perform various ensemble averages using

\hat{A}

to obtain the MNEQT, in which the stability (see Axiom 4) requires thermodynamic force

Δ F

to vanish in EQ (76d). We show in Section 8.4 that the second law is a direct consequence of the stability requirement in the system, which allows us to impose the second law inequalities

d_{i} S \geq 0, d_{i} Q \geq 0

, and

d_{i} W \geq 0

in the MNEQT in conformity with the second law.

At the center of the

μ

NEQT is the above separation of mechanical and stochastic aspects of a statistical body, and it contains the following four important ingredients:

1. all averages are ensemble averages (

\hat{A}

) as temporal averages are not meaningful;

2. its use of an extended state space

S_{Z}

in which the NEQ macrostate

M

is uniquely identified so that the

μ

NEQT provides not only a straightforward extension of the well-established EQ statistical mechanics, but also of the concept of EQ ergodicity hypothesis;

3. the need to distinguish three different infinitesimals (

d_{α}

) to describe intrinsic, exchange, and internal (or irreversible) quantities in an NEQ process;

4. its use of fluctuating BI-microquantities that are either mechanical in that they are determined by the Hamiltonian of the body or stochastic in that they are governed by microstate probabilities that add the required statistical nature to the mechanical model of the body. The commutator

{\hat{C}}_{α} ≐ d_{α} \hat{A} - \hat{A} d_{α}

is at the root of stochasticity, with

{\hat{C}}_{α} E

denoting the various heats. In its absence, the body behaves purely mechanically.

The formulation of the

μ

NEQT is contingent on identifying the extended state space

S_{Z}

in terms of a set of internal variables that is dictated by the process under investigation, as discussed in Section 12. It should be emphasized that internal variables also appear in a purely mechanical body, with its Hamiltonian written as

H (W) ≐ H (w, ξ)

, as discussed in Section 4. The latter can be equivalently specified by the set of microstates

m_{k}

and their energies

E_{k} (W)

. As there is no stochasticity associated with

H (W)

, the temporal behavior of

ξ

, if any, must be periodic, as follows from Poincaré’s recurrence theorem [84,92,93]. However, stochasticity changes this behavior dramatically [221,222], and endows each of them with a certain relaxation time, whose interplay with the observational time scale

τ_{obs}

determines if a particular internal variable has equilibrated during

τ_{obs}

or not. By ordering the internal variables as in Equation (270a), we determine the window

Δ_{n} τ

introduced in Equation (270b) to eventually identify

S_{Z}

in which

M = \{m_{k}, p_{k}\}

is uniquely specified as

M_{ieq} = \{m_{k}, p_{k}^{ieq}\}

. The uniqueness issue is discussed in Section 12. The situation is not very different from the EQ statistical mechanics, the

μ

EQT. Therefore, it should not come as a surprise that the NEQ macrostate is identified as being in internal equilibrium, a concept that is an extension of the equilibrium. Because of this deep connection between the

μ

NEQT and the

μ

EQT, the basic axioms in the

μ

NEQT include all of the axioms of the

μ

EQT, except for the maximization of the entropy in

M_{eq}

that is part of Postulate II [3]. However, there are also additional axioms of quasi-independence and reduction that play important roles in formulating the

μ

NEQT. The former restricts the sizes of various sub-bodies to be at least as large as their correlation lengths for entropy additivity. The axiom of reduction allows the microquantities associated with any body to be reduced to microquantities associated with another body interacting with the former. However, we only consider reducing microquantities

{\tilde{q}}_{\tilde{k}}

and q

_{0 k_{0}}

associated with

\tilde{Σ}

and

Σ_{0}

, respectively, to

{\tilde{q}}_{k}

and q

_{0 k}

for

Σ

that is interacting with them.

In Section 8, we discuss the properties of the unique entropy

S_{ieq}

for

M_{ieq}

in

S_{Z}

, and discuss its approximate formulation as a flat distribution that is commonly used in EQ statistical mechanics. This distribution neglects any fluctuations in the entropy, which are always present in the body. Despite this, it correctly gives the entropy so it can always be used to determine it as it simplifies the calculation. We show that the entropy additivity requires quasi-independence in Section 8.1, so the latter should not be confused with the principle of additivity for

W

.

The goal of the present study is summarized in Section 1.1 and in Proposition 2. In particular, we have focused on and clarified in this study five important and new but not well-understood concepts of the

μ

NEQT that are also used extensively in the modern approach to fluctuation theorems in the

\overset{˚}{μ}

NEQT [26,158,159]. As many of these concepts are counter-intuitive and not well-understood, we have made the entire study as pedagogical as possible, as noted earlier in Section 1.3, to reach even an untrained reader by extensively exploiting examples that are taught at an undergraduate level to bring forth these concepts in as simple a way as possible. This has made the presentation lengthy. Some may find the presentation too simple and wordy, while others may need to go back and forth to grasp the concepts as they are inter-related and a challenge to old preconceived ideas. This is a risk we have taken and hope that the reader is going to be patient. Their existence has been well-known in the

\overset{˚}{M} NEQT

but not well-understood. This resulted in their applications at the microstate level generating much confusion in the

\overset{˚}{μ}

NEQT, sometimes because the distinction between concepts remained completely forgotten. This is the situation with the distinction between Fl

d W_{k}

and NFl

d {\tilde{W}}_{k} = d \tilde{W} = - d_{e} W

. The other one is the ubiquitous microforce imbalance (

μ

FI) such as the pressure fluctuation

Δ_{k} P = P_{k} - P

within the body that is present even in

μ

EQT (see the discussion below Equation (178)), but its relevance becomes apparent when considering its contribution to internal microenergy change

d_{i} E_{k}

. They remain an integral part of the

μ

NEQT, but are not included in the

\overset{˚}{μ}

NEQT, which only deals with exchange quantities.

We now briefly summarize and discuss some important aspects of the

μ

NEQT below.

1. Second Law and its Violation.An arbitrary stochasticity described by

\{p_{k}\}

in the second stage has nothing to do with the second law or the maximum entropy principle [3]. The latter will emerge only if

\{p_{k}\}

is constrained appropriately such as the flat distribution or the most probable distribution. For thermodynamics to be able to satisfy the maximum entropy principle, Callen [3] adopts it as part of his Postulate II, but it says nothing about the second law as the law of increase in entropy in Proposition 3. For that, we either postulate the second law as part of the axiomatic formalism or prove it. The second law in Equation (213) has not been included in our axiomatic formulation described in Section 5. We therefore need to prove it, which we do in Section 8.3 within this formulation by two independent methods. In a direct proof in Section 8.3, we count the number

W (t)

of distinct microstates that the system passes through in time to result in

M_{arb}

in

S_{Z}

. This number only continues to increase in time, but can never decrease; see Propositions 4 and 5. It is this feature that is responsible for the second law as seen from the Boltzmann principle

S (t) = ln W (t)

; see Equations (209) and (206b). This proof of Theorem 8 is for a general macrostate. The method of proof avoids the molecular chaos assumption of Boltzmann because of its several pitfalls, many of which Boltzmann seems to be completely unaware of, that are discussed in Section 8.3 and summarized in Claim 17. We provide another proof by showing that the second law is a direct consequence of the stability [4] (see Axiom 4) of the system in Section 8.4.

As the second law is not part of the

μ

NEQT, we can use the latter even if the law is violated in the violation thermodynamics

\overset{ˇ}{M}

NEQT by properly modifying the averaging in the second stage to obtain the inequalities in Equation (220). Thus, we are able to investigate the catastrophic consequences of violating the second law in Section 9; see Conclusion 7. From this, we conclude that a violation of the second law in the

\overset{ˇ}{M}

NEQT can only happen for an unstable system, which is not found in nature. All physical systems form stable systems, even though instabilities arise in approximate calculations such as van der Waals equations or mean field, but they are removed from consideration; see Remark 58. The only credible violation is the demon paradox of Maxwell [50] or its various variants, all of which have been shown to be consistent with the second law after careful consideration, as discussed in Section 8.3. All the so-called violations ([223], for example) have been observed to occur in stable systems so they must be caused due to incomplete or incorrect analyses, as they contradict Conclusion 7, the demon paradox being one of them. Because of this, we have always assumed that we are dealing with a stable system for which the law is always valid, as noted in Section 1.

2. Issue of Uniqueness and

S_{Z}

. Planck [224] seems to be the first one to suggest that the concept of entropy must be just as applicable to NEQ macrostates

M

as to EQ macrostates

M_{eq}

. He also advocated the same for the temperature for any

M

. Landau [225] seems to be the first one to successfully introduce an NEQ temperature. We have taken the dream of Planck seriously and have attempted to provide a methodology to introduce a unique NEQ entropy. The experimental setup that produces the macrostate

M (t)

of the body during the process

P

also dictates how to uniquely describe that macrostate, as discussed in Section 12, by identifying the particular window

Δ_{n} τ

introduced in Equation (270b). This then identifies the needed state space

S_{Z}

in which

M = \{m_{k}, p_{k}\}

becomes

M_{ieq} = \{m_{k}, p_{k}^{ieq}\}

. The setup also determines

W^{F}

and

W^{NF}

, so

p_{k}^{ieq}

’s are also uniquely determined in

S_{Z}

. Thus, the setup not only uniquely identifies

S_{Z}

but also dictates the complete statistical mechanics, the

μ

NEQT. The relaxation times change as the macrostate changes during

P

so the index n in

Δ_{n} τ

may also change even for a fixed observational time

τ_{obs}

, probably resulting in different state spaces during

P

, as discussed earlier. Despite this, as Remark 46 shows, we can continue to use the same state space

S_{Z}

over the entire process

P

by including the hidden entropy generation and irreversible macrowork discussed in Section 5.9, as need be. In the absence of hidden macroquantities, the thermodynamic entropy of

M_{ieq}

remains a state function in each of the state spaces along

P

, and has a unique value that is no different than the statistical entropy. The statistical formulation of entropy in Equation (116) generalizes Gibbs EQ entropy formulation [48] to any arbitrary macrostate

M

by including hidden macroquantities to justify Axiom 3, whose validity for any

M (t)

requires quasi-independence to make the entropy quasi-additive; see Remark 41.

There have been several attempts since Landau [225] to introduce NEQ temperature by several authors. It is not possible to list all of them here. So, we have selected a few of these attempts [13,18,23,226,227,228,229,230,231,232] to show how our approach is different from all of them, without casting any aspersions on those that are omitted. Our thermodynamic definition in Equation (1) refers to the entire body, so it is not local. The inhomogeneity of the body is captured by the presence of internal variables. Including them allows us to treat the body as a black-box with a unique temperature that obeys Clausius’s heat theorem that heat flows from hot to cold, as discussed earlier.

The identification of this thermodynamic definition has the following surprising consequence. For any arbitrary macrostate

M

, the Clausius equality

d Q = T d S

in Equation (45) (see Remark 49) and Theorem 4 always hold. These are the two most important aspects of the use of the BI-quantities in the formulation of the MNEQT, to which we now turn.

Before doing that, however, we make the following comment. By replacing

d S

for

M_{ieq} \in S_{Z}

by

d S

in Equations (138a) and (141), as the case may be, all results for

M_{ieq}

can be directly taken to be valid for

M_{nieq}

.

3. The Importance of BI-quantities. Thermodynamic quantities can be classified into SI- and MI-quantities, which are independent of each other, so that an SI-quantity can be equated only with another SI-quantity; the same is also true of MI-quantities. As emphasized here, the SI-quantities are directly related to the Hamiltonian of the system so they can be generalized to BI-quantities for a body

Σ_{b}

. Their use proves crucial in identifying the state space

S_{Z}

, which then uniquely determines the

μ

NEQT for any

M_{ieq} \in S_{Z}

as the corresponding

p_{k}^{ieq}

’s are uniquely determined in

S_{Z}

. In particular, they allow us to express the first law in the MNEQT (see Equation (93a)) in a form in which the generalized heat

d Q

, which is proportional to

d S

, and the generalized work

d W = - d E_{m}

, which is an isentropic change in the energy E due to work variable

W

, are BI-quantities (although they are process quantities), as is

d E

. This follows immediately and directly from the form

E = E (S, W)

, which follows from Theorem 10 for any body. As

d Q

and

d W

originate from independent variations of S and

W

, respectively, the two cannot be confused; see Conclusion 10. Their independence also simplifies the

μ

NEQT considerably. A consequence of this is the following simplification: We need not consider any effect of the microheat

d Q_{k}

while considering the microwork

d W_{k}

; see Conclusion 11. This is consistent with treating a microstate as a mechanical system during microwork for which we have the identity

Δ E_{k} = - Δ W_{k} ≐ - \int_{γ_{k}} d W_{k},

which is independent not only of

p_{k}

along

γ_{k}

but also

γ_{k}

. In other words,

Δ W_{k}

only depends on the terminal microstates

m_{in k}

and

m_{fin k}

that are the same for all processes between the same macrostates

M^{(in)}

and

M^{(fin)}

; see Remark 71. It is not a process microquantity. Thus,

Δ W_{k} = - Δ E_{k}

is a microstate function but is Fl. This shows the necessity of distinguishing process and Fl-NFl quantities. For example,

Δ_{e} W_{k}

is a NFl-process quantity. We should contrast this with E being a state function, which is NFl, as it is a macroquantity, but is not a process quantity; see Conclusion 13. We should also recall that

Δ W

(and

Δ Q

) is a process (macro)quantity. It also follows from the same remark that

Δ_{i} W_{k}

varies over

γ_{k}

, so it is a process microquantity because of the presence of

d_{e} W (P)

in the definition, but is Fl. Its average results in the dissipation

Δ_{i} W = \int_{γ} d_{i} W \geq 0

, which is also a process macroquantity.

As

Δ_{i} W_{k}

is Fl,

Δ_{i} W_{k} \neq 0

in almost all cases, so it must be so even in

M_{eq}

, even though

Δ_{i} W = 0

. It is clear from Proposition 2 that the presence of a nonzero force imbalance is necessary (but not sufficient) for dissipation in the system; see also Remark 32 and Conclusion 3. The force imbalance is what gives rise to thermodynamic forces, whose importance does not seem to have been acknowledged to date by scientists who consistently use the

\overset{˚}{μ}

NEQT, a hallmark of which is the conjecture

Δ {\tilde{W}}_{k} = Δ \tilde{W} = Δ E_{k}

; see Equation (7). This amounts to the unintentional consequence that

Δ_{i} W = 0

.

The ubiquitous existence of the

μ

FI

F_{t k}

, which immediately follows from Proposition 2, is one of the most surprising results of our approach, which appears almost counter-intuitive and has remained hitherto unrecognized in the field because of it. It is presumably so because it is well-known that

Δ_{i} E = \int_{γ} d_{i} E = 0

, which follows Equation (53a). Thus, allowing

Δ_{i} E_{k}

to be nonzero seems to contradict

Δ_{i} E = 0

. However, we have shown (see Claim 22) that even if

d_{i} E_{k} \neq 0

,

d_{i} E

always vanishes and so does

Δ_{i} E = 0

. The root cause of

Δ_{i} E_{k} \neq 0

or

Δ_{i} W_{k} \neq 0

is the ubiquitous nature of the FI

F_{t k}

. Thus, these three quantities are interrelated.

From the examples given in the review, there can be no doubt that the ubiquitous existence of

F_{t k}

is purely mechanical and does not require any thermodynamic consideration. This has been examined carefully in Section 6.4. However, its thermodynamic average

〈F_{t}〉

, known as the thermodynamic force, may or may not be zero. It may vanish even if

F_{t k}

is not identically zero. In this case, we are dealing with a reversible process. The temporal variation in

P

should be slow compared to

τ_{eq}

so that the system has enough time to equilibrate during the process. Indeed, is a well-known result from EQ statistical mechanics that the fluctuations in

F_{t k}

cannot be identically zero, except at absolute zero. Thus, even in a reversible process,

F_{t k}

is not identically zero for

\forall k

. For an NEQ process during which the temporal variation is not slow compared to

τ_{eq}

, the system does not have enough time to equilibrate, so

〈F_{t}〉 \neq 0

. Therefore, having a nonzero

F_{t k}

is necessary but not sufficient for irreversibility. However, its ubiquitous nature must be accounted for, as we do in the

μ

NEQT.

The above discussion was related to the microscopic work–energy relation, but the notion of microheat is just as different between the two microscopic NEQ thermodynamics; see Remark 70. We have mentioned that the microheat in the Langevin evolution proposed by Sekimoto [140] to obtain the first law for a microstate (a realization of the Langevin process) in the

\overset{˚}{μ}

NEQT is nothing but the irreversible microwork

d_{i} W_{k}

in the

μ

NEQT cast as the exchange microheat

d_{e} Q_{k}

, which makes Sekimoto’s stochastic energetics very different from that in the

μ

NEQT; see Remark 61. Crooks [141] also follows the same identification for the exchange heat. The microwork in the

μ

NEQT is isentropic so no heat exchange with the heat bath that Sekimoto includes will change the microenergy

E_{k}

; the heat exchange only affects

p_{k}

. It appears that the two workers are really considering the energy change

d_{α} {\bar{E}}_{k}

and not

d_{α} E_{k}

(see Equation (243)), but

{\bar{E}}_{k}

is not a genuine microquantity; rather, it is a mixed microquantity, as discussed in Remark 60.

4. NFl-exchange quantities. Assuming quasi-additivity and quasi-independence, both commonly accepted in the field, we have proved (see Theorem 7) that quantities

{\tilde{q}}_{k} = \tilde{q}

for

\forall k

so that

{\tilde{q}}_{k}

is NFl, a surprising and novel result despite

{\tilde{q}}_{\tilde{k}}

being Fl. Its significance has not been appreciated to date by workers in microscopic NEQT. To appreciate this fact, we consider some exchange quantity

q \in θ

, for which we have

d {\tilde{q}}_{k} = d \tilde{q} = d_{e} \tilde{q} = - d_{e} q for \forall k;

see Equation (193c) over some infinitesimal process

d P

between two neighboring macrostates; see Notation 3. As

\{d {\tilde{q}}_{k}\}

and

\{p_{k}\}

are independent,

\{d {\tilde{q}}_{k}\}

is the same for all

d P

’s between the same two neighboring macrostates, and so is

d \tilde{q} = d_{e} \tilde{q} .

As a consequence, the exchange quantity

d_{e} q

is also the same for all such

d P

’s. It is determined only averaging over all microstates of the medium so it is a genuine MI-macroquantity. Thus, it is easily determined by knowing the properties of the medium that is in EQ. This is a well-known fact of classical thermodynamics, and explains why the

\overset{˚}{μ}

NEQT is so easy to implement. Therefore, it is surprising that the above fact has not been appreciated in the

\overset{˚}{μ}

NEQT including stochastic thermodynamics. Unfortunately, because of Theorem 7, a proper application of the

\overset{˚}{μ}

NEQT cannot capture any statistical fluctuations unless

d_{e} {\tilde{q}}_{k}

is improperly treated as a Fl-quantity.

5. Heat-Work Equivalence. As soon as

S_{Z}

has been identified in terms of BI-quantities specified by the nature of the process

d P

, the problem of a unique statistical mechanical description of

d P

is completely solved in that

p_{k}^{ieq}

are uniquely specified in

S_{Z}

; see Equation (275). This then uniquely specifies

M = M_{ieq}

at each instant along

P

. The identification of

M_{ieq}

is only possible because of the use of BI-quantities that properly capture fluctuations in a statistical body. Their usage justifies the version of the first law (see Equation (93a)) in terms of generalized macrowork

d W

and macroheat

d Q

that refer to the body; the former is an isentropic quantity, while the latter is an entropic quantity being directly related to entropy change. Therefore, they can be varied independently, which means that there is no constraint on

d E

in general. As a consequence, there cannot be any equivalence between them. These macroquantities differ from their exchange counterparts

d_{e} W

and macroheat

d_{e} Q

by their irreversible counterparts

d_{i} W \equiv d_{i} Q = (T - T_{0}) d_{e} S + T d_{i} S \geq 0;

(356a)

see Equation (95). It is a very important consequence in the MNEQT due to

d_{i} E = 0

as a general rule. Thus, there equivalence is a general rule in the MNEQT, and it provides not only a theoretical support for the well-known conclusion by Count Rumford [165] about the so-called equivalence of the irreversible macrowork and macroheat (see the discussion just above Equation (97)) but also generalizes it, so it clarifies its significance due to

d_{i} E = 0

. Indeed, Count Rumford had taken precautions to ensure no macroheat exchange with the medium, so his observation was for irreversible macroquantities. In his experiment, the first term on the right side vanishes and we obtain

d_{i} W \equiv d_{i} Q = T d_{i} S \geq 0,

(356b)

a well-known result, also known as the Gouy-Stodola theorem, in classical thermodynamics for the dissipated work; see for example [33,233,234]. Comparing with Equation (356a) derived in the MNEQT, it becomes clear that the above theorem is valid only when the system and the medium have the same temperature to ensure no macroheat exchange, similar to the conditions imposed by Count Rumford. But his observations leave out the situation of a possible heat exchange, so it is not clear what is meant by macroheat converting into macrowork in his statement. Thus, Equation (356a) extends the theorem to a more general situation, where the meanings of

d_{i} W

and

d_{i} Q

are clear in the MNEQT.

Moreover, the above equivalence is also extended in the

μ

NEQT between internal microwork

d_{i} W_{k}

and microheat

d_{i} Q_{k}

, which has not been hitherto recognized. What is remarkable about the equality is that it relates a purely mechanical quantity

d_{i} W_{k} = - d_{i} E_{k}

with a purely stochastic quantity

d_{i} Q_{k} = - (T - T_{0}) {\hat{η}}_{k} d_{e} η_{k} - T {\hat{η}}_{k} d_{i} η_{k},

which is easily derivable from Equation (256b). This is what makes the

μ

NEQT so useful, and a promising alternative to widely used current approaches [10,12,13,17,18,19,20,21,24,25,26,27,28,99,135,136,137,138,139,140,141,142,143,144,145,146,147] that are primarily based on the nonfluctuating exchange quantities as remarked above.

6. Work–Energy Theorem.Microwork

d_{α} W_{k}

in the

μ

NEQT is purely mechanical in that it is not influenced by

p_{k}

, while microheat

d_{α} Q_{k}

is stochastic in that it is determined by

d_{α} p_{k}

. Thus,

d_{α} W_{k}

and

d_{α} Q_{k}

originate from different sources. From the Work–Energy Theorem 6, we have

d_{α} E_{k} = - d_{α} W_{k}

. As

E_{k}

for any body is a function of

W

only, there is no

d Q_{k}

in

d E_{k}

. A comparison with the first law

d_{α} E = d_{α} Q - d_{α} W

, Equation (91) in Remark 30, clearly shows that there is no analog of this law for a microstate in the

μ

NEQT. This fact should not be confused with Equation (243), which deals with

{\bar{E}}_{k}

and not with

E_{k}

or with Equation (281); the latter refers to the microstate energy fluctuation within the body

Δ_{k} E = E_{k} - E

over its microstates. The physical implication of this first-law-looking Equation (281) has been discussed in Section 12.2, and merely reflects the fact that the BI-combination

G_{Z k}^{ieq} (T, W)

in Equation (277) is NFl over

m_{k}

, but that there are no exchange analogs of the two terms on the right side of Equation (281), and has nothing to do with any first law for

m_{k}

as summarized in Conclusion 61. In contrast, there is an analog of the microscopic first law in the

\overset{˚}{μ}

NEQT; see Remark 70.

Before we end the review, we wish to briefly point out some of the major differences between the

μ

NEQT based on the SI-quantities and other current theories that are formulated in terms of the MI-quantities representing exchanges with the medium [99,135,136,137,138,139,140,141,142,143,144,145,146,147]; see also Section 1.2. Because of the use of exchange quantities, they all belong to the

\overset{˚}{μ}

NEQT.

The use of the SI-quantities in the $μ$ NEQT allows us to uniquely identify all SI-macrofields such as the unique NEQ SI-temperature T of a body; see Equation (129). But this is not possible in the $\overset{˚}{μ}$ NEQT, where it has been defined in several ways, not all different for any $M_{neq}$ . This issue has been discussed elsewhere [76,77].
The use of SI-quantities $[d q]$ in the $μ$ NEQT has the following important consequence. It can be directly applied to an isolated system for which $[d_{e} q_{0}] \equiv 0$ so that $[d q_{0}] \equiv [d_{i} q_{0}]$ captures the contributions from all internal processes unambiguously. But $[d q]$ is not even defined in the $\overset{˚}{μ}$ NEQT, except for a state variable q, so knowing $[d_{e} q]$ does not allow for determining $[d_{i} q]$ directly and unambiguously. They are determined indirectly. As an example, the lost macrowork due to irreversibility in classical thermodynamics (also belonging to the $\overset{˚}{μ}$ NEQT) is defined as ${\overset{˚}{d}}_{lost} W = {\overset{˚}{d}}_{rev} W - {\overset{˚}{d}}_{irr} W \geq 0$ , where various ${\overset{˚}{d}}_{rev} W$ and ${\overset{˚}{d}}_{irr} W$ refer to the exchange macroworks along two distinct processes: a reversible and an irreversible. It is easy to see that ${\overset{˚}{d}}_{lost} W$ is precisely the irreversible macrowork $d_{i} W$ , which is determined by the actual process. While $d_{i} Q$ is defined in the $μ$ NEQT, ${\overset{˚}{d}}_{lost} Q$ is never defined in the $\overset{˚}{μ}$ NEQT.
In the $μ$ NEQT, the exchange microwork $d_{e} W_{k}$ is NFl as $d_{e} W_{k} = d_{e} W, \forall k$ . In contrast, $d_{e} W_{k} = - d_{e} E_{k}$ in accordance with the conjecture in Equation (7) is Fl in the $\overset{˚}{μ}$ NEQT.
In the $μ$ NEQT, due to the use of SI-microquantities $\{d q_{k}\}$ that are by nature Fl, the fluctuations are incorporated in this statistical mechanics. In contrast, $\{d_{e} q_{k}\}$ are NFL, some of which, such as $d_{e} E_{k} = - d_{e} W_{k} = - d_{e} W$ and $d_{e} S_{k} = d_{e} S$ , are also used in the $\overset{˚}{μ}$ NEQT. Therefore, additional justification is required to capture fluctuations in the $\overset{˚}{μ}$ NEQT. The most common justification is to use the conjecture in Equation (7) that equates $d_{e} W_{k}$ with ( $- d E_{k}$ ) to make it Fl; see the discussion of Equation (345). The conjecture seems to have a wider usage including stochastic and quantum thermodynamics [99,135,136,137,138,139,140,141,142,143,144,145,146,147], which all use the $\overset{˚}{μ}$ NEQT; see Remarks 61 and 70.
Microstate probabilities $\{p_{k}\}$ are uniquely determined in the $μ$ NEQT because of the use of SI-microquantities. For example, the macroheats in the $μ$ NEQT are ensemble averages over microstates with $\{p_{k}\}$ as in Equations (236) and (239). We do not need to invoke any master equation or the Fokker–Planck equation to determine them. As $\{p_{k}\}$ cannot be uniquely determined in the $\overset{˚}{μ}$ NEQT, a master equation or a Fokker–Planck equation is required to determine them. For example, the use of a master equation allows the identification of exchange macroheat in terms of transitions between microstates [235].
The use of SI-quantities allows for the introduction of partition functions in the $μ$ NEQT but cannot be defined in the $\overset{˚}{μ}$ NEQT.
There is no analog of the first law for a microstate in the $μ$ NEQT. However, there is such an analog in the $\overset{˚}{μ}$ NEQT proposed by Sekimoto [146].

A major open problem in the

μ

NEQT is to provide a strong justification for Proposition 1 to ensure that the

μ

NEQT is applicable to any arbitrary macrostate

M

. At present, it is merely a proposition, although a very convincing one. According to this proposition, any arbitrary macrostate

M

can be always identified as

M_{ieq}

with no explicit time dependence in an appropriate state space

S_{Z^{'}}

. In a smaller state space

S_{Z} \subset S_{Z^{'}}

,

M

will have hidden entropy generation

d_{i} S^{hid} (t)

(see Equation (139a)) due to this explicit time dependence, which puts a very strong limitation on the possible explicit time dependence that it must give rise to

d_{i} S^{hid} (t)

, as discussed in Section 5.9. It is only this restricted form of explicit time dependence in

M

or

\{p_{k}\}

in the

μ

NEQT that remains consistent with the second law. Therefore, it will be interesting to investigate if any arbitrary form of explicit time dependence in

M

or

\{p_{k}\}

can be shown to satisfy the second law.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

Acronyms
BI, MI, SI	body-, medium-, system-intrinsic
EQ, EQT	equilibrium, equilibrium thermodynamics
FI, $μ$ FI	force imbalance, microforce imbalance
Fl, NFl	fluctuating, nonfluctuating
IEQ, NEQ, NIEQ	internal EQ, non EQ, non IEQ
MNEQT, $\overset{˚}{M} NEQT$	macroscopic NEQT, macroscopic (with exchanges) NEQT
$μ$ NEQT, $\overset{˚}{μ}$ NEQT	microscopic NEQT, microscopic (with exchanges) NEQT
NEQ, NEQT	nonequilibrium, nonequilibrium thermodynamics
More Often Used Symbols
$m_{k}, M$	body-, micro-, macro state
$[q], [d_{α} q]$	Notation 2
$[X]$ , $[Z]$ , $[ζ], [χ]$	body’s micro-macro state variable
$[w]$ , $[W]$	body’s work parameter
$d_{α} θ$	Notation 1
$[q], [d_{α} q]$	Notation 2
$[F_{w}], [f_{w}]$	Claim 3, Section 2.9
$d W_{k}$	Claim 6, Definition 17
$η_{k}, {\hat{η}}_{k}, d_{α} η_{k}$	Equation (27b), Equation (27c), Equation (87b)
$d Q_{k}, d Q$	Equation (44a), Equation (44b)
$Δ F_{w k}, Δ F_{w}$	Equation (76a), Equation (76c)
$[(d_{e} w, d ξ), d_{i} w]$	Equation (76b)

References

Caratheodory, C. Untersuchungen über die Grundlagen der Thermodynamik. Math. Ann. 1909, 67, 355. [Google Scholar] [CrossRef]
Landsberg, P. Thermodynamics; Interscience: New York, NY, USA, 1961. [Google Scholar]
Callen, H.B. Thermodynamics and an Introduction to Thermostatistics, 2nd ed.; John Wiley & Sons: New York, NY, USA, 1985. [Google Scholar]
We discuss in Section 8.4 that the second law is a direct consequence of the stability of the system. Because of this, there is no need to adopt the law as an additional axiom.
Magie, W.F. (Ed.) The Second Law of Thermodynamics: Memoirs by Carnot, Clausius, and Thomson; Harper and Brothers Publishers: New York, NY, USA, 1899. [Google Scholar]
Kestin, J. The Second law of thermodynamics; Dowden, Hutchinson & Ross: Stroudsburg, PA, USA, 1976. [Google Scholar]
Grandy, W.T. Entropy and the Time Evolution of Macroscopic Systems; Oxford University Press: Oxford, UK, 2008. [Google Scholar]
Schrodinger, E. What Is Life? Cambridge University Press: Cambridge, UK, 1944. [Google Scholar]
Schneider, E.D.; Kay, J.J. Life as a Manifestation of the Second Law of Thermodynamics. Math. Comput. Model. 1994, 19, 25–48. [Google Scholar] [CrossRef]
De Donder, T.; Rysselberghe, P.V. Thermodynamic Theory of Affinity: A Book of Principles; Oxford University Press: Oxford, UK, 1936. [Google Scholar]
Becker, R. Theory of Heat, 2nd ed.; Revised Leibfried, G., Ed.; Springer: New York, NY, USA, 1967. [Google Scholar]
Prigogine, I. Thermodynamics of Irreversible Processes; Wiley-Interscience: New York, NY, USA, 1971. [Google Scholar]
de Groot, S.R.; Mazur, P. Nonequilibrium Thermodynamics, 1st ed.; Dover: New York, NY, USA, 1984. [Google Scholar]
Muschik, W. Aspects of Nonequilibrium Thermodynamics; World Scientific: Singapore, 1990. [Google Scholar]
Muschik, W.; Brunk, G. A concept of non-equilibrium temperature. Int. J. Engng Sci. 1977, 15, 377–389. [Google Scholar] [CrossRef]
Muschik, W.; Papenfuss, C.; Ehrentraut, H. A sketch of continuum thermodynamics. J. Non-Newton. Fluid Mech. 2001, 96, 255–290. [Google Scholar] [CrossRef]
Eu, B.G. Kinetic Theory and Irreversible Thermodynamics; John Wiley: New York, NY, USA, 1992. [Google Scholar]
Maugin, G.A. The Thermodynamics of Nonlinear Irreversible Behaviors: An Introduction; World Scientific: Singapore, 1999. [Google Scholar]
Eu, B.C. Kinetic Theory of Nonequilibrium Ensembles, Irreversible Thermodynamics, and Generalized Hydrodynamics; Springer International Publishing: Cham, Switzerland, 2016; Volume 1. [Google Scholar]
Kuiken, G.D.C. Thermodynamics of Irreversible Processes; John Wiley: Chichester, UK, 1994. [Google Scholar]
Jou, D.; Casas-Vázquez, J.; Lebon, G. Extended Irreversible Thermodynamics; Springer: Berlin, Germany, 1996. [Google Scholar]
Førland, K.S.; Førland, T.; Kjelstrup, S. Irreversible Thermodynamics: Theory and Application, 3rd ed.; Tapir: Trondheim, Norway, 2001. [Google Scholar]
Casas-Vázquez, J.; Jou, D. Temperature in non-equilibrium states: A review of open problems and current proposals. Rep. Prog. Phys. 2003, 66, 1937. [Google Scholar] [CrossRef]
Ottinger, H.C. Beyond Equilibrium Thermodynamics; Wiley: Hoboken, NJ, USA, 2005. [Google Scholar]
Kjelstrum, S.; Bedeaux, D. Nonequilibrium Thermodynamics of Heterogeneous Systems; World-Scientific: Singapore, 2008. [Google Scholar]
Evans, D.J.; Morriss, G. Statistical Mechanics of Nonequilibrium Liquids, 2nd ed.; Cambridge University Press: Cambridge, UK, 2008. [Google Scholar]
Koleva, M.K. Boundedness and Self-Organized Semantics: Theory and Applications; Information Science Ref.: Hershey, PA, USA, 2012. [Google Scholar]
Pokrovskii, V.N. Thermodynamics of Complex Systems; IOP Publishing Ltd.: Bristol, UK, 2020. [Google Scholar]
Muschik, W. Why so many “schools” of thermodynamics? Forsch Ingenieurwes 2007, 71, 149. [Google Scholar] [CrossRef]
Clausius, R. Über die Wärmeleitung gasförmiger Körper. Ann. Phys. 1862, 115, 1–57. [Google Scholar] [CrossRef]
Clausius, R. The Mechanical Theory of Heat; Macmillan and Co.: London, UK, 1879. [Google Scholar]
Thompson, C.J. Mathematical Statistical Mechanics; Princeton University: Princeton, NJ, USA, 1979. [Google Scholar]
Landau, L.D.; Lifshitz, E.M. Statistical Physics, 3rd ed.; Pergamon Press: Oxford, UK, 1986; Volume 1. [Google Scholar]
Huang, K. Statistical Mechanics, 2nd ed.; John Wiley and Sons: New York, NY, USA, 1987. [Google Scholar]
Hoover, W.G. Computational Statistical Mechanics; Elsevier: Amsterdam, The Netherlands, 1991. [Google Scholar]
Sethna, J.P. Statistical Mechanics: Entropy, Order Parameters and Complexity; Oxford University Press: New York, NY, USA, 2006. [Google Scholar]
Kardar, M. Statistical Physics of Particles; Cambridge University Press: Cambridge, UK, 2007. [Google Scholar]
Kardar, M. Statistical Physics of Fields; Cambridge University Press: Cambridge, UK, 2007. [Google Scholar]
Fermi, E. Thermodynamics; Dover: New York, NY, USA, 1956. [Google Scholar]
Reif, F. Fundamentals of Statistical and Thermal Physics; McGraw-Hill, Inc.: New York, NY, USA, 1965. [Google Scholar]
Woods, L.C. The Thermodynamics of Fluids Systems; Oxford University Press: Oxford, UK, 1975. [Google Scholar]
Kestin, J. A Course in Thermodynamics; Volumes 1 & 2, Revised Printing; McGraw-Hill Book Company: New York, NY, USA, 1979. [Google Scholar]
Waldram, J.R. The Theory of Thermodynamics; Cambridge University: Cambridge, UK, 1985. [Google Scholar]
Balian, R. From Microphysics to Macrophysics; Springer: Berlin, Germany, 1991; Volume 1. [Google Scholar]
ter Haar, D. Foundations of Statistical Mechanics. Rev. Mod. Phys. 1953, 27, 289–338. [Google Scholar] [CrossRef]
Boltzmann, L. Lectures on Gas Theory; University of California Press: Berkeley, CA, USA, 1964. [Google Scholar]
Boltzmann, L. On the relation between the second law of thermodynamics and the probability calculations of the principles of thermal equilibrium. Wien Ber. 1877, 76, 373–435. [Google Scholar]
Gibbs, J.W. Elementary Principles in Statistical Mechanics; Charles Scribner’s Sons: New York, NY, USA, 1902. [Google Scholar]
Maxwell, J.C. On the Dynamical Evidence of the Molecular Constitution of Bodies. J. Chem. Soc. 1875, 28, 493. [Google Scholar] [CrossRef]
Maxwell, J.C. Theory of Heat; Longmans, Green, and Co.: London, UK, 1902. [Google Scholar]
Kondepudi, D.; Prigogine, I. Modern Thermodynamics; John Wiley and Sons: West Sussex, UK, 1998. [Google Scholar]
Evans, D.J.; Cohen, E.G.D.; Morriss, G.P. Probability of second law violations in shearing steady states. Phys. Rev. Lett. 1993, 71, 2401. [Google Scholar] [CrossRef]
Searls, D.J.; Evans, D.J. Fluctuations Relations for Nonequilibrium Systems. Aust. J. Chem. 2004, 57, 1119. [Google Scholar] [CrossRef]
Tolman, R.C. The Principles of Statistical Mechanics; Oxford University: London, UK, 1959. [Google Scholar]
Rice, S.A.; Gray, P. The Statistical Mechanics of Simple Liquids; John Wiley & Sons: New York, NY, USA, 1965. [Google Scholar]
Brush, S.G. Kinetic-Theory; Irreversible-Processes, Pergamon Press: Oxford, UK, 1966; Volume 2. [Google Scholar]
Rice, O.K. Statistical Mechanics, Thermodynamics and Kinetics; W.H. Freeman: San Francisco, CA, USA, 1967. [Google Scholar]
van Kampen, N.G. Stochastic Processes in Physics and Chemistry, 3rd ed.; Elsevier: Amsterdam, The Netherlands, 2007. [Google Scholar]
Landau, L.D.; Lifshitz, E.M. Quantum Mechanics, 3rd ed.; Pergamon Press: Oxford, UK, 1977. [Google Scholar]
von Neumann, J. Mathematical Foundations of Quantum Mechanics; Princeton University Press: Princeton, NJ, USA, 1996. [Google Scholar]
Partovi, M.H. Entropic Formulation of Uncertainty for Quantum Measurements. Phys. Rev. Lett. 1983, 50, 1883. [Google Scholar] [CrossRef]
Bekenstein, J.D. Black Holes and Entropy. Phys. Rev. D 1973, 7, 2333. [Google Scholar] [CrossRef]
Bekenstein, J.D. Statistical black-hole thermodynamics. Phys. Rev. D 1975, 12, 3077. [Google Scholar] [CrossRef]
Hawking, S.W. Particle Creation by Black Holes. Commun. Math. Phys. 1975, 43, 199. [Google Scholar] [CrossRef]
Schumacher, B. Quantum coding. Phys. Rev. A 1995, 51, 2738. [Google Scholar] [CrossRef]
Landauer, R. Irreversibility and Heat Generation in the Computing Process. IBM J. Res. Dev. 1961, 5, 183. [Google Scholar] [CrossRef]
Bennet, C.H. The thermodynamics of computation—A review. Int. J. Theor. Phys. 1982, 21, 905. [Google Scholar] [CrossRef]
Szilard, L. On the decrease of entropy in a thermodynamic system by the intervention of intelligent beings. Z. Phys. 1929, 53, 840. [Google Scholar] [CrossRef]
Brillouin, L. Maxwell’s Demon Cannot Operate: Information and Entropy. I. J. Appl. Phys. 1951, 22, 334–337. [Google Scholar] [CrossRef]
Leff, H.S.; Rex, A.F. (Eds.) Maxwell’s Demon 2: Entropy, Classical and Quantum Information, Computing; CRT Press: Boca Raton, FL, USA, 2018. [Google Scholar]
Wiener, N. Cybernetics, or Control and Communication in the Animal and the Machine; John Wiley and Sons: New York, NY, USA, 1948. [Google Scholar]
Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379, 623–656. [Google Scholar] [CrossRef]
Lieb, E.H.; Yngvason, J. The physics and mathematics of the second law of thermodynamics. Phys. Rep. 1999, 310, 1. [Google Scholar] [CrossRef]
Ruelle, D. What physical quantities make sense in nonequilibrium statistical mechanics. In Boltzmann’s Legacy; Gallavotti, G., Reiter, W.L., Yngvason, Eds.; European Mathematical Society: Zürich, Switzerland, 2008. [Google Scholar]
Gujrati, P.D. Nonequilibrium Entropy. arXiv 2013, arXiv:1304.3768. [Google Scholar]
Gujrati, P.D. On Equivalence of Nonequilibrium Thermodynamic and Statistical Entropies. Entropy 2015, 17, 710. [Google Scholar] [CrossRef]
Gujrati, P.D. A Review of the System-Intrinsic Nonequilibrium Thermodynamics in Extended Space (MNEQT) with Applications. Entropy 2021, 23, 1584. [Google Scholar] [CrossRef]
Gujrati, P.D. Nonequilibrium Entropy in an Extended State Space. In Frontiers in Entropy across the Disciplines; Freeden, W., Nashed, M.Z., Eds.; World Scientific: Hackensack, NJ, USA, 2022. [Google Scholar]
Gujrati, P.D. Loss of Temporal Homogeneity and Symmetry in Statistical Systems: Deterministic Versus Stochastic Dynamics. Symmetry 2010, 2, 1201. [Google Scholar] [CrossRef]
ter Haar, D.; Green, C.D. The statistical aspect of Bolzmann’s H-Theorem. Proc. Phys. Soc. A 1953, 66, 153–159. [Google Scholar] [CrossRef]
Kröning, A. Grundziuge einer Theorie der Gase. Ann. Phys. 1856, 99, 315–322. [Google Scholar] [CrossRef]
Loschmidt, J. Sitznngsberichte der Akademie der Wissenschaften. Wien Ber. 1876, 73, 128–142. [Google Scholar]
Burbury, S.H. Boltzmann’s minimum function. Nature 1895, 52, 104–105. [Google Scholar] [CrossRef]
Poincaré, H. Sur le problème des trois corps et les équations de la dynamique. Acta Math. 1890, 13, 1–270. [Google Scholar]
Zermelo, E. On a Theorem of Dynamics and the Mechanical Theory of Heat. Ann. Physik 1896, 57, 485–494. [Google Scholar] [CrossRef]
Zermelo, E. On the Mechanical explanation of Irreversible Processes. Ann. Physik 1896, 59, 793–801. [Google Scholar] [CrossRef]
Boltzmann, L. Reply to Zermelo’s Remarks on the Theory of Heat. Ann. Phys. 1896, 57, 773–784. [Google Scholar] [CrossRef]
Boltzmann, L. On Zermelo’s Paper: On the Mechanical Explanation of Irreversible Processes. Ann. Phys. 1897, 60, 392–398. [Google Scholar] [CrossRef]
von Smoluchowski, M. Experimetell nachweisbare der ublichen Thermodynamik widersprechende Molekularphanomene. Physik. Z. 1912, 13, 1069. [Google Scholar]
Sklar, L. Physics and Chance; Cambridge University Press: Cambridge, UK, 1993. [Google Scholar]
Lebowitz, J.L. Statistical mechanics: A selective review of two central issues. Rev. Mod. Phys. 1999, 71, S346–S357. [Google Scholar] [CrossRef]
Gujrati, P.D. Poincare Recurrence, Zermelo’s Second Law Paradox, and Probabilistic Origin in Statistical Mechanics. arXiv 2008, arXiv:0803.0983v1. [Google Scholar]
Gujrati, P.D. Irreversibility, Molecular Chaos, and A Simple Proof of the Second Law. arXiv 2008, arXiv:0803.1099. [Google Scholar]
Fernando, P. Lack of Molecular Chaos and the Role of Stochasticity in Kac’s Ring Model. Master’s Thesis, The University of Akron, Akron, OH, USA, 2009. [Google Scholar]
Gautam, M. The Role of Walls’ Stochastic Forces in Statistical Mechanics: Phenomenon of Time Irreversibility. Master’s Thesis, The University of Akron, Akron, OH, USA, 2009. [Google Scholar]
Myrvold, W.C. Philosophical Issues in Thermal Physics. Oxf. Res. Encycl. Phys. 2022. [Google Scholar] [CrossRef]
Kac, M. Some remarks on the use of probability in classical statistical mechanics. Bull. Acad. R. Belg. 1956, 42, 356–361. [Google Scholar] [CrossRef]
Henin, F. Entropy, dynamics and molecular chaos, Kac’s model. Physica 1974, 77, 220–246. [Google Scholar] [CrossRef]
Keizer, J. Statistical Thermodynamics of Nonequilibrium Processes; Springer: New York, NY, USA, 1987. [Google Scholar]
Stratonovich, R.L. Nonlinear Nonequilibrium Thermodynamics I; Springer: Berlin, Germany, 1992. [Google Scholar]
Schuss, Z. Theory and Applications of Stochastic Processes: An Analytical Approach; Springer: New York, NY, USA, 2010. [Google Scholar]
Coffey, W.T.; Kalmykov, Y.P. The Langevin Equation, 4th ed.; World Scientific: Singapore, 2017. [Google Scholar]
Bauman, R.P. Work of Compressing an Ideal Gas. J. Chem. Educ. 1964, 41, 102–104. [Google Scholar] [CrossRef]
Bauman, R.P. Maximum Work Revisited (letter). J. Chem. Educ. 1964, 41, 676–677. [Google Scholar] [CrossRef]
Kivelson, D.; Oppenheim, I. Work in Irreversible Expansions. J. Chem. Educ. 1966, 43, 233–235. [Google Scholar] [CrossRef]
Bertrand, G.L. Thermodynamic Calculation of Work for Some Irreversible Processes. J. Chem. Educ. 2005, 82, 874–877. [Google Scholar] [CrossRef]
Gislason, E.A.; Craig, N.C. Pressure–Volume Integral Expressions for Work in Irreversible Processes. J. Chem. Educ. 2007, 84, 499. [Google Scholar] [CrossRef]
Coleman, B.D. Thermodynamics with Internal State Variables. J. Chem. Phys. 1967, 47, 597. [Google Scholar] [CrossRef]
The reader should pause to guess about our motivation to italicize container. During the process of expansion or ontraction or at any other time, the gas molecules are always experiencing the walls of the container. Later, we will see that the presence of walls becomes a central concept for breaking the temporal symmetry. Their presence gives rise to stochastic boundary conditions for the collisions of gas particles with the walls in any of its possible states; see Section 7. These collisions are not described by unique or deterministic potentials that are part of the Hamiltonian of the system, and destroy the temporal symmetry, just like the presence of walls destroys the homogeneity of space.
When we consider many particles, it is convenient to introduce the concept of a phase space Γ(x) in which a point x represents the collections of particles’ coordinates and momenta. Thus, each point in the phase space represents a state of the system. A microstate of the system is represented not by a point, but by a volume element h^3N, where h is Planck’s constant; see Definition 4 for more details.
This assumption simplifies the present discussion using deterministic dynamics as there is no internal deformation, which can cause dissipation and irreversibility that will be carefully treated later. Even the collisions are deterministic in such a system.
de Hemptinne, X. Nonequilibrium Statistical Thermodynamics; World Scientific: Singapore, 1992. [Google Scholar]
We are not considering weak interactions where this symmetry is not exact.
Giri, N.C. Introduction to Probability and Statistics, 2nd ed.; Marcel Dekker, Inc.: New York, NY, USA, 1993. [Google Scholar]
Chandrasekhar, S. Brownian Motion, Dynamical Friction, and Stellar Dynamics. Rev. Mod. Phys. 1949, 21, 383. [Google Scholar] [CrossRef]
Ehrenfest, P.; Ehrenfest, T. The Conceptual Foundations of the Statistical Approach in Mechanics; Translated by Moravcsik, M.; Cornell University Press: Ithaca, NY, USA, 1959. [Google Scholar]
Daub, E.E. Probability and thermodynamics: The reduction of the second law. Isis 1969, 60, 318–330. [Google Scholar] [CrossRef]
Davies, P.C.W. The Physics of Time Asymmetry; University of California Press: Berkeley, CA, USA, 1977. [Google Scholar]
Hawking, S.W. Arrow of time in cosmology. Phys. Rev. D 1985, 32, 2489–2495. [Google Scholar] [CrossRef] [PubMed]
Coveney, P.V. The second law of thermodynamics: Entropy, irreversibility and dynamics. Nature 1988, 333, 409–415. [Google Scholar] [CrossRef]
Zak, M. Irreversibility in Thermodynamics. Int. J. Theor. Phys. 1996, 35, 347–382. [Google Scholar] [CrossRef]
Price, H. Time’s Arrow and Archimedes’ Point: New Directions for the Physics of Time; Oxford University Press: New York, NY, USA, 1996. [Google Scholar]
Uffink, J. Irreversibility and the second law of thermodynamics. Entropy 2003, 121–146. [Google Scholar]
Price, H. The thermodynamic arrow: Puzzles and pseudo-puzzles. arXiv 2004, arXiv:physics/0402040. [Google Scholar]
Prigogine, I.; Grecos, A.; George, C. On the relation of dynamics to statistical mechanics. Celes. Mech. 1977, 16, 489–507. [Google Scholar] [CrossRef]
Poincaré, H. Mathematics and Science: Last Essays; Dover Publications, Inc.: New York, NY, USA, 1963. [Google Scholar]
A system with unique trajectories requiring an invertible one-to-one mapping in Equation (4) is what we call a deterministic system in this work. A Hamiltonian system is deterministic in this sense.
Landau, L. Das Daempfungsproblem in der Wellenmechanik. Z. Phys. 1927, 45, 430–464. [Google Scholar] [CrossRef]
Von Neumann, J. Mathematische Grundlagen der Quantenmechanik; Springer: Berlin, Germany, 1932; ISBN 3-540-59207-5. [Google Scholar]
Boltzmann, L. Über die mechanische Bedeutung des Zweiten Hauptsatzes der... Wärmegleichgewicht. Wien. Ber. 1877, 76, 373–435. [Google Scholar]
Boltzmann, L. Lectures On Gas Theory; (Translated from the original German by Stephen G. Brush); Dover Publications: New York, NY, USA, 1964. [Google Scholar]
Einstein, A. Investigations on the Theory of the Brownian Movement. Ann. Phys. 1905, 17, 549. [Google Scholar] [CrossRef]
Langevin, P. Sur la théorie du mouvement brownien. C. R. Acad. Sci. 1908, 146, 530. [Google Scholar]
Gujrati, P.D. Nonequilibrium thermodynamics: Structural relaxation, fictive temperature, and Tool-Narayanaswamy phenomenology in glasses. Phys. Rev. E 2010, 81, 051130. [Google Scholar] [CrossRef] [PubMed]
Spohn, H.; Lebowitz, J.L. Irreversible Thermodynamics for Quantum Systems Weakly Coupled to Thermal Reservoirs. Adv. Chem. Phys. 1978, 38, 109. [Google Scholar]
Bochkov, G.N.; Kuzovlev, Y.E. General theory of thermal fluctuations in nonlinear systems. Sov. Phys. JETP 1977, 45, 125. [Google Scholar]
Bochkov, G.N.; Kuzovlev, Y.E. Fluctuation-dissipation relations for nonequilibrium processes in open systems. Sov. Phys. JETP 1979, 49, 543. [Google Scholar]
Alicki, R. The quantum open system as a model of the heat Engine. J. Phys. A 1979, 12, L103. [Google Scholar] [CrossRef]
Gallavotti, G.; Cohen, E.G.D. Dynamical Ensembles in Nonequilibrium Statistical Mechanics. Phys. Rev. Lett. 1995, 74, 2694. [Google Scholar] [CrossRef]
Sekimoto, K. Kinetic Characterization of Heat Bath and the Energetics of Thermal Ratchet Models. J. Phys. Soc. Japan 1997, 66, 1234. [Google Scholar] [CrossRef]
Crooks, G.E. Entropy production fluctuation theorem and the nonequilibrium work relation for free energy differences. Phys. Rev. E 1999, 60, 2721. [Google Scholar] [CrossRef]
Jarzynski, C. Nonequilibrium Equality for Free Energy Differences. Phys. Rev. Lett. 1997, 78, 2690. [Google Scholar] [CrossRef]
Jarzynski, C. Equilibrium free-energy differences from nonequilibrium measurements: A master-equation approach. Phys. Rev. E 1997, 56, 5018. [Google Scholar] [CrossRef]
Jarzynski, C. Comparison of far-from-equilibrium work relations. C. R. Phys. 2007, 8, 495. [Google Scholar] [CrossRef]
Seifert, U. Entropy Production along a Stochastic Trajectory and an Integral Fluctuation Theorem. Phy. Rev. Lett. 2005, 95, 040602. [Google Scholar] [CrossRef]
Sekimoto, K. Stochastic Energetics; Springer: Berlin, Germany, 2010. [Google Scholar]
Pitaevskii, L.P. Rigorous results of nonequilibrium statistical physics and their experimental verification. Phys.-Uspekhi 2011, 54, 625. [Google Scholar] [CrossRef]
Gujrati, P.D. Nonequilibrium thermodynamics. II. Application to inhomogeneous systems. Phys. Rev. E 2012, 85, 041128. [Google Scholar] [CrossRef]
Gujrati, P.D.; Aung, P.P. Nonequilibrium thermodynamics. III. Generalization of Maxwell, Clausius-Clapeyron, and response function relations, and the Prigogine-Defay ratio for systems in internal equilibrium. Phys. Rev. E 2012, 85, 041129. [Google Scholar] [CrossRef]
Gujrati, P.D. Nonequilibrium Work and its Hamiltonian Connection for a Microstate in Nonequilibrium Statistical Thermodynamics: A Case of Mistaken Identity. arXiv 2017, arXiv:1702.00455. [Google Scholar]
Gujrati, P.D. Correcting the Mistaken Identification of Nonequilibrium Microscopic Work. arXiv 2018, arXiv:1808.04725. [Google Scholar]
Gujrati, P.D. Generalized Non-equilibrium Heat and Work and the Fate of the Clausius Inequality. arXiv 2011, arXiv:1105.5549. [Google Scholar]
Gujrati, P.D. Nonequilibrium Thermodynamics. Symmetric and Unique Formulation of the First Law, Statistical Definition of Heat and Work, Adiabatic Theorem and the Fate of the Clausius Inequality: A Microscopic View. arXiv 2012, arXiv:1206.0702. [Google Scholar]
Gujrati, P.D. A Novel Trick to Overcome the Phase Space Volume Change and the Use of Hamiltonian Trajectories with an emphasis on the Free Expansion. arXiv 2021, arXiv:2102.06122. [Google Scholar]
Gujrati, P.D. Jarzynski Equality and its Special Trajectory Ensemble Average Demystified. arXiv 2018, arXiv:1802.08084. [Google Scholar]
Gujrati, P.D. Jensen inequality and the second law. Phys. Lett. A 2020, 384, 126460. [Google Scholar] [CrossRef]
Gujrati, P.D. First-principles nonequilibrium deterministic equation of motion of a Brownian particle and microscopic viscous drag. Phys. Rev. E 2020, 102, 012140. [Google Scholar] [CrossRef]
Kurchan, J. Non-equilibrium work relations. J. Stat. Mech. 2007, P07005. [Google Scholar] [CrossRef]
Evans, D.J.; Searles, D.J. The fluctuation theorem. Adv. Phys. 2002, 51, 1529. [Google Scholar] [CrossRef]
Gujrati, P.D. Hierarchy of Relaxation Times and Residual Entropy: A Nonequilibrium Approach. Entropy 2018, 20, 149. [Google Scholar] [CrossRef]
Capek, V.; Sheehan, D. Challenges to the Second Law of Thermodynamics. Theory and Experiment; Springer: Berlin, Germany, 2005. [Google Scholar]
A truly isolated system is really an idealization and will not correctly represent a physical system, as noted in the previous footnote. For a correct representation, the description requires a probabilistic approach, which follows from the loss of temporal inhomogeneity; see the discussion leading to Equation (6).
The division in cells is to ensure that the number of microstates does not become infinite even for a finite system (finite N, E and V)
Landau, L.D.; Lifshitz, E.M. Mechanics, 3rd ed.; Pergamon Press: Oxford, UK, 1976. [Google Scholar]
Thompson, B. An Inquiry concerning the Source of the Heat which is excited by Fricton. Philos. Trans. 1798, 18, 286. [Google Scholar]
Bouchbinder, E.; Langer, J.S. Nonequilibrium thermodynamics of driven amorphous materials. I. Internal degrees of freedom and volume deformation. Phys. Rev. E 2009, 80, 031131. [Google Scholar] [CrossRef]
Pokrovskii, V.N. A Derivation of the Main Relations of Nonequilibrium Thermodynamics. ISRN Thermodyn. 2013, 906136. [Google Scholar] [CrossRef]
Vilar, J.M.G.; Rubi, J.M. Thermodynamics “beyond” local Equilibrium. Proc. Natl. Acad. Sci. USA 2001, 98, 11081. [Google Scholar] [CrossRef]
Davies, R.O.; Jones, G.O. Thermodynamic and kinetic properties of glasses. Adv. Phys. 1953, 2, 370. [Google Scholar] [CrossRef]
The Glass Transition and the Nature of the Glassy State; Goldstein, M.; Simha, R. (Eds.) N.Y. Academy of Sciences: New York, NY, USA, 1976. [Google Scholar]
Edwards, S.F.; Grinev, D.V. Granular materials: Towards the statistical mechanics of jammed configurations (Review). Adv. Phys. 2002, 51, 1669. [Google Scholar] [CrossRef]
Gutzow, I.S.; Schmelzer, J.W.P. The Vitreous State: Thermodynamics, Structure, Rheology, and Crystallization, 2nd ed.; Springer: Berlin, Germany, 2013. [Google Scholar]
Nemilov, S.V. Thermodynamic and Kinetic Aspects of the Vitreous State; CRC Press: Boca Raton, FL, USA, 2018. [Google Scholar]
Jaynes, E.T. Papers on Probability, Statistics and Statistical Physics; Resenkrantz, R.D., Ed.; Reidel Publishing: Dordrecht, Holland, 1983. [Google Scholar]
Jaynes, E.T. Information Theory and Statistical Mechanics. Phys. Rev. 1957, 106, 620. [Google Scholar] [CrossRef]
Gujrati, P.D. Where is the residual entropy of a glass hiding? arXiv 2009, arXiv:0908.1075. [Google Scholar]
We are assuming that there is only one species of stable particles, whose number N is an observable, and is held fixed to fix the size of ∑. We can list N in X if we keep another observable such as V fixed to fix the size of the system. Here, we will keep N fixed for the size. If there are several species k = 1,2,⋯, r of particles that undergo l distinct chemical reactions among themselves, then the individual numbers N_k, k ∈ {1,2,⋯, r} of the species are not constant, only their total N remains constant. In this case, we need distinct l′ = l − 1 extents of reaction [13,51] as internal variables in Z as has been discussed later. If the species do not undergo chemical reactions among themselves, then N_k’s are individually observables. In this case, we can choose l0 independent numbers that are contained in X. In this review, we only consider a single species for simplicity.
Seifert, U. Stochastic thermodynamics, fluctuation theorems and molecular machines. Rep. Prog. Phys. 2012, 75, 126001. [Google Scholar] [CrossRef]
Maruyama, K.; Nori, F.; Vedral, V. The physics of Maxwell’s demon and information. Rev. Mod. Phys. 2009, 81, 1. [Google Scholar] [CrossRef]
Cohen, E.G.D.; Mauzerall, D. A note on the Jarzynski equality. J. Stat. Mech. 2004, P07006. [Google Scholar] [CrossRef]
Cohen, E.G.D.; Mauzerall, D. The Jarzynski equality and the Boltzmann factor. Mol. Phys. 2005, 103, 2923. [Google Scholar] [CrossRef]
Jarzynski, C. Nonequilibrium work theorem for a system strongly coupled to a thermal environment. J. Stat. Mech. 2004, P09005. [Google Scholar] [CrossRef]
Sung, J. Validity condition of the Jarzynski relation for a classical mechanical system. arXiv 2005, arXiv:cond-mat/0506214v4. [Google Scholar]
Gross, D.H.E. Flaw of Jarzynski’s equality when applied to systems with several degrees of freedom. arXiv 2005, arXiv:condmat/0508721v1. [Google Scholar]
Peliti, L. On the work–Hamiltonian connection in manipulated Systems. J. Stat. Mech. 2008, P05002. [Google Scholar] [CrossRef]
Vilar, J.M.G.; Rubi, J.M. Failure of the Work-Hamiltonian Connection for Free-Energy Calculations. Phys. Rev. Lett. 2008, 101, 020601. [Google Scholar] [CrossRef] [PubMed]
Horowitz, J.; Jarzynski, C. Comment on “Failure of the Work-Hamiltonian Connection for Free-Energy Calculations”. Phys. Rev. Lett. 2008, 101, 098901. [Google Scholar] [CrossRef]
Vilar, J.M.G.; Rubi, J.M. Vilar and Rubi Reply. Phys. Rev. Lett. 2008, 101, 098902. [Google Scholar] [CrossRef]
Peliti, L. Comment on “Failure of the Work-Hamiltonian Connection for Free-Energy Calculations”. Phys. Rev. Lett. 2008, 100, 098903. [Google Scholar] [CrossRef]
Van den Broeck, C.; Esposito, M. Ensemble and trajectory thermodynamics: A brief introduction. Physica A 2015, 418, 6. [Google Scholar] [CrossRef]
Sung, J. Breakdown of the Jarzynski relation for an adiabatic stretching of an isotropic spring. arXiv 2005, arXiv:cond-mat/0510119. [Google Scholar]
Jarzynski, C. Reply to comments by D.H.E. Gross. arXiv 2005, arXiv:cond-mat/0509344v1. [Google Scholar]
Vilar, J.M.G.; Rubi, J.M. Vilar and Rubi Reply. Phys. Rev. Lett. 2008, 100, 098904. [Google Scholar] [CrossRef]
See Equation (7.9) in [33], which shows how this entropy formulation emerges in statistical physics. It is applicable to both EQ and NEQ macrostates as is clear from Section 40 (see Equation (40.7) in particular) dealing with NEQ ideal gas.
As the system is no longer isolated because of its interaction with the environment, E, N, V need not remain constant and may fluctuate. However, as long as we are dealing with very weak environmental noise, we can safely treat the system as quasi-isolated in that the widths of their spread can be neglected.
Whether the entire universe satisfies the second law is an unsettled problem at present. To verify it requires making measurement of some sort on different parts of an ever-expanding universe at the same instant. It is not clear whether it is possible to send signals to distant receding parts of our expanding universe to be able to make this measurement; most of these parts are probably causally disconnected from us. The idea of an isolated system is based on an exterior from which it is isolated. To test the isolation, we need to perform some sort of test from outside the isolated system. We need to know if we live in a universe or a multiverse. Also, is there a physical boundary to our universe isolating it from outside? By physical, we mean it to be composed of matter and energy. What is outside this boundary, and how can we test or know what is outside, while remaining inside the isolated universe? If there is a physical boundary, does it contain all the matter and energy within it or is there energy outside it? Are dark matter and dark energy confined within this boundary or do they also exist outside it? If it is vacuum outside, does it have any vacuum energy, which is then absorbed by the expanding universe? At present, we do not know answers to these questions. It is highly likely that there is no physical boundary to the universe that we can detect. Everything that we observe is causally connected to us and lies within the universe. Therefore, we cannot see its boundary, which is causally disconnected from us. For all practical purposes, the universe appears to be “unbounded” to us. The only sensible thing we can speak of is a part (within the causally connected observable universe) of the universe, finite in extent within this “unbounded” universe. The surrounding medium of the observable universe and the 3K radiation generate stochasticity and ensure that the observable universe satisfies the second law. In our opinion, causally disconnected parts of the universe have no bearing on the second law. Therefore, we will not worry about this issue here.
This is impossible at least due to the presence of the remanent 3 K radiation from the big bang that permeates the entire universe. We will neglect this radiation and other thermal radiation from the walls and other external bodies when we consider a deterministic dynamics. They will become an integral part of the discussion when we deal with stochastic dynamics.
Einstein, A. Uber einen die Erzeugung und Verwandlung des Lichtes betreffenden heuristischen Gesichtspunkt. Ann. Der Phys. 1905, 17, 132. [Google Scholar] [CrossRef]
Prigogine, I. The Boltzmann Equation, Theory and Applications; Cohen, E.G.D., Thirring, W., Eds.; Springer: Vienna, Austria, 1973; pp. 401–450. [Google Scholar]
Lanford, O.E. On a Derivation of the Boltzmann Equation, in Nonequilibrium Phenomena 1: The Boltzmann Equation; Lebowitz, J.L., Montroll, E.W., Eds.; North-Holland: Amsterdam, The Netherlands, 1983. [Google Scholar]
Kac, M. Probability and Related Topics in Physical Sciences; Interscience Publishers: London, UK, 1959. [Google Scholar]
Henin, F.; Prigogine, I. Entropy, Dynamics, and Molecular Chaos. Proc. Nat. Acad. Sci. USA 1974, 71, 2618–2622. [Google Scholar] [CrossRef] [PubMed]
Evans, D.J.; Searles, D. Causality, response theory, and the second law of thermodynamics. Phys. Rev. 1996, 53, 5808. [Google Scholar] [CrossRef] [PubMed]
Gujrati, P.D. Maxwell’s Demon must remain sebservient to Clausius’s statement. arXiv 2021, arXiv:2112.12300v2. [Google Scholar]
Gujrati, P.D. Maxwell’s Conjecture of the Demon creating a Temperature Difference is False. arXiv 2022, arXiv:2205.02313v2. [Google Scholar]
Kostic, M.M. The Second Law and Entropy Misconceptions Demystified. Entropy 2020, 22, 648. [Google Scholar] [CrossRef]
Earman, J.; Norton, J.D. Exorcist XIV: The Wrath of Maxwell’s Demon. Part I. From Maxwell to Szilard. Stud. Hist. Philos. Sci. B Stud. Hist. Philos. Mod. Phys. 1998, 29, 435. [Google Scholar] [CrossRef]
Liphardt, J.; Dumont, S.; Smith, S.; Tinoco, I.; Bustamante, C. Equilibrium information from nonequilibrium measurements in an experimental test of Jarzynski’s equality. Science 2002, 296, 1833. [Google Scholar] [CrossRef]
Sung, J. Reply to Note on cond-mat/0510270: Jarzynski equation for adiabatically stretched rotor. arXiv 2005, arXiv:condmat/0512250. [Google Scholar]
Sung, J. Application range of Jarzynski’s equation for boundary-switching processes. Phys. Rev. E 2008, 77, 042101. [Google Scholar] [CrossRef] [PubMed]
Bier, M. Note on cond-mat/0510119: Jarzynski equation for adiabatically stretched rotor. arXiv 2005, arXiv:cond-mat/0510270. [Google Scholar]
Sung, J. Theoretical test of Jarzynski’s equality for reversible volume-switching processes of an ideal gas system. Phys. Rev. E 2007, 76, 012101. [Google Scholar] [CrossRef] [PubMed]
Nieuwenhuizen, T.M. Thermodynamics of the Glassy State: Effective Temperature as an Additional System Parameter. Phys. Rev. Lett. 1998, 80, 5580. [Google Scholar] [CrossRef]
Allahverdyan, A.E.; Nieuwenhuizen, T.M. Steady adiabatic state: Its thermodynamics, entropy production, energy dissipation, and violation of Onsager relations. Phys. Rev. E 2000, 62, 845. [Google Scholar] [CrossRef]
Amotz, D.B.; Honig, J.M. Rectification of thermodynamic inequalities. J. Chem. Phys. 2003, 118, 5932. [Google Scholar] [CrossRef]
Amotz, D.B.; Honig, J.M. Average Entropy Dissipation in Irreversible Mesoscopic Processes. Phys. Rev. Lett. 2006, 96, 020602. [Google Scholar] [CrossRef]
Honig, J.M. Thermodynamics, 4th ed.; Academic Press: Oxford, UK, 2014. [Google Scholar]
Bizarro, J.P.S. Entropy production in irreversible processes with friction. Phys. Rev. E 2008, 78, 021137, Erratum: Entropy production in irreversible processes with friction. Phys. Rev. E 2008, 78, 059903. [Google Scholar] [CrossRef]
Gujrati, P.D. Iakov Boyko and Tyler Johnson, Determination of Nonequilibrium Temperature and Pressure using Clausius Equality in a State with Memory: A Simple Model Calculation. arXiv 2015, arXiv:1512.08744. [Google Scholar]
Bender, C.M.; Brody, D.C.; Meister, B.K. Quantum mechanical Carnot engine. J. Phys. A 2000, 33, 4427. [Google Scholar] [CrossRef]
Indeed, for a macroscopic system, the probability to come back to a previously generated microstate will be almost negligible.
It is found that for even glasses, the entire phase space with 2^N microstates is broken into disjoint components, so that the initial microstate in a given component evolves into microstates belonging to this component alone; no microstates from other components occur in the evolution. Again, the probaility of recurrence in each component will be almost negligible.
Lee, J.W. Energy Renewal: Isothermal Use of Environmental Heat Energy with Asymmetric Structures. Entropy 2021, 23, 665. [Google Scholar] [CrossRef]
Planck, M. Über die mechanische Boeutung der Temperatur und der Entropie. In Festschrift Ludwig Boltzmann; J.A. Barth: Leipzig, Germany, 1904; p. 113. [Google Scholar]
Landau, L.D. Kinetic equation for the Coulomb effect. Zh. Eksp. Teor. Fiz. 1937, 7, 203, reprinted in Collected papers of L.D. Landau; ter Haar, D., Ed.; Gordon and Breach: New York, NY, USA, 1965; p. 169. [Google Scholar]
Ramsey, N.F. Thermodynamics and Statistical Mechanics at Negative Absolute Temperatures. Phys. Rev. 1956, 103, 20. [Google Scholar] [CrossRef]
Coleman, B.D. Thermodynamics of materials with memory. Arch. Rat. Mech. Anal. 1964, 17, 1–64. [Google Scholar] [CrossRef]
Keizer, J. Heat, work, and the thermodynamic temperature at nonequilibrium steady states. J. Chem. Phys. 1985, 82, 2751. [Google Scholar] [CrossRef]
Eu, B.C.; Garcia-Colin, L.S. Irreversible processes and Temperature. Phys. Rev. E 1996, 54, 2501. [Google Scholar] [CrossRef] [PubMed]
Morris, G.P.; Rondoni, L. Definition of temperature in equilibrium and nonequilibrium systems. Phys. Rev. E 1999, 59, R5. [Google Scholar] [CrossRef]
Hoover, W.G.; Hoover, C.G. Nonequilibrium temperature and thermometry in heat-conducting ϕ⁴ models. Phys. Rev. E 2008, 77, 041104. [Google Scholar] [CrossRef]
Lucia, U.; Grisolia, G. Nonequilibrium Temperature: An Approach from Irreversibility. Materials 2021, 14, 2004. [Google Scholar] [CrossRef]
Bejan, A. Applied Engineering Thermodynamics, 3rd ed.; John Wiley: New York, NY, USA, 2006. [Google Scholar]
Bejan, A. Entropy generation minimization: The new thermodynamics of finite-size devices and finite-time processes. J. Appl. Phys. 1996, 79, 1191. [Google Scholar] [CrossRef]
Crooks, G.E. Nonequilibrium Measurements of Free Energy Differences for Microscopically Reversible Markovian Systems. J. Stat. Phys. 1998, 90, 1481. [Google Scholar] [CrossRef]

Figure 1. (a) An isolated nonequilibrium system

Σ_{0}

with internally generated

d_{i} Z

driving it towards equilibrium, during which its SI-fields

T (t), P (t), \dots, A (t)

continue to change to their equilibrium values;

d_{i} Z_{k}

denote the microanalog of

d_{i} Z

. The sign of

d_{i} Z

is determined by the second law. (b) A nonequilibrium systen

Σ

in a surrounding medium

\tilde{Σ}

, both forming the isolated system

Σ_{0}

. The macrostates of the medium and the system are characterized by their fields

T_{0}, P_{0}, . . ., A_{0} = 0

and

T (t), P (t), \dots, A (t)

, respectively, which are different when the two are out of equilibrium. Exchange quantities (

d_{e} Z

) carry a suffix “e” and irreversibly generated quantities (

d_{i} Z

) within the system are denoted by a suffix “i” by extending the Prigogine notation. Their sum

d_{e} Z + d_{i} Z

is denoted by

d Z

, which is a system-intrinsic quantity (see text). In a nonequilibrium system, the nonzero differences

F_{t}^{h} = T_{0} - T

and

Δ F_{w} = (P - P_{0}, \dots, A)

denote the set of thermodynamic forces, where we have also included the affinity

A

for internal variables

ξ

; see text. A microstate

m_{k}

of

Σ

is specified by appending a subscript k to

Δ F_{w}

so that

Δ F_{w k} = (P_{k} - P_{0}, \dots, A_{k})

, as explained in the text.

Figure 1. (a) An isolated nonequilibrium system

Σ_{0}

with internally generated

d_{i} Z

driving it towards equilibrium, during which its SI-fields

T (t), P (t), \dots, A (t)

continue to change to their equilibrium values;

d_{i} Z_{k}

denote the microanalog of

d_{i} Z

. The sign of

d_{i} Z

is determined by the second law. (b) A nonequilibrium systen

Σ

in a surrounding medium

\tilde{Σ}

, both forming the isolated system

Σ_{0}

. The macrostates of the medium and the system are characterized by their fields

T_{0}, P_{0}, . . ., A_{0} = 0

and

T (t), P (t), \dots, A (t)

, respectively, which are different when the two are out of equilibrium. Exchange quantities (

d_{e} Z

) carry a suffix “e” and irreversibly generated quantities (

d_{i} Z

) within the system are denoted by a suffix “i” by extending the Prigogine notation. Their sum

d_{e} Z + d_{i} Z

is denoted by

d Z

, which is a system-intrinsic quantity (see text). In a nonequilibrium system, the nonzero differences

F_{t}^{h} = T_{0} - T

and

Δ F_{w} = (P - P_{0}, \dots, A)

denote the set of thermodynamic forces, where we have also included the affinity

A

for internal variables

ξ

; see text. A microstate

m_{k}

of

Σ

is specified by appending a subscript k to

Δ F_{w}

so that

Δ F_{w k} = (P_{k} - P_{0}, \dots, A_{k})

, as explained in the text.

Figure 2. A system driven between two sources that are different in their fields; see Figure 1. If they are the same, the situation reduces to that in Figure 1a.

Figure 3. We schematically show a system of (a) gas in a cylinder with a movable piston under an external pressure

P_{0}

controlling the volume V of the gas, and (b) a particle attached to a spring in a fluid being pulled by an external force

F_{0}

, which causes the spring to stretch or compress depending on its direction. In an irreversible process, the internal pressure P (the spring force

F_{s}

) is different in magnitude from the external pressure

P_{0}

(external force

F_{0}

).

Figure 3. We schematically show a system of (a) gas in a cylinder with a movable piston under an external pressure

P_{0}

controlling the volume V of the gas, and (b) a particle attached to a spring in a fluid being pulled by an external force

F_{0}

, which causes the spring to stretch or compress depending on its direction. In an irreversible process, the internal pressure P (the spring force

F_{s}

) is different in magnitude from the external pressure

P_{0}

(external force

F_{0}

).

Figure 4. Schematic behavior of

S (t)

as a function of time t. Starting at O (

t = 0

), OA and OB show the symmetric growth of

S (t)

in future and under time reversal at

t = 0

. If we reverse time later at

t = t_{0} + t^{'}

by setting

t^{'} \to - t^{'},

then O

_{0}

C shows the growth of the entropy above its value

S (t_{0})

at

t = t_{0}

; the entropy does not retrace O

_{0}

O, as would be required by time-reversal invariance.

Figure 4. Schematic behavior of

S (t)

as a function of time t. Starting at O (

t = 0

), OA and OB show the symmetric growth of

S (t)

in future and under time reversal at

t = 0

. If we reverse time later at

t = t_{0} + t^{'}

by setting

t^{'} \to - t^{'},

then O

_{0}

C shows the growth of the entropy above its value

S (t_{0})

at

t = t_{0}

; the entropy does not retrace O

_{0}

O, as would be required by time-reversal invariance.

Figure 5. Free expansion of a gas. The gas is confined to the left chamber, which is separated by a partition (shown by a solid black vertical line) from the vacuum as shown in (a). At time

t = 0

, the partition is removed abruptly as shown by the broken line in its original place in (b). The gas expands in the empty space on the right but the expansion is gradual as shown by the solid front, which separates it from the vacuum on its right.

Figure 5. Free expansion of a gas. The gas is confined to the left chamber, which is separated by a partition (shown by a solid black vertical line) from the vacuum as shown in (a). At time

t = 0

, the partition is removed abruptly as shown by the broken line in its original place in (b). The gas expands in the empty space on the right but the expansion is gradual as shown by the solid front, which separates it from the vacuum on its right.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gujrati, P.D. Foundations of Nonequilibrium Statistical Mechanics in Extended State Space. Foundations 2023, 3, 419-548. https://doi.org/10.3390/foundations3030030

AMA Style

Gujrati PD. Foundations of Nonequilibrium Statistical Mechanics in Extended State Space. Foundations. 2023; 3(3):419-548. https://doi.org/10.3390/foundations3030030

Chicago/Turabian Style

Gujrati, Purushottam Das. 2023. "Foundations of Nonequilibrium Statistical Mechanics in Extended State Space" Foundations 3, no. 3: 419-548. https://doi.org/10.3390/foundations3030030

APA Style

Gujrati, P. D. (2023). Foundations of Nonequilibrium Statistical Mechanics in Extended State Space. Foundations, 3(3), 419-548. https://doi.org/10.3390/foundations3030030

Article Menu

Foundations of Nonequilibrium Statistical Mechanics in Extended State Space

Abstract

1. Introduction

1.1. Scope of the Review

1.2. System-Intrinsic and Medium-Intrinsic Thermodynamics

1.3. Main Results

1.4. Layout

2. Notation, Definitions and New Concepts

2.1. Systems and State Variables

2.2. Microstates and Macrostates

2.3. Micro–Macro Variables

2.4. Random Variable and Average

2.5. Different States in NEQT

2.6. Mechanical Description

2.7. Entropy and Stochastic Description

2.8. Reduction

2.9. Process Quantities

2.10. Σ 0 (Isolated Body) and Σ ˜ (Medium)

3. Mathematical Digression on d α

3.1. Generalizing d ≡ d e + d i

3.2. Consequences of Theorem 60

3.3. Some Simple Examples

3.4. Manipulations with d α

4. Internal Variables

5. Fundamentals of the μ NEQT

5.1. Fundamental Axiom

5.2. Parameter Description

5.3. Ensemble of Replicas

5.4. Concept of Probability

5.5. Statistical Entropy for M ( t )

5.6. Principle of Additivity

5.6.1. Additivity

5.6.2. Quasi-Additivity

5.7. Σ in Internal EQ (IEQ)

5.8. Gibbs Fundamental Relations for M ieq ( Z ) in S Z and S ζ

5.9. Time-Dependent Gibbs Fundamental Relations for M nieq ( Z ) in S Z

5.10. Consequences of the Second Law

5.11. Assumptions

5.11.1. N Fixed for Σ

5.11.2. Σ ˜ Always in EQ

6. Mechanical Aspects

6.1. Microstate Evolution in S Z

6.2. SI-Microwork in S Z

6.3. SI-Legendre Transform

6.4. Mechanical Force Imbalance (FI)

6.5. Work–Energy Principle

7. Stochastic Aspects

7.1. Origin of Stochasticity

7.2. Process of Reduction

7.3. Quasi-Independence

7.4. Reduction

7.5. Reduction under Quasi-Independence for m k

7.6. Clarifying Examples

8. Properties of Entropy for M ( t )

8.1. System in a Medium and Quasi-Independence

8.2. Second Law Postulate of NEQ Entropy S

8.3. A Proof of the Second Law

8.4. Second Law as a Consequence of Stability

9. Devastations Caused by Second Law Violation

9.1. Macroheat Exchanges

9.1.1. E l Monotonically Increases with T l

9.1.2. E l Monotonically Decreases with T l

9.2. Macrowork Exchanges

9.2.1. V l Monotonically Decreases with P l

9.2.2. V l Monotonically Increases with P l

10. Microworks, Microheats, and Commutator

10.1. Digression on Ensemble Averages

10.2. Statistical Significance of d W and d Q

10.3. Medium Σ ˜

11. External and Internal Variations of dp k ( t )

Proof of d i E = 0 Even If d i E k ’s Are Not

12. Extended State Space, M ieq and M nieq

12.1. Choice of Z for M ieq in S Z

12.2. Microstate Probabilities for M ieq : NFl- W

12.3. Lagrange Multiplier Method: NFl W

12.4. Extensivity Method

12.5. Fluctuating W k

12.6. M nieq ( Z ) and Its Microstate Probabilities

12.7. Common EQ Ensembles

2.10. $Σ_{0}$ (Isolated Body) and $\tilde{Σ}$ (Medium)

3. Mathematical Digression on $\{d_{α}\}$

3.1. Generalizing $d \equiv d_{e} + d_{i}$

3.4. Manipulations with $d_{α}$

5. Fundamentals of the $μ$ NEQT

5.5. Statistical Entropy for $M (t)$

5.7. $Σ$ in Internal EQ (IEQ)

5.8. Gibbs Fundamental Relations for $M_{ieq} (Z)$ in $S_{Z}$ and $S_{ζ}$

5.9. Time-Dependent Gibbs Fundamental Relations for $M_{nieq} (Z)$ in $S_{Z}$

5.11.1. N Fixed for $Σ$

5.11.2. $\tilde{Σ}$ Always in EQ

6.1. Microstate Evolution in $S_{Z}$

6.2. SI-Microwork in $S_{Z}$

7.5. Reduction under Quasi-Independence for $m_{k}$

8. Properties of Entropy for $M (t)$

9.1.1. $E_{l}$ Monotonically Increases with $T_{l}$

9.1.2. $E_{l}$ Monotonically Decreases with $T_{l}$

9.2.1. $V_{l}$ Monotonically Decreases with $P_{l}$

9.2.2. $V_{l}$ Monotonically Increases with $P_{l}$

10.2. Statistical Significance of $d W$ and $d Q$

10.3. Medium $\tilde{Σ}$

11. External and Internal Variations of ${dp}_{k} (t)$

Proof of $d_{i} E = 0$ Even If $d_{i} E_{k}$ ’s Are Not

12. Extended State Space, $M_{ieq}$ and $M_{nieq}$

12.1. Choice of $Z$ for $M_{ieq}$ in $S_{Z}$

12.2. Microstate Probabilities for $M_{ieq}$ : NFl- $W$

12.3. Lagrange Multiplier Method: NFl $W$

12.5. Fluctuating $\{W_{k}\}$

12.6. $M_{nieq} (Z)$ and Its Microstate Probabilities

15. An NEQ Microwork Fluctuation Theorem in $S_{Z}$