Next Article in Journal
Comparison of Harmonic Oscillator Model in Classical and Quantum Theories of Light-Matter Interaction
Previous Article in Journal
Entropy Generation and Control: Criteria to Calculate Flow Optimization in Biological Systems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Foundations of Nonequilibrium Statistical Mechanics in Extended State Space

by
Purushottam Das Gujrati
Department of Physics, School of Polymer Science and Polymer Engineering, The University of Akron, Akron, OH 44325, USA
Foundations 2023, 3(3), 419-548; https://doi.org/10.3390/foundations3030030
Submission received: 21 May 2023 / Revised: 10 August 2023 / Accepted: 14 August 2023 / Published: 23 August 2023
(This article belongs to the Section Physical Sciences)

Abstract

:
The review provides a pedagogical but comprehensive introduction to the foundations of a recently proposed statistical mechanics ( μ NEQT) of a stable nonequilibrium thermodynamic body, which may be either isolated or interacting. It is an extension of the well-established equilibrium statistical mechanics by considering microstates m k in an extended state space in which macrostates (obtained by ensemble averaging A ^ ) are uniquely specified so they share many properties of stable equilibrium macrostates. The extension requires an appropriate extended state space, three distinct infinitessimals d α = ( d , d e , d i ) operating on various quantities q during a process, and the concept of reduction. The mechanical process quantities (no stochasticity) like macrowork are given by A ^ d α q , but the stochastic quantities C ^ α q like macroheat emerge from the commutator C ^ α of d α and A ^ . Under the very common assumptions of quasi-additivity and quasi-independence, exchange microquantities d e q k such as exchange microwork and microheat become nonfluctuating over m k as will be explained, a fact that does not seem to have been appreciated so far in diverse branches of modern statistical thermodynamics (fluctuation theorems, quantum thermodynamics, stochastic thermodynamics, etc.) that all use exchange quantities. In contrast, dq k and d i q k are always fluctuating. There is no analog of the first law for a microstate as the latter is a purely mechanical construct. The second law emerges as a consequence of the stability of the system, and cannot be violated unless stability is abandoned. There is also an important thermodynamic identity  d i Q d i W     0 with important physical implications as it generalizes the well-known result of Count Rumford and the Gouy-Stodola theorem of classical thermodynamics. The μ NEQT has far-reaching consequences with new results, and presents a new understanding of thermodynamics even of an isolated system at the microstate level, which has been an unsolved problem. We end the review by applying it to three different problems of fundamental interest.

1. Introduction

Thermodynamics is the study of physical systems in nature that eventually evolve in time to stationary macrostates, in which any disturbance generates restoring forces to bring them back to the stationary macrostates [1,2,3], which makes them stable macrostates, usually called equilibrium (EQ) macrostate  M eq , satisfying certain stability conditions. Any disturbance to modify macrostates of the system invariably results in nonequilibrium (NEQ) processes so that they abound in nature and obey the well-established second law [4,5,6,7]. The law is also obeyed by biological systems [8,9]. However, NEQ processes are not well-understood, as the corresponding thermodynamics (NEQT) is not yet fully developed, despite it having a long history of various competing schools [10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28], among which are the most widely known schools of local equilibrium thermodynamics, rational thermodynamics, extended thermodynamics, and GENERIC thermodynamics [21,29]. They mostly deal with the time evolution of macroscopic quantities only; the latter emerge as instantaneous averages over microstates in a more fundamental and statistical approach, and are used to characterize any thermodynamic process and the resulting nonnegative entropy generation Δ i S 0 , as first proposed by Clausius [30,31]. In contrast, the equilibrium (EQ) thermodynamics (EQT) in which Δ i S 0 is based on the original ideas of Carnot, Clapeyron, Clausius, Thomson, Maxwell, and many others [3,12,32,33,34,35,36,37,38,39,40,41,42,43,44,45], and has by now been firmly established in statistical physics, thanks to Boltzmann [46,47] and Gibbs [48], who established that classical EQ thermodynamics is a direct consequence of the EQ statistical mechanics [33,34,37,38] that deals directly with microstates m k of the Hamiltonian H of the system, and their equilibrium probabilities p k eq that together specify the EQ macrostate  M eq . In contrast, EQT deals directly with M eq without any need to know m k and p k eq .
In general, the collection m k , p k of microstates and their probabilities is used in a statistical description of a macrostate M of the system Σ that may be isolated or interacting with a medium Σ ˜ , as shown in Figure 1. The same microstate set m k determines different macrostates depending on the probabilities p k = p k ( M ) with which m k appears in M . As H is by definition deterministic, m k is also deterministic. Thus, it is independent of p k , but is specified by its energies E k and the parameters defining H . Because of this, m k and E k are the same for any of its possible macrostates M including M eq . This allows E k to be treated as purely mechanical, which is then supplemented by p k to add stochasticity to the mechanical system. Such a description has proven very useful in EQ statistical mechanics [33,34], where the concepts of the entropy S eq = S ( M eq ) that was first introduced by Clausius [30,31] as a state function of M eq , and the temperature T are the new concepts that play a central role in the resulting EQ thermodynamics of Σ . As such, it is very common to use them to distinguish a thermodynamic system from a mechanical system by recognizing that the concept of heat (a consequence of a particular commutator as described later) is novel to thermodynamics but is not applicable to a mechanical system, which is traditionally taken to be described by a purely conservative Hamiltonian H . We use X ( N , E , V , ) to collectively denote the number of particles N, their energy E, the volume V occupied by them, etc., as representing the common thermodynamic extensive state variables that determine M eq M ( X ) in the state space S X spanned by X . We call them observables. As observables, these variables can be controlled from the outside of the system. We will allow X = X ( t ) to have time dependence in this work; here, t denotes the time. For the moment, we suppress the suffix “eq” for notational simplicity unless necessary as we are dealing with S X . To be useful, S and T must uniquely refer to the thermodynamic state M ( X ) . This unique relationship is what is meant by S being a state function of X , which when inverted gives E a state function of ζ ( S , w ) , w X E , where E stands for deleting E from the set preceding it. Being functions of M of Σ , S and T, an intensive field, must be interrelated in some fashion such as
1 / T = S / E ,
(see Equation (129)) so only one of them can be treated as a primitive concept, which we take to be the entropy. The goal of NEQT is to then specify it in terms of X in the state space S X . In this respect, having S a state function considerably simplifies the study as we then deal with M eq . When this cannot be done, we must go beyond S X to an extended state space in which the NEQ entropy also becomes a state function, which is the central theme of this review. In this space, a uniform global temperature of the body is defined as its unique field by the above derivative in the extended state space. Thus, our goal will be to identify the NEQ entropy in this space.
Although S plays important roles in diverse fields ranging from classical thermodynamics of Clausius [3,10,12,13,17,20,24,25,30,33,39,40,41,42,43,44,46,47,48,49,50,51,52,53,54,55,56,57,58], quantum mechanics and uncertainty [59,60,61], black holes [62,63,64], coding and computation [65,66,67], to information technology [68,69,70,71,72], it does not seem to have a standard definition in all cases, even though it is well-defined under EQ conditions, as extensively discussed in the literature; see, for example, [46,47,48,73,74,75,76,77,78]. As  S eq is uniquely determined by p k eq as a state function, p k eq ’s must be unique functions in S X , as is well-known [33]. Requiring this uniqueness will be a guiding force in our endeavor to formulate the NEQ statistical mechanics. Whether S has any physical significance in a NEQ macrostate M has been a topic of extensive debate; see for example [73,74,75,76,77] and references therein. The  problem arises because it is not clear if, and how, M can be uniquely identified. Because of the lack of uniqueness, introducing S ( M ) as a state function becomes nontrivial. The same concern also applies to p k .
Recently, we have been able to extend the classical concept of Clausius entropy from EQ states to NEQ states where irreversible entropy is generated [75,76,77,78]. That approach is an outgrowth of an earlier review [79] in this journal about a possible source of stochasticity that is required in a thermodynamic system, even though its mechanics is completely deterministic due to its Hamiltonian dynamics so that heat and temperature have no mechanical analogs. Not appreciating that the source of stochasticity is independent from the deterministic (mechanical) aspect has been a source of bitter debate between Boltzmann, Zermelo, Poincare, and many others ([45,56,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96], and references cited in there). The dispute required Boltzmann to propose the ideas of molecular chaos and of the ergodicity hypothesis [91] that have played a major role in EQ statistical mechanics. We discuss these important ansatze in [92,93,94,95] with an emphasis on Kac’s ring model [97,98] in more detail, where we find that they are not fulfilled in a deterministic dynamics. We infer, as is commonly believed, that one needs a stochastic dynamics for the ansatz to be satisfied. Both these ideas can only be supported by a stochastic dynamics as discussed in these references [99,100,101,102].
It is clear that we need to supplement a purely mechanical approach by supplementing it with stochasticity. We accomplish treating both aspects separately but unifying them together and enabling uniqueness by using an extended state space S Z spanned by extensive state variables (compactly denoted by Z = Z ( t ) = X ( t ) ξ ( t ) as an extension of X ( t ) in this review) to obtain a state function S. In general, Z includes the observables but possibly some more independent variables, compactly denoted by ξ ( t ) required for an NEQ situation, as will become clear later. The additional state variable ξ , when properly chosen as will be described later, allows for a unique description of the macrostate M ( Z ) in S Z . Once such a state space has been uncovered for M ( Z ) , its entropy S ( M ) also become a state function S ( Z ) in S Z . This again requires its p k ’s to be unique functions in S Z , just as p k eq are in S X . Thus, the identification of an appropriate S Z immediately solves the problem of obtaining a unique statistical mechanics of an NEQ system as it directly leads to p k as a unique function of m k and M ( Z ) in S Z .
In order for such an approach to work, and in particular for S ( Z ) , which itself is a system quantity, it is crucial that we deal with only system-intrinsic (SI) quantities (they are determined by the system), and not medium-intrinsic (MI) quantities (they are primarily determined by the medium) for the simple reason that utilizing m k requires their specification by the Hamiltonian of the system and so require SI-quantities for its specification. (We will use body to refer to Σ , Σ ˜ , and  Σ 0 , and BI-quantities to refer to quantities of a body.) As will become clear in the following, these quantities capture the internal processes going on within the system. They cannot be fully captured by the MI-quantities, even though they have been traditionally used in thermodynamics, for the simple reason that they retain the memory of the medium and can depend on the system only weakly. Thus, they will require additional steps to study internal processes. There has been a long debate about the relevance and significance of the two kinds of quantities that ensued from a very different perspective [103,104,105,106,107], but did not capture the importance these quantities acquire in our approach.
The SI-quantities allow us to develop our NEQ statistical mechanics, which for brevity is identified as the μ NEQT, with  μ referring to the microstates m k , in which we directly capture internal processes that are responsible for irreversibility. As the collection m k is the central object in the μ NEQT, the latter deals with quantities such as E k , p k . At the microstate level, there are fluctuations that are essential in a statistical treatment, and are properly captured in the μ NEQT through the fluctuations in E k and p k over m k . In contrast, the use of the MI-quantities does not directly describe m k so it cannot properly yield a statistical mechanical description of an NEQ process in a system. This is one of our most important conclusions. In particular, an important consequence of the μ NEQT as will be shown later is that MI-quantities, after  reduction (being averaged over the microstates of the medium) under commonly accepted conditions of quasi-additivity and quasi-independence, do not exhibit any fluctuations. This explains why they are not suitable in developing the statistical mechanics. We call the resulting version of the microstate NEQT the μ ˚ NEQT; the circle on μ is a reminder for the use of “exchange” microquantities derived from the MI-quantities in its formulation. The most prominent are the exchange (also called external) microwork d e W k = d e W , k , and the exchange (also called external) microheat d e Q k = d e Q , k , thus, explicitly exhibiting that they have no fluctuations. Because of this, it does not directly capture internal processes at the microstate level, which require additional steps to describe irreversibility as mentioned above. The  corresponding macroscopic NEQT from the two approaches are called the MNEQT and the M ˚ NEQT , respectively; here, M stands for the macroscopic description in terms of macrostates, the circle again having the same connotation as above. There are no fluctuations in these theories, as is well-known. The  M ˚ NEQT is the standard formulation of classical thermodynamics and has been discussed extensively by many prominent scientists [13,18,33,39,41,42,51,108], some including internal variables that play an important role in our approach.
It should be obvious from the above discussion that we need to make a clear distinction between fluctuating (Fl) and nonfluctuating (NFl) quantities. In  addition, we also recognize that there are many other macrostates in S Z for which neither S nor the corresponding p k ’s are unique functions in S Z , so S must be treated independently of Z . Our previous work did not consider such states, but they will be considered in this review. For this purpose, we will find it convenient to introduce the following state variable sets:
S , E , w X E , W Z E , Z , ζ = ( S , W ) , χ = ( S , Z ) ,
and the corresponding state spaces S S , S E , S W , S Z , S ζ S S S W , and  S S S S Z , where the suffix denotes the variable set forming the state space.
We should emphasize that internal variables also appear in mechanical systems. A simple example is that of two particles in a system, whose interior is hidden in the lab from us so that we cannot see where the particles are inside the system. From outside the system, we can only be aware of the position of the center of mass by observing its motion in the lab. However, there is no way to determine their separation within the system. This separation and the corresponding relative motion are examples of the internal variable and its motion, and play a role in the dynamics of the mechanical system. Thus, it should not come as a surprise that such internal variables will also be relevant in a thermodynamic system. Indeed, we will see later in Section 14 that this relative motion becomes the source of “microfriction”, resulting in friction, when we treat the system in thermodynamics.
To appreciate at a more fundamental level the distinction between a mechanical and a thermodynamic system, we first realize that both systems are usually separated from their surroundings Σ ˜ by some clear partitions, the most common being the walls between them; see Figure 1. We collectively call them containers or walls that contain the system [109]. In this review, we find it convenient to not include the container as part of the system, but use it to determine the boundary conditions for the equations of motion or as defining parameters in the Hamiltonian H of the system Σ . As  H plays the role of E, the parameters w and W are obtained by taking out E from X and Z , respectively:
w = ( V , ) , W = ( V , , ξ ) ;
where ⋯ refers to the rest of the elements in X besides V [110]. As will become evident below, these parameters denote the work parameter in the Hamiltonian, which we will denote by H ( x w ) or H ( x W ) , respectively. For simplicity in the following, we will always use W for the work parameter to refer to both cases and express the Hamiltonian as H ( x W ) . The parameter can be varied in a process with a concomitant change in H ( x W ) due to the work done by the system. This is in accordance with the work–energy theorem of mechanics that states that the change in the energy is due to work alone. Also, x = x ( t ) r ( t ) , p ( t ) is the dynamical variable and denotes the collection of coordinates r ( t ) and momenta p ( t ) of the N particles in the phase space Γ ( x W ) of Σ [110]. As internal variables play no role in EQ, W = w in EQ. For any x Γ ( x W ) , the  deterministic energy of Σ in the state specified by x is E x ( W ) = H ( x W ) , which need not be constant. However, there is no stochasticity so there is no concept of heat. Thus, the Hamiltonian itself cannot explain the fundamental difference between the two systems [79].
We elaborate further. Mechanics is a branch of the physical science to study the deterministic behavior of the system in the presence of known forces and radiation in time. The central concept is that of energy whose changes are governed by deterministic Hamiltonian equation of motion in Γ ( x w ) with deterministic boundary conditions such as at the walls confining the system Σ (see Figure 1a) that generate deterministic wall potentials acting on the particles. Accordingly, a point x ( t ) Γ ( x w ) is uniquely determined by x ( t 0 ) at some reference time t 0 . A central aspect of the equation is that it uniquely determines the properties of the system in the future ( t > t 0 ) as well as in the past ( t < t 0 ) [111,112]. We will assume in this review that the fundamental weak nuclear forces are not included in our discussion [113]. A movie of such a deterministic process in the future, when run backward for the past, will appear just as natural with no hint of the direction of the time flow. Thus, starting from x , which also identifies a microstate [110], at  t, the state undergoes a unique state transformation
x x
in the same interval Δ t for any Δ t . If we now consider an ensemble of the same mechanical system, each prepared in the state x at t = 0 , then at t = Δ t , each system will be in the same state x . In the language of probability theory [114], we say that x follows with certainty from x in Δ t 0 . (This will be useful later to associate the concept of a constant entropy to a mechanical system but not heat.)
But the above invariance is contrary to our daily experience as a rule [115,116,117,118,119,120,121,122,123,124]. For example, the initial state x 0 may be when all the gas particles are confined to a small portion of the container [109] located at the center of the container. We are not interested in particle momenta. As the gas expands spontaneously, it occupies the entire volume uniformly. However, once the gas has occupied the entire volume in the state x 1 , the reverse evolution is not seen in nature. Similarly, the  cream mixed in a cup of coffee does not ever unmix on its own. The smoke from a burning piece of wood only spreads out in the room, but never confines itself on its own. If we run the movies in any of these cases backward, we immediately realize that the backward movies do not represent physical phenomena that are consistent with our daily experience.
This lack of time-reversal invariance of the equations of motion is a natural fact of daily life where we deal with macroscopic systems [125] that eventually evolve in time to M eq . This is at the root of the second law of thermodynamics, and can be easily explained as follows. It happens here that each member of the above ensemble that was initially prepared in the same state x evolves during a fixed interval Δ t into different states x ( n ) = x ,   x ,   x , for different n = 1 , 2 , 3 ,
x x ( n ) , n = 1 , 2 , 3 , ,
then the certainty implied in Equation (4) is lost so that most often it would happen that the states of different members after Δ t would have no discernible pattern for x ( n ) and appear haphazard for the members. The result is ([126], pp. 1–14) a loss of physical determinism [127]. Thus, the mapping
x x
in (5) between x and one of its evolved states x is one-to-many, and the mapping becomes unpredictable, i.e., stochastic [114]. One possible explanation of the loss of certainty at the level of states lies in the presence of stochasticity in the system due to the uncontrollable interactions with the surroundings, as discussed elsewhere [79] and elaborated later in Section 7. This is the foundations of classical probability theory by Laplace, and used to formulate the idea of density matrix by Landau [59,128] and von Neumann [129]. In this case, the mapping (6) cannot be reversed, and we cannot perform time-reversal of the evolution anymore. It is the success of a probabilistic approach to nonequilibrium thermodynamics that prompted Maxwell [50] and Boltzmann [130,131] to promote the “ergodic hypothesis” to achieve EQ. One of our aims in this review is to follow the consequences of this stochasticity in the dynamics such as in the Brownian motion [132] and Langevin’s equation [133], and  extend the concept of ergodicity to a special class of NEQ states [134] that has been identified as internal equilibrium states; see Definition 9.

1.1. Scope of the Review

It should be obvious that the scope of the NEQ statistical mechanics, the  μ NEQT, is more general than that of the equilibrium statistical mechanics, to be denoted here simply as the μ EQT in short, in that the attempts are now mostly to deal with the most general time evolution of microscopic quantities in the former. The instantaneous averages of these quantities over microstates m k are used to specify the instantaneous macrostates M required to characterize any thermodynamic process P in time in the MNEQT. Thus, the tasks in the μ NEQT and the MNEQT are more difficult and their foundations less developed, which justifies the motivation of this review. The exception is the validity of the first law in terms of exchange (or external) work and heat between Σ and Σ ˜ in thermodynamics, which plays a central role in the M ˚ NEQT . These MI-quantities are determined uniquely by Σ ˜ regardless of P being reversible or irreversible, and are easily identified under generally acceptable conditions such as Σ ˜ always being in EQ, quasi-additivity and quasi-independence; see later. Some of the approaches in the M ˚ NEQT employ the enlarged state space S Z [18,21,42,108]. Being associated with an EQ Σ ˜ , the MI-quantities including the exchange (or external) entropy carry no information about irreversibilities going on within the system. In contrast, the MNEQT based on the use of the SI-quantities include, by definition, these irreversible contributions so they are directly obtainable. One of our goals, besides laying down the foundations of the μ NEQT, is to justify the MNEQT from the μ NEQT.
A system in EQ always has its observables uniformly distributed throughout the system so it is uniform in S X [33]. In  contrast, an NEQ system is not uniform and requires additional information about the nonuniformity to uniquely specify its states, which is provided by a proper choice of internal variables in ξ . The set ξ allows us to treat Σ as uniform in the state space S Z (see Section 5.7) so that there is a unique thermodynamic temperature and other fields for the entire system even though it is still nonuniform in S X . This is very useful to obtain a proper thermodynamics of the system. For example, the  single thermodynamic temperature T even for a nonuniform system satisfies Clausius’s theorem that heat flows from hot to cold. This is what makes the μ NEQT in the extended space S Z so useful and desirable.
Various microquantities associated with Σ (having microstates m k ), Σ ˜ (having microstates m ˜ k ˜ ), and Σ 0 (having microstates m 0 k 0 ) carry the suffix k , k ˜ , and  k 0 , respectively. However, we are only interested in microquantities associated with m k as our focus is on Σ . This means that microquantities of Σ ˜ and Σ 0 must be manipulated so that they can be associated with m k . To accomplish this, we introduce the principle of reduction, which accounts for the correlation introduced by mutual interactions between Σ and Σ ˜ . Under commonly accepted conditions about Σ ˜ , the principle shows that the effect of Σ ˜ on Σ can be incorporated by treating its microquantities in the form of exchange (or external) quantities having no fluctuations. This is what makes the MI-quantities play such an important role in classical thermodynamics, but makes them unsuitable to extract fluctuations in a statistical theory.
Our goal here is to provide a comprehensive and self-contained introduction to our recently developed NEQ statistical mechanics ( μ NEQT), in which we study deterministic time evolution of individual microstates in m k along Hamiltonian trajectories in γ k during P . When quantities associated with these trajectories are averaged over them using their probabilities, the result is the MNEQT, an extension of the equilibrium thermodynamics to describe NEQ processes. This consistency with the MNEQT is not only a check on the validity of the μ NEQT, but also a justification of the MNEQT by the μ NEQT. The use of the SI-quantities in the μ NEQT allows for directly obtaining quantities such as Δ i S after averaging. Thus, the  μ NEQT is an extension of the EQ statistical mechanics [33,34], the μ EQT, that was originally developed by Boltzmann [46,47] and Gibbs [48], and limited to Δ i S 0 .
We will follow deterministic trajectories γ k during P between two macrostates M in and M fin . Only the latter determine the trajectories so they are the same for all processes P between them. This makes γ k independent of the trajectory probabilities p γ k controlling various P ’s, which is similar to m k being independent of the microstate probabilities p k . The extended state space S Z is chosen appropriately to uniquely specify m k and γ k in it. This uniqueness is an important aspect of the μ NEQT and the MNEQT as it is missing in other contemporary NEQT theories [10,12,13,17,18,19,20,21,24,25,26,27,28,99,135,136,137,138,139,140,141,142,143,144,145,146,147]. The instantaneous E k along γ k can only change mechanically due to the variation in W . This variation is responsible for the net change Δ E k along γ k , and is only determined by M in and M fin and not by p γ k as noted above. To complete the formulation of the μ NEQT, we determine the unique  p γ k ( P ) for any P in S Z , which is another exceptional aspect of the μ NEQT. This way, the deterministic aspect of a process (the mechanical work) has been separated from the stochastic aspect (the heat) in thermodynamics in a unique way in the μ NEQT for any P , NEQ or not. With the unique probabilities in hand, all calculation can be carried out exactly in the μ NEQT, once S Z has been identified. In the μ ˚ NEQT, the trajectory probabilities need to be determined using additional steps such as using the master equation [54], Fokker–Planck equation [37,102], etc., which are phenomenological.
Being deterministic, microquantities associated with m k or γ k are not constrained by the second law, which is a macroscopic law based on stochasticity. This is not surprising, as the Hamiltonian dynamics has nothing to say about the second law. For the MNEQT, we need to determine various thermodynamic averages over γ k using p γ k . Thus, the development of the μ NEQT is carried out in two independent stages. First we determine mechanical quantities as if the system is a mechanical one following Hamiltonian dynamics. Its stochastic aspects are captured by p γ k ( P ) , which determine not only mechanical averages such as work but also the stochastic averages such as heat and entropy. It is the latter that finds itself manifested in the second law for appropriate choices of p k and p γ k . By simply modifying the second stage, we are able to investigate the catastrophic consequences of violating the second law. This proves the usefulness of our approach. With  Δ E k and p γ k ( P ) in hand, we now have a complete NEQ statistical mechanics to describe any process P . The division in the two distinct and independent stages is of central importance to the μ NEQT and the MNEQT [148,149,150,151,152,153,154,155,156,157].
We have successfully applied the μ NEQT recently to study free expansion [154], to provide a correct application of microwork and microheat [155,156] in the various modern fluctuation theorems [26,158,159], and to describe viscous dissipation [157] associated with the dynamics of a Brownian particle (BP) [115,132,133,140] in its medium by developing an alternative to the stochastic Langevin description [38,99]. The above applications clearly show the usefulness of the μ NEQT. However, our previous studies were mostly limited to microworks; microheats were not treated as extensively. One of our major incentives here is to overcome this limitation to determine the μ NEQT for which the central requirement is the unique microstate probability p k in the state space S Z . This ensures that M ( Z ) and S ( Z ) are uniquely identified in S Z . Such macrostates are said to be in internal equilibrium (IEQ) in S Z and written as M ieq or M ( Z ) , as opposed to EQ macrostates M eq = M ( X ) in S X . The  unique entropy S ( Z ) has the maximum possible value for a given Z so it has no memory of where the microstate has come from. Once M becomes uniquely specified as M ( Z ) in S Z , it satisfies the extension of the ergodic hypothesis for M ieq ; see Section 14 for an example.
But the applications so far of the μ NEQT have provided only a piecewise and incomplete description of the μ NEQT [148,149,150,151,152,153,154,155,156,157] that was restricted in scope to highlight its NEQ aspects in the limited context. This comprehensive review aims to overcome this limitation and provide a complete introduction to the foundation of the μ NEQT by assimilating and extending together the previous results and by including missing details and newer aspects that emerge from the use of the SI-quantities in the extended state space S Z , where m k and M are uniquely specified in an IEQ macrostate M ieq just as they are uniquely specified for an EQ macrostate M eq in the EQ state space S X . The μ NEQT has met with success, as we will describe in this review, so it is desirable to introduce it to a wider class of readers.
Due to its microscopic SI-nature, the  μ NEQT provides a more detailed description of fluctuations in a thermodynamic process that are hidden in the MNEQT. For this reason, therefore, the former is highly desirable from both a theoretical and experimental point of view. It is an extension of the MNEQT [77,78,134,148,149,152,153,160] to the microstate level, which brings about a very close parallel with μ EQT [32,33,34,36].
A microstate m k , k = 1 , 2 , , carries an index k; the set m k forms a countable set and is specified by its energy set E k ( W ) ; however, we will usually suppress W in m k and E k ( W ) , unless necessary. In a macrostate M , m k ’s appear with a probability p k ( M ) ; see Section 7 for details. For simplicity, we will also not explicitly show the argument M in p k ; the dependence is always implicit. In the rest of the review, all quantities pertaining to M are identified as macroquantities, while those pertaining to m k are identified as microquantities that always have the microstate index k of m k or of x k in H ( x k W ) ; see Definition 4. After statistical averaging over microstates using their probabilities p k (see Equation (12) for its proper definition), we obtain quantities without k or x .
A microquantity associated with m k will always carry the index k (see later). A macrostate M and a macroquantity associated with it do not carry the index k so it is always easy to distinguish the two kinds of quantities. We will continue to use “quantity” to stand for both microquantity and macroquantity, unless clarity is needed.

1.2. System-Intrinsic and Medium-Intrinsic Thermodynamics

As the medium is always taken to be in EQ, its properties do not change even if the system is out of equilibrium. This has made the choice of MI-description ( M ˚ NEQT ) very convenient to formulate classical thermodynamics [13,18,33,39,41,42,51,108], in which one uses the exchange macroheat Δ e Q = T 0 Δ e S in terms of the exchange entropy (see Equation (46)) and the exchange macrowork Δ e W (see Equation (135c)) such as Δ e W = P 0 Δ e V = P 0 Δ V for the PV-macrowork; see Equation (94) for the first law as an example. Here, T 0 and P 0 are the temperature and pressure of the medium (see Figure 1), which remain the same for all possible states of the system. This has made the M ˚ NEQT a highly desirable thermodynamic theory as it is applicable in all cases. The main problem with this theory is that it is not directly applicable to an isolated system in Figure 1b for which exchange quantities are identically zero, but which provides the most cogent formulation of the second law Δ S 0 0 ; see Equation (213) in Proposition 3. It is useful only for an interacting system in Figure 1a for which the second law is stated indirectly in terms of irreversible entropy generation Δ i S 0 ; see Equation (67c). Indeed, all irreversible quantities including irreversible macrowork are indirectly determined.
In contrast, the MNEQT provides an SI-description involving quantities associated with the system alone so it is applicable to both systems in Figure 1 by explicitly taking into account the EQ properties of the medium, when it is present. All irreversible quantities including macroworks and macroheats are contained in this approach so they are determined directly in the MNEQT.
We elaborate on the distinction between the MNEQT and the M ˚ NEQT . The  exchange quantities d e Z require the system Σ to be embedded in a medium Σ ˜ (see Figure 1a) and are controlled by Σ ˜ [154] so that d e Z = d e Z ˜ (see Section 2) and  are easy to handle and measure, as Σ ˜ is normally taken to be in equilibrium with no irreversibility ( d i Z ˜ = 0 ) so that d e Z ˜ = d Z ˜ . Thus, the exchange quantities do not directly provide any information about d i Z and any irreversibility as mentioned above. As an example, the lost macrowork due to irreversibility in the M ˚ NEQT is defined as
d ˚ lost W = d ˚ rev W d ˚ irr W 0 ,
where various d ˚ rev W and d ˚ irr W refer to the exchange macroworks along two distinct processes: a reversible and an irreversible. We have used a new notation d ˚ to ensure that any d ˚ W is not confused with d W in the MNEQT. It is easy to see that d ˚ lost W is precisely the irreversible macrowork d i W , which is determined by the actual process.
Similar distinctions can also be noted between the μ ˚ NEQT and the μ NEQT; they differ at least in the following important ways, with sweeping consequences, as we will see:
A.
The internal microwork Δ i W k has no analog in the former because it uses the following questionable conjecture:
Δ e W k = ? Δ E k ,
(see Section 15) which is often used in fluctuation theorems [99,135,136,137,138,139,140,141,142,143,144,145,146,147]; the use of = ? is a reminder of its possible questionable nature, which is justified later in Theorem 7. In these fluctuation theorems, one begins with the conventional form of the first law d E = d e Q d e W in terms of exchange macroquantities, but identifies
d e W = ? k p k d E k , d e Q = ? k E k d p k .
As a consequence of the above identification, no distinction can be made between fluctuating microwork
d W k d E k ,
which is an identity in accordance with the work–energy theorem (see Theorem 6) and nonfluctuating exchange microwork
d e W k = d e W , k ;
see Theorem 7. The distinction is always maintained in the latter, in which we also show (see Section 10.1) why the above identification cannot be rigorously justified. Similar conclusions as above are obtained by replacing infinitesimal d α by accumulation Δ α , properly defined in Section 13 along a process P .
B.
Consequently, the microforce imbalance ( μ FI) that results in fluctuating Δ i W k = Δ i E k , a ubiquitous quantity, is absent in the former in that Δ i W k = Δ W k Δ e W k 0 but is always present ( Δ i W k 0 ) in the latter.
C.
The former results in a first law of thermodynamics ( Δ E k = Δ e Q k Δ e W k ) for each m k , while the latter has it hold ( Δ E = Δ e Q Δ e W ) only for a M ; however, see Equation (243).
D.
The lost or dissipated macrowork Δ lost W measured by the average Δ i W k should be absent in the former due to its above conjecture, but is always present in the latter.
E.
The exchange microwork Δ e W k depends on the entire trajectory γ k in the former to make it fluctuating over γ k , while in the latter, Δ W k depends only on the terminal microstates of γ k , and  Δ e W k Δ e W is nonfluctuating (it is the same for all γ k ’s).

1.3. Main Results

The review emphasizes the very close parallel with EQ statistical mechanics ( μ EQT) that is clearly seen in the microstate probabilities and the existence of IEQ partition functions for M ieq . There are also major differences mainly in new concepts, some of which are very counter-intuitive, such as ubiquitous d i E k , microforce imbalance ( μ FI) and internal microwork d i W k resulting from it, etc., for any macrostates including M eq that have not been appreciated so far. They have been introduced previously [77,78,150,156,157] but now receive detailed explanation here. For example, it is a well-known fact that d i E = 0 [12] (see Equation (53a)) for any M ; yet d i E k is fluctuating and so can be different from zero, its average. The presentation here is simple enough to reach even an untrained reader. To accomplish this goal, we only focus on some examples borrowed from undergraduate physics so that a reader will not be lost; however, it does require an open mind to learn new concepts that are counter-intuitive and perplexing, as it is very hard to shake off old preconceptions.
Remark 1. 
As μEQT only deals with EQ processes, the second law plays no role here. However, the situation in the μNEQT is different, where we deal with NEQ processes. As the second law does not operate at the microstate level, our development of the μNEQT is not limited by this law. To make contact with thermodynamics, however, we will have to impose it at the level of macrostate. By investigating the internal inconsistencies that emerge if the second law is violated, we are able to conclude that the law cannot violated for a stable system. This is one of the most important benefits of our approach.
Throughout this review, we work in the enlarged state space S Z so we include at least one internal variable ξ as a prototype to make our discussion more realistic, as will become clear in Section 4 and Section 14. The main emphasis here will be to demonstrate the ubiquitous nature of internal changes such as d i E k , a new concept whose existence has not been previously appreciated in various fluctuation theorems [26,158,159]. Not recognizing its existence has resulted in the conjecture d e W k = d E k = d e E k (see Equation (7)), used extensively in the μ ˚ NEQT. This is contrary to a central result of the μ NEQT; see Theorem 6. It is the microforce imbalance ( μ FI) between the internal and external microforces, a hitherto unrecognized purely mechanical concept at the microstate level in EQ and NEQ thermodynamics, that generates d i E k and is present in all processes, whether they are thermodynamic or not, as we will demonstrate. This is the most important outcome of the our approach; see Proposition 2. It emphasizes the importance of SI-quantities (such as in d E k = d W k ) that are very different from the MI-quantities (such as in d e E k = d W ˜ k ) for any γ k , even if the trajectory belongs to a reversible process. The use of generalized work d W = d E m in Equation (234a) as isentropic change allows us to calculate microscopic work (microwork) d W k , which changes E k but not p k . This is because m k , whose concept is independent of p k , uniquely determines E k for a fixed work set W ; see Definition 5. Therefore, d E k is uniquely determined by d W and does not have any contribution from the change in p k . On the other hand, the generalized heat d Q allows us to introduce microscopic heat (microheat) d Q k , which does not change E k but changes p k . The above mutually exclusive nature of  d W k  and  d Q k  proves to be a great simplification and allows us to treat  d W k  and  d Q k  as purely a mechanical and a stochastic concept, respectively, in the development of the  μ NEQT. In addition, as  d E k  does not have any contribution from d p k , it has no microheat contribution, so there is no first law for m k in the μ NEQT.
As E k is fluctuating, d E k = d W k is also fluctuating and is uniquely determined as d E k E k ( W + d W ) E k ( W ) for m k ; the (slow or fast) nature of the process is irrelevant. The latter only controls p k . This provides a simplification in evaluating the cumulative change Δ W k , which is independent of the nature of P between two macrostates; see Remark 71 and the discussion following it. The  fluctuating microwork d W k is different from Δ W ˜ k = Δ W ˜ , which is the microwork done by the working medium on m k after reduction, and which depends strongly on the nature of P but is the same for all microstates for a given P .
The most important new results that emerge in the μ NEQT are the following:
  • a clear separation of different kinds of work and heat and their fluctuations that emerge from d α ;
  • additional thermodynamic forces for irreversibility due to internal variables;
  • stochasticity resulting from a nonvanishing commutator C ^ α d α A ^ A ^ d α ;
  • exchange microquantities are nonfluctuating, which makes them useless for directly obtaining fluctuations and irreversibility;
  • the fundamental identity Δ i W = Δ i Q between irreversible macrowork and macroheat generalizing the result of Count Rumford and the Gouy-Stodola theorem;
  • the origin of work dissipation Δ i W > 0 in an irreversible process;
  • the uniqueness of macrostates and microstate probabilities in the enlarged state space for M ( Z ) determined by the experimental setup;
  • the μ NEQT justifies the MNEQT as the μ EQT justifies the EQT.

1.4. Layout

The layout of the paper is the following. In the next section, we introduce our notation, definitions, and new concepts, which may be unfamiliar to many readers but are justified in the following sections. We describe here our basic approach that a thermodynamic description is equivalent to treating microquantities as purely mechanical without any consideration of stochasticity, to be followed by bringing in microstate probabilities to determine macroquantities, just as in EQ statistical mechanics. Microstate probabilities are not truly microquantities as they are not independent of each other. The stochasticity adds the dimension of entropy, without which we only have a mechanical description of an NEQ body in S Z . An arbitrary macrostate M arb is divided into an EQ macrostate M eq and an NEQ macrostate M neq ; the latter is further divided into an IEQ (internal equilibrium) macrostate M ieq and an NIEQ (non-internal equilibrium) macrostate M nieq . The IEQ macrostates share all the properties of EQ macrostates, except that the former have nonvanishing irreversible entropy generation Δ i S > 0 . The principle of reduction is also introduced here. In Section 3, we discuss the mathematical properties of and manipulations with the linear operators d α , and give some examples for clarification. The origin of internal variables is explained in Section 4, where we show that they also emerge in mechanical descriptions so that they are not unique to thermodynamics. This explains why we need the enlarged state space S Z for microscopic mechanical descriptions as well. We finally present the fundamentals of the μ NEQT in Section 5. This is a very important section, where we present various axioms and requirements of the μ NEQT. We then discuss stochasticity to derive a very general formulation of the entropy in terms of p k , which is then used to obtain the unique form of p k for M ieq . An important and surprising aspect of the μ NEQT is obtained in the equality of internal microwork (a mechanical microquantity) and microheat (a stochastic microquantity) even though they have distinct origins. At this stage, we have a complete and unique NEQ statistical mechanics (the μ NEQT) in S Z . We identify SI-macroquantities and use them to derive the MNEQT for M ieq exemplified by the Gibbs fundamental relation in S Z , which is then generalized to obtain the Gibbs fundamental relations for M nieq in S Z .
In Section 6 and Section 7, we begin to introduce the mechanical and stochastic aspects of the μ NEQT, respectively. In Section 6, we use W to identify microforces that operate in the mechanical formulation of the body so they are also present in its thermodynamic formulation. We use them to introduce the concept of microforce imbalance in Section 6.4, which captures the mechanical disparity between Σ and Σ ˜ . The imbalance is responsible for the internal microwork. In Section 6.5, we derive the extension of the work–energy theorem of mechanics in S Z . In Section 7, we revisit a previous proposal for the origin of stochasticity and extend it further by discussing the effect of correlations between Σ and Σ ˜ , and introducing the principle of reduction in Section 7.2. We then discuss quasi-independence in Section 7.3, and the simplification it brings about in thermodynamic considerations after reduction, especially with respect to the effects produced by Σ ˜ on Σ , which is discussed in Section 7.4 and Section 7.5. The discussion, which forms a very important part of the review, shows why classical thermodynamics works so well.
In Section 8, we discuss the properties of the unique entropy S ieq for M ieq in S Z , and discuss its approximate formulation as a flat distribution that is commonly used in EQ statistical mechanics. This distribution neglects any fluctuations in the entropy, which are always present in the body. Despite this, it correctly gives the entropy so it can always be used to determine it as it simplifies the calculation. We show that the entropy additivity requires quasi-independence in Section 8.1 so the latter should not be confused with the principle of additivity for W . Using this flat distribution, we provide a simple proof of the second law for M ieq in S Z in Section 8.3 by simply counting the number of distinct microstates as the system evolves in time, which can only increase with time; see Theorem 8. This direct proof is supplemented by Theorem 9 in Section 8.4 that the law is simply a direct consequence of the stability of the system so it does not need to be included as an additional part of Axiom 2 in the μ NEQT; see Section 5). In Section 9, we show that a violation [161] of the second law results in internally inconsistent thermodynamics for stable physical systems, and cannot be taken seriously (see Conclusion 7), even though thermodynamic instabilities arise in approximate calculations such as van der Waals equations or mean field, but are always removed from consideration; see Remark 58. Therefore, we will always assume that we are dealing with a stable system for which the law is always valid, as noted in Section 1, except in Section 9. In Section 10, we initiate the formulation of the μ NEQT by focusing on the two most important concepts, those of generalized or BI-microwork and microheat for Σ b . We show that various micro- and macroheats emerge from the nonvanishing commutator C ^ α introduced in Equation (229). For a fuller understanding, we first revisit in Section 10.1 the ensemble average of a fluctuating state variable, and its change in a process P . We show that for Z Z such as E belonging to S Z , its change d Z consists of two independent process contributions in orthogonal state spaces S Z and S S , a mechanical one d Z m at fixed p k in S Z , and a stochastic one d Z s at fixed Z k in S S . Thus, d Z d Z S χ . In contrast, the stochastic state variable S S S has only stochastic contributions belonging to S S . For E, d E m represents the negative of the generalized macrowork d W , and  d E s the generalized macroheat d Q in the body. Their statistical interpretation is covered in Section 10.2, where we show that d W k is purely mechanical, and  d Q k purely stochastic. In Section 11, we discuss how d e p k and d i p k are determined, and how they determine the forms of various microworks, microheats, and microentropies. We also give a general proof of the identity d i E 0 , even if d i E k 0 , k . This now completes the formulation of the unique NEQ statistical mechanics ( μ NEQT) in S Z .
The only thing remaining for a complete formulation of the μ NEQT is to identify the choice of S Z , which is discussed in Section 12. This is a very important section that describes how the choice of S Z is dictated by the way an experiment is performed, which must not come as a surprise for an NEQ process. This is because the observation and relaxation times play important roles here. By ordering various internal variables with their relaxation times in decreasing order, we show that only those internal variables have to considered whose relaxation times are greater than the observation time to uniquely specify the macrostate in S Z . We show how the unique microstate probability is identified. We consider the possibilities of fluctuating (Fl) and nonfluctuating (NFl) work parameter W . It will be convenient to take the parameters to be fixed so that they are the same for all microstates. We introduce the Legendre transform  E k L of the microenergy E k , which proves to be very useful in expressing p k . The discussion justifies that once S Z has been identified in which M becomes uniquely specified, the microstate probabilities are also uniquely specified. No auxiliary step is required to determine p k . This is what makes the μ NEQT so useful. The discussion is easily extended to consider a microstate that is not unique in S Z .
So far, we have provided a complete formulation of the μ NEQT for any M arb at each instant. To proceed further to extend the μ NEQT for any process, we need to introduce a trajectory ensemble and determination of various path and process quantities, which is taken over in Section 13. We show that different trajectory quantities have different trajectory probabilities (path microprobabilities), which has not been appreciated so far. This finally provides a complete description of the μ NEQT for any process.
We now turn to some of the applications of the μ NEQT in the next three sections. In Section 14, we use it to describe the origin of microfricton at the microstate level. A new NEQ work fluctuation theorem is derived in Section 15 between any two arbitrary macrostates. In Section 16, we use the μ NEQT to study the quantum and classical free expansion using our work fluctuation theorem. The final section provides a brief discussion of our conclusions and a summary.

2. Notation, Definitions and New Concepts

Before proceeding further, it is useful to introduce in this section our notation to describe various systems and their behavior, and new concepts for their understanding without much or any explanation (that will be offered later in the review where we discuss them). We also give various definitions and briefly discuss new concepts such as various forms of NEQ work and heat that need to be carefully distinguished for a precise formulation of the μ NEQT. Various important concepts are highlighted in the form of Remarks to draw the attention of the reader. It is the hope that a reader can always come back here to be refreshed in case of confusion. In this sense, this section plays an important role in the review for the purpose of bookkeeping.

2.1. Systems and State Variables

Definition 1. 
A system Σ is a collection of material particles and radiation enclosed in a region of space defined by some parameter W , and its Hamiltonian dynamics is determined by the Hamiltonian H ( x W ) . A system can be embedded in a medium Σ ˜ , which is extremely large compared to it, and with which it interacts. The combined system Σ 0 formed by Σ and Σ ˜ is commonly treated as an isolated system.
Remark 2. 
For convenience, we take the parameter W to be fixed so that it is the same for all microstates, even though it is not hard to take it to be unfixed so that it changes over the microstates. We will refer to them as nonfluctuating and fluctuating, respectively.
We draw attention to Figure 1 to introduce the notation. The  review mostly deals with statistical mechanics of macroscopically large systems Σ ; however, we will also digress a bit to discuss small systems. In both cases, Σ is extremely small compared to the medium Σ ˜ ; see Figure 1b. The medium Σ ˜ consists of two parts: a work source Σ ˜ w and a heat source Σ ˜ h , both of which can interact with Σ directly but not with each other. This separation allows us to study work and heat exchanges between Σ and Σ ˜ separately. We will continue to use Σ ˜ = Σ ˜ w Σ ˜ h to refer to both of them together. The collection Σ 0 = Σ Σ ˜ forms an isolated system, which we assume to be stationary. We remark that the concept of an isolated system in a laboratory is an important approximation [79,162] but extremely useful as no such system locally exists in reality. We need to always keep this in mind.
The system in Figure 1a is an isolated system, which we may not be able to divide into a medium and a system. Each medium in Figure 2, although not interacting with each other, has a similar relationship with Σ . In case they were mutually interacting, they can be treated as a single medium. The collection Σ 0 = Σ Σ ˜ 1 Σ ˜ 2 forms an isolated system. In the following, we will mostly focus on Figure 1 to introduce the notation, which can be easily extended to Figure 2 or to an extension with several mediums.
Definition 2. 
Observables X = ( E , V , N , ) of a system are extensive quantities that can be controlled from outside the system, and internal variables   ξ = ( ξ 1 , ξ 2 , ξ 3 , ) are extensive quantities that cannot be controlled from outside the system. Their collection Z = X ξ is called the set of extensive state variables of Σ forming S Z , which we may simply write as S when no confusion will arise. The set W or a subset of it may be fixed or may fluctuate over all microstates of Σ.
Definition 3. 
A system-intrinsic (SI) quantity is a quantity that pertains to the system Σ alone and can be used to characterize the system. A medium-intrinsic (MI) quantity is a quantity that is determined by the medium Σ ˜ alone and can be used to characterize it and also the exchanges between Σ and Σ ˜ . No external exchange is allowed for Σ 0 .
We use a suffix 0 to denote all quantities pertaining to Σ 0 , a tilde ( ˜ ) for all quantities pertaining to Σ ˜ , and no suffix for all quantities pertaining to Σ , even if it is isolated. Thus, the set of observables is denoted by X 0 , X ˜ , and X , respectively, and the set of state variables by Z 0 , Z ˜ , and Z , respectively, in the state space S Z ; the set of internal variables are ξ 0 , ξ ˜ , and ξ , respectively.
Remark 3. 
We will use the term “body” to refer to any of   Σ , Σ ˜ , and  Σ 0 in this review and use Σ b to denote it. However, to avoid notational complication, we will use the notation suitable for Σ for Σ b if no confusion would arise in the context. The mechanical aspect of a body is described by its Hamiltonian H ( x W ) , and we refer to all quantities pertaining to it as body-intrinsic (BI), which includes SI, MI, and ISI (for the isolated system) as the case may be.
The discussion below is mostly for a body Σ b , but the notation is suited for a system. Thus, it covers the three systems Σ , Σ ˜ , and  Σ 0 , unless mentioned otherwise.
Definition 4. 
A microstate of H ( x W ) represents the instantaneous deterministic state of Σ b . The quantum microstates are specified by a set of good quantum numbers, which we usually denote by k as a single quantum number for simplicity; we take k N , N denoting the set of natural numbers. In the classical case, we use a small cell δ x k of volume h 3 N around x k = x as the microstate m k [163]; the collection δ x k covers the entire phase space Γ. A microstate m k appears with probability p k that is central for statistical mechanics.
Below, we clarify the definition further.

2.2. Microstates and Macrostates

Remark 4. 
In order to obtain a microscopic understanding of thermodynamics, we need to focus on the countable set of microstates m k k = 1 , 2 , . Then
E k = H ( x k W )
denotes the microenergy of m k . In explicit form, the microenergy for m k of Σ will be expressed as E k ( W ) , for  m ˜ k ˜ of Σ ˜ it will be expressed as E ˜ k ˜ ( w ˜ ) (see Equation (28a)), and for m 0 k 0 of Σ 0 it will be expressed as E 0 k 0 ( W , w ˜ ) (see Equation (28b)).
Remark 5. 
For clarity and ease of presentation, we will assume each microstate to be nondegenerate, i.e., a singlet. Extending the discussion to degenerate microstates is trivial, as discussed in Section 15.
We now identify microstates m k . In quantum mechanics, they refer to the countable microstates of the Hamiltonian of a bounded body, with k denoting the set of quantum numbers. In classical mechanics, they are usually identified as follows. We will normally employ a discretization of the classical phase space Γ of a bounded system by dividing it into countable nonoverlapping cells δ x k , centered at x k and of some small size, commonly taken to be 2 π 3 N . The cells cover the entire phase space Γ . To  account for the identical nature of the particles, the number of cells and the volume of the phase space are assumed to be divided by N ! to count distinct microstates m k δ x k , indexed by k  = 1 , 2 , ; the center of δ x k is at x k . The energy and probability of these cells are denoted by E k , p k in which E k ( W ) is a function of W . The microstates obey deterministic evolution of the Hamiltonian H ( x W ) of the body. For  Σ b = Σ ˜ , m ˜ k ˜ appear with probabilities p ˜ k ˜ ; for Σ b = Σ 0 , m 0 , k 0 appear with probabilities p 0 k 0 .
With the discretization, we will use the same symbol Γ to denote the space occupied by microstates m k .
Claim 1. 
It is through the changes in microstate probabilities that a thermodynamic process P gets its stochastic nature. In contrast, constant p k ’s describe a mechanical process, which is deterministic.
A thermodynamic process P between any two arbitrary states (we will instead use P ˚ to denote a process between two equilibrium (EQ) terminal microstates) is understood in the context of the MNEQT [12,51] as a temporal sequence of macrostates M ( t ) of the body which keep changing during P due to changes in { m k ( t ) } and/or { p k ( t ) } . The rate of time variation (fast or slow compared to the equilibration time τ eq ) determines the (reversible or irreversible) nature of P .
Definition 5. 
At the microscopic level, the state of Σ b is specified by microstates set m k , their energy set E k , and their probability set p k . For the same set m k , E k , different choices of p k describe different macrostates M (see Definition 6), one of which, M e q , corresponding to p k e q , specifies an EQ macrostate having the maximum entropy; all other states have smaller entropies and are called nonequilibrium (NEQ) macrostates.
It is important to draw attention to the following important distinction between the Hamiltonian H required for a microstate and the average energy E of a macrostate. While the thermodynamic energy accounts for the stochasticity through microstate probabilities, the use of the Hamiltonian is going to be restricted to a particular microstate. In other words, the Hamiltonian depends on x and W but the energy depends on the entropy S and W . The energy E k of m k , on the other hand, depends only on W and denotes the value of H associated with m k ; see Equation (8). In the following, we will always treat Hamiltonians and microstate energies as equivalent descriptions, which does not depend on knowing { p k } ; the average energies depend on { p k } for their definition; see Equation (12) with q = E and q k = E k .
Definition 6. 
A macrostate M in S is a collection m k , p k of microstates m k and their probabilities p k , k = 1 , 2 , for a Σ b . Quantities that are the same for all microstates are called macroquantities as they refer to the macrostates M . Quantities that refer to microstates are called microquantities, and  carry the suffix k when associated with the microstate m k such as X k or Z k , which are the microanalogs of X or Z , respectively; however, see Remark 14. We will simply use “quantity” to refer to both of these quantities in short.
For example, we will refer to d W k as the microwork; similarly, we will refer to d W ˜ k as the external microwork, d e W k as the exchange microwork, and  d i W k as the internal microwork. The corresponding macroworks are denoted by d W , d W ˜ , d e W , and  d i W . We thus see that there are various possible notions of works in NEQT.
A macrostate M is usually described by the state variable Z in thermodynamics but functions of Z can also be used to characterize M . They are all macroquantities. In statistical mechanics, microstates of the Hamiltonian are used to describe M at the microstate level.
Remark 6. 
Microquantities can be divided into two kinds: pure and mixed. A pure microquantity such as E k is determined solely by m k but not by M . A mixed microquantity such as microheat and microentropy is one that is also determined by M . With this caveat in mind, we will call both kinds microquantities.
We find the shorthand notation [12,13,51]
d α = ( d , d e , d i )
quite useful in the following for the various infinitesimal contributions. Thus, d α E k = d E k , d e E k , d i E k will refer to microenergy change, exchange microenergy change, and internal microenergy change, respectively. We similarly use d α Q k = d Q k , d e Q k , d i Q k for various forms of microheats, and  d α S k = d S k , d e S k , d i S k for various forms of microentropies; see Equation (27a) and Remark 14. In particular, the random variable dq should not be confused with the differential of q, which may not even be defined; see Remark 20. We will refer to d Q k and d S k as microheat and microentropy, respectively. The corresponding macroquantities are denoted by d α E , d α Q , and  d α S , respectively, without the index k. The following notation generalizes the physics of various infinitesimals and their relationship.

2.3. Micro–Macro Variables

Notation 1. 
We introduce the sets of state variables
χ k S k , Z k , χ S , Z , ζ k S k , W k , ζ ( S , W ) ,
and infinitesimals
d α θ k d α χ k , d α W k , d α Q k , d α θ d α χ , d α W , d α Q .
Notation 2. 
We introduce a compact notation q for the collection q k , q :
q q k , q .
and d α q to cover all of the following quantities:
d α q d α θ d α χ , d α W , d α Q .
Thus, χ χ k , χ S χ , ζ ζ k , ζ S ζ , d α χ d α χ k , d α χ S χ , etc. For specificity, we use χ k j and χ j to refer to the jth element of χ k and χ , respectively. Similarly, we use ζ k j , ζ j for the jth element of ζ k and ζ , respectively, and d α θ k j , d α θ j for the jth element of d α θ k and d α θ , respectively.

2.4. Random Variable and Average

Remark 7. 
In the language of probability theory, M ( t )  can be thought of as a random variable with outcomes  m k  with probability  p k ( t ) . A microquantity  q k  associated with  m k  appears with probability  p k ( t )  at time t. Thus,  q k  denotes an outcome of a random variable q, and usually forms a fluctuating (Fl) microquantity.
Definition 7. 
The ensemble average for q k or of the random variable  q  is defined by
A ^ q ( t ) = q ( t ) or q ¯ ( t ) or q ( t ) k p k ( t ) q k
for a countable set p k ( t ) that satisfies the sum rule
k p k ( t ) = 1
due to the conservation of probability. We can also extend Equation (12) to q for which q k = q , k . We have used A ^ to denote the above averaging operator in Equation (12).
In thermodynamics, it is customary to use the simpler notation q for q = A ^ q , which we will also follow in this review, such as E , S , etc., for the average energy, entropy, etc. However, we will also use the notation A ^ q , q ¯ or q , when clarity is needed, as we will see in Section 10 that such a convention can lead to confusion if care is not exercised. We wish to emphasize that A ^ q = q does not imply that A ^ = 1 , except when q k = q , k .
Remark 8. 
To avoid confusion with the notation d α χ k , which can either mean d α ( χ k ) as d α acting on χ k , or  d α ( χ ) k denoting the microquantity associated with d α ( χ ) , we will continue to use d α χ k for the former, and  d α χ ¯ k for the latter, where χ = χ ¯ stands for the macroquantity associated with χ k ; see Section 10.1 for details. However, we will simply use d α χ k in d α θ k to simplify the notation, but we will always use the specific notation when clarity is needed.
In this review, we will not consider a constant random variable. Hence, a random variable will always have fluctuating outcomes.
Notation 3. 
We use modern notation [13,51] and its extension (see Figure 1), which will be extremely useful to understand the usefulness of our novel approach. Any infinitesimal and extensive Σ b -intrinsic quantity dq ( t ) (see Equation (11b)) during an arbitrary infinitesimal process d P can be partitioned as
d q ( t ) d e q ( t ) + d i q ( t ) ,
where d e q ( t ) is the change caused by exchange (“e”) with the surroundings such as the medium and d i q ( t ) is its change due to internal or irreversible (“i”) processes going on within Σ b . As mentioned earlier, the term external quantity will also be used for an exchange quantity to emphasize its external nature in this review. The partition also applies to the outcome d q k as follows:
d q k ( t ) d e q k ( t ) + d i q k ( t ) ,
As an example, we have (see Equation (27a) for the definition of S k )
d E k = d e E k + d i E k , d S k = d e S k + d i S k
for Σ b ; here d i E k or d i S k does not have to vanish or have a particular sign even though d i E = 0 (see Equation (53a)) or  d i S 0 (see Equation (67c)). We see that the linear operators d α satisfy
d d e + d i .
Claim 2. 
An extensive quantity of Σ b is additive over its various macroscopic parts, but the energy E is usually quasi-additive; see Section 5.6.
For the sake of clarity, we will take V as a symbolic representation of X , and a single ξ as an internal variable in many examples. Then, w = ( V ) , W = ( V , ξ ) , and  Z = ( E , V , ξ ) .

2.5. Different States in NEQT

Definition 8. 
An equilibrium (EQ) macrostate is a uniform macrostate having the maximum possible entropy in S X .
Definition 9. 
A nonequilibrium macrostate can be classified into two classes:
(a)
Internal-equilibrium macrostate (IEQ): The nonequilibrium entropy S ( X , t ) for such a macrostate is a state function S ( Z ) in the larger nonequilibrium state space S Z spanned by Z ; S X is a proper subspace of S Z : S X S Z . As there is no explicit time dependence, there is no memory of the initial macrostate in IEQ macrostates.
(b)
Non-internal-equilibrium macrostate (NIEQ): The nonequilibrium entropy for such a macrostate is not a state function of the state variable Z . Accordingly, we denote it by S ( Z , t ) with an explicit time dependence. The explicit time dependence gives rise to memory effects in these NEQ macrostates that lie outside the nonequilibrium state space S Z . An NIEQ macrostate in S Z becomes an IEQ macrostate in a larger state space S Z , Z Z , with a proper choice of Z .
Definition 10. 
An arbitrary macrostate M a r b of a system refers to all possible thermodynamic states, which include EQ macrostates, and NEQ macrostates with and without the memory of the initial macrostate. From now on, we denote an arbitrary macrostate by M , NEQ macrostates by M n e q , EQ macrostates by M e q , and IEQ macrostates by M i e q .
Different choices of p k for the same set m k , E k describe different macrostates for a given W , one of which corresponding to p k eq uniquely specifies the EQ macrostate M eq ; all other states are called NEQ macrostates M n eq . Among  M neq are some special macrostates M ieq that are said to be in internal equilibrium (IEQ); the rest are nonIEQ macrostates M nieq . An arbitrary macrostate M refers to either an EQ or an NEQ macrostate; the latter can be either M ieq or M nieq .

2.6. Mechanical Description

Claim 3. 
There are two distinct approaches to handling state variable W for a macrostate m k , p k of Σ b ; see Remark 2 and Definition 2.
  • Nonfluctuating (NFl) approach: It can be treated as a nonfluctuating (fixed) parameter in the Hamiltonian of Σ b so that it is the same for all of its microstates. If we alter W , it changes the same way for all m k ’s. We say that W is a NFl-parameter over m k ’s. This results in fluctuating generalized microforce
    F w k E k / W
    over m k ’s, with its ensemble average (see Equation (12)), given by the generalized macroforce
    F w k p k F w k = E / W .
    Even though F w k is a microvariable, we find it useful conceptually to think of it as the outcome of a random variable F w on m k . We use the notation W , F w k to compactly refer to this case.
  • Fluctuating (Fl) approach: Alternatively, we let W fluctuate over m k ’s and think of it conceptually as a random variable W with outcomes W k , even though W k is a microvariable. To be consistent with the NFl-approach (see below), we require that F w becomes nonfluctuating (fixed) defined by
    k , E k / W k = E / W = F w .
    In this view, the macroforce F w is fixed (so it is the same for all macrostates) with the result that m k ( W k ) is determined by the fluctuating random variable W over m k ’s, with its average (see Equation (112)) given by
    W k p k W k .
    We use the notation W k , F w to compactly refer to this case.
The same two approaches apply as well if we replace W by w , and  F w by f w in the above equations.
Claim 4. 
The presence of a parameter in the Hamiltonian H ( x W ) of the body brings forth the Legendre-transformed Hamiltonian H L ( x W L ) as the most important quantity to consider, where W w L is the work parameter in H L ; see Section 6.3.
Claim 5. 
The nonfluctuating (NFl) parameter W results in fluctuating (Fl) microfield F w k that plays the role of W L , and fluctuating W k results in a NFl workfield F w that plays the role of W L . As noted in Remark 2, we find it convenient to take W and F w as the parameters, respectively, as will become clear later in the review.
We provide an intuitive understanding of the two approaches. For the NFl- W , we use the microstates m k of H ( x W ) so that every microstate is specified by the same W . If we use the same Hamiltonian H ( x W ) for the Fl- W case, this will require considering different Hamiltonian H ( x k W k ) for different microstates so that their slopes are all equal to ( F w ); see Equation (18). This is quite cumbersome. It is well-known that in this case, it is most convenient to consider H L ( x F w ) with W L = F w so that every microstate is specified by the same W L , which plays the role of the work-parameter in H L ( x F w ) ; see Section 6.3.
Remark 9. 
We now explain the concept of consistency noted above. Consider E k ( W ) for some microstate m k in the NFl-approach, and determine F w k at some W . Using the variation d W , we determine the change d E k N F l = F w k · d W . In the Fl-approach, we choose that particular value W k at which E k has the NFl slope F w as shown in Equation (18). We emphasize that only the particular W k is considered that satisfies Equation (18). We then determine the variation d W k so that d E k F l = F w · d W k , as follows from Equations (17a), has exactly the same value as d E k N F l . Therefore, we do not have to distinguish between d E k N F l and d E k F l , and use the simpler notation d E k for both of them. As a consequence, we can make the following:
Claim 6. 
We have the same microwork in both approaches:
d W k = F w k · d W = F w · d W k = d E k .
Remark 10. 
In the NFl approach, we introduce the Legendre transform
E k L , N F l ( F w k ) = E k ( W ) + F w k · W ,
as a function of F w k with
W = E k N F l ( F w k ) / F w k .
In the Fl approach, we introduce the Legendre transform
E k L , F l ( F w ) = E k ( W k ) + F w · W k ,
as a function of F w with
W k = E k N F l ( F w ) / F w .
We see that the above definitions of the Legendre transform E k L of E k in the two approaches can be compactly denoted by
E k L ( b ) = E k ( a ) + Φ ( a , b )
in terms of a scalar function
Φ ( a , b ) a · b ;
see also Section 6.3. It is clear from Equation (23a) that it is sufficient to investigate the behavior of E k ; the behavior of E k L is easily obtained from it. Therefore, we will mostly focus on E k in the review.
Remark 11. 
As microstates m k play the central and important role in our approach involving the Hamiltonian, the microstate energies E k represent the outcomes of a random variable E  over the microstates. Thus, we always deal with a fluctuating microstate energy. Consequently, the corresponding “macroforce”
f s E / S = T
(see Equation (1) or equivalently Equation (129)) always appears as a NFl-parameter for Σ b , which can be combined with f w and F w as
f f s , f w , F f s , F w
to represent the relevant macroforces.
Remark 12. 
It follows from the above Remark that we can either consider the case W , F w k or W k , F w . In  both cases, we obtain the same thermodynamics. A NFl parameter can be treated as a deterministic parameter for whichq k = q , k , as the probability of q  is unity (certainty).
Remark 13. 
The work parameter may be a function of time t. By taking one of the components of the work parameter w to be simply t, it is also possible to include t as a separate parameter in H ( x t , W ) as is common in mechanics [164].
Definition 11. 
In general, p k are functions of the microquantity X k or Z k in S X or S Z , respectively, and are implicit functions of t through the latter; they may also depend explicitly on time t if not unique in the state space. For an EQ or an IEQ macrostate, p k have no explicit dependence on t; see Section 12 for details. As  p k always satisfies the sum rule (see Equation (13)) over any M , it is also an ensemble quantity because of this, and should be treated as a mixed microquantity; it is not determined by m k alone so it is not a true microquantity.
Definition 12. 
The collection m k , p k provides a complete microscopic or statistical mechanical description of thermodynamics of any arbitrary macrostate M in some state space S in which one deals with macroscopic or ensemble averages using p k (see Definition 7) over  m k of microstate variables.

2.7. Entropy and Stochastic Description

Definition 13. 
A state function entropy S for M e q or M i e q is defined thermodynamically by the Gibbs fundamental relation up to a constant.
Definition 14. 
Statistical entropy S, often called the Gibbs entropy, for  M is defined by its microstates by the Gibbs formulation (see Equation (116)),
S S = k p k S k = k p k ln p k ,
with its differential given by
d S = d S = k ( η k + 1 ) d p k k η ^ k d p k
where  S k is defined by
S k η k ln p k ;
in terms of Gibbs’ index of probability ([48], p. 16)
η k ln p k ,
and where we have also introduced
η ^ k η k + 1 .
Remark 14. 
The quantity S k and any deterministic function of it are mixed microquantities for the simple reason that p k satisfies the sum rule in Equation (13), which requires considering all the microstates; see also Definition 11. However, S is a macroquantity that is also a state variable.
This property of S k should not be forgotten.
Remark 15. 
Being additive, S is extensive. As a consequence, S k must be extensive.
As Σ ˜ is taken to be in EQ, its Hamiltonian is defined by its observable X ˜ ; the internal variable ξ ˜ plays no role. Thus, we will express its Hamiltonian as
H ˜ ( x ˜ w ˜ ) .
We will also assume that Σ ˜ is weakly interacting with Σ , a point discussed carefully in Section 5.6. By neglecting their mutual interaction, we have quasi-additivity of their Hamiltonians to determine the Hamiltonian of Σ 0 :
H 0 ( x 0 W , w ˜ ) H ( x W ) + H ˜ ( x ˜ w ˜ ) ,
and states the quasi-additivity of the microstate energies; see Equation (119). We also assume the following additivity in this case:
W 0 W + w ˜ ;
see also Equation (118a).

2.8. Reduction

Very often, we need to define an ensemble average over a composite system such as Σ 0 formed by two or more systems. We focus on Σ 0 = Σ Σ ˜ . A microquantity q 0 k 0 associated with Σ 0 may also refer to a microquantity q k associated with Σ , or a microquantity q ˜ k ˜ associated with Σ ˜ .
Definition 15. 
The ensemble average over m 0 k 0 of a composite microquantity  q 0 k 0  of  Σ 0 is given by the joint probability
p 0 k 0 p ( k k ˜ ) p k ˜ = p k p ( k ˜ k )
to be used in the following two equivalent ways:
q 0 = k p k k ˜ p ( k k ˜ ) p k p k ˜ q 0 k 0
= k p k k ˜ p ( k ˜ k ) q 0 k 0 .
This averaging is properly discussed in Section 7. The  conditional probabilities  p ( k k ˜ ) and p ( k ˜ k ) contain all the information about the correlation between Σ and Σ ˜ due to their mutual interaction, which will be considered in detail in Section 5.6 and Section 7. Here, we use the above definition to define the conditional microquantity q 0 k given that Σ is in the microstate m k .
Definition 16. 
The reduction of the composite microquantity  q 0 k 0 to a conditional Σ 0 -microquantity q 0 k  is defined by
q 0 k k ˜ p ( k k ˜ ) p k p k ˜ q 0 k 0 = k ˜ p ( k ˜ k ) q 0 k 0 .
Here, the conditional microquantity q 0 k associated with Σ 0 carries the suffix k and not k 0 , and is obtained under the condition that Σ is in the microstate m k , and requires conditionally averaging over all the microstates m ˜ k ˜ of Σ ˜ using the reduced or conditional probability p ( k k ˜ ) / p k ; see Section 7 for details.
It is evident, but also easily verified, that the conditional microquantity associated with Σ is the same as q k . For  q ˜ k ˜ , we find that
q ˜ k k ˜ p ( k ˜ k ) q ˜ k ˜ ,
and can be very different from q ˜ k ˜ .
Claim 7. 
When the two bodies in the above definition are quasi-independent (see Definition 28 and Section 7.3 for full details), then
p ( k k ˜ ) p k , p ( k ˜ k ) p k ˜ .
Remark 16. 
A composite microquantity χ 0 k 0 j and a medium microquantity χ ˜ k ˜ j are easily reduced to the conditional microquantities χ 0 k j and χ ˜ k j ascribed to m k , respectively, by  using quasi-independence condition as
χ 0 k j k ˜ p k ˜ χ 0 k 0 j , χ ˜ k j k ˜ p k ˜ χ ˜ k ˜ j = χ ˜ j ,
so that their averages following Equation (12) finally give χ ¯ 0 j and χ ˜ j approximately compared to its exact formulation in Equation (31). Here, the  conditional quantities χ 0 k j and χ ˜ k j require conditional averaging over all the microstates m ˜ k ˜ with their probabilities p k ˜ , given that Σ is in the microstate m k ; the reduced or conditional probability approximately becomes unity due to Equation (33). The last equation in Equation (34) follows from Theorem 1.
Remark 17. 
The above reduction plays a very important role in the formulation of the NEQ statistical mechanics (μNEQT) of the system Σ by reducing all microquantities in Σ 0 to conditional microquantities under the condition that Σ is in microstate m k .
For a medium microquantity q ˜ k ˜ , we obtain a very important result, which we quote as a Theorem because of its extreme importance.
Theorem 1. 
Under quasi-independence approximation, the conditional q ˜ k is simply given by the macroquantity q ˜ :
q ˜ k q ˜ , k .
Proof. 
By replacing χ ˜ k ˜ j by q ˜ k ˜ and χ ˜ k j by q ˜ k in Equation (34), we obtain the ensemble average on the right side, which proves the theorem. ☐
The application and general proof of this important theorem is deferred to Section 7.5, where it is restated slightly differently as Theorem 7, where we justify Remark 16, which is used in the simple proof given above.

2.9. Process Quantities

Remark 18. 
For a state variable  q S , Z for Σ b , its microstate analog  q k is trivially identified as the microstate value  q  takes on m k , and appears as the coefficient of p k in the right-hand side of Equation (12), the ensemble average. We now consider the process quantity d q k and consider its ensemble average
d q A ^ d q k k p k d q k
if we follow the convention adopted in Equation (12). However, d q above is not the same as
d A ^ q d q or d q ¯ ( t ) or d q k p k d q ¯ k d ( k p k q k ) ;
we have also introduced the microstate analog d q ¯ k for dq  or d q to make sure that we distinguish dq k and
d q ¯ k = d A ^ q k d q k + q k d η k ,
so that d q = A ^ d q ¯ . This distinction becomes very important for  q = E and S, as we will see in Section 10.1; see also Definition 23.
Definition 17. 
In mechanics, the generalized or BI-microwork by Σ b with parameter W is defined as the microwork done by the fluctuating microforce F w k
d W k F w k · d W = ( E k / W ) · d W = d E k ,
with F w k in its component form is given by
F w k = ( P k , . . . , A k ) = ( f w k , A k ) ;
here, ⋯ denotes microfields corresponding to the rest of the state variables in w besides V, and
f w k E k / w , A k E k / ξ ,
with A k representing the microaffinity.
With fluctuating W k , it is defined as the microwork done over the fluctuating generalized displacement d W k
d W k F w · d W k = ( E k / W k ) · d W k ;
see Claim 6. The generalized or BI-macrowork done by Σ b after ensemble averaging (see Equation (19)) in both approaches are the same:
d W = d W = F w · d W
Explicitly, we express F w in its component form as
F w = ( P ( t ) , . . . , A ( t ) ) = ( f w ( t ) , A ( t ) ) ;
see Figure 1. Here, ⋯ denotes the macrofields corresponding to the rest of the state variables in w besides V, and
f w E / w .
The SI-affinity
A E / ξ
corresponding to ξ [12,51] is nonzero, except in EQ, when it vanishes: A eq A 0 = 0 = 0 [13,51].
The SI-macrowork d W ξ done by Σ as the internal variable ξ varies is
d W ξ d i W ξ A · d ξ 0 .
Even for an isolated NEQ system, d W ξ will not vanish; it vanishes only in EQ, since ξ does no work when A 0 = 0 . Because of this, d e W ξ 0 so that d W ξ d i W ξ . However, f w , d W ˜ and d e W are unaffected by the presence of ξ .
Definition 18. 
In statistical mechanics, generalized or SI-microheat for Σ b is defined as
d Q k T ( η k + 1 ) d η k T η ^ k d η k ;
see Equation (255). The average of d Q k is the generalized or SI-macroheat
d Q k E k d p k T d S .
Remark 19. 
We will use “generalized” or “SI” interchageably in this review.
Conclusion 1. 
The SI-macroheat or the generalized macroheat d Q
d Q T d S ,
is identified as the Clausius equality; see Remark 42.
This interesting equality should be distinguished from the well-known Clausius inequality
d e Q T 0 d e S T 0 d S .
Thus, Equation (93a) allows us to uniquely identify generalized heat and work as independent of each other.
Remark 20. 
The d in d W , d Q , d W k , and  d Q k does not denote any differential operator on some quantity W , Q , W k , and  Q k , respectively. Conventionally, one uses a symbol đ or some other symbols in thermodynamics to emphasize this distinction. However, we follow the standard notation of mechanics for d W and d W k to emphasize these mechanical concept of work. We also use the same symbol for d Q and d Q k . If we extend Equation (36a) to also include d W k and d Q k , then we could also use d W ¯ and d Q ¯ for d W and d Q , respectively, but we will use the simpler notation d W and d Q . This should not cause any confusion.
It follows from Equations (45) and (46) that the irreversible macroheat is
d i Q = ( T T 0 ) d S + T 0 d i S ( T T 0 ) d e S + T d i S ;
see also Equation (142).
Definition 19. 
Changes in quantities such as S , E , V , in an infinitesimal process δ P are denoted by d α S , d α E , d α V , ; changes during a finite process P are denoted by Δ α S , Δ α E , Δ α V , . All of these are process quantities, which also include d α W , d α Q , Δ α W , and  Δ α Q .
Definition 20. 
The path γ P of a macrostate M is the path it takes in S Z during a process P . The trajectory γ k is the trajectory a microstate m k takes in time in S during the process P . The path and the trajectories are uniquely specified in S Z if m k are uniquely specified in it.      

2.10. Σ 0 (Isolated Body) and Σ ˜ (Medium)

Remark 21. 
As an isolated body cannot exchange anything with its surroundings, we must always have
d e θ 0 ( t ) 0 , d e θ 0 k 0 ( t ) 0 , d e θ 0 k ( t ) 0 , k 0 , k ;
see Definition 3. The last equality emerges from reduction; see Remark 16.
Remark 22. 
For a medium Σ ˜ , which is assumed to be in EQ and weakly interacting with and quasi-independent of the system Σ in microstate m k , we must have
d i θ ˜ ( t ) 0 , d i θ ˜ k ˜ ( t ) 0 , d i θ ˜ k ( t ) = 0
after reduction of d i θ ˜ k ˜ ( t ) from k ˜ to k. The last equality follows from Theorem 1 by replacing q ˜ k by d i θ ˜ k = d i θ ˜ , a NFl macroquantity, even though d i θ ˜ k ˜ is a Fl microquantity over m ˜ k ˜ .
Remark 23. 
As we always use microstates m k ’s with fluctuating energies E k , we find it useful and simple to use the notation in which W is fluctuating with W k over microstates. This means that we will consider the state variable Z fluctuating with Z k over microstates as if we are dealing with the fixed field approach with Z given by the extension of Equation (19)
Z = k p k Z k .
The approach also covers the fluctuating workfield approach if we simply replace each W k by a fixed W .

3. Mathematical Digression on d α

In NEQT, there are various forms of work and heat d W and d Q . Therefore, it is necessary to distinguish between them. Let us consider the Clausius equality in Equation (45) relating the SI-macroheat d Q and the entropy change d S . It would be naïve to take this equality to conjecture that
d α Q = T d α S ,
for the simple reason that the exchange macroheat d e Q is a MI-quantity so it must be determined by the medium alone. The presence of T in the above conjecture d e Q = T d e S raises doubts about the conjecture as T has nothing to do with the medium. Therefore, it is important to understand the role of the operators d α , which is explained in this section. This makes this section extremely important in the review.

3.1. Generalizing d d e + d i

The linear operators d α satisfy not only the identities in Equations (14a) and (14b), but also the following identities:
d α ( a q 1 + b q 2 ) = a d α q 1 + b d α q 2 , d α ( q 1 q 2 ) = q 1 d α q 2 + ( d α q 2 ) q 2 ;
here q 1 and q 2 are two extensive random variables, and a and b are two pure numbers.
The generalization of de Groot–Prigogine notation in Notation 3 provides a very compact description of NEQ processes in the μ NEQT. The original notation [13,51] is restricted to the entropy, particle number, energy, and volume changes d S , d N , d E , and d V , respectively, for Σ b ; see Figure 1 for d Z d X = d S , d N , d E and d V :
d S d e S + d i S ,
d N d e N + d i N ,
d E d e E + d i E ,
d V d e V + d i V ,
As no internal process can change the energy [12], we have
d i E 0 .
The surprising fact is that d i E k 0 , as we will establish below; see Theorem (6). Similarly,
d i V = 0 .
We have also assumed that d i N = 0 , but this is no consequence as we are assuming no chemical reaction in the review. We should emphasize that the partitions above have nothing to do with the partitions in Equations (238) and (247a), respectively. The original partition in Equation (52b) is not relevant in the review as we do not consider any chemical reaction, so d N d e N . Observe that the above partitions are defined only for macroscopic extensive observables for a body. We have extended the notation to not only all extensive state variables in χ but for d α W , d α Q for any body Σ b . We thus have
d W k = d e W k + d i W k , d Q k = d e Q k + d i Q k ,
d W = d e W + d i W , d Q = d e Q + d i Q ;
For Σ b an isolated system Σ 0 , it follows from Equation (48a) that
d e W 0 0 , d e Q 0 0 .
For Σ b a medium Σ ˜ , it follows from Equation (49) that
d i W ˜ 0 , d i Q ˜ 0 .
Note that d W , d Q , etc., do not represent changes in any SI-macrovariable; see Remark 20.
Remark 24. 
We mostly focus on q k or d α q k in the μNEQT, from which we obtain the information about the corresponding macroquantity  q  or d α q,  respectively, by ensemble averaging. The approach in this sense is to effectively discuss q or d α q , without explicitly showing the suffix k, unless clarity is needed. We will, however, use q k or d α q k when we consider specific cases.
We now consider the three systems separately for clarity below so we need q , d α q , q ˜ , d α q ˜ , and  q 0 , d α q 0 for Σ , Σ ˜ (not necessarily in EQ) and Σ 0 , respectively, which satisfy additivity for Σ 0 so that
q 0 k 0 = q k + q ˜ k ˜ , d α q 0 k 0 = d α q k + d α q ˜ k ˜ ,
where we have explicitly shown microstate indices for Σ , Σ ˜ , and Σ 0 ; here and in the following, q χ , and  d α q d α θ . For these equations to hold, we need to assume that Σ and Σ ˜ interact so weakly that their interactions can be neglected (recall that E is one of the possible q ) and that Σ and Σ ˜ are quasi-independent [148]; see Section 7.3. We also consider their partitions as shown in Equation (14a).
Remark 25. 
The medium Σ ˜ in Equation (56) need not be in EQ, so Equation (56) also applies to a system Σ consisting of two subsystems Σ 1 and Σ 2 interacting with each other satisfying quasi-additivity and quasi-independence. All we need to do is to take Σ 0 Σ , Σ Σ 1 , and  Σ ˜ Σ 2 . We can also have Σ embedded in a medium Σ ˜ , distinct from the previous Σ ˜ . It follows from Equation (56) that
q k = q k 1 + q k 2 , d α q k 0 = d α q k 1 + d α q k 2 ,
and
q = q 1 + q 2 , d α q = d α q 1 + d α q 2 .
Explicitly, we have
d q = d q 1 + d q 2 , d q k = d q 1 k 1 + d q 2 k 2 , d e q = d e q 1 + d e q 2 , d e q k = d e q 1 k 1 + d e q 2 k 2 , d i q = d i q 1 + d i q 2 , d i q k = d i q 1 k 1 + d i q 2 k 2 .
in which we must treat d e q j , d e q j k j , j = 1 , 2 , carefully. As usual, d e q is the exchange with Σ ˜ , but  d e q 1 , d e q 2 each have two exchanges; one exchange involving the suffix m is with Σ ˜ , and the other exchange is with the other subsystem. Thus, we have
d e q 1 = d e q 1 m + d e q 12 , d e q 2 = d e q 2 m + d e q 21 ,
in which d e q 12 , d e q 21 stand for mutual exchanges between the subsystems.
Remark 26. 
For an isolated Σ in Equation (58), we must have d e q k = d e q 1 m k 1 = d e q 1 m k 2 = 0 (see Remark 21), so
d e q 1 k 1 = d e q 2 k 2 .
Remark 27. 
It follows from Remark 26 that
d e W k = 0 , d e W 1 k 1 = d e W 2 k 2 .
We now turn back to discussing a system embedded in a medium as above, and prove the following important theorem.
Theorem 2. 
We consider the system Σ and the medium Σ ˜ (not necessarily in EQ) forming the isolated system Σ 0 . We prove two important identities that are extremely useful in the μNEQT:
d e q k d e q ˜ k ˜ = d q ˜ k ˜ + d i q ˜ k ˜ ,
d q 0 k 0 d q k + d q ˜ k ˜ = d i q 0 k 0 = d i q k + d i q ˜ k ˜ .
Proof. 
As Σ 0 is isolated, there cannot be any exchange quantity, so d e q 0 0 . It follows from Equation (60) that
d e q 0 k 0 = d e q k + d e q ˜ k ˜ 0 .
The identity in Equation (62a) immediately follows. Again using the second equation in Equation (56) for d α = d , and using d e q 0 k 0 = 0 proves the second identity, after using Equations (14a) and (14b) in d q 0 k 0 . This case is appropriate for treating Σ ˜ as another system. ☐
For Σ ˜ in EQ, d i q ˜ = 0 but not d i q ˜ k ˜ , as it is an outcome of a random variable d i q ˜ ; see Remark 22. Thus,
d q ˜ k ˜ = d e q ˜ k ˜ + d i q ˜ k ˜ ; d α q 0 k 0 = d α q k + d α q ˜ k ˜ ,
which should undergo reduction as our interest is to investigate Σ in m k . This is done in Section 7.5, where we find that
d e q k = d e q ˜ k = d e q ˜ = d e q , k ,
showing that exchange microquantities are not random variables; see Theorem 7. For the macrostate, we have
d q ˜ = d e q ˜ = d e q , d q 0 = d i q .
For q = Z and Σ ˜ in EQ, we have from Equation (62b) and the general additivity
Z 0 k 0 Z k + Z ˜ k ˜ ,
obtained by extending Equation (28c), the identity
d Z 0 = d e Z + d i Z + d Z ˜ = d i Z ,
which shows that
d e Z = d Z ˜ = d e Z ˜
in accordance with Equation (62a). We thus have
d Z 0 k = d i Z k , d Z 0 = d i Z ,
where all quantities pertaining to m 0 k 0 have been reduced (see Definition 4) and we have used the fact that after reduction (see Remark 22),
d i Z ˜ k = d i Z ˜ = 0 , k .
For q = E for a macrostate M , we have
d E 0 k = d i E k , d E 0 = d i E 0 = d i E = 0 ;
the last equation follows from Equation (53a).
For q = S for a macrostate M , we have the standard result
d S 0 d i S ,
from which we obtain
d S 0 k d i S k
giving the internal entropy generation, which has no particular sign, and
d S 0 = d i S 0
for the irreversible entropy generation. We similarly have
d W 0 k = d i W k , d Q 0 k = d i Q k ,
after reducing all quantities pertaining to m 0 k 0 . For a macrostate M ,
d W 0 = d i W 0 ; d Q 0 = d i Q 0 ;
see Equation (145). Here, d i W and d i Q are the irreversible macrowork done by and macroheat generation due to internal processes in Σ ; see Theorem 4.
Claim 8. 
The nonnegative inequalities for macroquantities d i q in the above equations are in accordance with the second law, where ensemble averaging at each instant plays a central role. Because of this relationship with the second law, we call these quantities irreversible. There is no sign requirement for corresponding microquantities d i q k that do not require such averaging. To make this clear distinction, we call these microquantities simply internal.
The discussion above finally justifies Conclusion 2 of several micro- and macroworks that are distinct in nature. Intuitively, the generalized microwork d W k denotes the mechanical work done by the system, a part
d e W k = d e W = d e W ˜ = d W ˜
of which is transferred to Σ ˜ w through exchange and d i W k is internally spent to overcome internal processes due to the microforce force imbalance ( μ FI) within Σ . Of the three, only d W k and d i W k are the outcomes of random variables d W and d i W , respectively.
Similarly, there are several micro- and macroheats that are distinct in nature. Of  d Q k , a part
d e Q k = d e Q = d e Q ˜ = d Q ˜
is transferred from Σ ˜ h through exchange and d i Q k is internally generated by internal processes within Σ . Of the three, only d Q k and d i Q k are the outcomes of random variables d Q and d i Q , respectively. Similar comments also apply to d S k , d e S k , and d i S k .
What has been said above can be summarized as follows (also see Claim 15):
Summary 1. 
dq k = ( d S k , d E k , d W k , d Q k )  and  d i q k = ( d i S k , d i , E k , d i W k , d i Q k ) are random variables and fluctuate around their respective averagesdq and d i q, so they have values on both sides of their averages.
This justifies Remark 22.

3.2. Consequences of Theorem 60

We show the importance of the above theorem about exchange microquantities, which is why they have been extensively exploited in modern NEQ statistical mechanics ( μ ˚ NEQT). We will only consider the case of fixed work parameter so we have fluctuating microforces associated with the random variable F w . The discussion is easily extended to fluctuating work parameter. We first consider an NEQ Σ in the microstate m k . The microwork d W k is given in Equation (37a); see also Equation (40). The same equation, applied to Σ ˜ in the microstate m ˜ k ˜ , gives
d W ˜ k ˜ f ˜ w k ˜ · d w ˜ = f ˜ w k ˜ · d e w ˜ = f ˜ w k ˜ · d e w ,
where we have used the fact that Σ ˜ is in EQ so E ˜ k ˜ does not depend on the internal variable ξ ˜ (see Equation (28a)), so that F ˜ w k ˜ f ˜ w k ˜ , d W ˜ d w ˜ = d e w ˜ , and  d i w ˜ = 0 . We have also used Equation (65b) to set d w ˜ = d e w in the last equation. Thus,
d W ˜ k ˜ = d e W ˜ k ˜ = d e W k ,
where we have also used Equation (70a) to derive the last equation.
Remark 28. 
A careful reader will notice that we have an equality between two quantities having different and independent suffixes k ˜ and k. This implies that we can change one index, say k, and not change k ˜ . As the equality again remains valid, both sides must be independent of the suffixes. It will be justified later in Section 7.5 in a different way.
Thus,
d W ˜ k ˜ = d W ˜ = d e W ˜ , d e W k = d e W ,
and
d e W = d W ˜ = d e W ˜ .
This is consistent with Equation (64b) as expected. Explicitly, we have
d e W ˜ k = f ˜ w · d e w = d W ˜ , f ˜ w k ˜ p k ˜ f ˜ w k ˜ = f 0 w , k ,
where f 0 w refers to Σ 0 . Thus, the exchange microwork is
d e W k f 0 w · d e w = d e W , k .
We now identify the internal microwork d i W k :
d W 0 k = d W k + d W ˜ k = d W k d e W = d i W k ,
and is explicitly given by
d i W k = ( f w k f 0 w ) · d e w + f w k · d i w + A k · d ξ ,
where we have allowed the possibility of an internal change d i w = d w d e w , similar to d i ξ = d ξ . Such a situation arises if w refers to polarization or magnetization, which can change due to internal processes.
We now turn to the physical significance of the three different terms in the internal microwork d i W k :
  • The first term is the internal microwork due to force imbalance f w k f 0 w between the SI-microforce of Σ , and the MI-macroforce of Σ ˜ .
  • The second term is the internal microwork due to the internal displacement d i w by the SI-microforce f w k of Σ .
  • The last term is due to the internal variable displacement by the SI-microaffinity A k .
We introduce the internal microforce imbalance ( μ FI, μ for micro) between Σ and Σ ˜ , and the internal SI-microforce
Δ F w k ( f w k f 0 w , A k ) , f w k ,
respectively, and the corresponding displacements
( d e w , d ξ ) , d i w
to reproduce Equation (75).
The corresponding macroforce imbalance and the internal macroforce are given by
Δ F w = ( f w f 0 w , A ) , f w ,
with the same displacements as above. Here, we will take a more general view of A , and also extend its definition to X . For  w , this means that we can treat f w f 0 w also as an affinity. By including Δ F h T 0 T also as an affinity [134], we can include it with Δ F w to form an extended set of thermodynamic forces or macroforce imbalances [51]:
Δ F ( T 0 T , f w f 0 w , A ) .
Claim 9. 
The extended set Δ F of thermodynamic forces in Equation (76d) must vanish in EQ. However, Δ F w k need not vanish even in EQ.      

3.3. Some Simple Examples

As an example, we focus on the case with W = ( V , ξ ) , w ˜ = ( V ˜ ) . The corresponding f 0 w is replaced by P 0 of Σ ˜ so that (setting d i V = 0 )
d W k = P k d V + A k d ξ ,
d e W k = P 0 d V , k , d e W = P 0 d V ;
d i W k = ( P k P 0 ) d V + A k d ξ , k ;
we can identify the two internal parts
d i W k V = ( P k P 0 ) d V , d i W k ξ = A k d ξ
that make up d i W k . The corresponding macroworks are given by
d W = P d V + A d ξ , d e W = P 0 d V ,
d i W = ( P P 0 ) d V + A d ξ .
The results d e W = P 0 d V , d i W = ( P P 0 ) d V in the absence of ξ are well-known in classical thermodynamics [51]. We identify the irreversible macrowork d W V and d W ξ due to V and ξ , respectively, from Equation (78):
d i W V = ( P P 0 ) d V 0 , d i W ξ = A d ξ 0 .
The above example describes a possible NEQ situation in Figure 3a of a gas of volume V in a cylinder with a movable piston forming the system Σ described by W = ( V , ξ ) by considering its microstate M very close to M eq so that only one internal variable is sufficient to describe it uniquely by treating M = M ieq . A possible choice of ξ can be rationalized as follows. We imagine the gas to be divided into two parts of volumes V 1 , V 2 and uniform number densities n 1 , n 2 , respectively, by an imaginary wall, with the region next to the piston designated as V 1 . The entire volume is not uniform if n 1 n 2 , which we assume. We now define
ξ V 1 / n 1 V 2 / n 2 ,
recalling that V = V 1 + V 2 ; see Section 4 for a generalization to describe M = M ieq far away from M eq that will require many internal variables. In a given microstate m k , the SI-pressure P k and the affinity A k form the corresponding microforce F w k = ( P k , A k ) (see Equation (17a)) with
P k = E k / V , A k = E k / ξ .
The corresponding generalized microwork d W k is given in Equation (77a). For the medium, the generalized microforce f ˜ w k ˜ = P k ˜ determines the generalized microwork d W ˜ k ˜ = P k ˜ d V ˜ = P k ˜ d V in Equation (71a). The conditional microforce f ˜ w k from Theorem 1 is equal to f ˜ w = P 0 , k ; here, P 0 is the external pressure on the piston. Thus, the conditional microwork is d W ˜ k = d W ˜ = P 0 d V so that d e W k = d W ˜ k = P 0 d V = d e W given in Equation (77b). The internal microwork in the gas is d i W k in Equation (77a).
The irreversible macrowork d i W = ( P P 0 ) d V + A d ξ must be nonnegative as we prove in Theorem 4. For  this to be true, each term must be nonnegative; see Equation (80). Indeed, it is easy to verify that d i W V ( P P 0 ) d V 0 . For  P > P 0 , the gas must expand so d V > 0 . For  P < P 0 , the gas must contract so d V < 0 . In both cases, the product satisfies the inequality.
This will become more clear by the following example of a spring discussed below.
The pressure difference Δ P = P P 0 (see Figure 3a) plays an important role as a macroforce imbalance in capturing dissipation. Only under mechanical equilibrium do we have the imbalance vanish ( Δ P = 0 ). This imbalance is a general feature but its importance at the microstate level in NEQ statistical mechanics has not been recognized. The following examples will make it abundantly clear that a nonzero microforce imbalance like Δ P k = P k P 0 is just as common even in classical mechanics whenever there is absence of mechanical equilibrium as in thermodynamics, EQ or otherwise. This is because the determination of various microforces and microworks are oblivious to any stochasticity; see Remark 30 for d α W = d α E w . As a consequence, there are no restrictions on the sign of d i W k as it is purely a mechanical quantity. Therefore, our second example below covers classical mechanics as well as thermodynamics; see also Conclusion 3.
Consider a general but purely classical mechanical one-dimensional massless spring of arbitrary Hamiltonian H ( x ) with one end fixed at an immobile wall on the left and the other end with a mass m free to move; see Figure 3b with vacuum and no fluid filling the cylinder. We consider a particular microstate m k of energy E k given by H . The center of mass of m is located at x from the left wall. The free end is pulled mechanically by an external force (not necessarily a constant) F 0 applied at time t = 0 and changes x; thus, x acts as a work parameter. We do not show the center-of-mass momentum p, as it plays no role in determining work.
Initially the spring is undisturbed and has zero SI restoring spring microforce F w k = E k / x ; see Equation (17a). The microwork done by F w k is the SI-work d W k = F w k d x as given in Equation (37a). The total microforce
F t k = F 0 + F w k
represents the microforce imbalance ( μ FI) F t k 0 as discussed later in Section 6.4; recall that F 0 and F w k point in opposite directions so F t k is a difference Δ F w k = F w k F 0 . There is no mechanical equilibrium unless Δ F w k = 0 and the spring continues to stretch or contract, thereby giving rise to an oscillatory motion that will go on forever. During each oscillation, Δ F w k is almost always nonzero, except when the mass is momentarily at the equilibrium (mechanical) position of the spring where Δ F w k = 0 . The SI-microwork done by F w k is the spring work (see Equation (37a))
d W k = d E k ,
while the microwork performed by the external source is d e W = F 0 d x . Being a purely mechanical example, there is no dissipation. Despite this, we can introduce using our notation
d i W k d W k d e W Δ F w k d x ;
this microwork can be of either sign (no second law here) and represents the work done by the μ FI F t k = Δ F w k . Thus,
Conclusion 2. 
d W k , d e W k and d i W k represent different kinds of mechanical work, a result that has nothing to do with dissipation but only with the microforce imbalance; among these, only the generalized work d W k is a SI microwork.

3.4. Manipulations with d α

As introduced in Equation (9), d α can be applied to micro- and macroquantities in the collection such as d α E k , d α W k , d α Q k , etc., and d α E , d α W , d α Q , etc.
Definition 21. 
Micropartition: The micropartition of the BI- d θ k ( t ) for Σ b is given in Equation (14b), in which d e θ k ( t ) is the change due to exchange with its surroundings, and  d i θ k ( t ) ( t ) is the internal change within Σ b .
The corresponding partitions for d E k and d S k are given in Equation (15), and those for d W k and d Q k in Equation (54a). For a Fl- W , we have
d W k d e W k + d i W k .
The micropartition also applies to d p k :
d p k d e p k + d i p k ,
We define
d α η k d α p k p k .
Definition 22. 
Macropartition: The macropartition of d θ ( t ) for Σ b is given in Equation (14a). It consists of two parts; the exchange d e θ is the change due to exchange with its surroundings, and d i θ is the irreversible change occurring within Σ b .
For the average in Equation (19) or for a NFl- W , we have
d W d e W + d i W .
In a process, χ undergoes infinitesimal changes d α χ k at fixed p k , or infinitesimal changes d α p k at fixed χ k . The changes result in two distinct ensemble averages or process quantities.
Definition 23. 
Infinitesimal macroquantities d α q , q χ = S , Z are ensemble averages
d α q m = d α q = A ^ d α q k k p k d α q k ,
at fixed p k so they are isentropic. They generalize the earlier definition in Equation (36a). We identify them as mechanical macroquantity and write them as d α q m for brevity. Infinitesimal macroquantities
d α q s q d α η k q k d α p k ,
which are ensemble averages involving d α p k with a concomitant change d S in the entropy. We identify them as stochastic macroquantities and write them as d α q s for brevity. Together, they determine the change d α q:
d α q d α q ¯ d α q w + d α q s , q S , Z .
Remark 29. 
The above equation shows that we must carefully distinguish  d α  q  = d α q ¯  and  d α  q  w = d α q ¯ ; their difference, the commutator C ^ α q, is the stochastic quantity d α q s , discussed in Section 10:
C ^ α q = d α q d α q m ;
see Equation (229).
Remark 30. 
For E, the above distinction is the content of the extension of the first law or the law of the conservation of energy
d α E = d α Q d α W .
We immediately identify that
d α Q = d α E s , d α W = d α E w .
For d α = d , d e , we have the SI- and MI-formulation of the first law given by (recall that d E d e E as d i E 0 )
d E = d Q d W ,
d e E = d e Q d e W .
Remark 31. 
The SI-formulation of the first law in Equation (93a) shows that d E can be uniquely partitioned into a stochastic component d Q determined by d S and a mechanical component d W determined by d W , which have independent origins.      
Traditionally, the first law is expressed in terms of the change in the energy caused by exchange quantities and is written as
d E = d e Q d e W .
As the exchange form of d E is written as d e E (see Equation (52c)), this is equivalent to the first law in Equation (93b).
We now prove the following important thermodynamic identity as a theorem [75,76,134,148,149].
Theorem 3. 
For any NEQ process P ,
d i Q d i W 0 .
Proof. 
For d α = d i in Equation (91), and using Equation (53a), we have
d i E = d i Q d i W = 0 ,
from which follows the following important thermodynamic identity d i Q d i W . We defer the proof of the inequality to a later part of the review. ☐
The above equality emphasizes the well-known fact (first discovered in 1798 by Count Rumford of Bavaria [165]) that the irreversible macrowork is always equal in its value but not in its cause (see later) to the irreversible macroheat. The inequality is governed by the second law. The analysis also demonstrates the important fact that the first law in Equation (93a) can be applied either to an exchange process in Equation (93b) or to an interior process in Equation (96). Indeed, in the last formulation, the law is also applicable to an isolated system for which it is replaced by
d E 0 = d Q 0 d W 0 = 0 .
Definition 24. 
For any body Σ b , we simply refer to d W k and d Q k as generalized or BI-microwork and generalized or BI-microheat or simply microwork and microheat, respectively. Similarly, we refer to d W and d Q as generalized or BI-macrowork and generalized or BI-macroheat or simply macrowork and macroheat, respectively. We will always refer to d e W k and d e Q k as exchange microwork and exchange microheat, respectively. We use exchange macrowork and exchange macroheat for  d e W and d e Q , respectively. As there is no irreversibility in mechanics, we use internal microheat for d i Q k and internal microwork for d i W k , respectively; see Claim 8. We never the use the prefix irreversible for these or other internal microquantities. We use irreversible macroheat for d i Q and irreversible macrowork for d i W , respectively.
As the system Σ is of primary interest in the μ NEQT, we will always reduce any microquantity associated with Σ ˜ and Σ 0 to refer to the microstate m k . Thus, all microquantities for any Σ b will carry the suffix k of m k . We will usually refer to d W ˜ k as the external microwork to distinguish it from the microwork d W ˜ k ˜ done by Σ ˜ in its microstate m ˜ k ˜ . We will use microenergy change, exchange microenergy change, and internal microenergy change for d E k , d e E k , and d i E k . We will refer to d α S k as microentropy change, even though both d α S k and d α Q k are mixed microquantities; see Remark 14.

4. Internal Variables

Let us consider two noninteracting mechanical systems Σ 1 and Σ 2 that form a composite system Σ , which we take to be isolated. We assume that both Σ 1 and Σ 2 are physically “similar” in that each requires the same set of NFl-state variable W having r components, so separately they are described by Hamiltonians E 1 k 1 = H 1 k 1 ( W 1 ) and E 2 k 2 = H 2 k 2 ( W 2 ) for m 1 k 1 and m 2 k 2 of Σ 1 and Σ 2 , respectively. We assume that the number of particles N 1 W 1 ( w 1 , ξ 1 ) and N 2 W 2 ( w 2 , ξ 2 ) are kept fixed in the two microstates so their total N is also fixed for each microstate m k of Σ given by
m k = m 1 k 1 m 2 k 2 .
As the particle numbers are fixed, we do not consider them to be part of the work sets anymore. We choose to express the combined Hamiltonian as
E k H k ( Z 1 , W 2 ) = H 1 k 1 ( W 1 ) + H 2 k 2 ( W 2 )
of m k , which is a function of 2 r + 2 state variables (which includes the microenergies E 1 k 1 and E 2 k 2 of Σ 1 and Σ 2 , respectively), from which we construct the following independent combinations:
Z Z 1 + Z 2 , ξ ^ Z 1 / n 1 Z 2 / n 2 ,
so that we can equivalently express H k ( Z 1 , W 2 ) as H k ( W ^ , ξ ) of 2 ( r + 1 ) variables, which excludes E k as explained below; here, n 1 = N 1 / N and n 2 = N 2 / N ,
W ^ W 1 + W 2 = ( w 1 + w 2 , ξ 1 + ξ 2 )
is the total initial work variable set, and  ξ is the new set of internal variables beyond those included in Z 1 and Z 2 . In addition, the excluded E k E 1 k 1 + E 2 k 2 is the microenergy of m k , and carries the suffix k. The choice of new arguments for H k ( W 1 , W 2 ) is convenient as it allows it to be expressed as H k ( W ) in terms of the set formed by 2 r + 1 variables
W ( W ^ , ξ ^ )
of the composite system Σ , as is also done for Σ 1 and Σ 2 . The set of internal variables
ξ ( ξ 1 + ξ 2 , ξ ^ )
denotes the set of internal variables for Σ .
Manipulating W will change the energy E k of Σ . Thus,
d E k = E 1 k 1 W 1 · d W 1 + E 2 k 2 W 2 · d W 2 .
It is easy to check that d E k is also given by
d E k = E k W · d W = E k W ^ · d W ^ + E k ξ · d ξ ^ ,
so both representations of H k are equivalent in all ways.
The choice of ξ ^ in terms of n 1 and n 2 ensures that it vanishes if the two systems form a uniform system Σ for which we must have Z 1 / N 1 = Z 2 / N 2 . However, other choices for ξ ^ can also be made as long as ξ ^ remains independent of W ^ .
Let us consider a simple example in which we only allow the energy E and volume V for each each system ( r = 1 ). We have ξ V , ξ E k as work variables in forming W . In this case, we have E k = E 1 k 1 + E 2 k 2 for the microstate energy and V = V 1 + V 2 for the total volume. By definition,
ξ E k = E 1 k 1 / n 1 E 2 k 2 / n 2 , ξ V = V 1 / n 1 V 2 / n 2 .
The microstate energy
E k ( V , ξ V , ξ E k ) = E 1 k 1 ( V 1 ) + E 2 k 2 ( V 2 )
is a function of three ( 2 r + 1 ) variables. We first consider ξ V . We have for P k , using Equation (81),
P k = n 1 P 1 k 1 + n 2 P 2 k 2 , A V k = n 1 n 2 ( P 1 k 1 P 2 k 2 ) ,
where we have used V 1 = n 1 V + n 1 n 2 ξ V and V 2 = n 2 V n 1 n 2 ξ V . As V is NFl, P k is Fl over m k , as we have learned.
We now use ξ E k to express E k 1 = n 1 E k + n 1 n 2 ξ E k and E k 2 = n 1 E k n 1 n 2 ξ E k . Differentiating Equation (104b) with respect to E k and ξ E k , respectively, and using Equation (42), we obtain
1 = n 1 + n 2 , A E = 0 ,
where A E (see Equation (18)) is NFl, so it has no suffix k.
As Σ is an isolated system, it is deterministic. So the observables ( E k 1 , E k 2 , V 1 , V 2 ) remain constant, which means that E k and ξ E k , k , will remain constant in time. If we allow a mutual interaction so that there is a possible energy (or volume) transfer between Σ 1 and Σ 2 , then this will be characterized by oscillating ξ E k and ξ V due to energy and volume transfers, respectively, back and forth between the two systems. On the other hand, if the interacting Σ 0 become stochastic, as discussed in Section 7, it will obey the second law and ξ E k and ξ V will eventually vanish. This case is studied later, where it is shown that macroheat flows from hot to cold.
The above discussion can be easily extended to a composite system composed of m > 2 subsystems by the trick proposed by Gujrati in ([77], Section 3). The trick is very simple. We use the collection W = ( W ^ , ξ ^ ) introduced above for the composite system. We consider two such composite systems, and introduce their work parameters W 1 and W 2 , which are used in Equation (103b) for each one of them. We now treat each as a system so that we have two new systems Σ 1 and Σ 2 that form a new composite system Σ . We use W 1 and W 2 to obtain the new collection of ( W ^ , ξ ^ ) as introduced above. This set defines a new W = ( W ^ , ξ ^ ) for the new composite system, which now has m = 4 subsystems. We then treat two such composite systems and treat each as a system to form another new composite system with m = 8 , and so on to finally consider a composite system formed of m subsystems. We thus claim the following:
Claim 10. 
The internal energy E k of the microstate m k of a composite system of m subsystems is a function of the work set W 1 , W 2 , , W m composed of their work parameters, and can be expressed as a function of
W ^ W 1 + W 2 + + W m
and a set ξ ^ of internal variables [77]; together, they form the set W for the composite system, as shown in Equation (102a).
Claim 11. 
We see that the new combination ξ is the set of internal variables, which also plays an important role in the unique description of the composite system. As the uniqueness is just as important in a thermodynamic consideration, which will be taken up in the following sections, internal variables will play just as important a role there as here.
The above discussion is for a mechanical system with no interaction, but is easily extended to the case in which the two systems are interacting, as will be done in the following sections. The internal variables discussed above relate to a particular microstate m k so some of them may carry the suffix k, and should be denoted as a internal microvariable ξ k . To see this, we recall that the microenergy E k carries the suffix k so any internal variable formed from microenergies of Σ 1 and Σ 2 will carry it as was the case for ξ E k constructed above. The discussion is also easily extended to include thermodynamics, where the internal macrovariable ξ obeys the restrictions imposed by the second law; see Equation (43) and Corollary 1. In this case, W k l of the lth subsystem will also include the internal variable ξ k l , not to be confused with ξ k for the system. It is clear that the complications due to ξ k l are avoided if each subsystem is in EQ so that ξ k k ’s do not exist, as was the simple example considered above. Then there is a maximum number n * of internal macrovariables in ξ that is determined by m. This has been discussed in recent publications [77,78], to which we refer the reader. By the addition of the suffix, it should be obvious that the above discussion is easily extended to Fl work parameter, such as Fl volume V k for m k , so that all microstates experience the same pressure P; see Equation (18). Thus, the above concept of internal variables is quite general. However, for the notational simplicity, we will not add the suffix to W and ξ unless needed for clarity by clearly specifying the situation.

5. Fundamentals of the μ NEQT

In this section, we will usually talk about a system, but the discussion is valid for any body Σ b . The most convenient and most common framework of describing a thermodynamic system Σ is in terms of the SI-set X = ( E , V , N , ) of its extensive macroscopic observables, which results in the SI-set f of the generalized macroforces (see Equation (25)) and the state space S X that is sufficient to uniquely describe the EQ system and its macrostate M eq . A very important SI quantity in thermodynamics is the entropy S that in EQ is uniquely determined by X so that S eq S ( X ) is a state function of M eq . For an NEQ macrostate M , S will not be a state function in S X , so it will depend explicitly on time. In this case, X no longer forms the set of state variables to uniquely describe M in S X , and both M and S have an explicit t-dependence; see Equation (141) for the latter. This is true whether the system is noninteracting (i.e., isolated) or interacting (i.e., interacts with a medium Σ ˜ , which is external to the system Σ ); see Figure 1.
With respect to microstates m k , the interaction between Σ and Σ ˜ causes MI-exchange d e X , which is then used to identify d i X d X d e X ; see Notation (11a). In general, the  SI-change d q can be partitioned into d e q and d i q in accordance with Equations (14b) and (14a), respectively, in which the MI-exchange between Σ and Σ ˜ is caused by their interaction and d i q is the change brought about by internal processes within Σ . In particular, d i q k represents the internal microchange, while d i q the irreversible macrochange. The  SI-force corresponding to w is [ f w ] ; see Equation (37c). There is no microanalog of f s introduced in Equation (24).
The above discussion is restricted to any M eq that is uniquely specified in S X , X = ( E , w ) . In an NEQ macrostate M neq , S X is no longer a convenient state space as it cannot specify M neq NEQ macrostate uniquely. This loss of uniqueness for M neq has been a major obstacle in formulating an NEQ thermodynamics that can be as robust and complete as the classical EQ thermodynamics. All competing NEQT approaches belong to M ˚ NEQT as discussed in Section 1 and deal only with exchange quantities that can be uniquely described in S X , as the medium Σ ˜ is always taken to be in EQ. Thus, they cannot offer any help to overcome the nonuniqueness of M neq .
We consider this loss of uniqueness to be the main issue in improving our current incomplete understanding of NEQ processes. Our approach to overcome this loss is to describe M neq in an appropriately enlarged state space to S Z by including internal variable set [12,13,18,42,51,108,134,148,166,167,168] ξ and identifying Z X ξ as the set of state variables to uniquely specify M neq . The internal variables also play a very dominant role in glassy and granular materials [169,170,171,172,173]. In all previous theories involving internal variables, they are introduced almost in an ad hoc manner without providing any physical insight into their origin. In contrast, our approach to introduce them differs from other approaches by providing a very clear and physical prescription, as discussed in Section 4. As  M eq describes a uniform system [33], M neq invariably requires some sort of nonuniformity, as in a composite system Σ = i Σ i composed of various subsystems Σ i . At the mechanical level, this nonuniformity is captured by the parameters of the SI-Hamiltonians of Σ i , as was the case with two subsystems in Equation (99). The internal variables as they appear in Equation (100) are mathematically required to ensure that the number of independent variables on both sides in Equation (99) are exactly the same. While their forms may not be unique, they must be independent. In terms of Z , we now have a complete SI-specification of m k of Σ , assuming a certain choice of ξ . This is the uniqueness we are looking for to develop the NEQ statistical mechanics. As  discussed in Section 4, ξ cannot be controlled from the outside of Σ . Therefore, its variation is due to internal processes only and may be controlled by the second law. It should be obvious from the discussion in Section 4 that ξ for a purely mechanical system such as m k cannot have any connection with the second law. Only in the presence of stochasticity required for a thermodynamic system will its average behavior be governed by the second law, so it also plays an important role in our approach. However, the requirement of including internal variables for a complete specification is a mechanical necessity due to nonuniformity, but becomes critical in the NEQ statistical mechanics. We direct readers to Section 5.7 for a simple example that clarifies its importance.
In the following, we will be considering the state space S Z in which the entropy is a state function S ( Z ) so that we will be dealing with M ieq ; see Definition 13. This means that p k are uniquely defined to specify M ieq . However, m k themselves are independent of this particular choice of p k , simply because m k are determined by the deterministic Hamiltonian of Σ as discussed in Section 1, so they remain oblivious to their probabilities. It is this independence of m k and p k that allows us to develop the μ NEQT as a mechanical theory that is modified by stochasticity by extending the conventional similar approach in the μ EQT [33,54].
Let us consider an infinitesimal change d Z in Z that takes M ieq = M ieq ( Z ) to M ieq = M ieq ( Z + d Z ) both belonging to S Z . If the system always stays within S Z during this change, then the change is carried out along an IEQ process in S Z . It is M ieq during this change so that d W k = d E k . If intermediate macrostates leave S Z during this change, then the change is not carried out along an IEQ process in S Z . Nevertheless, the microenergy change d E k = d W k between M ieq and M ieq is the same in both situations. In other words, d E k = d W k is the same between M ieq and M ieq , regardless of the nature of the process.
We will focus on an isolated composite system Σ in microstate m k made of two subsystems Σ 1 in microstate m k 1 and Σ 2 in microstate m k 2 ; recall Remark 25. Following from Remarks 21 and 26, we now conclude that
d q k d i q k = d q k 1 + d q k 2 .
In particular, we have
d i W k = d W k 1 + d W k 2 ;
we can use Equation (37a) for NFl W l and Equation (38) for Fl W k l , l = 1 , 2 , to determine d W k l , l = 1 , 2 .
Let us consider one of the above three bodies and focus on its W . For NFl W , the corresponding generalized microforce F w k is Fl as shown in Equation (17a). For Fl W k , the corresponding generalized microforce F w is NFl, as shown in Equation (18). Including E k , which is always FL, we see that Z for the body is Fl in the latter case.
As shown in Equation (20), the BI-microwork d W k = F w k · d W and d W k = F w · d W k defined mechanically as force × displacement in the two cases are the same, and are fluctuating over m k as expected due to the ubiquitous Fl microforce and Fl work parameter, respectively. The mechanically defined macrowork d W in each case will result in the irreversible macrowork d i W 0 in accordance with the second law. It follows from Equation (105) that each side represents a mechanical microwork, showing that even d i W k is a mechanical quantity. It follows from Theorem 6 that d i E k = d i W k , again emphasizing that d i W k has a mechanical origin. However, the second law puts no restriction on the Fl mechanical microanalog d i W . For the example of the spring with the force imbalance given in Equation (82) with NFl x, the internal microwork is given in Equation (84) and can be of any sign according to the signature of the internal microforce imbalance Δ F w k . In the presence of any microforce imbalance (see Conclusion 2) in an NEQ system, d i W k will not vanish, even if its average does. The following Remark emphasizes these points.
Remark 32. 
The internal microwork d i W k within an isolated Σ due to Fl internal microforces or Fl work parameter is ubiquitous. Its presence has a purely mechanical origin, as seen in Equation (84) or in Equation (78) for NFl W . For Fl W k , because of their mechanical nature, different additive parts of d i W k given in (78) are independent of p k in that they remain the same between M ieq and M ieq , both belonging to S Z , regardless of the processes between them. Despite this, the macroscopic analogs of each of these parts and d i W are controlled by the second law; see Corollary 1. It follows that in general, determining d i W k from SI- d W k will be a convenient way to discuss the statistical mechanics of NEQ systems; see Section 2.
We now put down the set of axioms for the formulation of the μ NEQT that are in addition to the axioms put forward by Callen [3]. Callen only discusses a system in equilibrium, so his two most important axioms are about the existence of the entropy function and of the stable equilibrium for EQ macrostates. We extend these axioms to NEQ macrostates below.
Axiom 1. 
Fundamental Axiom The thermodynamic behavior of a system is not the behavior of a single sample, but the average behavior of a large number of independent samples, prepared identically under the same macroscopic conditions at time  t = 0 .
Axiom 2. 
Axiom of Entropy Function Existence There exists an entropy function  S ( M )  for  M  in any state space, which may be a function of the state variables in that state space and time t.
Axiom 3. 
Axiom of IEQ Any  M neq  in  S Z  can always be turned into a unique  M ieq  in a suitably enlarged state space  S Z S Z , Z = Z ξ  so the thermodynamic and statistical entropies are identical; see Proposition 1 and Section 12.6 for details.
Axiom 4. 
Axiom of Stability The unique macrostate  M ieq  for a given  Z  is stable in  S Z  in that the system does not leave it if already there or returns to it if disturbed. A stable macrostate satisfies the stability conditions
d 2 S < 0 , d 2 E > 0 .
If we consider the matrix  J  formed by  2 S / Z j Z j or the matrix  K  formed by  2 E / ζ j ζ j then all the principle minors of the determinant of  J  must be strictly negativeor the determinant of  K  must be strictly positiveBy allowing  Z  to vary,  M ieq  moves to the most stable macrostate  M eq in which all thermodynamic forces (see Equation (76d)) vanish.
We do not consider the stability border d 2 S 0 , d 2 E 0 in the review.
It is an observed fact that nature, in her inorganic as well as organic forms, is driven towards greater stability. This tendency is just as ubiquitous in physics as it is in biology. Anything in nature that is capable of changing always changes eventually into an unchanging stable form, even in an explosion. This is also true of the Belusov reaction [51], undergoing oscillations initially but eventually ending into a stable macrostate.
Axiom 5. 
Axiom of quasi-additivity Any quantity  q  satisfies the principle of quasi-additivity
q j q j .
The above axiom also applies to S , the entropy, but requires the following additional axiom of quasi-independence, to be discussed later in Section 7.3.
Axiom 6. 
Axiom of Quasi-independence For entropy to be quasi-additive, as
S j S j ,
requires the property of quasi-independence (see Claim 7) between different parts of the system.
Axiom 7. 
Axiom of Reduction All microquantities carrying the suffix  k ˜  and  k 0 and associated with  Σ ˜  and  Σ 0 respectively, must be reduced to microquantities carrying the suffix k under the condition that  Σ  is in the microstate  m k  in order to assess their influence on  m k .
The discussion of the rules for reduction is postponed to Section 7.4.

5.1. Fundamental Axiom

To avoid any influence of the possible changes in the system brought about by measurements, we instead prepare a large number N S of samples or replicas under identical macroscopic conditions. The replicas are otherwise independent of each other in that they evolve independently in time. This is consistent with the requirement that different measurements should not influence each other. In the rest of this review, we will use the same term ensemble to collectively represent the samples. The average over these samples of some thermodynamic quantity then determines the thermodynamic property of the system. As the replica approach plays a central role in our formalism, we state its importance as Axiom 1, which was first proposed in [79].
Such an approach is standard in equilibrium statistical mechanics [11,33,34,36,54], but it must also apply to systems not in equilibrium. For the latter, this averaging must be carried out by ensuring that all samples have identical history, i.e., prepared at the same time t = 0 . This is obviously not an issue for systems in equilibrium. We refer the reader to a great discussion about the status of statistical mechanics and its statistical nature by Tolman ([54], Section 25), where he clearly puts down this viewpoint of statistical mechanics as follows. We quote from p. 65:
“The methods are essentially statistical in character and only purport to give results that may be expected on the average rather than precisely expected for any particular system.....The methods being statistical in character have to be based on some hypothesis as to a priori probabilities, and the hypothesis chosen is the only postulate that can be introduced without proceeding in an arbitrary manner....”
Tolman [54] then goes on to argue on p. 67 that what statistical mechanics should strive for is to ensure
“...that the averages obtained on successive trials of the same experiment will agree with the ensemble average, thus permitting any particular individual system to exhibit a behavior in time very different from the average;”
see also the last paragraph on p. 106 in Jaynes [174].

5.2. Parameter Description

As said earlier, E is always treated as a random variable E taking the values E k that fluctuate over m k , regardless of how W is treated. The most convenient description of a system is to use the NFl- W so it is the same for all m k . Per Claim 3, this results in a random SI-variable F w with its outcome F w k (see Equation (17a)) fluctuating over m k so its ensemble average is the generalized (mechanical) macroforce F w ; see Equation (17b). In contrast, the conjugate field β = 1 / T for Fl-E is fixed.
It is possible to use a mixed parameter approach. We consider W having two nonoverlapping subsets W 1 and W 2 , with  W 1 a NFl-parameter W NF . The remaining subset W 2 is Fl-parameter set W F taking the values W k F over m k . We impose the consistency condition on W k F (see Claims 3 and 5) so that the corresponding field F w k F = E k / W k F = F w F , k ; see Equation (18). For a null set W NF , we retrieve the field-parameter description in Claim 3. As before, the consistency requires obtaining the same MNEQT, so we must have
W F = W 2 , F w NF = F w 1 ;
see Condition 1.
To clarify the above distinction, we consider the simpler case of NFl- W = ( V , ξ ) for a system. The energy E is a random variable E taking Fl-values E k ; their average value is determined by a fixed f s = T ; see Equation (24) and Claim 5. In this ensemble, T , V and ξ are fixed so we can also call it a ( T -V- ξ ) -ensemble. In this case, E k , P k , and  A k are fluctuating over m k . If we take W NF = ( ξ ) and W F = ( V ) , then E k , V k , and  A k are fluctuating over m k with T , P and ξ kept fixed in this ensemble, which we can call a ( T -P- ξ ) -ensemble. We can also consider an ensemble with W NF = ( V ) and W F = ( ξ ) . In this ensemble, E k , P k , and  ξ k are fluctuating over m k ; T , V and A are kept fixed so we can call it a ( T -V- A ) -ensemble. For these ensembles to represent the same physical system thermodynamically, we must have V = V , P = P , ξ = ξ , and  A = A in accordance with Equation (109).
Remark 33. 
An NEQ ensemble is specified by the set of its NFl quantities W NF and F w F .

5.3. Ensemble of Replicas

The discussion here provides an extension of the ideas valid for thermodynamic equilibrium macrostates M eq to not only nonequilibrium macrostates M neq but also to macrostates M det . The latter are governed by deterministic dynamics in which microstate probabilities remain constant, as will be justified below; see Claim 12. The premise of the extension to M neq is that these ideas must be just as valid for them, as they are based on thermodynamics being an experimental science [79]. Thermodynamics (equilibrium and nonequilibrium) requires verification by performing the experiment many times over. The same premise also applies to M det . Therefore, we consider all these macrostates in the following, and simply use M ¯ to stand for all these states. We must prepare many copies or replicas  N > > 1 of the system at the same time t under identical conditions specified by the set of extensive variables Z ( t ) that can be used to also study how the system evolves in time. We identify a replica as simply representing an “instantaneous state” of the system, i.e., one of the microstates m k . The collection of all replicas at each instant t is the ensemble, which is specified by the set Z ( t ) and N . The ensemble then becomes the representation of the macrostate M ¯ . Any quantity q ( Z , t ) of interest associated with M ¯ is then identified as an instantaneous average over these replicas or samples, and is an explicit function of the set Z and possibly t. For simplicity, we will usually suppress Z and only exhibit the explicit dependence on t in q. By definition, the ensemble average is given by
q ( t ) or q ¯ ( t ) or < q > ( t ) 1 N k = 1 W N k ( t ) q k ,
where q k is the value of q in the kth microstate m k , N k ( t ) denotes the number of samples in the kth microstate m k at time t, and W is the total number of distinct microstates, which we assume is finite right now. We also assume N k ( t ) to be a countable set. It should be obvious that N > > W for the above definition to make sense. The overbar on or the angular bracket around q in Equation (110) are used to indicate the average q, which is also represented simply as q, following the acceptable tradition in thermodynamics. We will use all three notations to indicate the average in this review as need be.

5.4. Concept of Probability

We now introduce the concept of ensemble probability
p k ( t ) lim N N k ( t ) / N , k = 1 W p k ( t ) 1 ,
which is valid even if W . As is well-known [114], the probabilities require the formal limit N , which is going to be implicit in the following. This justifies Equation (12).
It should be stressed that the concept of probability introduced in Equation (111) is also valid for a Hamiltonian system with deterministic dynamics. All one needs to do is to prepare an ensemble with a given number N k of replicas. As these numbers will not change because the dynamics is deterministic, p k will not change.
It should be noted that m k , and hence the value q k on it, depend on Z ( t ) explicitly, but may also depend on t explicitly. In general, p k ( t ) will be time-dependent as determined by the history of the process. They become history-independent and constant in time t for M eq . As we will soon see, they remain constant in a mechanical evolution of M det . In this sense, there is a close parallel between M eq and M det , as discussed below.
The average of the state variable Z , using the tradition in thermodynamics, is simply written as Z (see Equation (110)):
Z k = 1 W p k ( t ) Z k ;
here Z k is the value of Z in m k . We will also extend this tradition to F w in Equation (40) so that
F w k = 1 W p k ( t ) F w k ,
where, as usual, F w k is the value of F w in m k .
Claim 12. 
The p k defined above in Equation (111) remains a constant of motion for a deterministic system.
This is easy to rationalize as follows. Consider a collection of microstates m k of a system with N k copies at some initial time t = 0 . In a deterministic evolution, N k ’s do not change, which justifies the above claim.
Definition 25. 
To distinguish the usage of constant probabilities for deterministic systems with the usage of probabilities for thermodynamic systems, where they may change spontaneously without any external intervention, we will use the term stochastic for this aspect of probabilistic behavior in M , but not in M det .
We clarify this point further. Consider an isolate system that is not in EQ. This means that, according to the Boltzmann principle, not all microstates are equally probable. In time, the system will come to equilibrium by ensuring that all microstates become equally probable. This shows how a thermodynamic system behaves in a way that allows p k to change in time even without any external intervention. For a deterministic system such as a loaded die, this will never happen even if it is disturbed by the performance of mechanical work, like throwing, an external intervention.
For a thermodynamic system in EQ, p k remains invariant (constant) in time. In this regard, such a system is identical to a deterministic system that obeys Liouville theorem [164], since it is well-known that an EQ system also obeys the theorem [33]. The reason is very simple. The various members of the above ensemble in EQ occupy various microstates with equal probability with the maximum entropy as shown in Section 5.5. This entropy remains a constant of motion for the EQ system.
Remark 34. 
An EQ macrostate M eq under fixed conditions of the surroundings so p k ’s do not change is no different than a deterministic macrostate M det , except that the former has a well-defined notion of temperature but the latter has no such notion.

5.5. Statistical Entropy for M ( t )

We provide a very general statistical formulation of S for a general system Σ that is applicable to mechanical as well as thermodynamic systems. It will be shown to be identical to the thermodynamic entropy S by appealing to the third law. Our derivation demonstrates that the concept of entropy in general is of a statistical nature. We consider a state M ( t ) M ( Z ( t ) , t ) of Σ at a given instant t. We focus on a macrostate M ( t ) of Σ at a given instant t, which refers to the sets m = m k and p = p k of microstates and their probabilities, respectively. We consider Fl- W but the discussion is also valid for NFl- W by simply setting W k = W , k . The microstates are specified by ( E k ( t ) , W k ( t ) ) , which along with p need not uniquely specify the macrostate M ( t ) . In the following, we will use the set Z ( t ) for m for simplicity. We will also denote Z ( t ) by Z ¯ so that we can separate out the explicit variation due to t in addition to the implicit variation in t due to Z ¯ , if any. For simplicity, we suppress t in M in the following. For the computation of combinatorics, the  probabilities are handled as described in Section 5.4. We follow the notation used there, choosing N = C W ( Z ¯ ) with C some large integer constant, and  W ( Z ¯ ) the number of distinct microstates m k in the ensemble or the sample space Γ ( Z ¯ ) spanned by m k . We will see that W ( Z ¯ ) is determined by m k ’s having nonzero probabilities [79]. We will call them available microstates.
The ensemble Γ ( Z ¯ ) above is a generalization of the ensemble introduced by Gibbs, except that the latter is restricted to an equilibrium system, whereas Γ ( Z ¯ ) refers to the system in any arbitrary macrostate so that p k in Equation (111) may be time-dependent, and may not be unique. The samples are, by definition, independent of each other so that there are no correlations among them. Because of this, we can treat the samples in Γ ( Z ¯ ) to be the outcomes of some random variable, the macrostate M ( t ) . This independence property of the outcomes is crucial in the following. Each sample of M ( t ) is one of a microstate in Γ ( Z ¯ ) . They may be equiprobable but not necessarily. The number of ways W to arrange the N samples into W ( Z ¯ ) distinct microstates is
W N ! / k N k ( t ) ! .
Taking its natural log, as proposed by Boltzmann, to obtain an additive quantity per sample as described in Section 5.6 (see also Axiom 6), we obtain
S ( 1 / N ) ln W ,
and using Stirling’s approximation, we see easily that it can be written as the ensemble average (see Equations (12) and (26a)),
S ( Z ¯ , t ) η ( t ) k = 1 W ( Z ¯ ) p k ( t ) ln p k ( t ) ,
of the negative of Gibbs’ index of probability ([48], p. 16)
η k ( t ) ln p k ( t ) .
We have shown an explicit time dependence in S, which is distinct from the implicit time dependence in Z ¯ , to merely express the fact that it may not be a state function in S Z ¯ , i.e., that M may not be uniquely specified in S Z ¯ . The above derivation clearly shows that Equation (116), which is identical in form to Equation (26a), justifies the latter for an arbitrary M .
The identification of entropy in Equation (116) with the Gibbs formulation of entropy is a time-honored practice for nonequilibrium states since the days of Gibbs ([48] see, in particular, chapters 11 and 12, where time dependence is discussed), and has been discussed by Tolman ([54], Ch. 13, and in particular pp. 538–539), Jaynes [174], Rice and Gray [55], and Rice [57], to name a few. There is no restriction on p j ( t ) . In particular, they do not have to be given by probabilities valid for equilibrium states; see also Sethna ([36], Section 5.3.1). The definition merely follows from the observation that the index of probability is an additive quantity for independent replicas (see Fundamental Axiom) and that the entropy is merely its average value (with a negative sign). Tolman takes great care in establishing that this formulation of the entropy satisfies the second law ([54], Section 130). Tolman also shows that the Boltzmann definition of entropy is a special case of the general formulation due to Gibbs ([54], see the derivation of Equation (131.2)), just as we have argued; see Equation (208).
The identification of the entropy with the negative of the Boltzmann H-function ([54], see p. 561), the latter describing a nonequilibrium state, should leave no doubt in anyone’s mind that the Gibbs formulation of the entropy can be applied equally well to an equilibrium or a nonequilibrium system. Nevertheless, we should point out that not all subscribe to this viewpoint of ours about the Gibbs formulation of entropy, because they insist that the Gibbs entropy is a constant of motion [135]. This constancy follows immediately from the application of Liouville’s theorem in classical mechanics [32,33,34,36,54], valid for a system described by a Hamiltonian, as discussed above and as we have already discussed in Section 5.4. We thus see that our formulation of the entropy in EQ is consistent with this theorem.
The above derivation is based on fundamental principles of combinatorics and additivity, and does not require the notion of equilibrium or nonequilibrium in the system; therefore, it is always applicable for any arbitrary macrostate M ( t ) including that of a determining system; see Claim 12. To the best of our knowledge, even though such an expression has been extensively used in the literature for NEQ entropy, it has been used by simply appealing to the information entropy [72,175]. Thus, Equation (116) is a generalization of Equation (26a) to the general case, and thus justifies it for M ( t ) . We now generalize Claim 12 as follows:
Claim 13. 
The probability p k and the Gibbs entropy (see Equation (26a)) is easy to define for a M ( t ) including that of a deterministic Hamiltonian system. As the probability and the entropy for M det do not change as a function of time, we show in Section 10.1 that the concepts of microheat and macroheat cannot be associated with a Hamiltonian system, although the concepts of microwork and macrowork are defined.
The distinction between the Gibbs’ statistical entropy and the thermodynamic entropy should be emphasized. The latter appears in the Gibbs fundamental relation that relates the energy change d E with the entropy change d S , as is well-known in classical thermodynamics, and as we will also demonstrate below; see also Equation (93a). The concept of microstates is irrelevant for this, as it is a purely thermodynamic relation. On the other hand, the Gibbs’ statistical entropy is solely determined by m k , so it is a statistical quantity. It then becomes imperative to show their equivalence, mainly because the statistical entropy is based on the Boltzmann idea. This equivalence has been justified elsewhere [75,76], and is summarized in the following Remark.
Remark 35. 
Because of this equivalence, we will no longer make any distinction between the statistical Gibbs entropy and the thermodynamic entropy and will use the standard notation S for both of them for a macrostate M ieq , of which M eq is a special case.
Remark 36. 
The Gibbs entropy appears as an instantaneous ensemble average; see Definition 7. This average should be contrasted with a temporal average in which a macroquantity q is considered as the average over a long period τ 0 of time
q = 1 τ 0 0 τ 0 q ( t ) d t ,
where q ( t )  is the value of q  at time t [33]. For an EQ macrostate M eq , both definitions give the same result provided ergodicity holds. The physics of this average is that q ( t ) at t represents a microstate of M eq . As M eq is invariant in time, these microstates belong to M eq , and the time average is the same as the ensemble average if ergodicity holds. However, for an NEQ macrostate M neq ( t ) , which continuously changes with time, the  temporal average is not physically meaningful as the microstate at time t corresponds to M neq ( t ) and not to M neq ( t = 0 ) in that the probabilities and Z are different in the two macrostates. Only the ensemble average makes any sense at any time t, as discussed in [176]. Because of this, we only consider ensemble averages in this review.
A word of caution must be offered. If S is not a state function, it cannot be measured or computed. Thus, while the statistical entropy can be computed in principle in all cases if p k is known, there is no way to compare its value with thermodynamic entropy in all cases. Thus, no comment can be made about their relationship in general for an arbitrary M ( t ) . We have only established their equivalence for M ieq for which the two entropies are the same.
Remark 37. 
We have summarized our approach for an arbitrary macrostate in Axiom 3, which allows us to identify the two entropies in all cases. Thus, we only need to investigate the μNEQT for M ieq to also cover M ; see Section 5.9.

5.6. Principle of Additivity

5.6.1. Additivity

We consider a system Σ consisting of two nonpenetrating sub-bodies Σ 1 and Σ 2 at present, each specified by W 1 and W 2 . Later, we will generalize to any number of sub-bodies Σ j . The principle of additivity states that Σ is specified by W given by
W j W j .
This principle is self-evident for nonpenetrating systems. For example, the  number of particles
N j N j
remains an identity. (This remains true even if the bodies are interpenetrating, for which the volumes may not be additive). For  nonpenetrating bodies, however, their volumes become additive:
V = j V j ,
which we will assume in this review. We will call the case of nonpenetrating bodies the discrete approach. It is evident that in this approach, the principle of additivity is valid for any number of sub-bodies Σ j , j = 1 , 2 , . In this case, the sum in the above equations is over all sub-bodies.
We now show that the above sample average in Equation (110) also follows immediately from the principle of additivity of quantities that are additive; see Claim 2. One considers a very large macroscopic system Σ 0 of N 0 N N particles and imagines dividing it into a large number N of macroscopically large and nonoverlapping parts of equal size N, each representing a microstate of the system Σ . As the parts are macroscopically large, they will act almost independently; see Section 7.3 for details. How well this condition is satisfied depends on how large the parts are. In principle, they can be made arbitrary large to ensure their complete independence. At the same time t, these parts will be in microstates m k of Σ with probabilities p k ( t ) . The additivity principle states that any extensive thermodynamic quantity X ( t ) of the system Σ 0 is the sum of this quantity over its various macroscopically large parts. This principle is consistent with the definition of the average in (110). One can also think of the N parts as representing the same measurement that has been repeated N times on samples prepared under identical macroscopic conditions at the same instant t.

5.6.2. Quasi-Additivity

We have enunciated the principle of additivity for W above. The  energy E plays a very different role because of mutual interactions between various sub-bodies. We again restrict to only two sub-bodies for simplicity, which can be later generalized to any number of sub-bodies. We assume that they are weakly interacting so that their energies are quasi-additive, which we express in a form using j = 1 , 2 :
E = j E j + U int j E j ,
where U int is the weak interaction energy between Σ 1 and Σ 2 , and can be neglected to a good approximation provided
U int < < E sm min E j .
We can extend the discussion to many sub-bodies Σ j , j = 1 , 2 , , by defining U int as the net interaction energies between all of them:
U int j > l E j l ,
where E j l is the interaction energy between Σ j and Σ l . The inequality in Equation (120) can be made as precise as we wish by making N extremely large compared to various sub-bodies.
Remark 38. 
With quasi-additivity for energies, we can extend the principle of additivity from W to
Z j Z j ,
by including quasi-additivity for the energies; see Claim 2.
However, the quasi-additivity of the entropy is altogether a different issue. The entropy additivity is strictly valid if Σ and Σ ˜ are (statistically) independent [3], i.e., noninteracting. However, this independence is not of any physical interest as Σ and Σ ˜ must be interacting with each other for any interesting thermodynamics; otherwise, there is no need to consider Σ ˜ , and the issue of additivity does not arise. Thus, we are inclined to consider them to be quasi-independent. To the best of our knowledge, the discussion of quasi-independence and its distinction from interactions between Σ and Σ ˜ that are weak has been carefully presented elsewhere ([148], S corr was called S int there; however, S corr seems to be more appropriate) for the first time, which we summarize below. The  presence of interparticle interactions that determine E and E ˜ for Σ and Σ ˜ , respectively, results in the thermodynamic concept of correlation lengths in them. The correlation length λ corr > a is a property of macrostates, and can be much larger than the interparticle interaction length a between particles depending on the macrostate. In general, λ corr > > a . A simple well-known example is of the correlation length λ corr of a nearest neighbor Ising model, which increases very rapidly as the critical point is reached, and where it can be much larger than the nearest neighbor distance a between the spins. Two interacting Ising systems at the same temperature cannot be “independent”, so the additivity of entropy for Σ 0 is replaced by the following:
S 0 ( X 0 , t ) = S ( X ( t ) , t ) + S ˜ ( X ˜ ( t ) ) + S corr ( t ) ,
where S corr ( t ) is a correction term to the entropy due to correlation that is present between Σ and Σ ˜ due to their mutual interaction. If the linear sizes l and l ˜ of the two bodies are much larger compared to λ corr , then this correlation becomes almost nonexistent. In this case, S corr ( t ) can be neglected to a good approximation so that
S 0 ( X 0 , t ) S ( X ( t ) , t ) + S ˜ ( X ˜ ( t ) ) ,
provided l ˜ , l > > λ corr . Under this condition, Σ and Σ ˜ are said to be quasi-independent [148], which ensures that their entropies become quasi-additive. This distinction is usually not made explicit in the literature. Usually, l ˜ > > l , but this condition was not used above so the above additivity is valid for any two bodies for which l ˜ , l > > λ corr . For  Σ ˜ representing a medium, S ˜ has no explicit time dependence as it is assumed to be in equilibrium, and  X 0 remains constant for the isolated system Σ 0 .
The above quasi-additivity principle is applicable to microstates of Σ as well. We now focus on classical microstates represented by the sub-bodies, and apply the discussion to only two sub-bodies representing Σ = Σ 1 and Σ ˜ = Σ 2 forming the isolated system Σ 0   as they are central to our statistical mechanics. We consider the energies of the microstates m 0 k 0 , m k , and  m ˜ k ˜ . They are related as follows:
E 0 k 0 = E k + E ˜ k ˜ + E k , k ˜
where we have also included the interaction energy E k , k ˜ due to U int , which is usually negligible relative to E k ,   E ˜ k ˜ . These energies are independent of the macrostates and, therefore, independent of quantities such as the temperatures and probabilities that specify macrostates of various bodies forming the system. The energies corresponding to their macrostates are related by
E 0 = E + E ˜ + U int ;
see Equation (119). Again, the smallness of E k , k ˜ results in its average U int obtained by using p 0 k 0 and E k , k ˜ in Equation (112), being negligible relative to E and E ˜ .
Remark 39. 
The assumption to neglect E k , k ˜ or U int merely makes Σ and Σ ˜ satisfy the principle of additivity. We will make this assumption in this review extensively.
Remark 40. 
From now on, we will usually replace the sign “≈” by “=” unless clarity is needed.
Remark 41. 
Throughout this review, we will think of the above approximate equalities as equalities to make the energies additive by neglecting the interaction energy between Σ and Σ ˜ , which is a standard practice in the field, but also assuming quasi-independence between them to make the entropies to be additive, which is not usually mentioned as a requirement in the literature.

5.7. Σ in Internal EQ (IEQ)

The central concept of the μ NEQT is that of the internal equilibrium (IEQ) according to which the entropy S of an NEQ macrostate is a state function of the state variables in the enlarged state space S Z [134,148,149]. The enlargement of the space relative to the EQ state space S X is due to independent internal variables [13,18,51,108], which is sufficient to uniquely specify M in S Z . We denote such a state by M ieq . The same state cannot be uniquely specified in S X or any other extended state space S Z that does not have the same set of internal variables as in Z .
We give a simple example to clarify why and how internal variables are useful for describing an NEQ state. Consider the case of two identical bodies Σ 1 and Σ 2 in thermal contact at different temperatures T 1 ( t ) and T 2 ( t ) and energies E 1 ( t ) and E 2 ( t ) , respectively; we ignore other observables N , V , etc. Thus, X = ( E ) ˙ for each system. We assume that each one is in an EQ state of its own at each instant. Together, they form an isolated composite system Σ , whose entropy S ( E 1 , E 2 ) = S 1 ( E 1 ) + S 2 ( E 2 ) is a function of two variables at each instant t, and can be written as a state function in the enlarged state space formed by
E = E 1 + E 2 = c o n s t , ξ ( t ) = E 1 E 2 .
(We have neglected the interaction energy E 12 between Σ 1 and Σ 2 here per Remark 39.) This situation should be compared with its mechanical analog in Section 4, and in particular with Equation (104a) for ξ E k ; here, n 1 = n 2 = 1 / 2 . The discussion there was purely mechanical so there was no dissipation.
We are in a position now to understand how dissipation emerges in thermodynamics. As the system approaches EQ, E 1 E 2 so that ξ 0 . This also means that T 1 ( t ) T 2 ( t ) = T eq , the EQ temperature. The first thing we learn from this simple example is that it clearly shows how the t-dependence in S ( E , t ) S ( E 1 , E 2 ) can be replaced by invoking an extensive internal variable ξ ( t ) so that the entropy can be treated as a state function S ( E , ξ ) in the enlarged state space S Z spanned by E and  ξ . In other words, the system is in an IEQ state. In general, we will need to enlarge S X by introducing an appropriate number of internal variables to form S Z in which the system is in IEQ. Thus, we can always express S in an IEQ state as a state function
S = S ieq = S ( Z )
in the appropriately enlarged state space S Z . This is carefully discussed in Section 12, where we take a different approach. As  S 1 ( E 1 ) and S 2 ( E 2 ) , being in EQ, have their maximum value for given E 1 ( t ) and E 2 ( t ) , S ( E , ξ ) also has its maximum value for given E ( t ) and ξ ( t ) , but this value increases as ξ 0 , and EQ is achieved. In general, M ieq has the maximum possible entropy for the given Z , and continues to increase as Z changes and EQ is reached. For this IEQ state, it is trivial to show that the temperature ( 1 / T = S / E ; see Equation (129)) of Σ is
T ( t ) = 2 T 1 T 2 / ( T 1 + T 2 )
and its affinity T S / ξ (see Equation (133)) is given by
A ( t ) = ( T 2 T 1 ) / ( T 1 + T 2 ) .
At equilibrium, T 1 = T 2 = T eq and ξ = 0 , A = 0 . Thus, T 1 and T 2 may be very different, yet the system as a whole can be treated as being in IEQ with a unique temperature T ( t ) , any temperature difference T 2 ( t ) T 1 ( t ) between its parts not withstanding. The discussion can be extended easily to the case when the two bodies are in IEQs and also when they are of different sizes. In all cases, a unique temperature in accordance with Equation (129) can be defined for the composite system [77,78]. Once it is determined, we do not have to worry about the internal temperature difference between Σ 1 and Σ 2 . Any internal heat transfer between them is captured by
β A ( t ) d ξ = d i S = d E 1 ( β 1 β 2 ) ,
as can be easily verified; here d i S is the irreversible entropy generation due to macroheat exchange [51]. We thus see the affinity for ξ is given by
A ( t ) = T d i S d ξ = d E 1 d ξ ( β 1 β 2 ) β ,
which vanishes as EQ is reached, a well-known feature [51] of classical thermodynamics. The analysis clearly shows how thermodynamics brings in dissipation in a mechanical system, showing the consistency of our approach using internal variables.

5.8. Gibbs Fundamental Relations for M ieq ( Z ) in S Z and S ζ

We first consider the state space S Z in which M ieq ( Z ) is uniquely specified. In this space, the state function S ( Z ) results in the general form of the Gibbs fundamental relation
d S ( Z ) = S E d E + S W · d W
for the entropy, from which follows the Gibbs fundamental relation for E ( ζ ) in S ζ spanned by ζ ( S , W ) ,
d E ( ζ ) = E S d S + E W · d W .
Introducing the SI-temperature T = 1 / β as
T E / S , β = S / E ,
and re-expressing the generalized macroforce in Equation (18) as
F w = E / W = T S / W ,
we rewrite Equations (128a) and (128b) as
d S = β d E + β d W
d E = T d S d W
in terms of SI macroquantities; here, we have introduced SI-macrowork d W as the generalized macrowork
d W F w · d W T S W · d W
done by the system. The derivative with respect to ξ determines the affinity
A T S / ξ = E / ξ ,
which vanishes in equilibrium so that A eq = A 0 = 0 . Thus, in general, F w = ( f w , A ) , where
f w = E / w = T S / w
is the generalized macrowork due to w .
Remark 42. 
Comparing Equation (131) with Equation (93a) allows us to verify Conclusion 1 for the Clausius equality.
This equality must be distinguished from d e Q in Equation (46). Thus, Equation (93a) allows us to uniquely identify the generalized macroheat d Q = T d S determined by d S and the generalized macrowork determined by d W to be independent of each other as they belong to orthogonal subspaces in the subspace S ζ ; see also Section 10.2. Both are SI-macroquantities. The  resulting thermodynamics has been identified as the MNEQT. In terms of various components of F w , the generalized macrowork is
d W = P d V μ d N + + A · d ξ .
We can identify various components of the macrowork as d W V = P d V , d W N = μ d N , , d W ξ 1 = A 1 d ξ 1 , , using an obvious notation. The  missing terms denote the contribution from the rest of the variables not shown, and 
P E / V , μ E / N , , A E / ξ ,
are the SI-fields associated with W , with changes d W =   d V , d N , , d ξ being the changes in it.
In the M ˚ NEQT , the first law in Equation (94) refers to exchange macroheat d e Q = T 0 d e S (see Equation (46)) and macrowork
d e W = P 0 d e V μ 0 d e N + ;
in terms of the fields (the temperature T 0 , pressure P 0 , chemical potential μ 0 ,⋯) of the medium and the corresponding macroscopic exchange quantities in all cases, regardless of the irreversibility. As the medium is in EQ, there is no contribution due to ξ in d e W as the corresponding contribution A 0 · d ξ vanishes due to the fact that the affinity A 0 0 for the medium. Our sign convention is that d e Q is positive when it is added to Σ , and  d e W is positive when it is transferred to Σ ˜ .
It follows from Equations (135a) and (135c) that the irreversible macrowork, also known as dissipative work, is
d i W = ( P P 0 ) d V ( μ μ 0 ) d N + + A · d ξ 0 .
The coefficients P P 0 , μ μ 0 , , A are commonly known as thermodynamic forces or macroforce imbalances [51], which vanish in EQ; see Section 6.4.
Remark 43. 
We have included the term associated with N for completeness in Equations (135a), (135c) and (136). We will no longer consider this term anymore.
We should compare the above equations with Equation (79). Once d e W or d W has been identified, the use of the first law allows us to uniquely determine d e Q or d Q , respectively.
It is clear that the root cause of dissipation is the macroforce imbalance. It drives the system towards equilibrium [41,42,75,76,134,148,149,150,152,153]. It arises due to the imbalance between the external and the average internal forces performing work; the microforce imbalance is introduced in the following section. The average force imbalances give rise to an internal work d i W due to all kinds of force imbalances. The irreversible or dissipated work is given in Equation (136), which is generated within Σ .
If we include the relative velocity between a Brownian particle Σ BP and the medium to account for the Brownian motion [148,157], we must account for [148] an additional term V · d P BP in d i W due to the relative velocity V :
d i W = ( P P 0 ) d V V · d P BP + A d ξ ;
here, d P BP = F wBP d t is the change in the linear momentum of the Brownian particle experiencing a macroforce F wBP . To see it, we recognize that V · d P BP must be nonpositive to comply with the second law. Thus, F wBP must be antiparallel to V and describes the frictional drag. This is discussed in detail in Ref. [157]. Thus, the force is reviewed in Section 14 as the role of friction in the Langevin equation turns out to be different in the two NEQ thermodynamics. We will come back to this term later when we consider the motion of a particle attached to a spring; see Figure 3b, a system also studied by Jarzynski, so that a comparison can be made.
The irreversible macroheat d i Q in all cases is given by Equation (47), and shows that it does not vanish when T = T 0 , provided d i S > 0 . This means that the irreversible macrowork is present even if there is no temperature difference, such as in an isothermal process, as long as there exists some nonzero thermodynamic force or irreversibility. The resulting irreversible entropy generation is then given by d i S . We summarize this [51] as
Conclusion 3. 
To have dissipation, it is necessary and sufficient to have a nonzero thermodynamic force. In its absence, there can be no dissipation regardless of the time dependence of the work process; see also Remark 32. This understanding of dissipation becomes clear from the microscopic source of dissipation in Proposition 2.

5.9. Time-Dependent Gibbs Fundamental Relations for M nieq ( Z ) in S Z

We now consider the generalization of the Gibbs fundamental relation for M nieq , which is not uniquely specified in S Z or S ζ , by starting from Equation (295a) having an explicit time dependence that comes from “hidden” internal variables ξ in S Z . From the state function entropy S ( Z ( t ) ) for M ieq ( t ) in S Z , we have
d S ( Z ( t ) ) = S E d E + S W · d W + S ξ · d ξ ,
where W is the work variable in S Z . Expressing the last term as
S ξ · d ξ d t d t ,
we obtain the following generalization of the Gibbs fundamental relation for M nieq ( t ) in S Z :
d S ( Z ( t ) , t ) = S E d E + S W · d W + S t d t ,
where
S t S ξ · d ξ d t 0 .
Definition 26. 
As the presence of S / t above in S Z is due to “hidden” internal variables in ξ , we will call it the hidden entropy generation rate, and 
d i S hid ( t ) = S t d t = S ξ · d ξ 0 ,
the hidden entropy generation. It results in a hidden irreversible macrowork
d i W hid T d i S hid = A · d ξ ,
in S Z due to the hidden internal variable with affinity A .
In S Z , we can identify the temperature T as the thermodynamic temperature in S Z by the standard definition. It is clear from the above discussion that S ( Z ( t ) ) / E in S Z has the same value as S ( Z ( t ) , t ) / E in S Z . However, there is an alternative definition of a temperature for M in S Z as
d Q ( Z ( t ) , t ) / d S ( Z ( t ) , t ) = T arb alt ( Z ( t ) , t ) ,
while T ( Z ( t ) ) = d Q ( Z ( t ) ) / d S ( Z ( t ) ) for M ieq in S Z . It is easy to see that they are not the same as macroheats d Q ( Z ( t ) ) = d E ( t ) + d W ( Z ( t ) )   and d Q ( Z ( t ) , t ) = d E ( t ) + d W ( Z ( t ) , t ) are not the same as macroworks. Thus, this definition is not a thermodynamic temperature for M in S Z . Therefore, we are now set to identify T arb (see also Equation (257)) as a thermodynamic temperature of M arb by this T.
Remark 44. 
1 / T arb S ( Z ( t ) , t ) / E in S Z is identified by the same derivative in the Gibbs fundamental relation in S Z as follows:
1 T arb = S ( Z ( t ) ) E 1 T ( Z ( t ) ) ,
while the alternative nonthermodynamic temperature satisfies
T arb alt ( Z ( t ) , t ) = T ( Z ( t ) ) [ 1 + d i S hid / d S ( Z ( t ) , t ) ] ,
as is easily verified.
Remark 45. 
As discussed above and as will be discussed in detail in Section 12.1, a macrostate M nieq ( t ) with S ( Z ( t ) , t ) can be converted to M ieq ( t ) with a state function S ( Z ( t ) ) in an appropriately chosen state space S Z S Z by finding the appropriate window in which τ obs lies as well. The needed additional internal variable ξ determines the hidden entropy generation rate S / t in Equation (138b) due to the non-IEQ nature of M nieq ( t ) in S Z , and ensures validity of the Gibbs relation in Equation (138a) for it, thereby not only providing a new interpretation of the temporal variation of the entropy due to hidden variables but also extending the MNEQT to M nieq ( t ) in S Z .
The above discussion strongly points towards the following possible proposition.
Proposition 1. 
The MNEQT provides a very general framework to study any M nieq ( t ) in S Z , since it can be converted into a M ieq ( t ) in an appropriately chosen state space S Z , with  d i S hid ( t ) originating from hidden internal variable ξ .
We now consider a process P to be studied in S Z . It is natural to think of at least the initial macrostate M in of P as being uniquely identified as M ieq in in S Z . During the process, M ( t ) along P may turn into M nieq ( t ) or remain M ieq ( t ) . The former has been studied above. The latter can happen under the following two cases:
(i) all internal variables in ξ remain out of equilibrium;
(ii) internal variables in a subset ξ ξ have equilibrated so that the affinity A = T S / ξ vanishes.
In both cases, M ( t ) remains M ieq ( t ) in S Z , except that in (ii), M ( t ) can also be treated as M ieq ( t ) in the proper subspaces between S Z S Z and S Z , with  Z defined by Z ξ = Z . Even though A = 0 in these subspaces so that d i S hid ( t ) = 0 and d i W hid ( t ) = 0 , the Fl microaffinity A k 0 in these subspaces, and will still play an important role in the μ NEQT. Therefore,
Remark 46. 
We will use the state space S Z to construct the NEQ statistical mechanics in (i) and (ii) without affecting the hidden entropy generation and hidden irreversible macrowork. This allows us to use S Z over the entire process.
Remark 47. 
In a process P resulting in M nieq ( t ) in S Z , it is natural to assume that the terminal macrostates in P are M ieq so the affinity corresponding to ξ must vanish in them.
The above discussion can be easily applied to consider the case S Z S Z , in which internal variables in a subset ξ of ξ have equilibrated. The result is summarized in the following:
Remark 48. 
By replacing Z by X , and  Z by Z , we can also express the Gibbs fundamental relation for any NEQ macrostate in S X as
d S ( X ( t ) , t ) = S E d E + S w · d w + S t d t ,
by treating M neq as M ieq in S Z . In an NEQ process P ¯ between two EQ macrostates but resulting in M ieq ( t ) between them in S Z , the affinity corresponding to ξ must vanish in the terminal EQ macrostates of P ¯ .
Equation (141) proves extremely useful to describe M neq in S X as it may not be easy to identify ξ in all cases.
Remark 49. 
The explicit time dependence in the entropy for M neq in S X or M nieq ( t ) in S Z is solely due to the internal variables, which do not affect the validity of the Clausius equality d Q = T d S (Equation (45)), with T defined as the inverse of S / E at fixed w , t or W , t in the two state spaces, respectively; see Equation (129). As a consequence, Equation (47) remains valid for any M .

5.10. Consequences of the Second Law

Theorem 4. 
As a consequence of the second law, the irreversible macrowork d i W (see Equation (136)) which is equal in magnitude to the macroheat d i Q (see Equation (95)) for any M is nonnegative in any real process.
Proof. 
Using Equation (47), we find
T 0 d i S = ( T 0 T ) d S + d i W T d i S = ( T 0 T ) d e S + d i W 0 ,
where the inequality follows from the second law d i S 0 in Equation (67c); we assume T and T 0 to be nonnegative. Therefore, each of the two independent contributions in each equation must be nonnegative. This thus proves that
d i W = d i Q 0 .
Corollary 1. 
Different components of d i W and d i Q for any M must be individually nonnegative.
Proof. 
Consider the independent components such as d i W V , d i W ξ , etc., of d i W . As  d i W is nonnegative, each component must be nonnegative.   □
This proves the inequalities in Equations (43) and (80). In addition, it shows that each term on the right in Equation (75) is nonnegative. We thus have a proof of a part of Remark 32 that deals with the consequences of the second law.
Corollary 2. 
In any real process,
( T 0 T ) d e S 0 ; ( T 0 T ) d S 0 .
Proof. 
The corollary follows from the preceding theorem.    □
The first inequality merely states the well-known fact of thermodynamics that macroheat d e Q = T 0 d e S flows from “hot” to “cold”. The second inequality also states a well-known fact about the stability in thermodynamics, which requires the entropy to increase with temperature. As EQ is reached, T T 0 either from above ( T > T 0 ) or from below ( T < T 0 ). In the former case, S decreases, while it increases in the latter case.
Corollary 3. 
For an isolated system ( d S d i S ) or for T = T 0 ,
T d i S = d i W 0 .
Proof. 
Setting d e S = 0 for an isolated system or T = T 0 in Equation (142) proves the theorem immediately.    □
The inequalities in Equation (142) follow from the second law d i S 0 in Equation (67c). Each term on the right side, being independent of each other, must be nonnegative separately, which yields
( T 0 T ) d S 0 , ( 1 T / T 0 ) d e Q 0 , d i W 0
as consequences of the second law. In view of Equation (95), the last inequality above proves the last two inequalities in Equation (69).

5.11. Assumptions

We list the two important assumptions of our approach. They can be relaxed but we will not do that in this review.

5.11.1. N Fixed for Σ

In order to fix the size of Σ , we need to specify one of its extensive state variables. Usually, N is kept fixed to ensure a fixed size. Therefore, N is not considered part of X = ( E , V , ) and Z from now on [177]. This also means that (i) there is no chemical reaction, and (ii) there is EQ with respect to the chemical potential. Most of the time, we will simplify the discussion by using a single internal variable; the extension to many internal variables is trivial.
Our primary interest is in studying an irreversible process P , which in MNEQT requires the existence of thermodynamic forces [51]. Their absence signifies that P represents a reversible process. It should be stressed that our notation is designed in such a way that the investigation can also apply directly to the (isolated) NEQ system Σ 0 , if need be, for which no exchange with the outside is possible. In that case, the external driving must be replaced by spontaneous processes going on within Σ 0 that drive it towards equilibrium. During this drive, there is dissipation within Σ 0 that is found to contribute to work fluctuations in the μ NEQT. As is well-known, such spontaneous fluctuations are not directly captured in the μ ˚ NEQT, the microstate extension of the M ˚ NEQT . This makes our approach superior.

5.11.2. Σ ˜ Always in EQ

We will assume Σ ˜ to be always in equilibrium (which requires it to be extremely large compared to Σ , as noted above). Any irreversibility going on within Σ 0 due to internal dissipation, internal motion, internal nonuniformities, etc., is ascribed to Σ alone. Moreover, we assume additivity of volume, a weak interaction between, and  quasi-independence of, Σ and Σ ˜ ; the last two conditions, respectively, ensure that the energies and entropies are additive [75,76,134,148,149] but also impose some restriction on the size of Σ in that it cannot be too small. In particular, the size should be at least as big as the correlation length for quasi-independence as discussed there. In this study, we will assume that all required conditions necessary for the above-mentioned additivity are met.

6. Mechanical Aspects

We will consider a system in this section, but the arguments are valid for any system Σ .

6.1. Microstate Evolution in S Z

The traditional formulation of statistical thermodynamics [33,48,79] is built on a mechanical approach in which m k follows its classical or quantum mechanical Hamiltonian evolution dictated by its SI-Hamiltonian H k = H ( x k ( t ) W ( t ) ) , which suffices to provide the deterministic mechanical description with NFl- W . We will see below that k does not change as W changes in a process P . We will only consider a classical case system Σ , for which the change in H k in P is
d H k = H k x k ( t ) · d x k ( t ) + H k W ( t ) · d W ( t ) .
The first term on the right, due to the dynamical variations of x k in the system, vanishes identically due to Hamilton’s equations of motion for any m k . Thus, for fixed W , the energy E k ( W ) = H ( x k W ) of m k remains constant in time due to deterministic Hamiltonian dynamics. Only the variation d W in S Z generates any change in E k . Consequently, we can write
d H k = d H k ( w ) = d H k W ( t ) · d W ( t )
for all m k , which clearly shows that only the variation d H k ( w ) due to d W is relevant. This is indicated by the superscript w on d H k ( w ) . We do not worry about how x k changes dynamically in H ( x k W ) from now on, and focus, instead, on the state space S Z , in which we can simply express the Hamiltonian as H k ( W ) for any microstate, remembering that its value E k ( W ) is a point in S Z .

6.2. SI-Microwork in S Z

The point E k ( W ) in S Z undergoes a change due to d W given by
d E k = E k W · d W = d W k ,
where
d W k = F w k · d W , F w k E k / W .
denotes the Fl-generalized microwork produced by the Fl-generalized microforce  F w k ; see Definition 17. These are SI-microquantities. As  E k is uniquely determined by W , the microforce is a deterministic and continuous function of W ; see below. The  SI-microwork d W k is mechanically defined work as W is varied, which explains why W is identified as the work parameter in H . The variation d Z ( t ) ( d E ( t ) , d W ( t ) ) in time defines a thermodynamic process P . The trajectory γ k in S Z followed by m k during P as a function of time will be called the Hamiltonian trajectory. Being purely mechanical in nature, the trajectory is completely deterministic and cannot describe the evolution of the thermodynamic macrostate M during P unless supplemented by thermodynamic stochasticity over P ; see Claim 1. This is accounted for by the variation in p k ( M ) as M changes, and is determined by some stochastic perturbation such as the random interaction with Σ ˜ [33,59]; see Definition 25. We discuss the origin of this stochasticity in Section 7, which will allow us to introduce heat and temperature.
Since m k and p k ( M ) are independent of each other, we can treat them separately. This provides a major simplification, as described below, for studying the process P in terms of a Hamiltonian trajectory γ k . We study the mechanical evolution of m k along γ k without being concerned about the probabilities. The effect of the probability can then be supplemented by an appropriate probability. This will lead to the introduction of the concept of SI-microheat; see Section 10, where we investigate this concept in detail for the first time.

6.3. SI-Legendre Transform

We can alternatively consider the case with W k as the Fl-parameter. In that case, we will be dealing with W as a random variable with outcome W k ; see Claim 3. Let us clarify the significance of Equation (18) by considering F w k = ( P k , A k ) defined above, and show how we ensure a fixed P and A by considering Fl- W k = ( V k , ξ k ) . We consider a m k with microenergy E k ( V , ξ ) , from which we obtain P k ( V , ξ ) and A k ( V , ξ ) . They are functions of two variables, and we look for their crossing W k = ( V k , ξ k ) with a plane Π defined by F w = ( P , A ) to determine W k . We now do this for every k using the same plane Π . Using these crossings, we have
k , P = E k / V k , A = E k / ξ k .
As the two derivatives have fixed values for every k, their averages are also the same fixed values in F w = ( P , A ) as required in Equation (18). The crossings W k give the fluctuating ( V k , ξ k ) .
Alternatively, we can easily determine ( V k , ξ k ) by considering an NEQ SI-Legendre transform E k L of E k , defined as
E k L ( P , A ) E k ( V k , ξ k ) + P V k + A ξ k ,
which is a function of P and A, but not of V k and ξ k , since E k L / V k = 0 , E k L / ξ k = 0 , as is easily seen using Equation (150a). We now have
V k = E k L / P , ξ k = E k L / A .
After averaging over microstates in M det , we obtain
E L ( P , A ) E ( V , ξ ) + P V + A ξ .
Remark 50. 
E L ( P , A ) must not be confused with the NEQ enthalpy H = E ( V , ξ ) + P 0 V .
We can generalize the above discussion for the general case of NFl F w or Fl W k . We first define the SI-Legendre-transformed Hamiltonian
H L ( F w ) H ( W ) + Φ ( F w , W ) ,
in terms of Φ ( F w , W ) introduced in Equation (23b). Its microenergy E k L ( F w ) is the SI-Legendre transform of E k ( W k ) , and is given by
E k L ( F w ) E k ( W k ) + Φ ( F w , W k ) ;
compare with E k L , Fl ( F w ) in Equation (22a). We are suppressing the suffix NFl, as it is clear from the dependence on F w that we are dealing with Fl W k ; see Claim 5. For E k L ( F w ) , F w plays the role of the (Legendre-transformed) NFl “work” parameter W L ( = F w ) so that the generalized (Legendre-transformed) Fl “microforce” F w k L is given by
F w k L E k L / F w = W k ,
which should be compared with the second equation in Equation (149); note the presence of the negative sign above on the right side. The extension of the generalized microwork given in the first equation in Equation (149) to this case is the Fl Legendre-transformed microwork
d W k L ( F w ) = W k · d F w ,
so that
d E k L ( F w ) d W k L ( F w ) ,
which is identical in form with Equation (148).
For the medium Σ ˜ , we have
d W ˜ k ˜ L ( f ˜ w ) = w ˜ k ˜ · d f ˜ w ,
which, after reduction, yields
d W ˜ k L ( f ˜ 0 w ) = d W ˜ L ( f ˜ 0 w ) = w ˜ · d f ˜ 0 w = d e W L ( f ˜ 0 w ) ,
where we have replaced f ˜ w by f ˜ 0 w of Σ 0 , and used Equation (64a).
The average of E k L ( F w ) is given by
E L ( S , F w ) E ( S , W ) + F w · W
(compare with Equation (152a)), while other microquantities have their averages given by
F w L E L ( S , F w ) / F w = W , d W L ( S , F w ) = W · d F w , d E L ( S , F w ) d W L ( S , F w ) ,
as is expected from the above discussion.
As considering Fl- W creates no additional complication, we will mostly deal with NFl- W in this review.
For completeness and later usage in Section 12.2, we also introduce another Legendre transform in the case that W