Temporal Asymmetry, Entropic Irreversibility, and Finite-Time Thermodynamics: From Parmenides–Einstein Time-Reversal Symmetry to the Heraclitan Entropic Arrow of Time

Wassim M. Haddad

doi:10.3390/e14030407

The School of Aerospace Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA

Entropy2012, 14(3), 407-455;https://doi.org/10.3390/e14030407

This article belongs to the Special Issue Arrow of Time

Version Notes

Order Reprints

Abstract

In this paper, we combine the two universalisms of thermodynamics and dynamical systems theory to develop a dynamical system formalism for classical thermodynamics. Specifically, using a compartmental dynamical system energy flow model we develop a state-space dynamical system model that captures the key aspects of thermodynamics, including its fundamental laws. In addition, we establish the existence of a unique, continuously differentiable global entropy function for our dynamical system model, and using Lyapunov stability theory we show that the proposed thermodynamic model has finite-time convergent trajectories to Lyapunov stable equilibria determined by the system initial energies. Finally, using the system entropy, we establish the absence of Poincaré recurrence for our thermodynamic model and develop clear and rigorous connections between irreversibility, the second law of thermodynamics, and the entropic arrow of time.

Keywords:

energy; entropy; irreversibility; arrow of time; poincaré recurrence; finite-time semistability; interconnected systems; state space formalism; relativistic thermodynamics

1. Introduction

The arrow of time and the second law of thermodynamics is one of the most famous and controversial problems in physics. The controversy between the course of time (i.e., a timeless universe) and the arrow of time (i.e., a constantly changing universe) can be traced back to the famous dialogues between the ancient Greek philosophers Parmenides and Herakleitos on being and becoming. Parmenides, like Einstein, insisted that time is an illusion, that there is nothing new, and that everything is (being) and will forever be. This statement is, of course, paradoxical since the status quo changed after Parmenides wrote his famous poem. Herakleitos’ aphorism on the other hand is predicated on change (becoming); namely, the universe is in a constant state of flux and nothing is stationary—T

α π ά ν τ α ρ ε ί ϰ α ί ο ύ δ έ ν μ έ ν ε ι

. Furthermore, Herakleitos goes on to state that the universe evolves in accordance with its own laws which are the only unchangeable things in the universe (i.e., universal conservation and nonconservation laws). His statements that everything is in a state of flux—T

α π ά ν τ α ρ ε ί

—and that man cannot step into the same river twice, because neither the man nor the river is the same—

Π ο τ α μ ε ί ς τ ο ί ς α υ τ ο ί ς ε μ β α ί ν ο μ ε ν τ ε ϰ α ί ο υ ϰ ε μ β α ί ν ο μ ε ν, ε ί μ ε ν τ ε ϰ α ί ο υ ϰ ε ί μ ε ν

—give the earliest perception of irreversibility of nature and the universe along with time’s arrow. The idea that the universe is in constant change and there is an underlying order to this change—the Logos (

Λ ό ϒ ο ς

)—postulates the existence of entropy as a physical property of matter permeating the whole of nature and the universe.

Herakleitos’ statements are completely consistent with the laws of thermodynamics which are intimately connected to the irreversibility of dynamical processes in nature. In addition, his aphorisms go beyond the worldview of thermodynamics and have deep relativistic ramifications to the space-time fabric of the cosmos. Specifically, Herakleitos’ profound statement—All matter is exchanged for energy, and energy for all matter (

Π υ ρ ό ς τ ε ἀ ν τ α μ ο ι β ὴ τ ὰ π ά ν τ α ϰ α ὶ π \overset{͂}{υ} ρ ἁ π ά ΰ τ ω ν

)—is a statement of the law of conservation of mass-energy and is a precursor to the principle of relativity. In describing the nature of the universe Herakleitos postulates that nothing can be created out of nothing, and nothing that disappears ceases to exist. This totality of forms, or mass-energy equivalence, is eternal and unchangeable in a constantly changing universe.

Energy is a concept that underlies our understanding of all physical phenomena and is a measure of the ability of a dynamical system to produce changes (motion) in its own system state as well as changes in the system states of its surroundings. Thermodynamics is a physical branch of science that deals with laws governing energy flow from one body to another and energy transformations from one form to another. These energy flow laws are captured by the fundamental principles known as the first and second laws of thermodynamics. The first law of thermodynamics gives a precise formulation of the equivalence between heat and work and states that among all system transformations, the net system energy is conserved. Hence, energy cannot be created out of nothing and cannot be destroyed; it can merely be transformed from one form to another.

The law of conservation of energy is not a mathematical truth, but rather the consequence of an immeasurable culmination of observations over the chronicle of our civilization, and is a fundamental axiom of the science of heat. The first law does not tell us whether any particular process can actually occur, that is, it does not restrict the ability to convert work into heat or heat into work, except that energy must be conserved in the process. The second law of thermodynamics asserts that, while the system energy is always conserved, it will be degraded to a point where it cannot produce any useful work. Hence, it is impossible to extract work from heat without at the same time discarding some heat, giving rise to an increasing quantity known as entropy.

As discussed in the recent monograph [1], there have been many different presentations of classical thermodynamics with varying hypotheses and conclusions. To exacerbate matters, the careless and considerable differences in the definitions of two of the key notions of thermodynamics—namely, the notions of reversibility and irreversibility—have contributed to the widespread confusion and lack of clarity in the exposition of classical thermodynamics over the past one and a half centuries. For example, the concept of a reversible process as defined by Clausius, Kelvin, Planck, and Carathéodory has very different meanings. In particular, Clausius defines a reversible (umkehrbar) process as a slowly varying process wherein successive states of this process differ by infinitesimals from the equilibrium system states. Such system transformations are commonly referred to as quasistatic transformations in the thermodynamic literature.

Alternatively, Kelvin’s notions of reversibility involve the ability of a system to completely recover its initial state from the final system state. Planck introduced several notions of reversibility. His main notion of reversibility is one of complete reversibility and involves recoverability of the original state of the dynamical system while at the same time restoring the environment to its original condition. Unlike Clausius’ notion of reversibility, Kelvin’s and Planck’s notions of reversibility do not require the system to exactly retrace its original trajectory in reverse order. Carathéodory’s notion of reversibility involves recoverability of the system state in an adiabatic process [2,3,4], resulting in yet another definition of thermodynamic reversibility. These subtle distinctions of (ir)reversibility are often unrecognized in the thermodynamic literature. Notable exceptions to this fact include [1,5,6], with [1,6] providing an excellent exposition of the relation between irreversibility, the second law of thermodynamics, and the arrow of time.

The arrow of time [7] remains one of physics’ most perplexing enigmas [8,9,10,11,12,13]. Even though time is one of the most familiar concepts humankind has ever encountered, it is the least understood. Puzzling questions of time’s mysteries have remained unanswered throughout the centuries—questions such as, Where does time come from? What would our universe look like without time? Can there be more than one dimension to time? Is time truly a fundamental appurtenance woven into the fabric of the universe, or is it just a useful edifice for organizing our perception of events? Why is the concept of time hardly ever found in the most fundamental physical laws of nature and the universe? Can we go back in time? And if so, can we change past events?

Human experience perceives time flow as unidirectional; the present is forever flowing toward the future and away from a forever fixed past. Many scientists have attributed this emergence of the direction of time flow to the second law of thermodynamics due to its intimate connection to the irreversibility of dynamical processes [14]. In this regard, thermodynamics is disjoint from Newtonian and Hamiltonian mechanics (including Einstein’s relativistic and Schrödinger’s quantum extensions), since these theories are invariant under time reversal, that is, they make no distinction between one direction of time and the other. Such theories possess a time-reversal symmetry, wherein, from any given moment of time, the governing laws treat past and future in exactly the same way [15]. For example, a film run backward of a harmonic oscillator over a full period or a planet orbiting the Sun would represent possible events. In contrast, a film run backward of water in a glass coalescing into a solid ice cube or ashes self-assembling into a log of wood would immediately be identified as an impossible event. Over the centuries, many philosophers and scientists shared the views of a Parmenidean frozen river time theory. However, since the advent of the science of thermodynamics in the 19^th century, philosophy and science took a different point of view with the writings of Hegel, Bergson, Heidegger, Clausius, Kelvin, and Boltzmann; one involving time as our existential dimension. The idea that the second law of thermodynamics provides a physical foundation for the arrow of time has been postulated by many authors [9,16,17]. However, a convincing argument of this claim has never been given [1,6,10,12,18].

In this paper, we use energy flow compartmental dynamical system theory to place thermodynamics on a system-theoretic foundation so as to harmonize it with classical mechanics. In particular, we develop a novel formulation of thermodynamics that can be viewed as a moderate-sized system theory as compared to statistical thermodynamics. This middle-ground theory involves deterministic large-scale dynamical system models that bridge the gap between classical and statistical thermodynamics. Specifically, since thermodynamic models are concerned with energy flow among subsystems, we use a state space formulation to develop a nonlinear compartmental dynamical system model that is characterized by energy conservation laws capturing the exchange of energy between coupled macroscopic subsystems. Furthermore, using graph-theoretic notions, we state two thermodynamic axioms consistent with the zeroth and second laws of thermodynamics, which ensure that our large-scale dynamical system model gives rise to a thermodynamically consistent energy flow model. Specifically, using a large-scale dynamical systems theory perspective for thermodynamics, we show that our compartmental dynamical system model leads to a precise formulation of the equivalence between work energy and heat in a large-scale dynamical system.

Next, we give a deterministic definition of entropy for a large-scale dynamical system that is consistent with the classical thermodynamic definition of entropy, and we show that it satisfies a Clausius-type inequality leading to the law of entropy nonconservation. However, unlike classical thermodynamics, wherein entropy is not defined for arbitrary states out of equilibrium, our definition of entropy holds for nonequilibrium dynamical systems. Then, using Lyapunov stability theory, we show that in the absence of energy exchange with the environment our thermodynamically consistent large-scale nonlinear dynamical system model possesses a continuum of equilibria and is semistable, that is, it has subsystem energies convergent to Lyapunov stable energy equilibria determined by the large-scale system’s initial subsystem energies.

For our thermodynamically consistent dynamical system model, we further establish the existence of a unique continuously differentiable global entropy function for all equilibrium and nonequilibrium states. Using this global entropy function, we go on to establish a clear connection between thermodynamics and the arrow of time. Specifically, we rigorously show the state irrecoverability and hence the state irreversibility [6,19] nature of thermodynamics. In particular, we show that for every nonequilibrium system state and corresponding system trajectory of our thermodynamically consistent large-scale nonlinear dynamical system, there does not exist a state such that the corresponding system trajectory completely recovers the initial system state of the dynamical system and at the same time restores the energy supplied by the environment back to its original condition. This, along with the existence of a global strictly increasing entropy function on every nontrivial system trajectory, gives a clear time-reversal asymmetry characterization of thermodynamics, establishing the emergence of the direction of time flow. Finally, since for every physical system energy and temperature equipartition is achieved in finite time rather than merely asymptotically, we merge the theories of semistability and finite-time stability developed in [20,21,22] to develop a mathematically rigorous framework for finite-time thermodynamics.

2. Dynamical System Model

In this section, we establish notation and provide a general axiomatic definition of a dynamical system. The notation used in this paper is fairly standard. Specifically,

R

denotes the set of real numbers,

{\bar{Z}}_{+}

(respectively,

Z_{+}

) denotes the set of nonnegative (respectively, positive) integers,

R^{q}

denotes the set of

q \times 1

column vectors,

{(\cdot)}^{T}

denotes transpose, and

I_{q}

or I denotes the

q \times q

identity matrix. For

z \in R^{q}

we write

z \geq \geq 0

(respectively,

z > > 0

) to indicate that every component of z is nonnegative (respectively, positive). In this case we say that z is nonnegative or positive, respectively. Furthermore, let

{\bar{R}}_{+}^{q}

and

R_{+}^{q}

denote the nonnegative and positive orthants of

R^{q}

, that is, if

z \in R^{q}

, then

z \in {\bar{R}}_{+}^{q}

and

z \in R_{+}^{q}

are equivalent, respectively, to

z \geq \geq 0

and

z > > 0

. Finally, let

\partial S

,

\overset{̊}{S}

, and

\bar{S}

denote the boundary, the interior, and the closure of the set

S

, respectively.

We write ‖ · ‖ for the Euclidean vector norm,

V^{'} (z)

for the Fréchet derivative of V at z,

B_{ε} (α)

,

α \in R^{q}

,

ε > 0

, for the open ball centered at α with radius ε, and

z (t) \to M

as

t \to \infty

to denote that

z (t)

approaches the set

M

(that is, for each

ε > 0

there exists

T > 0

such that dist

(z (t), M) < ε

for all

t > T

, where dist

(p, M) ≜ {inf}_{z \in M} ∥ p - z ∥

). Finally, the notions of openness, convergence, continuity, and compactness that we use throughout the paper refer to the topology generated on

D \subseteq R^{q}

by the norm

∥ \cdot ∥

.

Next, we define a dynamical system as a precise mathematical object satisfying a set of axioms. For this definition, let

U

denote an input space that consists of bounded continuous U-valued functions on

[0, \infty)

. The set

U \subseteq R^{m}

contains the set of input values, that is, at any time

t \geq t_{0}

,

u (t) \in U

. The space

U

is assumed to be closed under the shift operator, that is, if

u \in U

, then the function

u_{T}

defined by

u_{T} (t) ≜ u (t + T)

is contained in

U

for all

T \geq 0

. Furthermore, we let

Y

denote an output space that consists of continuous Y-valued functions on

[0, \infty)

. The set

Y \subseteq R^{l}

contains the set of output values, that is, each value of

y (t) \in Y

,

t \geq t_{0}

. The space

Y

is assumed to be closed under the shift operator, that is, if

y \in Y

, then the function

y_{T}

defined by

y_{T} (t) ≜ y (t + T)

is contained in

Y

for all

T \geq 0

.

Definition 2.1

Let

D

be a Euclidean space with norm ∥ · ∥. A dynamical system on

D

is the octuple

(D, U, U, Y, Y,

[0, \infty), s, h)

, where

s : [0, \infty) \times D \times U \to D

and

h : D \times U \to Y

are such that the following axioms hold:

(i): (Continuity): $s (\cdot, \cdot, u)$ is jointly continuous for all $u \in U$ .
(ii): (Consistency): $s (t_{0}, x_{0}, u) = x_{0}$ for all $t_{0} \in R$ , $x_{0} \in D$ , and $u \in U$ .
(iii): (Determinism): $s (t, x_{0}, u_{1}) = s (t, x_{0}, u_{2})$ for all $t \in [t_{0}, \infty)$ , $x_{0} \in D$ , and $u_{1}$ , $u_{2} \in U$ satisfying $u_{1} (τ) = u_{2} (τ)$ , $τ \leq t$ .
(iv): (Semigroup property): $s (τ, s (t, x_{0}, u), u) = s (t + τ, x_{0}, u)$ for all $x_{0} \in D$ , $u \in U$ , and τ, $t \in [t_{0}, \infty)$ .
(v): (Read-out map): For every $x_{0} \in D$ , $u \in U$ , and $t_{0} \in R$ , there exists $y \in Y$ such that $y (t) = h (s (t, x_{0},$ $u), u (t))$ for all $t \geq t_{0}$ .

We denote the dynamical system

(D, U, U, Y, Y,

[0, \infty), s, h)

by

G

. Furthermore, we refer to the map

s (\cdot, \cdot, \cdot)

as the flow or trajectory of

G

corresponding to

x_{0} \in D

, and for a given

s (t, x_{0}, u)

,

t \geq t_{0}

,

u \in U

, we refer to

x_{0} \in D

as an initial condition of

G

. Given

t \in R

, we denote the map

s (t, \cdot, \cdot) : D \times U \to D

by

s_{t} (x_{0}, u)

. Hence, for a fixed

t \in R

the set of mappings defined by

s_{t} (x_{0}, u) = s (t, x_{0}, u)

for every

x_{0} \in D

and

u \in U

gives the flow of

G

. In particular, if

D_{0}

is a collection of initial conditions such that

D_{0} \subset D

, then the flow

s_{t} : D_{0} \times U \to D

is the motion of all points

x_{0} \in D_{0}

or, equivalently, the image of

D_{0} \subset D

under the flow

s_{t}

, that is,

s_{t} (D_{0}, U) \subset D

, where

s_{t} (D_{0}, U) ≜ {y : y = s_{t} (x_{0}, u) for all x_{0} \in D

and

u \in U}

. Alternatively, if the initial condition

x_{0} \in D

is fixed and we let

[t_{0}, t_{1}] \subset R

and

u \in U

, then the mapping

s (\cdot, x_{0}, u) : [t_{0}, t_{1}] \to D

defines the solution curve or trajectory of the dynamical system

G

. Hence, the mapping

s (\cdot, x_{0}, u)

generates a graph in

[t_{0}, t_{1}] \times D

identifying the trajectory corresponding to the motion along a curve through the point

x_{0}

with input

u \in U

in a subset

D

of the state space. Given

x \in D

and

u \in U

, we denote the map

s (\cdot, x, u) : R \to D

by

s^{x} (t, u)

.

In general, the output of

G

depends on both the present input of

G

and the past history of

G

. Hence, the output at some time

t_{1}

depends on the state

s (t_{1}, x_{0}, u)

of

G

, which effectively serves as an information storage (memory) of past history. Furthermore, the determinism axiom ensures that the state and thus the output before some time

t_{1}

are not influenced by the values of the output after time

t_{1}

. Hence, future inputs to

G

do not affect past and present outputs of

G

. This is simply a statement of causality that holds for all physical systems. Finally, we note that the read-out map is memoryless in the sense that outputs only depend on the instantaneous (present) values of the state and input.

The dynamical system

G

is isolated if

u (t) \equiv 0

. Furthermore, an equilibrium point of the isolated dynamical system

G

is a point

x_{e} \in D

satisfying

s (t, x_{e}, 0) = x_{e}, t \geq t_{0}

. An equilibrium point

x_{e} \in D_{c} \subseteq D

of the isolated dynamical system

G

is Lyapunov stable with respect to the positively invariant set

D_{c}

if, for every relatively open subset

N_{ε}

of

D_{c}

containing

x_{e}

, there exists a relatively open subset

N_{δ}

of

D_{c}

containing

x_{e}

such that

s_{t} (N_{δ}, U) \subset N_{ε}

for all

t \geq t_{0}

, where

U = {u : R \to R^{m} : u (t) \equiv 0}

. An equilibrium point

x_{e} \in D_{c}

of the isolated dynamical system

G

is called semistable if it is Lyapunov stable and there exists a relatively open subset

N

of

D_{c}

containing

x_{e}

such that for all initial conditions in

N

, the trajectory of

G

converges to a Lyapunov stable equilibrium point, that is,

∥ s (t, x, 0) - y ∥ \to 0

as

t \to \infty

, where

y \in D_{c}

is a Lyapunov stable equilibrium point of

G

and

x \in N

. The isolated dynamical system

G

is said to be semistable if every equilibrium point of

G

is semistable.

Finally, for a given interval

[t_{0}, t_{1}]

, where

0 \leq t_{0} < t_{1} < \infty

, let

W_{[t_{0}, t_{1}]}

denote the set of all possible trajectories of given by

\begin{matrix} \begin{matrix} W_{[t_{0}, t_{1}]} & {s^{x} : [t_{0}, t_{1}] \times U \to D : s^{x} (\cdot, u (\cdot)) satisfies Axioms (i) - (iv) \\ of Definition 2.1, x \in D, and u (\cdot) \in U} \end{matrix} \end{matrix}

(1)

where

s^{x} (\cdot, u (\cdot))

denotes the solution curve or trajectory of

G

for a given fixed initial condition

x \in D

and input

u (\cdot) \in U

.

3. Reversibility, Irreversibility, Recoverability and Irrecoverability

The notions of reversibility, irreversibility, recoverability, and irrecoverability all play a central role in thermodynamic processes. In this section, we define the notions of R-state reversibility, state reversibility, and state recoverability of a dynamical system

G

. R-state reversibility concerns the existence of a system state with the property that a transformed system trajectory through an involution operator R is an image of a given system trajectory of

G

on a specified finite time interval. State reversibility concerns the existence of a system state with the property that the resulting system trajectory is the time-reversed image of a given system trajectory of

G

on a specified finite time interval. Finally, state recoverability concerns the existence of a system state with the property that the resulting system trajectory completely recovers the initial state of the dynamical system over a finite time interval.

For the results of this section we use the definition of a dynamical system given in Definition 2.1. We start by establishing the notions of (ir)reversibility and (ir)recoverability of a dynamical system

G

defined on a Euclidean space

D

.

Definition 3.1

Consider the dynamical system

G

defined on

D

. Let

R : D \to D

be an involutive operator (that is,

R^{2} = I_{D}

, where

I_{D}

denotes the identity operator on

D

) and let

s^{x} (\cdot, u (\cdot)) \in W_{[t_{0}, t_{1}]}

, where

u (\cdot) \in U

. The function

s^{- x} : [t_{0}, t_{1}] \times U \to D

is an R-reversed trajectory of

s^{x} (\cdot, u (\cdot))

if there exist an input

u^{-} (\cdot) \in U

and a continuous, strictly increasing function

τ : [t_{0}, t_{1}] \to [t_{0}, t_{1}]

such that

τ (t_{0}) = t_{0}

,

τ (t_{1}) = t_{1}

, and

\begin{matrix} s^{- x} (t, u^{-} (t)) = R s^{x} (t_{0} + t_{1} - τ (t), u (t_{0} + t_{1} - τ (t))), t \in [t_{0}, t_{1}] \end{matrix}

(2)

Definition 3.2

Consider the dynamical system

G

defined on

D

. Let

R : D \to D

be an involutive operator, let

r : U \times Y \to R

, and let

s^{x} (\cdot, u (\cdot)) \in W_{[t_{0}, t_{1}]}

, where

u (\cdot) \in U

.

s^{x} (\cdot, u (\cdot))

is an R-reversible trajectory of

G

if there exists an input

u^{-} (\cdot) \in U

such that

s^{- x} (\cdot, u^{-} (\cdot)) \in W_{[t_{0}, t_{1}]}

and

\begin{matrix} \int_{t_{0}}^{t_{1}} r (u (t), y (t)) d t + \int_{t_{0}}^{t_{1}} r (u^{-} (t), y^{-} (t)) d t = 0 \end{matrix}

(3)

where

y^{-} (\cdot)

denotes the read-out map for the R-reversed trajectory of

s^{x} (\cdot, u (\cdot))

. Furthermore,

G

is an R-state reversible dynamical system if for every

x \in D

,

s^{x} (\cdot, u (\cdot))

, where

u (\cdot) \in U

is an R-reversible trajectory of

G

.

In classical mechanics, R is a transformation that reverses the sign of all system momenta and magnetic fields, whereas in classical reversible thermodynamics R can be taken to be the identity operator. Note that if

R = I_{D}

, then

s^{x} (\cdot, u (\cdot))

, where

u (\cdot) \in U

is an

I_{D}

-reversible trajectory or, simply,

s^{x} (\cdot, u (\cdot))

is a reversible trajectory. Furthermore, we say that

G

is a state reversible dynamical system if and only if for every

x \in D

,

s^{x} (\cdot, u (\cdot))

, where

u (\cdot) \in U

is a reversible trajectory of

G

. Note that unlike state reversible systems, R-state reversible dynamical systems need not retrace every stage of the original system trajectory in reverse order, nor is it necessary for the dynamical system to recover the initial system state.

The function

r (u, y)

in Definition 3.2 is a generalized power supply from the environment to the dynamical system through the system’s input-output ports

(u, y)

. Hence, Equation (3) ensures that the total generalized energy supplied to the dynamical system

G

by the environment is returned to the environment over a given R-reversible trajectory starting and ending at any given (not necessarily the same) state

x \in D

. Furthermore, Equation (3) ensures that a reversible process completely restores the original dynamic state of a system and at the same time restores the energy supplied by the environment back to its original condition. The following result provides sufficient conditions for the existence of an R-reversible trajectory of a nonlinear dynamical system

G

, and hence, establishes sufficient conditions for R-state reversibility of the dynamical system

G

.

Theorem 3.1

Consider the dynamical system

G

defined on

D

. Let

R : D \to D

be an involutive operator, and let

s^{x} (\cdot, u (\cdot)) \in W_{[t_{0}, t_{1}]}

, where

u (\cdot) \in U

. Assume there exist a continuous function

V : D \to R

and a function

r : U \times Y \to R

such that

V (x) = V (R x)

,

x \in D

, and for every

x \in D

and all

{\hat{t}}_{0}

,

{\hat{t}}_{1}

,

t_{0} \leq {\hat{t}}_{0} < {\hat{t}}_{1} \leq t_{1}

,

\begin{matrix} V (s^{x} ({\hat{t}}_{1}, u ({\hat{t}}_{1}))) \geq V (s^{x} ({\hat{t}}_{0}, u ({\hat{t}}_{0}))) + \int_{{\hat{t}}_{0}}^{{\hat{t}}_{1}} r (u (t), y (t)) d t \end{matrix}

(4)

Furthermore, assume there exists

M \subset D

such that for all

{\hat{t}}_{0}

,

{\hat{t}}_{1}

,

t_{0} \leq {\hat{t}}_{0} < {\hat{t}}_{1} \leq t_{1}

, and

s^{x} (t, u (t)) \notin M

,

t \in [{\hat{t}}_{0}, {\hat{t}}_{1}]

, Equation (4) holds as a strict inequality. If

s^{x} (\cdot, u (\cdot))

is an R-reversible trajectory of

G

, then

s^{x} (t, u (t)) \in M

,

t \in [t_{0}, t_{1}]

.

Proof.

Let

s^{x} (\cdot, u (\cdot)) \in M_{[t_{0}, t_{1}]}

, where

u (\cdot) \in U

, be an R-reversible trajectory of

G

so that there exists

u^{-} (\cdot) \in U

such that

s^{- x} (\cdot, u^{-} (\cdot)) \in M_{[t_{0}, t_{1}]}

. Suppose, ad absurdum, there exists

t \in [t_{0}, t_{1}]

such that

s^{x} (t, u (t)) \notin M

. Now, it follows that there exists an interval

[{\hat{t}}_{0}, {\hat{t}}_{1}] \subset [t_{0}, t_{1}]

such that for

t_{0} \leq {\hat{t}}_{0} < {\hat{t}}_{1} \leq t_{1}

,

\begin{matrix} V (s^{x} ({\hat{t}}_{1}, u ({\hat{t}}_{1}))) > V (s^{x} ({\hat{t}}_{0}, u ({\hat{t}}_{0}))) + \int_{{\hat{t}}_{0}}^{{\hat{t}}_{1}} r (u (t), y (t)) d t \end{matrix}

(5)

which further implies that

\begin{matrix} V (s^{x} (t_{1}, u (t_{1}))) > V (s^{x} (t_{0}, u (t_{0}))) + \int_{t_{0}}^{t_{1}} r (u (t), y (t)) d t \end{matrix}

(6)

Next, since

s^{- x} (\cdot, u^{-} (\cdot)) \in M_{[t_{0}, t_{1}]}

, where

u^{-} (\cdot) \in U

, it follows that

\begin{matrix} V (s^{- x} (t_{1}, u^{-} (t_{1}))) \geq V (s^{- x} (t_{0}, u^{-} (t_{0}))) + \int_{t_{0}}^{t_{1}} r (u^{-} (t), y^{-} (t)) d t \end{matrix}

(7)

Now, adding Equations (6) and (7), using the definition of

s^{- x} (\cdot, u^{-} (\cdot))

, using the fact that

V (x) = V (R x)

,

x \in D

, and using Equation (3) yields

\begin{matrix} V (s^{x} (t_{0}, u (t_{0}))) + V (s^{x} (t_{1}, u (t_{1}))) > V (s^{x} (t_{0}, u (t_{0}))) + V (s^{x} (t_{1}, u (t_{1}))) \end{matrix}

which is a contradiction. Hence,

s^{x} (t, u (t)) \in M

,

t \in [t_{0}, t_{1}]

. ☐

It is important to note that since

V : D \to R

in Theorem 3.1 is not sign definite, Theorem 3.1 also holds for the case where the inequality in Equation (4) is reversed. The following corollary to Theorem 3.1 is immediate.

Corollary 3.1

Consider the dynamical system

G

defined on

D

. Let

R : D \to D

be an involutive operator, let

M \subset D

, and let

s^{x} (\cdot, u (\cdot)) \in W_{[t_{0}, t_{1}]}

, where

u (\cdot) \in U

. Assume there exists a continuous function

V : D \to R

such that

V (x) = V (R x)

,

x \in D

, and for

s^{x} (t, u (t)) \notin M

,

t \in [t_{1}, t_{2}]

,

V (s (t, x_{0}, u (\cdot)))

is a strictly increasing (respectively, decreasing) function of time. If

s^{x} (\cdot, u (\cdot))

is an R-reversible trajectory of

G

, then

s^{x} (t, u (t)) \in M

,

t \in [t_{0}, t_{1}]

.

Proof.

The proof is a direct consequence of Theorem 3.1 with

r (u, y) \equiv 0

and the fact that Theorem 3.1 also holds for the case when the inequality in Equation (4) is reversed. ☐

It follows from Corollary 3.1 that if, for a given dynamical system

G

, there exists an R-reversible trajectory of

G

, then there does not exist a function of the state of the system that strictly decreases or strictly increases in time on any trajectory of

G

lying in

M

. In this case, the existence of a completely ordered time set having a topological structure involving a closed set homeomorphic to the real line cannot be established. Such systems, which include lossless Newtonian and Hamiltonian systems, are time-reversal symmetric and hence lack an inherent time direction. However, that is not the case with thermodynamic systems.

Next, we present a notion of state recoverability of a dynamical system

G

.

Definition 3.3

Consider the dynamical system

G

defined on

D

. Let

r : U \times Y \to R

, and let

s^{x} (\cdot, u (\cdot)) \in W_{[t_{0}, t_{1}]}

, where

u (\cdot) \in U

.

s^{x} (\cdot, u (\cdot))

is a recoverable trajectory of

G

if there exist

u^{-} (\cdot) \in U

and

t_{2} > t_{1}

such that

u^{-} : [t_{1}, t_{2}] \to U

,

\begin{matrix} s (t_{2}, s^{x} (t_{1}, u (t_{1})), u^{-} (t_{2})) = s^{x} (t_{0}, u (t_{0})) \end{matrix}

(8)

and

\begin{matrix} \int_{t_{0}}^{t_{1}} r (u (t), y (t)) d t + \int_{t_{1}}^{t_{2}} r (u^{-} (t), y^{-} (t)) d t = 0 \end{matrix}

(9)

where

y^{-} (\cdot)

denotes the read-out map for the trajectory

s (\cdot, s^{x} (t_{1}, u (t_{1})),

u^{-} (\cdot))

. Furthermore,

G

is a state recoverable dynamical system if for every

x \in D

,

s^{x} (\cdot, u (\cdot))

is a recoverable trajectory of

G

.

It follows from the definition of state recoverability that the way in which the initial dynamical system state is restored may be chosen freely so long as Equation (9) is satisfied. Hence, unlike R-state reversibility, it is not necessary for the dynamical system to recover the initial state of the system through an involutive transformation of the system trajectory. Furthermore, unlike state reversibility, it is not necessary for the dynamical system to retrace every stage of the original trajectory in the reverse order. However, Equation (9) ensures that the recoverable process completely restores the original dynamic state and at the same time restores the energy supplied by the environment back to its original condition. This notion of recoverability is closely related to Planck’s notion of complete reversibility, wherein the initial system state is restored in the totality of nature (“die gesamte Natur"). The following result provides a sufficient condition for the existence of a recoverable trajectory of a nonlinear dynamical system

G

, and hence, establishes sufficient conditions for state recoverability of

G

.

Theorem 3.2

Consider the dynamical system

G

defined on

D

. Let

s^{x} (\cdot, u (\cdot)) \in W_{[t_{0}, t_{1}]}

, where

u (\cdot) \in U

. Assume there exist a continuous function

V : D \to R

and a function

r : U \times Y \to R

such that for every

x \in D

and all

{\hat{t}}_{0}

,

{\hat{t}}_{1}

,

t_{0} \leq {\hat{t}}_{0} < {\hat{t}}_{1} \leq t_{1}

,

\begin{matrix} V (s^{x} ({\hat{t}}_{1}, u ({\hat{t}}_{1}))) & \geq & V (s^{x} ({\hat{t}}_{0}, u ({\hat{t}}_{0}))) + \int_{{\hat{t}}_{0}}^{{\hat{t}}_{1}} r (u (t), y (t)) d t \end{matrix}

(10)

Furthermore, assume there exists

M \subset D

such that for all

{\hat{t}}_{0}

,

{\hat{t}}_{1}

,

t_{0} \leq {\hat{t}}_{0} < {\hat{t}}_{1} \leq t_{1}

, and

s^{x} (t, u (t)) \notin M

,

t \in [{\hat{t}}_{0}, {\hat{t}}_{1}]

, Equation (10) holds as a strict inequality. If

s^{x} (\cdot, u (\cdot))

is a recoverable trajectory of

G

, then

s^{x} (t, u (t)) \in M

,

t \in [t_{0}, t_{1}]

.

Proof.

Let

s^{x} (\cdot, u (\cdot)) \in W_{[t_{0}, t_{1}]}

, where

u (\cdot) \in U

, be a recoverable trajectory of

G

so that there exist

u^{-} (\cdot) \in U

and

t_{2} > t_{1}

such that

s (t_{2}, s^{x} (t_{1}, u (t_{1})), u^{-} (t_{2})) = s^{x} (t_{0}, u (t_{0}))

. Suppose, ad absurdum, there exists

t \in [t_{0}, t_{1}]

such that

s^{x} (t, u (t)) \notin M

. Now, it follows that there exists an interval

[{\hat{t}}_{0}, {\hat{t}}_{1}] \subset [t_{0}, t_{1}]

such that for

t_{0} \leq {\hat{t}}_{0} < {\hat{t}}_{1} \leq t_{1}

,

\begin{matrix} V (s^{x} ({\hat{t}}_{1}, u ({\hat{t}}_{1}))) > V (s^{x} ({\hat{t}}_{0}, u ({\hat{t}}_{0}))) + \int_{{\hat{t}}_{0}}^{{\hat{t}}_{1}} r (u (t), y (t)) d t \end{matrix}

(11)

which further implies that

\begin{matrix} V (s^{x} (t_{1}, u (t_{1}))) > V (s^{x} (t_{0}, u (t_{0}))) + \int_{t_{0}}^{t_{1}} r (u (t), y (t)) d t \end{matrix}

(12)

Next, it follows from Equation (10) with

t_{2} > t_{1}

that

\begin{matrix} V (s (t_{2}, s^{x} (t_{1}, u (t_{1})), u^{-} (t_{2}))) \geq V (s (t_{1}, s^{x} (t_{1}, u (t_{1})), u^{-} (t_{1}))) + \int_{t_{1}}^{t_{2}} r (u^{-} (t), y^{-} (t)) d t \end{matrix}

(13)

Now, adding Equations (12) and (13), using the definition of

s (t_{2}, s^{x} (t_{1}, u (t_{1}),

u^{-} (t_{2})))

, and using Equation (9) yields

\begin{matrix} V (s^{x} (t_{0}, u (t_{0}))) + V (s^{x} (t_{1}, u (t_{1}))) > V (s^{x} (t_{0}, u (t_{0}))) + V (s^{x} (t_{1}, u (t_{1}))) \end{matrix}

which is a contradiction. Hence,

s^{x} (t, u (t)) \in M

,

t \in [t_{0}, t_{1}]

. ☐

The following corollary to Theorem 3.2 is immediate.

Corollary 3.2

Consider the dynamical system

G

defined on

D

. Let

M \subset D

, and let

s^{x} (\cdot, u (\cdot)) \in W_{[t_{0}, t_{1}]}

, where

u (\cdot) \in U

. Assume there exists a continuous function

V : D \to R

such that for

s^{x} (t, u (t)) \notin M

,

t \in [t_{0}, t_{1}]

,

V (s (t, x_{0}, u (\cdot))

is a strictly increasing (respectively, decreasing) function of time. If

s^{x} (\cdot, u (\cdot))

is a recoverable trajectory of

G

, then

s^{x} (t, u (t)) \in M

,

t \in [t_{0}, t_{1}]

.

Proof.

The proof is a direct consequence of Theorem 3.2 with

r (u, y) \equiv 0

and the fact that Theorem 3.2 also holds for the case when the inequality in Equation (10) is reversed. ☐

As in the case of R-state reversibility and state reversibility, state recoverability can be used to establish a connection between a dynamical system evolving on a manifold

M \subset D

and the arrow of time. However, in the case of state recoverability, the recoverable dynamical system trajectory need not involve an involutive transformation of the system trajectory, nor is it required to retrace the original system trajectory in recovering the original dynamic state. It should be noted here that state recoverability is not implied by the concepts of reachability and controllability, which play a central role in control theory [1]. For example, one might envision, albeit with a considerable stretch of the imagination, perfectly controlled inputs that could reassemble a broken egg or even fuse water into solid cubes of ice. However, in all such cases, an external source of energy from the environment would be required to operate such an immaculate state recoverable mechanism and would violate Equation (9). Clearly, state recoverability is a weaker notion than that of state reversibility since state reversibility implies state recoverability; the converse, however, is not true. Conversely, state irrecoverability is a logically stronger notion than state irreversibility since state irrecoverability implies state irreversibility. However, as we see in Section 8, these notions are equivalent for thermodynamic systems.

4. Reversible Dynamical Systems, Volume-Preserving Flows and Poincaré Recurrence

The notion of R-state reversibility introduced in Section 3 is one of the fundamental symmetries that arises in natural science. This notion can also be characterized by the flow of a dynamical system. In particular, consider the dynamical system given by

\begin{matrix} \dot{x} (t) & = & f (x (t)), x (t_{0}) = x_{0}, t \in I_{x_{0}} \end{matrix}

(14)

where

x (t) \in D \subseteq R^{q}

,

t \in I_{x_{0}}

, is the system state vector,

D

is an open subset of

R^{q}

,

f : D \to R^{q}

is locally Lipschitz continuous on

D

, and

I_{x_{0}} = [t_{0}, τ_{x_{0}})

,

t_{0} < τ_{x_{0}} \leq \infty

, is the maximal interval of existence for the solution

x (\cdot)

of Equation (14). Note that since

f (\cdot)

is locally Lipschitz continuous on

D

, it follows from Theorem 3.1 of ([23], p. 18) that the solution to Equation (14) is unique for every initial condition in

D

and jointly continuous in t and

x_{0}

. In this case, the semigroup property

s (t + τ, x_{0}) = s (t, s (τ, x_{0}))

,

t, τ \in I_{x_{0}}

, and the continuity of

s (t, \cdot)

on

D

,

t \in I_{x_{0}}

, hold. Given

t \in R

, we denote the flow

s (t, \cdot) : D \to D

of Equation (14) by

s_{t} (x_{0})

for

x_{0} \in D

, and given

x \in D

, we denote the trajectory

s (\cdot, x) : R \to D

of Equation (14) by

s^{x} (t)

. Now, in terms of the flow

s_{t} : D \to D

of Equation (14), the consistency and semigroup properties of Equation (14) can be equivalently written as

s_{0} (x_{0}) = x_{0}

and

(s_{τ} \circ s_{t}) (x_{0}) = s_{τ} (s_{t} (x_{0})) = s_{t + τ} (x_{0})

, where “∘" denotes the composition operator. Next, it follows from continuity of solutions and the semigroup property that the map

s_{t} : D \to D

is a continuous function with a continuous inverse

s_{- t}

. Thus,

s_{t}

,

t \in I_{x_{0}}

, generates a one-parameter family of homeomorphisms on

D

forming a commutative group under composition.

To show that R-state reversibility can be characterized by the flow of Equation (14), let

R : D \to D

be a continuous map of Equation (14) such that

\begin{matrix} \dot{R} (x (t)) = - f (R (x (t))), R (x (t_{0})) = R (x_{0}), t \in I_{R (x_{0})} \end{matrix}

(15)

Now, it follows from Equation (15) that

\begin{matrix} R \circ s_{t} = s_{- t} \circ R, t \in I_{x_{0}} \end{matrix}

(16)

Equation (16), with

R (\cdot)

satisfying Equation (15), defines an R-reversed trajectory of Equation (14) in the sense of Definition 3.1 with

τ (t) = t

.

In the context of classical mechanics involving the configuration manifold (space of generalized positions)

Q = R^{n}

, with governing equations given by

\begin{matrix} \dot{q} (t) & = & {(\frac{\partial H (q (t), p (t))}{\partial p (t)})}^{T}, & q (t_{0}) & = q_{0}, t \geq t_{0} \end{matrix}

(17)

\begin{matrix} \dot{p} (t) & = - {(\frac{\partial H (q (t), p (t))}{\partial q (t)})}^{T}, & p (t_{0}) = p_{0} \end{matrix}

(18)

where

q \in R^{n}

denotes generalized system positions,

p \in R^{n}

denotes generalized system momenta,

H : R^{n} \times R^{n} \to R

is the system Hamiltonian given by

H (q, p) ≜ {\dot{q}}^{T} p - L (q, \dot{q})

,

L (q, \dot{q})

is the system Lagrangian, [24,25] and

p (q, \dot{q}) ≜ {(\frac{\partial L (q, \dot{q})}{\partial \dot{q}})}^{T}

, the reversing symmetry

R : R^{n} \times R^{n} \to R^{n} \times R^{n}

is such that

R (q, p) = (q, - p)

and satisfies Equation (15). In this case,

R

is an involution. This implies that if

(q (t), p (t))

,

t \geq t_{0}

, is a solution to Equations (17) and (18), then

(q (- t), - p (- t))

,

t \geq t_{0}

, is also a solution to Equations (17) and (18) with initial condition

(q_{0}, - p_{0})

. In the configuration space this clearly shows the time-reversal nature of lossless mechanical systems.

Reversible dynamical systems tend to exhibit a phenomenon known as Poincaré recurrence [26]. Poincaré recurrence states that if a dynamical system has a fixed total energy that restricts its dynamics to bounded subsets of its state space, then the dynamical system will eventually return arbitrarily close to its initial system state infinitely often. More precisely, Poincaré [27] established the fact that if the flow of a dynamical system preserves volume and has only bounded orbits, then for each open set there exist orbits that intersect the set infinitely often. In order to state the Poincaré recurrence theorem, the following definitions are needed.

Definition 4.1

Let

V \subset R^{q}

be a bounded set. The volume

V_{vol}

of

V

is defined as

\begin{matrix} V_{vol} ≜ \int_{V} d V \end{matrix}

(19)

where the integration in Equation (19) is the Lebesgue integral over

V

.

Definition 4.2

Let

V \subset R^{q}

be a bounded set. A map

g : V \to Q

, where

Q \subset R^{q}

, is volume preserving if for every

V_{0} \subset V

, the volume of

g (V_{0})

is equal to the volume of

V_{0}

.

The following theorem, known as Liouville’s theorem [26], establishes sufficient conditions for volume-preserving flows. For the statement of this theorem, consider the nonlinear dynamical system given by Equation (14) and define the divergence of

f = {[f_{1}, \dots, f_{q}]}^{T} : D \to R^{q}

by

\begin{matrix} \nabla \cdot f (x) ≜ \sum_{i = 1}^{q} \frac{\partial f_{i} (x)}{\partial x_{i}} \end{matrix}

(20)

where ∇ denotes the nabla operator,

“ \cdot ”

denotes the dot product in

R^{q}

, and

x_{i}

denotes the ith component of x.

Theorem 4.1

Consider the nonlinear dynamical system given by Equation (14). If

\nabla \cdot f (x) \equiv 0

, then the flow

s_{t} : D \to D

of Equation (14) is volume preserving.

Proof.

Let

V \subset R^{q}

be a compact set such that its image at time t under the mapping

s_{t} (\cdot)

is given by

s_{t} (V)

. In addition, let

d S_{V}

denote an infinitesimal surface element of the boundary of the set

V

and let

\hat{n} (z), z \in \partial V

, denote an outward normal vector to the boundary of

V

. Then the change in volume of

s_{t} (V)

at

t = t_{0}

is given by

\begin{matrix} d s_{t} {(V)}_{vol} = \int_{\partial V} (f (x) \cdot \hat{n} (x)) d t d S_{V} \end{matrix}

(21)

which, using the divergence theorem, implies that

\begin{matrix} {\frac{d s_{t} {(V)}_{vol}}{d t}|}_{t = t_{0}} = \int_{\partial V} (f (x) \cdot \hat{n} (x)) d S_{V} = \int_{V} \nabla \cdot f (x) d V \end{matrix}

(22)

Hence, if

\nabla \cdot f (x) \equiv 0

, then

s_{t} (\cdot)

is a volume-preserving map. ☐

Volume preservation is the key conservation law underlying statistical mechanics. The flows of volume-preserving dynamical systems belong to one of the Lie pseudogroups [28] of diffeomorphisms. These systems arise in incompressible fluid dynamics, classical mechanics, and acoustics. Next, we state the well-known Poincaré recurrence theorem. For this result, let

g^{(n)} (x), n \in {\bar{Z}}_{+}

, denote the n-time composition operator of

g (x)

with itself and define

g^{(0)} (x) ≜ x

.

Theorem 4.2

Let

D \subset R^{q}

be an open bounded set, and let

g : D \to D

be a continuous, volume-preserving bijective (one-to-one and onto) map. Then for every open set

N \subset D

, there exists

n \in Z_{+}

such that

g^{(n)} (N) \cap N \neq Ø

. Furthermore, there exists a point

x \in N

which returns to

N

, that is,

g^{(n)} (x) \in N

for some

n \in Z_{+}

.

Proof.

The proof of this result is standard; see for example ([26], p. 72). For completeness of exposition, however, we provide a proof here. First, note that the images

g^{(p)} (N)

,

p \in {\bar{Z}}_{+}

, under the mapping

g (\cdot)

of the neighborhood

N \subset D

have the same volume and are all contained in

D

. Next, define the union of all the images of

N

by

\begin{matrix} V & ≜ & ⋃_{p = 0}^{\infty} g^{(p)} (N) \subset D \end{matrix}

(23)

Since the volume of a union of disjoint sets is the sum of the individual set volumes, it follows that if

g^{(p)} (N), p \in {\bar{Z}}_{+}

are disjoint, then

V_{vol} = \infty

. However,

V \subset D

and

D

is a bounded set by assumption. Hence, there exist

k, l \in {\bar{Z}}_{+}

, with

k > l

, such that

g^{(k)} (N) \cap g^{(l)} (N) \neq Ø

. Now, applying the inverse

g^{(- 1)}

to this relation l times and using the fact that

g (\cdot)

is a bijective map, it follows that

g^{(k - l)} (N) \cap N \neq Ø

. Thus,

g^{(n)} (N) \cap N \neq Ø

, where

n = k - l

. Hence, there exists a point

x \in N

such that

g^{(n)} (x) \in g^{(n)} (N) \cap N \subseteq N

. ☐

The next result establishes the existence of a point x in

D \subset R^{q}

such that

{lim}_{i \to \infty} g^{(n_{i})} (x) = x

for some sequence

{n_{i}}_{i = 1}^{\infty}

, with

n_{i} \to \infty

as

i \to \infty

, under a continuous, volume-preserving bijective mapping

g (\cdot)

which maps a bounded region

D

of a Euclidean space onto itself. Hence, x returns infinitely often to any open neighborhood of itself under the mapping

g (\cdot)

.

Theorem 4.3

Let

D \subset R^{q}

be an open bounded set, and let

g : D \to D

be a continuous, volume-preserving bijective map. Then for every open neighborhood

N \subset D

, there exists a point

x \in N

such that

{lim}_{i \to \infty} g^{(n_{i})} (x) = x

for some sequence

{n_{i}}_{i = 1}^{\infty}

, with

n_{i} \to \infty

as

i \to \infty

. Hence,

x \in N

returns to

N

infinitely often, that is, there exists a sequence

{n_{i}}_{i = 1}^{\infty}

, with

n_{i} \to \infty

as

i \to \infty

, such that

g^{(n_{i})} (x) \in N

for all

i \in Z_{+}

.

Proof.

Let

N \subset D

be an open set, and let

N_{1} ≜ B_{δ_{1}} (x_{1})

be such that

{\bar{N}}_{1} \subset N

for some

δ_{1} > 0

and

x_{1} \in N

. Applying Theorem 4.2, with

g (\cdot)

replaced by

g^{(- 1)} (\cdot)

, it follows that there exists

n_{1} \in Z_{+}

such that

g^{(- n_{1})} (N_{1}) \cap N_{1} \neq Ø

, which implies that

g^{(- n_{1})} ({\bar{N}}_{1}) \cap {\bar{N}}_{1} \neq Ø

. Now, let

N_{2} = B_{δ_{2}} (x_{2})

be such that

{\bar{N}}_{2} \subset g^{(- n_{1})} (N_{1}) \cap N_{1}

for some

δ_{2} > 0

and

x_{2} \in g^{(- n_{1})} (N_{1}) \cap N_{1}

. Repeating the above arguments it follows that there exists

n_{2} \in Z_{+}

,

n_{2} > n_{1}

, such that

g^{(- n_{2})} (N_{2}) \cap N_{2} \neq Ø

and

g^{(- n_{2})} ({\bar{N}}_{2}) \cap {\bar{N}}_{2} \neq Ø

. Repeating this process recursively, it follows that there exist sequences

{n_{i}}_{i = 1}^{\infty}

and

{δ_{i}}_{i = 1}^{\infty}

, with

n_{i} \to \infty

as

i \to \infty

,

δ_{i} \to 0

as

i \to \infty

, and

δ_{i} > δ_{i + 1}

,

i = 1, 2, \dots

, such that

N_{i} \supset N_{i + 1}

,

i = 1, 2, \dots

, and

g^{(- n_{i})} (N_{i}) \cap N_{i} \neq Ø

, where

N_{i} = B_{δ_{i}} (x_{i})

for some

x_{i} \in g^{(- n_{i - 1})} (N_{i - 1}) \cap N_{i - 1}

and where

n_{0} ≜ 0

and

N_{0} ≜ N

. Now, since

N_{i} \neq Ø

,

i \in Z_{+}

, it follows from the Cantor intersection theorem ([29], p. 56) that

Z ≜ ⋂_{i = 1}^{\infty} {\bar{N}}_{i} \neq Ø

. Furthermore, since

δ_{i} \to 0

as

i \to \infty

, it follows that

Z

is a singleton. Next, let

x \in Z = {x}

, and since for every

i \in Z_{+}

,

{\bar{N}}_{i + 1} \subset N_{i}

, it follows that

x \in N_{i}

,

i \in Z_{+}

. Now, note that

x \in N_{i + 1} \subset g^{(- n_{i})} (N_{i}) \cap N_{i}

for all

i \in Z_{+}

, which implies that

g^{(n_{i})} (z) \in N_{i}

,

i \in Z_{+}

. Hence, since

δ_{i} \to 0

as

i \to \infty

, it follows that

{lim}_{i \to \infty} g^{(n_{i})} (x) = x

. ☐

The next theorem strengthens Poincaré’s theorem by showing that for every open neighborhood

N

of

D \subset R^{q}

, there exists a subset of

N

that is dense [30] in

N

so that almost every moving point in

N

returns repeatedly to the vicinity of its initial position under a continuous, volume-preserving bijective mapping which maps the bounded region

D

onto itself.

Theorem 4.4

Let

D \subset R^{q}

be an open bounded set, and let

g : D \to D

be a continuous, volume-preserving bijective map. Then for every open neighborhood

N \subset D

, there exists a dense subset

V \subset N

such that for every point

z \in V

,

{lim}_{i \to \infty} g^{(n_{i})} (x) = x

for some sequence

{n_{i}}_{i = 1}^{\infty}

, with

n_{i} \to \infty

as

i \to \infty

.

Proof.

Let

N \subset D

be an open neighborhood and define

V \subset N

by

\begin{matrix} \begin{matrix} V ≜ {x \in N : & there exists a sequence {n_{i}}_{i = 1}^{\infty}, with n_{i} \to \infty \\ as i \to \infty, such that lim_{i \to \infty} g^{(n_{i})} (x) = x} \end{matrix} \end{matrix}

(24)

Now, let

x \in N

and let

{δ_{i}}_{i = 1}^{\infty}

be a strictly decreasing positive sequence with

δ_{i} \to 0

as

i \to \infty

and

B_{δ_{1}} (x) \subset N

. It follows from Theorem 4.3 that for every

i \in Z_{+}

, there exists

x_{i} \in B_{δ_{i}} (x)

such that

{lim}_{k \to \infty} g^{(n_{k})} (x_{i}) = x_{i}

for some sequence

{n_{k}}_{k = 1}^{\infty}

, with

n_{k} \to \infty

as

k \to \infty

, which implies that

x_{i} \in V

,

i \in Z_{+}

. Next, since

{lim}_{i \to \infty} x_{i} = x

, it follows that

x \in \bar{V}

, which implies that

V \subseteq N \subset \bar{V}

, and hence,

V

is a dense subset of

N

. ☐

It follows from Theorem 4.4 that almost every point in

D \subset R^{q}

will return infinitely many times to any open neighborhood of itself under a continuous, volume-preserving bijective mapping which maps a bounded region

D

of a Euclidean space onto itself. The following theorem provides several equivalent statements for establishing Poincaré recurrence.

Theorem 4.5

Let

D \subset R^{q}

be an open bounded set, and let

g : D \to D

be a continuous, bijective map. Then the following statements are equivalent:

(i): For every open set $N \subset D$ , there exists a dense subset $V \subset N$ such that, for every point $z \in V$ , ${lim}_{i \to \infty} g^{(n_{i})} (x) = x$ for some sequence ${n_{i}}_{i = 1}^{\infty}$ , with $n_{i} \to \infty$ as $i \to \infty$ .
(ii): For every open set $N \subset D$ , there exists a point $x \in N$ such that ${lim}_{i \to \infty} g^{(n_{i})} (x) = x$ for some sequence ${n_{i}}_{i = 1}^{\infty}$ , with $n_{i} \to \infty$ as $i \to \infty$ .
(iii): For every open set $N \subset D$ , there exists a point $x \in N$ which returns to $N$ infinitely often, that is, $g^{(n_{i})} (x) \in N$ , $i \in Z_{+}$ , for some sequence ${n_{i}}_{i = 1}^{\infty}$ , with $n_{i} \to \infty$ as $i \to \infty$ .
(iv): For every open set $N \subset D$ , there exists a point $x \in N$ which returns to $N$ , that is, $g^{(n)} (x) \in N$ for some $n \in Z_{+}$ .
(v): For every open set $N \subset D$ , there exists $n \in Z_{+}$ such that $g^{(n)} (N) \cap N \neq Ø$ .

Proof.

The implication (i) implies (ii) follows trivially and the proof of (ii) implies (i) is identical to that of Theorem 4.4. The implications (ii) implies (iii), (iii) implies (iv), and (iv) implies (v) follow trivially. The proof of (v) implies (ii) is identical to that of Theorem 4.3. ☐

Note that it follows from Theorems 4.2, 4.3, and 4.4 that a continuous, bijective map

g : D \to D

exhibits Poincaré recurrence (that is, the statements in Theorem 4.5 hold) if

g (\cdot)

is volume preserving. For the remainder of this section we consider the nonlinear dynamical system given by Equation (14) and assume that the solutions to Equation (14) are defined for all

t \in R

. Recall that if all solutions to Equation (14) are bounded, then it follows from the Peano–Cauchy theorem ([23], pp. 16–17) that

I_{x_{0}} = R

. The following theorem shows that if a dynamical system preserves volume, then almost all trajectories return arbitrarily close to their initial position infinitely often.

Theorem 4.6

Consider the nonlinear dynamical system given by Equation (14). Assume that the flow

s_{t} : D \to D

of Equation (14) is volume preserving and maps an open bounded set

D_{c} \subset R^{q}

onto itself, that is,

D_{c}

is an invariant set with respect to Equation (14). Then the nonlinear dynamical system given by Equation (14) exhibits Poincaré recurrence, that is, almost every point

x \in D_{c}

returns to every open neighborhood

N \subset D_{c}

of x infinitely many times.

Proof.

Since

f : D \to R^{q}

is locally Lipschitz continuous on

D

and

s_{t} (\cdot)

maps an open bounded set

D_{c} \subset R^{n}

onto itself, it follows that the solutions to Equation (14) are bounded and unique for all

t \in R

and

x_{0} \in D_{c}

. Thus, the mapping

s_{t} (\cdot)

is bijective. Furthermore, since the solutions of Equation (14) are continuously dependent on the system’s initial conditions, it follows that

s_{t} (\cdot)

is continuous. Now, the result follows as a direct consequence of Theorem 4.4 with

g (\cdot) = s_{t} (\cdot)

for every

t \geq t_{0}

. ☐

It follows from Theorem 4.6 that a nonlinear dynamical system exhibits Poincaré recurrence if one of the statements in Theorem 4.5 holds with

g (\cdot) = s_{t} (\cdot)

for every

t \geq t_{0}

. Note that in this case it follows from (ii) of Theorem 4.5 that Poincaré recurrence is equivalent to the existence of a point

x \in N \subset D_{c}

such that x belongs to its positive limit set

ω (x)

, that is,

x \in ω (x)

.

All Hamiltonian dynamical systems of the form given by Equations (17) and (18) exhibit Poincaré recurrence since they possess volume-preserving flows and are conservative in the sense that the Hamiltonian function

H (q, p)

remains constant along system trajectories. To see this, note that with

x ≜ {[q^{T}, p^{T}]}^{T}

, Equations (17) and (18) can be rewritten as

\begin{matrix} \dot{x} (t) = J {(\frac{\partial H}{\partial x} (x (t)))}^{T}, x (t_{0}) = x_{0}, t \geq t_{0} \end{matrix}

(25)

where

x_{0} ≜ {[q_{0}^{T}, p_{0}^{T}]}^{T} \in R^{2 n}

and

\begin{matrix} J ≜ [\begin{matrix} 0_{n} & I_{n} \\ - I_{n} & 0_{n} \end{matrix}] \end{matrix}

(26)

Now, since

\begin{matrix} \dot{H} (x) = (\frac{\partial H}{\partial x} (x)) J {(\frac{\partial H}{\partial x} (x))}^{T} = 0, x \in R^{2 n} \end{matrix}

(27)

the Hamiltonian function

H (\cdot)

is conserved along the flow of Equation (25). If

H (\cdot)

is bounded from below and is radially unbounded, then every trajectory of the Hamiltonian system given by Equation (25) is bounded. Hence, by choosing the bounded region

D ≜ {x \in R^{2 n} : H (x) \leq η}

, where

η \in R

and

η > 0

, it follows that the flow

s_{t} (\cdot)

of Equation (25) maps the bounded region

D

onto itself. Since

η > 0

is arbitrary, the region

D

can be chosen arbitrarily large. Furthermore, since Equation (25) possesses unique solutions over

R

, it follows that the mapping

s_{t} (\cdot)

is one-to-one and onto. Moreover,

\begin{matrix} \nabla \cdot J {(\frac{\partial H}{\partial x} (x))}^{T} = \sum_{i = 1}^{n} \frac{\partial^{2} H (q, p)}{\partial q_{i} \partial p_{i}} - \sum_{i = 1}^{n} \frac{\partial^{2} H (q, p)}{\partial p_{i} \partial q_{i}} = 0, x \in R^{2 n} \end{matrix}

(28)

which, by Theorem 4.1, shows that the flow

s_{t} (\cdot)

of Equation (25) is volume preserving. Finally, since the flow

s_{t} (\cdot)

of Equation (25) is volume preserving, continuous, and bijective, and

s_{t} (\cdot)

maps a bounded region of a Euclidean space onto itself, it follows from Theorem 4.6 that the Hamiltonian dynamical system given by Equation (25) exhibits Poincaré recurrence. That is, in every open neighborhood

N

of every point

x_{0} \in R^{2 n}

there exists a point

y \in N

such that the trajectory

s (t, y), t \geq t_{0}

, of Equation (25) will return to

N

infinitely many times.

Poincaré recurrence has been the main source for the long and fierce debate between the microscopic and macroscopic points of view of thermodynamics [1]. In thermodynamic models predicated on statistical mechanics, an isolated dynamical system will return arbitrarily close to its initial state of molecular positions and velocities infinitely often. If the system entropy is determined by the state variables, then it must also return arbitrarily close to its original value, and hence, undergo cyclical changes. This apparent contradiction between the behavior of a mechanical system of particles and the second law of thermodynamics remains one of the hardest and most controversial problems in statistical physics. The resolution of this paradox lies in the controversial statement that as system dimensionality increases, the recurrence time increases at an extremely fast rate. Nevertheless, the shortcoming of the mechanistic world view of thermodynamics is the absence of the emergence of damping in lossless mechanical systems. The emergence of damping is, however, ubiquitous in isolated [31] thermodynamic systems. Hence, the development of a viable dynamical system model for thermodynamics must guarantee the absence of Poincaré recurrence. The next set of results presents sufficient conditions for the absence of Poincaré recurrence for the nonlinear dynamical system given by Equation (14). First, however, define the set of equilibria for the nonlinear dynamical system given by Equation (14) in

D

by

M_{e} ≜ {x \in D : f (x) = 0}

.

Theorem 4.7

Consider the nonlinear dynamical system given by Equation (14) and assume that

D ∖ M_{e} \neq Ø

. Assume that there exists a continuous function

V : D \to R

such that for every

x_{0} \in D ∖ M_{e}

,

V (s (t, x_{0}))

,

t \geq t_{0}

, is a strictly increasing (respectively, decreasing) function of time. Then the nonlinear dynamical system given by Equation (14) does not exhibit Poincaré recurrence on

D ∖ M_{e}

. That is, for some

x \in D ∖ M_{e}

, there exists a neighborhood

N \subset D ∖ M_{e}

such that for every

y \in N

,

y \notin ω (y)

.

Proof.

Suppose, ad absurdum, there exists

z \in D ∖ M_{e}

such that for every open neighborhood

N

containing x, there exists a point

y \in N

such that

y \in ω (y)

. Now, let

{t_{i}}_{i = 1}^{\infty}

be such that

t_{i} \to \infty

as

i \to \infty

and

s (t_{i}, y) \to y

as

i \to \infty

. Since

V (\cdot)

is continuous, it follows that

{lim}_{i \to \infty} V (s (t_{i}, y)) = V (y)

. However, since

V (s (\cdot, y))

is strictly increasing, it follows that

V (s (t_{i}, y)) > V (y)

,

i \in Z_{+}

, which is a contradiction. The proof for the case where

V (s (t, x_{0})), t \geq t_{0}

is strictly decreasing is identical. ☐

For the remainder of this section let

D_{c} \subseteq D

be a closed invariant set with respect to the nonlinear dynamical system given by Equation (14). The following definition for convergence is needed.

Definition 4.3

The nonlinear dynamical system given by Equation (14) is convergent with respect to

D_{c}

if

{lim}_{t \to \infty} s (t, x)

exists for every

x \in D_{c}

.

If the system given by Equation (14) is convergent with respect to

D_{c}

, then the ω-limit set

ω (x)

of Equation (14) for the trajectory

s^{x} (t)

starting at

x \in D_{c}

is a singleton. Furthermore, it follows from continuity of solutions that for every

h \geq 0

,

s_{h} (ω (x)) ≜ {lim}_{t \to \infty} s (t + h, x) = ω (x)

. Thus,

{\frac{d s_{h} (ω (x))}{d h}|}_{h = 0} = 0

and hence

ω (x)

is an equilibrium point of Equation (14) for all

x \in D_{c}

. The next result relates the continuity of the function

ω (\cdot)

at a point x to the stability of the equilibrium point

ω (x)

.

Proposition 4.1

Suppose the nonlinear dynamical system given by Equation (14) is convergent with respect to

D_{c}

. If

ω (x)

is a Lyapunov stable equilibrium point for some

x \in D_{c}

, then

ω : D_{c} \to D_{c}

is continuous at x.

Proof.

A proof of this result appears in [32]. For completeness of exposition, we provide an alternative proof here. Suppose

ω (x)

is Lyapunov stable for some

x \in D_{c}

, and let

N_{ε}

be an open neighborhood of

ω (x)

. Moreover, choose open neighborhoods

N

and

N_{δ}

of

ω (x)

such that

\bar{N} \subset N_{ε}

and

s_{t} (N_{δ}) \subseteq N

for all

t \geq t_{0}

, and let

{x_{i}}_{n = 1}^{\infty}

be a sequence in

D_{c}

converging to x. The existence of such neighborhoods follows from the Lyapunov stability of

ω (x)

. Next, there exists

h > 0

such that

s (h, x) \in N_{δ}

and, since the solutions to Equation (14) are continuously dependent on the system initial conditions, it follows that there exists an open neighborhood

N_{\hat{δ}} ≜ B_{\hat{δ}} (x)

,

\hat{δ} > 0

of x such that

s (h, y) \in N_{δ}

for all

y \in N_{\hat{δ}}

. Furthermore, it follows from the Lyapunov stability of

ω (x)

that

s (t + h, y) \in N

,

y \in N_{\hat{δ}}

,

t \geq 0

, and hence,

ω (y) \in \bar{N} \subset N_{ε}

,

y \in N_{\hat{δ}}

, which proves that

ω : D_{c} \to D_{c}

is continuous at x. ☐

The next result gives an alternative sufficient condition for the absence of Poincaré recurrence in a dynamical system.

Theorem 4.8

Consider the nonlinear dynamical system given by Equation (14). Assume that

D_{c} ∖ M_{e} \neq Ø

and assume Equation (14) is convergent and semistable in

D_{c}

. Then the nonlinear dynamical system given by Equation (14) does not exhibit Poincaré recurrence in

D_{c} ∖ M_{e}

. That is, for some

x \in D_{c} ∖ M_{e}

, there exists an open neighborhood

N \subset D_{c} ∖ M_{e}

such that for every

y \in N

the trajectory

s (t, y)

,

t \geq t_{0}

, does not return to

N

infinitely many times.

Proof.

Let

x \in D_{c} ∖ M_{e}

and let

ω (x) \in M_{e}

be a limiting point for the trajectory

s (t, x), t \geq t_{0}

, so that

{lim}_{t \to \infty} s (t, x) = ω (x)

. Since Equation (14) is convergent and semistable, it follows from Proposition 4.1 that

ω (x), x \in D_{c} ∖ M_{e}

, is continuous. Hence, for every

ε > 0

there exists

δ = δ (ε) > 0

such that

ω (y) \in B_{ε} (ω (x))

for all

y \in B_{δ} (x)

. Choose

ε > 0

and

δ > 0

such that

{\bar{B}}_{δ} (x) \cap {\bar{B}}_{ε} (ω (x)) = Ø

. Furthermore, choose

\hat{ε} > 0

to be sufficiently small such that

\begin{matrix} \bar{⋃_{y \in B_{δ} (x)} B_{\hat{ε}}} (ω (y)) \cap {\bar{B}}_{δ} (x) = Ø \end{matrix}

(29)

Since the dynamical system given by Equation (14) is convergent in

D_{c}

, it follows that for all

y \in B_{δ} (x)

and

\hat{ε} > 0

, there exists

T (\hat{ε}, y) > t_{0}

such that

s (t, y) \in B_{\hat{ε}} (ω (y))

for all

t > T (\hat{ε}, y)

. Moreover, it follows from Equation (29) that, for all

y \in B_{δ} (x)

,

s (t, y)

,

t \geq t_{0}

, does not return to

B_{δ} (x)

infinitely many times, which proves the result with

N = B_{δ} (x)

. ☐

5. Finite-Time Semistability of Nonlinear Dynamical Systems

The notion of semistability addressed in Section 2 implies convergence of the system trajectories to an equilibrium state over the infinite horizon. In physical thermodynamic systems, however, the dynamical system possesses the property that trajectories converge to a Lyapunov stable equilibrium in finite time rather than merely asymptotically. The key in achieving finite-time convergence versus asymptotic convergence of the system trajectories can be traced back to the structure of the thermodynamic system vector field characterizing energy flow between subsystem interconnections.

In particular, if the system vector field is Lipschitz continuous, which implies uniqueness of system solutions in forward and backward times, then convergence to an equilibrium state is achieved over an infinite time interval. Alternatively, in order to achieve convergence in finite time, the system dynamics need to be non-Lipschitzian giving rise to non-uniqueness of solutions in backward time. Uniqueness of solutions in forward time, however, can be preserved in the case of finite-time convergence. Sufficient conditions that ensure uniqueness of solutions in forward time in the absence of Lipschitz continuity are given in [33,34]. In addition, it is shown in ([35], Theorem 4.3, p. 59) that uniqueness of solutions in forward time along with continuity of the system dynamics ensure that the system solutions are continuous functions of the system initial conditions even when the dynamics are not Lipschitz continuous.

In this section, we merge the theories of semistability and finite-time stability developed in [20,21,22] to allow us to develop a rigorous framework for finite-time thermodynamics. First, we present the notions of finite-time convergence and finite-time semistability for nonlinear dynamical systems, and develop several sufficient Lyapunov stability theorems for finite-time semistability. Following [36], we exploit homogeneity as a means for verifying finite-time convergence. Our main result in this direction asserts that a homogeneous system is finite-time semistable if and only if it is semistable and has a negative degree of homogeneity. This main result depends on a converse Lyapunov result for homogeneous semistable systems, which we develop. While our converse result resembles a related result for asymptotically stable systems given in [36,37], the proof of our result is rendered more difficult by the fact that it does not hold under the notions of homogeneity considered in [36,37].

More specifically, while previous treatments of homogeneity involved Euler vector fields representing asymptotically stable dynamics, our results involve homogeneity with respect to a semi-Euler vector field representing a semistable system having the same equilibria as the dynamics of interest. Consequently, our theory precludes the use of dilations commonly used in the literature on homogeneous systems (such as [37]), and requires us to adopt a more geometric description of homogeneity (see [36] and references therein).

In this section, we consider nonlinear dynamical systems of the form

\begin{matrix} \dot{x} (t) = f (x (t)), x (0) = x_{0}, t \in I_{x_{0}} \end{matrix}

(30)

where

x (t) \in D \subseteq {\bar{R}}_{+}^{n}

,

t \in I_{x_{0}}

, is the system state vector,

D

is a relatively open set with respect to

{\bar{R}}_{+}^{n}

,

f : D \to R^{n}

is continuous and essentially nonnegative on

D

, that is,

f_{i} (x) \geq 0

for all

i = 1, \dots, n

and

x \in {\bar{R}}_{+}^{n}

, such that

x_{i} = 0

,

f^{- 1} (0) ≜ {x \in D : f (x) = 0}

is nonempty, and

I_{x_{0}} = [0, τ_{x_{0}})

,

0 \leq τ_{x_{0}} \leq \infty

, is the maximal interval of existence for the solution

x (\cdot)

of Equation (30). The continuity of f implies that, for every

x_{0} \in D

, there exist

τ_{0} < 0 < τ_{1}

and a solution

x (\cdot)

of Equation (30) defined on

(τ_{0}, τ_{1})

such that

x (0) = x_{0}

. A solution x is said to be right maximally defined if x cannot be extended on the right (either uniquely or non-uniquely) to a solution of Equation (30). Here, we assume that for every initial condition

x_{0} \in D

, Equation (30) has a unique right maximally defined solution, and this unique solution is defined on

[0, \infty)

.

Under these assumptions, the solutions of Equation (30) define a continuous global semiflow on

D

, that is,

s : [0, \infty) \times D \to D

is a jointly continuous function satisfying the consistency property

s (0, x) = x

and the semigroup property

s (t, s (τ, x)) = s (t + τ, x)

for every

x \in D

and

t, τ \in [0, \infty)

. Furthermore, we assume that for every initial condition

x_{0} \in D ∖ f^{- 1} (0)

, Equation (30) has a local unique solution for negative time. The image of

U \subset D

under the flow

s_{t}

is defined as

s_{t} (U) ≜ {y : y = s_{t} (x_{0}) for

all

x_{0} \in U}

. Finally, a set

E \subseteq {\bar{R}}_{+}^{n}

is connected if and only if every pair of open sets

U_{i} \subseteq {\bar{R}}_{+}^{n}

,

i = 1, 2

, satisfying

E \subseteq U_{1} \cup U_{2}

and

U_{i} \cap E \neq Ø

,

i = 1, 2

, has a nonempty intersection. A connected component of the set

E \subseteq {\bar{R}}_{+}^{n}

is a connected subset of

E

that is not properly contained in any connected subset of

E

.

Next, we establish the notion of finite-time semistability and develop sufficient Lyapunov stability theorems for finite-time semistability.

Definition 5.1

An equilibrium point

x_{e} \in f^{- 1} (0)

of Equation (30) is said to be finite-time semistable if there exist a relatively open neighborhood

Q \subseteq D

of

x_{e}

and a function

T : Q ∖ f^{- 1} (0) \to (0, \infty)

, called the settling-time function, such that the following statements hold:

(i): For every $x \in Q ∖ f^{- 1} (0)$ , $s (t, x) \in Q ∖ f^{- 1} (0)$ for all $t \in [0, T (x))$ , and ${lim}_{t \to T (x)} s (t, x)$ exists and is contained in $Q \cap f^{- 1} (0)$ .
(ii): $x_{e}$ is semistable.

An equilibrium point

x_{e} \in f^{- 1} (0)

of Equation (30) is said to be globally finite-time semistable if it is finite-time semistable with

D = Q = {\bar{R}}_{+}^{n}

. The system given by Equation (30) is said to be finite-time semistable if every equilibrium point in

f^{- 1} (0)

is finite-time semistable. Finally, Equation (30) is said to be globally finite-time semistable if every equilibrium point in

f^{- 1} (0)

is globally finite-time semistable.

It is easy to see from Definition 5.1 that, for all

x \in Q

,

\begin{matrix} T (x) = inf {t \in {\bar{R}}_{+} : f (s (t, x)) = 0} \end{matrix}

(31)

where

T (Q \cap f^{- 1} (0)) = {0}

.

Lemma 5.1

Suppose Equation (30) is finite-time semistable. Let

x_{e} \in f^{- 1} (0)

be an equilibrium point of Equation (30) and let

Q \subseteq D

be as in Definition 5.1. Furthermore, let

T : Q \to {\bar{R}}_{+}

be the settling-time function. Then T is continuous on

Q

if and only if T is continuous at each

z_{e} \in Q \cap f^{- 1} (0)

.

Proof.

Necessity is immediate. To prove sufficiency, suppose that T is continuous at each

z_{e} \in Q \cap f^{- 1} (0)

. Let

z \in Q ∖ f^{- 1} (0)

and consider a sequence

{z_{m}}_{m = 1}^{\infty}

in

Q

that converges to z. Let

τ^{-} = {lim inf}_{m \to \infty} T (z_{m})

and

τ^{+} = {lim sup}_{m \to \infty} T (z_{m})

. Note that both

τ^{-}

and

τ^{+}

are in

{\bar{R}}_{+}

and

\begin{matrix} τ^{-} \leq τ^{+} \end{matrix}

(32)

Next, let

{z_{l}^{+}}_{l = 1}^{\infty}

be a subsequence of

{z_{m}}_{m = 1}^{\infty}

such that

T (z_{l}^{+}) \to τ^{+}

as

l \to \infty

. The sequence

{(T (z), z_{l}^{+})}_{l = 1}^{\infty}

converges in

R_{+} \times Q

to

(T (z), z)

. By continuity and

\begin{matrix} s (T (x) + t, x) = s (T (x), x) \end{matrix}

(33)

for all

x \in Q

and

t \in R_{+}

,

s (T (z), z_{l}^{+}) \to s (T (z), z) = z_{e}

as

l \to \infty

, where

z_{e} \in Q \cap f^{- 1} (0)

. Since T is assumed to be continuous at each

z_{e} \in Q \cap f^{- 1} (0)

,

T (s (T (z), z_{l}^{+})) \to T (z_{e}) = 0

as

l \to \infty

. Note that

\begin{matrix} T (s (t, x)) = max {T (x) - t, 0} \end{matrix}

(34)

for all

x \in Q

and

t \in R_{+}

. Using Equation (34) with

t = T (z)

and

x = z_{l}^{+}

, we obtain

max {T (z_{l}^{+}) - T (z), 0} \to 0

as

l \to \infty

. Hence,

max {τ^{+} - T (z), 0} = 0

, that is,

\begin{matrix} τ^{+} \leq T (z) \end{matrix}

(35)

Now, let

{z_{l}^{-}}_{l = 1}^{\infty}

be a subsequence of

{z_{m}}_{m = 1}^{\infty}

such that

T (z_{l}^{-}) \to τ^{-}

as

l \to \infty

. It follows from Equations (32) and (35) that

τ^{-} \in R_{+}

. Therefore, the sequence

{(T (z_{l}^{-}), z_{l}^{-})}_{l = 1}^{\infty}

converges in

R_{+} \times Q

to

(τ^{-}, z)

. Since s is continuous, it follows that

s (T (z_{l}^{-}), z_{l}^{-}) \to s (τ^{-}, z)

as

l \to \infty

. Equation (33) implies that

s (T (z_{l}^{-}), z_{l}^{-}) \in Q \cap f^{- 1} (0)

for each l. Hence,

s (τ^{-}, z) = z_{e}

,

z_{e} \in Q \cap f^{- 1} (0)

and, by Equation (31),

\begin{matrix} T (z) \leq τ^{-} \end{matrix}

(36)

It follows from Equations (32), (35), and (36) that

τ^{-} = τ^{+} = T (z)

, and hence,

T (z_{m}) \to T (z)

as

m \to \infty

. ☐

Next, we introduce a new definition which is weaker than finite-time semistability and is needed for the next result.

Definition 5.2

The system given by Equation (30) is said to be finite-time convergent to

M \subseteq f^{- 1} (0)

for

D_{0} \subseteq D

if, for every

x_{0} \in D_{0}

, there exists a finite-time

T = T (x_{0}) > 0

such that

x (t) \in M

for all

t \geq T

.

The next result gives a sufficient condition for characterizing finite-time convergence. For the statement of this result, define

\begin{matrix} \dot{V} (x) ≜ lim_{h \to 0^{+}} \frac{1}{h} [V (s (h, x)) - V (x)], x \in D \end{matrix}

(37)

for a given continuous function

V : D \to

and for every

x \in D

such that the limit in Equation (37) exists.

Proposition 5.1

Let

D_{0} \subseteq D

be positively invariant and

M \subseteq f^{- 1} (0)

. Assume that there exists a continuous function

V : D_{0} \to R

such that

\dot{V} (\cdot)

is defined everywhere on

D_{0}

,

V (x) = 0

if and only if

x \in M \subset D_{0}

, and

\begin{matrix} - c_{1} {| V (x) |}^{α} \leq \dot{V} (x) \leq - c_{2} {| V (x) |}^{α}, x \in D_{0} ∖ M \end{matrix}

(38)

where

c_{1} \geq c_{2} > 0

and

0 < α < 1

. Then Equation (30) is finite-time convergent to

M

for

{x \in D_{0} : V (x) \geq 0}

. Alternatively, if V is nonnegative and

\begin{matrix} \dot{V} (x) \leq - c_{3} {(V (x))}^{α}, x \in D_{0} ∖ M \end{matrix}

(39)

where

c_{3} > 0

, then Equation (30) is finite-time convergent to

M

for

D_{0}

.

Proof.

Note that Equation (38) is also true for

x \in M

. Application of the comparison lemma (Theorems 4.1 and 4.2 of [34]) to Equation (38) yields

μ (t, V (x), c_{1}) \leq V (s (t, x)) \leq μ (t, V (x), c_{2})

,

x \in {z \in D_{0} : V (z) \geq 0}

, where μ is given by

\begin{matrix} μ (t, z, c) ≜ \{\begin{matrix} {(| z |}^{1 - α} {- c (1 - α) t)}^{\frac{1}{1 - α}}, & 0 \leq t < \frac{{| z |}^{1 - α}}{c (1 - α)}, α < 1 \\ 0, & t \geq \frac{{| z |}^{1 - α}}{c (1 - α)}, α < 1 \end{matrix} \end{matrix}

(40)

Hence,

V (s (t, x)) = 0

for

t \geq \frac{{| V (x) |}^{1 - α}}{c_{2} (1 - α)}

, which implies that

s (t, x) \in M

for

t \geq \frac{{| V (x) |}^{1 - α}}{c_{2} (1 - α)}

. The assertion follows. The second part of the assertion can be proved similarly. ☐

The next result establishes a relationship between finite-time convergence and finite-time semistability.

Theorem 5.1

Assume that there exists a continuous nonnegative function

V : D \to {\bar{R}}_{+}

such that

\dot{V} (\cdot)

is defined everywhere on

D

,

V^{- 1} (0) = f^{- 1} (0)

, and there exists a relatively open neighborhood

Q \subseteq D

such that

Q \cap f^{- 1} (0)

is nonempty and

\begin{matrix} \dot{V} (x) \leq w (V (x)), x \in Q ∖ f^{- 1} (0) \end{matrix}

(41)

where

w : {\bar{R}}_{+} \to R

is continuous,

w (0) = 0

, and

\begin{matrix} \dot{z} (t) = w (z (t)), z (0) = z_{0} \in {\bar{R}}_{+}, t \geq 0 \end{matrix}

(42)

has a unique solution in forward time. If Equation (42) is finite-time convergent to the origin for

{\bar{R}}_{+}

and every point in

Q \cap f^{- 1} (0)

is a Lyapunov stable equilibrium point of Equation (30), then every point in

Q \cap f^{- 1} (0)

is finite-time semistable. Moreover, the settling-time function of Equation (30) is continuous on a relatively open neighborhood of

Q \cap f^{- 1} (0)

. Finally, if

Q = D

, then Equation (30) is finite-time semistable.

Proof.

Consider

x_{e} \in Q \cap f^{- 1} (0)

. Since

x (t) \equiv x_{e}

is Lyapunov stable, it follows that there exists a relatively open positively invariant set

S \subseteq Q

containing

x_{e}

. Next, it follows from Equation (41) that

\begin{matrix} \dot{V} (s (t, x)) \leq w (V (s (t, x))), x \in S, t \geq 0 \end{matrix}

(43)

Now, application of the comparison lemma (Theorem 4.1 of [34]) to the inequality Equation (43) with the comparison system given by Equation (42) yields

\begin{matrix} V (s (t, x)) \leq ψ (t, V (x)), t \geq 0, x \in S \end{matrix}

(44)

where

ψ : [0, \infty) \times R \to R

is the global semiflow of Equation (42). Since Equation (42) is finite-time convergent to the origin for

{\bar{R}}_{+}

, it follows from Equation (44) and the nonnegativity of

V (\cdot)

that

\begin{matrix} V (s (t, x)) = 0, t \geq \hat{T} (V (x)), x \in S \end{matrix}

(45)

where

\hat{T} (\cdot)

denotes the settling-time function of Equation (42).

Next, since

s (0, x) = x

,

s (\cdot, \cdot)

is jointly continuous, and

V (s (t, x)) = 0

is equivalent to

f (s (t, x)) = 0

on

S

, it follows that

inf {t \in {\bar{R}}_{+} : f (s (t, x)) = 0} > 0

for

x \in S ∖ f^{- 1} (0)

. Furthermore, it follows from Equation (45) that

inf {t \in {\bar{R}}_{+} : f (s (t, x)) = 0} < \infty

for

x \in S

. Define

T : S ∖ f^{- 1} (0) \to {\bar{R}}_{+}

by

T (x) = inf {t \in {\bar{R}}_{+} : f (s (t, x)) = 0}

. Then it follows that every point in

S \cap f^{- 1} (0)

is finite-time semistable and T is the settling-time function on

S

. Furthermore, it follows from Equation (45) that

T (x) \leq \hat{T} (V (x))

,

x \in S

. Since the settling-time function of a one-dimensional finite-time stable system is continuous at the equilibrium, it follows that T is continuous at each point in

S \cap f^{- 1} (0)

. Since

x_{e} \in Q \cap f^{- 1} (0)

was chosen arbitrarily, it follows that every point in

Q \cap f^{- 1} (0)

is finite-time semistable, while Lemma 5.1 implies that T is continuous on a relatively open neighborhood of

Q \cap f^{- 1} (0)

.

The last statement follows by noting that, if

Q = D

, then

Q

is positively invariant by our assumptions on Equation (30), and hence, the preceding arguments hold with

S = Q

. ☐

Theorem 5.2

Assume that there exists a continuous nonnegative function

V : D \to {\bar{R}}_{+}

such that

\dot{V} (\cdot)

is defined everywhere on

D

,

V^{- 1} (0) = f^{- 1} (0)

, and there exists a relatively open neighborhood

Q \subseteq D

such that

Q \cap f^{- 1} (0)

is nonempty and Equation (39) holds for all

x \in Q ∖ f^{- 1} (0)

. Furthermore, assume that there exists a continuous nonnegative function

W : Q \to {\bar{R}}_{+}

such that

\dot{W} (\cdot)

is defined everywhere on

Q

,

W^{- 1} (0) = Q \cap f^{- 1} (0)

, and

\begin{matrix} ∥ f (x) ∥ \leq - c_{0} \dot{W} (x), x \in Q ∖ f^{- 1} (0) \end{matrix}

(46)

where

c_{0} > 0

. Then every point in

Q \cap f^{- 1} (0)

is finite-time semistable.

Proof.

For any

x_{e} \in Q \cap f^{- 1} (0)

, since

W (x) \geq 0 = W (x_{e})

for all

x \in Q

, it follows from (i) of Theorem 5.2 of [20] that

x_{e}

is a Lyapunov stable equilibrium and, hence, every point in

Q \cap f^{- 1} (0)

is Lyapunov stable. Now, it follows from the second assertion of Proposition 5.1 and Theorem 5.1, with

w (x) = - c_{3} sgn (x) {| x |}^{α}

, that every point in

Q \cap f^{- 1} (0)

is finite-time semistable. ☐

6. Homogeneity and Finite-Time Semistability

In this section, we develop necessary and sufficient conditions for finite-time semistability of homogeneous dynamical systems. In the sequel, we will need to consider a complete vector field ν on

{\bar{R}}_{+}^{n}

such that the solutions of the differential equation

\dot{y} (t) = ν (y (t))

define a continuous global flow

ψ : R \times {\bar{R}}_{+}^{n} \to {\bar{R}}_{+}^{n}

on

{\bar{R}}_{+}^{n}

, where

ν^{- 1} (0) = f^{- 1} (0)

. For each

τ \in R

, the map

ψ_{τ} (\cdot) = ψ (τ, \cdot)

is a homeomorphism and

ψ_{τ}^{- 1} = ψ_{- τ}

. We define a function

V : {\bar{R}}_{+}^{n} \to R

to be homogeneous of degree

l \in R

with respect to ν if and only if

(V \circ ψ_{τ}) (x) = e^{l τ} V (x)

,

τ \in R

,

x \in {\bar{R}}_{+}^{n}

. Our assumptions imply that every connected component of

{\bar{R}}_{+}^{n} ∖ f^{- 1} (0)

is invariant under ν. The Lie derivative of a continuous function

V : {\bar{R}}_{+}^{n} \to R

with respect to ν is given by

L_{ν} V (x) ≜ {lim}_{t \to 0^{+}} \frac{1}{t} [V (ψ (t, x)) - V (x)]

, whenever the limit on the right-hand side exists. If V is a continuous homogeneous function of degree

l > 0

, then

L_{ν} V

is defined everywhere and satisfies

L_{ν} V = l V

. We assume that the vector field ν is a semi-Euler vector field, that is, the dynamical system

\begin{matrix} \dot{y} (t) = - ν (y (t)), y (0) = y_{0}, t \geq 0 \end{matrix}

(47)

is globally semistable with respect to

{\bar{R}}_{+}^{n}

. Thus, for each

x \in {\bar{R}}_{+}^{n}

,

{lim}_{τ \to \infty} ψ (- τ, x) = x^{*} \in ν^{- 1} (0)

, and for each

x_{e} \in ν^{- 1} (0)

, there exists

z \in {\bar{R}}_{+}^{n}

such that

x_{e} = {lim}_{τ \to \infty} ψ (- τ, z)

. Finally, we say that the vector field f is homogeneous of degree

k \in R

with respect to ν if and only if

ν^{- 1} (0) = f^{- 1} (0)

and, for every

t \in {\bar{R}}_{+}

and

τ \in R

,

\begin{matrix} s_{t} \circ ψ_{τ} = ψ_{τ} \circ s_{e^{k τ} t} \end{matrix}

(48)

Note that if

V : {\bar{R}}_{+}^{n} \to R

is a homogeneous function of degree l such that

L_{f} V (x)

is defined everywhere, then

L_{f} V (x)

is a homogeneous function of degree

l + k

[37,38]. Finally, note that if ν and f are continuously differentiable in a neighborhood of

x \in {\bar{R}}_{+}^{n}

, then Equation (48) holds at x for sufficiently small t and τ if and only if

[ν, f] (x) = k f (x)

in a neighborhood of

x \in {\bar{R}}_{+}^{n}

, where the Lie bracket

[ν, f]

of ν and f can be computed by using

[ν, f] = \frac{\partial f}{\partial x} ν - \frac{\partial ν}{\partial x} f

.

The following lemmas are needed for the main results of this section.

Lemma 6.1

Consider the dynamical system given by Equation (47). Let

D_{c} \subset {\bar{R}}_{+}^{n}

be a relatively compact set satisfying

D_{c} \cap ν^{- 1} (0) = Ø

. Then for every relatively open set

Q

satisfying

ν^{- 1} (0) \subset Q

, there exist

τ_{1}, τ_{2} > 0

such that

ψ_{- t} (D_{c}) \subset Q

for all

t > τ_{1}

and

ψ_{τ} (D_{c}) \cap Q = Ø

for all

τ > τ_{2}

.

Proof.

Let

Q

be a relatively open neighborhood of

ν^{- 1} (0)

with respect to

{\bar{R}}_{+}^{n}

. Since every

z \in ν^{- 1} (0)

is Lyapunov stable under ν, it follows that there exists a relatively open neighborhood

V_{z}

containing z such that

ψ_{- t} (V_{z}) \subseteq Q

for all

t \geq 0

. Hence,

V ≜ ⋃_{z \in ν^{- 1} (0)} V_{z}

is relatively open and

ψ_{- t} (V) \subseteq Q

for all

t \geq 0

. Next, consider the collection of nested sets

{D_{t}}_{t > 0}

, where

D_{t} = {x \in D_{c} : ψ_{h} (x) \notin V, h \in [- t, 0]} = D_{c} \cap ({\bar{R}}_{+}^{n} ∖ (⋃_{h \in [- t, 0]} ψ_{h}^{- 1} (V)))

,

t > 0

. For each

t > 0

,

D_{t}

is a relatively compact set. Therefore, if

D_{t}

is nonempty for each

t > 0

, then there exists

x \in ⋂_{t > 0} D_{t}

, that is, there exists

x \in D_{c}

such that

ψ_{- t} (x) \notin V

for all

t > 0

, which contradicts the fact that the domain of semistability [39] of Equation (47) is

{\bar{R}}_{+}^{n}

. Hence, there exists

τ > 0

such that

D_{τ} = Ø

, that is,

D_{c} \subset ⋃_{h \in [- τ, 0]} ψ_{h}^{- 1} (V)

. Therefore, for every

t > τ

,

ψ_{- t} (D_{c}) \subset ⋃_{h \in [- τ, 0]} ψ_{- t} (ψ_{h}^{- 1} (V)) = ⋃_{h \in [- τ, 0]} ψ_{- t - h} (V) \subseteq Q

. The second conclusion follows using similar arguments as above. ☐

Lemma 6.2

Suppose

f : {\bar{R}}_{+}^{n} \to R^{n}

is homogeneous of degree

k \in R

with respect to ν and Equation (30) is (locally) semistable. Then the domain of semistability of Equation (30) is

{\bar{R}}_{+}^{n}

.

Proof.

Let

A \subseteq {\bar{R}}_{+}^{n}

be the domain of semistability [39] and

x \in {\bar{R}}_{+}^{n}

. Note that

A

is a relatively open neighborhood of

ν^{- 1} (0)

with respect to

{\bar{R}}_{+}^{n}

. Since every point in

ν^{- 1} (0)

is a globally semistable equilibrium under

- ν

with respect to

{\bar{R}}_{+}^{n}

, there exists

τ > 0

such that

z = ψ_{- τ} (x) \in A

. Then it follows from Equation (48) that

s (t, x) = s (t, ψ_{τ} (z)) = ψ_{τ} (s (e^{k τ} t, z))

. Since

{lim}_{t \to \infty} s (t, z) = x^{*} \in f^{- 1} (0)

, it follows that

{lim}_{t \to \infty} s (t, x) = {lim}_{t \to \infty} ψ_{τ} (s (e^{k τ} t, z)) = ψ_{τ} ({lim}_{t \to \infty} s (e^{k τ} t, z)) = ψ_{τ} (x^{*}) = x^{*}

, which implies that

x \in A

. Since

x \in {\bar{R}}_{+}^{n}

is arbitrary,

A = {\bar{R}}_{+}^{n}

. ☐

The following theorem presents a converse Lyapunov result for homogenous semistable systems.

Theorem 6.1

Suppose

f : {\bar{R}}_{+}^{n} \to R^{n}

is homogeneous of degree

k \in R

with respect to ν and Equation (30) is semistable. Then for every

l > max {- k, 0}

, there exists a continuous nonnegative function

V : {\bar{R}}_{+}^{n} \to {\bar{R}}_{+}

that is homogeneous of degree l with respect to ν, continuously differentiable on

{\bar{R}}_{+}^{n} ∖ f^{- 1} (0)

, and satisfies

V^{- 1} (0) = f^{- 1} (0)

,

V^{'} (x) f (x) < 0

,

x \in {\bar{R}}_{+}^{n} ∖ f^{- 1} (0)

, and for each

x_{e} \in f^{- 1} (0)

and each bounded, relatively open neighborhood

D_{0}

containing

x_{e}

with respect to

{\bar{R}}_{+}^{n}

, there exist

c_{1} = c_{1} (D_{0}) \geq c_{2} = c_{2} (D_{0}) > 0

such that

\begin{matrix} - c_{1} {[V (x)]}^{\frac{l + k}{l}} \leq V^{'} (x) f (x) \leq - c_{2} {[V (x)]}^{\frac{l + k}{l}}, x \in D_{0} \end{matrix}

(49)

Proof.

Choose

l > max {- k, 0}

. First, we prove that there exists a continuous Lyapunov function V on

{\bar{R}}_{+}^{n}

that is homogeneous of degree l with respect to ν, continuously differentiable on

{\bar{R}}_{+}^{n} ∖ f^{- 1} (0)

, and

V^{'} (x) f (x) < 0

for

x \in {\bar{R}}_{+}^{n} ∖ f^{- 1} (0)

. Choose any nondecreasing smooth function

g : {\bar{R}}_{+} \to [0, 1]

such that

g (s) = 0

for

s \leq a

,

g (s) = 1

for

s \geq b

, and

g^{'} (s) > 0

on

(a, b)

, where

0 < a < b

are constants. It follows from Theorem 4.21 of [40] and Lemma 6.2 that there exists a continuously differentiable Lyapunov function

U (\cdot)

on

{\bar{R}}_{+}^{n}

satisfying all of the properties in Theorem 4.21 of [40].

Next, define

\begin{matrix} V (x) ≜ \int_{- \infty}^{+ \infty} e^{- l τ} g (U (ψ (τ, x))) d τ, x \in {\bar{R}}_{+}^{n} \end{matrix}

(50)

Let

Q

be a bounded, relatively open set satisfying

\bar{Q} \cap f^{- 1} (0) = Ø

. Since every point in

ν^{- 1} (0)

is a globally semistable equilibrium point under

- ν

with respect to

{\bar{R}}_{+}^{n}

, it follows that for each

x \in \bar{Q}

,

{lim}_{τ \to + \infty} U (ψ (τ, x)) = + \infty

and

{lim}_{τ \to + \infty} U (ψ (- τ, x)) = 0

. Now, it follows from Lemma 6.1 that there exist time instants

τ_{1} < τ_{2}

such that for each

x \in \bar{Q}

,

U (ψ (τ, x)) \leq a

for all

τ \leq τ_{1}

and

U (ψ (τ, x)) \geq b

for all

τ \geq τ_{2}

. Hence,

\begin{matrix} V (x) = \int_{τ_{1}}^{τ_{2}} e^{- l τ} g (U (ψ (τ, x))) d τ + \frac{e^{- l τ_{2}}}{l}, x \in Q \end{matrix}

(51)

which implies that V is well defined, positive, and continuously differentiable on

Q

.

Next, since

U (\cdot)

satisfies (i) and (ii) of Theorem 4.21 of [40] it follows from Equations (50) and (51) that

V^{- 1} (0) = f^{- 1} (0)

. Since for any

σ \in R

and

x \in {\bar{R}}_{+}^{n}

,

\begin{matrix} V (ψ (σ, x)) = \int_{- \infty}^{+ \infty} e^{- l τ} g (U (ψ (τ + σ, x))) d τ = e^{l σ} V (x) \end{matrix}

(52)

by definition, V is homogeneous of degree l. In addition, it follows from Equations (48) and (51) that

\begin{matrix} \begin{matrix} V^{'} (x) f (x) & = \int_{τ_{1}}^{τ_{2}} e^{- l τ} g^{'} (U (ψ (τ, x))) \frac{d}{d t} U (s (e^{- k τ} t, ψ (τ, x))) |_{t = 0} d τ \\ = \int_{τ_{1}}^{τ_{2}} e^{- (l + k) τ} g^{'} (U (ψ (τ, x))) U^{'} (ψ (τ, x)) f (ψ (τ, x)) d τ \\ < 0, x \in Q \end{matrix} \end{matrix}

(53)

which implies that

V^{'} f

is negative and continuous on

Q

. Now, since

Q

is arbitrary, it follows that V is well defined and continuously differentiable, and

V^{'} f

is negative and continuous on

{\bar{R}}_{+}^{n} ∖ f^{- 1} (0)

.

Next, to show continuity at points in

f^{- 1} (0)

, we define

T : {\bar{R}}_{+}^{n} ∖ f^{- 1} (0) \to R

by

T (x) = sup {t \in R : U (ψ (τ, x)) \leq a for all τ \leq t}

, and note that the continuity of U implies that

U (ψ (T (x), x)) = a

for all

x \in {\bar{R}}_{+}^{n} ∖ f^{- 1} (0)

. Let

x_{e} \in f^{- 1} (0)

, and consider a sequence

{x_{k}}_{k = 1}^{\infty}

in

{\bar{R}}_{+}^{n} ∖ f^{- 1} (0)

converging to

x_{e}

. We claim that the sequence

{T (x_{k})}_{k = 1}^{\infty}

has no bounded subsequence so that

{lim}_{k \to \infty} T (x_{k}) = \infty

. To prove our claim by contradiction, suppose, ad absurdum, that

{T (x_{k_{i}})}_{i = 1}^{\infty}

is a bounded subsequence. Without loss of generality, we may assume that the sequence

{T (x_{k_{i}})}_{i = 1}^{\infty}

converges to

h \in R

. Then, by joint continuity of ψ,

{lim}_{i \to \infty} ψ (T (x_{k_{i}}), x_{k_{i}}) = ψ (h, x_{e}) = x_{e}

, so that

{lim}_{i \to \infty} U (ψ (T (x_{k_{i}}), x_{k_{i}})) = U (x_{e}) = 0

. However, this contradicts our observation above that

U (ψ (T (x), x)) = a

for all

x \in {\bar{R}}_{+}^{n} ∖ f^{- 1} (0)

. The contradiction leads us to conclude that

{lim}_{k \to \infty} T (x_{k}) = \infty

. Now, for each

k = 1, 2, \dots,

it follows that

V (x_{k}) = \int_{T (x_{k})}^{\infty} e^{- l τ} g (U (ψ (τ, x_{k}))) d τ \leq \int_{T (x_{k})}^{\infty} e^{- l τ} d τ = l^{- 1} e^{- l T (x_{k})}

so that

{lim}_{k \to \infty} V (x_{k}) = 0 = V (x_{e})

. Since

x_{e}

was chosen arbitrarily, it follows that V is continuous at every

x_{e} \in f^{- 1} (0)

.

To show that V possesses the last property, let

x_{e} \in f^{- 1} (0)

, and choose a bounded, relatively open neighborhood

D_{0}

of

x_{e}

with respect to

{\bar{R}}_{+}^{n}

. Let

W = ψ (R_{+} \times D_{0})

. For every

ε > 0

, denote

W_{ε} = W \cap V^{- 1} (ε)

. For every

ε > 0

, define the continuous map

τ_{ε} : {\bar{R}}_{+}^{n} ∖ f^{- 1} (0) \to R

by

τ_{ε} (x) ≜ l^{- 1} ln (ε / V (x))

, and note that, for every

x \in {\bar{R}}_{+}^{n} ∖ f^{- 1} (0)

,

ψ (t, x) \in V^{- 1} (ε)

if and only if

t = τ_{ε} (x)

. Next, define

β_{ε} : {\bar{R}}_{+}^{n} ∖ f^{- 1} (0) \to {\bar{R}}_{+}^{n}

by

β_{ε} ≜ ψ (τ_{ε} (x), x)

. Note that, for every

ε > 0

,

β_{ε}

is continuous, and

β_{ε} (x) \in V^{- 1} (ε)

for every

x \in {\bar{R}}_{+}^{n} ∖ f^{- 1} (0)

.

Consider

ε > 0

.

W_{ε}

is the union of the images of connected components of

D_{0} ∖ f^{- 1} (0)

under the continuous map

β_{ε}

. Since every connected component of

{\bar{R}}_{+}^{n} ∖ f^{- 1} (0)

is invariant under ν, it follows that the image of each connected component

Q

of

{\bar{R}}_{+}^{n} ∖ f^{- 1} (0)

under

β_{ε}

is contained in

Q

itself. In particular, the images of connected components of

D_{0} ∖ f^{- 1} (0)

under

β_{ε}

are all disjoint. Thus, each connected component of

W_{ε}

is the image of exactly one connected component of

D_{0} ∖ f^{- 1} (0)

under

β_{ε}

. Finally, if ε is small enough so that

V^{- 1} (ε) \cap D_{0}

is nonempty, then

V^{- 1} (ε) \cap D_{0} \subseteq W_{ε}

, and hence, every connected component of

W_{ε}

has a nonempty intersection with

D_{0} ∖ f^{- 1} (0)

.

We claim that

W_{ε}

is bounded for every

ε > 0

. It is easy to verify that, for every

ε_{1}, ε_{2} \in (0, \infty)

,

W_{ε_{2}} = ψ_{h} (W_{ε_{1}})

with

h = l^{- 1} ln (ε_{2} / ε_{1})

. Hence, it suffices to prove that there exists

ε > 0

such that

W_{ε}

is bounded. To arrive at a contradiction, suppose, ad absurdum, that

W_{ε}

is unbounded for every

ε > 0

. Choose a bounded relatively open neighborhood

V

of

{\bar{D}}_{0}

and a sequence

{ε_{i}}_{i = 1}^{\infty}

in

(0, \infty)

converging to 0. By our assumption, for every

i = 1, 2, \dots

, at least one connected component of

W_{ε_{i}}

must contain a point in

{\bar{R}}_{+}^{n} ∖ V

. On the other hand, for i sufficiently large, every connected component of

W_{ε_{i}}

has a nonempty intersection with

D_{0} \subset V

. It follows that

W_{ε_{i}}

has a nonempty intersection with the boundary of

V

for every i sufficiently large. Hence, there exists a sequence

{x_{i}}_{i = 1}^{\infty}

in

D_{0}

, and a sequence

{t_{i}}_{i = 1}^{\infty}

in

(0, \infty)

such that

y_{i} ≜ ψ_{t_{i}} (x_{i}) \in V^{- 1} (ε_{i}) \cap \partial V

for every

i = 1, 2, \dots

. Since

V

is bounded, we can assume that the sequence

{y_{i}}_{i = 1}^{\infty}

converges to

y \in \partial V

. Continuity implies that

V (y) = {lim}_{i \to \infty} V (y_{i}) = {lim}_{i \to \infty} ε_{i} = 0

. Since

V^{- 1} (0) = f^{- 1} (0) = ν^{- 1} (0)

, it follows that y is Lyapunov stable under

- ν

. Since

y \notin {\bar{D}}_{0}

, there exists a relatively open neighborhood

Q

of y such that

Q \cap D_{0} = Ø

. The sequence

{y_{i}}_{i = 1}^{\infty}

converges to y while

ψ_{- t_{i}} (y_{i}) = x_{i} \in D_{0} \subset {\bar{R}}_{+}^{n} ∖ Q

, which contradicts Lyapunov stability. This contradiction implies that there exists

ε > 0

such that

W_{ε}

is bounded. It now follows that

W_{ε}

is bounded for every

ε > 0

.

Finally, consider

x \in D_{0} ∖ f^{- 1} (0)

. Choose

ε > 0

and note that

ψ_{τ_{ε} (x)} (x) \in W_{ε}

. Furthermore, note that

V^{'} (x) f (x) < 0

for all

x \in {\bar{R}}_{+}^{n} ∖ f^{- 1} (0)

,

V^{'} (x) f (x)

is continuous on

{\bar{R}}_{+}^{n} ∖ f^{- 1} (0)

, and

{\bar{W}}_{ε} \cap f^{- 1} (0) = Ø

. Then, by homogeneity,

V (ψ_{τ_{ε} (x)} (x)) = ε

, and hence,

\begin{matrix} min_{z \in {\bar{W}}_{ε}} V^{'} (z) f (z) \leq V^{'} (ψ_{τ_{ε} (x)} (x)) f (ψ_{τ_{ε} (x)} (x)) \leq max_{z \in {\bar{W}}_{ε}} V^{'} (z) f (z) \end{matrix}

(54)

Since

V^{'} (ψ_{τ_{ε} (x)} (x)) f (ψ_{τ_{ε} (x)} (x))

is homogeneous of degree

l + k

, it follows that

\begin{matrix} V^{'} (ψ_{τ_{ε} (x)} (x)) f (ψ_{τ_{ε} (x)} (x)) = e^{(l + k) τ_{ε} (x)} V^{'} (x) f (x) = ε^{\frac{l + k}{l}} V {(x)}^{- \frac{l + k}{l}} V^{'} (x) f (x) \end{matrix}

Let

c_{1} ≜ - ε^{- \frac{l + k}{l}} {min}_{z \in {\bar{W}}_{ε}} V^{'} (z) f (z)

and

c_{2} ≜ - ε^{- \frac{l + k}{l}} {max}_{z \in {\bar{W}}_{ε}} V^{'} (z) f (z)

. Note that

c_{1}

and

c_{2}

are positive and well defined since

{\bar{W}}_{ε}

is compact. Hence, the theorem is proved. ☐

The following result represents the main application of homogeneity [36] to finite-time semistability.

Theorem 6.2

Suppose f is homogeneous of degree

k \in R

with respect to ν. Then Equation (30) is finite-time semistable if and only if Equation (30) is semistable and

k < 0

. In addition, if Equation (30) is finite-time semistable, then the settling-time function

T (\cdot)

is homogeneous of degree

- k

with respect to ν and

T (\cdot)

is continuous on

{\bar{R}}_{+}^{n}

.

Proof.

Since finite-time semistability implies semistability, it suffices to prove that if Equation (30) is semistable, then Equation (30) is finite-time semistable if and only if

k < 0

. Suppose Equation (30) is finite-time semistable and let

l > max {- k, 0}

. Then for each

x_{e} \in f^{- 1} (0)

, it follows from Theorem 6.1 that there exist a bounded, relatively open, and positively invariant set

S

containing

x_{e}

, and a continuous nonnegative function

V : S \to {\bar{R}}_{+}

that is homogeneous of degree

l + k

and is such that

V^{'} (x) f (x)

is continuous, negative on

S ∖ f^{- 1} (0)

, homogeneous of degree

l + k

, and Equation (49) holds. Now, ad absurdum, if

k \geq 0

and

x \in S ∖ f^{- 1} (0)

, then application of the comparison lemma (Theorem 4.2 in [34]) to the first inequality in Equation (49) yields

V (s (t, x)) \geq π (t, V (x))

, where π is given by

\begin{matrix} π (t, x) = \{\begin{matrix} sgn (x) {(\frac{1}{{| x |}^{α - 1}} + c_{1} (α - 1) t)}^{- \frac{1}{α - 1}}, & α > 1 \\ e^{- c_{1} t} x, & α = 1 \end{matrix} \end{matrix}

(55)

and where

sgn (x) ≜ x / | x |

,

x \neq 0

, and

sgn (0) ≜ 0

, with

α = 1 + k / l \geq 1

. Since, in this case,

π (t, V (x)) > 0

for all

t \geq 0

, we have

s (t, x) \notin S \cap f^{- 1} (0)

for every

t \geq 0

; that is,

x_{e}

is not a finite-time semistable equilibrium under f, which is a contradiction. Hence,

k < 0

.

Conversely, if

k < 0

, choose

x_{e} \in f^{- 1} (0)

and choose a relatively open neighborhood

D_{0}

of

x_{e}

such that Equation (50) holds. Next,

S_{x_{e}}

is chosen to be a bounded, positively invariant neighborhood of

x_{e}

contained in

D_{0}

. Then it follows from Theorem 6.1 that there exists a continuous nonnegative function

V (\cdot)

such that Equation (49) holds on

S_{x_{e}}

. Now, with

c = c_{2} > 0

,

0 < α = 1 + k / l < 1

,

D_{0} = S_{x_{e}}

, and

w (x) = - {c sgn (x) | x |}^{α}

, it follows from Proposition 5.1 and Theorem 5.1 that

x_{e}

is finite-time semistable on

S_{x_{e}}

. Define

S ≜ ⋃_{x_{e} \in f^{- 1} (0)} S_{x_{e}}

. Then

S

is a relatively open neighborhood of

f^{- 1} (0)

such that every solution in

S

converges in finite time to a Lyapunov stable equilibrium. Hence, Equation (30) is finite-time semistable. Lemma 6.2 then implies that Equation (30) is globally finite-time semistable, and

T (\cdot)

is defined on

{\bar{R}}_{+}^{n}

. By Proposition 5.1 with

D_{0} = S_{x_{e}}

, and Theorem 5.1, it follows that

T (\cdot)

is continuous on

S_{x_{e}}

. Next, since

x_{e} \in f^{- 1} (0)

was chosen arbitrarily, it follows from Lemma 5.1 that

T (\cdot)

is continuous on

{\bar{R}}_{+}^{n}

.

Finally, let

x \in {\bar{R}}_{+}^{n}

and note that, since every point in

ν^{- 1} (0) = f^{- 1} (0)

is a globally semistable equilibrium under

- ν

with respect to

{\bar{R}}_{+}^{n}

, there exists

τ > 0

such that

z ≜ ψ_{- τ} (x) \in S

. Then it follows from Equation (48) that

s (t, x) = s (t, ψ_{τ} (z)) = ψ_{τ} (s (e^{k τ} t, z))

, and hence,

f (s (t, x)) = 0

if and only if

f (s (e^{k τ} t, z)) = 0

. Now, it follows that for

x \in S

,

T (ψ_{- τ} (x)) = T (z) = e^{k τ} T (x)

. By definition, it follows that

T (\cdot)

is homogeneous of degree

- k

with respect to ν. ☐

In order to use Theorem 6.2 to prove finite-time semistability of a homogeneous system, a priori information of semistability for the system is needed, which is not easy to obtain. To overcome this, we need to develop some sufficient conditions to establish finite-time semistability. Recall that a function

V : {\bar{R}}_{+}^{n} \to R

is said to be weakly proper if and only if for every

c \in R

, every connected component of the set

{x \in {\bar{R}}_{+}^{n} : V (x) \leq c} = V^{- 1} ((- \infty, c])

is compact [21]. Furthermore, the following lemma giving a sufficient condition for a trajectory of Equation (30) to converge to a limit is needed.

Lemma 6.3

Consider the nonlinear dynamical system given by Equation (30) where f is essentially nonnegative and let

x \in_{+}^{n}

. If the positive limit set

ω (x)

of Equation (30) contains a Lyapunov stable (with respect to

R_{+}^{n}

) equilibrium point y, then

y = {lim}_{t \to \infty} s (t, x)

, that is,

ω (x) = {y}

.

Proof.

Suppose

y \in ω (x)

is Lyapunov stable with respect to

R_{+}^{n}

and let

N_{ε} \subseteq_{+}^{n}

be a relatively open neighborhood of y. Since y is Lyapunov stable with respect to

R_{+}^{n}

, there exists a relatively open neighborhood

N_{δ} \subset_{+}^{n}

of y such that

s_{t} (N_{δ}) \subseteq N_{ε}

for every

t \geq 0

. Now, since

y \in ω (x)

, it follows that there exists

τ \geq 0

such that

s (τ, x) \in N_{δ}

. Hence,

s (t + τ, x) = s_{t} (s (τ, x)) \in s_{t} (N_{δ}) \subseteq N_{ε}

for every

t > 0

. Since

N_{ε} \subseteq_{+}^{n}

is arbitrary, it follows that

y = {lim}_{t \to \infty} s (t, x)

. Thus,

{lim}_{n \to \infty} s (t_{n}, x) = y

for every sequence

{t_{n}}_{n = 1}^{\infty}

, and hence,

ω (x) = {y}

. ☐

Proposition 6.1

Assume f is homogeneous of degree

k < 0

with respect to ν. Furthermore, assume that there exists a weakly proper, continuous function

V : {\bar{R}}_{+}^{n} \to R

such that

\dot{V}

is defined on

{\bar{R}}_{+}^{n}

and satisfies

\dot{V} (x) \leq 0

for all

x \in {\bar{R}}_{+}^{n}

. If every point in the largest invariant subset

N

of

{\dot{V}}^{- 1} (0)

is a Lyapunov stable equilibrium point of Equation (30), then Equation (30) is finite-time semistable.

Proof.

Since

V (\cdot)

is weakly proper, it follows from Proposition 3.1 of [21] that the positive orbit

s^{x} ([0, \infty))

of

x \in {\bar{R}}_{+}^{n}

is bounded in

{\bar{R}}_{+}^{n}

. Since every solution is bounded, it follows from the hypotheses on

V (\cdot)

that for every

x \in {\bar{R}}_{+}^{n}

, the omega limit set

ω (x)

is nonempty and contained in the largest invariant subset

N

of

{\dot{V}}^{- 1} (0)

. Since every point in

N

is a Lyapunov stable equilibrium point, it follows from Lemma 6.3 that the omega limit set

ω (x)

contains a single point for every

x \in {\bar{R}}_{+}^{n}

. And since

{lim}_{t \to \infty} s (t, x) \in N

is Lyapunov stable for every

x \in {\bar{R}}_{+}^{n}

, by definition, the system given by Equation (30) is semistable. Hence, it follows from Theorem 6.2 that Equation (30) is finite-time semistable. ☐

7. A State Space Formalism for Thermodynamics

The fundamental and unifying concept in the analysis of thermodynamic systems is the concept of energy. The energy of a state of a dynamical system is the measure of its ability to produce changes (motion) in its own system state as well as changes in the system states of its surroundings. These changes occur as a direct consequence of the energy flow between different subsystems within the dynamical system. Heat (energy) is a fundamental concept of thermodynamics involving the capacity of hot bodies (more energetic subsystems) to produce work. As in thermodynamic systems, dynamical systems can exhibit energy (due to friction) that becomes unavailable to do useful work. This in turn contributes to an increase in system entropy, a measure of the tendency of a system to lose the ability to do useful work. In this section, we use the state space formalism to construct a mathematical model of a thermodynamic system that is consistent with basic thermodynamic principles.

Specifically, we consider a large-scale system model with a combination of subsystems (compartments or parts) that is perceived as a single entity. For each subsystem (compartment) making up the system, we postulate the existence of an energy state variable such that the knowledge of these subsystem state variables at any given time

t = t_{0}

, together with the knowledge of any inputs (heat fluxes) to each of the subsystems for time

t \geq t_{0}

, completely determines the behavior of the system for any given time

t \geq t_{0}

. Hence, the (energy) state of our dynamical system at time t is uniquely determined by the state at time

t_{0}

and any external inputs for time

t \geq t_{0}

and is independent of the state and inputs before time

t_{0}

.

More precisely, we consider a large-scale dynamical system composed of a large number of units with aggregated (or lumped) energy variables representing homogenous groups of these units. If all the units comprising the system are identical (that is, the system is perfectly homogeneous), then the behavior of the dynamical system can be captured by that of a single plenipotentiary unit. Alternatively, if every interacting system unit is distinct, then the resulting model constitutes a microscopic system. To develop a middle-ground thermodynamic model placed between complete aggregation (classical thermodynamics) and complete disaggregation (statistical thermodynamics), we subdivide the large-scale dynamical system into a finite number of compartments, each formed by a large number of homogeneous units. Each compartment represents the energy content of the different parts of the dynamical system, and different compartments interact by exchanging heat. Thus, our compartmental thermodynamic model utilizes subsystems or compartments to describe the energy distribution among distinct regions in space with intercompartmental flows representing the heat transfer between these regions. Decreasing the number of compartments results in a more aggregated or homogeneous model, whereas increasing the number of compartments leads to a higher degree of disaggregation resulting in a heterogeneous model.

To formulate our state space thermodynamic model, consider the large-scale dynamical system

G

shown in Figure 1 involving energy exchange between q interconnected subsystems. Let

x_{i} : [0, \infty) \to {\bar{R}}_{+}

denote the energy (and hence a nonnegative quantity) of the ith subsystem, let

u_{i} : [0, \infty) \to R

denote the external power (heat flux) supplied to (or extracted from) the ith subsystem, let

σ_{i j} : {\bar{R}}_{+}^{q} \to {\bar{R}}_{+}

,

i \neq j, i, j = 1, \dots, q

, denote the instantaneous rate of energy (heat) flow from the jth subsystem to the ith subsystem, and let

σ_{i i} : {\bar{R}}_{+}^{q} \to {\bar{R}}_{+}, i = 1, \dots, q

, denote the instantaneous rate of energy (heat) dissipation from the ith subsystem to the environment. In this and the next two sections, we assume that

σ_{i j} : {\bar{R}}_{+}^{q} \to {\bar{R}}_{+}, i, j = 1, \dots, q

, are locally Lipschitz continuous on

{\bar{R}}_{+}^{q}

and

u_{i} : [0, \infty) \to R, i = 1, \dots, q

are bounded piecewise continuous functions of time.

Figure 1. Large-scale dynamical system

G

.

An energy balance for the ith subsystem yields

\begin{matrix} x_{i} (T) = x_{i} (t_{0}) + \sum_{j = 1, j \neq i}^{q} \int_{t_{0}}^{T} [σ_{i j} (x (t)) - σ_{j i} (x (t))] d t - \int_{t_{0}}^{T} σ_{i i} (x (t)) d t + \int_{t_{0}}^{T} u_{i} (t) d t, T \geq t_{0} \end{matrix}

(56)

or, equivalently, in vector form,

\begin{matrix} x (T) = x (t_{0}) + \int_{t_{0}}^{T} f (x (t)) d t - \int_{t_{0}}^{T} d (x (t)) d t + \int_{t_{0}}^{T} u (t) d t, T \geq t_{0} \end{matrix}

(57)

where

x (t) ≜ {[x_{1} (t), \dots, x_{q} (t)]}^{T}

,

d (x (t)) ≜ {[σ_{11} (x (t)), \dots, σ_{q q} (x (t))]}^{T}

,

u (t) ≜ {[u_{1} (t), \dots, u_{q} (t)]}^{T}

,

t \geq t_{0}

, and

f = {[f_{1}, \dots, f_{q}]}^{T} : {\bar{R}}_{+}^{q} \to R^{q}

is such that

\begin{matrix} f_{i} (x) = \sum_{j = 1, j \neq i}^{q} [σ_{i j} (x) - σ_{j i} (x)], x \in {\bar{R}}_{+}^{q} \end{matrix}

(58)

It is important to note that the exchange of energy between subsystems in Equation (56) is assumed to be a nonlinear function of all the subsystems, that is,

σ_{i j} = σ_{i j} (x), x \in {\bar{R}}_{+}^{q}, i \neq j, i, j = 1, \dots, q

. This assumption is made for generality and would depend on the complexity of the diffusion process. For example, thermal processes may include evaporative and radiative heat transfer as well as thermal conduction giving rise to complex heat transport mechanisms. However, for simple diffusion processes it suffices to assume that

σ_{i j} (x) = σ_{i j} (x_{j})

, wherein the energy flow from the jth subsystem to the ith subsystem is only dependent (possibly nonlinearly) on the energy in the jth subsystem, resulting in a donor-controlled compartmental model. Similar comments apply to system dissipation.

Note that Equation (56) yields a conservation of energy equation and implies that the energy stored in the ith subsystem is equal to the external energy supplied to (or extracted from) the ith subsystem plus the energy gained by the ith subsystem from all other subsystems due to subsystem coupling minus the energy dissipated from the ith subsystem to the environment. Equivalently, Equation (56) can be rewritten as

\begin{matrix} {\dot{x}}_{i} (t) = \sum_{j = 1, j \neq i}^{q} [σ_{i j} (x (t)) - σ_{j i} (x (t))] - σ_{i i} (x (t)) + u_{i} (t), x_{i} (t_{0}) = x_{i 0}, t \geq t_{0} \end{matrix}

(59)

or, in vector form,

\begin{matrix} \dot{x} (t) & = & f (x (t)) - d (x (t)) + u (t), x (t_{0}) = x_{0}, t \geq t_{0} \end{matrix}

(60)

where

x_{0} ≜ {[x_{10}, \dots, x_{q 0}]}^{T}

, yielding a power balance equation that characterizes energy flow between subsystems of the large-scale dynamical system

G

. Equation (59) shows that the rate of change of energy, or power, in the ith subsystem is equal to the power input (heat flux) to the ith subsystem plus the energy (heat) flow to the ith subsystem from all other subsystems minus the power dissipated from the ith subsystem to the environment. Furthermore, since

f (\cdot) - d (\cdot)

is locally Lipschitz continuous on

{\bar{R}}_{+}^{q}

and

u (\cdot)

is a bounded piecewise continuous function of time, it follows that Equation (60) has a unique solution over the finite time interval

[t_{0}, τ_{x_{0}})

. If, in addition, the power balance Equation (60) is input-to-state stable [40], then

τ_{x_{0}} = \infty

.

Equation (57) or, equivalently, Equation (60) is a statement of the first law of thermodynamics as applied to isochoric transformations (i.e., constant subsystem volume transformations) for each of the subsystems

G_{i}, i = 1, \dots, q

, with

x_{i} (\cdot)

,

u_{i} (\cdot)

,

σ_{i j} (\cdot), i \neq j

, and

σ_{i i} (\cdot), i, j = 1, \dots, q

, playing the role of the ith subsystem internal energy, rate of heat supplied to (or extracted from) the ith subsystem, heat flow between subsystems due to coupling, and the rate of energy (heat) dissipated to the environment, respectively. To further elucidate that Equation (57) is essentially the statement of the principle of the conservation of energy, let the total energy in the large-scale dynamical system

G

be given by

U ≜ e^{T} x

, where

e^{T} ≜ [1, \dots, 1]

and

x \in {\bar{R}}_{+}^{q}

, and let the net energy received by the large-scale dynamical system

G

over the time interval

[t_{1}, t_{2}]

be given by

\begin{matrix} Q ≜ \int_{t_{1}}^{t_{2}} e^{T} [u (t) - d (x (t))] d t \end{matrix}

(61)

where

x (t), t \geq t_{0}

, is the solution to Equation (60). Then, premultiplying Equation (57) by

e^{T}

and using the fact that

e^{T} f (x) \equiv 0

, it follows that

\begin{matrix} Δ U = Q \end{matrix}

(62)

where

Δ U ≜ U (t_{2}) - U (t_{1})

denotes the variation in the total energy of the large-scale dynamical system

G

over the time interval

[t_{1}, t_{2}]

. This is a statement of the first law of thermodynamics for isochoric transformations of the large-scale dynamical system

G

and gives a precise formulation of the equivalence between the variation in system internal energy and heat.

It is important to note that the large-scale dynamical system model given by Equation (60) does not consider work done by the system on the environment nor work done by the environment on the system. Hence, Q can be physically interpreted as the net amount of energy that is received by the system in forms other than work. The extension of addressing work performed by and on the system can be easily addressed by including an additional state equation, coupled to the power balance Equation (60), involving volume (deformation) states for each subsystem. Since this extension does not alter any of the conceptual results of this paper, it is not considered here for simplicity of exposition. Work performed by the system on the environment and work done by the environment on the system is addressed in [1,41].

For our large-scale dynamical system model

G

, we assume that

σ_{i j} (x) = 0, x \in {\bar{R}}_{+}^{q}

, whenever

x_{j} = 0, i, j = 1, \dots, q

. In this case,

f (x) - d (x), x \in {\bar{R}}_{+}^{q}

, is essentially nonnegative, that is,

f_{i} (x) - d_{i} (x) \geq 0

for all

i = 1, \dots, q

and

x \in {\bar{R}}_{+}^{q}

such that

x_{i} = 0

. The above constraint implies that if the energy of the jth subsystem of

G

is zero, then this subsystem cannot supply any energy to its surroundings nor dissipate energy to the environment. Moreover, we assume that

u_{i} (t) \geq 0

whenever

x_{i} (t) = 0

,

t \geq t_{0}

,

i = 1, \dots, q

, which implies that when the energy of the ith subsystem is zero, then no energy can be extracted from this subsystem. Under these assumptions, it can be shown (see [1] for details) that the solution

x (t)

,

t \geq t_{0}

, to Equation (60) is nonnegative for all nonnegative initial conditions

x_{0} \in {\bar{R}}_{+}^{q}

.

8. Entropy and Irreversibility

The nonlinear power balance Equation (60) can exhibit a full range of nonlinear behavior, including bifurcations, limit cycles, and even chaos. However, a thermodynamically consistent energy flow model should ensure that the evolution of the system energy is diffusive (parabolic) in character with convergent subsystem energies. As established in Section 4, such a system model would guarantee the absence of Poincaré recurrence. Otherwise, the thermodynamic model would violate the second law of thermodynamics, since subsystem energies (temperatures) would be allowed to return to their starting state and thereby subverting the diffusive character of the dynamical system. Hence, to ensure a thermodynamically consistent energy flow model, we require the following axioms. For the statement of these axioms [42], we first recall the following graph-theoretic notions.

Definition 8.1

([43]) A directed graph

G (C)

associated with the connectivity matrix

C \in R^{q \times q}

has vertices

{1, 2, \dots, q}

and an arc from vertex i to vertex j,

i \neq j

, if and only if

C_{(j, i)} \neq 0

. A graph

G (C)

associated with the connectivity matrix

C \in R^{q \times q}

is a directed graph for which the arc set is symmetric, that is,

C = C^{T}

. We say that

G (C)

is strongly connected if for any ordered pair of vertices

(i, j)

,

i \neq j

, there exists a path (i.e., a sequence of arcs) leading from i to j.

Recall that the connectivity matrix

C \in R^{q \times q}

is irreducible, that is, there does not exist a permutation matrix such that

C

is cogradient to a lower-block triangular matrix, if and only if

G (C)

is strongly connected (see Theorem 2.7 of [43]). Let

ϕ_{i j} (x) ≜ σ_{i j} (x) - σ_{j i} (x), x \in {\bar{R}}_{+}^{q}

, denote the net energy flow from the jth subsystem

G_{j}

to the ith subsystem

G_{i}

of the large-scale dynamical system

G

.

Axiom (i) The connectivity matrix

C \in R^{q \times q}

associated with the large-scale dynamical system

G

is defined by

\begin{matrix} C_{(i, j)} ≜ \{\begin{matrix} 0, & if ϕ_{i j} (x) \equiv 0, \\ 1, & otherwise, \end{matrix} i \neq j, i, j = 1, \dots, q \end{matrix}

(63)

and

\begin{matrix} C_{(i, i)} ≜ - \sum_{k = 1, k \neq i}^{q} C_{(k, i)}, i = 1, \dots, q \end{matrix}

(64)

and satisfies rank

C = q - 1

. Moreover, for every

i \neq j

such that

C_{(i, j)} = 1

,

ϕ_{i j} (x) = 0

if and only if

x_{i} = x_{j}

.

Axiom (ii) For

i, j = 1, \dots, q

,

(x_{i} - x_{j}) ϕ_{i j} (x) \leq 0

,

x \in {\bar{R}}_{+}^{q}

.

The fact that

ϕ_{i j} (x) = 0

if and only if

x_{i} = x_{j}, i \neq j

, implies that subsystems

G_{i}

and

G_{j}

of

G

are connected; alternatively,

ϕ_{i j} (E) \equiv 0

implies that

G_{i}

and

G_{j}

are disconnected. Axiom (i) implies that if the energies in the connected subsystems

G_{i}

and

G_{j}

are equal, then energy exchange between these subsystems is not possible. This statement is consistent with the zeroth law of thermodynamics, which postulates that temperature equality is a necessary and sufficient condition for thermal equilibrium. Furthermore, it follows from the fact that

C = C^{T}

and rank

C = q - 1

that the connectivity matrix

C

is irreducible, which implies that for every pair of subsystems

G_{i}

and

G_{j}

,

i \neq j

, of

G

there exists a sequence of connectors (arcs) of

G

that connect

G_{i}

and

G_{j}

. Axiom (ii) implies that energy flows from more energetic subsystems to less energetic subsystems and is consistent with the second law of thermodynamics, which states that heat (energy) must flow in the direction of lower temperatures [44]. Furthermore, note that

ϕ_{i j} (x) = - ϕ_{j i} (x)

,

x \in {\bar{R}}_{+}^{q}, i \neq j, i, j = 1, \dots, q

, which implies conservation of energy between lossless subsystems. With

u (t) \equiv 0

, Axioms (i) and (ii) along with the fact that

ϕ_{i j} (x) = - ϕ_{j i} (x)

,

x \in {\bar{R}}_{+}^{q}, i \neq j, i, j = 1, \dots, q

, imply that at a given instant of time, energy can only be transported, stored, or dissipated but not created, and the maximum amount of energy that can be transported and/or dissipated from a subsystem cannot exceed the energy in the subsystem.

Next, we show that the classical Clausius equality and inequality for reversible and irreversible thermodynamics over cyclic motions are satisfied for our thermodynamically consistent energy flow model. For this result ∮ denotes a cyclic integral evaluated along an arbitrary closed path of Equation (60) in

{\bar{R}}_{+}^{q}

; that is,

\oint ≜ \int_{t_{0}}^{t_{f}}

with

t_{f} \geq t_{0}

and

u (\cdot) \in U

such that

x (t_{f}) = x (t_{0}) = x_{0} \in {\bar{R}}_{+}^{q}

.

Proposition 8.1

Consider the large-scale dynamical system

G

with power balance Equation (60), and assume that Axioms (i) and (ii) hold. Then, for all

x_{0} \in {\bar{R}}_{+}^{q}

,

t_{f} \geq t_{0}

, and

u (\cdot) \in U

such that

x (t_{f}) = x (t_{0}) = x_{0}

,

\begin{matrix} \int_{t_{0}}^{t_{f}} \sum_{i = 1}^{q} \frac{u_{i} (t) - σ_{i i} (x (t))}{c + x_{i} (t)} d t = \oint \sum_{i = 1}^{q} \frac{d Q_{i} (t)}{c + x_{i} (t)} \leq 0 \end{matrix}

(65)

where

c > 0

,

d Q_{i} (t) ≜ [u_{i} (t) - σ_{i i} (x (t))] d t

,

i = 1, \dots, q

, is the amount of net energy (heat) received or dissipated by the ith subsystem over the infinitesimal time interval

d t

, and

x (t)

,

t \geq t_{0}

, is the solution to Equation (60) with initial condition

x (t_{0}) = x_{0}

. Furthermore,

\begin{matrix} \oint \sum_{i = 1}^{q} \frac{d Q_{i} (t)}{c + x_{i} (t)} = 0 \end{matrix}

(66)

if and only if there exists a continuous function

α : [t_{0}, t_{f}] \to {\bar{R}}_{+}

such that

x (t) = α (t) e

,

t \in [t_{0}, t_{f}]

.

Proof.

Since

x (t) \geq \geq 0, t \geq t_{0}

, and

ϕ_{i j} (x) = - ϕ_{j i} (x), x \in {\bar{R}}_{+}^{q}

,

i \neq j, i, j = 1, \dots, q

, it follows from Equation (60) and Axiom (ii) that

\begin{matrix} \begin{matrix} \oint \sum_{i = 1}^{q} \frac{d Q_{i} (t)}{c + x_{i} (t)} & = \int_{t_{0}}^{t_{f}} \sum_{i = 1}^{q} \frac{{\dot{x}}_{i} (t) - \sum_{j = 1, j \neq i}^{q} ϕ_{i j} (x (t))}{c + x_{i} (t)} d t \\ = \sum_{i = 1}^{q} {log}_{e} (\frac{c + x_{i} (t_{f})}{c + x_{i} (t_{0})}) - \int_{t_{0}}^{t_{f}} \sum_{i = 1}^{q} \sum_{j = 1, j \neq i}^{q} \frac{ϕ_{i j} (x (t))}{c + x_{i} (t)} d t \\ = - \int_{t_{0}}^{t_{f}} \sum_{i = 1}^{q - 1} \sum_{j = i + 1}^{q} (\frac{ϕ_{i j} (x (t))}{c + x_{i} (t)} - \frac{ϕ_{i j} (x (t))}{c + x_{j} (t)}) d t \\ = - \int_{t_{0}}^{t_{f}} \sum_{i = 1}^{q - 1} \sum_{j = i + 1}^{q} \frac{ϕ_{i j} (x (t)) (x_{j} (t) - x_{i} (t))}{(c + x_{i} (t)) (c + x_{j} (t))} d t \\ \leq 0 \end{matrix} \end{matrix}

(67)

which proves Equation (65).

To show Equation (66), note that it follows from Equation (67), Axiom (i), and Axiom (ii) that Equation (66) holds if and only if

x_{i} (t) = x_{j} (t), t \in [t_{0}, t_{f}]

,

i \neq j, i, j = 1, \dots, q

, or, equivalently, there exists a continuous function

α : [t_{0}, t_{f}] \to {\bar{R}}_{+}

such that

x (t) = α (t) e, t \in [t_{0}, t_{f}]

. ☐

The inequality given by Equation (65) is a generalization of Clausius’ inequality for reversible and irreversible thermodynamics as applied to large-scale dynamical systems and restricts the manner in which the system dissipates (scaled) heat over cyclic motions. It follows from Axiom (i) and Equation (60) that for the adiabatically isolated large-scale dynamical system

G

(that is,

u (t) \equiv 0

and

d (x (t)) \equiv 0

), the energy states given by

x_{e} = α e, α \geq 0

, correspond to the equilibrium energy states of

G

. Thus, as in classical thermodynamics, we can define an equilibrium process as a process in which the trajectory of the large-scale dynamical system

G

moves along the equilibrium manifold

M_{e} ≜ {x \in {\bar{R}}_{+}^{q} : E = α e, α \geq 0}

corresponding to the set of equilibria of the isolated [45] system

G

. The power input that can generate such a trajectory can be given by

u (t) = d (x (t)) + \hat{u} (t), t \geq t_{0}

, where

\hat{u} (\cdot) \in U

is such that

{\hat{u}}_{i} (t) \equiv {\hat{u}}_{j} (t), i \neq j, i, j = 1, \dots, q

. Our definition of an equilibrium transformation involves a continuous succession of intermediate states that differ by infinitesimals from equilibrium system states and thus can only connect initial and final states, which are states of equilibrium. This process need not be slowly varying, and hence, equilibrium and quasistatic processes are not synonymous in this paper. Alternatively, a nonequilibrium process is a process that does not lie on the equilibrium manifold

M_{e}

. Hence, it follows from Axiom (i) that for an equilibrium process

ϕ_{i j} (x (t)) = 0, t \geq t_{0}

,

i \neq j, i, j = 1, \dots, q

, and thus, by Proposition 8.1, the inequality given by Equation (65) is satisfied as an equality. Alternatively, for a nonequilibrium process it follows from Axioms (i) and (ii) that Equation (65) is satisfied as a strict inequality.

Next, we give a deterministic definition of entropy for the large-scale dynamical system

G

that is consistent with the classical thermodynamic definition of entropy.

Definition 8.2

For the large-scale dynamical system

G

with power balance Equation (60), a function

S : {\bar{R}}_{+}^{q} \to R

satisfying

\begin{matrix} S (x (t_{2})) \geq S (x (t_{1})) + \int_{t_{1}}^{t_{2}} \sum_{i = 1}^{q} \frac{u_{i} (t) - σ_{i i} (x (t))}{c + x_{i} (t)} d t \end{matrix}

(68)

for every

t_{2} \geq t_{1} \geq t_{0}

and

u (\cdot) \in U

is called the entropy function of

G

.

Next, we establish the existence of a unique, continuously differentiable entropy function for

G

for equilibrium and nonequilibrium processes. This result answers the long-standing question of how the entropy of a nonequilibrium state of a dynamical process should be defined [46,47], and establishes its global existence and uniqueness.

Theorem 8.1

Consider the large-scale dynamical system

G

with power balance Equation (60), and assume that Axioms (i) and (ii) hold. Then the function

S : {\bar{R}}_{+}^{q} \to {\bar{R}}_{+}^{q}

given by

\begin{matrix} S (x) = e^{T} \log_{e} (c e + x) - q {log}_{e} c, x \in {\bar{R}}_{+}^{q} \end{matrix}

(69)

where

\log_{e} (c e + x) ≜ {[{log}_{e} (c + x_{1}), \dots, {log}_{e} (c + x_{q})]}^{T}

and

c > 0

, is a unique (modulo a constant of integration), continuously differentiable entropy function of

G

. Furthermore, for

x (t) \notin

M_{e}

,

t \geq t_{0}

, where

x (t)

,

t \geq t_{0}

, denotes the solution to Equation (60) and

M_{e} = {x \in {\bar{R}}_{+}^{q} : x = α e, α \geq 0}

, Equation (69) satisfies

\begin{matrix} S (x (t_{2})) > S (x (t_{1})) + \int_{t_{1}}^{t_{2}} \sum_{i = 1}^{q} \frac{u_{i} (t) - σ_{i i} (x (t))}{c + x_{i} (t)} d t \end{matrix}

(70)

for every

t_{2} \geq t_{1} \geq t_{0}

and

u (\cdot) \in U

.

Proof.

Since

x (t) \geq \geq 0, t \geq t_{0}

, and

ϕ_{i j} (x) = - ϕ_{j i} (x), x \in {\bar{R}}_{+}^{q}

,

i \neq j, i, j = 1, \dots, q

, it follows that

\begin{matrix} \begin{matrix} \dot{S} (x (t)) & = \sum_{i = 1}^{q} \frac{{\dot{x}}_{i} (t)}{c + x_{i} (t)} \\ = \sum_{i = 1}^{q} [\frac{u_{i} (t) - σ_{i i} (x (t))}{c + x_{i} (t)} + \sum_{j = 1, j \neq i}^{q} \frac{ϕ_{i j} (x (t))}{c + x_{i} (t)}] \\ = \sum_{i = 1}^{q} \frac{u_{i} (t) - σ_{i i} (x (t))}{c + x_{i} (t)} + \sum_{i = 1}^{q - 1} \sum_{j = i + 1}^{q} (\frac{ϕ_{i j} (x (t))}{c + x_{i} (t)} - \frac{ϕ_{i j} (x (t))}{c + x_{j} (t)}) \\ = \sum_{i = 1}^{q} \frac{u_{i} (t) - σ_{i i} (x (t))}{c + x_{i} (t)} + \sum_{i = 1}^{q - 1} \sum_{j = i + 1}^{q} \frac{ϕ_{i j} (x (t)) (x_{j} (t) - x_{i} (t))}{(c + x_{i} (t)) (c + x_{j} (t))} \\ \geq \sum_{i = 1}^{q} \frac{u_{i} (t) - σ_{i i} (x (t))}{c + x_{i} (t)}, t \geq t_{0} \end{matrix} \end{matrix}

(71)

Now, integrating Equation (71) over

[t_{1}, t_{2}]

yields Equation (68). Furthermore, in the case where

x (t) \notin M_{e}, t \geq t_{0}

, it follows from Axiom (i), Axiom (ii), and Equation (71) that Equation (70) holds.

To show that Equation (69) is a unique, continuously differentiable entropy function of

G

, let

S (x)

be a continuously differentiable entropy function of

G

so that

S (x)

satisfies Equation (68) or, equivalently,

\begin{matrix} \dot{S} (x (t)) \geq μ^{T} (x (t)) [u (t) - d (x (t))], t \geq t_{0} \end{matrix}

(72)

where

μ^{T} (x) = [\frac{1}{c + x_{1}}, \dots, \frac{1}{c + x_{q}}], x \in {\bar{R}}_{+}^{q}

,

x (t), t \geq t_{0}

, denotes the solution to the power balance Equation (60), and

\dot{S} (x (t))

denotes the time derivative of

S (x)

along the solution

x (t), t \geq t_{0}

. Hence, it follows from Equation (72) that

\begin{matrix} S^{'} (x) [f (x) - d (x) + u] & \geq & μ^{T} (x) [u - d (x)], x \in {\bar{R}}_{+}^{q}, u \in R^{q} \end{matrix}

(73)

which implies that there exist continuous functions

ℓ : {\bar{R}}_{+}^{q} \to R^{p}

and

W : {\bar{R}}_{+}^{q} \to R^{p \times q}

such that

\begin{matrix} \begin{matrix} 0 = S^{'} (x) & [f (x) - d (x) + u] - μ^{T} (x) [u - d (x)] \\ - & {[ℓ (x) + W (x) u]}^{T} [ℓ (x) + W (x) u], x \in {\bar{R}}_{+}^{q}, u \in R^{q} \end{matrix} \end{matrix}

(74)

Now, equating coefficients of equal powers (of u), it follows that

W (x)

\equiv 0

,

S^{'} (x) = μ^{T} (x), x \in {\bar{R}}_{+}^{q}

, and

\begin{matrix} 0 = S^{'} (x) f (x) - ℓ^{T} (x) ℓ (x), x \in {\bar{R}}_{+}^{q} \end{matrix}

(75)

Hence,

S (x) = e^{T} \log_{e} (c e + x) - q {log}_{e} c, x \in {\bar{R}}_{+}^{q}

, and

\begin{matrix} 0 = μ^{T} (x) f (x) - ℓ^{T} (x) ℓ (x), x \in {\bar{R}}_{+}^{q} \end{matrix}

(76)

Thus, Equation (69) is a unique, continuously differentiable entropy function for

G

. ☐

Note that it follows from Axiom (i), Axiom (ii), and the last equality in Equation (71) that the entropy function given by Equation (69) satisfies Equation (68) as an equality for an equilibrium process and as a strict inequality for a nonequilibrium process. Hence, it follows from Theorem 4.7 that the isolated (i.e.,

u (t) \equiv 0

and

d (x) \equiv 0

) large-scale dynamical system

G

does not exhibit Poincaré recurrence in

{\bar{R}}_{+}^{q} ∖ M_{e}

. Furthermore, for any entropy function of

G

, it follows from Proposition 8.1 that if Equation (68) holds as an equality for some transformation starting and ending at equilibrium points of the isolated system

G

, then this transformation must lie on the equilibrium manifold

M_{e}

. However, Equation (68) may hold as an equality for nonequilibrium processes starting and ending at nonequilibrium states. The entropy expression given by Equation (69) is identical in form to the Boltzmann entropy for statistical thermodynamics. Due to the fact that the entropy given by Equation (69) is indeterminate to the extent of an additive constant, we can set the constant of integration

q {log}_{e} c

to zero by taking

c = 1

. Since

S (x)

given by Equation (69) achieves a maximum when all the subsystem energies

x_{i}, i = 1, \dots, q

, are equal [1], the entropy of

G

can be thought of as a measure of the tendency of a system to lose the ability to do useful work, lose order, and settle to a more homogenous state.

Recalling that

d Q_{i} (t) = [u_{i} (t) - σ_{i i} (x (t))] d t, i = 1, \dots, q

, is the infinitesimal amount of the net heat received or dissipated by the ith subsystem of

G

over the infinitesimal time interval

d t

, it follows from Equation (68) that

\begin{matrix} d S (x (t)) \geq \sum_{i = 1}^{q} \frac{d Q_{i} (t)}{c + x_{i} (t)}, t \geq t_{0} \end{matrix}

(77)

The inequality given by Equation (77) is analogous to the classical thermodynamic inequality for the variation of entropy during an infinitesimal irreversible transformation with the shifted subsystem energies

c + x_{i}

playing the role of the ith subsystem thermodynamic (absolute) temperatures. Specifically, note that since

\frac{d S_{i}}{d x_{i}} = \frac{1}{c + x_{i}}

, where

S_{i} = {log}_{e} (c + x_{i}) - {log}_{e} c

denotes the unique continuously differentiable ith subsystem entropy, it follows that

\frac{d S_{i}}{d x_{i}}, i = 1, \dots, q

, defines the reciprocal of the subsystem thermodynamic temperatures. That is,

\begin{matrix} \frac{1}{T_{i}} ≜ \frac{d S_{i}}{d x_{i}} \end{matrix}

(78)

and

T_{i} > 0

,

i = 1, \dots, q

. Hence, in our formulation, temperature is a function derived from entropy and does not involve the primitive subjective notions of hotness and coldness.

Finally, using the system entropy function given by Equation (69) we show that our large-scale dynamical system

G

with power balance Equation (60) is state irreversible for every nontrivial (nonequilibrium) trajectory of

G

. For this result, let

W_{[t_{0}, t_{1}]}

denote the set of all possible energy trajectories of

G

over the time interval

[t_{0}, t_{1}]

given by

\begin{matrix} W_{[t_{0}, t_{1}]} ≜ {s^{x} : [t_{0}, t_{1}] \times U \to {\bar{R}}_{+}^{q} : s^{x} (\cdot, u (\cdot)) satisfies Equation (60)} \end{matrix}

(79)

and let

M_{e} \subset {\bar{R}}_{+}^{q}

denote the set of equilibria of the isolated system

G

given by

M_{e} = {x \in {\bar{R}}_{+}^{q} : α e, α \geq 0}

.

Theorem 8.2

Consider the large-scale dynamical system

G

with power balance Equation (60), and assume Axioms (i) and (ii) hold. Furthermore, let

s^{x} (\cdot, u (\cdot)) \in W_{[t_{0}, t_{1}]}

, where

u (\cdot) \in U

. Then

s^{x} (\cdot, u (\cdot))

is an

I_{q}

-reversible trajectory of

G

if and only if

s^{x} (t, u (t)) \in M_{e}

,

t \in [t_{0}, t_{1}]

.

Proof.

First, note that it follows from Theorem 8.1 that if

x (t)

\notin M_{e}, t \geq t_{0}

, then there exists an entropy function

S (x), x \in {\bar{R}}_{+}^{q}

, for

G

such that Equation (70) holds. Now, sufficiency follows as a direct consequence of Theorem 3.1 with

R = I_{q}

,

V (x) = S (x)

, and

r (u, y) = r (u, d (x)) = \sum_{i = 1}^{q} \frac{u_{i} - σ_{i i} (x)}{c + x_{i}}

. To show necessity, assume that

s^{x} (t, u (t)) \in M_{e}, t \in [t_{0}, t_{1}]

. In this case, it can be shown that

u (t) = d (x (t)) + \hat{u} (t), t \geq t_{0}

, where

\hat{u} (\cdot) \in U

is such that

{\hat{u}}_{i} (t) \equiv {\hat{u}}_{j} (t), i \neq j, i, j = 1, \dots, q

. Now, with

u^{-} (t) = d (x (t)) + {\hat{u}}^{-} (t), t \geq t_{0}

, where

{\hat{u}}^{-} (t) = - \hat{u} (t_{1} + t_{0} - t), t \in [t_{0}, t_{1}]

, it follows that

s^{x} (t, u (t))

is an

I_{q}

-reversible trajectory of

G

. ☐

Theorem 8.2 establishes an equivalence between (non)equilibrium and state (ir)reversible thermodynamic systems. Furthermore, Theorem 8.2 shows that for every

x_{0} \notin M_{e}

, the large-scale dynamical system

G

is state irreversible. In addition, since state irrecoverability implies state irreversibility and, by Theorem 8.2, state irreversibility is equivalent to

x (t) \notin M_{e}, t \geq t_{0}

, it follows from Theorem 3.2 that state (ir)reversibility and state (ir)recoverability are equivalent for our thermodynamically consistent large-scale dynamical system

G

. Hence, in the remainder of the paper we use the notions of (non)equilibrium, state (ir)reversible, and state (ir)recoverable dynamical processes interchangeably.

9. Semistability and the Entropic Arrow of Time

For the isolated large-scale dynamical system

G

, Equation (71) yields the fundamental inequality

\begin{matrix} S (x (t_{2})) \geq S (x (t_{1})), t_{2} \geq t_{1} \end{matrix}

(80)

The inequality given by Equation (80) implies that, for any dynamical change in an isolated large-scale dynamical system

G

, the entropy of the final state can never be less than the entropy of the initial state. Equation (80) is often identified with the second law of thermodynamics as a statement about entropy increase. Furthermore, it follows from Equation (70) that for an isolated large-scale dynamical system

G

the entropy function Equation (69) is a strictly increasing function of time along the trajectories of Equation (60) with initial conditions in

{\bar{R}}_{+}^{q} ∖ M_{e}

. Hence, it follows from Theorem 4.7 that the isolated large-scale dynamical system

G

does not exhibit Poincaré recurrence in

{\bar{R}}_{+}^{q} ∖ M_{e}

. This result can also be arrived at using the fact that our thermodynamically consistent large-scale dynamical system

G

is semistable.

Since our thermodynamic compartmental model involves intercompartmental flows representing energy transfer between compartments, we can use graph-theoretic notions with undirected graph topologies (i.e., bidirectional energy flows) to capture the compartmental system interconnections. Graph theory [48,49] can be useful in the analysis of the connectivity properties of compartmental systems. In particular, a directed graph can be constructed to capture a compartmental model in which the compartments are represented by nodes and the flows are represented by edges or arcs. In this case, the environment must also be considered as an additional node. Specifically, let

G = (V, E, A)

be a directed graph (or digraph) denoting the compartmental network with the set of nodes (or compartments)

V = {1, \dots, q}

involving a finite nonempty set denoting the compartments, the set of edges

E \subseteq V \times V

involving a set of ordered pairs denoting the direction of energy flow, and an adjacency matrix

A \in R^{q \times q}

such that

A_{(i, j)} = 1

,

i, j = 1, \dots, q

, if

(j, i) \in E

, while

A_{(i, j)} = 0

if

(j, i) \notin E

. The edge

(j, i) \in E

denotes that compartment j can obtain energy from compartment i, but not necessarily vice versa. Moreover, we assume

A_{(i, i)} = 0

for all

i \in V

. A graph or undirected graph

G

associated with the adjacency matrix

A \in R^{q \times q}

is a directed graph for which the arc set is symmetric, that is,

A = A^{T}

. Weighted graphs can also be considered here; however, since this extension does not alter any of the conceptual results in this paper we do not consider this extension for simplicity of exposition. Finally, we denote the energy of the compartment

i \in {1, \dots, q}

at time t by

x_{i} (t) \in {\bar{R}}_{+}

.

Proposition 9.1

Consider the large-scale dynamical system

G

with power balance Equation (60) with

d (x) \equiv 0

and

u (t) \equiv 0

, and assume Axioms (i) and (ii) hold. Then

f_{i} (x) = 0

for all

i = 1, \dots, q

if and only if

x_{1} = \dots = x_{q}

. Furthermore,

α e

,

α \geq 0

, is an equilibrium state of Equation (60).

Proof.

If

x_{i} = x_{j}

for all

(i, j) \in E

, then

f_{i} (x) = 0

for all

i = 1, \dots, q

is immediate from Axiom (i). Next, we show that

f_{i} (x) = 0

for all

i = 1, \dots, q

implies that

x_{1} = \dots = x_{q}

. If

f_{i} (x) = 0

for all

i = 1, \dots, q

, then it follows from Axiom (ii) that

\begin{matrix} 0 & = \sum_{i = 1}^{q} x_{i} f_{i} (x) \\ = \sum_{i = 1}^{q} \sum_{j = 1}^{q} x_{i} ϕ_{i j} (x) \\ = \sum_{i = 1}^{q - 1} \sum_{j = i + 1}^{q} (x_{i} - x_{j}) ϕ_{i j} (x) \\ \leq 0 \end{matrix}

where we have used the fact that

ϕ_{i j} (x) = - ϕ_{j i} (x)

for all

i, j = 1, \dots, q

. Hence,

(x_{i} - x_{j}) ϕ_{i j} (x) = 0

for all

i, j = 1, \dots, q

. Now, the result follows from Axiom (i).

Alternatively, the proof can also be shown using graph-theoretic concepts. Specifically, if

x_{i} = x_{j}

for all

(i, j) \in E

, then

f_{i} (x) = 0

for all

i = 1, \dots, q

is immediate from Axiom (i). Next, we show that

f_{i} (x) = 0

for all

i = 1, \dots, q

implies that

x_{1} = \dots = x_{q}

. If the values of all nodes are equal, then the result is immediate. Hence, assume there exists a node

i^{*}

such that

x_{i^{*}} \geq x_{j}

for all

j \neq i^{*}

,

j \in {1, \dots, q}

. If

(i, j) \in E

, then we define a neighbor of node i to be node j, and vice versa.

Define the initial node set

J^{(0)} ≜ {i^{*}}

and denote the indices of all the first neighbors of node

i^{*}

by

J^{(1)} = N_{i^{*}}

. Then,

f_{i^{*}} (x) = 0

implies that

\sum_{j \in N_{i^{*}}} ϕ_{i^{*} j} (x_{i^{*}}, x_{j}) = 0

. Since

x_{j} \leq x_{i^{*}}

for all

j \in N_{i^{*}}

and, by Axiom (ii),

ϕ_{i j} (z_{i}, z_{j}) \leq 0

for all

z_{i} \geq z_{j}

, it follows that

x_{i^{*}} = x_{j}

for all the first neighbors

j \in J^{(1)}

. Next, we define the kth neighbor of node

i^{*}

and show that the value of node

i^{*}

is equal to the values of all kth neighbors of node

i^{*}

for

k = 1, \dots, q - 1

. The set of kth neighbors of node

i^{*}

is defined by

\begin{matrix} J^{(k)} ≜ J^{(k - 1)} \cup N_{J^{(k - 1)}}, k \geq 1, J^{(0)} = {i^{*}} \end{matrix}

(81)

where

N_{J}

denotes the set of neighbors of the node set

J \subseteq V

. By definition,

{i^{*}} \subset J^{(k)} \subseteq V

for all

k \geq 1

and

J^{(k)}

is a monotonically increasing sequence of node sets in the sense of set inclusions.

Next, we show that

J^{(q - 1)} = V

. Suppose, ad absurdum,

V ∖ J^{(q - 1)} \neq Ø

. Then, by definition, there exists one node

m \in {1, \dots, q}

, disconnected from all the other nodes. Hence,

C_{(m, i)} = C_{(i, m)} = 0

,

i = 1, \dots, q

, which implies that the connectivity matrix

C

has a row and a column of zeros. Without loss of generality, assume that

C

has the form

\begin{matrix} C = [\begin{matrix} C_{s} & 0_{(q - 1) \times 1} \\ 0_{1 \times (q - 1)} & 0 \end{matrix}] \end{matrix}

where

C_{s} \in R^{(q - 1) \times (q - 1)}

denotes the connectivity matrix for the new undirected graph

G

which excludes node m from the undirected graph

G

. In this case, since

rank C_{s} \leq q - 2

, it follows that

rank C < q - 1

, which contradicts Axiom (i).

Using mathematical induction, we show that the values of all the nodes in

J^{(k)}

are equal for

k \geq 1

. This statement holds for

k = 1

. Assuming that the values of all the nodes in

J^{(k)}

are equal to the value of node

i^{*}

, we show that the values of all the nodes in

J^{(k + 1)}

are equal to the value of node

i^{*}

as well. Note that since

G

is strongly connected,

N_{i} \neq Ø

for all

i \in V

. If

N_{i} \cap (J^{(k + 1)} ∖ J^{(k)}) = Ø

for all i, then it follows that

J^{(k + 1)} = J^{(k)}

, and hence, the statement holds. Thus, it suffices to show that

x_{i} = x_{i^{*}}

for an arbitrary node

i \in J^{(k)}

with

N_{i} \cap (J^{(k + 1)} ∖ J^{(k)}) \neq Ø

. For node i, note that

\sum_{j \in N_{i}} ϕ_{i j} (x_{i}, x_{j}) = 0

. Furthermore, note that

N_{i} = (N_{i} \cap J^{(k)}) \cup (N_{i} \cap (V ∖ J^{(k)}))

,

V ∖ J^{(k)} = V ∖ J^{(k + 1)} \cup (J^{(k + 1)} ∖ J^{(k)})

,

J^{(k)} \subseteq V

for all k, and

J^{(k + 1)}

contains the set of first neighbors of node i, or

N_{i} \subseteq J^{(k + 1)}

. Then it follows that

N_{i} \cap (V ∖ J^{(k)}) = N_{i} \cap (J^{(k + 1)} ∖ J^{(k)})

and

\begin{matrix} \sum_{j \in N_{i} \cap J^{(k)}} ϕ_{i j} (x_{i}, x_{j}) + \sum_{j \in N_{i} \cap (J^{(k + 1)} ∖ J^{(k)})} ϕ_{i j} (x_{i}, x_{j}) = 0 \end{matrix}

(82)

Since

x_{j} = x_{i}

for all nodes

j \in N_{i} \cap J^{(k)} \subseteq J^{(k)}

, it follows that

\sum_{j \in N_{i} \cap J^{(k)}} ϕ_{i j} (x_{i}, x_{j}) = 0

, and hence,

\sum_{j \in N_{i} \cap (J^{(k + 1)} ∖ J^{(k)})} ϕ_{i j} (x_{i}, x_{j}) = 0

. However, since

x_{i^{*}} = x_{i} \geq x_{j}

for all

i \in J^{(k)}

and

j \in V ∖ J^{(k)}

, it follows that the values of all nodes in

N_{i} \cap (J^{(k + 1)} ∖ J^{(k)})

are equal to

x_{i^{*}}

. Hence, the values of all nodes i in the node set

⋃_{i \in J^{(k)}} N_{i} \cap (J^{(k + 1)} ∖ J^{(k)}) = J^{(k + 1)} \cap (J^{(k + 1)} ∖ J^{(k)}) = J^{(k + 1)} ∖ J^{(k)}

are equal to

x_{i^{*}}

, that is, the values of all the nodes in

J^{(k + 1)}

are equal. Combining this result with the fact that

J^{(q - 1)} = V

, it follows that the values of all the nodes in

V

are equal.

The second assertion is a direct consequence of the first assertion.☐

Theorem 9.1

Consider the large-scale dynamical system

G

with power balance Equation (60) with

u (t) \equiv 0

and

d (x) \equiv 0

, and assume that Axioms (i) and (ii) hold. Then for every

α \geq 0

,

α e

is a semistable equilibrium state of Equation (60). Furthermore,

x (t) \to \frac{1}{q} e e^{T} x (t_{0})

as

t \to \infty

and

\frac{1}{q} e e^{T} x (t_{0})

is a semistable equilibrium state.

Proof.

It follows from Proposition 9.1 that

α e \in {\bar{R}}_{+}^{q}, α \geq 0

, is an equilibrium state of Equation (60). To show Lyapunov stability of the equilibrium state

α e

, consider the function

V (x) = \frac{1}{2} {(x - α e)}^{T} (x - α e)

as a Lyapunov function candidate. Now, since

ϕ_{i j} (x) = - ϕ_{j i} (x), x \in {\bar{R}}_{+}^{q}

,

i \neq j, i, j = 1, \dots, q

, and

e^{T} f (x) = 0, x \in {\bar{R}}_{+}^{q}

, it follows from Axiom (ii) that

\begin{matrix} \dot{V} (x) & = & {(x - α e)}^{T} \dot{x} \\ = & {(x - α e)}^{T} f (x) \\ = & x^{T} f (x) \\ = & \sum_{i = 1}^{q} x_{i} [\sum_{j = 1, j \neq i}^{q} ϕ_{i j} (x)] \\ = & \sum_{i = 1}^{q - 1} \sum_{j = i + 1}^{q} (x_{i} - x_{j}) ϕ_{i j} (x) \\ = & \sum_{i = 1}^{q} \sum_{j \in K_{i}} (x_{i} - x_{j}) ϕ_{i j} (x) \end{matrix}

\begin{matrix} \leq & 0, x \in {\bar{R}}_{+}^{q} \end{matrix}

where

K_{i} ≜ N_{i} ∖ \cup_{l = 1}^{i - 1} {l}

and

N_{i} ≜ {j \in {1, \dots, q} : ϕ_{i j} (x) = 0 if and

only if x_{i} = x_{j}}

,

i = 1, \dots, q

, which establishes Lyapunov stability of the equilibrium state

α e

.

To show that

α e

is semistable, let

R ≜ {x \in {\bar{R}}_{+}^{q} : \dot{V} (x) = 0} = {x \in {\bar{R}}_{+}^{q} : (x_{i} - x_{j}) ϕ_{i j} (x) = 0, i = 1, \dots, q, j \in K_{i}}

. Now, by Axiom (i) the directed graph associated with the connectivity matrix

C

for the large-scale dynamical system

G

is strongly connected, which implies that

R = {x \in {\bar{R}}_{+}^{q} : x_{1} = \cdot \cdot \cdot = x_{q}}

. Since the set

R

consists of the equilibrium states of Equation (60), it follows that the largest invariant set

M

contained in

R

is given by

M = R

. Hence, it follows from the Krasovskii–LaSalle theorem [40] that for every initial condition

x (t_{0}) \in {\bar{R}}_{+}^{q}

,

x (t) \to M

as

t \to \infty

, and hence,

α e

is a semistable equilibrium state of Equation (60). Next, note that since

e^{T} x (t) = e^{T} x (t_{0})

and

x (t) \to M

as

t \to \infty

, it follows that

x (t) \to \frac{1}{q} e e^{T} x (t_{0})

as

t \to \infty

. Hence, with

α = \frac{1}{q} e^{T} x (t_{0})

,

α e = \frac{1}{q} e e^{T} x (t_{0})

is a semistable equilibrium state of Equation (60). ☐

Theorem 9.1 shows that the isolated (i.e.,

u (t) \equiv 0

and

d (x) \equiv 0

) large-scale dynamical system

G

is semistable. Hence, it follows from Theorem 4.8 that the isolated large-scale dynamical system

G

does not exhibit Poincaré recurrence in

{\bar{R}}_{+}^{q} ∖ M_{e}

. Next, using the system entropy function given by Equation (69), we show that our large-scale isolated dynamical system

G

with power balance Equation (60) is state irreversible for all nonequilibrium trajectories of

G

establishing a clear connection between our thermodynamic model and the arrow of time.

Theorem 9.2

Consider the large-scale dynamical system

G

with power balance Equation (60) with

u (t) \equiv 0

and

d (x) \equiv 0

, and assume Axioms (i) and (ii) hold. Furthermore, let

s^{x} (\cdot, 0) \in W_{[t_{0}, t_{1}]}

. Then for every

x_{0} \notin M_{e}

, there exists a continuously differentiable function

S : {\bar{R}}_{+}^{q} \to R

such that

S (s^{x} (t, 0))

is a strictly increasing function of time. Furthermore,

s^{x} (\cdot, 0)

is an

I_{q}

-reversible trajectory of

G

if and only if

s^{x} (t, 0) \in M_{e}

,

t \in [t_{0}, t_{1}]

.

Proof.

The existence of a continuously differentiable function

S : {\bar{R}}_{+}^{q} \to R

, which strictly increases on all nonequilibrium trajectories of

G

, is a restatement of Theorem 8.1 with

u (t) \equiv 0

and

d (x) \equiv 0

. Now, necessity is immediate, while sufficiency is a direct consequence of Corollary 3.1 with

R = I_{q}

and

V (x) = S (x)

. ☐

Theorem 9.2 shows that for every

x_{0} \notin M_{e}

, the isolated dynamical system

G

is state irreversible. This gives a clear connection between our thermodynamic model and the arrow of time. In particular, it follows from Corollary 3.1 and Theorem 9.2 that there exists a function of the system state that strictly increases in time on every nonequilibrium trajectory of

G

if and only if there does not exist a nonequilibrium reversible trajectory of

G

. Thus, the existence of the continuously differentiable entropy function given by Equation (69) for

G

establishes the existence of a completely ordered time set having a topological structure involving a closed set homeomorphic to the real line. This fact follows from the inverse function theorem of mathematical analysis and the fact that a continuous strictly monotonic function is a topological mapping (i.e., a homeomorphism), and conversely every topological mapping of a strictly monotonic function’s domain onto its codomain must be strictly monotonic. This topological property gives a clear time-reversal asymmetry characterization of our thermodynamic model establishing an emergence of the direction of time flow.

10. Monotonicity of System Energies in Thermodynamic Processes

Even though Theorem 9.1 gives sufficient conditions under which the subsystem energies in the large-scale dynamical system

G

converge, these subsystem energies may exhibit an oscillatory (hyperbolic) or nonmonotonic behavior prior to convergence. For certain thermodynamical processes, it is desirable to identify system models that guarantee monotonicity of the system energy flows. It is important to note that monotonicity of solutions does not necessarily imply Axiom (ii), nor does Axiom (ii) imply monotonicity of solutions. These are two disjoint notions. In this section, we give necessary and sufficient conditions under which the solutions to Equation (60) are monotonic.

To develop necessary and sufficient conditions for monotonicity of solutions, note that the power balance Equation (60) for the large-scale dynamical system

G

can be written as

\begin{matrix} \dot{x} (t) = [J (x (t)) - D (x (t))] {(\frac{\partial H}{\partial x} (x (t)))}^{T} + G u (t), x (t_{0}) = x_{0}, t \geq t_{0} \end{matrix}

(83)

where

x (t) \in {\bar{R}}_{+}^{q}

,

H (x) = e^{T} x

,

u (t) = {[u_{1} (t), \dots, u_{q} (t)]}^{T}, t \geq t_{0}

,

J (x)

is a skew-symmetric matrix function with

J_{(i, i)} (x) = 0

and

J_{(i, j)} (x) = σ_{i j} (x) - σ_{j i} (x), i \neq j, i, j = 1, \dots, q

,

D (x) = [σ_{11} (x),

\dots, σ_{q q} (x)] \geq 0

, and

G \in R^{q \times q}

is a diagonal input matrix that has been included for generality and contains zeros and ones as its entries. Hence, the power balance equation of the large-scale dynamical system

G

has a port-controlled Hamiltonian structure [50] with a Hamiltonian function

H (x) = e^{T} x = \sum_{i = 1}^{q} x_{i}

representing the sum of all subsystem energies,

D (x)

representing power dissipation in the subsystems,

J (x) = - J^{T} (x)

representing energy-conserving subsystem coupling, and

u (t), t \geq t_{0}

, representing supplied system power. As noted in Section 8, the nonlinear power balance Equation (83) can exhibit a full range of nonlinear behavior, including bifurcations, limit cycles, and even chaos. However, a thermodynamically consistent energy flow model ensures that the evolution of the system energy is diffusive in character with convergent subsystem energies. As shown in Section 8, Axioms (i) and (ii) guarantee a thermodynamically consistent energy flow model.

In order to guarantee a thermodynamically consistent energy flow model, we assume Axiom (ii) holds and seek solutions to Equation (83) that exhibit a monotonic behavior of the subsystem energies. This would physically imply that the energy of a subsystem whose initial energy is greater than the average system energy will decrease, while the energy of a subsystem whose initial energy is less than the average system energy will increase. This of course is consistent with the second law of thermodynamics with the additional constraint of monotonic heat flows. The following definition is needed.

Theorem 10.1

Consider the large-scale dynamical system

G

with power balance Equation (83). The subsystem energies

x (t)

,

t \geq t_{0}

, of

G

are monotonic for all

x_{0} \in D_{c} \subseteq {\bar{R}}_{+}^{q}

, where

D_{c}

is a positively invariant set with respect to Equation (83), if there exists a weighting matrix

R \in R^{q \times q}

such that

R = [r_{1}, \dots, r_{q}]

,

r_{i} = \pm 1

,

i = 1, \dots, q

, and, for every

x_{0} \in D_{c} \subseteq {\bar{R}}_{+}^{q}

,

R x (t_{2}) \leq \leq R x (t_{1})

,

t_{0} \leq t_{1} \leq t_{2}

.

The following result presents necessary and sufficient conditions that guarantee that the subsystem energies of the large-scale dynamical system

G

are monotonic. It is important to note that this result holds whether or not Axiom (ii) holds.

Theorem 10.1

Consider the large-scale dynamical system

G

with power balance Equation (83). Then the following statements hold:

(i): If $u (t) \geq \geq 0$ , $t \geq t_{0}$ , and there exists a matrix $R \in R^{q \times q}$ such that $R = [r_{1}, \dots, r_{q}]$ , $r_{i} = \pm 1$ , $i = 1, \dots, q$ , $R [J (x) - D (x)] {(\frac{\partial H}{\partial x} (x))}^{T} \leq \leq 0$ , $x \in {\bar{R}}_{+}^{q}$ , and $R G \leq \leq 0$ , then the subsystem energies $x (t)$ , $t \geq t_{0}$ , of $G$ are monotonic for all $x_{0} \in {\bar{R}}_{+}^{q}$ .
(ii): Let $u (t) \equiv 0$ and let $D_{c} \subseteq {\bar{R}}_{+}^{q}$ be a positively invariant set with respect to Equation (83). Then the subsystem energies $x (t)$ , $t \geq t_{0}$ , of $G$ are monotonic for all $x_{0} \in D_{c} \subseteq {\bar{R}}_{+}^{q}$ if and only if there exists a matrix $R \in R^{q \times q}$ such that $R = [r_{1}, \dots, r_{q}]$ , $r_{i} = \pm 1$ , $i = 1, \dots, q$ , and $R [J (x) - D (x)] {(\frac{\partial H}{\partial x} (x))}^{T} \leq \leq 0$ , $x \in D_{c} \subseteq {\bar{R}}_{+}^{q}$ .

Proof.

(i) Let

u (t) \geq \geq 0, t \geq t_{0}

, and assume there exists

R = [r_{1}, \dots,

r_{q}]

,

r_{i} = \pm 1

,

i = 1, \dots, q

, such that

R [J (x) - D (x)]

\cdot {(\frac{\partial H}{\partial x} (x))}^{T} \leq \leq 0

,

x \in {\bar{R}}_{+}^{q}

. Now, it follows from Equation (83) that

\begin{matrix} R \dot{x} (t) = R [J (x (t)) - D (x (t))] {(\frac{\partial H}{\partial x} (x (t)))}^{T} + R G u (t), x (t_{0}) = x_{0}, t \geq t_{0} \end{matrix}

(84)

which further implies that

\begin{matrix} R x (t_{2}) = R x (t_{1}) + \int_{t_{1}}^{t_{2}} R [J (x (t)) - D (x (t))] {(\frac{\partial H}{\partial x} (x (t)))}^{T} d t + \int_{t_{1}}^{t_{2}} R G u (t) d t \end{matrix}

(85)

Next, since

[J (x) - D (x)] {(\frac{\partial H}{\partial x} (x))}^{T}

is essentially nonnegative and

u (t) \geq \geq 0, t \geq t_{0}

, it follows from Proposition 4.3 of [51] that

x (t) \geq \geq 0

,

t \geq t_{0}

, for all

x_{0} \in {\bar{R}}_{+}^{q}

. Hence, since

R [J (x) - D (x)] {(\frac{\partial H}{\partial x} (x))}^{T} \leq \leq 0

,

x \in {\bar{R}}_{+}^{q}

, and

R G \leq \leq 0

, it follows that

\begin{matrix} R [J (x (t)) - D (x (t))] {(\frac{\partial H}{\partial x} (x (t)))}^{T} + R G u (t) \leq \leq 0, t \geq t_{0} \end{matrix}

(86)

which implies that, for every

x_{0} \in {\bar{R}}_{+}^{q}

,

R x (t_{2})

\leq \leq R x (t_{1})

,

t_{0} \leq t_{1} \leq t_{2}

.

(ii) To show sufficiency, note that since by assumption

D_{c}

is positively invariant, then

R [J (x (t)) - D (x (t))] {(\frac{\partial H}{\partial x} (x (t)))}^{T} \leq \leq 0, t \geq t_{0}

, for all

x_{0} \in D_{c} \subseteq {\bar{R}}_{+}^{q}

. Now, the result follows by using identical arguments as in (i) with

u (t) \equiv 0

and

x_{0} \in D_{c} \subseteq {\bar{R}}_{+}^{q}

. To show necessity, assume that Equation (83) with

u (t) \equiv 0

is monotonic for all

x_{0} \in D_{c} \subseteq {\bar{R}}_{+}^{q}

. In this case, Equation (84) implies that for every

τ > t_{0}

,

R x (τ) = R x_{0} + \int_{t_{0}}^{τ} R [J (x (t)) - D (x (t))] {(\frac{\partial H}{\partial x} (x (t)))}^{T} d t

(87)

Now, suppose, ad absurdum, there exist

J \in {1, \dots, q}

and

x_{0} \in D_{c} \subseteq {\bar{R}}_{+}^{q}

such that

{[R [J (x_{0}) - D (x_{0})] {(\frac{\partial H}{\partial x} (x_{0}))}^{T}]}_{J} > 0

. Since the mapping

R [J (\cdot) - D (\cdot)] {(\frac{\partial H}{\partial x} (\cdot))}^{T}

and the solution

x (t)

,

t \geq t_{0}

, to Equation (83) are continuous, it follows that there exists

τ > t_{0}

such that

\begin{matrix} {[R [J (x (t)) - D (x (t))] {(\frac{\partial H}{\partial x} (x (t)))}^{T}]}_{J} > 0, t_{0} \leq t \leq τ \end{matrix}

(88)

which implies that

{[R x (τ)]}_{J} > {[R x_{0}]}_{J}

, leading to a contradiction. Hence,

R [J (x) - D (x)] {(\frac{\partial H}{\partial x} (x))}^{T} \leq \leq 0

,

x \in D_{c} \subseteq {\bar{R}}_{+}^{q}

. ☐

It follows from (i) of Theorem 10.1 that if

G = I_{q}

(that is, external power (heat flux) can be injected to all subsystems), then

R = - I_{q}

, and hence,

[J (x) - D (x)] {(\frac{\partial H}{\partial x} (x))}^{T} \geq \geq 0, x \in {\bar{R}}_{+}^{q}

. This case would correspond to a power balance equation whose states are all increasing and can only be achieved if

D (x) = 0, x \in {\bar{R}}_{+}^{q}

. This, of course, implies that the dynamical system

G

cannot dissipate energy, and hence, the transfer of energy (heat) from a lower energy (temperature) level (source) to a higher energy (temperature) level (sink) requires the input of additional heat or energy. This is consistent with Clausius’ statement of the second law of thermodynamics.

The following result is a direct consequence of Theorem 10.1 and provides sufficient conditions for convergence of the subsystem energies of the isolated large-scale dynamical system

G

. Once again, this result holds whether or not Axiom (ii) holds.

Theorem 10.2

Consider the large-scale dynamical system

G

with power balance Equation (83) and

u (t) \equiv 0

. Let

D_{c} \subseteq {\bar{R}}_{+}^{q}

be a positively invariant set. If there exists a matrix

R \in R^{q \times q}

such that

R = [r_{1}, \dots, r_{q}]

,

r_{i} = \pm 1

,

i = 1, \dots, q

, and

R [J (x) - D (x)] {(\frac{\partial H}{\partial x} (x))}^{T}

\leq \leq 0

,

x \in D_{c} \subseteq {\bar{R}}_{+}^{q}

, then, for every

x_{0} \in D_{c} \subseteq {\bar{R}}_{+}^{q}

,

{lim}_{t \to \infty} x (t)

exists.

Proof.

Since

H (x) = e^{T} x

,

x \in {\bar{R}}_{+}^{q}

, it follows that

\begin{matrix} \dot{H} (x) = \frac{\partial H}{\partial x} \dot{x} = \frac{\partial H}{\partial x} [J (x) - D (x)] {(\frac{\partial H}{\partial x})}^{T} = - \frac{\partial H}{\partial x} D (x) {(\frac{\partial H}{\partial x})}^{T} \leq 0, x \in_{+}^{q} \end{matrix}

(89)

and hence,

\dot{H} (x (t)) \leq 0

,

t \geq t_{0}

, where

x (t)

,

t \geq t_{0}

, denotes the solution of Equation (83). This implies that

H (x (t)) \leq H (x_{0}) = e^{T} x_{0}

,

t \geq t_{0}

, and hence, for every

x_{0} \in {\bar{R}}_{+}^{q}

, the solution

x (t)

,

t \geq t_{0}

, of Equation (83) is bounded. Hence, for every

i \in {1, \dots, q}

,

x_{i} (t)

,

t \geq t_{0}

, is bounded. Furthermore, it follows from Theorem 10.1 that

x_{i} (t)

,

t \geq t_{0}

, is monotonic for all

x_{0} \in D_{c} \subseteq {\bar{R}}_{+}^{q}

. Now, since

x_{i} (\cdot)

,

i \in {1, \dots, q}

, is continuous and every bounded nonincreasing or nondecreasing scalar sequence converges to a finite real number, it follows from the monotone convergence theorem ([40], p. 37) that

{lim}_{t \to \infty} x_{i} (t)

,

i \in {1, \dots, q}

, exists. Hence,

{lim}_{t \to \infty} x (t)

exists for all

x_{0} \in D_{c} \subseteq {\bar{R}}_{+}^{q}

. ☐

11. Finite-Time Thermodynamics

As discussed in the Introduction, thermodynamic systems achieve energy and temperature equipartition in finite time rather than merely asymptotically. In this section, we use the results of Section 5 and Section 6 to develop continuous non-Lipschitzian intercompartmental flow laws that guarantee finite-time semistability and energy equipartition for the thermodynamically consistent dynamical system model developed in Section 7. Specifically, consider the dynamical system

G

given by

\begin{matrix} {\dot{x}}_{i} (t) & = & \sum_{j = 1, j \neq i}^{q} ϕ_{i j} (x_{i} (t), x_{j} (t)), x_{i} (t_{0}) = x_{i 0}, t \geq t_{0}, i = 1, \dots, q \end{matrix}

(90)

where

ϕ_{i j} (x)

,

x \in {\bar{R}}_{+}^{q}

, denotes the net energy flow from the jth compartment to the ith compartment defined in Section 7. In vector form, Equation (90) becomes

\begin{matrix} \dot{x} (t) & = & f (x (t)), x (t_{0}) = x_{0}, t \geq t_{0} \end{matrix}

(91)

where

x (t) ≜ {[x_{1} (t), \dots, x_{q} (t)]}^{T} \in {\bar{R}}_{+}^{q}

,

t \geq t_{0}

, and

f = {[f_{1}, \dots, f_{q}]}^{T} : {\bar{R}}_{+}^{q} \to R^{q}

is such that

f_{i} (x) = \sum_{j = 1, j \neq i}^{q} ϕ_{i j} (x_{i}, x_{j})

.

Theorem 11.1

Consider the dynamical system given by Equation (91) and assume that Axioms (i) and (ii) hold. Furthermore, assume that

ϕ_{i j} (x_{i}, x_{j}) = - ϕ_{j i} (x_{j}, x_{i})

for all

i, j = 1, \dots, q

,

i \neq j

. Then for every

α \in {\bar{R}}_{+}

,

α e

is a semistable equilibrium state of Equation (91). Furthermore,

x (t) \to \frac{1}{q} {ee}^{T} x (t_{0})

as

t \to \infty

and

\frac{1}{q} {ee}^{T} x (t_{0})

is a semistable equilibrium state.

Proof.

The result is a direct consequence of Proposition 9.1 and Theorem 9.1. ☐

Theorem 11.1 implies that the steady-state values of the state in each compartment

G_{i}

of the dynamical system

G

are equal, that is, the steady-state value of the dynamical system

G

given by

\begin{matrix} x_{\infty} = \frac{1}{q} e e^{T} x (t_{0}) = [\frac{1}{q} \sum_{i = 1}^{q} x_{i} (t_{0})] e \end{matrix}

is uniformly distributed over all compartments of

G

.

Next, we use the results of Section 6 to develop a compartmental model for finite-time thermodynamics. Specifically, consider the dynamical system given by

\begin{matrix} {\dot{x}}_{i} (t) = \sum_{j = 1, j \neq i}^{q} ϕ_{i j} (x_{i} (t), x_{j} (t)), x_{i} (0) = x_{i 0}, t \geq 0 \end{matrix}

(92)

where for each

i \in {1, \dots, q}

,

x_{i} (t) \in {\bar{R}}_{+}

denotes an energy state for all

t \geq 0

,

ϕ_{i j} (\cdot, \cdot)

satisfies Axioms (i) and (ii), and

ϕ_{i j} (x_{i}, x_{j}) = - ϕ_{j i} (x_{j}, x_{i})

for all

i, j = 1, \dots, q

,

i \neq j

. Furthermore, we assume

ϕ_{i j} (\cdot, \cdot)

for all

i, j = 1, \dots, q, i \neq j

, are continuous and not Lipschitz continuous.

Theorem 11.2

Consider the dynamical system

G

given by Equation (92). Assume that Axioms (i) and (ii) hold, and

ϕ_{i j} (x_{i}, x_{j}) = - ϕ_{j i} (x_{j}, x_{i})

for all

i, j = 1, \dots, q

,

i \neq j

. Furthermore, assume that the vector field f of the dynamical system given by Equation (92) is homogeneous of degree

k \in R

with respect to [52]

ν (x) = - \sum_{i = 1}^{q} [\sum_{j = 1, j \neq i}^{q} μ_{i j} (x_{i}, x_{j})] \frac{\partial}{\partial x_{i}}

, where

x ≜ {[x_{1}, \dots, x_{q}]}^{T} \in {\bar{R}}_{+}^{q}

and

μ_{i j} (\cdot, \cdot)

satisfies Axiom (ii),

μ_{i j} (x_{i}, x_{j}) = - μ_{j i} (x_{j}, x_{i})

, and

μ_{i j} (x_{i}, x_{j}) = 0

if and only if

x_{i} = x_{j}

for all

i, j = 1, \dots, q

,

i \neq j

. Then, for every

x_{e} \in {\bar{R}}_{+}

,

x_{e} e

is a finite-time semistable equilibrium state of

G

if and only if

k < 0

. Furthermore, if

k < 0

, then

x (t) = \frac{1}{q} {ee}^{T} x (0)

for all

t \geq T (x (0))

and

\frac{1}{q} {ee}^{T} x (0)

is a finite-time semistable equilibrium state, where

T (x (0)) \geq 0

.

Proof.

Suppose

k < 0

. It follows from Theorem 11.1 that

x_{e} e \in {\bar{R}}_{+}^{q}

,

x_{e} \in {\bar{R}}_{+}

, is a semistable equilibrium state of the homogeneous system given by Equation (92). Furthermore,

x (t) \to \frac{1}{q} {ee}^{T} x (0)

as

t \to \infty

and

\frac{1}{q} {ee}^{T} x (0)

is a semistable equilibrium state. Next, it can be shown using similar arguments as in the proof of Theorem 11.1 that Equation (47) is globally semistable with

ν (x) = - \sum_{i = 1}^{q} [\sum_{j = 1, j \neq i}^{q} μ_{i j} (x_{i}, x_{j})] \frac{\partial}{\partial x_{i}}

. Now, it follows from Theorem 6.2 that

x_{e} e

is a finite-time semistable equilibrium state by noting that the vector field

\sum_{j = 1, j \neq i}^{q} ϕ_{i j} (x_{i}, x_{j})

is homogeneous of degree

k < 0

with respect to the semi-Euler vector field

ν (x) = - \sum_{i = 1}^{q} [\sum_{j = 1, j \neq i}^{q} μ_{i j} (x_{i}, x_{j})] \frac{\partial}{\partial x_{i}}

. Hence, with

x_{e} = \frac{1}{q} e^{T} x (0)

,

x_{e} e = \frac{1}{q} {ee}^{T} x (0)

is a finite-time semistable equilibrium state. The converse follows as a direct consequence of Theorem 6.2. ☐

The following corollary to Theorem 11.2 gives a concrete form for the energy flow function

ϕ_{i j} (x_{i}, x_{j})

,

i, j = 1, \dots, q

,

i \neq j

.

Corollary 11.1

Consider the dynamical system

G

given by Equation (92) with energy flow function

\begin{matrix} ϕ_{i j} (x_{i}, x_{j}) = C_{(i, j)} sgn (x_{j} - x_{i}) {| x_{j} - x_{i} |}^{α} \end{matrix}

(93)

where

α > 0

and

C_{(i, j)}

is as in Equation (63) with

C = C^{T}

. Assume that Axioms (i) and (ii) hold. Then for every

x_{e} \in {\bar{R}}_{+}

,

x_{e} e

is a finite-time semistable equilibrium state of

G

if and only if

α < 1

. Furthermore, if

α < 1

, then

x (t) = \frac{1}{q} {ee}^{T} x (0)

for all

t \geq T (x (0))

and

\frac{1}{q} {ee}^{T} x (0)

is a finite-time semistable equilibrium state, where

T (x (0)) \geq 0

.

Proof.

First, note that the vector field f of

G

is essentially nonnegative. Next, the Lie bracket of

ν (x) = - \sum_{i = 1}^{q} [\sum_{j = 1, j \neq i}^{q} (x_{j} - x_{i})] \frac{\partial}{\partial x_{i}}

and the vector field f of the dynamical system given by Equation (92) with

ϕ_{i j} (x_{i}, x_{j})

given by Equation (93) is given by

[ν, f] = {[\sum_{i = 1}^{q} \frac{\partial f_{1}}{\partial x_{i}} ν_{i} - \frac{\partial ν_{1}}{\partial x_{i}} f_{i}, \dots, \sum_{i = 1}^{q} \frac{\partial f_{q}}{\partial x_{i}} ν_{i} - \frac{\partial ν_{q}}{\partial x_{i}} f_{i}]}^{T}

. Since for each

i, j = 1, \dots, q

,

\begin{matrix} \frac{\partial f_{j}}{\partial x_{i}} ν_{i} - \frac{\partial ν_{j}}{\partial x_{i}} f_{i} = \{\begin{matrix} \begin{matrix} C_{(j, i)} α | x_{i} & - x_{j} |^{α - 1} [\sum_{s = 1, s \neq i}^{q} (x_{i} - x_{s})] \\ + \sum_{k = 1, k \neq i}^{q} C_{(i, k)} sgn (x_{k} - x_{i}) {| x_{k} - x_{i} |}^{α}, \end{matrix} & i \neq j \\ \begin{matrix} [\sum_{k = 1, k \neq j}^{q} C_{(j, k)} α {| x_{k} - x_{j} |}^{α - 1}] [\sum_{s = 1, s \neq j}^{q} (x_{s} - x_{j})] \\ - (q - 1) \sum_{k = 1, k \neq j}^{q} C_{(j, k)} sgn (x_{k} - x_{j}) {| x_{k} - x_{j} |}^{α}, \end{matrix} & i = j \end{matrix} \end{matrix}

(94)

and noting that

C_{(i, j)} = C_{(j, i)}

,

i, j = 1, \dots, q

,

i \neq j

, it follows that for each

j = 1, \dots, q

,

\begin{matrix} \sum_{i = 1}^{q} \frac{\partial f_{j}}{\partial x_{i}} ν_{i} - \frac{\partial ν_{j}}{\partial x_{i}} f_{i} \\ = & \frac{\partial f_{j}}{\partial x_{j}} ν_{j} - \frac{\partial ν_{j}}{\partial x_{j}} f_{j} + \sum_{i = 1, i \neq j}^{q} \frac{\partial f_{j}}{\partial x_{i}} ν_{i} - \frac{\partial ν_{j}}{\partial x_{i}} f_{i} \\ = & [\sum_{k = 1, k \neq j}^{q} C_{(j, k)} α {| x_{k} - x_{j} |}^{α - 1}] [\sum_{s = 1, s \neq j}^{q} (x_{s} - x_{j})] - (q - 1) \sum_{k = 1, k \neq j}^{q} C_{(j, k)} sgn (x_{k} - x_{j}) {| x_{k} - x_{j} |}^{α} \\ + \sum_{i = 1, i \neq j}^{q} C_{(j, i)} α | x_{i} - x_{j} |^{α - 1} [\sum_{s = 1, s \neq i}^{q} (x_{i} - x_{s})] + \sum_{i = 1, i \neq j}^{q} \sum_{k = 1, k \neq i}^{q} C_{(i, k)} sgn (x_{k} - x_{i}) {| x_{k} - x_{i} |}^{α} \\ = & α \sum_{k = 1, k \neq j}^{q} C_{(j, k)} sgn (x_{k} - x_{j}) | x_{k} - x_{j} |^{α} + \sum_{k = 1, k \neq j}^{q} \sum_{s = 1, s \neq j, k}^{q} C_{(j, k)} α {| x_{k} - x_{j} |}^{α - 1} (x_{s} - x_{j}) \\ - (q - 1) \sum_{k = 1, k \neq j}^{q} C_{(j, k)} sgn (x_{k} - x_{j}) | x_{k} - x_{j} |^{α} + α \sum_{i = 1, i \neq j}^{q} C_{(j, i)} sgn (x_{i} - x_{j}) {| x_{i} - x_{j} |}^{α} \end{matrix}

\begin{matrix} + \sum_{i = 1, i \neq j}^{q} \sum_{s = 1, s \neq i, j}^{q} C_{(j, i)} α | x_{i} - x_{j} |^{α - 1} (x_{i} - x_{s}) + \sum_{i = 1}^{q} \sum_{k = 1, k \neq i}^{q} C_{(i, k)} sgn (x_{k} - x_{i}) {| x_{k} - x_{i} |}^{α} \\ - \sum_{k = 1, k \neq j}^{q} C_{(j, k)} sgn (x_{k} - x_{j}) {| x_{k} - x_{j} |}^{α} \\ = & 2 α \sum_{i = 1, i \neq j}^{q} C_{(j, i)} sgn (x_{i} - x_{j}) | x_{i} - x_{j} |^{α} + α \sum_{i = 1, i \neq j}^{q} \sum_{s = 1, s \neq i, j}^{q} C_{(j, i)} sgn (x_{i} - x_{j}) {| x_{i} - x_{j} |}^{α} \\ - q \sum_{k = 1, k \neq j}^{q} C_{(j, k)} sgn (x_{k} - x_{j}) {| x_{k} - x_{j} |}^{α} \\ = & q (α - 1) \sum_{i = 1, i \neq j}^{q} C_{(j, i)} sgn (x_{i} - x_{j}) {| x_{i} - x_{j} |}^{α} \\ = & q (α - 1) f_{j} \end{matrix}

which implies that the vector field f is homogeneous of degree

k = q (α - 1)

with respect to the semi-Euler vector field

\begin{matrix} ν (x) = - \sum_{i = 1}^{q} [\sum_{j = 1, j \neq i}^{q} (x_{j} - x_{i})] \frac{\partial}{\partial x_{i}} \end{matrix}

Now, the result is a direct consequence of Theorem 11.2. ☐

12. Conclusions

In contrast to mechanics, which is based on a dynamical system theory, (classical) thermodynamics is a physical theory concerned with systems in equilibrium and does not possess equations of motion, leaving these two classical disciplines of physics to stand in sharp contrast to one another in the one and the half centuries of their coexistence. This has made any connections between the thermodynamic arrow of time and the mechanistic course of time over the centuries translucent at best. Over the past several decades, numerous subjective papers plagued with philosophical arguments and void of any rigorous mathematics have unsuccessfully attempted to establish such connections. In order to make clear and rigorous connections between the arrow of time, the course of time, irreversibility, and the second law of thermodynamics, a dynamical systems framework for thermodynamics is needed rather than the classical (thermostatic) theory of thermodynamics.

In this paper, we combined the two universalisms of thermodynamics and dynamical systems theory under a single umbrella, with the second providing the ideal language for the first, to establish rigorous connections between causality, the arrow of time, the course of time, irreversibility, and the second law of thermodynamics. Specifically, we show a state irrecoverability, and hence, a state irreversibility nature of thermodynamics. State irreversibility reflects time-reversal non-invariance, wherein time-reversal is not meant literally; that is, we develop a dynamical system thermodynamic model whose trajectory reversal is or is not allowed and not a reversal of time itself. Next, we show that for every nonequilibrium system state and corresponding system trajectory of our thermodynamically consistent dynamical system, there does not exists a state such that the corresponding system trajectory completely recovers the initial system state of the dynamical system and at the same time restores the energy supplied by the environment back to its original condition. This, along with the existence of a global strictly increasing entropy function on every nontrivial system trajectory, establishes the existence of a completely ordered time set that has a topological structure involving a closed set homeomorphic to the real line, which gives a clear time-reversal asymmetry characterization of thermodynamics and establishes an emergence of the direction of time flow.

Classical thermodynamics as well as the dynamical system approach to thermodynamics presented in this paper are developed for systems that are assumed to be at rest with respect to a local observer and in the absence of strong gravitational fields. To effectively address the universality of thermodynamics and the arrow of time to cosmology, the dynamical system framework of thermodynamics presented in this paper needs to be extended to thermodynamic systems which are moving relative to a local observer moving with the system and a fixed observer with respect to which the system is in motion. In addition, the thermodynamic effects of gravity need to also be considered. In this case, Einstein’s theory of relativity shows that time and space are intricately coupled, and hence, one cannot curve space without involving time as well. This is essentially the time dilation equivalence principle of general relativity, which states that the combined speed of any object’s motion through the space-time continuum is always equal to the speed of light. Given the topological isomorphism between entropy and time established in this paper and Einstein’s time dilation assertion that increasing an object’s speed through space results in decreasing the object’s speed through time, we conjecture that a generalization of the present framework of thermodynamics that includes relativistic effects would lead to an entropy contraction principle wherein the change in entropy of a system would decrease as the system’s speed increases through space. This is the subject of current research.

References and Notes

Haddad, W.M.; Chellaboina, V.; Nersesov, S.G. Thermodynamics: A Dynamical Systems Approach; Princeton University Press: Princeton, NJ, USA, 2005. [Google Scholar]
Carathéodory, C. Untersuchungen über die grundlagen der thermodynamik. Math. Ann. 1909, 67, 355–386. [Google Scholar] [CrossRef]
Carathéodory, C. Über die Bestimmung der Energie und der absoluten Temperatur mit Hilfe von reversiblen Prozessen. In Proceedings of the 1925 Sitzungsberichte der Preuβischen Akademie der Wissenschaften, Math. Phys. Klasse, Berlin, Germany, 1925; pp. 39–47.
Carathéodory’s definition of an adiabatic process is nonstandard and involves transformations that take place while the system remains in an adiabatic container. For details see [2,3].
Bridgman, P. The Nature of Thermodynamics; Harvard University Press: Cambridge, MA, USA, 1941; Reprinted by Peter Smith: Gloucester, MA, USA, 1969. [Google Scholar]
Uffink, J. Bluff your way in the second law of thermodynamics. Stud. Hist. Philos. Mod. Phys. B 2001, 32, 305–394. [Google Scholar] [CrossRef]
Perhaps a better expression here is the geodesic arrow of time, since, as Einstein’s theory of relativity shows, time and space are intricately coupled, and hence one cannot curve space without involving time as well. Thus, time has a shape that goes along with its directionality.
Planck, M. Über die Begrundung des zweiten Hauptsatzes der Thermodynamik. In Proceedings of the 1925 Sitzungsberichte der Preuβischen Akademie der Wissenschaften, Math. Phys. Klasse, Berlin, Germany, 1925; pp. 453–463.
Reichenbach, H. The Direction of Time; University of California Press: Berkeley, CA, USA, 1956. [Google Scholar]
Grünbaum, A. The Anisotropy of Time. In The Nature of Time; Gold, T., Ed.; Cornell University Press: Ithaca, NY, USA, 1967. [Google Scholar]
Earman, J. Irreversibility and temporal asymmetry. J. Philos. 1967, 64, 543–549. [Google Scholar] [CrossRef]
Kroes, P. Time: Its Structure and Role in Physical Theories; Reidel: Dordrecht, The Netherlands, 1985. [Google Scholar]
Horwich, P. Asymmetries in Time; MIT Press: Cambridge, MA, USA, 1987. [Google Scholar]
In statistical thermodynamics the arrow of time is viewed as a consequence of high system dimensionality and randomness. However, since in statistical thermodynamics it is not absolutely certain that entropy increases in every dynamical process, the direction of time, as determined by entropy increase, has only statistical certainty and not an absolute certainty. Hence, it cannot be concluded from statistical thermodynamics that time has a unique direction of flow.
Lamb, J.S.W.; Roberts, J.A.G. Time reversal symmetry in dynamical systems: A survey. Physica D 1998, 112. [Google Scholar] [CrossRef]
Eddington, A. The Nature of the Physical World; Dent & Sons: London, UK, 1935. [Google Scholar]
Prigogine, I. From Being to Becoming; Freeman: San Francisco, CA, USA, 1980. [Google Scholar]
Haddad, W.M.; Chellaboina, V.; Nersesov, S.G. Time-reversal symmetry, Poincaré recurrence, irreversibility, and the entropic arrow of time: From mechanics to system thermodynamics. Nonlinear Anal. R. World Appl. 2008, 9, 250–271. [Google Scholar] [CrossRef]
In the terminology of [6], state irreversibility is referred to as time-reversal noninvariance. However, since the term time reversal is not meant literally (that is, we consider dynamical systems whose trajectory reversal is or is not allowed and not a reversal of time itself), state reversibility is a more appropriate expression.
Bhat, S.P.; Bernstein, D.S. Arc-length-based Lyapunov tests for convergence and stability in systems having a continuum of equilibria. In Proceedings of the 2003 American Control Conference, Denver, CO, USA, 4–6 June 2003; pp. 2961–2966.
Bhat, S.P.; Bernstein, D.S. Nontangency-based Lyapunov tests for convergence and stability in systems having a continuum of equilibra. SIAM J. Control Optim. 2003, 42, 1745–1775. [Google Scholar] [CrossRef]
Bhat, S.P.; Bernstein, D.S. Finite-time stability of continuous autonomous systems. SIAM J. Control Optim. 2000, 38, 751–766. [Google Scholar] [CrossRef]
Hale, J.K. Ordinary Differential Equations, 2nd ed.; Wiley: New York, NY, USA, 1980; Reprinted by Krieger: Malabar, FL, USA, 1991. [Google Scholar]
Liberman, P.; Marle, C.M. Symplectic Geometry and Analytical Mechanics; Reidel: Dordrecht, The Netherlands, 1987. [Google Scholar]
Here we assume that the system Lagrangian is hyperregular [24] so that the map from the generalized velocities $\dot{q}$ to the generalized momenta p is bijective (i.e., one-to-one and onto).
Arnold, V.I. Mathematical Models of Classical Mechanics; Springer-Verlag: New York, NY, USA, 1989. [Google Scholar]
Poincaré, H. Sur le probléme des trois corps et les équations de la dynamique. Acta Math. 1890, 13, 1–270. [Google Scholar]
A Lie group is a topological group that can be given an analytic structure such that the group operation and inversion are analytic. A Lie pseudogroup is an infinite-dimensional counterpart of a Lie group.
Apostol, T.M. Mathematical Analysis; Addison-Wesley: Reading, MA, USA, 1974. [Google Scholar]
We say that $V$ is dense in $N$ if and only if $N$ is contained in the closure of $V$ ; that is, $V$ ⊆ $N$ is dense in $N$ if and only if $N$ ⊆ $\bar{V}$ .
A key distinction between thermodynamics and mechanics is that thermodynamics is a theory of open systems, whereas mechanics is a theory of closed systems. The notions, however, of open and closed systems are different in thermodynamics and dynamical system theory. In particular, thermodynamic systems exchange matter and energy with the environment, and hence, interact with the environment. Such systems are called open systems in the thermodynamic literature. Systems that exchange heat (energy) but not matter with the environment are called closed, whereas systems that do not exchange energy and matter with the environment are called isolated. Alternatively, in mechanics it is always possible to include interactions with the environment (via feedback interconnecting components) within the system description to obtain an augmented closed system in the sense of dynamical system theory. That is, the system can be described by an evolution law with, possibly, an output equation wherein past trajectories define the future trajectory uniquely and the system output depends on the instantaneous (present) value of the system state.
Bhat, S.P.; Bernstein, D.S. Lyapunov analysis of semistability. In Proceedings of the 1999 American Control Conference, San Diego, CA, USA, 2–4 June 1999; pp. 1608–1612.
Agarwal, R.P.; Lakshmikantham, V. Uniqueness and Nonuniqueness Criteria for Ordinary Differential Equations; World Scientific: Singapore, 1993. [Google Scholar]
Yoshizawa, T. Stability Theory by Liapunov’s Second Method; Math. Soc. Japan: Tokyo, Japan, 1966. [Google Scholar]
Coddington, E.A.; Levinson, N. Theory of Ordinary Differential Equations; McGraw-Hill: New York, NY, USA, 1955. [Google Scholar]
Bhat, S.P.; Bernstein, D.S. Geometric homogeneity with applications to finite-time stability. Math. Control Signals Syst. 2005, 17, 101–127. [Google Scholar] [CrossRef]
Rosier, L. Homogeneous Lyapunov function for homogeneous continuous vector field. Syst. Control Lett. 1992, 19, 467–473. [Google Scholar] [CrossRef]
In a geometric, coordinate-free setting, the only link between homogeneity of functions and vector fields is that the Lie derivative of a homogeneous function along a homogeneous vector field is also a homogeneous function. In the special case where the coordinate functions are homogeneous functions, the fact mentioned above can be used to relate the homogeneity of a vector field with that of the components (considered as functions) of its coordinate representation. Such a relation is very familiar in the case of conventional dilations seen in the homogeneity literature [37].
The domain of semistability (with respect to ${\bar{R}}_{+}^{n}$ ) is the set of points x₀ ∈ ${\bar{R}}_{+}^{n}$ such that if x(t), t ≥ 0, is a solution to Equation (30) with x(0) = x₀, then x(t) converges to a Lyapunov stable (with respect to ${\bar{R}}_{+}^{n}$ ) equilibrium point in ${\bar{R}}_{+}^{n}$ .
Haddad, W.M.; Chellaboina, V. Nonlinear Dynamical Systems and Control: A Lyapunov-Based Approach; Princeton University Press: Princeton, NJ, USA, 2008. [Google Scholar]
Haddad, W.M.; Nersesov, S.G.; Chellaboina, V. Heat flow, work energy, chemical reactions, and thermodynamics: A dynamical systems perspective. In Thermodynamics; Mizutani, T., Ed.; InTech: Lexington, KY, USA, 2011; pp. 255–322. [Google Scholar]
It can be argued here that a more appropriate terminology is assumptions rather than axioms since, as will be seen, these are statements taken to be true and used as premises in order to infer certain results, but may not otherwise be accepted. However, as we will see, these statements are equivalent (within our formulation) to the stipulated postulates of the zeroth and second laws of thermodynamics involving transitivity of a thermal equilibrium and heat flowing from hotter to colder bodies, and as such we refer to them as axioms.
Berman, A.; Plemmons, R.J. Nonnegative Matrices in the Mathematical Sciences; Academic Press: New York, NY, USA, 1979. [Google Scholar]
It is important to note that our formulation of the second law of thermodynamics as given by Axiom ii) does not require the mentioning of temperature nor the more primitive subjective notions of hotness or coldness. As we will see later, temperature is defined in terms of the system entropy after we establish the existence of a unique, continuously differentiable entropy function for $G$ .
Since in our formulation we are not considering work performed by and on the system, the notions of an isolated system and an adiabatically isolated system are equivalent.
Meixner, J. On the foundation of thermodynamics of processes. In A Critical Review of Thermodynamics; Stuart, E.B., Gal-Or, B., Brainard, A.J., Eds.; Mono Book Corp.: Baltimore, MD, USA, 1970; pp. 37–47. [Google Scholar]
Lavenda, B. Thermodynamics of Irreversible Processes; Macmillan: London, UK, 1978; Reprinted by Dover: New York, NY, USA, 1993. [Google Scholar]
Diestel, R. Graph Theory; Springer-Verlag: New York, NY, USA, 1997. [Google Scholar]
Godsil, C.; Royle, G. Algebraic Graph Theory; Springer-Verlag: New York, NY, USA, 2001. [Google Scholar]
Lozano, R.; Brogliato, B.; Egeland, O.; Maschke, B. Dissipative Systems Analysis and Control; Springer-Verlag: London, UK, 2000. [Google Scholar]
Haddad, W.M.; Chellaboina, V.; Hui, Q. Nonnegative and Compartmental Dynamical Systems; Princeton University Press: Princeton, NJ, USA, 2010. [Google Scholar]
The differential operator notation in ν(x) is standard differential geometric notation used to write coordinate expressions for vector fields. This notation is based on the fact that there is a one-to-one correspondence between first-order linear differential operators on real-valued functions and vector fields.

© 2012 by the author; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/.)

Temporal Asymmetry, Entropic Irreversibility, and Finite-Time Thermodynamics: From Parmenides–Einstein Time-Reversal Symmetry to the Heraclitan Entropic Arrow of Time

Abstract

1. Introduction

2. Dynamical System Model

3. Reversibility, Irreversibility, Recoverability and Irrecoverability

4. Reversible Dynamical Systems, Volume-Preserving Flows and Poincaré Recurrence

5. Finite-Time Semistability of Nonlinear Dynamical Systems

6. Homogeneity and Finite-Time Semistability

7. A State Space Formalism for Thermodynamics

8. Entropy and Irreversibility

9. Semistability and the Entropic Arrow of Time

10. Monotonicity of System Energies in Thermodynamic Processes

11. Finite-Time Thermodynamics

12. Conclusions

References and Notes

Article Metrics

Citations

Article Access Statistics