Stochastic Thermodynamics: A Dynamical Systems Approach

Rajpurohit, Tanmay; Haddad, Wassim M.

doi:10.3390/e19120693

Open AccessArticle

Stochastic Thermodynamics: A Dynamical Systems Approach

by

Tanmay Rajpurohit

^† and

Wassim M. Haddad

^*,†

School of Aerospace Engineering, Georgia Institute of Technology, Atlanta, GA 30332-0150, USA

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Entropy 2017, 19(12), 693; https://doi.org/10.3390/e19120693

Submission received: 16 October 2017 / Revised: 13 December 2017 / Accepted: 13 December 2017 / Published: 17 December 2017

(This article belongs to the Special Issue Entropy and Its Applications across Disciplines)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, we develop an energy-based, large-scale dynamical system model driven by Markov diffusion processes to present a unified framework for statistical thermodynamics predicated on a stochastic dynamical systems formalism. Specifically, using a stochastic state space formulation, we develop a nonlinear stochastic compartmental dynamical system model characterized by energy conservation laws that is consistent with statistical thermodynamic principles. In particular, we show that the difference between the average supplied system energy and the average stored system energy for our stochastic thermodynamic model is a martingale with respect to the system filtration. In addition, we show that the average stored system energy is equal to the mean energy that can be extracted from the system and the mean energy that can be delivered to the system in order to transfer it from a zero energy level to an arbitrary nonempty subset in the state space over a finite stopping time.

Keywords:

statistical thermodynamics; dynamical thermodynamics; entropy; stochastic dynamical systems theory; stochastic semistability; energy equipartition; Markov processes

1. Introduction

In an attempt to generalize classical thermodynamics to irreversible nonequilibrium thermodynamics, a relatively new framework has been developed that combines stochasticity and nonequilibrium dynamics. This framework is known as stochastic thermodynamics [1,2,3,4,5] and goes beyond linear irreversible thermodynamics addressing transport properties and entropy production in terms of forces and fluxes via linear system response theory [6,7,8,9]. Stochastic thermodynamics is applicable to nonequilibrium systems extending the validity of the laws of thermodynamics beyond the linear response regime by providing a system thermodynamic paradigm formulated on the level of individual system state realizations that are arbitrarily far from equilibrium. The thermodynamic variables of heat, work, and entropy, along with the concomitant first and second laws of thermodynamics, are formulated on the level of individual dynamical system trajectories using stochastic differential equations.

The nonequilibrium conditions in stochastic thermodynamics are imposed by an exogenous stochastic disturbance or an initial system state that is far from the system equilibrium resulting in an open (i.e., driven) or relaxation dynamical process. More specifically, the exogenous disturbance is modeled as an independent standard Wiener process (i.e., Brownian motion) defined on a complete filtered probability space wherein the current state is only dependent on the most recent event. The stochastic system dynamics are described by an overdamped Langevin equation [2,3] in which fluctuation and dissipation forces obey the Einstein relation expressing that diffusion is a result of both thermal fluctuations and frictional dissipation [10].

Brownian motion refers to the irregular movement of microscopic particles suspended in a liquid and was discovered [11,12] by the botanist Robert Brown [13]. This random motion is explained as the result of collisions between the suspended particles (i.e., Brownian particles) and the molecules of the liquid. Einstein was the first to formulate the theory of Brownian motion by assuming that the particles suspended in the liquid contribute to the thermal fluctuations of the medium and, in accordance with the principle of equipartition of energy [14], the average translational kinetic energy of each particle [10]. Thus, Brownian motion results from collisions by molecules of the fluid, wherein the suspended particles acquire the same average kinetic energy as the molecules of the fluid. This theory suggested that all matter consists of atoms (or molecules) and heat is the energy of motion (i.e., kinetic energy) of the atoms.

The use of statistical methods in developing a general molecular theory of heat predicated on random motions of Newtonian atoms led to the connection between the dynamics of heat flow and the behavior of electromagnetic radiation. A year after Einstein published his theory on Brownian motion, Smoluchovski [15] confirmed the relation between friction and diffusion. In an attempt to simplify Einstein’s theory of Brownian motion, Langevin [16] was the first to model the effect of Brownian motion using a stochastic differential equation (now known as a Langevin equation) wherein spherical particles are suspended in a medium and acted upon by external forces.

In stochastic thermodynamics, the Langevin equation captures the coupling between the system particle damping and the energy input to the particles via thermal effects. Namely, the frictional forces extract the particle kinetic energy, which in turn is injected back to the particles in the form of thermal fluctuations. This captures the phenomenological behavior of a Brownian particle suspended in a fluid medium which can be modeled as a continuous Markov process [17]. Specifically, since collisions between the fluid molecules and a Brownian particle are more inelastic at higher viscosities, and temperature decreases with increasing viscosity in a fluid, additional heat is transferred to the fluid to maintain its temperature in accordance with the equipartition theorem. This heat is transferred to the Brownian particle through an increased disturbance intensity by the fluid molecules. These collisions between the Brownian particle and fluid molecules result in the observed persistent irregular and random motion of the particles.

The balance between damping (i.e., deceleration) of the particles due to frictional effects resulting in local heating of the fluid, and consequently entropy production, and the energy injection of the particles due to thermal fluctuations resulting in local cooling of the fluid, and consequently entropy consumption, is quantified by fluctuation theorems [18,19,20,21,22,23,24,25,26,27,28]. Thus, even though, on average, the entropy is positive (i.e., entropy production), there exist sample paths wherein the entropy decreases, albeit with an exponentially lower probability than that of entropy production. In other words, a stochastic thermodynamic system exhibits a symmetry in the probability distribution of the entropy production in the asymptotic nonequilibrium process.

Fluctuation theorems give a precise prediction for the cases in which entropy decreases in stochastic thermodynamic systems and provide a key relation between entropy production and irreversibility. Specifically, the entropy production of individual sample path trajectories of a stochastic thermodynamic system described by a Markov process is not restricted by the second law, but rather the average entropy production is determined to be positive. Furthermore, the notions of heat and work in stochastic thermodynamic systems allow for a formulation of the first law of thermodynamics on the level of individual sample path trajectories with microscopic states (i.e., positions and velocities) governed by a stochastic Langevin equation and macroscopic states governed by a Fokker–Planck equation [29] (or a Kolmogorov forward equation, depending on context) describing the evolution of the probability density function of the microscopic (stochastic) states.

In this paper, we combine our large-scale thermodynamic system model developed in [30] with stochastic thermodynamics to develop a stochastic dynamical systems framework of thermodynamics. Specifically, we develop a large-scale dynamical system model driven by Markov diffusion processes to present a unified framework for statistical thermodynamics predicated on a stochastic dynamical systems formalism. In particular, using a stochastic state space formulation, we develop a nonlinear stochastic compartmental dynamical system model characterized by energy conservation laws that is consistent with statistical thermodynamic principles. Moreover, we show that the difference between the average supplied system energy and the average stored system energy for our stochastic thermodynamic model is a martingale with respect to the system filtration. In addition, we show that the average stored system energy is equal to the mean energy that can be extracted from the system and the mean energy that can be delivered to the system in order to transfer it from a zero energy level to an arbitrary nonempty subset in the state space over a finite stopping time.

Finally, using the system ectropy [30] as a Lyapunov function candidate, we show that in the absence of energy exchange with the environment the proposed stochastic thermodynamic model is stochastically semistable in the sense that all sample path trajectories converge almost surely to a set of equilibrium solutions, wherein every equilibrium solution in the set is almost surely Lyapunov stable. In addition, we show that the steady-state distribution of the large-scale sample path system energies is uniform, leading to system energy equipartitioning corresponding to a maximum entropy equilibrium state.

2. Stochastic Dynamical Systems

To extend the dynamical thermodynamic formulation of [30] to stochastic thermodynamics we need to establish some notation, definitions, and mathematical preliminaries. A review of some basic results on nonlinear stochastic dynamical systems is given in [31,32,33,34,35]. Recall that given a sample space

Ω

, a

σ

-algebra

F

on

Ω

is a collection of subsets of

Ω

such that

⌀ \in F

, if

F \in F

, then

Ω \ F \in F

, and if

F_{1}, F_{2}, \dots \in F

, then

⋃_{i = 1}^{\infty} F_{i} \in F

and

⋂_{i = 1}^{\infty} F_{i} \in F

. The pair (

Ω, F

) is called a measurable space and the probability measure

P

defined on (

Ω, F

) is a function

P : F \to [0, 1]

such that

P (⌀) = 0

,

P (Ω) = 1

, and if

F_{1}, F_{2}, \dots \in F

and

F_{i} \cap F_{j} = ⌀

,

i \neq j

, then

P (⋃_{i = 1}^{\infty} F_{i}) = \sum_{i = 1}^{\infty} P (F_{i})

. The triple (

Ω, F, P

) is called a probability space if

F

contains all subsets of

Ω

with

P

-outer measure [36] zero [33].

The subsets

F

of

Ω

belonging to

F

are called

F

-measurable sets. If

Ω = R^{n}

and

B^{n}

is the family of all open sets in

R^{n}

, then

B^{n}

is called the Borel σ-algebra and the elements

B

of

B^{n}

are called Borel sets. If (

Ω, F, P

) is a given probability space, then the real valued function (random variable)

x : Ω \to R

is

F

-measurable if

{ω \in Ω : x (ω) \in B} \in F

for all Borel sets

B \subset R^{n}

. Given the probability space (

Ω, F, P

), a filtration is a family

{F_{t}}_{t \geq 0}

of

σ

-algebras

F_{t} \subset F

such that

F_{t} \subset F_{s}

for all

0 \leq t < s < \infty

.

In this paper, we use the notation and terminology as established in [37]. Specifically, define a complete probability space as

(Ω, F, P)

, where

Ω

denotes the sample space,

F

denotes a

σ

-algebra, and

P

defines a probability measure on the

σ

-algebra

F

; that is,

P

is a nonnegative countably additive set function on

F

such that

P (Ω) = 1

[32]. Furthermore, we assume that

w (\cdot)

is a standard d-dimensional Wiener process defined by

(w (\cdot), Ω, F, P^{w_{0}})

, where

P^{w_{0}}

is the classical Wiener measure ([33], p. 10), with a continuous-time filtration

{F_{t}}_{t \geq 0}

generated by the Wiener process

w (t)

up to time t.

We denote a stochastic dynamical system by

G

generating a filtration

{F_{t}}_{t \geq 0}

adapted to the stochastic process

x : {\bar{R}}_{+} \times Ω \to D

on

(Ω, F, P^{x_{0}})

satisfying

F_{τ} \subset F_{t}

,

0 \leq τ < t

, such that

{ω \in Ω : x (t, ω) \in B} \in F_{t}

,

t \geq 0

, for all Borel sets

B \subset R^{n}

contained in the Borel

σ

-algebra

B^{n}

. We say that the stochastic process

x : {\bar{R}}_{+} \times Ω \to D

is

F_{t}

-adapted if

x (t)

is

F_{t}

-measurable for every

t \geq 0

. Furthermore, we say that

G

satisfies the Markov property if the conditional probability distribution of the future states of the stochastic process generated by

G

only depends on the present state. In this case,

G

generates a Markov process which results in a decoupling of the past from the future in the sense that the present state of

G

contains sufficient information so as to encapsulate the effects of the past system inputs. Here we use the notation

x (t)

to represent the stochastic process

x (t, ω)

omitting its dependence on

ω

. Furthermore,

B^{n}

denotes the

σ

-algebra of Borel sets in

D \subseteq R^{n}

and

S

denotes a

σ

-algebra generated on a set

S \subseteq R^{n}

.

We denote the set of equivalence classes of measurable, integrable, and square-integrable

R^{n}

or

R^{n \times m}

(depending on context) valued random processes on

(Ω, F, P)

over the semi-infinite parameter space

[0, \infty)

by

L^{0} (Ω, F, P)

,

L^{1} (Ω, F, P)

, and

L^{2} (Ω, F, P)

, respectively, where the equivalence relation is the one induced by

P

-almost-sure equality. In particular, elements of

L^{0} (Ω, F, P)

take finite values

P

-almost surely (a.s.) or with probability one. Hence, depending on the context,

R^{n}

will denote either the set of

n \times 1

real variables or the subspace of

L^{0} (Ω, F, P)

comprising of

R^{n}

random processes that are constant almost surely. All inequalities and equalities involving random processes on

(Ω, F, P)

are to be understood to hold

P

-almost surely. Furthermore,

E [\cdot]

and

E^{x_{0}} [\cdot]

denote, respectively, the expectation with respect to the probability measure

P

and with respect to the classical Wiener measure

P^{x_{0}}

.

Given

x \in L^{0} (Ω, F, P)

,

{x = 0}

denotes the set

{ω \in Ω : x (t, ω) = 0}

, and so on. Given

x \in L^{0} (Ω, F, P)

and

E \in F

, we say x is nonzero on

E

if

P ({x = 0} \cap E) = 0

. Furthermore, given

x \in L^{1} (Ω, F, P)

and a

σ

-algebra

E \subseteq F

,

E^{P} [x]

and

E^{P} [x | E]

denote, respectively, the expectation of the random variable x and the conditional expectation of x given

E

, with all moments taken under the measure

P

. In formulations wherein it is clear from context which measure is used, we omit the symbol

P

in denoting expectation, and similarly for conditional expectation. Specifically, in such cases we denote the expectation with respect to the probability space

(Ω, F, P)

by

E [\cdot]

, and similarly for conditional expectation.

A stochastic process

x : {\bar{R}}_{+} \times Ω \to D

on

(Ω, F, P^{x_{0}})

is called a martingale with respect to the filtration

{F_{t}}_{t \geq 0}

if and only if

x (t)

is a

F_{t}

-measurable random vector for all

t \geq 0

,

E [x (t)] < \infty

, and

x (τ) = E [x (t) | F_{τ}]

for all

t \geq τ \geq 0

. Thus, a martingale has the property that the expectation of the next value of the martingale is equal to its current value given all previous values of the dynamical process. If we replace the equality in

x (τ) = E [x (t) | F_{τ}]

with “≤” (respectively, “≥”), then

x (\cdot)

is a supermartingale (respectively, submartingale). Note that every martingale is both a submartingale and supermartingale.

A random variable

τ : Ω \to [0, \infty]

is called a stopping time with respect to

F_{t}

if and only if

{ω \in Ω : τ (ω) \leq t} \in F_{t}

,

t \geq 0

. Thus, the set of all

ω \in Ω

such that

τ (ω) \leq t

is a

F_{t}

-measurable set. Note that

τ (ω)

can take on finite as well as infinite values and characterizes whether at each time t an event at time

τ (ω) < t

has occurred using only the information in

F_{t}

.

Finally, we write

∥ \cdot ∥

for the Euclidean vector norm,

{row}_{i} (A)

and

{col}_{j} (A)

for the i-th row and j-th column of a matrix

A \in R^{p \times q}

, tr(·) for the trace operator,

{(\cdot)}^{- 1}

for the inverse operator,

V^{'} (x) ≜ \frac{\partial V (x)}{\partial x}

for the Freéchet derivative of V at x,

V^{″} (x) ≜ \frac{\partial^{2} V (x)}{\partial x^{2}}

for the Hessian of V at x, and

H_{n}

for the Hilbert space of random vectors

x \in R^{n}

with finite average power, that is,

H_{n} ≜ {x : Ω \to R^{n} : E [x^{T} x] < \infty}

. For an open set

D \subseteq R^{n}

,

H_{n}^{D}

≜

{x \in H_{n} : x : Ω \to D}

denotes the set of all the random vectors in

H_{n}

induced by

D

. Similarly, for every

x_{0} \in R^{n}

,

H_{n}^{x_{0}}

≜

{x \in H_{n} : x \overset{a . s .}{=} x_{0}}

. Moreover,

{\bar{R}}_{+}^{n}

and

R_{+}^{n}

denote the nonnegative and positive orthants of

R^{n}

, that is, if

x \in R^{n}

, then

x \in {\bar{R}}_{+}^{n}

and

x \in R_{+}^{n}

are equivalent, respectively, to

x \geq \geq 0

and

x > > 0

, where

x \geq \geq 0

(respectively,

x > > 0

) indicates that every component of x is nonnegative (respectively, positive). Furthermore,

C^{2}

denotes the space of real-valued functions

V : D \to R

that are two-times continuously differentiable with respect to

x \in D \subseteq R^{n}

. Finally, we write

x (t) \to M

as

t \to \infty

to denote that

x (t)

approaches the set

M

, that is, for every

ε > 0

there exists

T > 0

such that

dist (x (t), M) < ε

for all

t > T

, where

dist (p, M) ≜ {inf}_{x \in M} ∥ p - x ∥

.

Definition 1.

Let

(S, S)

and

(T, T)

be measurable spaces, and let

μ : S \times T \to {\bar{R}}_{+}

. If the function

μ (s, B)

is

S

-measurable in

s \in S

for a fixed

B \in T

and

μ (s, B)

is a probability measure in

B \in T

for a fixed

s \in S

, then μ is called a (probability) kernel from S to T. Furthermore, for

s \leq t

, the function

μ_{s, t} : S \times S \to R

is called a regular conditional probability measure if

μ_{s, t} (\cdot, S)

is measurable,

μ_{s, t} (S, \cdot)

is a probability measure, and

μ_{s, t} (\cdot, \cdot)

satisfies

μ_{s, t} (x (s), B) = P (x (t) \in B | x (s)) = P (x (t) \in B | F_{s}), x (\cdot) \in H_{n},

(1)

where

P (x (t) \in B | x (s)) = P (0, x, t, B)

,

x \in R^{n}

, and

P (s, x, t, B)

,

t \geq s

, is the transition probability of the point

x \in R^{n}

at time instant s into all Borel subsets

B \subset R^{n}

at time instant t.

Any family of regular conditional probability measures

{μ_{s, t}}_{s \leq t}

satisfying the Chapman– Kolmogorov equation ([32])

P (s, x, t, B) = \int_{R^{n}} P (s, x, σ, d z) P (s, z, t, B),

(2)

or, equivalently,

μ_{s, t} (x, B) = \int_{R^{n}} μ_{s, σ} (x, d z) μ_{σ, t} (z, B),

(3)

where

0 \leq s \leq σ \leq t < \infty

,

x, z \in R^{n}

, and

B \in B^{n}

, is called a semigroup of Markov kernels. The Markov kernels are called time homogeneous if and only if

μ_{s, t} = μ_{0, t - s}

holds for all

s \leq t

.

Consider the nonlinear stochastic dynamical system

G

given by

d x (t) = f (x (t)) d t + D (x (t)) d w (t), x (0) \overset{a . s .}{=} x_{0}, t \in I_{x (0)},

(4)

where, for every

t \in I_{x_{0}}

,

x (t) \in H_{n}^{D}

is a

F_{t}

-measurable random state vector,

x (0) \in H_{n}^{x_{0}}

,

D \subseteq R^{n}

is relatively open set with

0 \in D

,

w (\cdot)

is a d-dimensional independent standard Wiener process (i.e., Brownian motion) defined on a complete filtered probability space

(Ω, F, {F_{t}}_{t \geq 0}, P)

,

x (0)

is independent of

(w (t) - w (0)), t \geq 0,

f : D \to R^{n}

and

D : D \to R^{n \times d}

are continuous,

E ≜ f^{- 1} (0) \cap D^{- 1} (0) ≜ {x \in D : f (x) = 0 and D (x) = 0}

is nonempty, and

I_{x (0)} = [0, τ_{x (0)})

,

0 \leq τ_{x (0)} \leq \infty

, is the maximal interval of existence for the solution

x (\cdot)

of (4).

An equilibrium point of (4) is a point

x_{e} \in R^{n}

such that

f (x_{e}) = 0

and

D (x_{e}) = 0

. It is easy to see that

x_{e}

is an equilibrium point of (4) if and only if the constant stochastic process

x (\cdot) \overset{a . s .}{=} x_{e}

is a solution of (4). We denote the set of equilibrium points of (4) by

E ≜ {ω \in Ω : x (t, ω) = x_{e}} = {x_{e} \in D : f (x_{e}) = 0 and D (x_{e}) = 0}

.

The filtered probability space

(Ω, F, {F_{t}}_{t \geq 0}, P)

is clearly a real vector space with addition and scalar multiplication defined componentwise and pointwise. A

R^{n}

-valued stochastic process

x : [0, τ] \times Ω \to D

is said to be a solution of (4) on the time interval

[0, τ]

with initial condition

x (0) \overset{a . s .}{=} x_{0}

if

x (\cdot)

is progressively measurable (i.e.,

x (\cdot)

is nonanticipating and measurable in t and

ω

) with respect to the filtration

{F_{t}}_{t \geq 0}

,

f \in L^{1} (Ω, F, P)

,

D \in L^{2} (Ω, F, P)

, and

x (t) = x_{0} + \int_{0}^{t} f (x (σ)) d σ + \int_{0}^{t} D (x (σ)) d w (σ) a . s ., t \in [0, τ],

(5)

where the integrals in (5) are Itô integrals [38]. If the map

t \to w (t, ω)

,

ω \in Ω

, had a bounded variation, then the natural definition for the integrals in (5) would be the Lebesgue-Stieltjes integral where

ω

is viewed as a parameter. However, since sample Wiener paths are nowhere differentiable and not of bounded variation for almost all

ω \in Ω

the integrals in (5) need to be defined as Itô integrals [39,40].

Note that for each fixed

t \geq 0

, the random variable

ω \mapsto x (t, ω)

assigns a vector

x (ω)

to every outcome

ω \in Ω

of an experiment, and for each fixed

ω \in Ω

, the mapping

t \mapsto x (t, ω)

is the sample path of the stochastic process

x (t)

,

t \geq 0

. A path-wise solution

t \mapsto x (t)

of (4) in

(Ω, {F_{t}}_{t \geq 0}, P^{x_{0}})

is said to be right maximally defined if x cannot be extended (either uniquely or non-uniquely) forward in time. We assume that all right maximal path-wise solutions to (4) in

(Ω, {F_{t}}_{t \geq 0}, P^{x_{0}})

exist on

[0, \infty)

, and hence, we assume that (4) is forward complete. Sufficient conditions for forward completeness or global solutions of (4) are given in [34,38].

Furthermore, we assume that

f : D \to R^{n}

and

D : D \to R^{n \times d}

satisfy the uniform Lipschitz continuity condition

{∥ f (x) - f (y) ∥ + ∥ D (x) - D (y) ∥}_{F} \leq L ∥ x - y ∥, x, y \in D \ {0},

(6)

and the growth restriction condition

{∥ f (x) ∥}^{2} + {∥ D (x) ∥}_{F}^{2} \leq L^{2} {(1 + ∥ x ∥}^{2}), x \in D \ {0},

(7)

for some Lipschitz constant

L > 0

, and hence, since

x (0) \in H_{n}^{D}

and

x (0)

is independent of

(w (t) - w (0)), t \geq 0,

it follows that there exists a unique solution

x \in L^{2} (Ω, F, P)

of (4) forward in time for all initial conditions in the following sense. For every

x \in H_{n}^{D} \ {0}

there exists

τ_{x} > 0

such that if

x_{1} : [0, τ_{1}] \times Ω \to D

and

x_{2} : [0, τ_{2}] \times Ω \to D

are two solutions of (4); that is, if

x_{1}, x_{2} \in L^{2} (Ω, F, P)

with continuous sample paths almost surely solve (4), then

τ_{x} \leq min {τ_{1}, τ_{2}}

and

P (x_{1} (t) = x_{2} (t), 0 \leq t \leq τ_{x}) = 1

.

The uniform Lipschitz continuity condition (6) guarantees uniqueness of solutions, whereas the linear growth condition (7) rules out finite escape times. A weaker sufficient condition for the existence of a unique solution to (4) using a notion of (finite or infinite) escape time under the local Lipschitz continuity condition (6) without the growth condition (7) is given in [41]. Alternatively, existence and uniqueness of solutions even when the uniform Lipschitz continuity condition (6) does not hold are given in ([38], p. 152).

The unique solution to (4) determines a

R^{n}

-valued, time homogeneous Feller continuous Markov process

x (\cdot)

, and hence, its stationary Feller transition probability function is given by (([31], Theorem 3.4), ([32], Theorem 9.2.8))

P (x (t) \in B | x (t_{0}) \overset{a . s .}{=} x_{0}) = P (0, x_{0}, t - t_{0}, B), x_{0} \in R^{n},

(8)

for all

t \geq t_{0}

and all Borel subsets

B

of

R^{n}

, where

P (σ, x, t, B), t \geq σ,

denotes the probability of transition of the point

x \in R^{n}

at time instant s into the set

B \subset R^{n}

at time instant t. Recall that every continuous process with Feller transition probability function is also a strong Markov process ([31] p. 101). Finally, we say that the dynamical system (4) is convergent in probability with respect to the closed set

H_{n}^{D_{c}} \subseteq H_{n}^{D}

if and only if the pointwise

{lim}_{t \to \infty} s (t, x, ω)

exists for every

x \in D_{c} \subseteq R^{n}

and

ω \in Ω

.

Here, the measurable map

s : [0, τ_{x}) \times D \times Ω \to D

is the dynamic or flow of the stochastic dynamical system (2) and, for all

t, τ \in [0, τ_{x})

, satisfies the cocycle property

s (τ, s (t, x), ω) = s (t + τ, x, ω)

and the identity (on

D

) property

s (0, x, ω) = x

for all

x \in D

and

ω \in Ω

. The measurable map

s_{t} ≜ s (t, \cdot, ω) : D \to D

is continuously differentiable for all

t \in [0, τ_{x})

outside a

P

-null set and the sample path trajectory

s^{x} ≜ s (\cdot, x, ω) : [0, τ_{x}) \to D

is continuous in

D

for all

t \in [0, τ_{x})

. Thus, for every

x \in D

, there exists a trajectory of measures defined for all

t \in [0, τ_{x})

satisfying the dynamical processes (4) with initial condition

x (0) \overset{a . s .}{=} x_{0}

. For simplicity of exposition we write

s (t, x)

for

s (t, x, ω)

omitting its dependence on

ω

.

Definition 2.

A point

p \in D

is a limit point of the trajectory

s (\cdot, x)

of (4) if there exists a monotonic sequence

{t_{n}}_{n = 0}^{\infty}

of positive numbers, with

t_{n} \to \infty

as

n \to \infty

, such that

s (t_{n}, x) \overset{a . s .}{\to} p

as

n \to \infty

. The set of all limit points of

s (t, x), t \geq 0

, is the limit set

ω (x)

of

s (\cdot, x)

of (4).

It is important to note that the

ω

-limit set of a stochastic dynamical system is a

ω

-limit set of a trajectory of measures, that is,

p \in ω (x)

is a weak limit of a sequence of measures taken along every sample continuous bounded trajectory of (4). It can be shown that the

ω

-limit set of a stationary stochastic dynamical system attracts bounded sets and is measurable with respect to the

σ

-algebra of invariant sets. Thus, the measures of the stochastic process

x (\cdot)

tend to an invariant set of measures and

x (t)

asymptotically tends to the closure of the support set (i.e., kernel) of this set of measures almost surely.

However, unlike deterministic dynamical systems, wherein

ω

-limit sets serve as global attractors, in stochastic dynamical systems stochastic invariance (see Definition 4) leads to

ω

-limit sets being defined for each fixed sample

ω \in Ω

of the underlying probability space

(Ω, F, P)

, and hence, are path-wise attractors. This is due to the fact that a cocycle property rather than a semigroup property holds for stochastic dynamical systems. For details see [42,43,44].

Definition 3

([33], Def. 7.7). Let

x (\cdot)

be a time-homogeneous Markov process in

H_{n}^{D}

and let

V : D \to R

. Then the infinitesimal generator

L

of

x (t)

,

t \geq 0

, with

x (0) \overset{a . s .}{=} x_{0}

, is defined by

L V (x_{0}) ≜ lim_{t \to 0^{+}} \frac{E^{x_{0}} [V (x (t))] - V (x_{0})}{t}, x_{0} \in D,

(9)

where

E^{x_{0}}

denotes the expectation with respect to the transition probability measure

P^{x_{0}} (x (t) \in B) ≜ P (0, x_{0}, t, B)

.

If

V \in C^{2}

and has a compact support [45], and

x (t)

,

t \geq 0

, satisfies (4), then the limit in (9) exists for all

x \in D

and the infinitesimal generator

L

of

x (t)

,

t \geq 0

, can be characterized by the system drift and diffusion functions

f (x)

and

D (x)

defining the stochastic dynamical system (4) and is given by ([33], Theorem 7.9)

L V (x) ≜ \frac{\partial V (x)}{\partial x} f (x) + \frac{1}{2} tr D^{T} (x) \frac{\partial^{2} V (x)}{\partial x^{2}} D (x), x \in D .

(10)

Next, we extend Proposition 2.1 of [30] to stochastic dynamical systems. First, however, the following definitions on stochastic invariance and essentially nonnegative vector fields are needed.

Definition 4.

A relatively open set

D \subset R^{n}

is invariant with respect to (4) if

D

is Borel and, for all

x_{0} \in D

,

P^{x_{0}} (x (t) \in D) = 1

,

t \geq 0

.

Definition 5.

Let

f = {[f_{1}, \dots, f_{n}]}^{T} : D \subseteq {\bar{R}}_{+}^{n} \to R^{n}

. Then f is essentially nonnegative if

f_{i} (x) \geq 0

for all

i = 1, \dots, n

and

x \in {\bar{R}}_{+}^{n}

such that

x_{i} = 0

, where

x_{i}

denotes the i-th component of x.

Proposition 1.

Suppose

{\bar{R}}_{+}^{n} \subset D

. Then

{\bar{R}}_{+}^{n}

is an invariant set with respect to (4) if and only if

f : D \to R^{n}

is essentially nonnegative and

D_{(i, j)} (x) = 0

,

j = 1, \dots, d

, whenever

x_{i} = 0

,

i = 1, \dots, n

.

Proof.

Define dist

(x, {\bar{R}}_{+}^{n}) ≜ {inf}_{y \subseteq {\bar{R}}_{+}^{n}} ∥ x - y ∥

,

x \in R^{n}

. Now, suppose

f : D \to R^{n}

is essentially nonnegative and let

x \in {\bar{R}}_{+}^{n}

. For every

i \in {1, \dots, q}

, if

x_{i} = 0

, then

x_{i} + h f_{i} (x) + {row}_{i} (D (x)) [w (h, ω) - w (0, ω)] = h f_{i} (x) \geq 0

for all

h \geq 0

and all

ω \in Ω

, whereas, if

x_{i} > 0

, then it follows from the continuity of

D (\cdot)

and the sample continuity of

w (\cdot)

that

x_{i} + h f_{i} (x) + {row}_{i} (D (x)) [w (h, ω) - w (0, ω)] \geq 0

for all

| h |

sufficiently small and all

ω \in Ω

. Thus,

x + h f (x) + {row}_{i} (D (x)) [w (h, ω) - w (0, ω)] \in {\bar{R}}_{+}^{n}

for all sufficiently small

h > 0

and all

ω \in Ω

, and hence,

{lim}_{h \to 0^{+}} dist (x + h f (x) + {row}_{i} (D (x)) [w (h, ω) - w (0, ω)], {\bar{R}}_{+}^{n}) / h = 0

. It now follows from Lemma 2.1 of [46], with

x (0) \overset{a . s .}{=} x_{0}

, that

P^{x_{0}} (x (t) \in {\bar{R}}_{+}^{n}) = 1

for all

t \in [0, τ_{x_{0}})

.

Conversely, suppose that

{\bar{R}}_{+}^{n}

is invariant with respect to (4), let

P^{x_{0}} (x (0) \in {\bar{R}}_{+}^{n}) = 1

, and suppose, ad absurdum, x is such that there exists

i \in {1, \dots, q}

such that

x_{i} (0) \overset{a . s .}{=} 0

and

f_{i} (x (0)) h + {row}_{i} (D (x)) [w (h, ω) - w (0, ω)] < 0

for all

ω \in Ω

. Then, since f and D are continuous and a Wiener process

w (\cdot)

can be positive or negative with equal probability, there exists sufficiently small

h > 0

such that

P^{x_{0}} (f_{i} (x (t)) d t + {row}_{i} (D (x (t))) d w (t) < 0) \neq 0

for all

t \in [0, h)

, where

x (t)

is the solution to (4). Hence,

x_{i} (t)

is strictly decreasing on

[0, h)

with nonzero probability, and thus,

P^{x_{0}} (x (t) \in {\bar{R}}_{+}^{n}) \neq 1

for all

t \in (0, h)

, which leads to a contradiction. ☐

It follows from Proposition 1 that if

x_{0} \geq \geq 0

, then

x (t) \overset{a . s .}{\geq \geq} 0

,

t \geq 0

, if and only if f is essentially nonnegative and

D_{(i, j)} (x) = 0

,

j = 1, \dots, d

, whenever

x_{i} = 0

,

i = 1, \dots, n

. In this case, we say that (4) is a stochastic nonnegative dynamical system. Henceforth, we assume that f and D are such that the nonlinear stochastic dynamical system (4) is a stochastic nonnegative dynamical system.

3. Stability Theory for Stochastic Nonnegative Dynamical Systems

In this section, we establish key stability results in probability for stochastic nonnegative dynamical systems. The mathematical machinery used is supermartingale theory and ergodic theory of Markov processes [31]. Specifically, deterministic stability theory is extended to stochastic dynamical systems by establishing supermartingale properties of Lyapunov functions. The following definition introduces several notions of stability in probability for the equilibrium solution

x (t) \overset{a . s .}{\equiv} x_{e} \in {\bar{R}}_{+}^{n}

of the stochastic nonnegative dynamical system (4) for

I_{x (0)} = [0, \infty)

.

Definition 6.

(i) The equilibrium solution

x (t) \overset{a . s .}{\equiv} x_{e} \in {\bar{R}}_{+}^{n}

to (4) is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

if, for every

ε > 0

,

lim_{x_{0} \to x_{e}} P^{x_{0}} (sup_{t \geq 0} ∥ x (t) - x_{e} ∥ > ε) = 0 .

(11)

Equivalently, the equilibrium solution

x (t) \overset{a . s .}{\equiv} x_{e} \in {\bar{R}}_{+}^{n}

to (4) is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

if, for every

ε > 0

and

ρ \in (0, 1)

, there exists

δ = δ (ρ, ε) > 0

such that, for all

x_{0} \in B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n}

,

P^{x_{0}} (sup_{t \geq 0} ∥ x (t) - x_{e} ∥ > ε) \leq ρ .

(12)

(ii) The equilibrium solution

x (t) \overset{a . s .}{\equiv} x_{e} \in {\bar{R}}_{+}^{n}

to (4) is asymptotically stable in probability with respect to

{\bar{R}}_{+}^{n}

if it is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

and

lim_{x_{0} \to x_{e}} P^{x_{0}} (lim_{t \to \infty} ∥ x (t) - x_{e} ∥ = 0) = 1 .

(13)

Equivalently, the equilibrium solution

x (t) \overset{a . s .}{\equiv} x_{e} \in {\bar{R}}_{+}^{n}

to (4) is asymptotically stable in probability with respect to

{\bar{R}}_{+}^{n}

if it is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

and, for every

ρ \in (0, 1)

, there exists

δ = δ (ρ) > 0

such that if

x_{0} \in B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n}

, then

P^{x_{0}} (lim_{t \to \infty} ∥ x (t) - x_{e} ∥ = 0) \geq 1 - ρ .

(14)

(iii) The equilibrium solution

x (t) \overset{a . s .}{\equiv} x_{e} \in {\bar{R}}_{+}^{n}

to (4) is globally asymptotically stable in probability with respect to

{\bar{R}}_{+}^{n}

if it is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

and, for all

x_{0} \in {\bar{R}}_{+}^{n}

,

P^{x_{0}} (lim_{t \to \infty} ∥ x (t) - x_{e} ∥ = 0) = 1 .

(15)

As in deterministic stability theory, for a given

ε > 0

the subset

B_{ε} (x_{e}) \cap {\bar{R}}_{+}^{n}

defines a cylindrical region in the

(t, x)

-space wherein the trajectory

x (t)

,

t \geq 0

, belongs to. However, in stochastic stability theory, for every

x_{0} \in B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n}

, there exists a probability of less than or equal to

ρ

that the system solution

s (t, x_{0})

leaves the subset

B_{ε} (x_{e}) \cap {\bar{R}}_{+}^{n}

; and for

x_{0} = x_{e}

this probability is zero. In other words, the probability of escape is continuous at

x_{0} = x_{e}

with small deviations from the equilibrium implying a small probability of escape.

The following lemma gives an equivalent characterization of Lyapunov and asymptotic stability in probability with respect to

{\bar{R}}_{+}^{n}

in terms of class

K

,

K_{\infty}

, and

K L

functions ([47], p. 162).

Lemma 1.

(i) The equilibrium solution

x (t) \overset{a . s .}{\equiv} x_{e}

to (4) is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

if and only if for every

ρ > 0

there exist a class

K

function

α_{ρ} (\cdot)

and a constant

c = c (ρ) > 0

such that, for all

x_{0} \in B_{c} (x_{e}) \cap {\bar{R}}_{+}^{n}

,

P^{x_{0}} (∥ x (t) - x_{e} ∥ > α_{ρ} (∥ x_{0} - x_{e} ∥)) \leq ρ, t \geq 0 .

(16)

(ii) The equilibrium solution

x (t) \overset{a . s .}{\equiv} x_{e}

to (4) is asymptotically stable in probability with respect to

{\bar{R}}_{+}^{n}

if and only if for every

ρ > 0

there exist a class

K L

function

β_{ρ} (\cdot, \cdot)

and a constant

c = c (ρ) > 0

such that, for all

x_{0} \in B_{c} (x_{e}) \cap {\bar{R}}_{+}^{n}

,

P^{x_{0}} (∥ x (t) - x_{e} ∥ > β_{ρ} (∥ x_{0} - x_{e} ∥, t)) \leq ρ, t \geq 0 .

(17)

Proof.

(i) Suppose that there exist a class

K

function

α_{ρ} (\cdot)

and a constant

c = c (ρ) > 0

such that, for every

ρ > 0

and

x_{0} \in B_{c} (x_{e}) \cap {\bar{R}}_{+}^{n}

,

P^{x_{0}} (∥ x (t) - x_{e} ∥ > α_{ρ} (∥ x_{0} - x_{e} ∥)) \leq ρ, t \geq 0 .

(18)

Now, given

ε > 0

, let

δ (ρ, ε) = min {c (ρ), α_{ρ}^{- 1} (ε)}

. Then, for

x_{0} \in B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n}

and

t \geq 0

,

\begin{matrix} P^{x_{0}} (∥ x (t) - x_{e} ∥ > α_{ρ} (∥ x_{0} - x_{e} ∥)) & \geq & P^{x_{0}} (∥ x (t) - x_{e} ∥ > α_{ρ} (δ)) \\ \geq & P^{x_{0}} (∥ x (t) - x_{e} ∥ > α_{ρ} (α_{ρ}^{- 1} (ε))) \\ \geq & P^{x_{0}} (∥ x (t) - x_{e} ∥ > ε) . \end{matrix}

Therefore, for every given

ε > 0

and

ρ > 0

, there exists

δ > 0

such that, for all

x_{0} \in B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n}

,

P^{x_{0}} (sup_{t \geq 0} ∥ x (t) - x_{e} ∥ > ε) \leq ρ,

which proves that the equilibrium solution

x (t) \overset{a . s .}{\equiv} x_{e}

is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

.

Conversely, for every given

ε

and

ρ

, let

\bar{δ} (ε, ρ)

be the supremum of all admissible

δ (ε, ρ)

. Note that the function

δ (\cdot, \cdot)

is positive and nondecreasing in its first argument, but not necessarily continuous. For every

ρ > 0

chose a class

K

function

γ_{ρ} (r)

such that

γ_{ρ} (r) \leq k \bar{δ} (r, ρ)

,

0 < k < 1

. Let

c (ρ) = {lim}_{r \to \infty} γ_{ρ} (r)

and

α_{ρ} (r) = γ_{ρ}^{- 1} (r)

, and note that

α_{ρ} (\cdot)

is class

K

([48], Lemma 4.2). Next, for every

ρ > 0

and

x_{0} \in B_{c (ρ)} (x_{e}) \cap {\bar{R}}_{+}^{n}

, let

ε = α_{ρ} (∥ x_{0} - x_{e} ∥)

. Then,

∥ x_{0} - x_{e} ∥ < \bar{δ} (ε, ρ)

and

P^{x_{0}} (sup_{t \geq 0} ∥ x (t) - x_{e} ∥ > ε) \leq ρ

(19)

imply

P^{x_{0}} (∥ x (t) - x_{e} ∥ > α_{ρ} (∥ x_{0} - x_{e} ∥)) \leq ρ, t \geq 0 .

(20)

(ii) Suppose that there exists a class

K L

function

β (r, s)

such that (17) is satisfied. Then,

P^{x_{0}} (∥ x (t) - x_{e} ∥ > β_{ρ} (∥ x_{0} - x_{e} ∥, 0)) \leq ρ, t \geq 0,

which implies that equilibrium solution

x (t) \overset{a . s .}{\equiv} x_{e}

is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

. Moreover, for

x_{0} \in B_{c (ρ)} (x_{e}) \cap {\bar{R}}_{+}^{n}

, the solution to (4) satisfies

P^{x_{0}} (∥ x (t) - x_{e} ∥ > β_{ρ} (∥ c (ρ) ∥, t)) \leq ρ, t \geq 0 .

Now, letting

t \to \infty

yields

P^{x_{0}} ({lim}_{t \to \infty} ∥ x (t) - x_{e} ∥ > 0) \leq ρ

for every

ρ > 0

, and hence,

P^{x_{0}} ({lim}_{t \to \infty} ∥ x (t) - x_{e} ∥ = 0) \geq 1 - ρ

, which implies that the equilibrium solution

x (t) \overset{a . s .}{\equiv} x_{e}

is asymptotically stable in probability with respect to

{\bar{R}}_{+}^{n}

.

Conversely, suppose that the equilibrium solution

x (t) \overset{a . s .}{\equiv} x_{e}

is asymptotically stable in probability with respect to

{\bar{R}}_{+}^{n}

. In this case, for every

ρ > 0

there exist a constant

c (ρ) > 0

and a class

K

function

α_{ρ} (\cdot)

such that, for every

r \in (0, c (ρ)]

, the solution

x (t)

,

t \geq 0

, to (4) satisfies

\begin{matrix} P^{x_{0}} (sup_{t \geq 0} ∥ x (t) - x_{e} ∥ > α_{ρ} (r)) \leq P^{x_{0}} (sup_{t \geq 0} ∥ x (t) - x_{e} ∥ > α_{ρ} (∥ x_{0} - x_{e} ∥)) \leq ρ \end{matrix}

(21)

for all

∥ x_{0} - x_{e} ∥ < r

. Moreover, given

η > 0

there exists

T = T_{ρ} (η, r) \geq 0

such that

P^{x_{0}} (sup_{t \geq T_{ρ} (η, r)} ∥ x (t) - x_{e} ∥ > η) \leq ρ .

Let

{\bar{T}}_{ρ} (η, r)

be the infimum of all admissible

T_{ρ} (η, r)

and note that

{\bar{T}}_{ρ} (η, r)

is nonnegative and nonincreasing in

η

, nondecreasing in r, and

{\bar{T}}_{ρ} (η, r) = 0

for all

η \geq α (r)

. Now, let

W_{r, ρ} (η) = \frac{2}{η} \int_{\frac{η}{2}}^{η} {\bar{T}}_{ρ} (s, r) d s + \frac{r}{η} \geq {\bar{T}}_{ρ} (η, r) + \frac{r}{η}

and note that

W_{r, ρ} (η)

is positive and has the following properties: (i) For every fixed r and

ρ

,

W_{r, ρ} (η)

is continuous, strictly decreasing, and

{lim}_{η \to \infty} W_{r, ρ} (η) = 0

; and (ii) for every fixed

η

and

ρ

,

W_{r, ρ} (η)

is strictly increasing in r.

Next, let

U_{r, ρ} = W_{r, ρ}^{- 1}

and note that

U_{r, ρ}

satisfies properties (i) and (ii) of

W_{r, ρ}

, and

{\bar{T}}_{ρ} (U_{r, ρ} (σ), r) < W_{r, ρ} (U_{r, ρ} (σ)) = σ

. Therefore,

P^{x_{0}} (∥ x (t) - x_{e} ∥ > U_{r, ρ} (t)) \leq ρ, t \geq 0,

(22)

for all

∥ x_{0} - x_{e} ∥ < r

. Now, using (21) and (22) it follows that

P^{x_{0}} (∥ x (t) - x_{e} ∥ > \sqrt{α_{ρ} (∥ x_{0} - x_{e} ∥) U_{c (ρ), ρ} (t)}) \leq ρ, ∥ x_{0} - x_{e} ∥ < c (ρ), t \geq 0 .

Thus, inequality (17) is satisfied with

β_{ρ} (∥ x_{0} - x_{e} ∥, t) = \sqrt{α_{ρ} (∥ x_{0} - x_{e} ∥)} \sqrt{U_{c (ρ), ρ} (t)}

. ☐

Next, we present sufficient conditions for Lyapunov and asymptotic stability in probability for nonlinear stochastic nonnegative dynamical systems. First, however, the following definition of a recurrent process relative to a domain

D_{r}

is needed.

Definition 7.

A Markov process

x (\cdot)

in

H_{n}^{D}

is recurrent relative to the domain

D_{r}

or, equivalently,

D_{r} \subset D

is recurrent in

D

, if there exists a finite-time

t > 0

such that

P^{x} (x (t) \in D_{r}) = 1 .

(23)

In addition,

D_{r}

is positive recurrent if

sup_{x \in D_{c}} E^{x} inf {t \geq 0 : x (t) \in D_{r}} < \infty

(24)

for every compact set

D_{c} \subset D

.

Theorem 1.

Let

D

be an open subset relative to

{\bar{R}}_{+}^{n}

that contains

x_{e}

. Consider the nonlinear stochastic dynamical system (4) where f is essentially nonnegative and

f (x_{e}) = 0

,

D_{(i, j)} (x) = 0

,

j = 1, \dots, d

, whenever

x_{i} = 0

,

i = 1, \dots, n

, and

D (x_{e}) = 0

. Assume that there exists a two-times continuously differentiable function

V : D \to R

such that

\begin{matrix} V (x_{e}) = 0, \end{matrix}

(25)

\begin{matrix} V (x) > 0, x \in D, x \neq x_{e}, \end{matrix}

(26)

\begin{matrix} \frac{\partial V (x)}{\partial x} f (x) + \frac{1}{2} tr D^{T} (x) \frac{\partial^{2} V (x)}{\partial x^{2}} D (x) \leq 0, x \in D . \end{matrix}

(27)

Then the equilibrium solution

x (t) \equiv x_{e}

to (4) is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

. If, in addition,

\frac{\partial V (x)}{\partial x} f (x) + \frac{1}{2} tr D^{T} (x) \frac{\partial^{2} V (x)}{\partial x^{2}} D (x) < 0, x \in D, x \neq x_{e},

(28)

then the equilibrium solution

x (t) \equiv x_{e}

to (4) is asymptotically stable in probability with respect to

{\bar{R}}_{+}^{n}

. Finally, if

D = {\bar{R}}_{+}^{n}

and

V (\cdot)

is radially unbounded, then the equilibrium solution

x (t) \equiv x_{e}

to (4) is globally asymptotically stable in probability with respect to

{\bar{R}}_{+}^{n}

.

Proof.

Let

δ > 0

be such that

B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n} \subseteq D

, define

V_{δ} ≜ {inf}_{x \in D \ B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n}} V (x) > 0

, and let

τ_{δ}

be the stopping time wherein the trajectory

x (t)

,

t \geq 0

, of (4) exits the bounded domain

B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n} \subseteq D

with

τ_{δ} (t) ≜ min {t, τ_{δ}}

. Since

V (\cdot)

is two-times continuously differentiable and (27) holds it follows from Lemma 5.4 of [31] that

E^{x} [V (x (τ_{δ} (t)))] \leq V (x)

(29)

for all

x \in B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n}

and

t \geq 0

. Now, using Chebyshev’s inequality ([31], p. 29) yields

P^{x} (sup_{0 \leq s \leq t} ∥ x (s) - x_{e} ∥ > δ) \leq \frac{E^{x} [V (x (τ_{δ} (t)))]}{V_{δ}} \leq \frac{V (x)}{V_{δ}} .

(30)

Next, taking the limit as

t \to \infty

, (30) yields

P^{x} (sup_{s \geq 0} ∥ x (s) - x_{e} ∥ > δ) \leq \frac{V (x)}{V_{δ}},

(31)

and hence, Lyapunov stability in probability with respect to

{\bar{R}}_{+}^{n}

follows from the continuity of

V (\cdot)

and (25).

To prove asymptotic stability in probability with respect to

{\bar{R}}_{+}^{n}

, note that the stochastic process

V (x (τ_{δ} (t)))

is a supermartingale ([31], Lemma 5.4), and hence, it follows from Theorem 5.1 of [31] that

lim_{t \to \infty} V (x (τ_{δ} (t))) \overset{a . s .}{=} ν .

(32)

Let

B^{x}

denote the set of all sample trajectories of (4) starting from

x \in {\bar{R}}_{+}^{q}

for which

τ_{δ} = \infty

. Since the equilibrium solution

x (t) \equiv x_{e}

to (4) is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

, it follows that

lim_{x \to x_{e}} P^{x} (B^{x}) = 1 .

(33)

Next, it follows from Theorem 3.9 of [31] and (28) that all sample trajectories contained in

B^{x}

, except for a set of trajectories with measure zero, satisfy

{inf}_{t > 0} ∥ x (t) - x_{e} ∥ = 0

. Moreover, it follows from Lemma 5.3 of [31] that

\underset{t \to \infty}{lim inf} ∥ x (t) - x_{e} ∥ = 0,

(34)

and hence, using (25),

{lim inf}_{t \to \infty} V (x (t)) = 0

. Now, (32) implies

lim_{t \to \infty} V (x (τ_{δ} (t))) = lim_{t \to \infty} V (x (t))

(35)

for almost all sample trajectories in

B^{x}

, and hence,

lim_{t \to \infty} V (x (t)) = \underset{t \to \infty}{lim inf} V (x (t)) = 0,

(36)

which, using (25) and (26), further implies, that

lim_{t \to \infty} ∥ x (t) - x_{e} ∥ = 0 .

(37)

Now, asymptotic stability in probability with respect to

{\bar{R}}_{+}^{n}

is direct consequence of (33) and (37).

Finally, to prove global asymptotic stability in probability with respect to

{\bar{R}}_{+}^{n}

note that it follows from Lyapunov stability in probability with respect to

{\bar{R}}_{+}^{n}

that, for every

ε > 0

and

ρ = ε

, there exists

δ > 0

such that, for all

x \in B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n}

,

P^{x} (sup_{t > 0} ∥ x (t) - x_{e} ∥ > ε) < ε .

(38)

Moreover, it follows from Lemma 3.9, Theorem 3.9 of [31], and the radial unboundedness of

V (\cdot)

that the solution

x (t)

,

t \geq 0

, of (4) is recurrent relative to the domain

B_{ε} (x_{e}) \cap {\bar{R}}_{+}^{n}

for every

ε > 0

. Thus,

{\tilde{τ}}_{δ} \overset{a . s .}{<} \infty

, where

{\tilde{τ}}_{δ}

is the first hitting time of the trajectories starting from the set

{\bar{R}}_{+}^{n} \ B_{δ} (x_{e})

and transitioning into the set

B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n}

.

Now, using the strong Markov property of solutions and choosing

δ > 0

such that

x \in {\bar{R}}_{+}^{n} \ B_{δ} (x_{e})

yields

\begin{matrix} P^{x} & (\underset{t \to \infty}{lim sup} ∥ x (t) - x_{e} ∥ > ε) \\ = \int_{σ = 0}^{\infty} \int_{y \in \partial B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n}} P ({\tilde{τ}}_{δ} \in d σ, x ({\tilde{τ}}_{δ}) \in d y) P^{y} (\underset{t \to \infty}{lim sup} ∥ x (t) - x_{e} ∥ > ε) \\ = \int_{σ = 0}^{\infty} \int_{y \in \partial B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n}} P ({\tilde{τ}}_{δ} \in d σ, x ({\tilde{τ}}_{δ}) \in d y) P^{y} (sup_{t > 0} ∥ x (t) - x_{e} ∥ > ε) \\ \leq ε, \end{matrix}

(39)

which proves global asymptotic stability in probability with respect to

{\bar{R}}_{+}^{n}

. ☐

As noted in [37], a more general stochastic stability notion can also be introduced here involving stochastic stability and convergence to an invariant (stationary) distribution. In this case, state convergence is not to an equilibrium point but rather to a stationary distribution. This framework can relax the vanishing perturbation assumption

D (x_{e}) = 0

at the equilibrium point

x_{e}

and requires a more involved analysis framework showing stability of the underlying Markov semigroup [49].

As in nonlinear stochastic dynamical system theory [31], converse Lyapunov theorems for Lyapunov and asymptotic stability in probability for stochastic nonnegative dynamical systems can also be established. However, in this case, a non-degeneracy condition on

D (x)

,

x \in D

, is required [31].

Finally, we establish a stochastic version of the Krasovskii-LaSalle stability theorem for nonnegative dynamical systems. For nonlinear stochastic dynamical systems this result is due to Mao [50].

Theorem 2.

Consider the nonlinear stochastic nonnegative dynamical system (4). Let

D \subseteq {\bar{R}}_{+}^{n}

be an invariant set with respect to (4) and assume that there exists a two-times continuously differentiable function

V : D \to {\bar{R}}_{+}

and a continuous function

η : {\bar{R}}_{+} \to {\bar{R}}_{+}

such that

\begin{matrix} \frac{\partial V (x)}{\partial x} f (x) + \frac{1}{2} tr D^{T} (x) \frac{\partial^{2} V (x)}{\partial x^{2}} D (x) \leq - η (V (x)), x \in D . \end{matrix}

(40)

Then, for every

x_{0} \in D

,

{lim}_{t \to \infty} V (x (t))

exists and is finite almost surely, and

lim_{t \to \infty} η (V (x (t))) \overset{a . s .}{=} 0 .

(41)

Proof.

Since

D \subseteq {\bar{R}}_{+}^{n}

is invariant with respect to (4), it follows that, for all

x_{0} \in D

,

P^{x_{0}} (x (t) \in D) = 1

,

t \geq 0

. Furthermore, using Itô’s chain rule formula and (40) we have

\begin{matrix} V (x (t)) & = V (x_{0}) + \int_{0}^{t} L V (x (σ)) d σ + \int_{0}^{t} \frac{\partial V (x (σ))}{\partial x} D (x (σ)) d w (σ) \\ \leq V (x_{0}) - \int_{0}^{t} η (V (x (σ))) d σ + \int_{0}^{t} \frac{\partial V (x (σ))}{\partial x} D (x (σ)) d w (σ) . \end{matrix}

(42)

Now, it follows from Theorem 7 of ([51], p. 139) that

{lim}_{t \to \infty} V (x (t))

exists and is finite almost surely, and

lim_{t \to \infty} \int_{0}^{t} η (V (x (σ))) d σ \overset{a . s .}{<} \infty .

(43)

To show that

{lim}_{t \to \infty} η (V (x (t))) \overset{a . s .}{=} 0

suppose, ad absurdum, that there exists a sample space

\bar{Ω} \subset Ω

such that

P (\bar{Ω}) > 0

and

\underset{t \to \infty}{lim sup} η (V (x (t, ω))) > 0, ω \in \bar{Ω} .

(44)

Let

{t_{n}}_{n = 0}^{\infty}

,

n \in Z_{+}

, be a monotonic sequence with

t_{n} + 1 < t_{n + 1}

and note that there exist

ε > 0

and

N \in Z_{+}

such that

η (V (x (t_{n}, ω))) > ε, n \geq N .

(45)

Now, it follows from the continuity of

η (\cdot)

and

V (\cdot)

, and the sample continuity of

x (\cdot)

that there exist

δ > 0

,

δ_{1} > 0

, and

δ_{2} > 0

, such that if

| V (x (t_{n}, ω)) - V (x (t, ω)) | \leq δ_{2}

, then

\begin{matrix} | η (V (x (t_{n}, ω))) - η (V (x (t, ω))) | \leq \frac{ε}{2}, \end{matrix}

(46)

if

∥ x (t_{n}, ω) - x (t, ω) ∥ \leq δ_{1}

, then

\begin{matrix} | V (x (t_{n}, ω)) - V (x (t, ω)) | \leq δ_{2}, \end{matrix}

(47)

and if

| t_{n} - t | \leq δ

, then

\begin{matrix} ∥ x (t_{n}, ω) - x (t, ω) ∥ \leq δ_{1} . \end{matrix}

(48)

Thus, using (45) and (46) it follows that, for all

| t_{n} - t | \leq δ

and

n \geq N

,

\begin{matrix} η (V (x (t, ω))) & \geq η (V (x (t_{n}, ω))) - | η (V (x (t_{n}, ω))) - η (V (x (t, ω))) | > \frac{ε}{2}, \end{matrix}

(49)

and hence,

\begin{matrix} lim_{t \to \infty} \int_{0}^{t} η (V (x (σ))) d σ & \geq & \sum_{n = N}^{\infty} \int_{t_{n}}^{t_{n} + δ} η (V (x (σ))) d σ \geq \sum_{n = N}^{\infty} \frac{ε δ}{2} = \infty, \end{matrix}

(50)

which contradicts (43). Thus,

{lim}_{t \to \infty} η (V (x (t))) \overset{a . s .}{=} 0

. ☐

Note that if we define

D_{η} ≜ {v \geq 0 : η (v) = 0}

, then it can be shown that

D_{η} \neq ⌀

and (41) implies

lim_{t \to \infty} dist (V (x (t)), D_{η}) \overset{a . s .}{=} 0,

(51)

that is,

V (x (t))

will asymptotically approach the set

D_{η}

with probability one. Thus, if

η (V (x)) = x_{e}

if and only if

x = x_{e}

and

V (\cdot)

is positive definite with respect to

x_{e}

, then it follows from Theorems 1 and 2 that the equilibrium solution

x (t) \overset{a . s .}{\equiv} x_{e}

to (4) is asymptotically stable in probability with respect to

{\bar{R}}_{+}^{n}

.

4. Semistability of Stochastic Nonnegative Dynamical Systems

As shown in [30], thermodynamic systems give rise to systems that possess a continuum of equilibria. In this section, we develop a stability analysis framework for stochastic systems having a continuum of equilibria. Since, as noted in [37,52], every neighborhood of a non-isolated equilibrium contains another equilibrium, a non-isolated equilibrium cannot be asymptotically stable. Hence, asymptotic stability is not the appropriate notion of stability for systems having a continuum of equilibria. Two notions that are of particular relevance to such systems are convergence and semistability. Convergence is the property whereby every system solution converges to a limit point that may depend on the system initial condition. Semistability is the additional requirement that all solutions converge to limit points that are Lyapunov stable. Semistability for an equilibrium thus implies Lyapunov stability, and is implied by asymptotic stability.

In this section, we present necessary and sufficient conditions for stochastic semistability. It is important to note that stochastic semistability theory was also developed in [37] for a stronger set of stability in probability definitions. The results in this section, though parallel the results in [37], are predicated on a weaker set of stability in probability definitions, and hence, provide a stronger set of semistability results. First, we present several key propositions. The following proposition gives a sufficient condition for a trajectory of (4) to converge to a limit point. For this result,

D_{c} \subseteq D \subseteq {\bar{R}}_{+}^{n}

denotes a positively invariant set with respect to (4) and

s_{t} (H_{n}^{D_{c}})

denotes the image of

H_{n}^{D_{c}} \subset H_{n}^{D}

under the flow

s_{t} : H_{n}^{D_{c}} \to H_{n}^{D}

, that is,

s_{t} (H_{n}^{D_{c}}) ≜ {y : y = s_{t} (x_{0}) for some x (0) \overset{a . s .}{=} x_{0} \in H_{n}^{D_{c}}}

.

Proposition 2.

Consider the nonlinear stochastic nonnegative dynamical system (4) and let

x \in D_{c}

. If the limit set

ω (x)

of (4) contains a Lyapunov stable in probability (with respect to

{\bar{R}}_{+}^{n}

) equilibrium point y, then

{lim}_{x \to y} P^{x} (∥ {lim}_{t \to \infty} s (t, x) - y ∥ = 0) = 1

, that is,

ω (x) \overset{a . s .}{=} {y}

as

x \to y

.

Proof.

Suppose

y \in ω (x)

is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

and let

N_{ε} \subseteq D_{c}

be a relatively open neighborhood of y. Since y is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

, there exists a relatively open neighborhood

N_{δ} \subset D_{c}

of y such that

s_{t} (H_{n}^{N_{δ}}) \subseteq H_{n}^{N_{ε}}

as

x \to y

for every

t \geq 0

. Now, since

y \in ω (x)

, it follows that there exists

τ \geq 0

such that

s (τ, x) \in H_{n}^{N_{δ}}

. Hence,

s (t + τ, x) = s_{t} (s (τ, x)) \in s_{t} (H_{n}^{N_{δ}}) \subseteq H_{n}^{N_{ε}}

for every

t > 0

. Since

N_{ε} \subseteq D_{c}

is arbitrary, it follows that

y \overset{a . s .}{=} {lim}_{t \to \infty} s (t, x)

. Thus,

{lim}_{n \to \infty} s (t_{n}, x) \overset{a . s .}{=} y

as

x \to y

for every sequence

{t_{n}}_{n = 1}^{\infty}

, and hence,

ω (x) \overset{a . s .}{=} {y}

as

x \to y

. ☐

The following definition introduces the notion of stochastic semistability.

Definition 8.

An equilibrium solution

x (t) \overset{a . s .}{\equiv} x_{e} \in E

of (4) is stochastically semistable with respect to

{\bar{R}}_{+}^{n}

if the following statements hold.

(i): For every $ε > 0$ , ${lim}_{x_{0} \to x_{e}} P^{x_{0}} ({sup}_{0 \leq t < \infty} ∥ x (t) - x_{e} ∥ > ε) = 0 .$ Equivalently, for every $ε > 0$ and $ρ \in (0, 1)$ , there exist $δ = δ (ε, ρ) > 0$ such that, for all $x_{0} \in B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n}$ ,

$P^{x_{0}} (sup_{0 \leq t < \infty} ∥ x (t) - x_{e} ∥ > ε) \leq ρ .$
(ii): ${lim}_{dist (x_{0}, E) \to 0} P^{x_{0}} ({lim}_{t \to \infty} dist (x (t), E) = 0) = 1$ . Equivalently, for every $ρ \in (0, 1)$ , there exist $δ = δ (ρ) > 0$ such that if $dist (x_{0}, E) \leq δ$ , then $P^{x_{0}} ({lim}_{t \to \infty} dist (x (t), E) = 0) \geq 1 - ρ .$

The dynamical system (4) is stochastically semistable with respect to

{\bar{R}}_{+}^{n}

if every equilibrium solution of (4) is stochastically semistable with respect to

{\bar{R}}_{+}^{n}

. Finally, the dynamical system (4) is globally stochastically semistable with respect to

{\bar{R}}_{+}^{n}

if (i) holds and

P^{x_{0}} ({lim}_{t \to \infty} dist (x (t), E) = 0) = 1

for all

x_{0} \in {\bar{R}}^{n}

.

Note that if

x (t) \overset{a . s .}{\equiv} x_{e} \in E

only satisfies (i) in Definition 8, then the equilibrium solution

x (t) \overset{a . s .}{\equiv} x_{e} \in E

of (4) is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

.

Definition 9.

For a given

ρ \in (0, 1)

, the

ρ

-domain of semistability with respect to

{\bar{R}}_{+}^{n}

is the set of points

x_{0} \in D \subseteq {\bar{R}}^{n}

such that if

x (t)

,

t \geq 0

, is a solution to (4) with

x (0) \overset{a . s .}{=} x_{0}

, then

x (t)

converges to a Lyapunov stable (with respect to

{\bar{R}}_{+}^{n}

) in probability equilibrium point in

D

with probability greater than or equal to

1 - ρ

.

Note that if (4) is stochastically semistable, then its

ρ

-domain of semistability contains the set of equilibria in its interior.

Next, we present alternative equivalent characterizations for stochastic semistability of (4). This result is an extension of Proposition 2.2 of [37] to the more general semistability definition presented in this paper.

Proposition 3.

Consider the nonlinear stochastic nonnegative dynamical system

G

given by (4). Then the following statements are equivalent:

(i)

G

is stochastically semistable with respect to

{\bar{R}}_{+}^{n}

.

(

i i

) For every

x_{e} \in E

and

ρ > 0

, there exist class

K

and

L

functions

α_{ρ} (\cdot)

and

β_{ρ} (\cdot)

, respectively, and

δ = δ (x_{e}, ρ) > 0

such that, if

x_{0} \in B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n}

, then

P^{x_{0}} (∥ x (t) - x_{e} ∥ > α_{ρ} (∥ x_{0} - x_{e} ∥)) \leq ρ, t \geq 0,

and

P^{x_{0}} (dist (x (t), E) > β_{ρ} (t)) \leq ρ

,

t \geq 0

.

(iii) For every

x_{e} \in E

and

ρ > 0

, there exist class

K

functions

α_{1 ρ} (\cdot)

and

α_{2 ρ} (\cdot)

, a class

L

function

β_{ρ} (\cdot)

, and

δ = δ (x_{e}, ρ) > 0

such that, if

x_{0} \in B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n}

, then

\begin{matrix} P^{x_{0}} (dist (x (t), E) & > α_{2 ρ} (∥ x_{0} - x_{e} ∥) β_{ρ} (t)) \\ \leq P^{x_{0}} (α_{1 ρ} (∥ x (t) - x_{e} ∥) > α_{2 ρ} (∥ x_{0} - x_{e} ∥)) \leq ρ, t \geq 0 . \end{matrix}

Proof.

To show that (i) implies (ii), suppose (4) is stochastically semistable with respect to

{\bar{R}}_{+}^{n}

and let

x_{e} \in E

. It follows from Lemma 1 that for every

ρ > 0

there exists

δ = δ (x_{e}, ρ) > 0

and a class

K

function

α_{ρ} (\cdot)

such that if

∥ x_{0} - x_{e} ∥ \leq δ

, then

P^{x_{0}} (∥ x (t) - x_{e} ∥ > α_{ρ} (∥ x_{0} - x_{e} ∥)) \leq ρ

,

t \geq 0

. Without loss of generality, we can assume that

δ

is such that

\bar{B_{δ} (x_{e})} \cap {\bar{R}}_{+}^{n}

is contained in the

ρ

-domain of semistability of (4). Hence, for every

x_{0} \in \bar{B_{δ} (x_{e})} \cap {\bar{R}}_{+}^{n}

,

{lim}_{t \to \infty} x (t) \overset{a . s .}{=} x^{*} \in E

and, consequently,

P^{x_{0}} ({lim}_{t \to \infty} dist (x (t), E) = 0) = 1

.

For every

ε > 0

,

ρ > 0

, and

x_{0} \in \bar{B_{δ} (x_{e})} \cap {\bar{R}}_{+}^{n}

, define

T_{x_{0}} (ε, ρ)

to be the infimum of T with the property that

P^{x_{0}} ({sup}_{t \geq T} dist (x (t), E) > ε) \leq ρ

, that is,

T_{x_{0}} (ε, ρ) ≜ inf \{T : P^{x_{0}} (sup_{t \geq T} dist (x (t), E) > ε) \leq ρ\} .

For each

x_{0} \in \bar{B_{δ} (x_{e})} \cap {\bar{R}}_{+}^{n}

and

ρ

, the function

T_{x_{0}} (ε, ρ)

is nonnegative and nonincreasing in

ε

, and

T_{x_{0}} (ε, ρ) = 0

for sufficiently large

ε

.

Next, let

T (ε, ρ) ≜ sup {T_{x_{0}} (ε, ρ) : x_{0} \in \bar{B_{δ} (x_{e})} \cap {\bar{R}}_{+}^{n}}

. We claim that T is well defined. To show this, consider

ε > 0

,

ρ > 0

, and

x_{0} \in \bar{B_{δ} (x_{e})} \cap {\bar{R}}_{+}^{n}

. Since

P^{x_{0}} ({sup}_{t \geq T_{x_{0}} (ε, ρ)} dist (x (t), E) > ε) \leq ρ

, it follows from the sample continuity of s that, for every

ε > 0

and

ρ > 0

, there exists an open neighborhood

U

of

x_{0}

such that

P^{x_{0}} ({sup}_{t \geq T_{z} (ε, ρ)} dist (s (t, z), E) > ε) \leq ρ

for every

z \in U

. Hence,

{lim sup}_{z \to x_{0}} T_{z} (ε, ρ) \leq T_{x_{0}} (ε, ρ)

implying that the function

x_{0} \mapsto T_{x_{0}} (ε, ρ)

is upper semicontinuous at the arbitrarily chosen point

x_{0}

, and hence on

\bar{B_{δ} (x_{e})} \cap {\bar{R}}_{+}^{n}

. Since an upper semicontinuous function defined on a compact set achieves its supremum, it follows that

T (ε, ρ)

is well defined. The function

T (\cdot)

is the pointwise supremum of a collection of nonnegative and nonincreasing functions, and hence is nonnegative and nonincreasing. Moreover,

T (ε, ρ) = 0

for every

ε > max {α_{ρ} (∥ x_{0} - x_{e} ∥) : x_{0} \in \bar{B_{δ} (x_{e})} \cap {\bar{R}}_{+}^{n}}

.

Let

ψ_{ρ} (ε) ≜ \frac{2}{ε} \int_{ε / 2}^{ε} T (σ, ρ) d σ + \frac{1}{ε} \geq T (ε, ρ) + \frac{1}{ε}

. The function

ψ_{ρ} (ε)

is positive, continuous, strictly decreasing, and

ψ_{ρ} (ε) \to 0

as

ε \to \infty

. Choose

β_{ρ} (\cdot) = ψ^{- 1} (\cdot)

. Then

β_{ρ} (\cdot)

is positive, continuous, strictly decreasing, and

{lim}_{σ \to \infty} β_{ρ} (σ) = 0

. Furthermore,

T (β_{ρ} (σ), ρ) < ψ_{ρ} (β_{ρ} (σ)) = σ

. Hence,

P^{x_{0}} (dist (x (t), E) > β_{ρ} (t)) \leq ρ

,

t \geq 0

.

Next, to show that (ii) implies (iii), suppose (ii) holds and let

x_{e} \in E

. Then it follows from (i) of Lemma 1 that

x_{e}

is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

. For every

ρ > 0

, choosing

x_{0}

sufficiently close to

x_{e}

, it follows from the inequality

P^{x_{0}} (∥ x (t) - x_{e} ∥ > α_{ρ} (∥ x_{0} - x_{e} ∥)) \leq ρ

,

t \geq 0

, that trajectories of (4) starting sufficiently close to

x_{e}

are bounded, and hence, the positive limit set of (4) is nonempty. Since

P^{x_{0}} ({lim}_{t \to \infty} dist (x (t), E) = 0) = 1

as

dist (x_{0}, E) \to 0

, it follows that the positive limit set is contained in

E

as

dist (x_{0}, E) \to 0

.

Now, since every point in

E

is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

, it follows from Proposition 2 that

{lim}_{t \to \infty} x (t) \overset{a . s .}{=} x^{*}

as

x_{0} \to x^{*}

, where

x^{*} \in E

is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

. If

x^{*} = x_{e}

, then it follows using similar arguments as above that there exists a class

L

function

{\hat{β}}_{ρ} (\cdot)

such that

P^{x_{0}} (dist (x (t), E) > {\hat{β}}_{ρ} (t)) \leq P^{x_{0}} (∥ x (t) - x_{e} ∥ > {\hat{β}}_{ρ} (t)) \leq ρ

for every

x_{0}

satisfying

∥ x_{0} - x_{e} ∥ < δ

and

t \geq 0

. Hence,

P^{x_{0}} (dist (x (t), E) > \sqrt{∥ x (t) - x_{e} ∥} \sqrt{{\hat{β}}_{ρ} (t)}) \leq ρ, t \geq 0 .

Next, consider the case where

x^{*} \neq x_{e}

and let

α_{1 ρ} (\cdot)

be a class

K

function. In this case, note that

P^{x_{0}} (lim_{t \to \infty} dist (x (t), E) / α_{1 ρ} (∥ x (t) - x_{e} ∥) = 0) \geq 1 - ρ,

and hence, it follows using similar arguments as above that there exists a class

L

function

β_{ρ} (\cdot)

such that

P^{x_{0}} (dist (x (t), E) > α_{1 ρ} (∥ x (t) - x_{e} ∥) β_{ρ} (t)) \leq ρ, t \geq 0 .

Now, note that

α_{1 ρ} \circ α_{ρ}

is of class

K

(by [48]), Lemma 4.2), and hence, (iii) follows immediately.

Finally, to show that (iii) implies (i), suppose (iii) holds and let

x_{e} \in E

. Then it follows that for every

ρ > 0

,

P^{x_{0}} (α_{1 ρ} (∥ x (t) - x_{e} ∥) > α_{2 ρ} (∥ x (0) - x_{e} ∥)) \leq ρ, t \geq 0,

that is,

P^{x_{0}} [∥ x (t) - x_{e} ∥ > α_{ρ} (∥ x (0) - x_{e} ∥)] \leq ρ

, where

t \geq 0

and

α_{ρ} = {α_{1 ρ}}^{- 1} \circ α_{2 ρ}

is of class

K

(by [48], Lemma 4.2). It now follows from (i) of Lemma 1 that

x_{e}

is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

. Since

x_{e}

was chosen arbitrarily, it follows that every equilibrium point is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

. Furthermore,

P^{x_{0}} ({lim}_{t \to \infty} dist (x (t), E) = 0) \geq 1 - ρ

.

Choosing

x_{0}

sufficiently close to

x_{e}

, it follows from the inequality

P^{x_{0}} (∥ x (t) - x_{e} ∥ > α_{ρ} (∥ x_{0} - x_{e} ∥)) \leq ρ, t \geq 0,

that trajectories of (4) are almost surely bounded as

x_{0} \to x_{e}

, and hence, the positive limit set of (4) is nonempty as

x_{0} \to x_{e}

. Since every point in

E

is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

, it follows from Proposition 2 that

{lim}_{t \to \infty} x (t) \overset{a . s .}{=} x^{*}

as

x_{0} \to x^{*}

, where

x^{*} \in E

is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

. Hence, by Definition 8, (4) is stochastically semistable with respect to

{\bar{R}}_{+}^{n}

. ☐

Next, we develop necessary and sufficient conditions for stochastic semistability. First, we present sufficient conditions for stochastic semistability. The following theorems generalize Theorems 3.1 and 3.2 of [37].

Theorem 3.

Consider the nonlinear stochastic nonnegative dynamical system (4). Let

Q \subseteq {\bar{R}}_{+}^{n}

be a relatively open neighborhood of

E

and assume that there exists a two-times continuously differentiable function

V : Q \to {\bar{R}}_{+}

such that

V^{'} (x) f (x) + \frac{1}{2} tr D^{T} (x) V^{″} (x) D (x) < 0, x \in Q \ E .

(52)

If every equilibrium point of (4) is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

, then (4) is stochastically semistable with respect to

{\bar{R}}_{+}^{n}

. Moreover, if

Q = {\bar{R}}_{+}^{n}

and

V (x) \to \infty

as

∥ x ∥ \to \infty

, then (4) is globally stochastically semistable with respect to

{\bar{R}}_{+}^{n}

.

Proof.

Since every equilibrium point of (4) is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

by assumption, for every

z \in E

, there exists a relatively open neighborhood

V_{z}

of z such that

s ([0, \infty) \times V_{z} \cap B_{ε} (z))

,

ε > 0

, is bounded and contained in

Q

as

ε \to 0

. The set

V_{ε} ≜ ⋃_{z \in E} V_{z} \cap B_{ε} (z)

,

ε > 0

, is a relatively open neighborhood of

E

contained in

Q

. Consider

x \in V_{ε}

so that there exists

z \in E

such that

x \in V_{z} \cap B_{ε} (z)

and

s (t, x) \in H_{n}^{V_{z} \cap B_{ε} (z)}

,

t \geq 0

, as

ε \to 0

. Since

V_{z} \cap B_{ε} (z)

is bounded and invariant with respect to the solution of (4) as

ε \to 0

, it follows that

V_{ε}

is invariant with respect to the solution of (4) as

ε \to 0

. Furthermore, it follows from (52) that

L V (s (t, x)) < 0

,

t \geq 0

, and hence, since

V_{ε}

is bounded it follows from Theorem 2 that

{lim}_{t \to \infty} L V (s (t, x)) \overset{a . s .}{=} 0

as

ε \to 0

.

It is easy to see that

L V (x) \neq 0

by assumption and

L V (x_{e}) = 0

,

x_{e} \in E

. Therefore,

s (t, x) \overset{a . s .}{\to} E

as

t \to \infty

and

ε \to 0

, which implies that

{lim}_{dist (x, E) \to 0} P^{x} ({lim}_{t \to \infty} dist (s (t, x), E) = 0) = 1

. Finally, since every point in

E

is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

, it follows from Proposition 2 that

{lim}_{t \to \infty} s (t, x) \overset{a . s .}{=} x^{*}

as

x \to x^{*}

, where

x^{*} \in E

is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

. Hence, by Definition 8, (4) is semistable. For

Q = {\bar{R}}_{+}^{n}

global stochastic semistability with respect to

{\bar{R}}_{+}^{n}

follows from identical arguments using the radially unbounded condition on

V (\cdot)

. ☐

Next, we present a slightly more general theorem for stochastic semistability wherein we do not assume that all points in

L V^{- 1} (0)

are Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

but rather we assume that all points in

{(η \circ V)}^{- 1} (0)

are Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

for some continuous function

η : {\bar{R}}_{+} \to {\bar{R}}_{+}

.

Theorem 4.

Consider the nonlinear stochastic nonnegative dynamical system (4) and let

Q \subseteq {\bar{R}}_{+}^{n}

be a relatively open neighborhood of

E

. Assume that there exist a two-times continuously differentiable function

V : Q \to {\bar{R}}_{+}

and a continuous function

η : {\bar{R}}_{+} \to {\bar{R}}_{+}

such that

V^{'} (x) f (x) + \frac{1}{2} tr D^{T} (x) V^{″} (x) D (x) \leq - η (V (x)), x \in Q .

(53)

If every point in the set

M

≜

{x \in Q : η (V (x)) = 0}

is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

, then (4) is stochastically semistable with respect to

{\bar{R}}_{+}^{n}

. Moreover, if

Q =_{+}^{n}

and

V (x) \to \infty

as

∥ x ∥ \to \infty

, then (4) is globally stochastically semistable with respect to

{\bar{R}}_{+}^{n}

.

Proof.

Since, by assumption, (4) is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

for all

z \in M

, there exists a relatively open neighborhood

V_{z}

of z such that

s ([0, \infty) \times V_{z} \cap B_{ε} (z))

,

ε > 0

, is bounded and contained in

Q

as

ε \to 0

. The set

V_{ε} ≜ ⋃_{z \in M} V_{z} \cap B_{ε} (z)

is a relatively open neighborhood of

M

contained in

Q

. Consider

x \in V_{ε}

so that there exists

z \in M

such that

x \in V_{z} \cap B_{ε} (z)

and

s (t, x) \in H_{n}^{V_{z} \cap B_{ε} (z)}

,

t \geq 0

, as

ε \to 0

. Since

V_{z}

is bounded it follows that

V_{ε}

is invariant with respect to the solution of (4) as

ε \to 0

. Furthermore, it follows from (53) that

L V (s (t, x)) \leq - η (V (s (t, x)))

,

t \geq 0

, and hence, since

V_{ε}

is bounded and invariant with respect to the solution of (4) as

ε \to 0

, it follows from Theorem 2 that

{lim}_{t \to \infty} η (V (s (t, x))) \overset{a . s .}{=} 0

as

ε \to 0

. Therefore,

s (t, x) \overset{a . s .}{\to} M

as

t \to \infty

and

ε \to 0

, which implies that

{lim}_{dist (x, M) \to 0} P^{x} ({lim}_{t \to \infty} dist (s (t, x), M) = 0) = 1

.

Finally, since every point in

M

is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

, it follows from Proposition 2 that

{lim}_{t \to \infty} s (t, x) \overset{a . s .}{=} x^{*}

as

x \to x^{*}

, where

x^{*} \in M

is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

. Hence, by definition, (4) is semistable. For

Q = {\bar{R}}_{+}^{n}

global stochastic semistability with respect to

{\bar{R}}_{+}^{n}

follows from identical arguments using the radially unbounded condition on

V (\cdot)

. ☐

Example 1.

Consider the nonlinear stochastic nonnegative dynamical system on

H_{2}

given by ([37])

\begin{matrix} d x_{1} (t) & = [σ_{12} (x_{2} (t)) - σ_{21} (x_{1} (t))] d t + γ (x_{2} (t) - x_{1} (t)) d w (t), x_{1} (0) \overset{a . s .}{=} x_{10}, t \geq 0, \end{matrix}

(54)

\begin{matrix} d x_{2} (t) & = [σ_{21} (x_{1} (t)) - σ_{12} (x_{2} (t))] d t + γ (x_{1} (t) - x_{2} (t)) d w (t), x_{2} (0) \overset{a . s .}{=} x_{20}, \end{matrix}

(55)

where

σ_{i j} (\cdot)

,

i, j = 1, 2

,

i \neq j

, are Lipschitz continuous and

γ > 0

. Equations (54) and (55) represent the collective dynamics of two subsystems which interact by exchanging energy. The energy states of the subsystems are described by the scalar random variables

x_{1}

and

x_{2}

. The unity coefficients scaling

σ_{i j} (\cdot)

,

i, j \in {1, 2}

,

i \neq j

, appearing in (54) and (55) represent the topology of the energy exchange between the subsystems. More specifically, given

i, j \in {1, 2}

,

i \neq j

, a coefficient of 1 denotes that subsystem j receives energy from subsystem i, and a coefficient of zero denotes that subsystem i and j are disconnected, and hence, cannot exchange energies.

The connectivity between the subsystems can be represented by a graph

G

having two nodes such that

G

has a directed edge from node i to node j if and only if subsystem j can receive energy from subsystem i. Since the coefficients scaling

σ_{i j} (\cdot)

,

i, j \in {1, 2}

,

i \neq j

, are constants, the graph topology is fixed. Furthermore, note that the directed graph

G

is weakly connected since the underlying undirected graph is connected; that is, every subsystem receives energy from, or delivers energy to, at least one other subsystem.

Note that (54) and (55) can be cast in the form of (4) with

f (x) = [\begin{matrix} σ_{12} (x_{2}) - σ_{21} (x_{1}) \\ σ_{21} (x_{1}) - σ_{12} (x_{2}) \end{matrix}], D (x) = [\begin{matrix} γ (x_{2} - x_{1}) \\ γ (x_{1} - x_{2}) \end{matrix}],

where the stochastic term

D (x) d w

represents probabilistic variations in the energy transfer between the two subsystems. Furthermore, note that since

e_{2}^{T} d x (t) = e_{2}^{T} f (x (t)) d t + e_{2}^{T} D (x (t)) d w (t) = 0, x (0) \overset{a . s .}{=} x_{0}, t \geq 0,

where

e_{2} ≜ {[1 1]}^{T}

, it follows that

d x_{1} (t) + d x_{2} (t) = 0

, which implies that the total system energy is conserved.

In this example, we use Theorem 3 to analyze the collective behavior of (54) and (55). Specifically, we are interested in the energy equipartitioning behavior of the subsystems. For this purpose, we make the assumptions

σ_{i j} (x_{j}) - σ_{j i} (x_{i}) = 0

if and only if

x_{i} = x_{j}

,

i \neq j

, and

(x_{i} - x_{j}) [σ_{i j} (x_{j}) - σ_{j i} (x_{i})] \leq - γ^{2} (x_{1} - x_{2})

for

i, j \in {1, 2}

.

The first assumption implies that if the energies in the connected subsystems i and j are equal, then energy exchange between the subsystems is not possible. This statement is reminiscent of the zeroth law of thermodynamics, which postulates that temperature equality is a necessary and sufficient condition for thermal equilibrium. The second assumption implies that energy flows from more energetic subsystems to less energetic subsystems and is reminiscent of the second law of thermodynamics, which states that heat (energy) must flow in the direction of lower temperatures. It is important to note here that due to the stochastic term

D (x) d w

capturing probabilistic variations in the energy transfer between the subsystems, the second assumption requires that the scaled net energy flow

(x_{i} - x_{j}) [σ_{i j} (x_{j}) - σ_{j i} (x_{i})]

is bounded by the negative intensity of the diffusion coefficient given by

\frac{1}{2} tr D (x) D^{T} (x)

.

To show that (54) and (55) is stochastically semistable with respect to

{\bar{R}}_{+}^{2}

, note that

E = f^{- 1} (0) \cap D^{- 1} (0) = {(x_{1}, x_{2}) \in {\bar{R}}_{+}^{2} : x_{1} = x_{2} = α, α \in {\bar{R}}_{+}}

and consider the Lyapunov function candidate

V (x_{1}, x_{2}) = \frac{1}{2} {(x_{1} - α)}^{2} + \frac{1}{2} {(x_{2} - α)}^{2}

, where

α \in {\bar{R}}_{+}

. Now, it follows that

\begin{matrix} L V (x_{1}, x_{2}) & = & (x_{1} - α) [σ_{12} (x_{2}) - σ_{21} (x_{1})] + (x_{2} - α) [σ_{21} (x_{1}) - σ_{12} (x_{2})] \\ + \frac{1}{2} [{(γ (x_{2} - x_{1}))}^{2} + {(γ (x_{1} - x_{2}))}^{2}] \\ = & x_{1} [σ_{12} (x_{2}) - σ_{21} (x_{1})] + x_{2} [σ_{21} (x_{1}) - σ_{12} (x_{2})] + {(γ (x_{1} - x_{2}))}^{2} \\ = & (x_{1} - x_{2}) [σ_{12} (x_{2}) - σ_{21} (x_{1}) + γ^{2} (x_{1} - x_{2})] \\ \leq & 0, (x_{1}, x_{2}) \in {\bar{R}}_{+} \times {\bar{R}}_{+}, \end{matrix}

(56)

which implies that

x_{1} = x_{2} = α

is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{2}

.

Next, it is easy to see that

L V (x_{1}, x_{2}) \neq 0

when

x_{1} \neq x_{2}

, and hence,

L V (x_{1}, x_{2}) < 0

,

(x_{1}, x_{2}) \in {\bar{R}}_{+}^{2} \ E

. Therefore, it follows from Theorem 3 that

x_{1} = x_{2} = α

is stochastically semistable with respect to

{\bar{R}}_{+}^{2}

for all

{\bar{R}}_{+}

. Furthermore, note that

e_{2}^{T} d x (t) \overset{a . s .}{=} 0

,

t \geq 0

, implies

x (t) \overset{a . s .}{\to} \frac{1}{2} e_{2} e_{2}^{T} x (0) \overset{a . s .}{=} \frac{1}{2} [x_{1} (0) + x_{2} (0)] e_{2} as t \to \infty .

Note that an identical assertion holds for the collective dynamics of n subsystems with a connected undirected energy graph topology. △

Finally, we extend Theorem 3.3 of [37] to provide a converse Lyapunov theorem for stochastic semistability. For this result, recall that

L V (x_{e}) = 0

for every

x_{e} \in E

. Also note that it follows from (9) that

L V (x) = L V (s (0, x))

.

Theorem 5.

Consider the nonlinear stochastic nonnegative dynamical system (4). Suppose (4) is stochastically semistable with a ρ-domain of semistability

D_{0}

. Then there exist a continuous nonnegative function

V : D_{0} \to {\bar{R}}_{+}

and a class

K_{\infty}

function

α (\cdot)

such that (i)

V (x) = 0

,

x \in E

, (

i i

)

V (x) \geq α (dist (x, E))

,

x \in D_{0}

, and (

i i i

)

L V (x) < 0

,

x \in D_{0} \ E

.

Proof.

Let

B^{x_{0}}

denote the set of all sample trajectories of (4) for which

{lim}_{t \to \infty} dist (x (t, ω), E) = 0

and

x ({t \geq 0}, ω) \in B^{x_{0}}

,

ω \in Ω

, and let

𝟙_{B^{x_{0}}} (ω)

,

ω \in Ω

, denote the indicator function defined on the set

B^{x_{0}}

, that is,

𝟙_{B^{x_{0}}} (ω) ≜ \{\begin{matrix} 1, & if x ({t \geq 0}, ω) \in B^{x_{0}}, \\ 0, & otherwise . \end{matrix}

Note that by definition

P^{x_{0}} (B^{x_{0}}) \geq 1 - ρ

for all

x_{0} \in D_{0}

. Define the function

V : D_{0} \to {\bar{R}}_{+}

by

\begin{matrix} V (x) & ≜ & sup_{t \geq 0} \{\frac{1 + 2 t}{1 + t} E [dist (s (t, x), E) 𝟙_{B^{x}} (ω)]\}, x \in D_{0}, \end{matrix}

(57)

and note that

V (\cdot)

is well defined since (4) is stochastically semistable with respect to

{\bar{R}}_{+}^{n}

. Clearly, (i) holds. Furthermore, since

V (x) \geq dist (x, E)

,

x \in D_{0}

, it follows that (ii) holds with

α (r) = r

.

To show that

V (\cdot)

is continuous on

D_{0} \ E

, define

T : D_{0} \ E \to [0, \infty)

by

T (z) ≜ inf {h : E [dist (s (h, z), E) 𝟙_{B^{z}} (ω)] < dist (z, E) / 2 for all t \geq h > 0}

, and denote

\begin{matrix} W_{ε} ≜ \{x \in D_{0} : P^{x} (sup_{t \geq 0} dist (s (t, x), E) \leq ε) \geq 1 - ρ\} . \end{matrix}

(58)

Note that

W_{ε} \supset E

is open and contains an open neighborhood of

E

. Consider

z \in D_{0} \ E

and define

λ ≜ dist (z, E) > 0

. Then it follows from stochastic semistability of (4) that there exists

h > 0

such that

P^{z} (s (h, z) \in W_{λ / 2}) \geq 1 - ρ

. Consequently,

P^{z} (s (h + t, z) \in W_{λ / 2}) \geq 1 - ρ

for all

t \geq 0

, and hence, it follows that

T (z)

is well defined. Since

W_{λ / 2}

is open, there exists a neighborhood

B_{σ} (s (T (z), z)

such that

P^{z} (B_{σ} (s (T (z), z)) \subset W_{λ / 2}) \geq 1 - ρ

. Hence,

N \subset D_{0}

is a neighborhood of z such that

s_{T (z)} (H_{n}^{N}) ≜ B_{σ} (s (T (z), z))

.

Next, choose

η > 0

such that

η < λ / 2

and

B_{η} (z) \subset N

. Then, for every

t > T (z)

and

y \in B_{η} (z)

,

[(1 + 2 t) / (1 + t)] E [dist (s (t, y), E) 𝟙_{B^{y}} (ω)] \leq 2 E [dist (s (t, y), E) 𝟙_{B^{y}} (ω)] \leq λ .

Therefore, for every

y \in B_{η} (z)

,

\begin{matrix} V (z) - V (y) & = & sup_{t \geq 0} \{\frac{1 + 2 t}{1 + t} E [dist (s (t, z), E) 𝟙_{B^{z}} (ω)]\} \\ - sup_{t \geq 0} \{\frac{1 + 2 t}{1 + t} E [dist (s (t, y), E) 𝟙_{B^{y}} (ω)]\} \\ = & sup_{0 \leq t \leq T (z)} \{\frac{1 + 2 t}{1 + t} E [dist (s (t, z), E) 𝟙_{B^{z}} (ω)]\} \\ - sup_{0 \leq t \leq T (z)} \{\frac{1 + 2 t}{1 + t} E [dist (s (t, y), E) 𝟙_{B^{y}} (ω)]\} . \end{matrix}

(59)

Hence,

\begin{matrix} (60) & | V (z) - V (y) | & \leq & sup_{0 \leq t \leq T (z)} | \frac{1 + 2 t}{1 + t} (E [dist (s (t, z), E) 𝟙_{B^{z}} (ω)] \\ - E [dist (s (t, y), E) 𝟙_{B^{y}} (ω)]) | \\ \leq & 2 sup_{0 \leq t \leq T (z)} |E [dist (s (t, z), E) 𝟙_{B^{z}} (ω)] - E [dist (s (t, y), E) 𝟙_{B^{y}} (ω)]| \\ (61) & \leq & 2 sup_{0 \leq t \leq T (z)} E [dist (s (t, z), s (t, y))], z \in D_{0} \ E, y \in B_{η} (z) . \end{matrix}

Now, since

f (\cdot)

and

D (\cdot)

satisfy (6) and (7), it follows from continuous dependence of solutions

s (\cdot, \cdot)

on system initial conditions ([32], Theorem 7.3.1) and (60) that

V (\cdot)

is continuous on

D_{0} \ E

.

To show that

V (\cdot)

is continuous on

E

, consider

x_{e} \in E

. Let

{x_{n}}_{n = 1}^{\infty}

be a sequence in

D_{0} \ E

that converges to

x_{e}

. Since

x_{e}

is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

, it follows that

x (t) \overset{a . s .}{\equiv} x_{e}

is the unique solution to (4) with

x (0) \overset{a . s .}{=} x_{e}

. By continuous dependence of solutions

s (\cdot, \cdot)

on system initial conditions ([32], Theorem 7.3.1),

s (t, x_{n}) \overset{a . s .}{\to} s (t, x_{e}) \overset{a . s .}{=} x_{e}

as

n \to \infty

,

t \geq 0

.

Let

ε > 0

and note that it follows from (ii) of Proposition 3 that there exists

δ = δ (x_{e}) > 0

such that for every solution of (4) in

B_{δ} (x_{e})

there exists

\hat{T} = \hat{T} (x_{e}, ε) > 0

such that

P (s_{t} (H_{n}^{B_{δ} (x_{e})}) \subset W_{ε}) \geq 1 - ρ

for all

t \geq \hat{T}

. Next, note that there exists a positive integer

N_{1}

such that

x_{n} \in B_{δ} (x_{e})

for all

n \geq N_{1}

. Now, it follows from (57) that

\begin{matrix} V (x_{n}) & \leq & 2 sup_{0 \leq t \leq \hat{T}} E [dist (s (t, x_{n}), E) 𝟙_{B^{x_{n}}} (ω)] + 2 ε, n \geq N_{1} . \end{matrix}

(62)

Next, it follows from ([32], Theorem 7.3.1) that

E [| s (\cdot, x_{n}) |]

converges to

E [| s (\cdot, x_{e}) |]

uniformly on

[0, \hat{T}]

. Hence,

\begin{matrix} lim_{n \to \infty} sup_{0 \leq t \leq \hat{T}} E [dist (s (t, x_{n}), E) 𝟙_{B^{x_{n}}} (ω)] & = & sup_{0 \leq t \leq \hat{T}} E [lim_{n \to \infty} dist (s (t, x_{n}), E) 𝟙_{B^{x_{n}}} (ω)] \\ \leq & sup_{0 \leq t \leq \hat{T}} dist (x_{e}, E) \\ = & 0, \end{matrix}

(63)

which implies that there exists a positive integer

N_{2} = N_{2} (x_{e}, ε) \geq N_{1}

such that

sup_{0 \leq t \leq \hat{T}} E [dist (s (t, x_{n}), E) 𝟙_{B^{x_{n}}} (ω)] < ε

for all

n \geq N_{2}

. Combining (62) with the above result yields

V (x_{n}) < 4 ε

for all

n \geq N_{2}

, which implies that

{lim}_{n \to \infty} V (x_{n}) = 0 = V (x_{e})

.

Finally, we show that

L V (x (t))

is negative along the solution of (4) on

D_{0} \ E

. Note that for every

x \in D_{0} \ E

and

0 < h \leq 1 / 2

such that

P (s (h, x) \in D_{0} \ E) \geq 1 - ρ

, it follows from the definition of

T (\cdot)

that

E [V (s (h, x))]

is reached at some time

\hat{t}

such that

0 \leq \hat{t} \leq T (x)

. Hence, it follows from the law of iterated expectation that

\begin{matrix} E [V (s (h, x))] & = & E [E [dist (s (\hat{t} + h, x), E) 𝟙_{B^{s (h, x)}} (ω)] \frac{1 + 2 \hat{t}}{1 + \hat{t}}] \\ = & E [dist (s (\hat{t} + h, x), E) 𝟙_{B^{x}} (ω)] \frac{1 + 2 \hat{t} + 2 h}{1 + \hat{t} + h} [1 - \frac{h}{(1 + 2 \hat{t} + 2 h) (1 + \hat{t})}] \\ \leq & V (x) [1 - \frac{h}{2 {(1 + T (x))}^{2}}], \end{matrix}

(64)

which implies that

L V (x) = lim_{h \to 0^{+}} \frac{E [V (s (h, x))] - V (x)}{h} \leq - \frac{1}{2} V (x) {(1 + T (x))}^{- 2} < 0, x \in D_{0} \ E,

and hence, (iii) holds. ☐

5. Conservation of Energy and the First Law of Thermodynamics: A Stochastic Perspective

In this section, we extend the thermodynamic model proposed in [30] to include probabilistic variations in the instantaneous rate of energy dissipation as well as probabilistic variations in the energy transfer between the subsystems. Even though the treatment in this and the next two sections closely parallels that of [30] for deterministic thermodynamics, the thermodynamic models and proofs of our results are rendered more difficult due to the inclusion of stochastic disturbances. To formulate our state space stochastic thermodynamic model, we consider the large-scale stochastic dynamical system

G

shown in Figure 1 involving energy exchange between q interconnected subsystems and use the notation developed in [30].

Specifically,

E_{i} : [0, \infty) \to {\bar{R}}_{+}

denotes the energy (and hence a nonnegative quantity) of the i-th subsystem,

S_{i} : [0, \infty) \to R

denotes the external power (heat flux) supplied to (or extracted from) the i-th subsystem,

σ_{i j} : {\bar{R}}_{+}^{q} \to {\bar{R}}_{+}

,

i \neq j, i, j = 1, \dots, q

, denotes the instantaneous rate of energy (heat) flow from the j-th subsystem to the i-th subsystem,

J_{(i, k)} : {\bar{R}}_{+}^{q} \to {\bar{R}}_{+}

,

i = 1, \dots, q

,

k = 1, \dots, d_{1}

, denotes the instantaneous rate of energy (heat) received or delivered to the i-th subsystem from all other subsystems due to the stochastic disturbance

w_{1 k} (\cdot)

,

σ_{i i} : {\bar{R}}_{+}^{q} \to {\bar{R}}_{+}, i = 1, \dots, q

, denotes the instantaneous rate of energy (heat) dissipation from the i-th subsystem to the environment, and

D_{(i, l)} : {\bar{R}}_{+}^{q} \to {\bar{R}}_{+}, i = 1, \dots, q, l = 1, \dots, d_{2}

, denotes the instantaneous rate of energy (heat) dissipation from the i-th subsystem to the environment due to the stochastic disturbance

w_{2 l} (\cdot)

. Here we assume that

σ_{i j} : {\bar{R}}_{+}^{q} \to {\bar{R}}_{+}, i, j = 1, \dots, q

,

J_{(i, k)} : {\bar{R}}_{+}^{q} \to {\bar{R}}_{+}

,

i = 1, \dots, q

,

k = 1, \dots, d_{1}

, and

D_{(i, l)} : {\bar{R}}_{+}^{q} \to {\bar{R}}_{+}, i = 1, \dots, q, l = 1, \dots, d_{2}

, are locally Lipschitz continuous on

{\bar{R}}_{+}^{q}

and satisfy a linear growth condition, and

S_{i} : [0, \infty) \to R, i = 1, \dots, q

, are bounded piecewise continuous functions of time.

An energy balance for the i-th subsystem yields

\begin{matrix} E_{i} (T) & = & E_{i} (t_{0}) + \sum_{j = 1, j \neq i}^{q} \int_{t_{0}}^{T} [σ_{i j} (E (t)) - σ_{j i} (E (t))] d t \\ + \int_{t_{0}}^{T} {row}_{i} (J (E (t))) d w_{1} (t) - \int_{t_{0}}^{T} σ_{i i} (E (t)) d t \\ - \int_{t_{0}}^{T} {row}_{i} (D (E (t))) d w_{2} (t) + \int_{t_{0}}^{T} S_{i} (t) d t, T \geq t_{0}, \end{matrix}

(65)

or, equivalently, in vector form,

\begin{matrix} E (T) & = & E (t_{0}) + \int_{t_{0}}^{T} f (E (t)) d t + \int_{t_{0}}^{T} J (E (t)) d w_{1} (t) - \int_{t_{0}}^{T} d (E (t)) d t \\ - \int_{t_{0}}^{T} D (E (t)) d w_{2} (t) + \int_{t_{0}}^{T} S (t) d t, T \geq t_{0}, \end{matrix}

(66)

where

E (t) ≜ {[E_{1} (t), \dots, E_{q} (t)]}^{T}

,

w_{1} (\cdot)

and

w_{2} (\cdot)

are, respectively, a

d_{1}

-dimensional and

d_{2}

-dimensional independent standard Wiener process (i.e., Brownian motion) defined on a complete filtered probability space

(Ω, F, {F_{t}}_{t \geq t_{0}}, P)

,

E (t_{0})

is independent of

(w_{1} (t) - w_{1} (t_{0})), t \geq t_{0}

, and

(w_{2} (t) - w_{2} (t_{0})), t \geq t_{0}

,

\begin{matrix} d (E (t)) & ≜ & {[σ_{11} (E (t)), \dots, σ_{q q} (E (t))]}^{T}, \\ S (t) & ≜ & {[S_{1} (t), \dots, S_{q} (t)]}^{T}, \\ f (E) & = & {[f_{1} (E), \dots, f_{q} (E)]}^{T} : {\bar{R}}_{+}^{q} \to R^{q}, \\ J (E) & = & {[{row}_{1} (J (E)), \dots, {row}_{q} (J (E))]}^{T} : {\bar{R}}_{+}^{q} \to R^{q} \times R^{d_{1}}, \\ D (E) & = & {[{row}_{1} (D (E)), \dots, {row}_{q} (D (E))]}^{T} : {\bar{R}}_{+}^{q} \to R^{q} \times R^{d_{2}} . \end{matrix}

Here, the stochastic disturbance

J (E) d w_{1}

in (66) captures probabilistic variations in the energy transfer rates between compartments and the stochastic disturbance

D (E) d w_{2}

captures probabilistic variations in the instantaneous rate of energy dissipation.

Equivalently, (65) can be rewritten as

\begin{matrix} d E_{i} (t) & = & \sum_{j = 1, j \neq i}^{q} [σ_{i j} (E (t)) - σ_{j i} (E (t))] d t + {row}_{i} (J (E (t))) d w_{1} (t) - σ_{i i} (E (t)) d t \\ - {row}_{i} (D (E (t))) d w_{2} (t) + S_{i} (t) d t, E_{i} (t_{0}) \overset{a . s .}{=} E_{i 0}, t \geq t_{0}, \end{matrix}

(67)

or, in vector form,

d E (t) = f (E (t)) d t + J (E (t)) d w_{1} (t) - d (E (t)) d t - D (E (t)) d w_{2} (t) + S (t) d t, E (t_{0}) \overset{a . s .}{=} E_{0}, t \geq t_{0},

(68)

where

E_{0} ≜ {[E_{10}, \dots, E_{q 0}]}^{T}

, yielding a differential energy balance equation that characterizes energy flow between subsystems of the large-scale stochastic dynamical system

G

. Here we assume that

S (\cdot)

satisfies sufficient regularity conditions such that (68) has a unique solution forward in time. Specifically, we assume that the external power (heat flux)

S (\cdot)

supplied to the large-scale stochastic dynamical system

G

consists of measurable functions

S (\cdot)

adapted to the filtration

{F_{t}}_{t \geq t_{0}}

such that

S (t) \in H_{q}

,

t \geq t_{0}

, for all

t \geq s

,

w (t) - w (s)

is independent of

S (τ)

,

w (τ)

,

τ \leq s

, and

E (t_{0})

, where

w (t) ≜ {[w_{1}^{T} (t), w_{2}^{T} (t)]}^{T}

, and hence,

S (\cdot)

is non-anticipative. Furthermore, we assume that

S (\cdot)

takes values in a compact metrizable set. In this case, it follows from Theorem 2.2.4 of [53] that there exists a path-wise unique solution to (68) in

(Ω, {F_{t}}_{t \geq t_{0}}, P^{E_{0}})

.

Equation (66) or, equivalently, (68) is a statement of the first law for stochastic thermodynamics as applied to isochoric transformations (i.e., constant subsystem volume transformations) for each of the subsystems

G_{i}, i = 1, \dots, q

. To see this, let the total energy in the large-scale stochastic dynamical system

G

be given by

U ≜ e^{T} E

, where

e^{T} ≜ [1, \dots, 1]

and

E \in {\bar{R}}_{+}^{q}

, and let the net energy received by the large-scale dynamical system

G

over the time interval

[t_{1}, t_{2}]

be given by

\begin{matrix} Q ≜ \int_{t_{1}}^{t_{2}} e^{T} [S (t) - d (E (t))] d t - \int_{t_{1}}^{t_{2}} e^{T} D (E (t)) d w_{2} (t), \end{matrix}

(69)

where

E (t), t \geq t_{0}

, is the solution to (68). Then, premultiplying (66) by

e^{T}

and using the fact that

e^{T} f (E) \equiv 0

and

e^{T} J (E) \equiv 0

, it follows that

\begin{matrix} Δ U = Q, \end{matrix}

(70)

where

Δ U ≜ U (t_{2}) - U (t_{1})

denotes the variation in the total energy of the large-scale stochastic dynamical system

G

over the time interval

[t_{1}, t_{2}]

.

For our large-scale stochastic dynamical system model

G

, we assume that

σ_{i j} (E) = 0, E \in {\bar{R}}_{+}^{q}

,

σ_{j j} (E) = 0, E \in {\bar{R}}_{+}^{q}

,

J_{(j, k)} (E) = 0, E \in {\bar{R}}_{+}^{q}

,

k = 1, \dots, d_{1}

, and

D_{(j, l)} (E) = 0, E \in {\bar{R}}_{+}^{q}

,

l = 1, \dots, d_{2}

, whenever

E_{j} = 0, j = 1, \dots, q

. In this case,

f (E) - d (E), E \in {\bar{R}}_{+}^{q}

, is essentially nonnegative. The above constraint implies that if the energy of the j-th subsystem of

G

is zero, then this subsystem cannot supply any energy to its surroundings nor dissipate energy to the environment. Moreover, we assume that

S_{i} (t) \geq 0

whenever

E_{i} (t) = 0

,

t \geq t_{0}

,

i = 1, \dots, q

, which implies that when the energy of the i-th subsystem is zero, then no energy can be extracted from this subsystem.

The following proposition is needed for the main results of this paper.

Proposition 4.

Consider the large-scale stochastic dynamical system

G

with differential energy balance equation given by (68). Suppose

σ_{i j} (E) = 0

,

E \in {\bar{R}}_{+}^{q}

,

J_{(j, k)} (E) = 0, E \in {\bar{R}}_{+}^{q}

,

k = 1, \dots, d_{1}

, and

D_{(j, l)} (E) = 0, E \in {\bar{R}}_{+}^{q}

,

l = 1, \dots, d_{2}

, whenever

E_{j} = 0

,

j = 1, \dots, q

, and

S_{i} (t) \geq 0

whenever

E_{i} (t) = 0

,

t \geq t_{0}

,

i = 1, \dots, q

. Then the solution

E (t)

,

t \geq t_{0}

, to (68) is nonnegative for all nonnegative initial conditions

E_{0} \in {\bar{R}}_{+}^{q}

.

Proof.

First note that

f (E) - d (E), E \in {\bar{R}}_{+}^{q}

, is essentially nonnegative,

J_{(j, k)} (E) = 0, E \in {\bar{R}}_{+}^{q}

,

k = 1, \dots, d_{1}

, and

D_{(j, l)} (E) = 0, E \in {\bar{R}}_{+}^{q}

,

l = 1, \dots, d_{2}

, whenever

E_{j} = 0

,

j = 1, \dots, q

. Next, since

S_{i} (t) \geq 0

whenever

E_{i} (t) = 0

,

t \geq t_{0}

,

i = 1, \dots, q

, it follows that

d E_{i} (t) \geq 0

for all

t \geq t_{0}

and

i = 1, \dots, q

whenever

E_{i} (t) = 0

and

E_{j} (t) \geq 0

for all

j \neq i

and

t \geq t_{0}

. This implies that for all nonnegative initial conditions

E_{0} \in {\bar{R}}_{+}^{q}

, every sample trajectory of

G

is directed towards the interior of the nonnegative orthant

{\bar{R}}_{+}^{q}

whenever

E_{i} (t) = 0, i = 1, \dots, q

, and hence, remains nonnegative almost surely for all

t \geq t_{0}

. ☐

Next, premultiplying (66) by

e^{T}

, using Proposition 4, and using the fact that

e^{T} f (E) \equiv 0

and

e^{T} J (E) \equiv 0

, it follows that

\begin{matrix} e^{T} E (T) & = & e^{T} E (t_{0}) + \int_{t_{0}}^{T} e^{T} S (t) d t - \int_{t_{0}}^{T} e^{T} d (E (t)) d t - \int_{t_{0}}^{T} e^{T} D (E (t)) d w_{2} (t), T \geq t_{0} . \end{matrix}

(71)

Now, for the large-scale stochastic dynamical system

G

, define the input

u (t) ≜ S (t)

and the output

y (t) ≜ d (E (t))

. Hence, it follows from (71) that for any two

F_{t}

-stopping times

τ_{1}

and

τ_{2}

such that

τ_{1} \geq τ_{2}

almost surely,

\begin{matrix} E [e^{T} E (τ_{2}) | F_{τ_{1}}] & = & e^{T} E (τ_{1}) + E [\int_{τ_{1}}^{τ_{2}} e^{T} S (t) d t | F_{τ_{1}}] - E [\int_{τ_{1}}^{τ_{2}} e^{T} d (E (t)) d t | F_{τ_{1}}] \\ - E [\int_{τ_{1}}^{τ_{2}} e^{T} D (E (t)) d w_{2} (t) | F_{τ_{1}}] \\ = & e^{T} E (τ_{1}) + E [\int_{τ_{1}}^{τ_{2}} [e^{T} S (t) - e^{T} d (E (t))] d t | F_{τ_{1}}] . \end{matrix}

(72)

Thus, the large-scale stochastic dynamical system

G

is stochastically lossless [54] with respect to the energy supply rate

r (u, y) ≜ e^{T} u - e^{T} y

and with the energy storage function

U (E) ≜ e^{T} E, E \in {\bar{R}}_{+}^{q}

. In other words, the difference between the supplied system energy and the stored system energy is a martingale with respect to the differential energy balance system filtration.

The following lemma is required for our next result.

Lemma 2.

Consider the large-scale stochastic dynamical system

G

with differential energy balance Equation (68). Then, for every equilibrium state

E_{e} \in H_{q}^{+}

and every

ε > 0

and

τ \overset{a . s .}{>} 0

, there exist

S_{e} \in H_{q}

,

α > 0

, and

τ \overset{a . s .}{>} \hat{τ} \overset{a . s .}{>} 0

such that, for every

\hat{E} \in H_{q}^{+}

with

∥ \hat{E} - E_{e} ∥ \overset{a . s .}{\leq} α τ

, there exists

S : {\bar{R}}_{+} \to H_{q}

such that

∥ S (t) - S_{e} ∥ \overset{a . s .}{\leq} ε

,

t \in [0, \hat{τ}]

, and

E (t) = E_{e} + \frac{(\hat{E} - E_{e})}{\hat{τ}} t

,

t \in [0, \hat{τ}]

.

Proof.

Note that with

S_{e} ≜ d (E_{e}) - f (E_{e})

, the state

E_{e} \in {\bar{R}}_{+}^{q}

is an equilibrium state of (68). Let

θ > 0

and

τ \overset{a . s .}{>} 0

, and define

\begin{matrix} M (θ, τ) & ≜ & sup_{E \in H_{q}^{{\bar{B}}_{1} (0)}, t \in [0, τ]} ∥ f (E_{e} + θ t E) - d (E_{e} + θ t E) + S_{e} ∥, \end{matrix}

(73)

\begin{matrix} M_{J} (θ, τ) & ≜ & sup_{E \in H_{q}^{{\bar{B}}_{1} (0)}, t \in [0, τ]} ∥J (E_{e} + \frac{(\hat{E} - E_{e})}{∥ \hat{E} - E_{e} ∥} α t)∥, \end{matrix}

(74)

\begin{matrix} M_{D} (θ, τ) & ≜ & sup_{E \in H_{q}^{{\bar{B}}_{1} (0)}, t \in [0, τ]} ∥D (E_{e} + \frac{(\hat{E} - E_{e})}{∥ \hat{E} - E_{e} ∥} α t)∥ . \end{matrix}

(75)

Note that for every

τ \overset{a . s .}{>} 0

,

{lim}_{θ \to 0^{+}} M (θ, τ) \overset{a . s .}{=} 0

,

{lim}_{θ \to 0^{+}} M_{J} (θ, τ) \overset{a . s .}{=} 0

, and

{lim}_{θ \to 0^{+}} M_{D} (θ, τ) \overset{a . s .}{=} 0

, and for every

θ > 0

,

{lim}_{τ \overset{a . s .}{\to} 0^{+}} M (θ, τ) \overset{a . s .}{=} 0

,

{lim}_{τ \overset{a . s .}{\to} 0^{+}} M_{J} (θ, τ) \overset{a . s .}{=} 0

, and

{lim}_{τ \overset{a . s .}{\to} 0^{+}} M_{D} (θ, τ) \overset{a . s .}{=} 0

. Moreover, it follows from Lévy’s modulus of continuity theorem [55] that for sufficiently small

d t > 0

,

∥ d w_{1} (t) ∥ \overset{a . s .}{\leq} M_{W} (d t) d t

and

∥ d w_{2} (t) ∥ \overset{a . s .}{\leq} M_{W} (d t) d t

, where

M_{W} (d t) ≜ \sqrt{2 d t {log}_{e} (\frac{1}{d t})}

.

Next, let

ε > 0

and

τ \overset{a . s .}{>} 0

be given and, for sufficiently small

d t > 0

, let

α > 0

be such that

M (α, τ) + α + M_{J} (α, τ) M_{W} (d t) + M_{D} (α, τ) M_{W} (d t) \overset{a . s .}{\leq} ε .

(The existence of such an

α

is guaranteed since

M (α, τ) \overset{a . s .}{\to} 0

,

M_{J} (α, τ) \overset{a . s .}{\to} 0

, and

M_{D} (α, τ) \overset{a . s .}{\to} 0

as

α \to 0^{+}

.) Now, let

\hat{E} \in H_{q}^{+}

be such that

∥ \hat{E} - E_{e} ∥ \overset{a . s .}{\leq} α τ

. With

\hat{τ} ≜ \frac{∥ \hat{E} - E_{e} ∥}{α} \overset{a . s .}{\leq} τ

and

\begin{matrix} S (t) d t & = & [- f (E (t)) + d (E (t)) + α \frac{(\hat{E} - E_{e})}{∥ \hat{E} - E_{e} ∥}] d t \\ - J (E (t)) d w_{1} (t) + D (E (t)) d w_{2} (t), t \in [0, \hat{τ}], \end{matrix}

(76)

it follows that

\begin{matrix} E (t) = E_{e} + \frac{(\hat{E} - E_{e})}{∥ \hat{E} - E_{e} ∥} α t, t \in [0, \hat{τ}], \end{matrix}

(77)

is a solution to (68).

The result is now immediate by noting that

E (\hat{τ}) \overset{a . s .}{=} \hat{E}

and

\begin{matrix} ∥ S (t) - S_{e} ∥ d t & \overset{a . s .}{\leq} & ∥f (E_{e} + \frac{(\hat{E} - E_{e})}{∥ \hat{E} - E_{e} ∥} α t) - d (E_{e} + \frac{(\hat{E} - E_{e})}{∥ \hat{E} - E_{e} ∥} α t) + S_{e}∥ d t \\ + α d t + ∥J (E_{e} + \frac{(\hat{E} - E_{e})}{∥ \hat{E} - E_{e} ∥} α t)∥ | d w_{1} (t) | \\ + ∥D (E_{e} + \frac{(\hat{E} - E_{e})}{∥ \hat{E} - E_{e} ∥} α t)∥ | d w_{2} (t) | \\ \overset{a . s .}{\leq} & [M (α, τ) + α + M_{J} (α, τ) M_{W} (d t) + M_{D} (α, τ) M_{W} (d t)] d t, t \in [0, \hat{τ}], \end{matrix}

(78)

and hence,

\begin{matrix} ∥ S (t) - S_{e} ∥ & \overset{a . s .}{\leq} M (α, τ) + α + M_{J} (α, τ) M_{W} (d t) + M_{D} (α, τ) M_{W} (d t) \overset{a . s .}{\leq} ε, \end{matrix}

which proves the result. ☐

It follows from Lemma 2 that the large-scale stochastic dynamical system

G

with the differential energy balance Equation (68) is stochastically reachable from and stochastically controllable to the origin in

{\bar{R}}_{+}^{q}

. Recall from [54] that the large-scale stochastic dynamical system

G

with the differential energy balance Equation (68) is stochastically reachable from the origin in

{\bar{R}}_{+}^{q}

if, for all

E_{0} \in {\bar{R}}_{+}^{q}

and

ε > 0

, there exist a finite random variable

τ_{B_{ε} (E_{0})} \overset{a . s .}{\geq} t_{0}

, called the first hitting time, defined by

τ_{B_{ε} (E_{0})} (ω) ≜ inf {t \geq t_{0} : E (t, ω) \in B_{ε} (E_{0})},

and a

F_{t}

-adapted square integrable input

S (\cdot)

defined on

[t_{0}, τ_{B_{ε} (E_{0})}]

such that the state

E (t)

,

t \geq t_{0}

, can be driven from

E (t_{0}) \overset{a . s .}{=} 0

to

E (τ_{B_{ε} (E_{0})})

and

E [τ_{E_{0}}] < \infty

, where

τ_{E_{0}} ≜ {sup}_{ε > 0} τ_{B_{ε} (E_{0})}

and the supremum is taken pointwise. Alternatively,

G

is stochastically controllable to the origin in

{\bar{R}}_{+}^{q}

if, for all

E (t_{0}) \overset{a . s .}{=} E_{0}

,

E_{0} \in {\bar{R}}_{+}^{q}

, there exists a finite random variable

{\tilde{τ}}_{B_{ε} (E_{0})} \overset{a . s .}{\geq} t_{0}

defined by

{\tilde{τ}}_{B_{ε} (E_{0})} (ω) ≜ inf {t \geq t_{0} : E (t, ω) \in B_{ε} (0)},

and a

F_{t}

-adapted square integrable input

S (\cdot)

defined on

[t_{0}, {\tilde{τ}}_{B_{ε} (E_{0})}]

such that the state

E (t), t \geq t_{0}

, can be driven from

E (t_{0}) \overset{a . s .}{=} E_{0}

to

E ({\tilde{τ}}_{B_{ε} (E_{0})}) \in B_{ε} (0)

and

{\tilde{τ}}_{E_{0}} ≜ {sup}_{ε > 0} {\tilde{τ}}_{B_{ε} (E_{0})}

with a pointwise supremum.

We let

U_{r}

denote the set of measurable bounded

H_{q}^{+}

-valued stochastic processes on the semi-infinite interval

[t_{0}, \infty)

consisting of power inputs (heat fluxes) to the large-scale stochastic dynamical system

G

such that for every

τ_{E_{0}} \overset{a . s .}{\geq} t_{0}

the system energy state can be driven from

E (t_{0}) \overset{a . s .}{=} 0

to

E (τ_{E_{0}})

by

S (\cdot) \in U_{r}

. Furthermore, we let

U_{c}

denote the set of measurable bounded

H_{q}^{+}

-valued stochastic processes on the semi-infinite interval

[t_{0}, \infty)

consisting of power inputs (heat fluxes) to the large-scale stochastic dynamical system

G

such that the system energy state can be driven from

E (t_{0}) \overset{a . s .}{=} E_{0}

,

E_{0} \in {\bar{R}}_{+}^{q}

to

E ({\tilde{τ}}_{E_{0}})

by

S (\cdot) \in U_{c}

. Finally, let

U

be an input space that is a subset of measurable bounded

H_{q}^{+}

-valued stochastic processes on

R

. The spaces

U_{r}

,

U_{c}

, and

U

are assumed to be closed under the shift operator, that is, if

S (\cdot) \in U

(respectively,

U_{c}

or

U_{r}

), then the function

S_{T}

defined by

S_{T} (t) ≜ S (t + T)

is contained in

U

(respectively,

U_{c}

or

U_{r}

) for all

T \geq 0

.

The next result establishes the uniqueness of the internal energy function

U (E), E \in {\bar{R}}_{+}^{q}

, for our large-scale stochastic dynamical system

G

. For this result define the available energy of the large-scale stochastic dynamical system

G

by

\begin{matrix} U_{a} (E_{0}) & ≜ & - inf_{u (\cdot) \in U, τ \overset{a . s .}{\geq} t_{0}} E [E [\int_{t_{0}}^{τ} [e^{T} u (t) - e^{T} y (t)] d t | E (t_{0}) \overset{a . s .}{=} E_{0}]], E_{0} \in {\bar{R}}_{+}^{q}, \end{matrix}

(79)

where

E (t)

,

t \geq t_{0}

, is the solution to (68) with

E (t) \overset{a . s .}{=} E_{0}

and admissible inputs

S (\cdot) \in U

. The infimum in (79) is taken over all

F_{t}

-measurable inputs

S (\cdot)

, all finite

F_{t}

-stopping times

τ \overset{a . s .}{\geq} 0

, and all system sample paths with initial value

E (t_{0}) \overset{a . s .}{=} E_{0}

and terminal value left free. Furthermore, define the required energy supply of the large-scale stochastic dynamical system

G

by

\begin{matrix} U_{r} (E_{0}) & ≜ & inf_{u (\cdot) \in U_{r}, τ_{E_{0}} \overset{a . s .}{\geq} 0} E [E [\int_{0}^{τ_{E_{0}}} [e^{T} u (t) - e^{T} y (t)] d t | E (0) \overset{a . s .}{=} 0]], E_{0} \in {\bar{R}}_{+}^{q} . \end{matrix}

(80)

The infimum in (80) is taken over all system sample paths starting from

E (t_{0}) \overset{a . s .}{=} 0

and ending at

E (τ_{E_{0}}) \overset{a . s .}{=} E_{0}

at time

t = τ_{E_{0}}

, and all times

t \geq t_{0}

.

Note that the available energy

U_{a} (E)

is the maximum amount of stored energy (net heat) that can be extracted from the large-scale stochastic dynamical system

G

at any finite stopping time

τ

, and the required energy supply

U_{r} (E)

is the minimum amount of energy (net heat) that can be delivered to the large-scale stochastic dynamical system

G

such that, for all

ε > 0

,

P^{0} ({lim}_{t \to τ_{E_{0}}} E (t) \in B_{ε} (E_{0})) = 1

.

Theorem 6.

Consider the large-scale stochastic dynamical system

G

with differential energy balance equation given by (68). Then

G

is stochastically lossless with respect to the energy supply rate

r (u, y) = e^{T} u - e^{T} y

, where

u (t) \equiv S (t)

and

y (t) \equiv d (E (t))

, and with the unique energy storage function corresponding to the total energy of the system

G

given by

\begin{matrix} U (E_{0}) & = & e^{T} E_{0} \\ = & - E [E [\int_{0}^{τ_{0}} [e^{T} u (t) - e^{T} y (t)] d t | E (0) \overset{a . s .}{=} E_{0}]] \\ = & E [E [\int_{0}^{τ_{E_{0}}} [e^{T} u (t) - e^{T} y (t)] d t | E (0) \overset{a . s .}{=} 0]], E_{0} \in {\bar{R}}_{+}^{q}, \end{matrix}

(81)

where

E (t)

,

t \geq t_{0}

, is the solution to (68) with admissible input

u (\cdot) \in U

,

E (τ_{0}) \overset{a . s .}{=} 0

, and

E (τ_{E_{0}}) \overset{a . s .}{=} E_{0} \in {\bar{R}}_{+}^{q}

. Furthermore,

\begin{matrix} 0 \leq U_{a} (E_{0}) = U (E_{0}) = U_{r} (E_{0}) < \infty, E_{0} \in {\bar{R}}_{+}^{q} . \end{matrix}

(82)

Proof.

Note that it follows from (71) that

G

is stochastically lossless with respect to the energy supply rate

r (u, y) = e^{T} u - e^{T} y

and with the energy storage function

U (E) = e^{T} E, E \in {\bar{R}}_{+}^{q}

. Since, by Lemma 2,

G

is reachable from and controllable to the origin in

{\bar{R}}_{+}^{q}

, it follows from (71), with

E (t_{0}) \overset{a . s .}{=} E_{0} \in {\bar{R}}_{+}^{q}

and

E (τ_{+}) \overset{a . s .}{=} 0

for some

τ_{+} \overset{a . s .}{\geq} t_{0}

and

u (\cdot) \in U

, that

\begin{matrix} e^{T} E_{0} & = & - E [E [\int_{t_{0}}^{τ_{+}} [e^{T} u (t) - e^{T} y (t)] d t | E (t_{0}) \overset{a . s .}{=} E_{0}]] \\ \leq & sup_{u (\cdot) \in U, τ_{+} \overset{a . s .}{\geq} t_{0}} - E [E [\int_{t_{0}}^{τ_{+}} [e^{T} u (t) - e^{T} y (t)] d t | E (t_{0}) \overset{a . s .}{=} E_{0}]] \\ = & - inf_{u (\cdot) \in U, τ_{+} \overset{a . s .}{\geq} t_{0}} E [E [\int_{t_{0}}^{τ_{+}} [e^{T} u (t) - e^{T} y (t)] d t | E (t_{0}) \overset{a . s .}{=} E_{0}]] \\ = & U_{a} (E_{0}), E_{0} \in {\bar{R}}_{+}^{q} . \end{matrix}

(83)

Alternatively, it follows from (71), with

E (0) \overset{a . s .}{=} 0

for some

τ_{-} \overset{a . s .}{\geq} 0

and

u (\cdot) \in U_{r}

, that

\begin{matrix} e^{T} E_{0} & = & E [E [\int_{0}^{τ_{-}} [e^{T} u (t) - e^{T} y (t)] d t | E (0) \overset{a . s .}{=} 0]] \\ \geq & inf_{u (\cdot) \in U_{r}, τ_{-} \overset{a . s .}{\geq} 0} E [E [\int_{0}^{τ_{-}} [e^{T} u (t) - e^{T} y (t)] d t | E (0) \overset{a . s .}{=} 0]] \\ = & U_{r} (E_{0}), E_{0} \in {\bar{R}}_{+}^{q} . \end{matrix}

(84)

Thus, (83) and (84) imply that (81) is satisfied and

\begin{matrix} U_{r} (E_{0}) \leq e^{T} E_{0} \leq U_{a} (E_{0}), E_{0} \in {\bar{R}}_{+}^{q} . \end{matrix}

(85)

Conversely, it follows from (71) and the fact that

U (E) = e^{T} E \geq 0, E \in {\bar{R}}_{+}^{q}

, that, for all

τ \overset{a . s .}{\geq} t_{0}

and

u (\cdot) \in U

,

\begin{matrix} e^{T} E_{0} \geq - E [E [\int_{t_{0}}^{τ} [e^{T} u (t) - e^{T} y (t)] d t | E (t_{0}) \overset{a . s .}{=} E_{0}]], E_{0} \in {\bar{R}}_{+}^{q}, \end{matrix}

(86)

which implies that

\begin{matrix} e^{T} E (t_{0}) & \geq & sup_{u (\cdot) \in U, τ \overset{a . s .}{\geq} t_{0}} - E [E [\int_{t_{0}}^{τ} [e^{T} u (t) - e^{T} y (t)] d t | E (t_{0}) \overset{a . s .}{=} E_{0}]] \\ = & - inf_{u (\cdot) \in U, τ \overset{a . s .}{\geq} t_{0}} E [E [\int_{t_{0}}^{τ} [e^{T} u (t) - e^{T} y (t)] d t | E (t_{0}) \overset{a . s .}{=} E_{0}]] \\ = & U_{a} (E_{0}), E_{0} \in {\bar{R}}_{+}^{q} . \end{matrix}

(87)

Furthermore, it follows from the definition of

U_{a} (\cdot)

that

U_{a} (E) \geq 0, E \in {\bar{R}}_{+}^{q}

, since the infimum in (79) is taken over the set of values containing the zero value (

τ \overset{a . s .}{=} t_{0}

).

Next, note that it follows from (71), with

E (0) \overset{a . s .}{=} 0

and

E (τ) \overset{a . s .}{=} E_{0}

,

E_{0} \in {\bar{R}}_{+}^{q}

, for all

τ \overset{a . s .}{\geq} 0

and

u (\cdot) \in U_{r}

, that

\begin{matrix} e^{T} E_{0} & = & E [E [\int_{0}^{τ} [e^{T} u (t) - e^{T} y (t)] d t | E (0) \overset{a . s .}{=} 0]] \\ = & inf_{u (\cdot) \in U_{r}, τ \overset{a . s .}{\geq} 0} E [E [\int_{0}^{τ} [e^{T} u (t) - e^{T} y (t)] d t | E (0) \overset{a . s .}{=} 0]] \\ = & U_{r} (E_{0}), E_{0} \in {\bar{R}}_{+}^{q} . \end{matrix}

(88)

Moreover, since the system

G

is reachable from the origin, it follows that for every

E_{0} \in {\bar{R}}_{+}^{q}

, there exists

τ \overset{a . s .}{\geq} 0

and

u (\cdot) \in U_{r}

such that

\begin{matrix} E [E [\int_{0}^{τ} [e^{T} u (t) - e^{T} y (t)] d t | E (0) \overset{a . s .}{=} 0]] \end{matrix}

(89)

is finite, and hence,

U_{r} (E_{0}) < \infty, E_{0} \in {\bar{R}}_{+}^{q}

. Finally, combining (85), (87), and (88), it follows that (82) holds. ☐

It follows from (82) and the definitions of available energy

U_{a} (E_{0})

and the required energy supply

U_{r} (E_{0})

,

E_{0} \in {\bar{R}}_{+}^{q}

, that the large-scale stochastic dynamical system

G

can deliver to its surroundings all of its stored subsystem energies and can store all of the work done to all of its subsystems. This is in essence a statement of the first law of stochastic thermodynamics and places no limitation on the possibility of transforming heat into work or work into heat. In the case where

S (t) \equiv 0

, it follows from (71) and the fact that

σ_{i i} (E) \geq 0, E \in {\bar{R}}_{+}^{q}, i = 1, \dots, q

, that the zero solution

E (t) \equiv 0

of the large-scale stochastic dynamical system

G

with the differential energy balance Equation (68) is Lyapunov stable in probability with respect to

{\bar{R}}_{+}^{n}

with Lyapunov function

U (E)

corresponding to the total energy in the system.

6. Entropy and the Second Law of Thermodynamics

As for the deterministic dynamical thermodynamic model presented in [30], the nonlinear differential energy balance Equation (68) can exhibit a full range of nonlinear behavior, including bifurcations, limit cycles, and even chaos. However, a thermodynamically consistent energy flow model should ensure that the evolution of the system energy is diffusive in character with convergent subsystem energies. As established in [30], such a system model would guarantee the absence of the Poincaré recurrence phenomenon [56]. To ensure a thermodynamically consistent energy flow model, we require the following axioms [57]. For the statement of these axioms, we first recall the following graph-theoretic notions.

Definition 10

([58]). A directed graph

G (C)

associated with the connectivity matrix

C \in R^{q \times q}

has vertices

{1, 2, \dots, q}

and an arc from vertex i to vertex j,

i \neq j

, if and only if

C_{(j, i)} \neq 0

. A graph

G (C)

associated with the connectivity matrix

C \in R^{q \times q}

is a directed graph for which the arc set is symmetric, that is,

C = C^{T}

. We say that

G (C)

is strongly connected if for any ordered pair of vertices

(i, j)

,

i \neq j

, there exists a path (i.e., a sequence of arcs) leading from i to j.

Recall that the connectivity matrix

C \in R^{q \times q}

is irreducible, that is, there does not exist a permutation matrix such that

C

is cogredient to a lower-block triangular matrix, if and only if

G (C)

is strongly connected (see Theorem 2.7 of [58]). Let

ϕ_{i j} (E) ≜ σ_{i j} (E) - σ_{j i} (E), E \in {\bar{R}}_{+}^{q}

, denote the net energy flow from the j-th subsystem

G_{j}

to the i-th subsystem

G_{i}

of the large-scale stochastic dynamical system

G

.

Axiom (i):

For the connectivity matrix

C \in R^{q \times q}

associated with the large-scale stochastic dynamical system

G

defined by

\begin{matrix} C_{(i, j)} ≜ \{\begin{matrix} 0, & if ϕ_{i j} (E) \equiv 0, \\ 1, & otherwise, \end{matrix} i \neq j, i, j = 1, \dots, q, \end{matrix}

(90)

and

\begin{matrix} C_{(i, i)} ≜ - \sum_{k = 1, k \neq i}^{q} C_{(k, i)}, i = j, i = 1, \dots, q, \end{matrix}

(91)

rank

C = q - 1

, and for

C_{(i, j)} = 1

,

i \neq j

,

ϕ_{i j} (E) = 0

if and only if

E_{i} = E_{j}

.

Axiom (ii):

For

i, j = 1, \dots, q

,

(E_{i} - E_{j}) ϕ_{i j} (E) \overset{a . s .}{\leq} 0

,

E \in {\bar{R}}_{+}^{q}

, and, for all

c > 0

,

\sum_{j = 1, j \neq i}^{q} (E_{i} - E_{j}) ϕ_{i j} (E) \frac{c + E_{i}}{c + E_{j}} \leq - {row}_{i} (J (E)) {row}_{i}^{T} (J (E)), i = 1, \dots, q .

As discussed in [30] for the deterministic thermodynamic problem, the fact that

ϕ_{i j} (E) = 0

if and only if

E_{i} = E_{j}, i \neq j

, implies that subsystems

G_{i}

and

G_{j}

of

G

are connected; alternatively,

ϕ_{i j} (E) \equiv 0

implies that

G_{i}

and

G_{j}

are disconnected. Axiom (i) implies that if the energies in the connected subsystems

G_{i}

and

G_{j}

are equal, then energy exchange between these subsystems is not possible. This statement is consistent with the zeroth law of thermodynamics, which postulates that temperature equality is a necessary and sufficient condition for thermal equilibrium. Furthermore, it follows from the fact that

C = C^{T}

and rank

C = q - 1

that the connectivity matrix

C

is irreducible, which implies that for any pair of subsystems

G_{i}

and

G_{j}

,

i \neq j

, of

G

there exists a sequence of connectors (arcs) of

G

that connect

G_{i}

and

G_{j}

.

Axiom (ii) implies that energy flows from more energetic subsystems to less energetic subsystems and is consistent with the second law of thermodynamics, which states that heat (energy) must flow in the direction of lower temperatures [59]. Furthermore, note that

ϕ_{i j} (E) = - ϕ_{j i} (E)

,

E \in {\bar{R}}_{+}^{q}, i \neq j, i, j = 1, \dots, q

, which implies conservation of energy between lossless subsystems. With

S (t) \equiv 0

and

J (E) = 0

, Axioms (i) and (ii) along with the fact that

ϕ_{i j} (E) = - ϕ_{j i} (E)

,

E \in {\bar{R}}_{+}^{q}, i \neq j, i, j = 1, \dots, q

, imply that at a given instant of time, energy can only be transported, stored, or dissipated but not created, and the maximum amount of energy that can be transported and/or dissipated from a subsystem cannot exceed the energy in the subsystem. Finally, it is important to note here that due to the stochastic disturbance term

J (E) d w_{1}

capturing probabilistic variations in heat transfer between the subsystems, Axiom (ii) requires that the scaled net energy flow between the subsystems is bounded by the negative intensity of the system diffusion.

Next, we show that the classical Clausius equality and inequality for reversible and irreversible thermodynamics over cyclic motions are satisfied for our stochastic thermodynamically consistent energy flow model. For this result ∮ denotes a cyclic integral evaluated along an arbitrary closed path of (68) in

{\bar{R}}_{+}^{q}

; that is,

\oint ≜ \int_{t_{0}}^{τ_{f}}

with

τ_{f} \overset{a . s .}{\geq} t_{0}

and

S (\cdot) \in U

such that

E (τ_{f}) \overset{a . s .}{=} E (t_{0}) \overset{a . s .}{=} E_{0} \in {\bar{R}}_{+}^{q}

.

Proposition 5.

Consider the large-scale stochastic dynamical system

G

with differential energy balance Equation (68), and assume that Axioms (i) and (ii) hold. Then, for all

E_{0} \in {\bar{R}}_{+}^{q}

,

τ_{f} \geq t_{0}

, and

S (\cdot) \in U

such that

E (τ_{f}) \overset{a . s .}{=} E (t_{0}) \overset{a . s .}{=} E_{0}

,

\begin{matrix} E^{E_{0}} [\int_{t_{0}}^{τ_{f}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} d t]] & = & E^{E_{0}} [\oint \sum_{i = 1}^{q} \frac{d Q_{i} (t)}{c + E_{i} (t)}] \\ \leq & E^{E_{0}} [\int_{t_{0}}^{τ_{f}} \sum_{i = 1}^{q} \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}} d t], \end{matrix}

(92)

where

c > 0

,

d Q_{i} (t) ≜ [S_{i} (t) - σ_{i i} (E (t))] d t

,

i = 1, \dots, q

, is the amount of net energy (heat) received by the i-th subsystem over the infinitesimal time interval

d t

, and

E (t)

,

t \geq t_{0}

, is the solution to (68) with initial condition

E (t_{0}) \overset{a . s .}{=} E_{0}

. Furthermore,

\begin{matrix} E^{E_{0}} [\oint \sum_{i = 1}^{q} \frac{d Q_{i} (t)}{c + E_{i} (t)}] = E^{E_{0}} [\int_{t_{0}}^{τ_{f}} \sum_{i = 1}^{q} \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}} d t] \end{matrix}

(93)

if and only if there exists a continuous function

α : [t_{0}, t_{f}] \to {\bar{R}}_{+}

such that

E (t) = α (t) e

,

t \in [t_{0}, t_{f}]

.

Proof.

Since, by Proposition 4,

E (t) \geq \geq 0, t \geq t_{0}

, and

ϕ_{i j} (E) = - ϕ_{j i} (E), E \in {\bar{R}}_{+}^{q}

,

i \neq j, i, j = 1, \dots, q

, it follows from (68), Ito’s lemma, and Axiom (ii) that, for all

τ_{f} \overset{a . s .}{\geq} t_{0}

,

\begin{matrix} E^{E_{0}} [\oint \sum_{i = 1}^{q} \frac{d Q_{i} (t)}{c + E_{i} (t)}] & = & E^{E_{0}} [\int_{t_{0}}^{τ_{f}} \sum_{i = 1}^{q} \frac{d E_{i} (t) - \sum_{j = 1, j \neq i}^{q} ϕ_{i j} (E (t)) d t}{c + E_{i} (t)}] \\ = & E^{E_{0}} [\int_{t_{0}}^{τ_{f}} \sum_{i = 1}^{q} \frac{d E_{i} (t)}{c + E_{i} (t)}] - E^{E_{0}} [\int_{t_{0}}^{τ_{f}} \sum_{i = 1}^{q} \frac{\sum_{j = 1, j \neq i}^{q} ϕ_{i j} (E (t)) d t}{c + E_{i} (t)}] \\ = & E^{E_{0}} [\int_{t_{0}}^{τ_{f}} \sum_{i = 1}^{q} [d {log}_{e} (c + E_{i} (t)) + \frac{1}{2} \frac{{row}_{i} (J (E (t))) {row}_{i}^{T} (J (E (t)))}{{(c + E_{i} (t))}^{2}} d t \\ + \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}} d t]] \\ - E^{E_{0}} [\int_{t_{0}}^{τ_{f}} \sum_{i = 1}^{q} \sum_{j = 1, j \neq i}^{q} \frac{1}{2} (\frac{ϕ_{i j} (E (t))}{c + E_{i} (t)} - \frac{ϕ_{i j} (E (t))}{c + E_{j} (t)}) d t] \\ = & E^{E_{0}} [\sum_{i = 1}^{q} {log}_{e} (\frac{c + E_{i} (τ_{f})}{c + E_{i} (t_{0})})] \\ - E^{E_{0}} [\int_{t_{0}}^{τ_{f}} \sum_{i = 1}^{q} \sum_{j = 1, j \neq i}^{q} \frac{1}{2} \frac{ϕ_{i j} (E (t)) [E_{j} (t) - E_{i} (t)]}{(c + E_{i} (t)) (c + E_{j} (t))} d t] \\ + E^{E_{0}} [\int_{t_{0}}^{τ_{f}} \sum_{i = 1}^{q} \frac{1}{2} \frac{{row}_{i} (J (E (t))) {row}_{i}^{T} (J (E (t)))}{{(c + E_{i} (t))}^{2}} d t] \\ + E^{E_{0}} [\int_{t_{0}}^{τ_{f}} \sum_{i = 1}^{q} \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}} d t] \\ = & E^{E_{0}} [\int_{t_{0}}^{τ_{f}} \sum_{i = 1}^{q} \frac{1}{2} \frac{1}{{(c + E_{i} (t))}^{2}} [\sum_{j = 1, j \neq i}^{q} ϕ_{i j} (E (t)) [E_{i} (t) - E_{j} (t)] \frac{c + E_{i} (t)}{c + E_{j} (t)} \\ + {row}_{i} (J (E (t))) {row}_{i}^{T} (J (E (t)))] d t] \\ + E^{E_{0}} [\int_{t_{0}}^{τ_{f}} \sum_{i = 1}^{q} \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}} d t] \\ \leq & E^{E_{0}} [\int_{t_{0}}^{τ_{f}} \sum_{i = 1}^{q} \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}} d t], \end{matrix}

(94)

which proves (92).

To show (93), note that it follows from (94), Axiom (i), and Axiom (ii) that (93) holds if and only if

E_{i} (t) \overset{a . s .}{=} E_{j} (t), t \in [t_{0}, τ_{f}]

,

i \neq j, i, j = 1, \dots, q

, or, equivalently, there exists a continuous function

α : [t_{0}, τ_{f}] \to {\bar{R}}_{+}

such that

E (t) \overset{a . s .}{=} α (t) e, t \in [t_{0}, τ_{f}]

. ☐

Inequality (92) is a generalization of Clausius’ inequality for reversible and irreversible thermodynamics as applied to large-scale stochastic dynamical systems and restricts the manner in which the system dissipates (scaled) heat over cyclic motions. Note that the Clausius inequality (92) for the stochastic thermodynamic model is stronger than the Clausius inequality for the deterministic model presented in [30].

It follows from Axiom (i) and (68) that for the adiabatically isolated large-scale stochastic dynamical system

G

(that is,

S (t) \equiv 0

and

D (E (t)) \equiv 0

), the energy states given by

E_{e} = α e, α \geq 0

, correspond to the equilibrium energy states of

G

. Thus, as in classical thermodynamics, we can define an equilibrium process as a process in which the trajectory of the large-scale stochastic dynamical system

G

moves along the equilibrium manifold

M_{e} ≜ {E \in {\bar{R}}_{+}^{q} : E = α e, α \geq 0}

corresponding to the set of equilibria of the isolated [60] system

G

. The power input that can generate such a trajectory can be given by

S (t) = d (E (t)) + u (t), t \geq t_{0}

, where

u (\cdot) \in U

is such that

u_{i} (t) \equiv u_{j} (t), i \neq j, i, j = 1, \dots, q

. Our definition of an equilibrium transformation involves a continuous succession of intermediate states that differ by infinitesimals from equilibrium system states and thus can only connect initial and final states, which are states of equilibrium. This process need not be slowly varying, and hence, equilibrium and quasistatic processes are not synonymous in this paper. Alternatively, a nonequilibrium process is a process that does not lie on the equilibrium manifold

M_{e}

. Hence, it follows from Axiom (i) that for an equilibrium process

ϕ_{i j} (E (t)) = 0, t \geq t_{0}

,

i \neq j, i, j = 1, \dots, q

, and thus, by Proposition 5, inequality (92) is satisfied as an equality. Alternatively, for a nonequilibrium process it follows from Axioms (i) and (ii) that (92) is satisfied as a strict inequality.

Next, we give a stochastic definition of entropy for the large-scale stochastic dynamical system

G

that is consistent with the classical thermodynamic definition of entropy.

Definition 11.

For the large-scale stochastic dynamical system

G

with differential energy balance Equation (68), a function

S : {\bar{R}}_{+}^{q} \to R

satisfying

\begin{matrix} [S (E (τ_{2})) | F_{τ_{1}}] & \geq & S (E (τ_{1})) + [\int_{τ_{1}}^{τ_{2}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} \\ - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{τ_{1}}] \end{matrix}

(95)

for every

F_{t}

-stopping times

τ_{2} \overset{a . s .}{\geq} τ_{1} \overset{a . s .}{\geq} t_{0}

and

S (\cdot) \in U

is called the entropy function of

G

.

Note that it follows from Definition 11 that the difference between the system entropy production and the stored system entropy is a submartingale with respect to the differential energy balance filtration.

Next, we show that (92) guarantees the existence of an entropy function for

G

. For this result define the available entropy of the large-scale stochastic dynamical system

G

by

\begin{matrix} S_{a} (E_{0}) & ≜ & - sup_{S (\cdot) \in U_{c}, τ_{0} \overset{a . s .}{\geq} t_{0}} E [E [\int_{t_{0}}^{τ_{0}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} \\ - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | E (t_{0}) \overset{a . s .}{=} E_{0}]], \end{matrix}

(96)

where

E_{0} \in {\bar{R}}_{+}^{q}

and

E (τ_{0}) \overset{a . s .}{=} 0

, and define the required entropy supply of the large-scale stochastic dynamical system

G

by

\begin{matrix} S_{r} (E_{0}) & ≜ & sup_{S (\cdot) \in U_{r}, τ_{E_{0}} \overset{a . s .}{\geq} t_{0}} E [E [\int_{t_{0}}^{τ_{E_{0}}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} \\ - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | E (t_{0}) \overset{a . s .}{=} 0]], \end{matrix}

(97)

where

E (τ_{E_{0}}) \overset{a . s .}{=} E_{0} \in {\bar{R}}_{+}^{q}

. Note that the available entropy

S_{a} (E_{0})

is the minimum amount of scaled heat (entropy) that can be extracted from the large-scale stochastic dynamical system

G

in order to transfer it from an initial state

E (t_{0}) = E_{0}

to

E (T) = 0

. Alternatively, the required entropy supply

S_{r} (E_{0})

is the maximum amount of scaled heat (entropy) that can be delivered to

G

to transfer it from the origin to a given subset in the state space containing the initial state

E (t_{0}) = E_{0}

over a finite stopping time. For further details, see [54].

Theorem 7.

Consider the large-scale stochastic dynamical system

G

with differential energy balance Equation (68), and assume that Axiom

(i i)

holds. Then there exists an entropy function for

G

. Moreover,

S_{a} (E)

,

E \in {\bar{R}}_{+}^{q}

, and

S_{r} (E)

,

E \in {\bar{R}}_{+}^{q}

, are possible entropy functions for

G

with

S_{a} (0) = S_{r} (0) = 0

. Finally, all entropy functions

S (E)

,

E \in {\bar{R}}_{+}^{q}

, for

G

satisfy

\begin{matrix} S_{r} (E) \leq S (E) - S (0) \leq S_{a} (E), E \in {\bar{R}}_{+}^{q} . \end{matrix}

(98)

Proof.

Since, by Lemma 2,

G

is stochastically controllable to and stochastically reachable from the origin in

{\bar{R}}_{+}^{q}

, it follows from (96) and (97) that

S_{a} (E_{0}) < \infty, E_{0} \in {\bar{R}}_{+}^{q}

, and

S_{r} (E_{0}) > - \infty, E_{0} \in {\bar{R}}_{+}^{q}

, respectively. Next, let

E_{0} \in {\bar{R}}_{+}^{q}

, and let

S (\cdot) \in U

be such that

E (τ_{i}) \overset{a . s .}{=} E (τ_{f}) \overset{a . s .}{=} 0

and

E (τ_{0}) \overset{a . s .}{=} E_{0}

, where

τ_{i} \overset{a . s .}{<} τ_{0} \overset{a . s .}{<} τ_{f}

. In this case, it follows from (92) that, for all

τ_{i} \overset{a . s .}{<} τ_{0} \overset{a . s .}{<} τ_{f}

,

\begin{matrix} E [E, [\int_{τ_{i}}^{τ_{f}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | E (τ_{i}) \overset{a . s .}{=} 0]] \leq 0 . \end{matrix}

(99)

Next, using the strong Markov property we have

\begin{matrix} E [E, [\int_{τ_{i}}^{τ_{f}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | E (τ_{i})]] \\ = E [E [\int_{τ_{i}}^{τ_{0}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t \\ + \int_{τ_{0}}^{τ_{f}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | E (τ_{i})]] \\ = E [E, [\int_{τ_{i}}^{τ_{0}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | E (τ_{i})]] \\ + E [E, [\int_{τ_{0}}^{τ_{f}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{τ_{0}}]] \\ = E [E, [\int_{τ_{i}}^{τ_{0}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | E (τ_{i})]] \\ + E [E, [\int_{τ_{0}}^{τ_{f}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | E (τ_{0})]], \end{matrix}

(100)

and hence, (99) implies

\begin{matrix} E [E, [\int_{τ_{i}}^{τ_{0}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | E (τ_{i})]] \\ \leq - E [E, [\int_{τ_{0}}^{τ_{f}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | E (τ_{0})]] . \end{matrix}

(101)

Now, taking the supremum on both sides of (101) over all

S (\cdot) \in U_{r}

and

τ_{i} \overset{a . s .}{\leq} τ_{0}

yields

\begin{matrix} S_{r} (E_{0}) & = & sup_{S (\cdot) \in U_{r}, τ_{i} \overset{a . s .}{\leq} τ_{0}} E [E [\int_{τ_{i}}^{τ_{0}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} \\ - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}} d t | E (τ_{i})]] \\ \leq & - E [E, [\int_{τ_{0}}^{τ_{f}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | E (τ_{0})]] . \end{matrix}

(102)

Next, taking the infimum on both sides of (102) over all

S (\cdot) \in U_{c}

and

τ_{f} \overset{a . s .}{\geq} τ_{0}

, we obtain

S_{r} (E_{0}) \leq S_{a} (E_{0}), E_{0} \in {\bar{R}}_{+}^{q}

, which implies that

- \infty < S_{r} (E_{0}) \leq S_{a} (E_{0}) < \infty, E_{0} \in {\bar{R}}_{+}^{q}

. Hence, the functions

S_{a} (\cdot)

and

S_{r} (\cdot)

are well defined.

Next, it follows from the definition of

S_{a} (\cdot)

, the law of iterated expectation, and the strong Markov property that for every stopping time

T \overset{a . s .}{\geq} τ_{1}

and

S (\cdot) \in U_{c}

such that

E (τ_{1}) \in {\bar{H}}_{q}^{+}

and

E (T) \overset{a . s .}{=} 0

,

\begin{matrix} - S_{a} (E (τ_{1})) & T \\ = & sup_{S (\cdot) \in U_{c}, T \overset{a . s .}{\geq} τ_{1}} E [\int_{τ_{1}}^{T} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | E (τ_{1})], \\ \geq & E [\int_{τ_{1}}^{τ_{2}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{τ_{1}}] \\ + sup_{S (\cdot) \in U_{c}, T \overset{a . s .}{\geq} τ_{2}} E [\int_{τ_{2}}^{T} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{τ_{1}}], \\ = & E [\int_{τ_{1}}^{τ_{2}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{τ_{1}}] \\ + E [sup_{S (\cdot) \in U_{c}, T \overset{a . s .}{\geq} τ_{2}} [\int_{τ_{2}}^{T} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} \\ - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{τ_{2}}] | F_{τ_{1}}], \\ = & E [\int_{τ_{1}}^{τ_{2}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{τ_{1}}] \\ - E [S_{a} (E (τ_{2})) | F_{τ_{1}}], τ_{1} \overset{a . s .}{\leq} τ_{2} \overset{a . s .}{\leq} T, \end{matrix}

(103)

which implies that

S_{a} (E), E \in {\bar{R}}_{+}^{q}

, satisfies (95). Thus,

S_{a} (E), E \in {\bar{R}}_{+}^{q}

, is a possible entropy function for

G

. Note that with

E (τ_{0}) \overset{a . s .}{=} E (T) \overset{a . s .}{=} 0

it follows from (92) that the supremum in (96) is taken over the set of negative semidefinite values with one of the values being zero for

S (t) \overset{a . s .}{\equiv} 0

. Thus,

S_{a} (0) = 0

.

Similarly, it follows from the definition of

S_{r} (\cdot)

that for every stopping time

T \leq τ_{2}

and

S (\cdot) \in U_{r}

such that

E (τ_{2}) \in {\bar{H}}_{q}^{+}

and

E (T) \overset{a . s .}{=} 0

,

\begin{matrix} S_{r} (E (τ_{2})) & = & sup_{S (\cdot) \in U_{r}, T \overset{a . s .}{\leq} τ_{2}} \int_{T}^{τ_{2}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t, \\ \geq & sup_{S (\cdot) \in U_{r}, T \overset{a . s .}{\leq} τ_{1}} \int_{T}^{τ_{1}} [\sum_{i = 1}^{q} \frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t \\ + \int_{τ_{1}}^{τ_{2}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t, \\ = & S_{r} (E (τ_{1})) + \int_{τ_{1}}^{τ_{2}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t, \\ T \overset{a . s .}{\leq} τ_{1} \overset{a . s .}{\leq} τ_{2}, \end{matrix}

(104)

which implies that

S_{r} (E), E \in {\bar{R}}_{+}^{q}

, satisfies (95). Thus,

S_{r} (E), E \in {\bar{R}}_{+}^{q}

, is a possible entropy function for

G

. Note that with

E (t_{0}) \overset{a . s .}{=} E (T) \overset{a . s .}{=} 0

it follows from (92) that the supremum in (97) is taken over the set of negative semidefinite values with one of the values being zero for

S (t) \overset{a . s .}{\equiv} 0

. Thus,

S_{r} (0) = 0

.

Next, suppose there exists an entropy function

S : {\bar{R}}_{+}^{q} \to R

for

G

, and let

E (τ_{2}) \overset{a . s .}{=} 0

in (95). Then it follows from (95) that

S (E (τ_{1})) - S (0) \leq - E [\int_{τ_{1}}^{τ_{2}} \sum_{i = 1}^{q} \frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{τ_{1}}]

(105)

for all

τ_{2} \overset{a . s .}{\geq} τ_{1}

and

S (\cdot) \in U_{c}

, which implies that

\begin{matrix} S (E (τ_{1})) - S (0) \\ \leq inf_{S (\cdot) \in U_{c}, τ_{2} \overset{a . s .}{\geq} τ_{1}} [- E [\int_{τ_{1}}^{τ_{2}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{τ_{1}}]] \\ = - sup_{S (\cdot) \in U_{c}, τ_{2} \overset{a . s .}{\geq} τ_{1}} E [\int_{τ_{1}}^{τ_{2}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{τ_{1}}] \\ = S_{a} (E (τ_{1})) . \end{matrix}

(106)

Since

E (τ_{1})

is arbitrary, it follows that

S (E) - S (0) \leq S_{a} (E), E \in {\bar{R}}_{+}^{q}

.

Alternatively, let

E (τ_{1}) \overset{a . s .}{=} 0

in (95). Then it follows from (95) that

\begin{matrix} S (E (τ_{2})) - S (0) & \geq & E [\int_{τ_{1}}^{τ_{2}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} \\ - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{τ_{1}}] \end{matrix}

(107)

for all

τ_{1} \overset{a . s .}{\leq} τ_{2}

and

S (\cdot) \in U_{r}

. Hence,

\begin{matrix} S (E (τ_{2})) - S (0) & \geq & sup_{S (\cdot) \in U_{r}, τ_{1} \overset{a . s .}{\leq} τ_{2}} E [\int_{τ_{1}}^{τ_{2}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} \\ - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{τ_{1}}] \\ = & S_{r} (E (τ_{2})), \end{matrix}

(108)

which, since

E (τ_{2})

is arbitrary, implies that

S_{r} (E) \leq S (E) - S (0), E \in {\bar{R}}_{+}^{q}

. Thus, all entropy functions for

G

satisfy (98). ☐

It is important to note that inequality (92) is equivalent to the existence of an entropy function for

G

. Sufficiency is simply a statement of Theorem 7, while necessity follows from (95) with

E (t_{2}) \overset{a . s .}{=} E (t_{1})

. This definition of entropy leads to the second law of stochastic thermodynamics being viewed as an axiom in the context of stochastic (anti)cyclo-dissipative dynamical systems [54].

The next result shows that all entropy functions for

G

are continuous on

{\bar{R}}_{+}^{q}

.

Theorem 8.

Consider the large-scale stochastic dynamical system with differential energy balance Equation (68), and let

: {\bar{R}}_{+}^{q} \to R

be an entropy function of . Then

S (\cdot)

is continuous on

{\bar{R}}_{+}^{q}

.

Proof.

Let

E_{e} \in {\bar{R}}_{+}^{q}

and

S_{e} \in R^{q}

be such that

S_{e} = d (E_{e}) - f (E_{e})

. Note that with

S (t) \overset{a . s .}{\equiv} S_{e}

,

E_{e}

is an equilibrium point of the differential energy balance Equation (68). Next, it follows from Lemma 2 that

G

is locally stochastically controllable, that is, for every

τ \overset{a . s .}{>} 0

and

ε > 0

, the set of points that can be reached from and to

E_{e}

in time T using admissible inputs

S : [0, τ] \to H_{q}

, satisfying

∥ S (t) - S_{e} ∥ \overset{a . s .}{<} ε

, contains a neighborhood of

E_{e}

.

Next, let

δ > 0

and note that it follows from the continuity of

f (\cdot)

,

d (\cdot)

,

J (\cdot)

, and

D (\cdot)

that there exist

τ > 0

and

ε > 0

such that for every

S : [0, τ) \to R^{q}

and

∥ S (t) - S_{e} ∥ \overset{a . s .}{<} ε

,

∥ E (t) - E_{e} ∥ \overset{a . s .}{<} δ

,

t \in [0, τ)

, where

S (\cdot) \in U

and

E (t), t \in [0, τ)

, denotes the solution to (68) with the initial condition

E_{e}

. Furthermore, it follows from the local controllability of

G

that for every

\hat{τ} \in (0, τ]

, there exists a strictly increasing, continuous function

γ : {\bar{R}}_{+}^{q} \to {\bar{R}}_{+}^{q}

such that

γ (0) = 0

, and for every

E_{0} \in H_{q}^{+}

such that

∥ E_{0} - E_{e} ∥ \overset{a . s .}{\leq} γ (\hat{τ})

, there exists

0 \overset{a . s .}{\leq} \tilde{τ} \overset{a . s .}{\leq} \hat{τ}

and an input

S : [0, \hat{τ}] \to H_{q}

such that

∥ S (t) - S_{e} ∥ < ε

,

t \in [0, \tilde{τ})

, and

E (\hat{t}) \overset{a . s .}{=} E_{0}

. Hence, there exists

β > 0

such that for every

E_{0} \in H_{q}^{+}

such that

∥ E_{0} - E_{e} ∥ \overset{a . s .}{\leq} β

, there exists

0 \overset{a . s .}{\leq} \hat{τ} \overset{a . s .}{\leq} γ^{- 1} (∥ E_{0} - E_{e} ∥)]

and an input

S : [t_{0}, \hat{τ}] \to H_{q}

such that

∥ S (t) - S_{e} ∥ \overset{a . s .}{<} ε, t \in [0, \hat{t}]

, and

E (\hat{t}) \overset{a . s .}{=} E_{0}

. In addition, it follows from Lemma 2 that

S : [0, \hat{τ}] \to H_{q}

is such that

E (t) \overset{a . s .}{\geq \geq} 0, t \in [0, \hat{τ}]

.

Next, since

σ_{i i} (\cdot)

,

i = 1, \dots, q

, is continuous, it follows that there exists

M \in H_{1}^{+}

such that

\begin{matrix} sup_{E - E_{e} \overset{a . s .}{<} S - S_{e} \overset{a . s .}{<}} |\sum_{i = 1}^{q} [\frac{S_{i} -_{i i} (E)}{c + E_{i}} - \frac{1}{2} \frac{{row}_{i} (D (E)) {row}_{i}^{T} (D (E))}{{(c + E_{i})}^{2}}]| = M . \end{matrix}

(109)

Hence, it follows that

\begin{matrix} |\int_{0}^{\hat{τ}} \sum_{i = 1}^{q} [\frac{S_{i} (σ) -_{i i} (E (σ))}{c + E_{i} (σ)} - \frac{1}{2} \frac{{row}_{i} (D (E (σ))) {row}_{i}^{T} (D (E (σ)))}{{(c + E_{i} (σ))}^{2}}] d σ| \\ \overset{a . s .}{\leq} & \int_{0}^{\hat{τ}} |\sum_{i = 1}^{q} [\frac{S_{i} (σ) -_{i i} (E (σ))}{c + E_{i} (σ)} - \frac{1}{2} \frac{{row}_{i} (D (E (σ))) {row}_{i}^{T} (D (E (σ)))}{{(c + E_{i} (σ))}^{2}}]| d σ \\ \overset{a . s .}{\leq} & M \hat{τ} \\ \overset{a . s .}{\leq} & M γ^{- 1} (E_{0} - E_{e}) . \end{matrix}

(110)

Now, if

S (\cdot)

is an entropy function of

G

, then

\begin{matrix} E [(E (\hat{τ})) | F_{0}] & \overset{a . s .}{\geq} & S (E_{e}) + E [\int_{0}^{\hat{τ}} \sum_{i = 1}^{q} [\frac{S_{i} (σ) -_{i i} (E (σ))}{c + E_{i} (σ)} \\ - \frac{1}{2} \frac{{row}_{i} (D (E (σ))) {row}_{i}^{T} (D (E (σ)))}{{(c + E_{i} (σ))}^{2}}] d σ | F_{0}] \end{matrix}

(111)

or, equivalently,

\begin{matrix} - E [\int_{0}^{\hat{τ}} \sum_{i = 1}^{q} [\frac{S_{i} (σ) -_{i i} (E (σ))}{c + E_{i} (σ)} - \frac{1}{2} \frac{{row}_{i} (D (E (σ))) {row}_{i}^{T} (D (E (σ)))}{{(c + E_{i} (σ))}^{2}}] d σ | F_{0}] \\ \overset{a . s .}{\geq} & S (E_{e}) - E S [(E (\hat{τ})) | F_{0}] . \end{matrix}

(112)

If

S (E_{e}) \overset{a . s .}{\geq} S (E (\hat{τ}))

, then combining (110) and (112) yields

| S (E_{e}) - E [S (E (\hat{τ})) | F_{0}] | \overset{a . s .}{\leq} E [M γ^{- 1} (E_{0} - E_{e}) | F_{0}] .

(113)

Alternatively, if

S (E (\hat{τ})) \overset{a . s .}{\geq} S (E_{e})

, then (113) can be derived by reversing the roles of

E_{e}

and

E (\hat{τ})

. Specifically, for

E_{0} \in {\bar{R}}_{+}^{q}

and

E (\hat{τ}) \overset{a . s .}{=} E_{0}

, (113) becomes

| S (E_{0}) - S (E_{e}) | \leq E [M] γ^{- 1} (E_{0} - E_{e}) .

Hence, since

γ (\cdot)

is continuous and

E (\hat{τ})

is arbitrary, it follows that

S (\cdot)

is continuous on

{\bar{R}}_{+}^{q}

. ☐

Next, as a direct consequence of Theorem 7, we show that all possible entropy functions of

G

form a convex set, and hence, there exists a continuum of possible entropy functions for

G

ranging from the required entropy supply

S_{r} (E)

to the available entropy

S_{a} (E)

.

Proposition 6.

Consider the large-scale stochastic dynamical system

G

with differential energy balance Equation (68), and assume that Axioms

(i)

and

(i i)

hold. Then

\begin{matrix} S (E) ≜ α S_{r} (E) + (1 - α) S_{a} (E), α \in [0, 1], \end{matrix}

(114)

is an entropy function for

G

.

Proof.

The result is a direct consequence of the reachability of

G

along with inequality (95) by noting that if

S_{r} (E)

and

S_{a} (E)

satisfy (95), then

S (E)

satisfies (95). ☐

It follows from Proposition 6 that Definition 11 does not provide enough information to define the entropy uniquely for nonequilibrium thermodynamic systems with differential energy balance Equation (68). This difficulty has long been pointed out in [61]. Two particular entropy functions for

G

can be computed a priori via the variational problems given by (96) and (97). For equilibrium thermodynamics, however, uniqueness is not an issue, as shown in the next proposition.

Proposition 7.

Consider the large-scale stochastic dynamical system

G

with differential energy balance Equation (68), and assume that Axioms

(i)

and

(i i)

hold. Then at every equilibrium state

E = E_{e}

of the isolated system

G

, the entropy

S (E), E \in {\bar{R}}_{+}^{q}

, of

G

is unique (modulo a constant of integration) and is given by

\begin{matrix} S (E) - S (0) & = & S_{a} (E) = S_{r} (E) = e^{T} \log_{e} (c e + E) - q {log}_{e} c, \end{matrix}

(115)

where

E = E_{e}

and

\log_{e} (c e + E)

denotes the vector natural logarithm given by

{[{log}_{e} (c + E_{1}), \dots, {log}_{e} (c + E_{q})]}^{T}

.

Proof.

It follows from Axiom (i) and Axiom (ii) that for an equilibrium process

ϕ_{i j} (E (t)) \overset{a . s .}{\equiv} 0, i \neq j, i, j = 1, \dots, q

,

D (E (t)) \overset{a . s .}{\equiv} 0

, and

J (E (t)) \overset{a . s .}{\equiv} 0

. Consider the entropy function

S_{a} (\cdot)

given by (96), and let

E_{0} = E_{e}

for some equilibrium state

E_{e}

. Then it follows from (68) that

\begin{matrix} S_{a} (E_{0}) & = & - sup_{S (\cdot) \in U_{c}, T \overset{a . s .}{\geq} t_{0}} E [E [\int_{t_{0}}^{T} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} \\ - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (σ))}^{2}}] d t | E (t_{0}) \overset{a . s .}{=} E_{0}]] \\ = & - sup_{S (\cdot) \in U_{c}, T \overset{a . s .}{\geq} t_{0}} E [E [\int_{t_{0}}^{T} \sum_{i = 1}^{q} [\frac{d E_{i} (t) - \sum_{j = 1, j \neq i}^{q} ϕ_{i j} (E (t)) d t}{c + E_{i} (t)} \\ - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (σ))}^{2}} d t] | E (t_{0}) \overset{a . s .}{=} E_{0}]] \\ = & - sup_{S (\cdot) \in U_{c}, T \overset{a . s .}{\geq} t_{0}} E [E [0 p t 21 p t \sum_{i = 1}^{q} {log}_{e} (\frac{c}{c + E_{i 0}}) + \int_{t_{0}}^{T} \sum_{i = 1}^{q} \frac{1}{2} \frac{{row}_{i} (J (E (t))) {row}_{i}^{T} (J (E (t)))}{{(c + E_{i} (t))}^{2}} d t \\ - \int_{t_{0}}^{T} \sum_{i = 1}^{q} \sum_{j = 1, j \neq i}^{q} \frac{ϕ_{i j} (E (t))}{c + E_{i} (t)} d t | E (t_{0}) \overset{a . s .}{=} E_{0}]] \\ = & - sup_{S (\cdot) \in U_{c}, T \overset{a . s .}{\geq} t_{0}} E [E [0 p t 21 p t \sum_{i = 1}^{q} {log}_{e} (\frac{c}{c + E_{i 0}}) + \int_{t_{0}}^{T} \sum_{i = 1}^{q} \frac{1}{2} \frac{{row}_{i} (J (E (t))) {row}_{i}^{T} (J (E (t)))}{{(c + E_{i} (t))}^{2}} d t \\ - \int_{t_{0}}^{T} \sum_{i = 1}^{q} \sum_{j = 1, j \neq i}^{q} \frac{1}{2} (\frac{ϕ_{i j} (E (t))}{c + E_{i} (t)} - \frac{ϕ_{i j} (E (t))}{c + E_{j} (t)}) d t | E (t_{0}) \overset{a . s .}{=} E_{0}]] \\ = & \sum_{i = 1}^{q} {log}_{e} (\frac{c + E_{i 0}}{c}) + inf_{S (\cdot) \in U_{c}, T \overset{a . s .}{\geq} t_{0}} E [E [\int_{t_{0}}^{T} \sum_{i = 1}^{q} - \frac{1}{2} \frac{1}{{(c + E_{i} (t))}^{2}} [\sum_{j = 1, j \neq i}^{q} ϕ_{i j} (E (t)) \\ \cdot [E_{i} (t) - E_{j} (t)] \frac{c + E_{i} (t)}{c + E_{j} (t)} + {row}_{i} (J (E (t))) {row}_{i}^{T} (J (E (t)))] d t | E (t_{0}) \overset{a . s .}{=} E_{0}]] . \end{matrix}

(116)

Since the solution

E (t), t \geq t_{0}

, to (68) is nonnegative for all nonnegative initial conditions, it follows from Axiom (ii) that the infimum in (116) is taken over the set of nonnegative values. However, the zero value of the infimum is achieved on an equilibrium process for which

ϕ_{i j} (E (t)) \overset{a . s .}{\equiv} 0

,

i \neq j

,

i, j = 1, \dots, q

. Thus,

\begin{matrix} S_{a} (E_{0}) = e^{T} \log_{e} (c e + E_{0}) - q {log}_{e} c, E_{0} = E_{e} . \end{matrix}

(117)

Similarly, consider the entropy function

S_{r} (\cdot)

given by (97). Then, it follows from (68) that, for

E_{0} = E_{e}

,

\begin{matrix} S_{r} (E_{0}) & = & sup_{S (\cdot) \in U_{r}, T \overset{a . s .}{\geq} t_{0}} E [E [\int_{t_{0}}^{T} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} \\ - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | E (t_{0}) \overset{a . s .}{=} 0]] \\ = & sup_{S (\cdot) \in U_{r}, T \overset{a . s .}{\geq} t_{0}} E [E [\int_{t_{0}}^{T} \sum_{i = 1}^{q} [\frac{d E_{i} (t) - \sum_{j = 1, j \neq i}^{q} ϕ_{i j} (E (t)) d t}{c + E_{i} (t)} \\ - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}} d t] | E (t_{0}) \overset{a . s .}{=} 0]] \\ = & sup_{S (\cdot) \in U_{r}, T \overset{a . s .}{\geq} t_{0}} E [E [0 p t 21 p t \sum_{i = 1}^{q} {log}_{e} (\frac{c + E_{i 0}}{c}) + \int_{t_{0}}^{T} \sum_{i = 1}^{q} \frac{1}{2} \frac{{row}_{i} (J (E (t))) {row}_{i}^{T} (J (E (t)))}{{(c + E_{i} (t))}^{2}} d t \\ - \int_{t_{0}}^{T} \sum_{i = 1}^{q} \sum_{j = 1, j \neq i}^{q} \frac{ϕ_{i j} (E (t))}{c + E_{i} (t)} d t | E (t_{0}) \overset{a . s .}{=} 0]] \\ = & \sum_{i = 1}^{q} {log}_{e} (\frac{c + E_{i 0}}{c}) + sup_{S (\cdot) \in U_{r}, T \overset{a . s .}{\geq} t_{0}} E [E [\int_{t_{0}}^{T} \sum_{i = 1}^{q} \frac{1}{2} \frac{1}{{(c + E_{i} (t))}^{2}} [\sum_{j = 1, j \neq i}^{q} ϕ_{i j} (E (t)) \\ \cdot [E_{i} (t) - E_{j} (t)] \frac{c + E_{i} (t)}{c + E_{j} (t)} + {row}_{i} (J (E (t))) {row}_{i}^{T} (J (E (t)))] d t | E (t_{0}) \overset{a . s .}{=} 0]] . \end{matrix}

(118)

Now, it follows from Axioms (i) and (ii) that the zero value of the supremum in (118) is achieved on an equilibrium process and thus

\begin{matrix} S_{r} (E_{0}) = e^{T} \log_{e} (c e + E_{0}) - q {log}_{e} c, E_{0} = E_{e} . \end{matrix}

(119)

Finally, it follows from (98) that (115) holds. ☐

The next proposition shows that if (95) holds as an equality for some transformation starting and ending at an equilibrium point of the isolated dynamical system

G

, then this transformation must lie on the equilibrium manifold

M_{e}

.

Proposition 8.

Consider the large-scale stochastic dynamical system

G

with differential energy balance Equation (68), and assume that Axioms (

i)

and (

i i)

hold. Let

S (\cdot)

denote an entropy of

G

, and let

E : [t_{0}, t_{1}] \to {\bar{R}}_{+}^{q}

denote the solution to (68) with

E (t_{0}) \overset{a . s .}{=} α_{0} e

and

E (t_{1}) \overset{a . s .}{=} α_{1} e

, where

α_{0}

,

α_{1} \geq 0

. Then

\begin{matrix} E [S (E (t_{1})) | F_{t_{0}}] & = & S (E (t_{0})) + E [\int_{t_{0}}^{t_{1}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} \\ - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{t_{0}}] \end{matrix}

(120)

if and only if there exists a continuous function

α : [t_{0}, t_{1}] \to {\bar{R}}_{+}

such that

α (t_{0}) = α_{0}

,

α (t_{1}) = α_{1}

, and

E (t) \overset{a . s .}{=} α (t) e

,

t \in [t_{0}, t_{1}]

.

Proof.

Since

E (t_{0})

and

E (t_{1})

are equilibrium states of the isolated dynamical system

G

, it follows from Proposition 7 that

\begin{matrix} E [S (E (t_{1})) | F_{t_{0}}] - S (E (t_{0})) \overset{a . s .}{=} q {log}_{e} (c + α_{1}) - q {log}_{e} (c + α_{0}) . \end{matrix}

(121)

Furthermore, it follows from (68) that

\begin{matrix} E [\int_{t_{0}}^{t_{1}} [\sum_{i = 1}^{q} \frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{t_{0}}] \\ = E [\int_{t_{0}}^{t_{1}} \sum_{i = 1}^{q} \frac{d E_{i} (t) - \sum_{j = 1, j \neq i}^{q} ϕ_{i j} (E (t)) d t}{c + E_{i} (t)} | F_{t_{0}}] \\ - E [\int_{t_{0}}^{t_{1}} \sum_{i = 1}^{q} \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}} d t | F_{t_{0}}] \\ = q {log}_{e} (\frac{c + α_{1}}{c + α_{0}}) + [\int_{t_{0}}^{t_{1}} \sum_{i = 1}^{q} \frac{1}{2} \frac{1}{{(c + E_{i} (t))}^{2}} [\sum_{j = 1, j \neq i}^{q} ϕ_{i j} (E (t)) [E_{i} (t) - E_{j} (t)] \\ \cdot \frac{c + E_{i} (t)}{c + E_{j} (t)} + {row}_{i} (J (E (t))) {row}_{i}^{T} (J (E (t)))] d t | F_{t_{0}}] . \end{matrix}

(122)

Now, it follows from Axioms (i) and (ii) that (118) holds if and only if

E_{i} (t) = E_{j} (t)

,

t \in [t_{0}, t_{1}]

,

i \neq j, i, j = 1, \dots, q

, or, equivalently, there exists a continuous function

α : [t_{0}, t_{1}] \to {\bar{R}}_{+}

such that

E (t) \overset{a . s .}{=} α (t) e

,

t \in [t_{0}, t_{1}]

,

α (t_{0}) = α_{0}

, and

α (t_{1}) = α_{1}

. ☐

Even though it follows from Proposition 6 that Definition 11 does not provide a unique continuous entropy function for nonequilibrium systems, the next theorem gives a unique, two-times continuously differentiable entropy function for

G

for equilibrium and nonequilibrium processes. This result answers the long-standing question of how the entropy of a nonequilibrium state of a dynamical process should be defined [61,62], and establishes its global existence and uniqueness.

Theorem 9.

Consider the large-scale stochastic dynamical system

G

with differential energy balance Equation (68), and assume that Axioms

(i)

and

(i i)

hold. Then the function

S : {\bar{R}}_{+}^{q} \to {\bar{R}}_{+}^{q}

given by

\begin{matrix} S (E) = e^{T} \log_{e} (c e + E) - q {log}_{e} c, E \in {\bar{R}}_{+}^{q}, \end{matrix}

(123)

where

c > 0

, is a unique (modulo a constant of integration), two-times continuously differentiable entropy function of

G

. Furthermore, for

E (t) \notin

H_{q}^{M_{e}}

,

t \geq t_{0}

, where

E (t)

,

t \geq t_{0}

, denotes the solution to (68) and

M_{e} = {E \in {\bar{R}}_{+}^{q} : E = α e, α \geq 0}

, (123) satisfies

\begin{matrix} E [S (E (t_{2})) | F_{t_{1}}] & > & S (E (t_{1})) + E [\int_{t_{1}}^{t_{2}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} \\ - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{t_{1}}] \end{matrix}

(124)

for every

t_{2} \geq t_{1} \geq t_{0}

and

S (\cdot) \in U

.

Proof.

Since, by Proposition 1,

E (t) \geq \geq 0, t \geq t_{0}

, and

ϕ_{i j} (E) = - ϕ_{j i} (E), E \in {\bar{R}}_{+}^{q}

,

i \neq j, i, j = 1, \dots, q

, it follows that

\begin{matrix} E & [S (E (t_{2})) | F_{t_{1}}] - S (E (t_{1})) \\ = & E [\int_{t_{1}}^{t_{2}} d S (E (t)) | F_{t_{1}}] \\ = & E [\int_{t_{1}}^{t_{2}} \sum_{i = 1}^{q} \frac{d E_{i} (t)}{c + E_{i} (t)} - \frac{1}{2} [\frac{{row}_{i} (J (E (t))) {row}_{i}^{T} (J (E (t)))}{{(c + E_{i} (t))}^{2}} \\ + \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{t_{1}}] \\ = & E [\int_{t_{1}}^{t_{2}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{t_{1}}] \\ + E [\int_{t_{1}}^{t_{2}} \sum_{i = 1}^{q} [\sum_{j = 1, j \neq i}^{q} \frac{ϕ_{i j} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (J (E (t))) {row}_{i}^{T} (J (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{t_{1}}] \\ = & E [\int_{t_{1}}^{t_{2}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{t_{1}}] \\ - E [\int_{t_{1}}^{t_{2}} \sum_{i = 1}^{q} \frac{1}{2} \frac{1}{{(c + E_{i} (t))}^{2}} [\sum_{j = 1, j \neq i}^{q} ϕ_{i j} (E (t)) (E_{i} (t) - E_{j} (t)) \frac{c + E_{i} (t)}{c + E_{j} (t)} \\ + {row}_{i} (J (E (t))) {row}_{i}^{T} (J (E (t)))] \\ \geq & E [\int_{t_{1}}^{t_{2}} \sum_{i = 1}^{q} [\frac{S_{i} (t) - σ_{i i} (E (t))}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}] d t | F_{t_{1}}], t \geq t_{0} . \end{matrix}

(125)

Furthermore, in the case where

E (t) \notin H_{q}^{M_{e}}, t \geq t_{0}

, it follows from Axiom (i), Axiom (ii), and (125) that (124) holds.

To show that (123) is a unique, two-times continuously differentiable entropy function of

G

, let

S (E)

be a two-times continuously differentiable entropy function of

G

so that

S (E)

satisfies (95) or, equivalently,

\begin{matrix} L S (E) \geq μ_{1}^{T} (E) [S - d (E)] - \frac{1}{2} tr μ_{2} (E) D (E) D^{T} (E), E \in {\bar{R}}_{+}^{q}, S \in R^{q}, \end{matrix}

(126)

where

μ_{1}^{T} (E) = [\frac{1}{c + E_{1}}, \dots, \frac{1}{c + E_{q}}]

and

μ_{2} (E) = diag [\frac{1}{{(c + E_{1})}^{2}}, \dots, \frac{1}{{(c + E_{q})}^{2}}], E \in {\bar{R}}_{+}^{q}

,

E (t), t \geq t_{0}

, denotes the solution to the differential energy balance Equation (68), and

L S (E (t))

denotes the infinitesimal generator of

S (E)

along the solution

E (t), t \geq t_{0}

. Hence, it follows from (126) that

\begin{matrix} S^{'} (E) [f (E) - d (E) + S] + \frac{1}{2} tr S^{″} (E) [J (E) J^{T} (E) + D (E) D^{T} (E)] \\ \geq μ_{1}^{T} (E) [S - d (E)] - \frac{1}{2} tr μ_{2} (E) D (E) D^{T} (E), E \in {\bar{R}}_{+}^{q}, S \in R^{q}, \end{matrix}

(127)

which implies that there exist continuous functions

ℓ : {\bar{R}}_{+}^{q} \to R^{p}

and

W : {\bar{R}}_{+}^{q} \to R^{p \times q}

such that

\begin{matrix} 0 & = & S^{'} (E) [f (E) - d (E) + S] + \frac{1}{2} tr S^{″} (E) [J (E) J^{T} (E) + D (E) D^{T} (E)] \\ - μ_{1}^{T} (E) [S - d (E)] + \frac{1}{2} tr μ_{2} (E) D (E) D^{T} (E) \\ - {[ℓ (E) + W (E) S]}^{T} [ℓ (E) + W (E) S], E \in {\bar{R}}_{+}^{q}, S \in R^{q} . \end{matrix}

(128)

Now, equating coefficients of equal powers (of S and D), it follows that

W (E)

\equiv 0

,

S^{'} (E) = μ^{T} (E)

,

S^{″} (E) = - μ_{2} (E), E \in {\bar{R}}_{+}^{q}

, and

\begin{matrix} 0 & = & S^{'} (E) f (E) + \frac{1}{2} tr S^{″} (E) J (E) J^{T} (E) - ℓ^{T} (E) ℓ (E), E \in {\bar{R}}_{+}^{q} . \end{matrix}

(129)

Hence,

S (E) = e^{T} \log_{e} (c e + E) - q {log}_{e} c, E \in {\bar{R}}_{+}^{q}

, and

\begin{matrix} 0 & = & μ_{1}^{T} (E) f (E) - \frac{1}{2} tr μ_{2} (E) J (E) J^{T} (E) - ℓ (E) ℓ^{T} (E), E \in {\bar{R}}_{+}^{q} . \end{matrix}

(130)

Thus, (123) is a unique, two-times continuously differentiable entropy function for

G

. ☐

Note that it follows from Axiom (i), Axiom (ii), and the last equality in (125) that the entropy function given by (123) satisfies (95) as an equality for an equilibrium process and as a strict inequality for a nonequilibrium process. For any entropy function of

G

, it follows from Proposition 8 that if (95) holds as an equality for some transformation starting and ending at equilibrium points of the isolated system

G

, then this transformation must lie on the equilibrium manifold

M_{e}

. However, (95) may hold as an equality for nonequilibrium processes starting and ending at nonequilibrium states.

The entropy expression given by (123) is identical in form to the Boltzmann entropy for statistical thermodynamics. Due to the fact that the entropy given by (123) is indeterminate to the extent of an additive constant, we can place the constant of integration

q {log}_{e} c

to zero by taking

c = 1

. Since

S (E)

given by (123) achieves a maximum when all the subsystem energies

E_{i}, i = 1, \dots, q

, are equal, the entropy of

G

can be thought of as a measure of the tendency of a system to lose the ability to do useful work, lose order, and settle to a more homogenous state. For further details see [30].

Recalling that

E [d Q_{i} (t) | F_{t}] = [S_{i} (t) - σ_{i i} (E (t))] d t, i = 1, \dots, q

, is the infinitesimal amount of the net heat received or dissipated by the i-th subsystem of

G

over the infinitesimal time interval

d t

, it follows from (95) that

\begin{matrix} E [d S (E (t)) | F_{t}] \geq \sum_{i = 1}^{q} [\frac{d Q_{i} (t)}{c + E_{i} (t)} - \frac{1}{2} \frac{{row}_{i} (D (E (t))) {row}_{i}^{T} (D (E (t)))}{{(c + E_{i} (t))}^{2}}], t \geq t_{0} . \end{matrix}

(131)

Inequality (131) is analogous to the classical thermodynamic inequality for the variation of entropy during an infinitesimal irreversible transformation with the shifted subsystem energies

c + E_{i}

playing the role of the i-th subsystem thermodynamic (absolute) temperatures. Specifically, note that since

\frac{d S_{i}}{d E_{i}} = \frac{1}{c + E_{i}}

, where

S_{i} = {log}_{e} (c + E_{i}) - {log}_{e} c

denotes the unique continuously differentiable i-th subsystem entropy, it follows that

\frac{d S_{i}}{d E_{i}}, i = 1, \dots, q

, defines the reciprocal of the subsystem thermodynamic temperatures. That is,

\begin{matrix} \frac{1}{T_{i}} ≜ \frac{d S_{i}}{d E_{i}} \end{matrix}

(132)

and

T_{i} > 0

,

i = 1, \dots, q

. Hence, in our formulation, temperature is a function derived from entropy and does not involve the primitive subjective notions of hotness and coldness.

It is important to note that in this paper we view subsystem temperatures to be synonymous with subsystem energies. Even though this does not limit the generality of our theory from a mathematical perspective, it can be physically limiting since it does not allow for the consideration of two subsystems of

G

having the same stored energy with one of the subsystems being at a higher temperature (i.e., hotter) than the other. This, however, can be easily addressed by assigning different specific heats (i.e., thermal capacities) for each of the compartments of the large-scale system

G

as shown in [30].

7. Stochastic Semistability and Energy Equipartition

For the (adiabatically) isolated large-scale stochastic dynamical system

G

, (95) yields the fundamental inequality

\begin{matrix} E [S (E (τ_{2})) | F_{τ_{1}}] \geq S (E (τ_{1})), τ_{2} \overset{a . s .}{\geq} τ_{1} . \end{matrix}

(133)

Inequality (133) implies that, for any dynamical change in an adiabatically isolated large-scale stochastic dynamical system

G

, the entropy of the final state can never be less than the entropy of the initial state and is a generalization of Clausius’ version of the entropy principle, which states that for every irreversible (nicht umkehrbar) process in an adiabatically isolated system beginning and ending at an equilibrium state, the entropy of the final state is greater than or equal to the entropy of the initial state. Inequality (133) is often identified with the second law of thermodynamics for stochastic systems and gives as a statement about entropy increase. It is important to stress that this result holds for an adiabatically isolated dynamical system. It is, however, possible with power (heat flux) supplied from an external system to reduce the entropy of the dynamical system

G

. The entropy of both systems taken together, however, cannot decrease.

As for the deterministic thermodynamic problem [30], this observation implies that when the isolated large-scale dynamical system

G

with thermodynamically consistent energy flow characteristics (i.e., Axioms (i) and (ii) hold) is at a state of maximum entropy consistent with its energy, it cannot be subject to any further dynamical change since any such change would result in a decrease of entropy. This of course implies that the state of maximum entropy is the stable state of an isolated system, and this equilibrium state has to be stochastically semistable. The following theorem generalizes Theorem 3.9 of [30] to the stochastic setting.

Theorem 10.

Consider the large-scale stochastic dynamical system

G

with differential energy balance Equation (68) with

S (t) \overset{a . s .}{\equiv} 0

,

D (E (t)) \overset{a . s .}{\equiv} 0

, and

d (E (t)) \overset{a . s .}{\equiv} 0

, and assume that Axioms

(i)

and

(i i)

hold. Then, for every

α \geq 0

,

α e

is a stochastic semistable equilibrium state of (68). Furthermore,

E (t) \overset{a . s .}{\to} \frac{1}{q} e e^{T} E (t_{0})

as

t \to \infty

and

\frac{1}{q} e e^{T} E (t_{0})

is a semistable equilibrium state. Finally, if for some

k \in {1, \dots, q}

,

σ_{k k} (E) \geq 0

,

E \in {\bar{R}}_{+}^{q}

, and

σ_{k k} (E) = 0

if and only if

E_{k} = 0

[63], then the zero solution

E (t) \equiv 0

to (68) is a globally asymptotically stable in probability equilibrium state of (68).

Proof.

It follows from Axiom (i) and (ii) that

α e \in {\bar{R}}_{+}^{q}, α \geq 0

, is an equilibrium state of (68). To show Lyapunov stability of the equilibrium state

α e

, consider

V (E) = \frac{1}{2} {(E - α e)}^{T} (E - α e)

as a Lyapunov function candidate. Note that for

c > > max {E_{i}, E_{j}}

,

i \neq j, i, j = 1, \dots, q

,

\frac{c + E_{i}}{c + E_{j}} = \frac{1 + E_{i} / c}{1 + E_{j} / c} \approx 1 .

(134)

Since, Axiom (ii) holds for all

c > 0

, we have

\sum_{j = 1, j \neq i}^{q} (E_{i} - E_{j}) ϕ_{i j} (E) \leq - {row}_{i} (J (E)) {row}_{i}^{T} (J (E)), i = 1, \dots, q .

(135)

Now, since

ϕ_{i j} (E) = - ϕ_{j i} (E), E \in {\bar{R}}_{+}^{q}

,

i \neq j, i, j = 1, \dots, q

,

e^{T} f (E) = 0, E \in {\bar{R}}_{+}^{q}

, and

e^{T} J (E) = 0, E \in {\bar{R}}_{+}^{q}

, it follows from (135) that

\begin{matrix} L V (E) & = & {(E - α e)}^{T} f (E) + \frac{1}{2} tr J (E) J^{T} (E) \\ = & E^{T} f (E) + \frac{1}{2} J (E) J^{T} (E) \\ = & \sum_{i = 1}^{q} E_{i} [\sum_{j = 1, j \neq i}^{q} ϕ_{i j} (E)] + \frac{1}{2} \sum_{i = 1}^{q} {row}_{i} (J (E)) {row}_{i}^{T} (J (E)) \\ = & \frac{1}{2} \sum_{i = 1}^{q} [\sum_{j = 1, j \neq i}^{q} (E_{i} - E_{j}) ϕ_{i j} (E) + {row}_{i} (J (E)) {row}_{i}^{T} (J (E))] \\ \leq & 0, E \in {\bar{R}}_{+}^{q}, \end{matrix}

(136)

which establishes Lyapunov stability in probability of the equilibrium state

α e

.

To show that

α e

is stochastically semistable, let

R ≜ {E \in {\bar{R}}_{+}^{q} : L V (E) = 0}

. Now, by Axiom (i) and (ii) the directed graph associated with the connectivity matrix

C

for the large-scale dynamical system

G

is strongly connected, which implies that

R = {E \in {\bar{R}}_{+}^{q} : E_{1} = \cdot \cdot \cdot = E_{q}}

. Since the

{\bar{R}}_{+}^{q}

is an invariant set and

V (E)

is radially unbounded, it follows from the Theorem 2 that for every initial condition

E (t_{0}) \in {\bar{R}}_{+}^{q}

,

E (t) \overset{a . s .}{\to} R

as

t \to \infty

, and hence,

α e

is a stochastic semistable equilibrium state of (68). Next, note that since

e^{T} E (t) = e^{T} E (t_{0})

and

E (t) \overset{a . s .}{\to} R

as

t \to \infty

, it follows that

E (t) \overset{a . s .}{\to} \frac{1}{q} e e^{T} E (t_{0})

as

t \to \infty

. Hence, with

α = \frac{1}{q} e^{T} E (t_{0})

,

α e = \frac{1}{q} e e^{T} E (t_{0})

is a semistable equilibrium state of (68).

To show that in the case where for some

k \in {1, \dots, q}

,

σ_{k k} (E) \geq 0, E \in {\bar{R}}_{+}^{q}

, and

σ_{k k} (E) = 0

if and only if

E_{k} = 0

, the zero solution

E (t) \equiv 0

to (68) is globally asymptotically stable in probability, consider

V (E) = \frac{1}{2} E^{T} E, E \in {\bar{R}}_{+}^{q}

, as a candidate Lyapunov function. Note that

V (0) = 0

,

V (E) > 0, E \in {\bar{R}}_{+}^{q}, E \neq 0

, and

V (E)

is radially unbounded. Now, the infinitesimal generator of Lyapunov function along the system energy trajectories of (68) is given by

\begin{matrix} L V (E) & = & E^{T} [f (E) - d (E)] + \frac{1}{2} tr J (E) J^{T} (E) \\ = & E^{T} f (E) + \frac{1}{2} tr J (E) J^{T} (E) - E_{k} σ_{k k} (E) \\ = & \sum_{i = 1}^{q} E_{i} [\sum_{j = 1, j \neq i}^{q} ϕ_{i j} (E)] + \frac{1}{2} \sum_{i = 1}^{q} {row}_{i} (J (E)) {row}_{i}^{T} (J (E)) - E_{k} σ_{k k} (E) \\ = & \frac{1}{2} \sum_{i = 1}^{q} [\sum_{j = 1, j \neq i}^{q} (E_{i} - E_{j}) ϕ_{i j} (E) + {row}_{i} (J (E)) {row}_{i}^{T} (J (E))] - E_{k} σ_{k k} (E) \\ \leq & 0, E \in {\bar{R}}_{+}^{q}, \end{matrix}

(137)

which shows that the zero solution

E (t) \overset{a . s .}{\equiv} 0

to (68) is Lyapunov stable in probability.

Finally, to show global asymptotic stability in probability of the zero equilibrium state, let

R ≜ {E \in {\bar{R}}_{+}^{q} : L V (E) = 0}

. Now, since Axiom

(i)

holds and

σ_{k k} (E) = 0

if and only if

E_{k} = 0

, it follows that

R = {E \in {\bar{R}}_{+}^{q} : E_{k} = 0, k \in {1, \dots, q}} \cap {E \in {\bar{R}}_{+}^{q} : E_{1} = E_{2} = \cdot \cdot \cdot = E_{q}} = {0}

. Hence, it follows from Theorem 2 that for every initial condition

E (t_{0}) \in {\bar{R}}_{+}^{q}

,

E (t) \overset{a . s .}{\to} R = {0}

as

t \to \infty

, which proves global asymptotic stability in probability of the zero equilibrium state of (68). ☐

Theorem 10 shows that the isolated (i.e.,

S (t) \overset{a . s .}{\equiv} 0

,

d (E) \overset{a . s .}{\equiv} 0

, and

D (E) \overset{a . s .}{\equiv} 0

) large-scale stochastic dynamical system

G

is stochastically semistable. In Theorem 10 we used the energy Lyapunov function to show that for the isolated (i.e.,

S (t) \overset{a . s .}{\equiv} 0

,

d (E) \overset{a . s .}{\equiv} 0

, and

D (E) \overset{a . s .}{\equiv} 0

) large-scale stochastic dynamical system

G

,

E (t) \overset{a . s .}{\to} \frac{1}{q} e e^{T} E (t_{0})

as

t \to \infty

and

\frac{1}{q} e e^{T} E (t_{0})

is a stochastic semistable equilibrium state. This result can also be arrived at using the system entropy.

Specifically, using the system entropy given by (123), we can show attraction of the system trajectories to stochastic Lyapunov stable equilibrium points

α e, α \geq 0

, and hence show stochastic semistability of these equilibrium states. To see this, note that since

e^{T} f (E) = 0, E \in {\bar{R}}_{+}^{q}

and

e^{T} J (E) = 0, E \in {\bar{R}}_{+}^{q}

, it follows that

e^{T} d E (t) = 0, t \geq t_{0}

. Hence,

e^{T} E (t) \overset{a . s .}{=} e^{T} E (t_{0}), t \geq t_{0}

. Furthermore, since

E (t) \geq \geq 0, t \geq t_{0}

, it follows that

0 \leq \leq E (t) \leq \leq e e^{T} E (t_{0}) a . s ., t \geq t_{0}

, which implies that all solutions to (68) are almost surely bounded.

Next, since by (125) the function

- S (E (t)), t \geq t_{0}

, is a supermartingale and

E (t), t \geq t_{0}

, is bounded, it follows from Theorem 2 that for every initial condition

E (t_{0}) \in {\bar{R}}_{+}^{q}

,

E (t) \overset{a . s .}{\to} R

as

t \to \infty

, where

R ≜ {E \in {\bar{R}}_{+}^{q} : L S (E) = 0}

. It now follows from the last inequality of (125) that

R = {E \in {\bar{R}}_{+}^{q} : (E_{i} - E_{j}) ϕ_{i j} (E) = 0, i = 1, \dots, q, j \in K_{i}}

, which, since the directed graph associated with the connectivity matrix

C

for the large-scale dynamical system

G

is strongly connected, implies that

R = {E \in {\bar{R}}_{+}^{q} : E_{1} = \cdot \cdot \cdot = E_{q}}

. Since the set

R

consists of the equilibrium states of (68), it follows that

M = R

, which, along with (136), establishes stochastic semistability of the equilibrium states

α e, α \geq 0

.

Theorem 10 implies that the steady-state value of the energy in each subsystem

G_{i}

of the isolated stochastic large-scale dynamical system

G

is equal, that is, the steady-state energy of the isolated large-scale stochastic dynamical system

G

given by

\begin{matrix} E_{\infty} \overset{a . s .}{=} \frac{1}{q} e e^{T} E (t_{0}) \overset{a . s .}{=} [\frac{1}{q} \sum_{i = 1}^{q} E_{i} (t_{0})] e \end{matrix}

(138)

is uniformly distributed over all subsystems of

G

. This phenomenon is known as equipartition of energy [64,65,66,67,68] and is an emergent behavior in thermodynamic systems [30].

Example 2.

In this example, we apply Theorem 10 to the five-compartment thermodynamic system shown in Figure 2. Specifically, consider

\begin{matrix} d E_{1} (t) & = [E_{2} (t) - E_{1} (t)] d t + γ [E_{2} (t) - E_{1} (t)] d w (t), E_{1} (0) \overset{a . s .}{=} E_{10}, t \geq 0, \end{matrix}

(139)

\begin{matrix} d E_{2} (t) & = [E_{1} (t) - E_{2} (t) + E_{3} (t) - E_{2} (t) + E_{5} (t) - E_{2} (t)] d t \\ + γ [E_{1} (t) - E_{2} (t) + E_{3} (t) - E_{2} (t) + E_{5} (t) - E_{2} (t)] d w (t), E_{2} (0) \overset{a . s .}{=} E_{20}, \end{matrix}

(140)

\begin{matrix} d E_{3} (t) & = [E_{2} (t) - E_{3} (t) + E_{4} (t) - E_{3} (t)] d t + γ [E_{2} (t) - E_{3} (t) + E_{4} (t) - E_{3} (t)] d w (t), \\ E_{3} (0) \overset{a . s .}{=} E_{30}, \end{matrix}

(141)

\begin{matrix} d E_{4} (t) & = [E_{3} (t) - E_{4} (t)] d t + γ [E_{3} (t) - E_{4} (t)] d w (t), E_{4} (0) \overset{a . s .}{=} E_{40}, \end{matrix}

(142)

\begin{matrix} d E_{5} (t) & = [E_{2} (t) - E_{5} (t)] d t + γ [E_{2} (t) - E_{5} (t)] d w (t), E_{5} (0) \overset{a . s .}{=} E_{50} . \end{matrix}

(143)

Note that (139)–(143) can be cast in the form of (68) with

E ≜ {[E_{1}, E_{2}, E_{3}, E_{4}, E_{5}]}^{T}

,

d (E) = 0

,

D (E) = 0

,

S (t) \equiv 0

,

w_{1} = w

,

f (E) = [\begin{matrix} E_{2} - E_{1} \\ E_{1} - E_{2} + E_{3} - E_{2} + E_{5} - E_{2} \\ E_{2} - E_{3} + E_{4} - E_{3} \\ E_{3} - E_{4} \\ E_{2} - E_{5} \end{matrix}], J (E) = γ [\begin{matrix} E_{2} - E_{1} \\ E_{1} - E_{2} + E_{3} - E_{2} + E_{5} - E_{2} \\ E_{2} - E_{3} + E_{4} - E_{3} \\ E_{3} - E_{4} \\ E_{2} - E_{5} \end{matrix}] .

It follows from Theorem 10 that the thermodynamic heat flow model (139)–(143) is stochastically semistable with respect to

{\bar{R}}_{+}^{5}

and achieves energy equipartition. To see this, let

E_{10} = 0

,

E_{20} = 10

,

E_{30} = 20

,

E_{40} = 30

,

E_{50} = 40

, and

γ = 0.2

. Figure 3 shows the sample energy trajectories along with the standard deviation of the states of each thermodynamic compartment versus time for 10 sample paths.

8. Conclusions

In this paper, we combined thermodynamics and stochastic dynamical system theory to provide a system-theoretic foundation of thermodynamics. The proposed dynamical systems framework of thermodynamics can potentially provide deeper insights into the constitutive mechanisms that explain fundamental thermodynamic processes and describe acute microcosms and macrocosms in the ever-elusive pursuit of unifying the subatomic and astronomical domains. In future research, we will use the realizations of each sample path of the stochastic energy variables characterized by the stochastic differential energy balance dynamical model to describe the probability density function of our large-scale stochastic thermodynamic model by a continuous-time and continuous-space Fokker–Planck evolution equation to give a thermodynamic interpretation between the stationary solution of the Fokker–Planck equation and the canonical thermodynamic equilibrium distribution. Furthermore, since our stochastic thermodynamic model does not restrict the second law on each individual sample path trajectory, we will explore the second law as a fluctuation theorem to give a precise prediction of the cases in which the system entropy decreases over a given time interval for our model.

Acknowledgments

This research was supported in part by the Air Force Office of Scientific Research under Grant FA9550-16-1-0100.

Author Contributions

Both authors contributed equally to this work.

Conflicts of Interest

The authors declare no conflict of interest.

References and Notes

Sekimoto, K. Kinetic characterization of heat bath and the energetics of thermal ratchet models. J. Phys. Soc. Jpn. 1997, 66, 1234–1237. [Google Scholar] [CrossRef]
Sekimoto, K. Langevin equation and thermodynamics. Prog. Theor. Phys. Supp. 1998, 130, 17–27. [Google Scholar] [CrossRef]
Sekimoto, K. Stochastic Energetics; Springer: Berlin, Germany, 2010. [Google Scholar]
Seifert, U. Stochastic thermodynamics: Principles and perspectives. Eur. Phys. J. B 2008, 64, 423–431. [Google Scholar] [CrossRef]
Seifert, U. Stochastic thermodynamics, fluctuation theorems and molecular machines. Rep. Prog. Phys. 2012, 75, 1–58. [Google Scholar] [CrossRef] [PubMed]
Onsager, L. Reciprocal relations in irreversible processes, I. Phys. Rev. 1931, 37, 405–426. [Google Scholar] [CrossRef]
Onsager, L. Reciprocal relations in irreversible processes, II. Phys. Rev. 1932, 38, 2265–2279. [Google Scholar] [CrossRef]
De Groot, S.R. Thermodynamics of Irreversible Processes; North-Holland: Amsterdam, The Netherlands, 1951. [Google Scholar]
Prigogine, I. Thermodynamics of Irreversible Processes; Interscience: New York, NY, USA, 1955. [Google Scholar]
Einstein, A. Über die von der molekularkinetischen Theorie der Wärme geforderte Bewegung von in ruhenden Flüssigkeiten suspendierten Teilchen. Ann. Phys. 1905, 322, 549–560. [Google Scholar] [CrossRef]
Even though Robert Brown [13] is credited with the discovery of Brownian motion, the chaotic motion of atoms was first conjectured by Leukippos—the ancient Greek philosopher to first develop the theory of atomism. In Book II of his poem De Rerum Natura (On the Nature of the Universe) Lucretius (99–55 b.c.) attributes the observed disordered motion of dust lit by a sunray to Leukippos. He goes on to state that Leukippos asserted that the irregular and extremely fast motion of atoms is the cause for the slower motion of the larger dust particles ([12], p. 23).
Russo, L. The Forgotten Revolution: How Science was Born in 300 B.C. and Why it Had to be Reborn; Springer: Berlin, Germany, 2004. [Google Scholar]
Brown, R. A brief account of microscopical observations made in the months of June, July, and August, 1827, on the particles contained in the pollen of plants; and on the general existence of active molecules in organic and inorganic bodies. Philos. Mag. 1827, 4, 161–173. [Google Scholar] [CrossRef]
Jüttner, F. Das Maxwellsche Gesetz der Geschwindigkeitsverteilung in der Relativtheorie. Ann. Phys. 1911, 34, 856–882. [Google Scholar] [CrossRef]
Smoluchowski, M. Zur kinetischen Theorie der Brownschen Molekularbewegung und der Suspensionen. Ann. Phys. 1906, 21, 756–780. [Google Scholar] [CrossRef]
Langevin, P. Sur la théorie du mouvement Brownien. C. R. Acad. Sci. Paris 1908, 146, 530–533. [Google Scholar]
It is important to note here that Brownian motion is not a Markov process on arbitrary time scales since the Markov assumption (i.e., the conditional probability distribution of the future states only depend on the current state) does not hold for the detailed dynamics of the Brownian particle collisions. However, the outcome of each Brownian particle collision only depends on the initial condition of the collision, and hence, on the most recent collision, which is precisely the Markov assumption. Thus, on a time scale of the same order of magnitude as the mean time between particle collisions (i.e., the Markov-Einstein time scale), Brownian motion can be assumed to be Markovian.
Bochkov, G.N.; Kuzovlev, Y.E. General theory of thermal fluctuations in nonlinear systems. J. Exp. Theor. Phys. 1977, 45, 125–130. [Google Scholar]
Bochkov, G.N.; Kuzovlev, Y.E. Fluctuation-dissipation relations for nonequilibrium processes in open systems. J. Exp. Theor. Phys. 1979, 49, 543–551. [Google Scholar]
Gallavotti, G.; Cohen, E.G.D. Dynamical ensembles in nonequilibrium statistical mechanics. Phys. Rev. Lett. 1995, 74, 2694–2697. [Google Scholar] [CrossRef] [PubMed]
Kurchan, J. Fluctuation theorem for stochastic dynamics. J. Phys. A 1998, 31, 3719–3729. [Google Scholar] [CrossRef]
Lebowitz, J.L.; Spohn, H. A Gallavotti-Cohen-type symmetry in the large deviation functional for stochastic dynamics. J. Stat. Phys. 1999, 95, 333–365. [Google Scholar] [CrossRef]
Evans, D.J.; Searles, D.J. Equilibrium microstates which generate second law violating steady states. Phys. Rev. E 1994, 50, 1645–1648. [Google Scholar] [CrossRef]
Jarzynski, C. Nonequilibrium equality for free energy differences. Phys. Rev. Lett. 1997, 78, 2690–2693. [Google Scholar] [CrossRef]
Jarzynski, C. Equilibrium free-energy differences from nonequilibrium measurements: A master-equation approach. Phys. Rev. E 1997, 56, 5018–5035. [Google Scholar] [CrossRef]
Crooks, G.E. Entropy production fluctuation theorem and the nonequilibrium work relation for free energy differences. Phys. Rev. E 1999, 60, 2721–2726. [Google Scholar] [CrossRef]
Crooks, G.E. Path-ensemble averages in systems driven far from equilibrium. Phys. Rev. E 2000, 61, 2361–2366. [Google Scholar] [CrossRef]
Hummer, G.; Szabo, A. Free energy reconstruction from nonequilibrium single-molecule pulling experiments. Proc. Natl. Acad. Sci. USA 2001, 98, 3658–3661. [Google Scholar] [CrossRef] [PubMed]
Van Kampen, N.G. Stochastic Processes in Physics and Chemistry; Elsevier: Amsterdam, The Netherlands, 1992. [Google Scholar]
Haddad, W.M.; Chellaboina, V.; Nersesov, S.G. Thermodynamics: A Dynamical Systems Approach; Princeton University Press: Princeton, NJ, USA, 2005. [Google Scholar]
Khasminskii, R.Z. Stochastic Stability of Differential Equations; Springer: Berlin, Germany, 2012. [Google Scholar]
Arnold, L. Stochastic Differential Equations: Theory and Applications; Wiley-Interscience: New York, NY, USA, 1974. [Google Scholar]
Oksendal, B. Stochastic Differential Equations: An Introduction with Applications; Springer: Berlin, Germany, 1995. [Google Scholar]
Mao, X.; Yuan, C. Stochastic Differential Equations with Markovian Switching; Imperial College Press: London, UK, 2006. [Google Scholar]
Mao, X. Stochastic Differential Equations and Applications; Harwood: New York, NY, USA, 1997. [Google Scholar]
The ℙ-outer measure on a set Ω is an isotone (i.e., order-preserving), countably additive, extended real-valued set function defined for all subsets of Ω with ℙ(⌀)=0.
Rajpurohit, T.; Haddad, W.M. Lyapunov and converse Lyapunov theorems for stochastic semistability. Syst. Control Lett. 2016, 97, 83–90. [Google Scholar] [CrossRef]
Gard, T.C. Introduction to Stochastic Differential Equations; Marcel Dekker: New York, NY, USA, 1988. [Google Scholar]
Itô, K. Differential equations determining Markov processes. Zenkoku Shijo Sugaku Danwaki 1942, 1077, 1352–1400. [Google Scholar]
Itô, K. Stochastic integral. Proc. Imp. Acad. Tokyo 1944, 20, 519–524. [Google Scholar] [CrossRef]
Wu, Z.J.; Xie, X.J.; Shi, P.; Xia, Y.Q. Backstepping controller design for a class of stochastic nonlinear systems with Markovian switching. Automatica 2009, 45, 997–1004. [Google Scholar] [CrossRef]
Brzezniak, Z.; Capinski, M.; Flandoli, F. Pathwise global attractors for stationary random dynamical systems. Prob. Theory Relat. Fields 1993, 95, 87–102. [Google Scholar] [CrossRef]
Crauel, H.; Flandoli, F. Attractors of random dynamical systems. Prob. Theory Relat. Fields 1994, 100, 365–393. [Google Scholar] [CrossRef]
Crauel, H.; Debussche, A.; Flandoli, F. Random attractors. J. Dyn. Differ. Equ. 1997, 9, 307–341. [Google Scholar] [CrossRef]
The support of a function is the unique smallest closed set for which the complement of the set has measure zero. A function has a compact support if its support is a compact set.
Haddad, W.M.; Chellaboina, V.; Hui, Q. Nonnegative and Compartmental Dynamical Systems; Princeton University Press: Princeton, NJ, USA, 2010. [Google Scholar]
Haddad, W.M.; Chellaboina, V. Nonlinear Dynamical Systems and Control: A Lyapunov-Based Approach; Princeton University Press: Princeton, NJ, USA, 2008. [Google Scholar]
Khalil, H.K. Nonlinear Systems, 3rd ed.; Prentice-Hall: Upper Saddle River, NJ, USA, 2002. [Google Scholar]
Meyn, S.P.; Tweedie, R.L. Markov Chains and Stochastic Stability; Springer: London, UK, 1993. [Google Scholar]
Mao, X. Stochastic versions of the LaSalle theorem. J. Differ. Equ. 1999, 153, 175–195. [Google Scholar] [CrossRef]
Liptser, R.S.; Shiryayev, A.N. Theory of Martingales; Kluwer Academic Publishers: Norwell, MA, USA, 1989. [Google Scholar]
Rajpurohit, T.; Haddad, W.M. Lyapunov and converse Lyapunov theorems for stochastic semistability. In Proceedings of the 55th IEEE Conference on Decision and Control, Las Vegas, NV, USA, 10–13 December 2016; pp. 83–90. [Google Scholar]
Arapostathis, A.; Borkar, V.S.; Ghosh, M.K. Ergodic Control of Diffusion Processes; Cambridge University Press: Cambridge, UK, 2012. [Google Scholar]
Rajpurohit, T.; Haddad, W.M. Dissipativity theory for nonlinear stochastic dynamical systems. IEEE Trans. Autom. Control 2017, 62, 1684–1699. [Google Scholar] [CrossRef]
Lévy, P.P. Théorie de L’addition des Variables Aléatoires; Gauthier-Villars: Paris, France, 1937. [Google Scholar]
Arnold, V.I. Mathematical Models of Classical Mechanics; Springer: New York, NY, USA, 1989. [Google Scholar]
It can be argued here that a more appropriate terminology is assumptions rather than axioms since, as will be seen, these are statements taken to be true and used as premises in order to infer certain results, but may not otherwise be accepted. However, as we will see, these statements are equivalent (within our formulation) to the stipulated postulates of the zeroth and second laws of thermodynamics involving transitivity of a thermal equilibrium and heat flowing from hotter to colder bodies, and as such we refer to them as axioms.
Berman, A.; Plemmons, R.J. Nonnegative Matrices in the Mathematical Sciences; Academic: New York, NY, USA, 1979. [Google Scholar]
It is important to note that our formulation of the second law of thermodynamics as given by Axiom (ii) does not require the mentioning of temperature nor the more primitive subjective notions of hotness or coldness. As we will see later, temperature is defined in terms of the system entropy after we establish the existence of a unique, two-times continuously differentiable entropy function for $G$ .
Since in this section we are not considering work performed by and on the system, the notions of an isolated system and an adiabatically isolated system are equivalent.
Meixner, J. On the foundation of thermodynamics of processes. In A Critical Review of Thermodynamics; Stuart, E.B., Gal-Or, B., Brainard, A.J., Eds.; Mono Book Corp.: Baltimore, MD, USA, 1970; pp. 37–47. [Google Scholar]
Lavenda, B. Thermodynamics of Irreversible Processes; Macmillan: London, UK, 1978; Dover: New York, NY, USA, 1993. [Google Scholar]
The assumption σ_kk(E) ≥ 0, E ∈ ${\bar{R}}_{+}^{q}$ , and σ_kk(E) = 0 if and only if E_k = 0 for some k∈{1,…,q} implies that if the k-th subsystem possesses no energy, then this subsystem cannot dissipate energy to the environment. Conversely, if the k-th subsystem does not dissipate energy to the environment, then this subsystem has no energy.
Lyon, R.H. Statistical Energy Analysis of Dynamical Systems: Theory and Applications; MIT Press: Cambridge, MA, USA, 1975. [Google Scholar]
Bernstein, D.S.; Hyland, D.C. Compartmental modeling and second-moment analysis of state space systems. SIAM J. Matrix Anal. Appl. 1993, 14, 880–901. [Google Scholar] [CrossRef]
Bernstein, D.S.; Bhat, S.P. Energy equipartition and the emergence of damping in lossless systems. In Proceedings of the 41st IEEE Conference on Decision and Control, Las Vegas, NV, USA, 10–13 December 2002; pp. 2913–2918. [Google Scholar]
Pearson, R.K.; Johnson, T.L. Energy equipartition and fluctuation-dissipation theorems for damped flexible structures. Q. Appl. Math. 1987, 45, 223–238. [Google Scholar] [CrossRef]
Hall, S.R.; MacMartin, D.G.; Bernstein, D.S. Covariance averaging in the analysis of uncertain systems. IEEE Trans. Autom. Control 1992, 38, 1842–1859. [Google Scholar]

Figure 1. Large-scale dynamical system

G

with

D (E) = 0

and

J (E) = 0

.

Figure 1. Large-scale dynamical system

G

with

D (E) = 0

and

J (E) = 0

.

Figure 2. Thermodynamic model with undirected heat flow.

Figure 3. Sample average along with the sample standard deviation of the system energies versus time;

E_{1} (t)

in blue,

E_{2} (t)

in red,

E_{3} (t)

in green,

E_{4} (t)

in magenta, and

E_{5} (t)

in black.

Figure 3. Sample average along with the sample standard deviation of the system energies versus time;

E_{1} (t)

in blue,

E_{2} (t)

in red,

E_{3} (t)

in green,

E_{4} (t)

in magenta, and

E_{5} (t)

in black.

© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Rajpurohit, T.; Haddad, W.M. Stochastic Thermodynamics: A Dynamical Systems Approach. Entropy 2017, 19, 693. https://doi.org/10.3390/e19120693

AMA Style

Rajpurohit T, Haddad WM. Stochastic Thermodynamics: A Dynamical Systems Approach. Entropy. 2017; 19(12):693. https://doi.org/10.3390/e19120693

Chicago/Turabian Style

Rajpurohit, Tanmay, and Wassim M. Haddad. 2017. "Stochastic Thermodynamics: A Dynamical Systems Approach" Entropy 19, no. 12: 693. https://doi.org/10.3390/e19120693

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Stochastic Thermodynamics: A Dynamical Systems Approach

Abstract

1. Introduction

2. Stochastic Dynamical Systems

3. Stability Theory for Stochastic Nonnegative Dynamical Systems

4. Semistability of Stochastic Nonnegative Dynamical Systems

5. Conservation of Energy and the First Law of Thermodynamics: A Stochastic Perspective

6. Entropy and the Second Law of Thermodynamics

7. Stochastic Semistability and Energy Equipartition

8. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References and Notes

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI