On the Relation between Topological Entropy and Restoration Entropy

Kawan, Christoph

doi:10.3390/e21010007

Open AccessFeature PaperArticle

On the Relation between Topological Entropy and Restoration Entropy

by

Christoph Kawan

Fakultät für Informatik und Mathematik, Universität Passau, Innstraße 33, 94032 Passau, Germany

Entropy 2019, 21(1), 7; https://doi.org/10.3390/e21010007

Submission received: 28 November 2018 / Revised: 17 December 2018 / Accepted: 20 December 2018 / Published: 23 December 2018

(This article belongs to the Special Issue Entropy in Networked Control)

Download Versions Notes

Abstract

:

In the context of state estimation under communication constraints, several notions of dynamical entropy play a fundamental role, among them: topological entropy and restoration entropy. In this paper, we present a theorem that demonstrates that for most dynamical systems, restoration entropy strictly exceeds topological entropy. This implies that robust estimation policies in general require a higher rate of data transmission than non-robust ones. The proof of our theorem is quite short, but uses sophisticated tools from the theory of smooth dynamical systems.

Keywords:

topological entropy; restoration entropy; state estimation under communication constraints; SRB measures; Anosov diffeomorphisms

1. Introduction

This paper compares two notions of entropy that are relevant in the context of state estimation under communication constraints. Since the work of Savkin [1], it has been well known that the topological entropy of a dynamical system characterizes the smallest rate of information above which an estimator, receiving its state information at the corresponding rate, is able to generate a state estimate of arbitrary precision. Topological entropy is a quantity that has been studied in the mathematical field of dynamical systems since the 1960s and has turned out to be a useful tool for solving many theoretical and practical problems, cf. the survey [2] and the monograph [3]. A big drawback of this notion in the context of state estimation is that topological entropy is highly discontinuous with respect to the dynamical system under consideration in any reasonable topology, cf. [4]. As a consequence, estimation policies based on topological entropy are likely to suffer from a lack of robustness. Additionally, topological entropy is very hard to compute or estimate. There are only few numerical approaches that potentially work for multi-dimensional systems, cf. [5,6,7,8], and each of them has its drawbacks and restrictions.

A possible remedy for these problems is provided in the works [9,10] of Matveev and Pogromsky. One of the main ideas in these papers is to replace the topological entropy as a figure-of-merit for the necessary rate of data transmission with a possibly larger quantity, named restoration entropy, which describes the smallest data rate above which a more robust form of state estimation can be achieved (called regular observability in [9,10]).

Looking at one of the simplest types of nonlinear dynamical systems, namely Anosov diffeomorphisms, the main result of the paper at hand demonstrates that for most dynamical systems, we have to expect that the restoration entropy strictly exceeds the topological entropy. That is, to achieve a state estimation objective that is more robust with respect to perturbations, one has to pay the price of using a channel that allows for a larger rate of data transmission. More specifically, our result shows that the equality of topological and restoration entropy implies a great amount of uniformity in the dynamical system under consideration, which can be expressed in terms of the unstable Lyapunov exponents at each point, whose sum essentially has to be a constant. Such a property can easily be destroyed by a small perturbation, showing that arbitrarily close to the given system, we find systems whose restoration entropy strictly exceeds their topological entropy. Since Anosov diffeomorphisms are considered as a paradigmatic class of chaotic dynamical systems, this property can be expected for a much larger class of systems.

To prove our result, we need a number of high-level concepts and results from the theory of topological, measurable, and smooth dynamical systems. This includes the concepts of topological and metric pressure, Lyapunov exponents, SRB measures, and uniform hyperbolicity.

For further reading on the topic of state estimation under communication constraints, we refer the reader to [1,9,10,11,12,13,14] and the references given therein.

The structure of this paper is as follows: In Section 2, we collect all necessary definitions and results from the theory of dynamical systems. Section 3 introduces the concept of restoration entropy and explains its operational meaning in the context of estimation under communication constraints. In Section 4, we prove our main result and provide some interpretation and an example. Finally, Section 5 contains some concluding remarks.

2. Tools from Dynamical Systems

Notation: By

Z

, we denote the set of all integers, by

N

the set of positive integers, and

N_{0} : = {0} \cup N

. All logarithms are taken to the base two. If M is a Riemannian manifold, we write

| \cdot |

for the induced norm on any tangent space

T_{x} M

,

x \in M

. The notation

∥ \cdot ∥

is reserved for operator norms. We write

cl A

and

int A

for the closure and the interior of a set A in a metric space, respectively. Finally, the notation

A \subset B

(A subset of B) does not exclude the case

A = B

.

In this paper, we use several sophisticated results from the theory of dynamical systems, in particular from smooth ergodic theory. In the following, we try to explain these results without going too much into technical details.

Let

T : X \to X

be a continuous map on a compact metric space

(X, d)

. Via its iterates:

T^{0} : = {id}_{X}, T^{n + 1} : = T \circ T^{n}, n = 0, 1, 2, \dots

the map T generates a discrete-time dynamical system on X with associated orbits

{T^{n} (x)}_{n \in N_{0}}

,

x \in X

. We call the pair

(X, T)

a topological dynamical system (TDS).

2.1. Entropy and Pressure

Let

(X, T)

be a TDS. The topological entropy

h_{top} (T)

measures the total exponential complexity of the orbit structure of

(X, T)

in terms of the maximal numbers of finite-time orbits that are distinguishable w.r.t. to a finite resolution. One amongst different possible formal definitions is as follows. For

n \in N

and

ε > 0

, a set

E \subset X

is called

(n, ε, T)

-separated if for any

x, y \in E

with

x \neq y

, we have:

d (T^{i} (x), T^{i} (y)) \geq ε for at least one 0 \leq i < n .

That is, we can distinguish any two points in E at a resolution of

ε

by looking at their length-n finite-time orbits. By the compactness of X, there is a uniform upper bound on the cardinality of any (n, ε, T)-separated set. Writing r(n, ε, T) for the maximal possible cardinality,

\begin{matrix} h_{top} (T) : = lim_{ε ↓ 0} \underset{n \to \infty}{lim sup} \frac{1}{n} log r (n, ε, T) . \end{matrix}

This definition is due to Bowen [15] and (independently) Dinaburg [16]. However, it should be noted that the first definition of topological entropy, given by Adler, Konheim, and McAndrew [17], was in terms of open covers of X and was modeled in strict analogy to the metric (= measure−theoretic) entropy defined earlier by Kolmogorov and Sinai [18,19].

To define metric entropy, one additionally needs a Borel probability measure

μ

on X that is preserved by T in the sense that

μ (A) = μ (T^{- 1} (A))

for every Borel set A. By the theorem of Krylov–Bogolyubov, every continuous map on a compact space admits at least one such measure, cf. [20], Theorem 4.1.1. We write

M_{T}

for the set of all T-invariant Borel probability measures. For any finite measurable partition

P

of X, we define the entropy of T on

P

by:

h_{μ} (T; P) : = lim_{n \to \infty} \frac{1}{n} H_{μ} (⋁_{i = 0}^{n - 1} T^{- i} P) .

Here, ⋁ denotes the join operation. That is,

⋁_{i = 0}^{n - 1} T^{- i} P

is the partition of X whose elements are all intersections of the form

P_{0} \cap T^{- 1} (P_{1}) \cap \dots \cap T^{- n + 1} (P_{n - 1})

with

P_{i} \in P

. Moreover,

H_{μ} (\cdot)

denotes the Shannon entropy of a partition, i.e.,

H_{μ} (Q) = - \sum_{Q \in Q} μ (Q) log μ (Q)

for any finite partition

Q

. The metric entropy of T w.r.t.

μ

is then defined by:

h_{μ} (T) : = sup_{P} h_{μ} (T; P),

the supremum taken over all finite measurable partitions

P

of X (replacing measurable partitions with open covers and Shannon entropy with the logarithm of the cardinality of a minimal finite subcover, the same construction yields the topological entropy as defined in [17]).

To understand the meaning of

h_{μ}

, note that

H_{μ} (Q)

is the average amount of uncertainty as one attempts to predict the partition element to which a randomly-chosen point belongs. Hence,

h_{μ} (T)

measures the average uncertainty per iteration in guessing the partition element of a typical length-n orbit.

The variational principle for entropy states that:

h_{top} (T) = sup_{μ \in M_{T}} h_{μ} (T),

(1)

where the supremum is not necessarily a maximum. This variational principle can be regarded as a quantitative version of the theorem of Krylov–Bogolyubov.

Another concept (of which entropy is a special case) used in dynamical systems and inspired by ideas in thermodynamics is pressure. In this context, any continuous function

ϕ : X \to R

, also called a potential or an observable, gives rises to the metric pressure of T w.r.t.

ϕ

for a given

μ \in M_{T}

, defined as:

P_{μ} (T, ϕ) : = h_{μ} (T) + \int ϕ d μ .

To define an associated notion of topological pressure, put

S_{n} ϕ (x) : = \sum_{i = 0}^{n - 1} ϕ (T^{i} (x))

and:

R (n, ε, ϕ; T) : = sup \{\sum_{x \in E} 2^{S_{n} ϕ (x)} : E \subset X is (n, ε, T) - separated\} .

Then, the topological pressure of T w.r.t.

ϕ

is given by:

P_{top} (T, ϕ) : = lim_{ε ↓ 0} \underset{n \to \infty}{lim sup} \frac{1}{n} log R (n, ε, ϕ; T) .

The associated variational principle, first proven in [21], reads:

P_{top} (T, ϕ) = sup_{μ \in M_{T}} P_{μ} (T, ϕ),

(2)

which includes (1) as a special case (simply put

ϕ = 0

).

2.2. Subadditive Cocycles

Let

T : X \to X

be a map. A subadditive cocycle over

(X, T)

is a sequence

{(f_{n})}_{n \in N_{0}}

of functions

f_{n} : X \to R

satisfying:

f_{n + m} (x) \leq f_{n} (x) + f_{m} (T^{n} (x)), \forall n, m \in N_{0}, x \in X .

If equality holds in this relation, we call

{(f_{n})}_{n \in N_{0}}

an additive cocycle over

(X, T)

.

If X has the structure of a probability space with a

σ

-algebra

F

and a probability measure

μ

on

F

, T is measurable, and

μ

is T-invariant, we speak of a measurable subadditive cocycle provided that all

f_{n}

are measurable. In the context of a TDS

(X, T)

, we speak of a continuous subadditive cocycle if all

f_{n}

are continuous.

The most fundamental result about subadditive cocycles is Kingman’s subadditive ergodic theorem, cf. [3], Theorem 2.1.4:

Theorem 1.

Let

T : X \to X

be a measure-preserving map on a probability space

(X, F, μ)

and

{(f_{n})}_{n \in N_{0}}

a measurable subadditive cocycle over

(X, T)

such that each

f_{n}

is integrable. Then, the limit:

lim_{n \to \infty} \frac{1}{n} f_{n} (x)

exists for μ-almost every

x \in X

. If, additionally, μ is ergodic, then the limit is constant with:

lim_{n \to \infty} \frac{1}{n} f_{n} (x) = lim_{n \to \infty} \frac{1}{n} \int f_{n} d μ .

(3)

Observe that the limit on the right-hand side of (3) always exists by Fekete’s subadditivity lemma (see [3], Fact 2.1.1), because the sequence

a_{n} : = \int f_{n} d μ

is subadditive, i.e.,

a_{n + m} \leq a_{n} + a_{m}

. Kingman’s theorem can, in particular, be applied if

(X, T)

is a TDS,

μ \in M_{T}

, and

{(f_{n})}_{n \in N_{0}}

is a continuous subadditive cocycle.

Now, we consider again a TDS

(X, T)

and a continuous subadditive cocycle

{(f_{n})}_{n \in N_{0}}

over

(X, T)

. We define the extremal growth rate of

(f_{n})

by:

β [(f_{n})] : = sup_{x \in X} \underset{n \to \infty}{lim sup} \frac{1}{n} f_{n} (x) .

The following result is well known and can be found in [22], Theorem A.3, for instance:

Lemma 1.

Let

{(f_{n})}_{n \in N_{0}}

be a continuous subadditive cocycle over a TDS

(X, T)

. Then:

β [(f_{n})] = sup_{μ \in M_{T}} inf_{n > 0} \frac{1}{n} \int f_{n} d μ = inf_{n > 0} sup_{x \in X} \frac{1}{n} f_{n} (x) = inf_{n > 0} sup_{μ \in M_{T}} \frac{1}{n} \int f_{n} d μ .

Here, all infima can be replaced with limits. Moreover, every supremum is attained.

2.3. Lyapunov Exponents, SRB Measures, and Pesin’s Formula

To describe the long-term dynamical behavior of smooth systems, the notion of Lyapunov exponents is crucial. Given a

C^{1}

-diffeomorphism

T : M \to M

on a compact Riemannian manifold M, the Lyapunov exponent at

x \in M

in direction

0 \neq v \in T_{x} M

is the number:

λ (x, v) : = lim_{n \to \infty} \frac{1}{n} log | D T^{n} (x) v |,

provided that the limit exists. Lyapunov exponents measure how fast nearby solutions diverge from each other. The most general result on their existence and their properties is the multiplicative ergodic theorem (MET), also known as Oseledets theorem, cf. [23,24]. We need the following version of the theorem (which is not the most general):

Theorem 2.

Let

T : M \to M

be a

C^{1}

-diffeomorphism of a compact Riemannian manifold M and

μ \in M_{T}

. Then, there exists a Borel set

Ω \subset M

with

μ (Ω) = 1

and

T (Ω) = Ω

such that the following holds: for every

x \in Ω

, there exist numbers

λ_{1} (x) > \dots > λ_{r (x)} (x)

, and the tangent space at x splits into linear subspaces as:

T_{x} M = E_{1} (x) \oplus \dots \oplus E_{r (x)} (x)

such that the following properties hold:

(i): For every $0 \neq v \in E_{i} (x)$ , we have:

$lim_{n \to \pm \infty} \frac{1}{n} log | D T^{n} (x) v | = λ_{i} (x) .$
(ii): The functions $r (\cdot)$ , $dim E_{i} (\cdot)$ , and $λ_{i} (\cdot)$ are measurable and constant along orbits. Moreover,

$D T (x) E_{i} (x) = E_{i} (T (x)), i = 1, \dots, r (x) .$
(iii): For every $x \in Ω$ , the limit:

$Λ_{x} : = lim_{n \to \infty} {(D T^{n} {(x)}^{*} D T^{n} (x))}^{1 / 2 n}$

exists, and the different eigenvalues of $Λ_{x}$ are $2^{λ_{1} (x)}, \dots, 2^{λ_{r (x)} (x)}$ (here, $D T^{n} {(x)}^{*}$ denotes the adjoint of $D T^{n} (x)$ ).

Typically, a given map has a huge number of associated invariant measures. To obtain a good description of the global dynamical behavior, one has to select specific invariant measures that determine the behavior of the system on a large set of initial states. In this context, the notion of an SRB measure (Sinai–Ruelle–Bowen measure) comes into play. An SRB measure is a measure with at least one positive Lyapunov exponent almost everywhere, having absolutely continuous conditional measures on unstable manifolds. We are not going to give a technical definition of the latter property. Instead, we state the following celebrated theorem due to Ledrappier and Young [25], which characterizes this property in terms of metric entropy. Here, we use the short-cut:

λ^{+} (x) : = \sum_{i = 1}^{r (x)} max {0, λ_{i} (x) dim E_{i} (x)}

for the sum of all positive Lyapunov exponents at a point

x \in Ω

, counted with multiplicities.

Theorem 3.

Let

T : M \to M

be a

C^{2}

-diffeomorphism of a compact manifold M and

μ \in M_{T}

. Then, the formula:

h_{μ} (T) = \int λ^{+} d μ

(4)

holds if and only if μ has absolutely continuous conditional measures on unstable manifolds.

Additionally, note that for any

C^{1}

-diffeomorphism T and any

μ \in M_{T}

, the inequality:

h_{μ} (T) \leq \int λ^{+} d μ

(5)

holds, which is known as Ruelle’s inequality or Ruelle–Margulis inequality [26] (Formula (4) was first proven by Pesin for smooth invariant measures).

2.4. Anosov Diffeomorphisms

One of the simplest classes of smooth dynamical systems with complicated dynamical behavior is the class of Anosov diffeomorphisms. In this paper, we use these systems for two reasons. First, they have positive topological entropy, and second, they are very well understood and there are many tools available to describe their properties.

Let M be a compact Riemannian manifold. A

C^{1}

-diffeomorphism

T : M \to M

is called an Anosov diffeomorphism if there exists a splitting:

T_{x} M = E_{x}^{u} \oplus E_{x}^{s}, \forall x \in M

into linear subspaces such that the following conditions are satisfied:

(A1): $D T (x) E_{x}^{u} = E_{T (x)}^{u}$ and $D T (x) E_{x}^{s} = E_{T (x)}^{s}$ for all $x \in M$ .
(A2): There are constants $c \geq 1$ and $λ \in (0, 1)$ , so that, for all $x \in M$ and $n \in N_{0}$ ,

$\begin{matrix} | D T^{n} (x) v | & \leq c λ^{n} | v | for all v \in E_{x}^{s}, \\ | D T^{- n} (x) v | & \leq c λ^{n} | v | for all v \in E_{x}^{u} . \end{matrix}$

From (A1) and (A2), it automatically follows that

E_{x}^{s}

and

E_{x}^{u}

vary continuously with x, cf. [20], Proposition 6.4.4. The existence of a splitting as above is also known as uniform hyperbolicity.

The simplest examples of Anosov diffeomorphisms are hyperbolic linear torus automorphisms, i.e., maps on the n-dimensional torus

T^{n} = R^{n} / Z^{n}

of the form:

T_{A} (x) = A x (\mod Z^{n}), T_{A} : T^{n} \to T^{n},

where

A \in Z^{n \times n}

is an integer matrix satisfying

| det A | = 1

and

| λ | \neq 1

for all eigenvalues

λ

of A. Observe that the assumption

| det A | = 1

guarantees that

T_{A}

is invertible with inverse

T_{A}^{- 1} = T_{A^{- 1}}

(because

A^{- 1}

also has integer entries) and at the same time implies that

T_{A}

is area-preserving. That is, the normalized Lebesgue measure on

T^{n}

is an element of

M_{T_{A}}

. The assumption on the eigenvalues of A together with the fact that the derivative

D T_{A} (x)

at any point

x \in T^{n}

can be identified with A itself implies the Anosov Properties (A1) and (A2).

It is well known that Anosov diffeomorphisms are structurally stable, i.e., any sufficiently small

C^{1}

-perturbation

T_{ε}

of an Anosov diffeomorphism

T : M \to M

is also an Anosov diffeomorphism, which is topologically conjugate to T, see [20], Proposition 6.4.6 and Corollary 18.2.2. That is, there exists a homeomorphism

h : M \to M

, so that:

h^{- 1} \circ T_{ε} \circ h = T .

If we assume that T is an arbitrary Anosov diffeomorphism of the torus, the existence of a unique entropy-maximizing measure

μ

follows. That is,

μ

is the unique element of

M_{T}

satisfying:

h_{top} (T) = h_{μ} (T) .

This follows from a combination of results that can be found in Katok and Hasselblatt [20], namely Theorem 20.3.7, Proposition 18.6.5, Theorem 18.3.9, and Corollary 6.4.10. The entropy-maximizing measure

μ

is also known as the Bowen-measure.

In this context, also the notion of topological mixing is important. An Anosov diffeomorphism (or simply a continuous map)

T : M \to M

is called topologically mixing if for any two nonempty open sets

A, B \subset M

, there exists an integer N such that

T^{n} (A) \cap B \neq \emptyset

for all

n \geq N

. In particular, all Anosov diffeomorphisms on

T^{n}

are topologically mixing ([20], Proposition 18.6.5).

3. State Estimation and Restoration Entropy

The notion of restoration entropy was introduced in [10] for systems given by ODEs on

R^{n}

. However, it is immediately clear from the definition that restoration entropy can be defined for any continuous map on a compact metric space as follows. Let

T : X \to X

be a continuous map on a metric space

(X, d)

and

K \subset X

a compact set with

T (K) \subset K

. For every

x \in X

,

n \in N

and

ε > 0

, let

p (n, x, ε)

denote the smallest number of

ε

-balls needed to cover the image

T^{n} (B_{ε} (x) \cap K)

. If the map is not clear from the context, we also write

p (n, x, ε; T)

. Then:

h_{res} (T_{| K}) : = lim_{n \to \infty} \frac{1}{n} \underset{ε ↓ 0}{lim sup} sup_{x \in X} log p (n, x, ε) .

The existence of the limit in n follows from the subadditivity of the sequence

a_{n} : = {lim sup}_{ε ↓ 0}

{sup}_{x \in X} log p (n, x, ε)

(using Fekete’s lemma). If we assume that T is a

C^{1}

-diffeomorphism of a compact Riemannian manifold, the numbers

p (n, x, ε)

can be estimated in terms of the unstable singular values of

D T^{n} (x)

. This is related to the simple fact that the image of a ball under a linear map (in our case, the local linear approximation

D T^{n} (x)

to

T^{n}

) is an ellipsoid with semi-axes of lengths proportional to the singular values. This leads to the following result, proven in [10], Theorem 11, for continuous-time systems. The proof carries over to discrete-time systems on Riemannian manifolds without any problem.

Theorem 4.

Let

T : M \to M

be a

C^{1}

-diffeomorphism of a d-dimensional Riemannian manifold M and

K \subset M

a forward-invariant compact set of T with

cl K = cl (int K)

. Then:

h_{res} (T_{| K}) = lim_{n \to \infty} \frac{1}{n} max_{x \in K} \sum_{i = 1}^{d} max {0, log α_{i} (n, x)},

where

α_{1} (n, x) \geq \dots \geq α_{d} (n, x)

denote the singular values of

D T^{n} (x)

.

For the analysis of

h_{res}

, based on the above formula, the following observations are crucial:

We have

$\sum_{i = 1}^{d} max {0, log α_{i} (n, x)} = log \prod_{i = 1}^{d} max {1, log α_{i} (n, x)} = log ∥ D T^{n} {(x)}^{\land} ∥,$

where $D T^{n} {(x)}^{\land}$ denotes the linear map induced by $D T^{n} (x)$ between the full exterior algebras of the tangent spaces $T_{x} M$ and $T_{T^{n} (x)} M$ , respectively; see [27], Chapter I, Proposition 7.4.2.
The sequence $f_{n} (x) : = log ∥ D T^{n} {(x)}^{\land} ∥$ , $f_{n} : M \to R$ , is a continuous subadditive cocycle over $(K, T_{| K})$ , since:

$\begin{matrix} f_{n + m} (x) & = log ∥ D T^{n + m} {(x)}^{\land} ∥ = log ∥ D T^{m} {(T^{n} (x))}^{\land} D T^{n} {(x)}^{\land} ∥ \\ \leq log (∥ D T^{m} {(T^{n} (x))}^{\land} ∥ \cdot ∥ D T^{n} {(x)}^{\land} ∥) \\ = log ∥ D T^{n} {(x)}^{\land} ∥ + log ∥ D T^{m} {(T^{n} (x))}^{\land} ∥ = f_{n} (x) + f_{m} (T^{n} (x)) . \end{matrix}$

Alternatively, this follows from Horn’s inequality for singular values; see [27], Chapter I, Proposition 2.3.1.

In the following, we explain the operational meaning of the quantity

h_{res} (T_{| K})

.

Consider the dynamical system given by:

x_{t + 1} = T (x_{t}), x_{0} \in K, t = 0, 1, 2, \dots

(6)

Suppose that a sensor, fully observing the state

x_{t}

, sends its data to an encoder. At the sampling times

t = 0, 1, 2, \dots

, the encoder sends a signal

e_{t}

through a noise-free discrete channel to a decoder (without transmission delay). The decoder acts as an observer of the system, trying to reconstruct the state from the received data. We write

{\hat{x}}_{t}

for the estimate generated by the observer at time t. Moreover, we assume that we start with an initial estimate

{\hat{x}}_{0} \in K

of a specified accuracy.

With

M

denoting the coding alphabet, the encoder and the observer are described by mappings:

e_{t} = C_{t} (x_{0}, x_{1}, \dots, x_{t}; {\hat{x}}_{0}, δ), C_{t} : K^{t + 1} \times K \times R_{> 0} \to M,

and:

{\hat{x}}_{t} = E_{t} (e_{0}, e_{1}, \dots, e_{t}; {\hat{x}}_{0}, δ), E_{t} : M^{t + 1} \times K \times R_{> 0} \to X .

The argument

δ

corresponds to the initial error at time zero, i.e.,

d (x_{0}, {\hat{x}}_{0}) \leq δ

. In particular, we assume that both the encoder and the observer are given the data

{\hat{x}}_{0}

and

δ

.

We assume that the channel can transmit at least

b_{-} (r)

and at most

b_{+} (r)

bits in any time interval of length r. The capacity of the channel is then defined by:

C : = lim_{r \to \infty} \frac{b_{-} (r)}{r} = lim_{r \to \infty} \frac{b_{+} (r)}{r},

assuming that these limits exist and coincide.

We consider the following two observation objectives:

(O1): The observer observes the system with exactness $ε > 0$ if there exists $δ = δ (ε, K)$ , so that $x_{0}, {\hat{x}}_{0} \in K$ with $d (x_{0}, {\hat{x}}_{0}) \leq δ$ implies:

$sup_{t \geq 0} d (x_{t}, {\hat{x}}_{t}) \leq ε .$
(O2): The observer regularly observes the system if there exist $G, δ_{*} > 0$ , so that for all $δ \in (0, δ_{*})$ and $x_{0}, {\hat{x}}_{0} \in K$ with $d (x_{0}, {\hat{x}}_{0}) \leq δ$ ,

$sup_{t \geq 0} d (x_{t}, {\hat{x}}_{t}) \leq G δ .$

We say that the system is:

observable on K over a channel of capacity C if for every $ε > 0$ , an observer exists that observes the system with exactness $ε$ over this channel;
regularly observable on K over a channel of capacity C if there exists an observer that regularly observes the system over this channel.

Then, we have the following data-rate theorem, cf. [9], Theorem 8, and [10], Theorem 9.

Theorem 5.

The smallest channel capacity

C_{0}

, so that System (6) is:

observable on K over every channel of capacity $C > C_{0}$ is given by:

$C_{0} = h_{top} (T_{| K}) .$
regularly observable on K over every channel of capacity $C > C_{0}$ is given by:

$C_{0} = h_{res} (T_{| K}) .$

Since regular observability implies observability, it is clear that:

h_{top} (T_{| K}) \leq h_{res} (T_{| K}) .

As already pointed out in the Introduction, the quantity

h_{top} (\cdot)

is highly discontinuous w.r.t. the dynamical system. Moreover, the corresponding data-rate theorem has the disadvantage that the final error

ε

may be much larger than the initial error

δ

, which cannot happen in the case of regular observability. From Theorem 4 in combination with Lemma 1, one sees that in the smooth case,

h_{res}

is an infimum over functions that are continuous w.r.t. T in the

C^{1}

-topology. This implies at least upper semicontinuity. Hence, we can expect that coding and estimation strategies based on restoration entropy enjoy better properties than those based on topological entropy.

4. Results

Before we present our main result, we prove two lemmas, which are of independent interest.

Lemma 2.

Let

T : M \to M

be a

C^{2}

-diffeomorphism on a compact Riemannian manifold M. Then, for any

μ \in M_{T}

, we have:

\int λ^{+} d μ = lim_{n \to \infty} \frac{1}{n} \int log ∥ D T^{n} {(x)}^{\land} ∥ d μ (x) .

Proof.

Let

d = dim M

. First observe that we have the identity:

∥ D T^{n} {(x)}^{\land} ∥ = max \{1, max_{1 \leq k \leq d} \prod_{i = 1}^{k} α_{i} (n, x)\},

where

α_{1} (n, x) \geq \dots \geq α_{d} (n, x)

are the singular values of

D T^{n} (x)

, see [27], Chapter I, Proposition 7.4.2. Hence,

log ∥ D T^{n} {(x)}^{\land} ∥ = max \{0, max_{1 \leq k \leq d} \sum_{i = 1}^{k} log α_{i} (n, x)\} .

The maximum over k is clearly attained when k is the maximal number such that

α_{i} (n, x) > 1

for all

1 \leq i \leq k

. Hence,

log ∥ D T^{n} {(x)}^{\land} ∥ = max \{0, \sum_{α_{i} (n, x) > 1} log α_{i} (n, x)\} .

The numbers

α_{i} (n, x)

are the eigenvalues of

A_{n} (x) : = {(D T^{n} {(x)}^{*} D T^{n} (x))}^{1 / 2}

. Theorem 2 states that

A_{n} {(x)}^{1 / n} \to Λ_{x}

for

μ

-almost every

x \in M

and the logarithms of the eigenvalues of

Λ_{x}

are the Lyapunov exponents at x. Since eigenvalues depend continuously on the matrix, it follows that:

lim_{n \to \infty} \frac{1}{n} log ∥ D T^{n} {(x)}^{\land} ∥ = λ^{+} (x) μ - a . e .

and consequently

\int λ^{+} d μ = \int lim_{n \to \infty} \frac{1}{n} log ∥ D T^{n} {(x)}^{\land} ∥ d μ (x) .

Applying the theorem of dominated convergence then yields the result. □

Lemma 3.

Let

T : M \to M

be a

C^{2}

-diffeomorphism on a compact Riemannian manifold M such that

h_{top} (T) = h_{res} (T)

. Then, if T has an entropy-maximizing measure

μ_{*}

, it follows that:

h_{μ_{*}} (T) = \int λ^{+} d μ_{*} .

Proof.

Assume to the contrary that

h_{μ_{*}} (T) < \int λ^{+} d μ_{*}

(using Ruelle’s inequality (5)). Then, Lemma 2 implies:

h_{top} (T) = h_{μ_{*}} (T) < \int λ^{+} d μ_{*} = lim_{n \to \infty} \frac{1}{n} \int log ∥ D T^{n} {(x)}^{\land} ∥ d μ_{*} (x) .

According to Theorem 4 and the subsequent observation, an application of Lemma 1 yields:

h_{res} (T) = sup_{μ \in M_{T}} lim_{n \to \infty} \frac{1}{n} \int log ∥ D T^{n} {(x)}^{\land} ∥ d μ (x) .

Combining these observations gives

h_{top} (T) < h_{res} (T)

, in contradiction to our assumption. □

Now, we are in a position to state our main result.

Theorem 6.

Let

T : M \to M

be a topologically mixing

C^{2}

-Anosov diffeomorphism on a compact Riemannian manifold M such that

h_{top} (T) = h_{res} (T)

. Then, the unique entropy-maximizing measure

μ_{*} \in M_{T}

is an SRB measure. Moreover, the function:

μ \mapsto \int λ^{+} d μ, M_{T} \to R_{\geq 0}

is constant.

Proof.

First note that the existence and uniqueness of an entropy-maximizing measure

μ_{*}

follows from [20], Theorem 20.3.7, Theorem 18.3.9, and Corollary 6.4.10. Here, the assumption that T is topologically mixing is crucial. By the preceding lemma combined with Theorem 3, we already know that

μ_{*}

has absolutely continuous conditional measures on unstable manifolds. Since an Anosov diffeomorphism has positive Lyapunov exponents everywhere (where they exist), attained in all directions of the unstable subspace

E_{x}^{u}

, it follows that

μ_{*}

is an SRB measure.

Now, let

μ \in M_{T}

be chosen arbitrarily. Due to the invariance of

μ

, we have:

\begin{matrix} {\int log | det D T (x)}_{| E_{x}^{u}} | d μ (x) & = \int \frac{1}{n} \sum_{i = 0}^{n - 1} log | det D T {(T^{i} (x))}_{| E_{T^{i} (x)}^{u}} | d μ (x) \\ = \int \frac{1}{n} log | det D T^{n} {(x)}_{| E_{x}^{u}} | d μ (x) \end{matrix}

for every

n \in N

, implying:

\begin{matrix} \int λ^{+} d μ & = \int lim_{n \to \infty} \frac{1}{n} log | det D T^{n} {(x)}_{| E_{x}^{u}} | d μ (x) \\ = lim_{n \to \infty} \int \frac{1}{n} log | det D T^{n} {(x)}_{| E_{x}^{u}} | d μ (x) = \int log | det D T {(x)}_{| E_{x}^{u}} | d μ (x), \end{matrix}

where we use Kingman’s subadditive ergodic theorem, applied to the continuous additive cocycle

f_{n} (x) : = log | det D T^{n} {(x)}_{| E_{x}^{u}} |

(

n \in N_{0}

), and the theorem of dominated convergence. Observe that the function

J^{u} T (x) : = log | det D T {(x)}_{| E_{x}^{u}} |

is continuous (using the fact that

x \mapsto E_{x}^{u}

is continuous). Hence, we can consider the affine function:

α_{μ} : R \to R, α_{μ} (t) : = P_{μ} (T, - t J^{u} T) = h_{μ} (T) - t \int λ^{+} d μ .

The variational principle (2) for pressure tells us that:

P_{top} (- t J^{u} T) = sup_{μ \in M_{T}} α_{μ} (t), \forall t \in R .

(7)

Hence,

t \mapsto P_{top} (- t J^{u} T)

, as the supremum over affine functions, is a convex function.

Using that

μ_{*}

is the entropy-maximizing measure and Theorem 3, respectively, we obtain:

α_{μ_{*}} (0) = h_{top} (T) and α_{μ_{*}} (1) = 0 .

On the other hand, also:

P_{top} (- 0 \cdot λ^{+}) = h_{top} (T) and P_{top} (- 1 \cdot J^{u} T) = 0 .

The second identity here follows from the fact that

P_{top} (- 1 \cdot J^{u} T) = {sup}_{μ \in M_{T}} (h_{μ} (T) - \int λ^{+} d μ)

and

h_{μ} (T) \leq \int λ^{+} d μ

by Ruelle’s inequality (5). Hence,

P_{top} (- 1 \cdot J^{u} T) = h_{μ_{*}} (T) - \int λ^{+} d μ_{*} = 0

.

By convexity of

t \mapsto P_{top} (- t J^{u} T)

and (7), this implies:

P_{top} (- t J^{u} T) = α_{μ_{*}} (t), \forall t \in R .

From (7), it now follows that all of the maps

α_{μ}

have the same slope, i.e.,

\int λ^{+} d μ

is independent of

μ

. □

The above theorem shows that the equality

h_{top} (T) = h_{res} (T)

is a very restrictive condition. Indeed, this can be seen as follows. Any topologically mixing Anosov diffeomorphism has an abundance of periodic points. Indeed, the set of periodic points is dense in M; see [20], Corollary 6.4.19. If we consider a periodic point

p \in M

of period

n_{p} \in N

, we can consider the invariant measure

μ_{p}

given by:

μ_{p} : = \frac{1}{n_{p}} \sum_{i = 0}^{n_{p} - 1} δ_{T^{i} (p)}

with

δ_{(\cdot)}

being the Dirac measure at a point. The above theorem implies that, under

h_{top} (T) = h_{res} (T)

, the number:

γ (p) : = \int λ^{+} d μ_{p} = \frac{1}{n_{p}} log |det (D T^{n_{p}} {(p)}_{| E_{p}^{u}} : E_{p}^{u} \to E_{p}^{u})|

is independent of the periodic point p chosen. On the other hand, we know that every sufficiently small

C^{2}

-perturbation of T yields another

C^{2}

-Anosov diffeomorphism, topologically conjugate to T, hence also topologically mixing. If this perturbation is only performed in a small vicinity of a fixed periodic orbit, it can easily change the number

γ (p)

, while not changing it for most of the other periodic orbits. As a consequence, the perturbed diffeomorphism

T_{ε}

cannot satisfy

h_{top} (T_{ε}) = h_{res} (T_{ε})

.

The following corollary gives another characterization of Anosov diffeomorphisms with

h_{top} = h_{res}

in a two-dimensional case.

Corollary 1.

Consider a

C^{2}

area-preserving Anosov diffeomorphism

T : T^{2} \to T^{2}

of the two-torus. Then, the equality

h_{top} (T) = h_{res} (T)

is equivalent to the existence of a hyperbolic linear automorphism

T_{A} : T^{2} \to T^{2}

and a

C^{1}

-diffeomorphism

h : T^{2} \to T^{2}

such that

h^{- 1} \circ T \circ h = T_{A}

.

Proof.

It follows immediately from Theorem 6 in combination with [20], Corollary 20.4.4, that the identity

h_{top} (T) = h_{res} (T)

implies the existence of a

C^{1}

-conjugacy, as asserted. The other direction is easy to see, using the definition of restoration entropy. If

h^{- 1} \circ T \circ h = T_{A}

, then also

h^{- 1} \circ T^{n} \circ h = T_{A}^{n}

for all

n \in N

. We use that a

C^{1}

-map on a compact manifold has a global Lipschitz constant. Let

L : = Lip (h)

and

L^{'} : = Lip (h^{- 1})

be Lipschitz constants of h and

h^{- 1}

, respectively. Then:

T^{n} (B_{ε} (x)) = h \circ T_{A}^{n} \circ h^{- 1} (B_{ε} (x)) .

Observe that

h^{- 1} (B_{ε} (x)) \subset B_{L^{'} ε} (h^{- 1} (x))

. Let

N (l)

denote the minimal number of

ε

-balls needed to cover an

l ε

-ball in

T^{2}

for any

l > 0

. Then, the minimal number of

ε

-balls needed to cover

T_{A}^{n} h^{- 1} (B_{ε} (x))

is bounded from above by

N (L^{'}) {sup}_{z \in T^{2}} p (n, z, ε; T_{A})

. This implies:

p (n, x, ε; T) \leq N (L) N (L^{'}) sup_{z \in T^{2}} p (n, z, ε; T_{A}) .

Hence,

sup_{x \in T^{2}} \frac{1}{n} log p (n, x, ε; T) \leq \frac{1}{n} log N (L) N (L^{'}) + sup_{x \in T^{2}} \frac{1}{n} log p (n, x, ε; T_{A}) .

Taking the lim sup for

ε ↓ 0

and subsequently the limit for

n \to \infty

, we obtain that

h_{res} (T) \leq h_{res} (T_{A})

. The other inequality can be proven analogously, so:

h_{res} (T) = h_{res} (T_{A}) .

Since T and

T_{A}

are topologically conjugate (the

C^{1}

-diffeomorphism h is a homeomorphism, in particular), they also have the same topological entropy:

h_{top} (T) = h_{top} (T_{A}) .

To complete the proof, it now suffices to show that

h_{res} (T_{A}) = h_{top} (T_{A})

. We can compute

h_{res} (T_{A})

using Theorem 4. To this end, observe that A is a hyperbolic matrix. If

| λ_{1} | > 1 > | λ_{2} |

are its eigenvalues, we obtain:

lim_{n \to \infty} \frac{1}{n} \sum_{i = 1}^{2} max {0, log α_{i} (n, x)} = log | λ_{1} | \forall x \in T^{2},

implying

h_{res} (T_{A}) = log | λ_{1} |

. It is well known that this is also the value of the topological entropy

h_{top} (T_{A})

; see [20], Section 4. This also follows from the combination of the variational principle with Theorem 3. □

The following example demonstrates how restrictive the condition

h_{res} (T) = h_{top} (T)

is by looking at small perturbations of Arnold’s Cat Map.

Example 1.

Arnold’s Cat Map is the hyperbolic linear two-torus automorphism

T_{A} : T^{2} \to T^{2}

induced by the integer matrix:

A : = (\begin{matrix} 2 & 1 \\ 1 & 1 \end{matrix})

with determinant

det A = 1

. Observe that the derivative

D T_{A} (x)

can be identified with A for each

x \in T^{2}

. Since A is a hyperbolic matrix with eigenvalues:

γ_{1} = \frac{3}{2} - \frac{1}{2} \sqrt{5} a n d γ_{2} = \frac{3}{2} + \frac{1}{2} \sqrt{5}

satisfying

| γ_{2} | > 1 > | γ_{1} |

, it follows that

T_{A}

is a

C^{\infty}

area-preserving Anosov diffeomorphism. Hence, Corollary 1 yields:

h_{top} (T_{A}) = h_{res} (T_{A}) = log | γ_{2} | .

Now, we consider a perturbation of the form:

T_{A}^{ε} (x, y) : = (2 x + y + ε sin (2 π x), x + y) (\mod Z^{2}), ε > 0

which is well defined as a torus map, since the sine function is

2 π

-periodic. By the structural stability of Anosov diffeomorphisms, for a sufficiently small ε, this map is topologically conjugate to

T_{A}

, hence has the same topological entropy

log | γ_{2} |

. However, its restoration entropy is strictly greater. This can be seen by looking at the fixed point

(0, 0)

with the associated derivative:

D T_{A}^{ε} (0, 0) = (\begin{matrix} 2 + 2 π ε & 1 \\ 1 & 1 \end{matrix}) .

The eigenvalues of this matrix can be computed as:

λ_{\pm} = \frac{3}{2} + π ε \pm \frac{1}{2} \sqrt{5 + 4 π ε (1 + π ε)} .

Since

λ_{+} > γ_{2}

, Theorem 4 yields

h_{res} (T_{A}^{ε}) \geq log | λ_{+} | > h_{top} (T_{A}^{ε})

for

ε > 0

sufficiently small.

5. Conclusions

In this paper, we compared two notions of entropy for dynamical systems that have an operational meaning in the context of state estimation over digital channels: topological entropy and restoration entropy. Looking at Anosov diffeomorphisms (a paradigmatic class of chaotic dynamical systems), our main result demonstrates that the equality of these two quantities implies a great amount of uniformity in the given system. For area-preserving Anosov diffeomorphisms on the two-torus, this uniformity can be expressed in terms of the existence of a

C^{1}

-conjugacy to a linear system. Hence, we can conclude that for most dynamical systems, the strict inequality

h_{top} < h_{res}

holds. The operational meaning of this inequality is that for regular observability, as defined in Section 3, a strictly larger channel capacity is necessary than for observability.

Funding

This research received no external funding.

Acknowledgments

The author owes particular thanks to Katrin Gelfert, who provided one of the main ideas in the proof of Theorem 6 during the Mini-Workshop Entropy, Information and Control held at the Mathematisches Forschungsinstitut Oberwolfach from 4–10 March 2018. The author also thanks Alexander Pogromsky for fruitful discussions on restoration entropy.

Conflicts of Interest

The author declares no conflict of interest.

References

Savkin, A.V. Analysis and synthesis of networked control systems: Topological entropy, observability, robustness and optimal control. Autom. J. IFAC 2006, 42, 51–62. [Google Scholar] [CrossRef]
Katok, A. Fifty years of entropy in dynamics: 1958–2007. J. Mod. Dyn. 2007, 1, 545–596. [Google Scholar] [CrossRef]
Downarowicz, T. Entropy in Dynamical Systems; New Mathematical Monographs Volume 18; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
Misiurewicz, M. On non-continuity of topological entropy. Bull. Acad. Polon. Sci. Str. Sci Math. Astronom. Phys. 1971, 19, 319–320. [Google Scholar]
Chen, Q.; Ott, E.; Hurd, L. Calculating topological entropies of chaotic dynamical systems. Phys. Lett. A 1991, 156, 48–52. [Google Scholar] [CrossRef]
D’Alessandro, G.; Grassberger, P.; Isola, S.; Politi, A. On the topology of the Hénon map. J. Phys. A 1990, 23, 5285–5294. [Google Scholar]
Froyland, G.; Junge, O.; Ochs, G. Rigorous computation of topological entropy with respect to a finite partition. Phys. D 2001, 154, 68–84. [Google Scholar] [CrossRef] [Green Version]
Newhouse, S.; Pignataro, T. On the estimation of topological entropy. J. Stat. Phys. 1993, 72, 1331–1351. [Google Scholar] [CrossRef] [Green Version]
Matveev, A.; Pogromsky, A. Observation of nonlinear systems via finite capacity channels: Constructive data rate limits. Autom. J. IFAC 2016, 70, 217–229. [Google Scholar] [CrossRef]
Matveev, A.; Pogromsky, A. Observation of nonlinear systems via finite capacity channels. Part II: Restoration entropy and its estimates. Automatica 2016, 70, 217–229. [Google Scholar] [CrossRef]
Matveev, A.S.; Savkin, A.V. Estimation and Control over Communication Networks; Birkhäuser Boston: Boston, MA, USA, 2009. [Google Scholar]
Liberzon, D.; Mitra, S. Entropy and minimal bit rates for state estimation and model detection. IEEE Trans. Autom. Control 2018, 63, 3330–3344. [Google Scholar] [CrossRef]
Matveev, A.S. State estimation via limited capacity noisy communication channels. Math. Control Signals Syst. 2008, 20, 1–35. [Google Scholar] [CrossRef]
Kawan, C.; Yüksel, S. On optimal coding of non-linear dynamical systems. IEEE Trans. Inf. Theory 2018, 64, 6816–6829. [Google Scholar] [CrossRef]
Bowen, R. Entropy for group endomorphisms and homogeneous spaces. Trans. Am. Math. Soc. 1971, 153, 401–414. [Google Scholar] [CrossRef]
Dinaburg, E.I. A connection between various entropy characterizations of dynamical systems. Izv. Akad. Nauk SSSR Ser. Mat. 1971, 35, 324–366. [Google Scholar]
Adler, R.L.; Konheim, A.G.; McAndrew, M.H. Topological entropy. Trans. Am. Math. Soc. 1965, 114, 309–319. [Google Scholar] [CrossRef]
Kolmogorov, A.N. A new metric invariant of transient dynamical systems and automorphisms in Lebesgue spaces. Dokl. Akad. Nauk SSSR 1958, 119, 861–864. [Google Scholar]
Sinai, J. On the concept of entropy for a dynamic system. Dokl. Akad. Nauk SSSR 1959, 124, 768–771. [Google Scholar]
Katok, A.; Hasselblatt, B. Introduction to the Modern Theory of Dynamical Systems; Encyclopedia of Mathematics and its Applications Series 54; Cambridge University Press: Cambridge, UK, 1995. [Google Scholar]
Walters, P. A variational principle for the pressure of continuous transformations. Am. J. Math. 1975, 97, 937–971. [Google Scholar] [CrossRef]
Morris, I.D. Mather sets for sequences of matrices and applications to the study of joint spectral radii. Proc. Lond. Math. Soc. 2013, 107, 121–150. [Google Scholar] [CrossRef] [Green Version]
Arnold, L. Random Dynamical Systems; Springer: Berlin/Heidelberg, Germany; New York, NY, USA, 1998. [Google Scholar]
Colonius, F.; Kliemann, W. Dynamical Systems and Linear Algebra; American Mathematical Society: Providence, RI, USA, 2014; Volume 158. [Google Scholar]
Ledrappier, F.; Young, L.-S. The metric entropy of diffeomorphisms. I. Characterization of measures satisfying Pesin’s entropy formula. Ann. Math. 1985, 122, 509–539. [Google Scholar] [CrossRef]
Ruelle, D. An inequality for the entropy of differentiable maps. Bol. Soc. Brasil. Mat. 1978, 9, 83–87. [Google Scholar] [CrossRef] [Green Version]
Boichenko, V.A.; Leonov, G.A.; Reitmann, V. Dimension Theory for Ordinary Differential Equations; Teubner: Stuttgart, Germany, 2005. [Google Scholar]

© 2018 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kawan, C. On the Relation between Topological Entropy and Restoration Entropy. Entropy 2019, 21, 7. https://doi.org/10.3390/e21010007

AMA Style

Kawan C. On the Relation between Topological Entropy and Restoration Entropy. Entropy. 2019; 21(1):7. https://doi.org/10.3390/e21010007

Chicago/Turabian Style

Kawan, Christoph. 2019. "On the Relation between Topological Entropy and Restoration Entropy" Entropy 21, no. 1: 7. https://doi.org/10.3390/e21010007

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

On the Relation between Topological Entropy and Restoration Entropy

Abstract

1. Introduction

2. Tools from Dynamical Systems

2.1. Entropy and Pressure

2.2. Subadditive Cocycles

2.3. Lyapunov Exponents, SRB Measures, and Pesin’s Formula

2.4. Anosov Diffeomorphisms

3. State Estimation and Restoration Entropy

4. Results

5. Conclusions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI