1. Introduction
In topological dynamics, the topological entropy of a dynamical system measures the information in orbits, by counting the exponential growth rate of different partial orbits with a certain of accuracy. One example of a dynamical system is a subshift, a dynamical system whose points are infinite words over a finite alphabet A, and the dynamics is the left shift. One-way entropy can arise in a subshift when, somewhere in partial orbits, we literally see all distinct words up to n. One may ask to what extent entropy always arises from such “free choices”.
Two formal notions that most directly capture this idea are the
independence entropy of a subshift and
independence sets. The former measures how much entropy comes from “set-theoretic hyperrectangles”, namely sets of the form
which are contained in the language of the subshift as
, where
are any subsets of the alphabet and we consider consecutive letters only, see [
1], (p. 303) for the precise definition. Positive entropy does not imply positive independence entropy (see Proposition 1).
Independence sets, on the other hand, are defined similarly in [
2,
3], but we allow gaps between the positions where the prescribed letters appear, and there is no constraint on the words seen in the gaps between the positions. More precisely, in the case of a subshift
X with a binary alphabet
, an
independence set is a set
such that
. It turns out that the existence of an independence set of positive lower asymptotic density does capture positive entropy, see [
2,
3] for a proof in this binary subshift case, and [
4,
5,
6] for analogous results in more general settings.
The following theorem gives a new proof of this result (see Corollary 1), but it is stronger, as there is an additional constraint on how the independent choices must be realized. It also applies in the case of a general alphabet.
Theorem 1. A subshift has positive entropy if and only if it contains a steadily branching binary tree.
We can interpret
as the boundary of an
-ary tree and
X a closed subset of this boundary. A closed subset of the boundary of a tree is determined by finite subpaths in the tree, and it is in this sense that
X contains the binary tree. Steadily branching means that the embedded binary tree branches at a uniform sequence of times which form a set of positive lower asymptotic density. A formal definition is given in
Section 3.
Positive entropy refers to topological entropy, and in the context of subshifts, means the exponential growth of the number of words. Countable subshifts can have a growth rate arbitrarily close to exponential (see Proposition 2), but clearly contain trees with only finitely many branchings. Thus, if we weaken the assumption of positive entropy, the existence of a steadily branching binary tree can fail in a very strong sense.
It is also easy to find examples with positive entropy where it is not possible to find a tree where branchings happen a syndetic set of times (see Proposition 3). There are also examples where no individual tree which branches at a uniform sequence of times “captures” all of the entropy, as can be seen from [
1], (Example 2). These remarks show that we cannot expect to essentially improve the conclusion about the tree under the assumption of positive entropy.
The proof of Theorem 1 is straightforward using the standard results about the winning shift, which is the set of winning turn orders in a certain word-building game associated to X. The winning shift is known to have the same entropy as X. This implies it has a point with positive density, and this point is the branching structure of a steadily branching tree in X. This note is simply a self-contained elaboration of this deduction. I give a combinatorial proof directly from the definitions, and also include an ergodic theoretic proof.
It is easy to deduce a two-sided version of the result as well. For and , write for the configuration with x and y back to back.
Theorem 2. A subshift has positive entropy if and only if for some , the set contains a steadily branching binary tree.
The winning shift was introduced in [
7], and has also been studied in [
8]. We found out while working on [
8] that a concept equivalent to the winning shift was discovered in the set systems setting already in 2002 [
9] (even if ostensibly only for binary words), and I found out while working on the present paper that [
9] has been applied by dynamicists [
2,
3] (this paper is independent from [
7], but almost contemporary), and their proof that this gives large independence sets essentially boils down to the same proof as I give here. Nevertheless, I feel that though Theorem 1 is an interesting statement about subshifts, to our knowledge the statement does not appear in the literature, and its proof through winning shifts is worth making explicit. Furthermore, unlike Theorem 1, the statement about independence sets does not trivially generalize to non-binary alphabets.
As discussed, I obtain a new proof of the fact entropy implies the existence of certain types of independence sets in the case of a binary alphabet.
Corollary 1. Let be a subshift. Then, X has positive entropy if and only if it has an independence set of positive lower asymptotic density.
Again, I mention that this result is proved in [
2,
3], and that generalizations appear in [
4,
5,
6]. To our knowledge, the first reference where the Sauer-Shelah lemma appears in a dynamics context is [
4]. The most naive generalization of Corollary 1, the existence of a high density set
N with
, fails for a non-binary alphabet
A (even for trivial reasons, by artificially increasing the alphabet of the subshift). However, one can state an equivalent condition for having higher entropy in this way [
10], (Theorem 8.3), [
3], (Theorem 2). (The non-trivial direction of this equivalence also appears in [
7], (Proposition 5.9)).
As I show in this note, winning shifts say something stronger than Corollary 1 in the case of one-dimensional subshifts, but it is not clear how to generalize them to other settings, at least in a way that allows seeing them naturally as a dynamical system.
Question 1. Does every expansive system of positive entropy contain a steadily branching binary tree (and what does that even mean)? More generally, can the notion of winning shift be extended to general (expansive) systems? Group/monoid actions?
The webpage
www.theproofistrivial.com (accessed on 28 November 2016) is an online resource that suggests high-level proof strategies in a variety of mathematical domains. The present note was inspired by the following quote from this webpage: “The proof is trivial! Just view it as a rational metric space whose elements are countable combinatorial games.” The page was accessed during the Combinatorics, Automata and Number Theory conference that took place between 28 November 2016 and 2 December 2016.
2. Definitions
Let be the natural numbers. An alphabet is a finite set A. Words are elements of the free monoid generated by A, denoted by . Elements of A are seen as belonging to when convenient, denotes the multiplication in the free monoid (formal concatenation). The length of a word is defined in the obvious way. Write .
The set , for a finite alphabet A (or later ), always carries the product topology, and it is homeomorphic to Cantor space if . A subshift is a closed subset of for a finite alphabet A (or later ) satisfying where is the shift. The words of a subshift are the words such that for some . The words of a subshift form its language. Writing for the intersection of the language of X with , the (topological) entropy of a subshift X is the exponential growth rate of the number of words, (which exists by the subadditivity of ). The lower asymptotic density of is .
Two subshifts are topologically conjugate if there is a shift-commuting homeomorphism . A subshift is countable if it countably has many points. A set is syndetic if where + denotes the Minkowski sum.
3. The Winning Shift
For , denote .
Definition 1. Let A be a finite alphabet and let . Atree in
X with branching structure
is a set of sequences where each is an element of X, and: A steadily branching binary tree is a tree with branching structure satisfying .
In other words, for distinct sequences , the first position where and differ is the same as the first position where z and differ. For binary b, this means that the nonzero positions in b are the positions where our tree must branch in two, and more generally, means the tree must branch n times in position i, explaining why we call this sequence a branching structure. Note also that if and X contain a tree with branching structure , then necessarily for all .
Definition 2. Let be a subshift. Let , the winning shift of X, be the set of all branching structures of trees in X.
This is defined in a game-theoretic framework in [
7]: a tree in
X with branching structure
z can be interpreted as a winning strategy for the first player in a word-building game where on the
ith turn, the first player picks a set of
symbols, then the second picks one of them, and the first player wins if the word obtained in the limit is in
X.
It is shown in [
7] that the winning shift is indeed a subshift. A subshift
is
hereditary [
6] if for all
we have:
The following fact is Proposition 5.7 of [
7] (the hereditarity claim is trivial). The notation differs slightly, in the reference
is used for what we call
, and their
is the same as ours in the binary case, but differs in general.
Lemma 1. The subshift is hereditary and has the same number of words of each length as X.
Proof sketch. The hereditarity claim is trivial. For the claim about words, one can define
for
analogously to the subshift case (using finite trees). Letting
be the language of
X intersected with
,
is the language of
intersected with
. Now, we write
and exchange a suitable sum:
The first equality is true by definition, and the penultimate one by induction. □
For a finite word , write for the sum of the symbols in w, and for the length of w as a word. The key to finding steadily branching trees is to study the density of a word w. If is a subshift, by we mean the Cartesian power with the diagonal action, interpreted in an obvious way as an alphabet over the alphabet .
We observe a simple combinatorial lemma.
Lemma 2. If a subshift Y has positive entropy and is hereditary, then for some there exist arbitrarily long words w of with .
Proof. Let
be the alphabet of
Y. Let:
Since Y is hereditary, it is easy to see that the number of words in Y of length n is at most the number of words in of length n. Thus, if Y has positive entropy, so does , and thus so does . Suppose thus that the number of words in of every length n is at least for some , as is clearly implied by positive entropy.
The number of words of length
n with at most
k many 1s is at most:
Setting , this becomes and we observe that as , we have and , so for small enough , we have , and thus there must be words of length n in with at least many 1s, for arbitrarily large n. These are words of since Y is hereditary. □
This argument appears also in [
11] (Theorem 4.8) (this theorem already includes the statement about lower asymptotic density, which we deduce in the next section).
4. The Proofs
Proof of Theorem 1. Obviously, a steadily branching tree implies positive entropy.
For the other direction, let be a finite alphabet and a subshift. Write for the maximal sum of a word w of length n in Y. This sequence is clearly subadditive, so exists, say .
We outline the usual addendum that there must be a configuration
such that every prefix
w of
y satisfies
. Supposing that this is not the case, then every point
X has a prefix
w such that
, and this must happen after a bounded number of steps by compactness, thus there exists
such that for some
m, we always find a prefix
w of length at most
m in any point
, such that
. Now, given any long word
w of
Y, we can split it as
(where
i in
denotes a superscript, not a power) with
and
for all
. If
w is long enough, then since
for all words
, we have:
for all long enough words, a contradiction to
.
Now, we set and apply Lemma 2. The lemma implies that in the above. The point whose density stays above gives the branching structure of a steadily branching tree in X. □
The “usual addendum” is well known. The author learned it from [
12,
13] and P. Guillon; Jeandel attributes it to [
14,
15], and it also appears in [
3,
11] where it is attributed to [
16,
17].
The point of this note was to provide a combinatorial proof of Theorem 1 from the first principles, as this is easy to do. However, I have included also the proof using ergodic theory (see [
18] for a basic reference).
Ergodic theoretic proof of Theorem 1. Obviously a steadily branching tree implies positive entropy. For the other direction, since the winning shift has positive entropy, it admits an invariant measure with positive entropy, thus for some . By the ergodic decomposition there exists such an ergodic measure, and by the pointwise ergodic theorem, there is a point where the lower asymptotic density of as is positive, giving the result. □
We conclude with the proofs of Theorem 2 and Corollary 1.
Proof of Theorem 2. Let be the subshift of the right tails of points in X and apply Theorem 1. Let be the branching structure of some steadily branching tree and the lower asymptotic density. It is easy to see that for any n, we can find a finite prefix w of of a length of at least n such that for all prefixes u of x of length at most n, . Otherwise, by cutting long words into ones of length at most n, as in the previous argument, we see that the lower asymptotic density is less than .
The fact we can find such as a branching structure implies that the tree corresponding to x can follow at least one word of length . Letting n tend to infinity, by compactness, we obtain a steadily branching tree that can follow some left tail. □
Proof of Corollary 1. Supposing
X has positive entropy, let
have positive lower asymptotic density. The set
is an independence set (which by assumption has positive lower asymptotic density): Suppose
form a tree with branching structure
y, i.e., we have:
Supposing that
enumerates
N in order, let
, and for
let
be the characteristic sequence of
. Clearly
for all
. The map:
defined by
is injective, because
. Thus, this map is surjective, giving
. By compactness, we have
. □