Abstract
We study Markov operators T, A, and of general Markov chains on an arbitrary measurable space. The operator, T, is defined on the Banach space of all bounded measurable functions. The operator A is defined on the Banach space of all bounded countably additive measures. We construct an operator , topologically conjugate to the operator T, acting in the space of all bounded finitely additive measures. We prove the main result of the paper that, in general, a Markov operator is quasi-compact if and only if T is quasi-compact. It is proved that the conjugate operator is quasi-compact if and only if the Doeblin condition is satisfied. It is shown that the quasi-compactness conditions for all three Markov operators T, A, and are equivalent to each other. In addition, we obtain that, for an operator to be quasi-compact, it is necessary and sufficient that it does not have invariant purely finitely additive measures. A strong uniform reversible ergodic theorem is proved for the quasi-compact Markov operator in the space of finitely additive measures. We give all the proofs for the most general case. A detailed analysis of Lin’s counterexample is provided.
Keywords:
general Markov chains; Markov operators; quasi-compactness conditions; finitely additive measures; invariant measures; ergodic theorem; probability theory MSC:
60J05; 37A30; 28A33; 46E27
1. Introduction
In this paper, we investigate general Markov chains (MC) on arbitrary measurable spaces. The main methodological feature of our work is that we actively use finitely additive measures. We present a continuation of our research in [1,2,3]. We apply the operator-theoretical treatment that was introduced by Kryloff and Bogoliouboff in [4]. In this article, the authors introduced a new notion of quasi-compactness and gave a condition for the quasi-compactness of Markov operators on spaces of bounded measurable functions. Under this condition, the Markov operators have good ergodic properties, as was shown in [4]. Yosida and Kakutani in [5] applied this quasi-compactness condition to Markov operators defined on spaces of bounded countably additive measures. This allowed the authors in [5] to prove several new ergodic theorems for general Markov operators with certain additional conditions. Later, the operator approach to the study of general and special MCs developed rapidly. In particular, various authors studied the question of the relationship between the conditions of quasi-compactness of MC and the Doeblin condition of ergodicity of MC [6]. The method developed in [6] was called the direct point-set-theoretical (or probabilistic) method in [5]. The operator approach was also used to study MCs, with operators defined on spaces of integrable functions (see, for example, Horowitz [7]). This methodology was also successfully used to study the ergodic properties of topological MCs (see, for example, Foguel [8,9,10]). We actively use the operator results for MCs presented in several papers by Lin [11,12,13]. The operator approach to studying Markov chains is well and thoroughly presented in the monographs Neveu [14] and Revuz [15]. Hernandez-Lerma and Lasserre [16] also studied Markov chains using both probabilistic and operator approaches.
In the recently published paper, Mebarki, Messirdi, and Benharrat [17] construct a general theory of quasi-compact operators in arbitrary and special Banach spaces. Corollaries for Markov chains are given. Probabilistic and operator approaches are also used to describe Markov chains in [17].
In those cases where our constructions and theorems in this article have points of contact with the results of other works listed above (and many others), we provide the corresponding references, remarks, and comments in the main text of the article. In many works using the classical operator approach to studying general Markov operators, two Markov operators are considered. These are the operator T in the space of bounded measurable functions and the operator A in the space of bounded countably additive measures. These operators are dual in a sense, but they are not topologically conjugate. In this paper (as well as in [1,2,3]), we construct an extension of the operator A to the space of bounded finitely additive measures, and we obtain the corresponding operator conjugate to the operator T. In this way, we obtain a functionally closed construction of spaces and operators. We emphasize that in this construction, the transition function of the Markov chain remains countably additive in the second argument.
The main results of the paper are presented in Section 4 and Section 5. In Section 4, in Theorem 4, we prove, in the general case, that a Markov operator T in the space of measurable functions is quasi-compact if and only if its conjugate operator in the space of finitely additive measures is quasi-compact. Furthermore, in Theorem 5, it is proved that if the operator T in an arbitrary Banach space is quasi-compact, then its restriction to an invariant Banach subspace is also quasi-compact. In Corollary 1, it is obtained that if the conjugate Markov operator on the space of finitely additive measures is quasi-compact, then the Markov operator A on the space of countably additive measures is also quasi-compact. In Theorem 6, we prove, in the general case, that the conjugate Markov operator in the space of finitely additive measures is quasi-compact if and only if the Doeblin condition (D) is satisfied. Proposition 1 presents a summary logical diagram on the pairwise equivalence of the quasi-compactness conditions for all three Markov operators T, A, and and on their equivalence to the Doeblin condition (D).
In Theorems 8 and 9, it is proved that for an arbitrary MC, its Markov operator is quasi-compact if and only if the condition is satisfied: all its invariant finitely additive measures are countably additive, i.e., the operator does not have invariant purely finitely additive measures. In Theorem 10, it is proved that if the operator T is quasi-compact, then the set of all its invariant finitely additive measures is finite-dimensional (and all such invariant measures are countably additive). Theorem 11 gives a conversion of Theorem 10 to the case when the operator T has a unique invariant finitely additive measure.
In Section 5, in Theorem 12, we prove, in the general case, a strong uniform reversible ergodic theorem for the quasi-compact Markov operator in the space of finitely additive measures. In Section 6, we study in detail Lin’s counterexample [11] of a Markov chain on a countable phase space with Markov operators P and defined on the spaces and , respectively. In [11], the author showed that in this construction, the operator P is quasi-compact, but its conjugate operator is not quasi-compact. In our construction, the Markov operators T and for the same Markov chain are defined on other spaces: on the spaces and , respectively. In Section 6, we prove by direct calculation that in this scheme, the conjugate operator for a given MC is also quasi-compact. This does not refute but confirms our Theorem 4.
2. Definitions, Notations and Some Information
Here are some of the basic definitions and concepts we use and their symbolism, focusing on [18,19]. We also use the notation system from our article [1]. Everywhere below, is the set of real numbers (number line), and N is the set of natural numbers. Let X be an arbitrary infinite set, and be some sigma-algebra of its subsets. We denote by the Banach space of bounded -measurable functions with the -norm .
Definition 1
(see [19]). A finitely additive non-negative measure is called purely finitely additive (pure charge, pure mean) if any countably additive measure λ satisfying the condition , is identically zero. The alternating measure μ is called purely finitely additive if in its Jordan decomposition both non-negative measures and are purely finitely additive.
If the measure is purely finitely additive, then it is equal to zero on every one-point set: (see [20] [Lemma 1]). The converse, generally speaking, is not true (for example, for the Lebesgue measure on the segment ).
Theorem 1
(Yosida-Hewitt decomposition, see [19]). Any finitely additive measure μ can be uniquely decomposed into the sum , where is countably additive, and is a purely finitely additive measure.
In this article, we consider Banach spaces of bounded measures , with a norm equal to the total variation of the measure (but an equivalent sup-norm can also be used): is the space of finitely additive measures, is the space of countably additive measures, is the space of purely finitely additive measures. If , then the norm is . Purely finitely additive measures also form a Banach space with the same norm and . We denote the sets of measures: , , , , , . All measures from these sets will be called probabilistic. We also denote , . A detailed exposition of the foundations of the general theory of finitely additive measures is contained in the monograph K.P.S.B. Rao, and M.B. Rao [21], in which such measures are called charges.
3. Markov Operators
Markov chains (MCs) on a (phase) measurable space are given by their transition function (probability) , under ordinary conditions:
- (1)
- (2)
- (3)
We emphasize that the transition function is a countably additive measure with respect to the second argument, i.e., we consider classical Markov chains. The transition function generates two Markov linear bounded positive integral operators:
Let be the initial measure. Then the iterative sequence of countably additive probability measures , is usually identified with the Markov chain. We will call a Markov sequence of measures. Topologically conjugate to the space is the (isomorphic) space of finitely additive measures: (see, for example, [18]). In this case, the operator serves as topologically conjugate to the operator T, which is uniquely determined by the well-known rule:
The operator is the only bounded continuation of the operator A to the entire space while preserving its analytic form. The operator has its own invariant subspace , i.e., , on which it matches the original A operator. The construction of the Markov operators T and is now functionally closed. We will use the notation , , to restrict the operator to the subspace . The norms of Markov operators on the corresponding spaces are , , . In such a setting, it is natural to admit to consideration the Markov sequences of probabilistic finitely additive measures: keeping the countable additivity of the transition function with respect to the second argument. Such a Markov chain can have cycles consisting of finitely additive measures. The properties of such cycles Markov chain are considered in detail in our paper [22]. You can change the problem statement and allow the transition function itself to be just a finitely additive measure with respect to the second argument. Such Markov chains are also studied (see [20,23]) and are called “finitely additive Markov chains”. In this article, we do not consider such MC. Thus, in our case the following terminology is appropriate: we study countably additive Markov chains with operators defined on the space of finitely additive measures. Let us denote the sets of invariant probability measures of the Markov chain:
Let be the linear subspace of invariant measures of the Markov chain in the space . Obviously, is generated by the set : . We will also use the notation and with similar meaning. The linear dimension of the set will mean the algebraic dimension of the linear space generated by it and denote it by . Similarly, we talk about the dimension of the sets and . The classical countably additive Markov chain may or may not have invariant probability countably additive measures, i.e., possibly (for example, for a symmetric walk on Z). Šidak was one of the first to extend the Markov operator A to the space of finitely additive measures in the framework of the operator approach and proved the following two important theorems in [24] (1962).
Theorem 2
(Šidak [24] [Theorem 2.2]). Any countably additive Markov chain on an arbitrary measurable space has at least one invariant finitely additive probability measure, i.e., always .
This result was then briefly proved in the author’s paper [3] as a simple consequence of the Krein–Rutman theorem ([25] [Theorem 3.1]).
Theorem 3
(Šidak [24] [Theorem 2.5]). If a finitely additive measure μ for an arbitrary Markov chain is invariant , and is its decomposition into countably additive and purely finitely additive components, then each of them is also invariant: , .
4. Quasi-Compactness Conditions and Finitely Additive Measures
Definition 2
(see, for example, [18] [Definition VI.5.1]). A linear operator F, which transfers the Banach space Y to itself, is called compact (completely continuous) if it transfers each bounded set of space Y into a pre-compact (relatively compact) set, i.e., such that its closure is compact in Y.
It is well known (see, for example, [18] [Theorem VI.5.2, Schauder’s Theorem]) that the operator is compact if and only if its conjugate operator is compact.
Definition 3.
A linear operator , where Y is an arbitrary Banach space, is called quasi-compact (quasi-completely continuous) if the following condition is satisfied .
The transition function of a general Markov chain generates three different Markov operators:
, and . The quasi-compactness condition can be applied to each of them with the appropriate replacement of the symbol F by the symbols T, A, and . Denote such conditions , and , respectively. Next, we use one construction from [18]), which we present here in our symbolism and edition. Let be an arbitrary Banach space, be the space of all linear continuous bounded operators .
We denote:
is a Banach space topologically conjugate to , is the operator , topologically conjugate to the operator F and is the space of all conjugate operators on (see [18] [Definition VI.2.1]). We introduce the mapping , , (note that in [18] no symbol is introduced for such a mapping). In [18] [Lemma VI.2.2], it is proved that the mapping is an isometric isomorphism of the space into the space . According to [18] [Definition II.3.17], the isometric isomorphism is assumed to be a linear mapping.
The mapping , generally speaking, does not map the space onto the entire space , which is implied in the above Lemma VI.2.2 ([18]). Let be the subspace of of all operators for which there exists a pre-conjugate operator such that . Then, obviously, and on there exists an inverse isometric and linear isomorphism . Moreover, . We will also call the set the scope of definition of the isomorphism . Therefore, we prove the following general statement for arbitrary Markov operators.
Theorem 4.
Let be an arbitrary measurable space. An arbitrary Markov operator is quasi-compact if and only if its conjugate Markov operator is quasi-compact, i.e., the conditions and are equivalent: .
Proof.
First, we show that . Let the operator T be quasi-compact. Then there are and a compact operator such that . Let us rewrite this inequality in the following form: , where . We pass to conjugate operators and obtain . By Schauder’s Theorem ([18]), the operator conjugate to a compact operator is also compact.
According to [18] (Lemma VI.2.2) and our analysis presented above the mapping that takes the original operators (here Y is an arbitrary Banach space) to the conjugate operators is a linear isometric isomorphism, i.e., preserves the norms of the operators. Therefore, in our case, from the inequality it follows that .
Next, we will show that the operator can have its superscripts swapped. To do this, we will use Lemma from [18] [Lemma VI.2.4]. It proves that for two linear operators and acting on the corresponding Banach spaces, the equality holds. In our case, we take the operators and and get the equality . From here, by induction, we obtain the formula for arbitrary : . Let us substitute the right side of this equality into the general formula and obtain: . Thus, we have obtained that the operator conjugate to T is also quasi-compact. This means that .
Let us prove the converse statement. Let for some Markov operator T its conjugate operator be quasi-compact. This means that there are and a compact operator such that . We write this inequality in the following form: , where . The pre-conjugate operator to the operator is, by assumption, the operator T, i.e., . Since , then the pre-conjugate to the operator is the operator , i.e., .
Now we use the inverse isometric isomorphism . Since the isomorphism is a linear mapping and the operator is included in the scope of definition of mapping , then we can do the following transformations:
Thus, the operators and U also fall into the scope of definition of the inverse isomorphism . We denote the pre-conjugate operators and , where and .
We have obtained the decomposition .
Since, by assumption, the operator is compact, then, by the already mentioned Schauder’s Theorem ([18]), its pre-conjugate operator is also compact. The mapping takes the original operator T to the conjugate operator and is an isometric isomorphism. Therefore, if , then for the pre-conjugate operator , , holds. Thus, we have obtained that the operator T pre-conjugate to the quasi-compact operator is also quasi-compact, i.e., . This completes the proof. □
Remark 1.
In our work [2], at the beginning of §12, it is noted that the quasi-compactness condition for the operator is equivalent to the quasi-compactness condition for the operator T. However, proof of this fact is not given there.
Remark 2.
Herkenrach ([26] [Lemma 3.2]) showed (under certain conditions) that if some linear operator special type on a Banach space Y is quasi-compact, then the conjugate to Therefore, the operator is also quasi-compact. However, the proof of this particular statement is not given in the work [26]. There is no converse statement there, either.
Remark 3.
Lin [11] considered Markov chains with operators P and defined on the left and right sides of and , respectively. In Remark [11] [Section 2], it is argued that if P is quasi-compact, then its conjugate may not be quasi-compact. A corresponding counterexample is given there to support this. In this paper, in Section 6, we study this counterexample in detail. We show that if a given concrete Markov chain is described using the construction of our Markov operators T, A, and , then the Markov operator is quasi-compact. Thus, this example of Lin does not refute our proved Theorem 4 but only confirms it.
Let us prove one more general statement for any operators, not just Markov ones.
Theorem 5.
Let a linear bounded operator in a Banach space Y have a non-degenerate invariant Banach subspace , . Then, if the operator F in the space Y is quasi-compact, its restriction to the subspace M is also quasi-compact.
Proof.
Let the operator be quasi-compact. Then, by Definition 3, for some and a compact operator , holds, where . We restrict these operators to the invariant subspace M and obtain . From this equality, it follows that for any set , we have
In addition, for any the following holds: . It is obvious that an arbitrary bounded set will remain bounded even if it is embedded in Y. A norm in the subspace M is, by general definition, induced from the space Y (i.e., coinciding with the norm from Y on the subspace M). Consequently, the closures of the sets and , taken in their coinciding norms, coincide, i.e.,
Note that, by the assumptions of the theorem, the subspace M is Banach and therefore closed in Y. By construction, the operator is compact. Therefore, by Definition 2, for a bounded set , the set is compact in the norm of the space Y. This means that the set is compact in the norm of the subspace M induced from Y. Thus, the operator is compact in the subspace M. Let us now turn to the operator V in the expansion of the operator and to its restriction to the space M. The norms of operators in normed spaces are calculated using the formulas:
Since the subspace M is part of the space Y, then . By construction (and by condition), . Consequently, . Thus, the restriction of the operator to the subspace M is quasi-compact. The proof is complete. □
Corollary 1.
Consider a general Markov chain on an arbitrary measurable space . If the Markov operator , conjugate to the operator is quasi-compact, then the Markov operator (here ) is also quasi-compact.
Now we write Doeblin condition of ergodicity of the MC (see, for example, [27]):
Remark 4.
The superscript k in denotes the order of the integral convolution (iteration) of the transition function, not its degree.
Theorem 6.
Let be an arbitrary measurable space on which a general Markov chain is defined. Its Markov operator is quasi-compact, if and only if the Doeblin condition is satisfied, i.e., the conditions and are equivalent: .
Proof.
Neveu in the book [14] [V.3.2] showed a scheme of proof that the condition of quasi-compactness of the operator is equivalent to the Doeblin condition . In ([26] [Theorem 2.1]), Herkenrath gives his full proof (with reference to Yosida and Kakutani [5] and Neveu [14]) that the Doeblin condition is equivalent to the quasi-compactness condition for the operator , i.e., . In our Theorem 4 above, it is proved that the quasi-compactness condition is equivalent to the quasi-compactness condition . Therefore, . The theorem is proved. □
The following statement is obtained from the Herkenrath Theorem 2.2 [26] and Theorem 2.1 [26].
Theorem 7
([26]). The operator is quasi-compact if and only if the operator is quasi-compact, i.e., .
Now we can formulate a summary statement obtained through the efforts of many authors (including ours).
Proposition 1.
For a general Markov chain on an arbitrary measurable space , the conditions for the quasi-compactness of its three operators are equivalent to the Doeblin condition :
We emphasize that there are no restrictions or additional conditions for Markov chains for the resulting diagram. If at least one of the three Markov operators T, A, or of MC is quasi-compact, then we will call the Markov chain itself quasi-compact. We will now use some of the results obtained in our paper [1].
In our articles [1,2] the following condition was introduced:
it means that all invariant finitely additive measures of the operator (and such measures exist for any Markov chain) are countably additive.
Theorem 8.
For an arbitrary MC, its Markov operator is quasi-compact if and only if the condition is satisfied.
Proof.
According to [1] [Theorem 7] the condition is equivalent to the Doeblin condition (D). In Theorem 6 of this paper, it is proved that the Doeblin condition (D) is equivalent to the quasi-compactness condition of the operator . Therefore, . The theorem is proved. □
We introduce one more condition:
The fulfillment of the condition means that the Markov chain does not have invariant purely finitely additive measures. Obviously, these conditions and () coincide.
Let us repeat the previous Theorem 8 in a new formulation.
Theorem 9.
For an arbitrary MC, its Markov operator is quasi-compact if and only if it does not have invariant purely finitely additive measures.
Corollary 2.
Conditions and () are equivalent to the condition and the condition .
Now, we can present a new version of theorem [1] [Theorem 9]—Theorem 10.
Theorem 10.
Let a general MC be given on an arbitrary . Then, if the Markov operator is quasi-compact, then the set of its finitely additive invariant measures is finite-dimensional, i.e., , where .
Proof.
In our article [3] in Theorems 8.1 and 8.2, it is proved that if the condition is satisfied, then , where . According to the conditions of the theorem, the operator is quasi-compact. Therefore, according to Theorem 8, the condition is satisfied. The theorem is proved. □
Now we will prove the reversal of Theorem 10 in the one-dimensional case for (see, also, [1] [Theorem 10]).
Theorem 11.
Let on an arbitrary a MC be given for which the dimension , i.e., its Markov operator has in a unique invariant finitely additive measure , . Then this invariant measure μ is countably additive, the Markov operator is quasi-compact and the conditions , and , as well as , and are satisfied for the considered MC.
Proof.
In our theorem [3] [Theorem 8.3] we proved that if for some general MC, we have and , , then this invariant measure is countably additive, i.e., . This means that for this MC the condition is satisfied. Then, by Theorem 8, the Markov operator of the considered MC is quasi-compact, i.e., the condition is satisfied for it. From our Theorems 4 and 6–9 (see above), it follows that the remaining conditions listed in this theorem are also satisfied for this MC. The theorem is proved. □
Please note that the proof of the theorem used above [3] [Theorem 8.3] is not simple; it is carried out using the technique of weak topologies in the space .
Lemma 1.
Let an arbitrary Markov chain be given on a measurable space . If for any finitely additive measure that is invariant for a Markov operator there exists a point such that , then the operator is quasi-compact and the condition is satisfied.
Proof.
Let the conditions of the lemma be satisfied. Suppose that there exists a purely finitely additive measure that is invariant for the Markov operator . Then by Lemma 1 from [20] on any one-point set . This contradicts the condition of the Lemma about the existence of a point such that . Therefore, this Markov chain does not have invariant purely finitely additive measures, i.e., . It follows from our Theorems 8 and 9 that such a Markov chain satisfies the condition and it is quasi-compact. The Lemma is proven. □
Corollary 3.
If the condition of the Lemma 1 is satisfied, then all invariant finitely additive measures of the operator are countably additive, i.e., and .
Remark 5.
Lemma 1, generally speaking, cannot be reversed. If the operator is quasi-compact and if is its invariant countably additive measure, then it is possible (for an infinite X) that for all .
5. Ergodic Theorem
In the book by Dunford and Schwartz [18], in Chapter VIII, paragraphs 4–6 and 8, the history of ergodic theory and ergodic theorems is described in detail. Specific problems in mechanics, physics and chemistry are given that can be solved using ergodic theorems of various types. There, in Chapter VIII, Markov processes and Markov chains are singled out in several places, and their classical ergodic properties and theorems are briefly analyzed. We add that today, ergodic theorems for Markov chains are widely used in economics, statistics and in the planning of experiments. In [18] and in many other sources, the following non-rigorous interpretation of the ergodicity of a particular system is given: the average over space is equal to the average over time. Please note that from a mathematical point of view, ergodic theorems are more convenient (and useful) to formulate and prove in the form of limit theorems, using current averages (Cesaro averages) over time. This is what we do below in our Theorem 12.
The literature that studies various aspects of ergodic theory and its applications is very extensive. However, to solve the problems in this section, we only need to use the monograph [18] and several articles cited below.
Let us consider some ergodic properties of Markov operators following from the statements obtained above for T, A, and (see, also, [17]). Lin [13] [Theorem 1] proved a general strong uniform ergodic theorem for arbitrary linear positive quasi-compact operators on a Banach lattice. Theorem [13] [Theorem 1] can also be applied to Markov operators with different domains of their definition, which is noted by Lin [13]. An important feature of this theorem is that it also contains a conversion (under certain conditions) of classical uniform ergodic theorems: from the ergodic asymptotics of the operator under consideration, it follows that it is quasi-compact.
In 1937, for the Markov operator , the quasi-compactness conditions were introduced in the first paper on this topic by Kryloff and Bogoliouboff [4]. The first uniform ergodic theorem was also proved for this operator there. For this reason, further study of quasi-compact Markov operators was most often carried out in function spaces. After the paper by Yosida and Kakutani ([5], 1941), quite a lot of papers appeared in which quasi-compactness conditions were considered, and the corresponding ergodic theorems were proved for Markov operators defined on spaces of countably additive measures (see, for example, [18] [VIII.8]). The Markov operators , defined on the space of finitely additive measures, are still mentioned only in a few works. Taking this into account, in the next theorem, we will consider the results of applying the theorem Lin [13] [Theorem 1] only for the Markov operators and A and compare their ergodic properties.
Theorem 12.
Let the general MC be given on an arbitrary measurable space . Then the following three statements are true:
- 1.
- The Markov operator is quasi-compact (i.e., the condition is satisfied) if and only if the following ergodic condition is satisfied:
- 2.
- The Markov operator is quasi-compact (i.e., the condition is satisfied) if and only if the following ergodic condition is satisfied:
- 3.
- Ergodicity conditions and are equivalent, i.e.,
Proof.
Theorem [13] [Theorem 1] requires that the linear operator under consideration be positive on some Banach lattice. Our measure spaces and are linear and are vector Banach lattices (see, for example, [19]). The Markov operators and A are positive by construction. These conditions are satisfied for and A. In addition, the conditions of [13] [Theorem 1] require that the operators and A satisfy the following properties:
Since for Markov operators it is always true that and , then the above conditions are also satisfied for them. Therefore, the asymptotic formulas and statements [13] [Theorem 1] is also true for both operators and A (in our notation). According to our Theorem 4 the conditions and are equivalent, i.e., . By Theorem 7 the equivalence of also holds. Therefore, . It follows that the conditions and are also equivalent, and statement 3 of this theorem is true. The theorem is proved. □
Remark 6.
Theorem 12, the operator projects the space onto the subspace of all its invariant probability finitely additive measures , i.e., and . According to Theorem 8 and Corollary 2 we have . Therefore, the restriction of the projector to the space is also a projector of the space to the subspace , i.e., and .
Remark 7.
The substantive difference between conditions and is that in the first condition, the supremum is taken over all initial finitely additive probability measures , including purely finitely additive probability measures . And in the second condition , the supremum is taken only over initial countably additive probability measures . However, the limit projections and coincide in this case.
Below, in Section 6, when analyzing Lin’s Example, we will show how this looks in a simple case.
6. Example Michael Lin
6.1. Description of Special Markov Chains
The work by Michael Lin [11] considers the linear positive contraction operator P in the space and its adjoint operator in the space , where . The operator P is called Markovian. A general classical definition of a quasi-compact operator is given. The equivalent Doeblin condition is also mentioned there with references to Neveu [14] and to Doob [27]. The said work contains a Remark at the end of paragraph 2 ([11], p. 467), which states that "If P is quasi-compact, then may fail to be quasi-compact". A corresponding counterexample is also given there. Theorem 4, which we proved above, states that if the Markov operator T defined on the space is quasi-compact, then so is the operator conjugate to it on the space , where , is also quasi-compact. The opposite is also true. Now, we will show, using the same example, that there is no contradiction here. Studying this example will also allow us to illustrate the possible applications of our theorems and methods. Let us give this example from Lin [11]. Given a Markov chain on a countable measurable space (in fact, the example considers ) with an explicit the form of its transition function:
The transition function of this Markov chain can be represented in other forms. For any and , we write:
where is the Dirac measure at point i, and is the characteristic function of the set E. This definition of the transition function makes it possible to use our Markov operators T, A, and on the spaces , and , respectively. As we have already noted, in the work of Lin [11], other spaces and are used, and, accordingly, other Markov operators P and . Next, we will consider step by step all three Markov operators of this Markov chain T, A, and , show their quasi-compactness, and reveal some of their other interesting properties.
6.2. Operator T
Recall that the operator in the general case is given by the following integral formula:
Let us substitute the formula (3) into equality (5), and after simple transformations, we obtain for our MC:
It is known that the unit function is invariant for any MC and for the corresponding operator (this can be verified by simple substitution). Invariant functions of the operator, T, are also called harmonic. Therefore, . We use (6) to find all invariant functions of the operator T and solve the equation , . Simple transformations give us the answer: , for all . Let us rewrite this equality in the equivalent form:
So, we have obtained that all invariant functions of a given operator T differ from the function h only by a numerical coefficient. Consequently, the operator T has only a one-dimensional space of all its invariant functions
Conclusion 1. We can consider function h to be the unique invariant function of a given MC up to a numerical coefficient . It is also obvious that this operator T does not have cycles. Let and . Using Equality (6), we obtain for all :
Let us introduce the projector onto the one-dimensional space of invariant functions of operator T.
According to [18] [VI.9.18], there are infinitely many such projectors. For all such operators the equality holds, where and , i.e., . Every finite-dimensional projector is a compact operator ([18] [VI.9.20]).
Now we make the following estimates for all , :
The right side of this inequality does not depend on the initial function and the current ordinate i. Hence, in the operator norm we obtain
Conclusion 2. The Markov operator T for a given MC is quasi-compact.
6.3. Operator A
In [11], an invariant countably additive measure for this Markov chain is given. In [11], it is not specified which measure is invariant. However, it is easy to check that for our operator A it is invariant. However, within the framework of our approach, we need all invariant (probabilistic) finitely additive measures , including purely finitely additive measures, if they exist. To do this, we need to solve the equation in the class of finitely additive measures . Let us write this equation in more detail:
In this notation, we cannot go from the integral to an infinite sum (to a series) since the measure can be purely finitely additive. We take a concrete set . Then, from the general equation, we obtain:
Suppose that the invariant measure is purely finitely additive. Then, on the one-point set, we have . However, above we obtained . From this contradiction, we obtained a conclusion.
Conclusion 3.
This MC does not have invariant purely finitely additive measures and all its invariant measures are countably additive. Now, we need to make sure that the MC has a unique invariant countably additive measure. We will do this by constructing it directly. Given the above Conclusion 3, we can construct this measure in the class using the operator A instead of the operator . We continue solving the equation:
For we have already received .
Now we will find :
For an arbitrary , by induction, we obtain
We need to make sure that the sum of these quantities is equal to one. Since the invariant measures are countably additive, then
Conclusion 4. From the construction it follows that the found invariant measure for the operators A and is unique, i.e., .
Lin [11] also gives the constructed invariant measure for a given Markov chain (in our case, for operators A and ). We repeat this measure here to indicate the operators to which it applies and to show its uniqueness in the class of finitely additive measures and in the class of countably additive measures, which is what we have done. Now, we prove the quasi-compactness of the operator A by direct methods without resorting to our theorems and corresponding theorems from other sources. Above, we have already found out that the operator A has a unique invariant countably additive measure . This measure is the only invariant measure for the operator in the class of finitely additive measures.
We introduce the following notation:
Obviously, Let an arbitrary initial countably additive measure be given. It generates a Markov sequence of measures . We will only need the first two measures and from this sequence. Let us study these measures. Let , and . Then for any the following formula holds true:
We consider step by step the values of on some of the sets we need.
- (1).
- Let . Then
- (2).
- Let . Then
- (3).
- Let . Then
- (4).
- Let . Then, by induction,
- (5).
- .
Remark 8.
For any initial measure we have . Consequently, the equality holds for any initial measure, i.e., they do not depend on . This is also true for the equality .
Now we will consider the second iteration in the Markov sequence of measures , where and
Let us take some concrete sets :
- (1).
- Let . Then
- (2).
- Let . Then
- (3).
- Furthermore, by induction, for we obtain:
- (4).
- .
- (5).
- and .
Remark 9.
Here the values of the measures and are also not dependent on the initial measure . Similarly, the values of and do not depend on .
Please note that , , and , where is the invariant measure of the operator A. This process of substitution of values by values (at ) continues to occur at . However, now it is enough for us to track such substitutions only for and , which we have already done. Thus, and , i.e., when . We estimate the norms of measures:
From the previous remarks, it follows that
i.e.,
Let us introduce the projector onto the one-dimensional space of invariant measures for operator A.
According to [18] [VI.9.18], there are infinitely many such projectors. For all such operators the equality holds, where and . Every finite-dimensional projector is a compact operator ([18] [VI.9.20]).
Now, from the obtained inequalities, it follows that in the operator norm, we obtained
Conclusion 5. The quasi-compactness condition from Definition 3 is satisfied. This means that the Markov operator of a given Markov chain is quasi-compact.
We proved this without using our theorems from this article. For the operator A, all classical ergodic theorems are true.
6.4. Purely Finitely Additive Initial Measure
In the present subsection, we investigate the behavior of the Markov sequence of measures , for the case when the initial measure is purely finitely additive, i.e., . First, we note that such a measure , like any other purely finitely additive measure on a given measurable space , has the following properties:
for any , and , for any finite set . Let us take the measure and find its value for some important sets . For an arbitrary set the following holds:
Let . Then
Let . Then
Let . Then
Let . Then
Let us now consider the second iteration of the Markov sequence of measures , where . We find the value of the measure for some sets using the same scheme as for (omitting simple transformations):
Let , where . Then . Now, let us consider the general case of the n-th iteration of a Markov sequence of measures for and : . From the formulas for and we obtain by induction the following values for :
Since the measures are finitely additive, then
From this, we obtain that . Obviously, if , then and . We decompose finitely additive measures into countably additive components and purely finitely additive components , for which . In this case, and , by condition. From the values of the measures obtained above on some sets , it is easy to see that
where is a countably additive Dirac measure at the point 0, and is some purely finitely additive measure (generated by the measure ), . Next,
where is also some purely finitely additive measure, . Then,
where . Let us find the norms of the resulting components of the measures . Since all the measures used in these components are positive, the Dirac measures are pairwise singular and , then
Conclusion 6. In the considered MC, for any initial purely finitely additive measure , the Markov sequence of measures generated by the measure has its countably additive components tending to one in the norm, while its purely finitely additive components tend to zero in the norm as . Obviously, in both cases, this convergence is uniform in the initial measure :
We also note that this convergence is exponentially fast.
6.5. Conjugate Operator
Now, we describe the operator of our Markov chain, in general, defined on the entire space of finitely additive measures . We want to prove by direct methods that the Markov operator of this Markov chain is quasi-compact. Let the initial finitely additive measure be arbitrary. This measure generates a Markov sequence of measures . Each such measure can be decomposed into a countably additive component and a purely finitely additive component : . Above, we have already considered Markov sequences of measures for the cases where and . The same decomposition can be used to consider the general case. However, it is more convenient to go the other way. To prove the quasi-compactness of the operator , we will use almost verbatim the proof of the quasi-compactness of the operator in Section 6.3. We replace the condition with the condition in the corresponding formulas in Section 6.3 for and . We also replace the notation of the operator A with the notation of the operator . It is easy to check that in all the corresponding transformations in Section 6.3, we never used the countable additivity of the measures and , but only the finite additivity of these measures. As a result, we obtain the following statements: For , , , we have:
Note that the values of the measure do not depend on the initial measure on the indicated sets . Recall that the measure , for any , is invariant for the operator A and for the operator . The measure is countably additive, but we did not use this in our transformations above. Obviously, , and , . Now we can obtain estimates for the norm for any initial measure , repeating the corresponding estimates for the same norm for any (we omit the details):
Since this estimate does not depend on the initial measure , then
i.e.,
Exactly as before for the operator , we introduce the projector onto the one-dimensional space of invariant measures for the operator . Obviously, a given projector is a projector onto the same one-dimensional space when it is restricted from the space to the space . From the estimates obtained above, we arrive at the operator norm
Conclusion 7. According to Definition 3, the operator is quasi-compact on the space for the considered MC. From classical general ergodic theorems, we obtain that for the considered MC, the general strong uniform ergodic theorem is true:
in the class of all bounded finitely additive measures (and in the class of all bounded countably additive measures). General Conclusion. An example of a Markov chain from Lin [11], if its operators T and are considered on the spaces and , respectively, does not contradict our Theorem 4 proved above, and confirms it.
7. Discussion
1. The work of Foguel [8] (1966) is one of the first in which the following construction of Markov operators is constructed for general Markov chains: and , where is the space of finitely additive measures (Foguel refers on [18]). We also use this construction in this paper and in our other works. In our present paper, we have not found any direct intersections with the results of [8]. In many of Fogel’s subsequent works (see, for example, [9,10]), the author focuses on topological spaces X and Markov operators in the spaces and . We do not consider such constructions in the present paper.
2. Horowitz [7] considers the construction of Markov operators (left) and (right). Theorem 4.1 in [7] proves the equivalence of 10 different conditions for ergodic and conservative Markov chains. In particular, it is proved that the condition “(a) there exists no invariant pure charge” is equivalent to the condition “(h) P is a quasi-compact operator on ”. In the present paper, we use a different construction of Markov operators. It is difficult to compare theorems proved within these two different approaches. Therefore, we can say that our Theorems 8 and 9 are distant analogs of Horowitz Theorem 4.1 [7].
3. We want to compare the results of the present paper with those of Lin [12]. We have already performed such an analysis in our paper [1] since the results of [1] are also close to the results of [12] [Theorem 5]. We consider it appropriate to give here a large corrected quotation from [1] [Section 7].
In the second part of [12] (1975), Lin considered an arbitrary measurable space and investigated the properties of the Markov operator P defined on the space of functions and the properties of the conjugate operator defined on the conjugate space. It should be noted that the symbols , , and also the isomorphism were not used explicitly in the text of article [12]. We are also using this formulation of the problem with the corresponding addition.
In Theorem 5 of [12], a number of assertions under some hard a priori condition of “ergodicity” on the Markov chain are considered. In particular, it is proved that the Doeblin condition (viii) is equivalent to the condition (vi): “The space of invariant functionals (i.e., finitely additive measures) is one-dimensional”. But this is true only due to the a priori “ergodicity” condition in [12] [Theorem 5]. In the general case, only the finite dimensionality of the space of invariant finitely additive measures, and hence also of invariant countably additive measures, follows from the Doeblin condition (see, for example, [27], and also our theorems in Sections 4 and 5 from [1]).
In the same Theorem 5 from [12], it was proved (under the same a priori ergodicity condition) that the Doeblin condition (viii) and the condition (vi) are equivalent to the condition : “Every -invariant functionals is a measure”, here it is countably additive measures.
If we translate these statements into the language we use, then we obtain a special case of our Theorem 7 [1], but for a one-dimensional space of invariant finitely additive measures.
Theorems 10 and 11 from [1] also generalize the corresponding assertions Theorem 5 from [12] since they do not assume that the Markov chain is ergodic (end of quote).
We can now add one more new remark to those given above. In our Theorem 4, it was proved that the condition “the operator is quasi-compact” is equivalent to the condition “the operator T is quasi-compact” (in [12] these operators are denoted by the symbols and P). There is no such statement in [12] [Theorem 5]. In [12], in point , it is written only that the operator P is quasi-compact.
Thus, our Theorem 4 significantly strengthens Theorem 5 from [12].
4. Hernández-Lerma and Lasserre proved in [16] [Theorem 6.3.1] (2003) that for a Markov chain defined on a separable metrizable phase space , under certain assumptions, there exists an invariant finite additive measure. It is also shown that if a finitely additive measure is invariant, then both its countably additive and purely finitely additive components are invariant. In this article, we consider general Markov chains and do not separately single out the particular case of the topological phase space.
Funding
The research was supported by Ministry of Science and Higher Education of the Russian Federation (project FFNU-2022-0036).
Data Availability Statement
Data are contained within the article.
Acknowledgments
The author thanks anonymous referees for their comments that helped improve the paper.
Conflicts of Interest
The author declares no conflicts of interest.
References
- Zhdanok, A. Invariant Finitely Additive Measures for General Markov Chains and the Doeblin Condition. Mathematics 2023, 11, 3388. [Google Scholar] [CrossRef]
- Zhdanok, A.I. Finitely additive measures in the ergodic theory of Markov chains II. Sib. Adv. Math. 2003, 13, 108–125. [Google Scholar]
- Zhdanok, A.I. Finitely additive measures in the ergodic theory of Markov chains I. Sib. Adv. Math. 2003, 13, 87–125. [Google Scholar]
- Kryloff, N.; Bogoliouboff, N. Sur les probabilités en chaine. C. R. Acad. Sci. Paris 1937, 204, 1386–1388. [Google Scholar]
- Yosida, K.; Kakutani, S. Operator-theoretical treatment of Markoff’s processes and mean ergodic theorem. Ann. Math. Second Ser. 1941, 42, 188–228. [Google Scholar] [CrossRef]
- Doeblin, W. Sur les propriétés asymptotiques de mouvements régis par certains types de chaines simples. Bull. Math. Soc. Roum. Sci. 1937, 39, 57–115. [Google Scholar]
- Horowitz, S. Transition probabilities and contractions of L∞. Z. Wahrscheinlichkeitstheorie Verw Geb. 1972, 24, 263–274. [Google Scholar] [CrossRef]
- Foguel, S.R. Limit theorems for Markov processes. Trans. Am. Math. Soc. 1966, 121, 200–209. [Google Scholar] [CrossRef]
- Foguel, S.R. The Ergodic Theory of Markov Processes; Van Nostrand Reinhold Co.: New York, NY, USA, 1969. [Google Scholar]
- Foguel, S.R. The ergodic theory of positive operators of continuous functions. Ann. Scuola Norm. Sup. Pisa—Cl. Sci. 1973, 27, 19–51. [Google Scholar]
- Lin, M. On quasi-compact Markov operators. Ann. Probab. 1974, 2, 464–475. [Google Scholar] [CrossRef]
- Lin, M. Quasi-compactness and uniform ergodicity of Markov operators. Ann. Inst. Henri Poincare Probab. Stat. 1975, 11, 345–354. [Google Scholar]
- Lin, M. Quasi-compactness and uniform ergodicity of positive operators. Isr. J. Math. 1978, 29, 309–311. [Google Scholar] [CrossRef]
- Neveu, J. Bases Mathematiques du Calcul des Probabilites; Masson et Cie: Paris, France, 1964. [Google Scholar]
- Revuz, D. Markov Chains, 2nd ed.; North-Holland Mathematical Library: Amsterdam, The Netherlands, 1984. [Google Scholar]
- Hernández-Lerma, O.; Lasserre, J.B. Markov Chains and Invariant Probabilities; Birkhäuser: Basel, Switzerland, 2003. [Google Scholar]
- Mebarki, L.; Messirdi, B.; Benharrat, M. Quasi-compactness of linear operators on Banach spaces: New properties and application to Markov chains. Asian-Eur. J. Math. 2021, 14, 2150060. [Google Scholar] [CrossRef]
- Dunford, N.; Schwartz, J.T. Linear Operatiors, Part I: General Theory; Interscience Publishers, Inc.: New York, NY, USA; London, UK, 1958. [Google Scholar]
- Yosida, K.; Hewitt, E. Finitely Additive Measures. Trans. Am. Math. Soc. 1952, 72, 46–66. [Google Scholar] [CrossRef]
- Zhdanok, A.; Khuruma, A. Decomposition of Finitely Additive Markov Chains in Discrete Space. Mathematics 2022, 10, 2083. [Google Scholar] [CrossRef]
- Rao, K.P.S.B.; Rao, M.B. Theory of Charges: A Study of Finitely Additive Measures; Academic Press: London, UK, 1983. [Google Scholar]
- Zhdanok, A.I. Cycles in Spaces of Finitely Additive Measures of General Markov Chains. In Recent Developments in Stochastic Methods and Applications. Springer Proceedings in Mathematics Statistics; Shiryaev, A.N., Samouylov, K.E., Kozyrev, D.V., Eds.; Springer: New York, NY, USA, 2021; Volume 371, pp. 131–143. [Google Scholar]
- Ramakrishnan, S. Finitely Additive Markov Chains. Trans. Am. Math. Soc. 1981, 265, 247–272. [Google Scholar] [CrossRef]
- Šidak, Z. Integral representations for transition probabilities of Markov chains with a general state space. Czechoslov. Math. J. 1962, 12, 492–522. [Google Scholar] [CrossRef]
- Krein, M.G.; Rutman, M.A. Linear operators leaving invariant a cone in a Banach space. Uspekhi Mat. Nauk 1948, 3, 3–95. [Google Scholar]
- Herkenrath, U. Markov processes under continuity assumptions. Rev. Roum. Math. Pures Appl. 1977, 22, 1419–1431. [Google Scholar]
- Doob, J.L. Stochastic Processes; Wiley: New York, NY, USA, 1953; p. MR0058896. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).