Abstract
We establish a connection between random set theory and Boolean valued analysis by showing that random Borel sets, random Borel functions, and Markov kernels are respectively represented by Borel sets, Borel functions, and Borel probability measures in a Boolean valued model. This enables a Boolean valued transfer principle to obtain random set analogues of available theorems. As an application, we establish a Boolean valued transfer principle for large deviations theory, which allows for the systematic interpretation of results in large deviations theory as versions for Markov kernels. By means of this method, we prove versions of Varadhan and Bryc theorems, and a conditional version of Cramér theorem.
1. Introduction
A situation which often arises in probability theory is the necessity to generalize a known unconditional theorem to a setting which is not unconditional any more. Instead, there is an infinity of models depending on some parameter , for a probability space , and the theorem has to be applied simultaneously to ‘almost all’ . The formalization of this approach has motivated new developments in probability theory such as [1,2]. In [3], the connection between the algebra of conditional sets [1,4] and Boolean valued analysis [5] was provided. In the present paper, we provide a similar connection for the framework of random set theory [2]. We aim to show that the well-known set-theoretic techniques of Boolean valued analysis are perfectly suited to the type of applications of random set theory. The advantage is that the present approach allows for applying the full power of the set-theoretical methods. In particular, the so-called transfer principle of Boolean valued models provides a tool for expanding the content of already available theorems to non-obvious analogues in random set theory.
To reach this aim, we study the Boolean valued representation of different objects in random set theory. First, we study the Boolean valued representations of random sets. The notion of a random set gives meaning to random objects X whose realizations , , take values as subsets of some space ; see [2]. These objects have an important role in mathematical finance and stochastic optimization; see e.g., [6,7,8,9,10]. By considering the Boolean valued model associated with the underlying probability space, we show that a random set corresponds to a Borel set in this model. Moreover, we characterize when a random set corresponds to an open set, a closed set, or a compact set in the model. Similar connections are provided for random Borel functions. Second, we study the Boolean valued representation of Markov kernels in the model. Namely, we prove that a Markov kernel corresponds to a Borel probability measure in the model. Moreover, the Lebesgue integral of a random Borel function with respect to a Markov kernel corresponds to the Lebesgue integral of a real-valued function with respect to the corresponding Borel probability measure in the model. Third, we study the Boolean valued representation of Markov kernels that stem from regular conditional distributions of real-valued random variables. Namely, we show that a random variable corresponds to a random variable in a Boolean valued model so that its regular conditional probability corresponds to its probability distribution in the model. Moreover, this correspondence preserves arithmetic operations, almost sure convergence, and sends conditionally independent random variables to (unconditional) independent random variables in the model.
In probability theory, large deviations theory concerns the asymptotic behavior of sequences of probability distributions; see, e.g., [11]. By means of the previous Boolean valued representations and the Boolean valued transfer principle, we prove versions for sequences of Markov kernels of basics results in large deviations theory. Namely, we obtain versions of Varadhan’s and Bryc’s theorems and a conditional version of Cramér’s theorem for the sequence of sample means of a conditionally independent identically distributed sequence of real-valued random variables. We emphasize that these results are just instances of application of the transfer principle, whose number can be easily increased. We point out that large deviations results for Markov kernels are particularly important in the theory of random walks in random media and are called ‘quenched’ large deviations principles; see, e.g., [12].
The paper is organized as follows: in Section 2, we recall some basics of Boolean valued analysis. Section 3 is devoted to some preliminaries on random sets. In Section 4, we study the Boolean valued representation of random sets and random functions. In Section 5, we study the Boolean valued representation of Markov kernels. In Section 6, we study the Boolean valued representation of regular conditional probability distributions. Finally, in Section 7, a Boolean valued transfer principle for large deviations of sequences of Markov kernels is provided.
2. Basics of Boolean Valued Analysis
The main tool of Boolean valued analysis is Boolean valued models of set theory. The precise formulation of Boolean valued models requires some familiarity with the basics of set theory and logic, and, in particular, with first-order logic, ordinals and transfinite induction. For the convenience of the reader, we will give some background. All the principles and results in this section are well-known, and details can be found in [5].
Let us consider a universe of sets satisfying the axioms of the Zermelo–Fraenkel set theory with the axiom of choice (ZFC), and a first-order language , which allows for the formulation of statements about the elements of . In the universe , we have all possible mathematical objects (real numbers, topological spaces, and so on). The language consists of names for the elements of together with a finite list of symbols for logic symbols (∀, ∧, ¬, and parentheses), variables, and the predicates = and ∈. Though we usually use a much richer language by introducing more and more intricate definitions, in the end, any usual mathematical statement can be written using only those mentioned. The elements of the universe are classified into a transfinite hierarchy: , where , is the family of all sets whose elements come from , and for limit ordinal .
In the following, we consider an underlying probability space , which is a member of the universe . Then, the associated measure algebra is a complete Boolean algebra with ‘unity’ and ‘zero’ , where and , and Boolean operations
for . As usual, we write whenever . Furthermore, has the countable chain condition, that is, any partition of the unity by elements of is at most countable. Although we focus on the case of the measure algebra associated with a probability space, the following constructions and principles work for any complete Boolean algebra, by replacing countable partitions by arbitrary partitions.
The Boolean valued universe is constructed by transfinite induction over the class of ordinals of the universe . We start by defining . If is the successor of the ordinal , we define to be the set of all functions with . If is a limit ordinal, . Finally, let Given u in , we define its rank as the least ordinal such that u is in .
We consider a first-order language which allows us to produce statements about . Namely, let be the first-order language which is the extension of by adding names for each element in . Throughout, we will not distinguish between an element in and its name in . Thus, hereafter, the members of will be referred to as names.
Suppose that is a formula in set theory, that is, is constructed by applying logical symbols to atomic formulas and . If does not have any free variable and all the constants in are names in , then we define its Boolean truth value, say , which is a member of and is constructed by induction in the length of by naturally giving Boolean meaning to the predicates = and ∈, the logical connectives, and the quantifiers. Namely, the Boolean truth value of the atomic formulas and for u and v in is defined by transfinite recursion as follows:
where, for , we denote . For nonatomic formulas, we have
We say that a formula is satisfied in the model , whenever it is true with the Boolean truth value, i.e., . Say that two names are equivalent when . It is not difficult to verify that the Boolean truth value of a formula is not affected when we change a name by an equivalent one. However, the relation does not mean that the functions u and v coincide. In order to overcome these difficulties, we will consider the separated universe. Namely, let be the subclass of defined by choosing a representative of the least rank in each class of the equivalence relation .
In the model , we have all possible mathematical objects (real numbers, topological spaces, and so on), and a full mathematical discussion is possible. For instance, if a name u satisfies that , we will say that u is a vector space in the model . If f, u, and v are names such that , we will say that f is a function from u to v in the model . Throughout, we will use this terminology for different mathematical objects without further explanations.
2.1. Principles in the Universe
Next, we recall some important principles.
Theorem 1.
(Transfer Principle) If φ is a theorem of ZFC, then φ holds in the model .
The transfer principle tells us that any available theorem holds true in the model .
Theorem 2.
(Maximum Principle) Let be a formula with free variables and suppose that . Then, there exists such that
Theorem 3.
(Mixing Principle) Let be a countable partition of and a sequence of names. Then, there exists a unique member u of such that for all .
Given a countable partition of and a sequence of elements of , we denote by , the unique name u in satisfying for all .
2.2. Descent Operation
Given a member u of with we define its descent by
2.3. Ascent Operation
Consider a nonempty set X of members of . We define the ascent of X to be the unique representative in of the name given by the function
Then, it is satisfied that .
2.4. Boolean Valued Representation
Definition 1.
A nonempty set X is called a stable -set if a function exists such that:
- 1.
- ;
- 2.
- iff ;
- 3.
- ;
- 4.
- for every countable partition of and sequence , there exists a unique such that for all .
Given a sequence and a countable partition of , we denote by the unique element given by (4) above.
Remark 1.
- 1.
- The requirement in (4) above that x is unique is superfluous as the uniqueness can be proven from (1) above; for more details, see, e.g., ([5], Section 3.4.2).
- 2.
- A stable -set can be reformulated as a Boolean metric space by considering the Boolean metric .
- 3.
- If satisfies that , then, due to the mixing principle, is a stable -set for .
Definition 2.
Suppose that X, Y are stable -sets. A function is said to be stable if for every sequence and countable partition of .
A name is said to be a Boolean valued representation of the stable -set X (or that X is a Boolean valued interpretation of u) if there exists a stable bijection
Proposition 1.
Every stable -set X admits a Boolean valued representation , which is unique up to bijections in the model .
Let X and Y be stable -sets with Boolean valued representations and , respectively.
- Suppose that is a name for the Cartesian product of and in the model . Then, is a Boolean representation of the stable -set . (Notice that is a stable -set by setting .) More precisely, there exists a bijection such thatfor all .
- A nonempty subset is said to be stable if for every countable partition of and sequence it holds that is again an element of S. Given a stable set , we define . Then, is a nonempty subset of in the model . Due to the mixing principle, one has that is a stable bijection between S and . In addition, the correspondence is a bijection between the class of stable subsets of and the class of names for nonempty subsets of X in the model .
- Suppose that is stable. Then, there exists a unique member of such that is a function from to in the model with for all .
Remark 2.
The notion of stability plays a key role in related frameworks. In Boolean valued analysis, the terminology cyclic or universally complete-sets is used (here is a complete Boolean algebra, for instance, we can take ), see [5]. In conditional set theory, the terminology stable set, stable function, and stable collection are used, see [4]. In fact, the notion of conditional set is a reformulation of that of cyclic -set; see ([3], Theorem 3.1) and ([3], Remark 3.1). In the theory of -modules, the notion of stability is called countable concatenation property; see [13], and, in random set theory, is called countable decomposability; see [2].
2.5. Manipulation of Boolean Truth Values
We recall some useful rules to manipulate Boolean truth values. Suppose that X is a stable -set. Given a nonempty subset Y of X, we define the stable hull of Y to be
Then, is the smallest stable subset of X that contains Y. In particular, we can consider the bijection of into its Boolean valued representation .
Proposition 2.
Let be a formula with free variables , and X a stable -set. Then, given a nonempty subset Y of X and ,
is satisfied. Moreover, it holds that
- 1. if and only if for all ;
- 2. if and only if for some .
2.6. Boolean Valued Numbers
Denote by , , and the spaces of classes of equivalence of —almost surely equal -measurable random variables with values in the natural numbers, rational numbers, real numbers, and extended real numbers, respectively. Given , the inequalities and are understood in the almost sure sense. It is well known that any nonempty subset S of has a (unique) supremum in for the particular other ≤, which we denote by ; see e.g., ([14], Section A.5). In particular, is a Dedekind complete ring lattice for the partial order ≤. Similarly, we denote by the infimum of S. For , we set
- ,
- ,
where are representatives arbitrarily chosen of , respectively.
By applying the maximum and transfer principles, there exist names , , , and for the sets of natural numbers, rational numbers, real numbers, and extended real numbers, respectively, in the model . Takeuti [15] proved that is a Boolean valued representation of . This fact amounts to the following; see [3,16]. There exists a bijection
such that
- (R1)
- for every sequence and countable partition of . (Here, by convention we set .)
- (R2)
- , for every .
- (R3)
- bijects into .
- (R4)
- bijects into .
- (R5)
- bijects into .
- (R6)
- , and for every .
- (R7)
- , and for every .
- (R8)
- If is stable; then,
Remark 3.
- 1.
- Since and are countable, we have that and . Then, in view of Proposition 2, we can reduce all essentially countable quantifiers in the model , like , ..., to check constant names for , ,... This type of manipulation of Boolean truth values will be done throughout without further explanations.
- 1.
- Consider a member u of with . Suppose that is a sequence of elements of . For any , define . The functionis well-defined due to (R3) and stable due to (R1). Then, we can consider , which is a name for a sequence in the model such that for all . Conversely, suppose that w is a sequence of elements of v in the model , i.e., . Then, we can consider a sequence with for each .
Remark 4.
A fundamental result in Boolean valued analysis is the so-called Gordon’s theorem, which states that the field of real numbers of a Boolean valued universe (associated with an arbitrary complete Boolean algebra) is the Boolean valued representation of a universally complete vector lattice [17]. This result was complemented by Kusraev and Kutateladze by establishing that any universally complete vector lattice is the interpretation of the field of real numbers in a suitable Boolean valued universe; see e.g., ([5], Section 5.2). In the particular case of the model , the universally complete vector lattice is the interpretation of the field of real numbers in the model , which was fruitfully exploited by Takeuti [15]. For more details about the Gordon’s theorem, we refer to [18].
3. Preliminaries on Random Sets
We next recall some basics of random sets, for a detailed account, we refer to [2]. Hereafter, is a complete probability space. (If is not complete, we can always consider the completion and the corresponding extension of on . Notice that and produce the same measure algebra and, consequently, the same model .) Throughout, is an infinite Polish space (i.e., a separable completely metrizable topological space). We denote by the space of classes of equivalence of -measurable random variables with values in , and by the Borel -algebra of . For a sequence , we write whenever for a.e. .
We consider the product -algebra . Under the present assumptions, projections onto are measurable and measurable selectors exist. Namely, the following proposition holds true; see, e.g., ([19], Theorem 5.4.1).
Proposition 3.
for every the following is satisfied:
- (A)
- the projection onto Ω is an element of ;
- (B)
- there exists such that for a.e. , where denotes the ω-section of M.
A set-valued mapping is said to be an -measurable random Borel set (shortly, random Borel set) if its graph
is an element of . Throughout, we identify two random Borel sets whenever for a.e. . We denote by the set of all (equivalence classes of) random Borel sets. Given , we say that is an a.s. -measurable selector of X if for a.e. . We denote by the set of all a.s. -measurable selectors of X. Due to (B) in Proposition 3, is nonempty whenever X is a.s. nonempty. Regarding a set as a constant set-valued mapping, we denote by the set of classes of equivalence of -valued variables with values in E. Suppose that :
- Say that X is a random closed set if is closed for a.e. ;
- say that X is a random open set if is open for a.e. ;
- say that X is a random compact set if is compact for a.e. .
Suppose that and and let be representatives arbitrarily chosen of , respectively. We write
- ;
- ;
- .
We say that a function is:
- A random Borel function if F is measurable, where and are endowed with the -algebras and , respectively;
- essentially bounded if there exists such that, for every ,holds.
Two random Borel functions are identified whenever for a.e. . We denote by the set of (equivalence classes of) random Borel functions, and by the set of (equivalence classes of) functions which are essentially bounded.
Given , we write whenever for a.e. . Then, the binary relation ≤ is a partial order on .
Given a sequence in , we write for if for a.e. it is satisfied that for all .
We say that is random continuous if, for every sequence , we have
whenever
Denote by (resp. ) the set of all random continuous functions F in (resp. ).
4. Boolean Valued Representation of Random Sets and Random Functions
For every sequence and countable partition of , denote by the unique member such that for a.e. , for all . Then, the space is a stable -set by defining . Therefore, due to Proposition 1, admits a Boolean valued representation, say . More specifically, there exists a bijective mapping
such that for all countable partition of and sequence .
4.1. Boolean Valued Representation of Random Borel Sets
Takeuti [15] showed that the elements of the product -algebra correspond to real Borel sets in the model . Next, we extend this result to an arbitrary Polish space by showing that each random Borel set corresponds to a Borel subset of in the model . Furthermore, we characterize open, closed, and compact sets of in the model .
Given , we denote by the unique element of equivalent to the name given by the function
If X is a.s. nonempty, then is a stable set. In that case, it is not difficult to show that . A manipulation of Boolean truth values shows the following.
Proposition 4.
Suppose that , then:
- 1. for all ;
- 2.;
- 3.;
- 4.;
- 5..
Due to (2) in Remark 3, every sequence corresponds to a sequence of subsets of in the model . A manipulation of Boolean truth values bearing in mind (1) in Remark 3 yields the following.
Proposition 5.
Let be a sequence and . Then,
- 1.;
- 2..
Let be any metric compatible with the topology of . Consider the random metric
given by for a.e. . Then, is a stable function and therefore we can consider a name with . Furthermore, a manipulation of Boolean truth values shows that, in the model , is a metric on .
Suppose that is a sequence in a stable subset S. Due to (2) in Remark 3, is a name for a sequence in in the model . As a consequence of Takeuti ([15], Proposition 2.2.1), we have the following.
Lemma 1.
Let be a sequence in . Then, iff .
Lemma 2.
Let be a.s. nonempty. If , then for a.e. .
Proof.
Let , be representatives of X, Y, respectively. Define
Due to (A) in Proposition 3, the projection is an element of . Then, . Otherwise, due to (B) in Proposition 3, we can find such that , which contradicts our assumption. □
Definition 3.
We say that a subset is sequentially closed if for every sequence such that it holds that .
In the case (), Kabanov and M. Safarian ([19], Proposition 5.4.3) proved that a stable set is sequentially closed if and only if it is the set of measurable selectors of a random closed set. This was generalized in [20], Theorem 5.1 to an arbitrary Polish space ; see also ([16], Theorem 5.4.1). We complement this result by providing the corresponding Boolean valued representation.
Proposition 6.
Suppose that is stable. The following conditions are equivalent:
- 1.
- There exists a random closed set X such that ;
- 2.
- S is sequentially closed;
- 3.
- 3..
In that case, if , then for a.e. .
Proof .
is precisely ([20], Theorem 5.1). follows by a manipulation of Boolean truth values taking into account Lemma 1 and Remark 3. □
Definition 4.
We say that is open if for every there exists such that .
Proposition 7.
Suppose that is stable. The following conditions are equivalent:
- 1.
- There exists a random open set X such that ;
- 2.
- S is open;
- 3.
- .
In that case, if , then for a.e. .
Proof .
is verified by a manipulation of Boolean truth values.
: Let . Due to (A) in Proposition 3, is an element of . Consider the random Borel set Y where for a.e. and for a.e. . Then, Y is a random closed set and, consequently, is a closed set in the model due to Proposition 6. On the one hand, we have
On the other hand, . Thus, , hence is an open set in the model .
: If , then . Define
where . Then, C is a stable set such that . Therefore, there exists a random closed set Y such that due to Proposition 6. Let be such that for a.e. and for a.e. . Then, X is a random open set such that . □
Definition 5.
We say that is stably compact if S is stable and, for every sequence , there exist and a sequence with such that . Here, for .
The notion of stable compactness is standard in Boolean valued analysis. Usually, the terminology cyclical compactness is employed. It is well known that stable compact sets are represented by compact sets in the Boolean valued model; see, e.g., [21,22], which amount to the equivalence below in the present context. In conditional set theory, it is used the terminology conditional compactness; see [3,4,20]. In particular, it was proven in ([23], Theorem 5.12) and ([16], Theorem 5.4.2) that, in the case that (), a set is stably compact if and only it is the set of measurable selectors of a random compact set. All these known results amount to the following.
Proposition 8.
Suppose that is stable. The following conditions are equivalent:
- S is stably compact;
- .
If with , the conditions (1) and (2) are equivalent to
- 3.
- There exists a random compact set X such that .
In that case, if , then for a.e. .
The following result was obtained by Takeuti [15] in the case . For the general case, we need to rely on Proposition 7.
Proposition 9.
is a Borel subset of in the model if and only if there exists such that .
Proof.
Consider the collection
Notice that is stable. Moreover, is a collection of subsets of in the model . Then, due to Propositions 4 and 5, is a -algebra in the model . Furthermore, due to Proposition 7, contains all the open subsets of in the model . We conclude that, for every u which is a Borel set in the model , there exists such that .
For the converse, consider the collection
where denotes the random Borel set . If with and open, we have that , which is an open set in the model due to Proposition 7. Due to Propositions 4 and 5, is a -algebra. It follows that . The proof is complete. □
By noting that, if is a.s. nonempty, then , we can rewrite Proposition 9 in terms of sets of a.s. measurable selectors.
Corollary 1.
Suppose that is stable. The following conditions are equivalent:
- 1.There exists a random Borel set X such that ;
- 2..
In that case, if X, Y are random Borel sets such that , then for a.e. .
4.2. Boolean Valued Representation of Random Borel Functions
In the following, we connect random Borel functions and random continuous functions with Borel functions and continuous functions, respectively, in the model . Takeuti [15] established a similar connection between the so-called pseudo-Baire functions and ( to ) Baire functions in the model . Since the class of Baire functions equals the class of Borel functionals in the to case, the connections provided below extends the results in [15] to the case of a general Polish space instead of .
For , we define , where for each we set for a.e. . Since is stable, we can define , which is a function from to in the model .
Lemma 3.
Let . If for all , then . In particular, if , then .
Proof.
Fix representatives of , respectively. Define
Due to (A) in Proposition 3, the projection is an element of . Then, . Otherwise, due to (B) in Proposition 3, we can find such that for a.e. , which contradicts our assumption. □
Proposition 10.
Suppose that , then the following conditions are equivalent:
- 1.
- ;
- 2.
- for every ;
- 3.
- .
Proof .
follows by manipulation of Boolean truth values.
is clear, and is Lemma 3. □
Due to Remark 3, any sequence corresponds to a sequence of functions in the model .
Proposition 11.
Suppose that for every . The following conditions are equivalent:
- 1.
- ;
- 2.
- for every ;
- 3.
- .
Proof .
follows by manipulation of Boolean truth values.
is clear.
: Fix representatives of , respectively. Define
Due to (A) in Proposition 3, the projection is an element of . Then, . Otherwise, due to (B) in Proposition 3, we can find such that for a.e. , which contradicts our assumption. The argumentation for the limit inferior is similar. □
The following lemma was proven in ([15], Proposition 2.4.1) in the case . The general case follows by the same argument.
Lemma 4.
Suppose that . If , then there exists such that and .
Proposition 12.
Let be a stable function. The following conditions are equivalent:
- 1.There exists such that ;
- 2..
In that case, if for , then .
Proof.
: Define
Notice that is stable. Then, is a collection of functions from to in the model . We prove that contains all the Borel measurable functions from to in the model . Supposing that is a characteristic function on a Borel subset of in the model , then for some , due to Proposition 9. In that case, with . In the model , is closed under linear combinations and, due to Lemma 4, is also closed under limits. Therefore, contains all the Borel measurable functions in the model .
: Define
We prove that . If for some , then , where . The collection is closed under linear combinations. In addition, if with , then for any
in the model . Then, is limit of Borel function, hence it is Borel measurable in the model . □
The following result is proven word by word as in the case ; see ([15], Theorem 2.3.2).
Proposition 13.
Let be a stable function. The following conditions are equivalent:
- 1.
- There exists such that ;
- 2.
- .
In that cases, if for , then .
5. Boolean Valued Representation of Markov Kernels
Next, we recall the notion of Markov kernel, which is a fundamental object in probability theory.
Definition 6.
Let be a function.
- κ is called an essential Markov kernel if:
- is -measurable for all ;
- , for a.e. ;
- If and for , then for a.e. .
- An essential Markov kernel κ is called a Markov kernel if is a probability measure for all .
Lemma 5.
([24], Theorem 1) If μ is an essential Makov kernel, there exists a Markov kernel κ such that for a.e. , for all .
Lemma 6.
Let be a Markov kernel and . If is a representative of X, then is measurable.
Proof.
Define
Above, denotes . First, if with and , we have that , which is measurable. Hence, . In particular, we have . We prove that is a Dynkin system, i.e., is closed under proper differences and under the unions of increasing sequences of sets. If with , we have that is measurable. If is an increasing sequence of elements of , then it follows that is measurable. Finally, since is a -system (i.e., closed under finite intersections), we conclude that by Dynkin’s - theorem; see, e.g., ([25], Theorem 1.6.2). □
Given a Markov kernel , we define
where for a.e. . Note that is well-defined due to Lemma 6. Then, the function
is stable. Notice that, due to Proposition 9, is the descent of the Borel -algebra on in the model . We can consider a name that satisfies and
Then, a manipulation of Boolean truth values bearing in mind Propositions 4 and 5 proves the following.
Proposition 14.
Let be a Markov kernel. Then, is a Borel probability measure in the model .
In a converse direction, we have the following.
Proposition 15.
Suppose that Q is a Borel probability measure in the model . Then, there exists a Markov kernel such that . Moreover, if are Markov kernels such that , then for a.e .
Proof.
Given , we denote again by E the (class of the) constant random set . Take such that . By choosing a representative of for each , we can define such that is -measurable and for a.e. . Then, is an essential Markov kernel. Due to Lemma 5, there exists a Markov kernel such that for a.e. , for all . Define
For with and , we have that for a.e. . In the model , it holds that
By Propositions 4 and 5, is a -algebra, hence .
Finally, suppose that are Markov kernels with . If , then, in the model , it holds
Hence, for a.e. , for every . Define . Since is second countable, then there exists a countable -system with (it suffices to take the collection of finite intersections of a countable topological base). For each , define : = Then, , by the Dynkin’s - theorem. Hence, , and since . Then, the assertion follows. □
Remark 5.
A related connection of the above reciprocal relations in Propositions 14 and 15 is given in ([1], Theorem 4.1) using the language of conditional sets.
Suppose that is a Markov kernel and . If is -integrable for a.e. , we denote by the unique element such that for a.e. .
Proposition 16.
Let be a Markov kernel and . Then, is -integrable for a.e. if and only if . In that case,
Proof.
We prove (1) for . It is not difficult to show the equality for with . By linearity, (1) holds for F simple. Finally, for arbitrary, take a sequence of simple functions with . Then, in the model , by monotone convergence one has
The proof is complete. □
Denote by the set of all Borel probability measures on the Polish space . We endow with the Prokhorov metric, which is defined by
where . Recall that the metric induces the topology of weak convergence of probability measures. Namely, a sequence converges to Q if and only if for every , where denotes the set of all bounded continuous functional . In addition, the set , endowed with the Prokhorov metric, is a Polish space and the metric is compatible with the weak topology . For further details, see [26,27]. Since is Polish, it is possible to apply the results studied in the previous section to any Boolean valued representation of .
Markov kernels can be regarded as -valued random variables. Namely, we have the following.
Lemma 7.
([28], Theorem A.5.2) If is a Markov kernel, then there exists such that for a.e. . If , then there exists a Markov kernel such that for a.e. .
In virtue of Lemma 7, in the following, an element is regarded as an equivalence class of Markov kernels.
Denote by a name for the Borel probability measures on , in the model . Proposition 14 and 15 tell us that
is a stable bijection. In other words, is a Boolean valued representation of . In particular, we can apply the relations provided in the previous section to the Prokhorov metric , and its randomized version can be defined as
for every .
6. Boolean Valued Representation of Regular Conditional Probability Distributions
In the following, we assume that is a probability space and is a sub--algebra of that contains all the -null sets.( If does not contain all the null sets, we can complete and by considering the -algebras and .) Conditional expectations are commonly defined for random variables with finite expectation, but we can naturally extend this to a more general setting. Namely, for with we define the extended conditional expectation of by
Next, we study a Boolean valued representation of in the model . Gordon [29] established that the conditional expectation is the Boolean valued interpretation of the usual expectation. Further relations can be found in ([16], Chapter 4). We start by briefly recalling how a probability space can be made into a probability space in the model ; the details can be found in ([16], Chapter 4). This construction is valid for a general probability space even if it is not complete. Consider the measure algebra . Then, is a stable -set, where for (see e.g., ([30], Section 1.10)). Therefore, we can consider the bijection
of into its Boolean valued representation . In addition, it is shown that is a complete Boolean algebra in the model . Furthermore, the conditional probability
is a stable function and it can be shown that is a probability measure on , in the model . By applying first the transfer principle to the Stone representation theorem for measure algebras ([31], 321J) and then the maximum principle, we can find members , , and of such that:
- , and
- .
Furthermore, let be a name with . Then, as shown in ([16], Proposition 4.1.6), it is possible to find a bijection
such that:
- (S1)
- and coincide on ;
- (S2)
- For every sequence and countable partition of , holds;
- (S3)
- ;
- (S4)
- ;
- (S5)
- ;
- (S6)
- ;
- (S7)
- ;
- (S8)
- for all ;
- (S9)
- for all ;
- (S10)
- for all with .
Given and , we define to be the class in of the set , where and are arbitrary representatives of and X, respectively. In addition, in the model , we can similarly consider for the random variable and the Borel set .
Proposition 17.
Suppose that . Then, it holds
for all . In particular, if X is a.s. nonempty
Proof.
The collection
is well-defined and stable. Then, is a collection of real Borel sets in the model , due to Proposition 9. Moreover, in the model , is a -algebra that contains the real intervals due to (S7). Therefore, , and the assertion follows. □
If , it is well known that there exists a Markov kernel such that
for all . (Actually, it is a consequence of Lemma 5.) The Markov kernel above is called a regular conditional distribution of given ; see, e.g., [32]. The following result tells us that the conditional distribution of a real-valued random variable can be interpreted as the distribution of a real-valued random variable in the model .
Proposition 18.
Suppose that . Then,
holds for every . In particular, if X is a.s. nonempty
Proof.
Suppose that is a regular conditional distribution of given . We have that for all . In the model , the probability measure
agrees with on real intervals due to (S9). Therefore, in the model . □
A sequence in is said to be:
- Conditionally independent if it is satisfied thatfor all and .
- Conditionally identically distributed iffor all and .
Given a sequence , by stability, we can find a sequence of elements of in the model .
Proposition 19.
([33], Proposition 4.6) Suppose that is a sequence in such that . Then,
In particular, exists iff . In that case,
A manipulation of Boolean truth values bearing in mind (S9) above shows the following result, which holds true even if the underlying probability space is not complete.
Proposition 20.
Suppose that is a sequence in . Then, the following properties hold:
- 1.
- is conditionally independent iff ;
- 2.
- is conditionally identically distributed iff .
7. A Transfer Principle for Large Deviations of Markov Kernels
As an application of the connections provided in the previous sections, we next develop a transfer principle that allows for the interpretation of results in large deviations theory as versions for sequences of Markov kernels. Let us first recall some basics of large deviations theory. For a thorough account, we refer to [11]. Suppose that is a rate function (i.e., a not identically lower semicontinuous function). Let be a sequence in .
- Say that satisfies the large deviation principle (LDP) with rate function I if
- Say that satisfies the Laplace principle (LP) with rate function I iffor every bounded continuous function .
- Say that is exponentially tight if, for every , there exists a compact set such that
Next, we introduce analogues for Markov kernels of the notions above.
Definition 7.
A function is called a conditional rate function, if the following holds:
- 1.I is stable;
- 2.there exists such that ;
- 3. whenever .
Definition 8.
Suppose that is a sequence of Markov kernels and is a conditional rate function.
- 1.
- Say that satisfies the conditional large deviation principle (cLDP) with conditional rate function I if:for all a.s. nonempty random open set ,for all a.s. nonempty random closed set .
- 2.
- Say that satisfies the conditional Laplace principle (cLP) with conditional rate function I iffor all .
- 3.
- Say that is conditionally exponentially tight if for every there exists a.s. nonempty such that is stably compact and
As a consequence of (R8) above, we have the following.
Lemma 8.
Suppose that is a stable function, then:
- 1.
- For every a.s. nonempty , it holds that
- 2.
- for every , it holds that
Given a conditional rate function , we can find a name such that . Moreover, is a rate function in the model . Then, the Boolean valued representations provided in Section 4 and Section 5 give all the elements to show the following by means of a simple manipulation of Boolean truth values.
Theorem 4.
Suppose that is a sequence of Markov kernels and a conditional rate function. Then,
- 1.
- satisfies the cLDP with conditional rate function I iff
- 2.
- satisfies the cLP with conditional rate function I iff
- 3.
- is conditionally exponentially tight iff
The Interpretation of Basic Theorems
By the transfer principle, if is a known theorem, then the assertion is also a theorem. This provides a technology for expanding the content of the already available theorems. In the following, we use this method to expand large deviations results on sequences of probability distributions to new large deviations results on sequences of Markov kernels.
We derive the following version of Varadhan’s large deviations theorem.
Theorem 5.
Let be a sequence of Markov kernels and a conditional rate function. If satisfies the cLDP with conditional rate function I, then satisfies the cLP with conditional rate function I.
Consider the classical Varadhan’s theorem; see, e.g., ([34], Theorem III.13, p. 32). Then, in view of Theorem 4, the statement above is a reformulation of , which is also a theorem due to the transfer principle. Similarly, we have the following version of the Bryc’s large deviations theorem ([11], Theorem 4.4.2).
Theorem 6.
Let be a sequence of Markov kernels such that the limit
exists for all . If is conditionally exponentially tight, then satisfies the cLDP and the cLP with conditional rate function ,
Suppose now that, as in Section 6, is a complete probability space and is a sub--algebra of that contains all the null sets. If , is the sequence of sample means for some conditionally i.i.d. sequence , then, due to (S5) and Proposition 20, is the sequence of a sample means of an i.i.d. sequence in the model . Then, the following conditional version of Cramér’s large deviation theorem follows by a Boolean valued interpretation of its unconditional version ([11], Theorem 2.2.3) bearing in mind Proposition 18.
Theorem 7.
Let be conditionally i.i.d.. Suppose that is a regular conditional distribution of given , for each . Then, satisfies the cLDP with a conditional rate function ,
where .
Obviously, all these theorems are just some examples: we can state a version of any theorem on large deviations theory, and it immediately renders a version for Markov kernels of the form .
We finish this section by pointing out that several results in the literature involving sample means of conditional independent random variables can be easily proven by means of the transfer principle. For limitation of space, we focus on a few of them. Recall the strong law of large numbers, which asserts that the sequence of sample means of a i.i.d. sequence of random variables converges a.s. to the mean of their common distribution. As explained above, the sequence of sample means of a c.i.i.d. sequence of random variables are represented by the sequence of sample means of a i.i.d. sequence of random variables in the model . Then, bearing in mind Proposition 19, the law of large numbers in the model is interpreted as the following known conditional law of large numbers (see ([35], Proposition 2.3)), which holds true due to the transfer principle.
Theorem 8.
Suppose that is a conditionally i.i.d. sequence such that . Then, .
In addition, the conditional versions of the law of large numbers obtained in [36] (see Theorems 3.5 and 4.2) are also interpretations of their classical versions. The same applies to the main result in [37] (see Theorem 2.1), which is a conditional version of the Kolmogorov–Feller weak law of large numbers and follows by the Boolean valued interpretation of its unconditional version.
8. Conclusions
As shown in Section 4, Section 5 and Section 6, all the basic objects in random set theory have a natural Boolean valued representation in the model . Namely, random sets, random functions, Markov kernels, and regular conditional distributions are respectively represented by Borel sets, Borel functions, Borel probability measures, and probability distributions in the model . On the other hand, Boolean valued analysis provides a technology for expanding the content of the already available theorems. Namely, each known theorem involving Borel sets, Borel functions, Borel probability measures, and/or probability distributions automatically has a non-obvious random set analogue involving respectively random sets, random functions, Markov kernels, and/or regular conditional distributions. This is a powerful tool to formalize and prove results in random set theory, and potentially applicable to large deviations, stochastic optimization, and mathematical finance. This is illustrated in Section 7 with the new limits results for Markov kernels easily obtained by means of this method, namely, Theorems 5–7. Nevertheless, these results are just examples of applications, and the number of instances of applications can be easily increased.
Author Contributions
These authors contributed equally to this work. All authors have read and agreed to the published version of the manuscript.
Funding
The first author was supported by project MTM2017-86182-P (Government of Spain, AEI/ERDF-FEDER, EU) and by project 20797/PI/18 by Fundación Séneca, ACyT Región de Murcia. The second author was partially supported by the grant Fundación Séneca 20903/PD/18.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Jamneshan, A.; Kupper, M.; Streckfuß, M. Measures and integrals in conditional set theory. Set-Valued Var. Anal. 2018, 26, 947–973. [Google Scholar] [CrossRef]
- Molchanov, I. Theory of Random Sets. Probability and Its Applications, 2nd ed.; Springer: London, UK, 2017. [Google Scholar]
- Avilés, A.; Zapata, J.M. Boolean-valued models as a foundation for locally L0-convex analysis and Conditional set theory. J. Appl. Logics 2018, 5, 389–420. [Google Scholar]
- Drapeau, S.; Jamneshan, A.; Karliczek, M.; Kupper, M. The algebra of conditional sets, and the concepts of conditional topology and compactness. J. Math. Anal. Appl. 2016, 437, 561–589. [Google Scholar] [CrossRef]
- Kusraev, A.G.; Kutateladze, S.S. Boolean Valued Analysis. Mathematics and Its Applications; Springer: Dordrecht, The Netherlands, 2012. [Google Scholar]
- Haier, A.; Molchanov, I. Multivariate risk measures in the non-convex setting. Stat. Risk Model. 2019, 36, 25–35. [Google Scholar] [CrossRef]
- Lepinette, E. Random optimization on random sets. Math. Methods Oper. Res. 2020, 91, 159–173. [Google Scholar] [CrossRef]
- Lepinette, E.; Molchanov, I. Conditional cores and conditional convex hulls of random sets. J. Math. Anal. Appl. 2010, 478, 368–393. [Google Scholar] [CrossRef]
- Molchanov, I.; Cascos, I. Multivariate risk measures: A constructive approach based on selections. Math. Financ. 2016, 26, 867–900. [Google Scholar] [CrossRef]
- Molchanov, I.; Molinari, F. Random Sets in Econometrics; Cambridge University Press: Cambridge, UK, 2018; Volume 60. [Google Scholar]
- Dembo, A.; Zeitouni, O. Large Deviations Techniques and Applications; Springer: Berlin/Heidelberg, Germany, 1998. [Google Scholar]
- Varadhan, S.R.S. Large deviations for random walks in a random environment. Commun. Pure Appl. Math. A J. Issued Courant Inst. Math. Sci. 2003, 56, 1222–1245. [Google Scholar] [CrossRef]
- Guo, T. Relations between some basic results derived from two kinds of topologies for a random locally convex module. J. Funct. Anal. 2010, 258, 3024–3047. [Google Scholar] [CrossRef]
- Föllmer, H.; Schied, A. Stochastic Finance. An Introduction in Discrete Time. In De Gruyter Studies in Mathematics, 3rd ed.; Walter de Gruyter: Berlin, Germany; New York, NY, USA, 2011. [Google Scholar]
- Takeuti, G. Two Applications of Logic to Mathematics; Princeton University Press: Princeton, NJ, USA, 1979. [Google Scholar]
- Zapata, J.M. A Boolean-Valued Models Approach to L0-Convex Analysis, Conditional Risk and Stochastic Control. Ph.D. Thesis, Universidad de Murcia, Murcia, Spain, 2018. Available online: http://hdl.handle.net/10201/59422 (accessed on 1 August 2020).
- Gordon, E.I. Real numbers in Boolean-valued models of set theory, and K-spaces. In Doklady Akademii Nauk; Russian Academy of Sciences: Moscow, Russia, 1977; Volume 237, pp. 773–775. [Google Scholar]
- Kusraev, A.G.; Kutateladze, S.S. The Gordon theorem: Origins and meaning. Vladikavkaz Math. J. 2019, 21, 63–70. [Google Scholar]
- Kabanov, Y.; Safarian, M. Markets with Transaction Costs: Mathematical Theory; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2009. [Google Scholar]
- Jamneshan, A.; Kupper, M.; Zapata, J.M. Parameter-dependent stochastic optimal control in finite discrete time. J. Optim. Theory Appl. 2020, 186, 644–666. [Google Scholar] [CrossRef]
- Kusraev, A.G. Dominated Operators; Kluwer Academic Publishers: Dordrecht, Switzerland, 2000. [Google Scholar]
- Kusraev, A.G. Vector Duality and Its Applications; Nauka: Novosibirsk, Russia, 1985. [Google Scholar]
- Jamneshan, A.; Zapata, J.M. On compactness in topological L0-modules. Preprint 2017.
- Radchenko, V.N. The Radon-Nikodym theorem for random measures. Ukr. Math. J. 1989, 41, 57–61. [Google Scholar] [CrossRef]
- Cohn, D. Measure Theory, 2nd ed.; Birkhäuser: Basel, Switzerland, 2013; Volume 5. [Google Scholar]
- Billingsley, P. Convergence of Probability Measures; John Wiley & Sons Inc.: New York, NY, USA, 1968. [Google Scholar]
- Parthasarathy, K.R. Probability Measures on Metric Spaces; American Mathematical Soc.: Providence, RI, USA, 2005; Volume 352. [Google Scholar]
- Dupuis, P.; Ellis, R.S. A Weak Convergence Approach to the Theory of Large Deviations; John Wiley & Sons: Hoboken, NJ, USA, 2011; Volume 902. [Google Scholar]
- Gordon, E.I. K-spaces in Boolean-valued models of set theory. Dokl. Akad. Nauk SSSR 1981, 258, 777–780. [Google Scholar]
- Kusraev, A.G.; Kutateladze, S.S. Boolean valued analysis: Selected topics. Vladikavkaz SMI VSC RAS 2014, 1000. [Google Scholar] [CrossRef]
- Fremlin, D.H. Measure Theory Volume 3: Measure Algebras; Torres Fremlin: Colchester, UK, 2002; Volume 25. [Google Scholar]
- Durrett, R. Probability: Theory and Examples, 2nd ed.; Duxbury Press: Belmont, CA, USA, 1996. [Google Scholar]
- Zapata, J.M. A Boolean valued analysis aproach to conditional risk. Vladikavkaz Math. J. 2019, 21, 71–89. [Google Scholar]
- Den Hollander, F. Large Deviations; American Mathematical Soc.: Providence, RI, USA, 2008; Volume 14. [Google Scholar]
- Hess, K.T. Conditional zero-one laws. Theory Probab. Its Appl. 2004, 48, 711–718. [Google Scholar] [CrossRef]
- Majerek, D.; Nowak, W.; Zieba, W. Conditional strong law of large number. Int. J. Pure Appl. Math 2005, 20, 143–156. [Google Scholar]
- Yuan, D.; Hu, X. A conditional version of the extended Kolmogorov–Feller weak law of large numbers. Stat. Probab. Lett. 2015, 97, 99–107. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).