Abstract
This paper investigates the capability of undirected graphs (UGs) to represent a set of Conditional Independence (CI) statements derived from a given probability distribution of a random vector. While it is established that certain axioms can govern this set, providing sufficient conditions for UGs to capture specific CI statements, our focus is on covariance and concentration graphs. These remain the only known families of UGs capable of describing CI statements. We explore the issue of complete representation of CI statements through their corresponding covariance and concentration graphs. Two parameters are defined, one each from the covariance and concentration graphs, to determine the limitations concerning the cardinality of the conditioning subset that the graph can represent. We establish a relationship between these parameters and the cardinality of the separators in each graph, providing a straightforward computational method to evaluate them. In conclusion, we enhance the aforementioned procedure and introduce criteria to ascertain, without additional computations, whether the graphs can fully represent a given set of CI statements. We demonstrate that either the concentration or the covariance graph forms a cycle, and when considered in conjunction, they can represent the entire relation. These criteria also enable us, in specific cases, to deduce the covariance graph from the concentration graph and vice versa.
MSC:
62H05; 05C90; 60E05; 62H20; 05C75
1. Introduction
In the realm of statistics and graphical estimation, conditional independence is fundamental in simplifying complex multivariate relationships and plays a crucial role in the construction and interpretation of graphical models, including Bayesian networks and Markov random fields. These models, which are pivotal in fields such as bioinformatics, epidemiology, and machine learning, leverage conditional independence to efficiently represent variable dependencies, thus enabling a more intuitive understanding of the underlying data structure and facilitating model simplification by reducing the number of parameters required (see [1,2,3]). This approach is particularly valuable in high-dimensional data analysis for identifying relevant variables and constructing sparse models, essential for effective prediction and inference. In the estimation of these models, learning conditional independence relationships directly from data is a key aspect, especially in high-dimensional settings with complex mean structures and random effects, where constraint-based methods are employed to estimate these relationships and form Bayesian networks [4]. The use of Bayesian networks in analyzing gene expression data underscores the connection between these networks and the concept of direct causal influence, characterized by probabilities and conditional independence statements [5]. Furthermore, in epidemiology, Bayesian network modeling is recognized as a well-suited approach for studying messy and highly correlated datasets typical in systems epidemiology, while the application of probabilistic graphical models, such as conditional random fields, hypergraph convolution networks, and large margin models, highlights the prevalence of these methods in learning with graphical structure [6,7,8].
This paper uses undirected graphs as a mathematical framework for representing sets of Conditional Independence (CI) statements. These graphs, commonly called graphical models, have been extensively studied in [9,10,11]. A central question in this field is whether a simple mathematical surrogate, like a graph, can effectively represent most of the CI statements associated with a random vector without requiring additional computations. Given the simplicity and mathematical rigor of graphs (see [12]), they appear to be suitable candidates for this role.
Graphical models are constructed according to specific rules, associating them with a probability distribution. These models are derived from a subset of the CI statements and can subsequently be used to infer additional CI statements within the distribution. To formalize this, consider a finite set V and denote as the pair of two elements in .
Given a random vector indexed by V and associated with a probability distribution P, we can define an undirected graph based on P and a fixed family of subsets of V. Here, V serves as the set of vertices, and the set of edges E is a subset of as defined in Equation (1).
The graph G satisfies the following condition:
Here, is shorthand for the statement “ is independent of given the random vector indexed by S, i.e., ”. When this property (2) is met, we say that P is pairwise Markov to G according to .
This paper focuses on two well-known types of undirected graphical models. The first type is the concentration graph, denoted by , which is constructed from P when the family . The second type is the covariance graph, denoted by , and is constructed from P when the family .
Under certain conditions satisfied by the probability distribution P [10,13,14,15], both types of graphs can be used to infer a wide range of CI statements, although not exhaustively. Specifically, for a triplet of pairwise disjoint subsets of V, if A is separated from B by C in G, then . In this context, P is said to be globally Markov to G.
To formalize, consider the following sets of triplets of pairwise disjoint subsets of V:
and
P is globally Markov to G if and only if .
Similarly, for the covariance graph H, if A and B are separated by in H, then in P (see [15,16,17]). In this case, P is also considered globally Markov to H. Therefore, consider the following set:
P is globally Markov to H if and only if .
In this paper, we aim to identify the families of probability distributions that the concentration and covariance graphs can fully represent. The central question is to ascertain the conditions on P such that
where G and H are the concentration and covariance graphs associated with P, respectively.
It has been previously established in [18] that when the CI statements are fully captured by the concentration graph, i.e., , the covariance graph must be a union of complete connected graphs. In such cases, P is said to be faithful to its concentration graph. Conversely, using a dual argument, it was shown that if the covariance graph can fully represent the CI statements, i.e., , then the concentration graph must also be a union of complete connected graphs. Under these conditions, P is faithful to its covariance graph (see [19]).
In [19], the authors provided a sufficient condition for P to be faithful to its concentration graph. They demonstrated that if all connected components of a concentration graph are trees and P satisfies certain conditions, then the graph can fully represent the CI statements. This result holds when all connected components of the covariance graph are also trees. In the Gaussian case, it was further established in [20] and generalized in [17] that these graphs represent all sets of CI statements.
In this paper, we investigate whether the concentration and covariance graphs, when considered together, can fully describe a set of Conditional Independence (CI) statements. We introduce the term CC-faithful to describe a probability distribution P that can be completely represented by these graphs when considered in tandem. While it is unlikely that this holds universally, we aim to identify necessary conditions for P to be CC-faithful.
Our first main result reveals that the conditions depend on the size of the separators in both the covariance and concentration graphs. We then proceed to identify families of probability distributions that are CC-faithful, working within a general framework that utilizes the set of relations previously defined in [14,21].
We show that the study of CI statements can be restricted to those of the form , where and . These CI statements, denoted by , are termed couples. We then consider the set of such couples as a relation on V.
We demonstrate that when a given relation satisfies an axiom known as the pseudographoid axiom, a global Markov property holds. This implies that the set of relations constructed from the covariance and concentration graphs are subsets of . We also establish a one-to-one mapping between the covariance and concentration graphs, simplifying many proofs. For instance, we show that less restrictive conditions are needed for the global Markov property on the covariance graph, contrary to the assumptions made in [17].
In the final section, we introduce a graphical criterion for identifying new dependencies from the covariance and/or concentration graphs, extending the work in [17]. We provide examples of CC-faithful probability distributions, showing that when either the covariance or concentration graph forms a cycle, P is CC-faithful.
The paper is organized as follows: Section 2 introduces the notations and is divided into two subsections focusing on basic graph theory and properties of relations, respectively. Section 3 discusses how relations derived from the concentration and covariance graphs are subsets of the originating relation, provided they satisfy the pseudographoid axiom. Section 4 introduces separatoids, a generalization of separators. Section 5 examines the role of separator size in the ability of the graphs to describe CI statements. Finally, Section 6 presents a graphical criterion for reading dependencies and provides examples of CC-faithful distributions.
2. Preliminaries
In this section, we delineate the essential notations utilized throughout this paper. It is systematically segmented into two subsections: the first focuses on the characteristics and nuances of undirected graphs, while the second meticulously details the assembly of relations, representing a comprehensive method to articulate the set of Conditional Independence (CI) statements within a specified probability distribution.
2.1. Undirected Graphs
An undirected graph consists of sets V representing the set of vertices and where is the set defined in (1).
For , we write when and we say that u and v are adjacent in G.
Let us first recall some basic definitions of undirected graphs.
- A path connecting two distinct vertices u and v in a graph G is a sequence of distinct vertices , where and and, for each , . Such a path is denoted by , establishing that connects and . Alternatively, we can state that and are connected by . The length of this path, denoted by , is defined as the number of edges connecting the vertices within p, . Moreover, we denote by the set of all possible paths between u and v. Furthermore, we define two paths and in to be disjoint if their intersection is , clearly indicating that u and v are the sole common vertices between and .
- A subgraph of G induced graph by a subset is denoted by , and
- A connected component of G is the largest subgraph of G such that each pair of vertices can be connected by at least one path in . Equivalently we can define to be a connected component of G if and only if satisfies the following two conditions
- i.
- ∀, .
- ii.
- ∀, .
It is straightforward to see that the two definitions above are equivalent. - For a connected graph, a separator is a subset S of V such that there exists a pair of non-adjacent vertices u and v such that andIn that case, we write . It is also easily verified that every such that is also a separator. We are thus led to the notion of minimal separator. The separator S is defined to be a minimal separator between two non-adjacent vertices u and v if for any , the subsets is not a separator of u and v.
- For any triplet of pairwise disjoint subsets of V, we say that S separates A and B in G and we denote by if for any pair of vertices we have .
2.2. The Set of Relations
Let V be a finite set. We don’t distinguish in the next of this paper between singletons and elements and we denote by a union of two subsets A and B of V. We call a couple an object where u and v are two distincts elements of V and . We denote by the set of all the couples
where .
Any subset of will be called a relation, and it will be generally denoted by . We associate a given relation the set the of subsets of V defined as follows
Let us now give three examples of well-known relations. We will see later that these relations satisfy the pseudographoid axiom.
Example 1.
Let be a random vector with a probability distribution P. Let us define the relation as follows
where denotes that the variables and are independent given the sub-random vector indexed by S.
Example 2.
Let be an undirected graph and where we denote by when the set separates u and v in G.
Let us notice the following remarks
- i.
- G is complete if and only if .
- ii.
- G is connected if and only if , where is the set associated with according to (6).
- iii.
- G has n connected components then where for the graph is the subgraph induced from on G and is the relation defined as follows
Example 3.
Let us consider Σ a positive definite matrix, i.e., . We denote by the sub-matrix of Σ with rows in and columns in . We denote then the following relation
We also know that from any probability distribution P we can define two undirected graphs called covariance and concentration graphs. We will show in the next section that the set of relations constructed from these two graphs is included in the set of relations composed from the probability distribution. We will prove this result remains true for any relation satisfying an axiom called the pseudographoid relation. This fact is known for concentration graphs (see [14]). However, by using a dual relationship between the covariance and the concentration graph, we show that this inclusion can also be satisfied by covariance graphs, unlike what is proved in [17] where the author assumes a further axiom called the weak transitivity axiom.
3. Markov Relations
Let us denote by the set of relations defined on V and we define on the application such that the relation is defined as follows
The relation will be called the dual of . It is easily seen that for any we have , i.e., . In this section and in the next of this paper, we say that satisfies the pseudographoid axiom if the following axiom is satisfied: for any and the following axiom
It was proved in [21] that all the relations , and defined in the Examples 1–3 satisfy the pseudographoid axiom (11).
Lemma 1.
Let and be two relations in . The following assertions are then satisfied
- i.
- If is satisfying (11) then is a pseudographoid too.
- ii.
- If then .
Proof.
The proof of (i) can be found in [21] (see Lemma 2).
Let us now prove (ii). We have
□
Let us now show how to associate to any relation and a fixed set of subsets of V, i.e.,
We will see later two kinds of such graphs constructed using pairwise relationships but their role will be larger. Indeed, they will help us determine all the elements of from reading separation statements on those graphs. Let us consider the following application such that if then
We denote here by the power set of the power set of V, i.e.,
Let us fix and consider the following family of subsets of V, , i.e.,
and we consider then the following pair of undirected graphs defined as follows
We are now interested in the special case where
- Case 1
- where we say that G is the concentration graph associated with .
- Case 2
- where We say that H is the covariance graph associated with .
Instead of considering the application we will rather consider the following two applications and and defined as follows
The applications and will be called respectively that concentration and the covariance map. It is easily seen that and . It is easily seen then that G can be viewed as the covariance graph of the dual of , i.e., and vis versa for the graph H.
This section aims to show that a part of the element of a given relation can be described using the concentration and the covariance graphs. The theorem that will be proved below is a generalization of a result well known as the global Markov property to a distribution probability to its covariance or concentration graph.
Theorem 1.
Let a pseudographoid. Let and be respectively the concentration and the covariance graphs associated with . Then if is a pseudographoid then
and
Before we begin the proof of Theorem 1, it is important to note that the equalities and may occur. These are related to the concept of faithfulness of the graphical structure to the pseudographoid structure. The definition of faithfulness will be discussed later in Definition 3.
Proof.
Let then , i.e., and let us show that . Let us prove this by backward induction on . If there is nothing to prove.
Assume now that and that relations in are contained in when .
Since , let . Hence and by induction we can say that . Further more since then either or . Otherwise, a path exists between u and w that does not intersect , and another one between v and w that does not intersect . Hence we can construct a path between u and v that contains w and does not intersect S. It is a contradiction with .
Let us then assume that . Hence . Since is a pseudographoid we can deduce from that .
To prove (17) it will be enough to apply (16) to the relation . According to Lemma 1 the relation satisfyies the pseudographoid axiom (see (11)). Then
□
We can deduce then from Theorem 1 that if is a pseudographoid the following inclusion holds
Since it is well known that the relation defined from a given probability distribution P as in Example 1 is also satisfying the pseudographoid axiom (11). Let us then define the CC-faithful probability distributions.
Definition 1.
Let P be a probability distribution of a random vector and let
- i.
- be the relation associated to P according to (7)
- ii.
- be the concentration associated with P.
- iii.
- be the covariance associated with P.
We say that P is a CC-faithful if .
Our main question now for the next of this paper is to determine which conditions should be satisfied by the undirected graph G and/or H to obtain the equality in (18)?
4. Separatoids on
Let be a non empty relation in and let us denote by the subset of pairs belonging in such that and there exists at least one such that . Let us consider the following definition
Definition 2.
Let u and v be two distincts elements of V. Let S and T be two subsets of such that and in . We say that
- i.
- S is a minimum separatoid between u and v if either or , .
- ii.
- T is a maximum separatoid between u and v if either or , .
Let us then denote by and by be respectively the set of all the minimal and maximal separatoids defined between u and v. We define the following two parameters
and
Lemma 2.
Let and let and and be the set of minimal separatoids associated respectively to and . Then
Furthermore, we have
Proof.
Let and let us prove that It is equivalent to show the following statements
- i.
- ii.
- either or ,
Since , then . Hence . Then (i) is proved.
Let us show (ii). If , then . Let we know that . Hence since . Hence, (ii) is completely proved.
We can use the same argument to show the other way in the equivalence (21).
Let us now prove the equality (22). Let and . let us show that .
Let , then . Since , then . Hence . Then and .
Let now , then . Since , then . Hence . Then and .
We deduce then that . □
Let us consider the case of a relation a type where G is an undirected graph. Since belongs to means that S separates u and v in G then for any we have also . Let us recall that if S separates u and v in G, then any subset containing S is also a separator of u and v in G. Then it is easily seen that the set and consequently by applying Lemma 2 we deduce that , and .
We have then proved the lemma below.
Lemma 3.
Let G be an undirected graph and let be the relation associated with G according to (2). Then , , and .
Let us define then the separability of an undirected graph.
Definition 3.
Let be an undirected graph. If G is not complete, we define for any non adjacent pair of vertices in G the separability order of as
we define the separability of G by
if G is not-complete and equal to is G is complete.
In the following section, we will explore how understanding the separability order of the covariance graph aids in determining the capacity of the concentration graph to represent the elements of and vice versa. We will revisit a finding from a previous paper (see [18]), which elucidates the effects on the covariance graph when the concentration graph fully represents This was initially demonstrated for concentration and covariance graphs under specific conditions in [18]. Here, we expand upon this result under much broader conditions.
However, let’s make some remarks about the parameters introduced in Definition 3. Indeed, this parameter was defined and used in [22]. But determining the value of for a given non-adjacent vertices are known to be called Menger’s problem (see [23]), and it was proved in many references (see [24,25,26]), and many others) this number equals the maximal number of disjoint paths between u and v.
Hence if u and v are note belonging to the same connected component of G, then . Furthermore, if and only if G comprises complete connected components and if if and only if G is complete.
5. Ability of the Concentration and the Covariance Graphs to Represent
In this section, we are interested in showing how much-undirected graphs can describe a pseudographoid relation . We have already seen in Theorem 1 that the concentration graph and the covariance graph are both of them able to determine elements of by reading separation statements on G and H. Our main question in this section is which of the elements of can be missed by G and H. We will show that the couples that G can miss having necessary an S with a cardinality smaller than a number depending on the separability order of H. By using a “dual technique”, we show that the couples that H can miss having necessary an S with a cardinality smaller than a number depending on the separability order of G.
Before giving the proof of the main result of this section, we would like to introduce some notations. Let us recall that by the construction of G, the following equivalence is satisfied for any pair of non-adjacent vertices in G
Hence if we denote for any an application such that for any relation
From the equivalence in (25) it can be easily deduced that . Let us thus define the integer as the smallest integer such that
is satisfied. Hence we deduce that when is known, the undirected graph G allows us to determine all the couples of where .
Similarly and by construction of H, the following equivalence is satisfied for any pair of non-adjacent vertices in G
Hence the equivalence in (28) can be written . We can also define the integer as the largest integer such that
Hence we deduce that when is known, the undirected graph H allows us to determine the couples of where .
The two lemma below help us to give a relationship between and .
Lemma 4.
Let be a relation in and an application defined as in (26) where . Then
Proof.
Let ,
□
Lemma 5.
Let a pseudographoid and let G and H be, respectively the concentration and the covariance graph associated to . Then
Proof.
Let , then
But according to Lemma 4 and to the fact that G is the covariance graph associated with . Then
Hence if we denote and, we have
Similarly if we consider and let us write which is the covariance graph associated to . Then
But and
Hence (32) becomes
This implies that ⟺ . Since is the largest integer. Then
Let us then prove the main result of this section.
Theorem 2.
Let be a pseudographoid. Let and be respectively the concentration and the covariance graph associated with . Let , , and . Assume that G and H are connected. Then, the following two inequalities hold
Before starting the proof of Theorem 2, we must first prove the following lemma.
Lemma 6.
Let be a pseudographoid. Let and be the concentration and the covariance graph associated with . Let and . Assume that
then and for any there exists such that and .
Proof.
Let us assume that (36) holds. Since is an integer that belongs to then (36) implies that H is necessarily not complete and the set of pairs on non-adjacent vertices in H is not empty. Let be a pair of non-adjacent vertices in H, i.e., . According to (36) there exists a subset such that and .
We can also write S as follows
where . By applying the Markov property (17) in Theorem 1, we deduce that . Secondly as and , then and by using the definition of we deduce that . Hence . So and then .
We can deduce then that .
We have also proved that for any non-adjacent pair of vertices in H we can find a subset such that and . □
Lemma 7.
Let be a pseudographoid. Let and be respectively the concentration and the covariance graph associated with . Let and . Assume that
then and for any there exists such that and .
Proof.
We will show that the proof of Lemma 7 can be obtained by Lemma 6 to . Let us first assume that (36) holds.
Let us remaind that . By using also Lemma 5 we have . Hence
Hence (36) holds for . By applying Lemma 6 for we deduce that is an undirected graph where the set of edges F is included in E the set of edges of . The remaining conclusion of the lemma and can be obtained obviously. □
Proof of Theorem 2.
Note that if G and/or H are complete, there is nothing to prove since in that case, the separability order is equal to , and the inequalities remain satisfied in that case.
Assume that (34) is true for any relation . Since for any pseudographoid , the dual of which is is also a pseudographoid and Hence it satisfies (34):
Bu using Lemma 5 we deduce that
Let us now assume that (34) does not hold. Then by equivalence (35) does not hold. We can then assume that (36) and (38). Note that once (36) is satisfied by equivalence (38) is also satisfied. These former inequalities are the principal hypothesis in Lemmas 6 and 7. Let us then apply these lemmas. We deduce that and for any non-adjacent pair of vertices in G there exists where and are in and where neither S is empty nor .
Hence we can find four vertices and in G such that where , and . The vertices and are also not adjacent in G. Otherwise, a path exists between and which is that does not intersect S. Similarly, we can say that and are not adjacent in G. Hence, is a pair of non-adjacent vertices in G such that is a path connecting these two vertices. Hence any subset T separating and contains . However, it will not be possible to to separate and in G. We obtain then a contradiction. □
The second part now of this section is devoted to device an algorithm allowing us to compute the parameters and for a giving It will then be an algorithm that may help us to verify if a given is at the same satisfying the pseudographoid axiom.
The following result concerns relations that satisfy the following two axioms:
- the semigraphoid axiom: for any triplet of distinct elements of V and such that
- the weak transitive axiom: for any triplet of distinct elements of V and such that
Lemma 8.
Let be a relation satisfying the axioms (11), (42) and (41). Let . Let . Then the following assertions hold
- i.
- Assume that and . Then
- ii.
- Assume that and . Then
Proof.
Let us start by proving (i). The left-to-right of the equivalence (43) is indeed obvious. Let us now show the right-to-left way. Hence assume that there exists such that . Since and then
Since satisfies the weak transitivity axiom (42) we deduce that
Assume that . Since and by applying (41) we deduce that . Since also that we deduce that .
As and then by applying (11) we have .
Let us now prove (ii). We will indeed apply (i) to the relation , which also satisfies the axioms (11), (42) and (41), and to its concentration graph which is H.
Since the equality and by applying Lemma 4 we obtain
Since . Furthermore the fact that is equivalent to . Hence, (ii) is proved. □
Lemma 9.
Let be a relation satisfying the axioms (11), (42) and (41). Let μ and ν are parameters associated with . Let .
- i.
- Assume that . Assumes that and let where , then
- ii.
- Assume that . Assumes that and let where , then
Proof of Lemma 9.
First, we prove (ii) by induction on .
If , then and where . By applying (44) in Lemma 8 we deduce that
Let us then apply (44) since the hypothesis of Lemma 8 are still satisfied:
Since and the hypothesis of Lemma 8 are still satisfied. Hence, by applying the induction hypothesis, we can deduce that (49) is still satisfied for . Hence, (ii) is proved.
The proof of (ii) is easily deduced from . It is obtained by applying (i) to G, which is the covariance associated with . □
6. Graphical Criteria for Not Belonging to
In this section, we give a graphical criterion from either the concentration and/or the covariance graphs that can be used to determine now which of the couples in could not be in the relation . In other terms, we are giving here the reciprocal way of the global Markov property. If is, for example, a relation as the one associated with a random vector and defined in Example 1 this criteria will allow us to read now dependencies statements between a triplet of sub-random vectors. This criterion was also used in [17] but to read dependencies only from covariance graphs.
Definition 4.
Let be a pair of vertices in an undirected graph G and . We say that u and v are -connected given S if there exists exactly n paths in pairwise disjoints, i.e., such that . We denote then that .
The lemma below is needed for the proof of the main result in this paper.
Lemma 11.
Let be a pseudographoid. Let . Let us denote by the relation deduced from as following
Let us denote by H and be the covariance graph associated respectively with and . Then where is the subgraph of H induced on U.
Proof.
We know that the graph is defined as follows
Hence, the lemma is proved. □
The following statements are all the possible contrapositive of the semigrpahoid axiom. Some of them will be needed in the proof of the main result. If is a semigraphoid then it satisfies the following statements
We now define another family of relations: the graphoids.
Definition 5.
A relation is called a graphoid if it is at the same a pseudographoid and a semigraphoid.
As we see in the prove above, we will rather use the contrepositive of the weak transitive axiom defined (42).
Here is now the last lemma before giving the proof of the main result of this section.
Lemma 12.
Let be a graphoid. Let H be the covariance associated with . Let and . Assume that the following two hypotheses are satisfied
- A.
- i/, any path between w and v contains u and ii/. Then
- B.
- i/, any path between w and v contains u and ii/. Then
Proof.
Let us prove (A).
Let us first deduce from (i) that the vertices v and w can not be adjacent in H then . We can also deduce from the condition (i) that .
Secondly, according to (ii), we can deduce that the subset contains u since and this later subset intersects any path between v and w. Then by using Theorem 1 we deduce that .
Let prove Lemma 12 by induction on .
Let us prove the lemma for . Let us note that and . Hence by applying (50) we deduce that .
Let us assume that the lemma is true for subsets S with cardinality n. Let us prove it for . Hence assume that . Using the induction hypothesis for all , . Assume that . Since . Then by using the induction hypothesis we have . Since we have already proved that , then by using (50) we deduce that . Hence .
The part (B) of the lemma is deduced by applying (A) to and G which is the covariance graph associated with . □
Let us now give the criteria needed to determine if a couple can not be in a relation . Considering the relations that are derived from probability distributions, we can then be able to determine which relations do not correspond to Conditional Independence statements.
Theorem 3.
Let be a relation satisfying the axioms (11), (41) and (42). Let G and H be the concentration and the covariance graph associated with . Then
- i.
- if then
- ii.
- if then
Proof of Theorem 3.
Let us first prove (i).
Since then there exists a unique path between u and v such that . Let us prove (i) by induction on the length of p.
If then . We have to prove that
Let us now prove (55) by induction on .
When , . Hence, two cases are possible:
Let us consider the case where . Note that otherwise it exists another path between u and v different from the path-edge . Hence
Let us then apply (50) in the case where then . Then the statement is proved when .
Let us assume that when and satisfies the hypothesis of Theorem 3. Let us assume now that . Let us denote by the induced graph of H on . Let us then define the following two subsets of S:
and
First we claim that and that .
The second statement in our claim is obvious, i.e., . let us show that .
Let . If and then there exists and , both of them are included in and which is the union of and is a path connecting u and v different from the edge path . We obtain a contradiction with the fact that a single path exists between u and v in H.
Since then either or .
Let us assume then that . Since for any , , then for all we have . However . Hence by using Lemma 12 we deduce that
Similarly if we assume that we deduce that .
Let . It is easily seen that is a set with cardinality n and satisfying the hypothesis of Theorem 3. Hence, by using the induction hypothesis, we deduce that . Assume that . Since there is no path between w and v that does not intersect u then contains u. Hence any path between w and v intersect in u. Then We have shown then that
Hence by using (50) we deduce that since .
Let us assume that the result remains true when p is a path with length n. Assume now that . Let then . The vertex as . As
Hence . Since p is the unique path between u and v then
Since and then by applying the induction hypothesis we deduce that and .
As is a relation satisfying the weak transitivity axiom (42), then by applying (54) we deduce that .
Finally, (ii) is deduced by applying (i) to . □
Let us now consider the following axiom
where and . We will say that a relation is a gaussoid if it is relation satisfying (11), (41), (42) and (57).
Let us now show there is also another criterion as the one shown in Theorem 3 that can help us to determine elements of .
Theorem 4.
Let be a gaussoid. Let in V and . Let us assume that there exists such that and let p be the single path between v and w Then
Proof.
Let us write the proof by induction .
Let then and . We have then to prove (58).
Since any path between w and u should contain v and does not contain v. Since p is the only path between v and w. Then any path between w and v intersects in v. Then, by using Theorem 1 we deduce that .
Let us assume that and since we apply (50) to we deduce that .
If we assume now that and since is a gaussoid then by applying (57) we deduce that . Hence the equivalence (58) is proved.
Let us assume that Theorem 4 is true when p is with length equal to n, i.e., : there are n edges in the path p. Assume now that .
However let us note that . By induction hypothesis, any path between w and u should contain . Then seprates u and v in . Hence by applying Theorem 1 and Lemma 11, we deduce that .
Let us assume that and since , we apply (50) to and we deduce that .
If we assume now that and since is a gaussoid then by applying (57) we deduce that .
Hence we have shown that
Hence by using (59) we deduce also that
We can also note that . Then this former set separates u and w in . Hence .
Let us assume that and since we can apply (50) to we deduce then that .
Corollary 1.
Let be a gaussoid and let G and H be respectively the concentration and the covariance graph associated with . Let , , and . Then
- i.
- if H is a cycle, then , , and
- ii.
- if G is a cycle, then , , and .
Proof.
Let us prove (i). Assume that H is a cycle. Let u and v two vertices in H and such and let us show that
Firstly that note that if then the cardinality of . Then .
Let us assume, to the contrary, that . Then, there exists a path between u and v that does not intersect . Given that H is a cycle, this path p is unique and is contained within . Consequently, . According to Theorem 3, this implies that , leading to a contradiction. Since is defined as the largest integer for which , it follows that .
Let us denote , given that H is connected. It is important to note that , as H is a connected graph. This means that we cannot find edges that an empty set can separate.
Since (as stated in Theorem 2), and we have proven that , it follows that . Consequently, if G is not a complete graph, then all separators in G have a cardinality equal to . Therefore, for all , , and can include pairs where . To establish this, it suffices to select u and v as non-adjacent in H. By applying Theorem 3, we can deduce that must belong to .
Let us now prove that . Consider . If , then . Given that , it follows that for all , . Furthermore, as demonstrated in the first part of this proof, if is valid, then . Consequently, we conclude that . □
7. A Discussion about the Gieger and Pearl Conjecture
Geiger and Pearl conjectured that for any undirected graph GG, it is possible to construct a probability distribution PP such that
(see [10,27]). This conjecture is particularly relevant in the context of concentration graphical models, where the faithfulness assumption plays a pivotal role. The concept of faithfulness in these models, as defined in relation to Equation (61), is critical, especially when considering the PC algorithms (see [18]). In faithful graphical models, the 0-1 graph is expected to include, at a minimum, all the edges present in the full conditional independence graph, also known as the concentration graph (see [28]). This relationship underscores the importance of the faithfulness assumption in accurately representing and analyzing the underlying probabilistic structures in graphical models.
It was also proved in [21] that from any undirected graph and for a fixed non-negative “sufficiently close de zero” we can construct a real positive definite matrix as follows
and they showed (see Corollaries 2 & 3 in [21]) that . Hence if we consider a random vector with Gaussian distribution having as a covariance matrix the undirected graph H is then the covariance graph associated with P the probability distribution of . According to Theorem 2, the separability order of the concentration graph G associated with P is a complete graph, and then
On the other hand let now consider an undirected graph G and let us consider where is constructed as in (62) and let us consider the random vector as a Gaussian vector with covariance matrix . According to the construction of , the undirected graph G is the concentration graph associated to Q, the probability distribution of . Since , then . We have applied here Lemma 1 in [21]). By applying also 2 the covariance graph H associated to Q is complete and hence
We think then that Geiger and Pearl’s conjecture is completely proved.
8. Conclusions
In conclusion, this paper has comprehensively examined the role of undirected graphs in representing sets of Conditional Independence (CI) statements. Through rigorous exploration of properties and implications, we have established that specific axioms govern these sets, providing a mathematical framework for their representation. Our investigation has focused on covariance and concentration graphs, the only known families of undirected graphs capable of comprehensively describing CI statements. We have introduced parameters to assess the limitations of these graphs and provided a computational method for their evaluation. Additionally, we have offered criteria to ascertain the complete representation of CI statements through their corresponding graphs. These findings advance the theoretical understanding of graph theory in statistical contexts and offer practical tools for researchers in related fields. Future work could extend these insights to more complex graph structures or apply them in empirical network data studies. Overall, this research significantly enhances our understanding of the capabilities and limitations of undirected graphs in representing CI statements, making a substantial contribution to the field of graphical models.
9. Notations
- V is a finite set
- is the cardinality of V
- E is a subset of
- is the couple in
- is an undirected graph where V is the set of vertices and E is the set of edges that satisfies ⟺
- is a random vector with values in
- is sub-random-vector of
- means that
- and S disjoint subsets of V, means that
- is a couple
- A set of couples is called a relation
- If is an undirect graph means that is an edge in E.
Funding
This research received no external funding.
Data Availability Statement
No data was used for this research.
Conflicts of Interest
The authors declare no conflict of interest.
References
- De Morais Andrade, P.; Stern, J.M.; De Bragança Pereira, C.A. Bayesian Test of Significance for Conditional Independence: The Multinomial Model. Entropy 2014, 16, 1376–1395. [Google Scholar] [CrossRef]
- Markowetz, F.; Spang, R. Inferring Cellular Networks—A Review. BMC Bioinform. 2007, 8, S5. [Google Scholar] [CrossRef] [PubMed]
- Shen, X.; Zhu, S.; Zhang, J.; Hu, S.; Chen, Z. Reframed GES with a Neural Conditional Dependence Measure. arXiv 2022, arXiv:2206.08531. [Google Scholar]
- Kasza, J.; Glonek, G.; Solomon, P.J. Estimating Bayesian Networks for High-dimensional Data With Complex Mean Structure and Random Effects. Aust. New Zealand J. Stat. 2012, 54, 169–187. [Google Scholar] [CrossRef]
- Friedman, N.; Linial, M.; Nachman, I.; Pe’er, D. Using Bayesian Networks to Analyze Expression Data. J. Comput. Biol. 2000, 7, 601–620. [Google Scholar] [CrossRef]
- Kratzer, G.; Furrer, R. Information-Theoretic Scoring Rules to Learn Additive Bayesian Network Applied to Epidemiology. arXiv 2018, arXiv:1808.01126. [Google Scholar]
- Fathony, R.; Rezaei, A.; Bashiri, M.A.; Zhang, X.; Ziebart, B.D. Distributionally Robust Graphical Models. arXiv 2018, arXiv:1811.02728. [Google Scholar]
- Xie, Y.; Li, S.; Wu, C.; Lai, Z.; Su, M. A novel hypergraph convolution network for wafer defect patterns identification based on an unbalanced dataset. J. Intell. Manuf. 2022, 1–14. [Google Scholar] [CrossRef]
- Whittaker, J. Graphical Models in Applied Multivariate Statistics; Wiley: Hoboken, NJ, USA, 1990. [Google Scholar]
- Lauritzen, S.L. Graphical Models; Oxford University Press: New York, NY, USA, 1996. [Google Scholar]
- Studenỳ, M. Probabilistic Conditional Independence Structures; Springer: Berlin/Heidelberg, Germany, 2004. [Google Scholar]
- Diestel, R. Graph Theory, 3rd ed.; Graduate Texts in Mathematics; Springer: Berlin/Heidelberg, Germany, 2005; Volume 173. [Google Scholar]
- Pear, J.; Paz, A. Graphoids: A graph based logic for reasoning about relevancy relations. Adv. Artif. Intell. 1987, 2, 357–363. [Google Scholar]
- Matúš, F. On equivalence of Markov proerties over undirected graphs. J. Appl. Probab. 1992, 29, 745–749. [Google Scholar] [CrossRef]
- Kauermann, G. On a dualization of graphical Gaussian models. Scand. J. Stat. 1996, 23, 105–116. [Google Scholar]
- Banerjee, M.; Richardson, T. On a Dualization of Graphical Gaussian Models: A Correction Note. Scand. J. Stat. 2003, 30, 817–820. [Google Scholar] [CrossRef]
- Peña, J.M.; Nilsson, R.; Björkegren, J.; Tegnér, J. An Algorithm for Reading Dependencies from the Minimal Undirected Independence Map of a Graphoid that Satisfies Weak Transitivity. J. Mach. Learn. 2009, 10, 1071–1094. [Google Scholar]
- Malouche, D. Implication of faithfulness assumption. Sankhyā B Indian J. Stat. 2022, 84, 495–515. [Google Scholar]
- Becker, A.; Geiger, D.; Meek, C. Perfect Tree-like Markovian Distributions. Probab. Math. Stat. 2005, 25, 231–239. [Google Scholar]
- Malouche, D.; Rajaratnam, B. Gaussian covariance faithful markov trees. J. Probab. Stat. 2011, 2011, 152942. [Google Scholar] [CrossRef]
- Lňenička, R.; Matúš, F. On Gaussian Conditional Independence Structures. Kybernetika 2007, 43, 327–342. [Google Scholar]
- Malouche, D. Determining full conditional independence by low order conditioning. Bernoulli J. 2009, 15, 1179–1189. [Google Scholar] [CrossRef]
- Menger, K. Zur allgemeinen kurventheorie. Fund. Math. 1972, 10, 96–115. [Google Scholar] [CrossRef]
- Lovász, L. A remark on Menger’s theorem. Acta Math. Acad. Sci. Hung. 1970, 21, 365–368. [Google Scholar] [CrossRef]
- O’Neil, P.V. A new proof of Menger’s theorem. J. Graph Theory 1978, 2, 257–259. [Google Scholar] [CrossRef]
- McCuaig, W. A simple proof of Menger’s theorem. J. Graph Theory 1984, 8, 427–429. [Google Scholar] [CrossRef]
- Geiger, D.; Pearl, J. Logical and algorithmic properties of conditional independence and graphical models. Ann. Stat. 1993, 21, 2001–2021. [Google Scholar] [CrossRef]
- Wille, A.; Bühlman, P. Low-Order Conditional Independence Graphs for Inferring Genetic Network. Stat. Appl. Genet. Mol. Biol. 2006, 5, 1–32. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).