1. Introduction
This work is concerned mostly with the question of the existence of bivariate copulas with square integrable densities that are idempotent. The requirement is to identify Markov chains that have a stationary joint distribution for any two of its variables. For such Markov chains, any missing observation can be ignored without consequence in the theoretical analysis, as
has the same distribution as
for any
. To answer this question, we wrap it into the larger question involving the study of positive definite kernel operators. This is possible because any copula induces a kernel operator that is a Hilbert–Schmidt operator when the density of the copula is symmetric and square integrable. Several new copula constructions are obtained, which can be identified as perturbations of the independence copula as defined in Longla (2022) [
1], Longla (2022b) [
2] or Komornik et al. (2017) [
3].
The term copula is used in this paper in place of “bivariate copula”. When needed, the term “n-copula” is used to indicate a copula with more than two variables. A copula is a function
C defined on
such that
,
, and for any
,
,
(see Nelsen (2006) [
4] or Durante and Sempi (2016) [
5]).
Many authors have worked on building copulas with various properties over the past decades. Some constructions can be found in Nelsen (2006) [
4], Longla et al. (2022) [
1], Longla et al. (2022b) [
2], and the references therein. Some more recent constructions can be found in Chesneau (2021) [
6] and (2023) [
7], where trigonometric copulas were studied and some extensions of the FGM copula were constructed. These constructions are quite far from the main considerations of this paper, which is about mixing properties, Markov chains, and exchangeability. The copulas constructed here do not in any form represent a copy or an extension of their work.
One of the main points of this paper is that the constructed copula families have a mixing structure that is known; the copula of any two variables along the Markov chain that they generate belongs to the copula family of interest, and it is possible to explicitly find the joint distribution of any two variables of the chain. This makes statistical analysis easier and often avoids computational issues around approximations of the variances of estimators. One extension includes the Farlie–Gumbel–Morgenstern copula family. These families are cases of Type I Longla copulas, introduced in Longla (2024) [
8].
A central role in this investigation is played by the fold product of copulas, defined in Darsow et al. (1992) [
9] by
for
where
is the derivative of the copula with respect to its
variable. Recall that the fold product as defined for
is the joint distribution of
if
is a stationary Markov chain with copula
C and the uniform distribution as the marginal distribution (see Darsow et al. (1992) [
9]). Thus, for any copula
C, the set of
is a closed class of copulas under the fold product (meaning that the fold product of any two elements of this set is an element of the set). For most of the copula families considered in this work, this set has a single accumulation point
defined by
, that is, the independence copula. When this is the limit of powers of a copula, it implies that observations of the Markov chain in the long run tend to be independent from the first observation. It is important to point out some examples of singular idempotent copulas or in general idempotent copulas that are not fully defined by their absolutely continuous parts. We have
and
where
and
. The copula
M is that of perfect positive correlation, while
W is that of perfect negative correlation (see Nelsen (2006) [
4]).
Note that by Sklar’s Theorem (see Sklar 1959 [
10]), a copula can be obtained from any bivariate distribution
H by scaling out the effect of marginal distributions
and
by the inversion formula
. This copula is not unique when the variables are not continuous. When variables are continuous, there is a one-to-one relationship between joint distributions and the set of copulas and marginal distributions. This means that for continuous random variables, any set of copula and marginal distributions is equivalent to a joint distribution function.
In general, the copula
is the joint distribution of
when
is a stationary Markov chain generated by
C and the Uniform(0,1) distribution. These notions are important for the study of mixing properties as well as association and asymptotic distributions of averages of functions of Markov chains (see Longla et al. (2022) [
1] and Longla (2024) [
8]).
Another notion mentioned earlier is that of idempotent Markov chain. An idempotent Markov chain has copula C such that for . Such copulas are called idempotent copulas. For the purposes of this work, we impose symmetry and square integrability. As will be seen later, the independence copula is the limit of for all copulas of the form with . In this paper, we also show that there exists only one class of absolutely continuous idempotent copulas with square integrable density.
The idempotent copulas analyzed here have been topic of research for many scholars, among whom Darsow and Olsen (2010) [
11] provided a characterization of idempotent 2-copulas through the study of invariant sets. They additionally provided general conditions and constructed some examples. Among other things, they showed that the class of idempotent copulas is a lattice under the partial ordering
if
. Darsow and Olsen (2010) [
11] showed that for any copula
C there exists an idempotent copula
which annihilates
C (meaning that
) such that
for any other idempotent annihilator
of
. For the idempotent copulas constructed in this work, it turns out that
, which provides a way of characterizing idempotent copulas. For any other copula defined by Formula (
3) that is not idempotent,
, where
is
Note that when exists and is , it holds that . However, it is sometimes the case that exists when does not exist, as for the copula defined later in this paper. Because is either or depending on parity, .
Zabell (1995) [
12] provided a characterization of probability mass functions for finite or countable state-space Markov chains. We have not found a similar characterization for continuous state-space Markov chains. Among other things, the present work fills this gap. For the special case of square integrable densities of copulas, we use the existence of eigenvalues to provide a different view of idempotent copulas while studying the mixing properties of such symmetric copulas and the reversible Markov chains that they generate.
This work can be combined with the results of Blum et al. (1958) [
13] to establish asymptotic distributions of estimators in the case of exchangeable Markov chains. Blum et al. (1958) [
13] showed that for any mean-zero exchangeable sequence with Gaussian joint distributions for finite subsets, correlations
, and variance 1, the central limit theorem holds in the form
as
(
when
). This was stated as a corollary of another result stating that for a mean zero exchangeable sequence with variance 1, the central limit theorem holds if and only if
. Taylor et al. (1985) [
14] provided more results on central limit theorems for mean zero exchangeable random sequences, including
(only for exchangeable sequences for which the correlation is
and
).
The rest of this paper is organized as follows. In
Section 2, we introduce symmetric copulas with square integrable densities. In
Section 3, we approach idempotent copulas with square integrable densities. A characterization is provided and used to build new sets of copula families that extend existing ones from Longla (2024) [
8]. We show that idempotent copulas with square integrable densities generate exchangeable Markov chain. These copula families weirdly behave conditionally like the independence copula. In
Section 4 we extend copula families to
n-dimensional copulas that include the Farlie–Gumbel–Morgenstern copula family. Here, we also provide the Spearman’s
for each of the copula families along with some useful remarks and graphs.
Section 5 is in general about the central limit theorem for exchangeable Markov chains and consequences.
Section 6 presents some simulation studies and related large sample theory. Finally,
Section 7 provides conclusions and comments.
3. Symmetric Idempotent Copulas
Recall that an idempotent copula is defined by
, where ∗ is the fold product of copulas. Based on our representation of absolutely continuous copulas with square integrable densities, if
C is an absolutely continuous copula with square integrable density, then for some positive integer
s and a set of orthonormal functions
on
, the following proposition holds for
Proposition 1. An absolutely continuous copula with square integrable density is idempotent when its eigenvalue 1 has finite multiplicity and it can be represented by (5) for some functions . Examples of idempotent copulas and extensions are built using various mean zero functions on . We consider such copulas families in this section.
3.1. Examples of Idempotent Copulas
Starting from our earlier comments, we note to begin with that clearly not all idempotent copulas will have densities that allow for their full identification. Earlier, we identified the following copulas that are idempotent: , M, and . Now, we formulate the following result.
Theorem 1. For any copula C, if , then is idempotent and commutes with C.
Proof. Assume that
. We have
These equalities are due to the fact that the limit of the fold product does not change when we fix the first
n terms. This argument is identical to the proof of Theorem 2.3 in Darsow et al. (1992) [
9]. It is also consistent with the formula of the annihilator presented by Darsow and Olsen (2010) [
11]. Applying Theorem 2.3 of Darsow et al. (1992) [
9], we conclude
; moreover,
. □
In search of idempotent copulas, we consider Formula (
5) with a single piecewise constant function. Note that the copula (
5) has a piecewise constant density function if and only if
is piecewise constant. The following copula family is obtained considering
with
(see Longla (2024) [
8]). Simple computations show that the only possible values of
a and
b satisfy the equations
and
. These conditions come from the fact that the function has its integral equal to 0 and that its square integrates to 1. The copula
for
,
is
The copula in (
6) has a maximal coefficient of correlation 1 and
for
. Thus, this is an example of an idempotent copula with less than full support. This copula satisfies
, where these copulas are understood as limits when
and
Remark 1. For Markov chains generated by with marginal distribution, the sets and are absorbing sets. Therefore, the Markov chains are not recurrent but rather aperiodic. This means that when a Markov chain generated by (6) starts in each of these sets, it does not leave for the other set. The only chance to visit the other set is to reach the point , and this is an event with probability 0. Therefore, a single realization of a Markov chain generated by this copula and the uniform distribution is a set of independent values from one of sets A and B. Thus, repeated runs of the Markov chain will have separate random samples from A with probability and from B with probability . For
, we denote the copula
. This copula is also obtained using
in our characterization of idempotent copulas with
. This copula has a density that is uniform on
and has
. For a large sample of Markov chains generated by this copula and the
marginal distribution, there is approximately the same number of chains with values in
and
. A related copula is
, defined by density
. It is obvious that
.
is not idempotent but is rather recurrent and periodic with
. This copula is uniform on
. Any Markov chain generated by this copula has values that bounce back and forth between the two regions
and
. Unlike for
,
generates Markov chains that can take values all over the set
:
Theorem 2. For any , let A be a union of sub-intervals of of measure and let B be its complement on . Then,is the density of an idempotent copula. The proof of this result is based on simple computations using . The condition that makes a copula density is built in the construction . An example of copula density uses and .
Note that based on Longla (2015) [
18], an absolutely continuous copula that is bounded away from zero on a set of Lebesgue measure 1 generates
-mixing Markov chains. Based on Formula (
5), absolutely continuous symmetric idempotent copulas with square integrable densities have maximal coefficient of correlation equal to 1. Therefore, these copulas do not generate
-mixing. It follows that they do not generate
-mixing (as
-mixing implies
-mixing; see Bradley (2007) [
19]) and their densities are not bounded away from zero on any set of Lebesgue measure 1. Thus, the following theorem holds.
Theorem 3. Any absolutely continuous copula that generates non-ψ-mixing Markov chains with uniform marginals has a density that is either equal to zero on a set with a non-zero Lebesgue measure (when piecewise constant) or is equal to zero on a closed set with a Lebesgue measure of 0 (when not piecewise constant).
Based on Theorem 3, we state the following more general result.
Theorem 4. The copula in (6) is the only absolutely continuous symmetric idempotent copula with piecewise constant density. Proof. Assuming that the copula is idempotent with piecewise continuous density implies that the functions
of Formula (
5) are piecewise constant functions. If we assume that
for
, forming a partition of
,
, and
for all
, then
The key point in this proof is the fact that C is a copula and does not generate -mixing stationary Markov chains. This implies that Theorem 3 holds. Thus, there exists a rectangle on which the copula density is 0. Without loss of generality, we can assume that . Otherwise, we can always renumber the sets to make sure that this condition holds. Assuming this ordering, for all implies . This can take some algebra to prove, but does not present any difficulties.
At this point, we have
and
. Suppose that
and that
are such that
. This means that there are at least three different values of the eigenvalue function. Then, an additional relationship allows us to form a system of three equations and with three unknowns
. This additional equation is due to the partition of
,
Regrouping the terms, we obtain
Therefore, using Cramer’s rule, we obtain
Note that this solution is possible and unique only because when . Moreover, given that for all i, we can look carefully at to realize that and and that for all . This leads to , which is a contradiction. Therefore, we cannot have more than three different values of , meaning that there can only be two different values of . To be exact, these arguments assume that we are working on a subset of full Lebesgue measure, on the complement of which might have other values. The rest of the proof for one function is the content of Theorem 2.
Now, if we consider the case with several functions
, each has to take the form specified by Theorem 2. Moreover,
are orthogonal and are defined with
, as in Theorem 2. We can split the segment
into three sets
such that
,
,
, and
The integral of this function easily leads to
implying that
On the other hand,
fails to be a copula density because
for
. Therefore, we cannot have more than one function in the sum. This concludes the proof. □
Theorem 5. If , , and , then there exists , , and : where .
Proof. Suppose that the conditions of Theorem 5 are satisfied but that its conclusion does not hold. Let , , and . The first step is to show that
Given that Theorem 5 does not hold, we have
. Suppose that
. Then, it follows that
. Moreover,
implies that
The right-hand side is
Thus,
. On the left-hand side,
implies
The right-hand side is equal to which implies . Therefore, we have and .
The second step is to show that unless
and
, we have a contradiction. We define
with
,
and
. Note that
for
,
for
, and the integral of each of these functions is equal to 0. Taking the difference of these integrals, it follows that
If
, then the first integral is strictly negative and the second is strictly positive. Moreover,
implies that
Given that
on
, Equation (
9) implies that the set
has Lebesgue measure 0. This leads to
and
. Thus,
except perhaps on a set of Lebesgue measure 0. □
3.2. Exchangeable Markov Chains
Note that exchangeability implies that the copula of is idempotent because the copula of is under the assumption of Markov chain and has the same distribution as due to exchangeability. For this reason, we can only consider copulas of the type . We have seen that for absolutely continuous idempotent copulas with square integrable densities, the only non-zero eigenvalue is with finite multiplicity. To allow such a copula to generate an exchangeable Markov chain, if we denote the eigenfunctions for as and , then we obtain the following theorem for the copula-based Markov chains that they generate.
Theorem 6. All absolutely continuous idempotent copulas with square densities generate exchangeable Markov chains. Their densities are provided by (8) for some such that , , where μ is the Lebesgue measure on and . The copula of the Markov chain is obtained for via the recurrence relationship Proof. The proof is based on the fact that any other copula will need to be characterized by
. Considering
, exchangeability implies that
. This can be written as
Thus, substituting the necessary derivatives, we obtain
Integrating and simplifying leads to
After simplifications, we obtain
Rearranging the terms yields
Because this equality holds for all
, there exists a constant
k such that
Taking the derivatives provides for almost every x. Thus, for any k, has only two possible values where the product is equal to . If we can show that the sets on which those values are taken have Lebesgue measures and for some positive , then this exactly defines the idempotent copula provided earlier and shown in Theorem 5. Because all implications in the proof are equivalencies, the copula provided here defines exchangeable Markov chains.
We now have . Theorem 4 completes the proof. □
The distribution of
is
This copula is obtained by computing
According to Nelsen (2006) [
4], this integral provides the joint density of the said vector. When
, we have
4. Multivariate Extensions
We consider n-copulas with as extensions of our copula families. This emphasizes the importance of our constructions via some examples of new copula families with specific dependence properties. A set of copulas that generate exchangeable random vectors that are not Markov chains is proposed.
Proposition 2. For any bounded mean-zero function φ withis a copula that generates exchangeable vectors that are not Markov chains. Proof. The condition on guarantees that we have a positive density, the mean of zero ensures that does not affect lower-dimensional distributions, and symmetry ensures that any subset of components consists of independent variables. This implies that any subset of or fewer components is made up of independent variables. □
Another example is the extension of the copula
with density
where
Using
, we obtain a subclass of the Farlie–Gumbel–Morgenstern multivariate copula
In the spirit of Proposition 2, the following theorem holds.
Theorem 7. For any integer and for any mean zero function such that , let ; then,where is a copula that is exchangeable iff are equal for . Examples include the following trigonometric copulas for any
by
Using a shifted truncated hyperbolic tangent, we obtain
Following Longla (2024) [
8], note that the maximal coefficient of correlation for Markov chains generated by this copula is
, where
is the only non-zero eigenvalue of the copula operator. This conclusion relies on
Therefore, we can take
in Formula (
14) or
. This range of dependence
is larger than that of the FGM copula.
Consider (for ); via simple computations, we can establish the following proposition.
Proposition 3. For any and for any such thatis a copula. Moreover, and C is the FGM when . Using
, we obtain
and
The positive Spearman correlations for this copula are larger than those of the FGM copula. We have
, while
Using
, we obtain a copula with
provided by
In this case, ; thus, and the range of reduces.
Remark 2. The particularity of the copulas from Theorem 7 is that any subset of variables from the vector with the copula in (13) is made of independent components. For Proposition 2, any subset of variables from the vector with the copula in (11) is made up of independent components. Theorem 8. Under the assumptions of Theorem 7, for any , the functionwith where the sum is taken over all subsets J of size of , is a copula for which any subset of variables is made up of independent components. This copula is exchangeable iff are equal for . Figure 1 presents an example of a copula built using the truncated hyperbolic tangent function. The graph shows the level curves for the specified
.
Figure 2 shows the level curves and density of the extreme values for (
16).
Figure 3 shows the level curves of the extreme values for (
17).
Figure 4 shows the densities of the extreme values for the copulas in (
17).
Examples of 1-Dependent Copulas
In this section we discuss asymmetric copulas. We consider asymmetry in the sense of using different basis functions for
u and
v. The first example we consider is with the cosine functions as
. For any positive integers
and
and for any real number
such that
,
is a copula. For
, this copula is not symmetric; however, per Longla et al. (2022b) [
2], it generates
-mixing Markov chains. Moreover, it can be easily shown that for a Markov chain generated by this copula, any pair of variables separated by one or more variables is made up of independent components.
Figure 5 shows a graph of an example of the copulas in (
19) with
,
, and
. The Spearman correlation for this family of copulas is 0 unless
and
are both even integers. The second example that we consider has the same properties as the previous example but is based on orthogonal polynomials:
for
is a copula. The Spearman correlation is 0 for any value of
.
Figure 6 shows the level curves and density of the copula in (
20) with
. A slight modification of (
20) provides the following copula, which does not more have the property of
m-dependence for any
m:
The Spearman correlation for this copula is
with
. As in the last example, we consider piecewise constant functions for
:
For
, the copula in (
22) and its density are equal to 0 for
and
, while for
the density is equal to 0 for
and
. Moreover, for all
, the Spearman correlation is 0 and
. This implies that any Markov chain generated by this copula and a continuous marginal distribution is
-mixing. This means that the conditions of Longla (2022c) [
17] are not necessary for
-mixing.
5. Central Limit Theorem and Exchangeability
Based on the properties of exchangeable random vectors, the variance of partial sums of functions
of
generated by an exchangeable copula is provided by the formula
where
is the Pearson correlation of any two components
of the vector. This is justified by the fact that the joint distributions of any two components of the vector are equal. Kipnis and Varadhan (1986) [
20] justified this on the basis that if the exchangeable vector is a Markov chain, then it is reversible and the central limit theorem holds if and only if
, as in that case we have
. It is clear that even if there is no independence, this variance implies that when
, the central limit theorem holds in the form
where
This result is implied by Kipnis and Varadhan (1986) [
20], and the equivalence of this convergence and the condition
is a consequence of Theorem 2.1.2 and Remark 2.1.1 in Taylor et al. (1985) [
14]. In this case, we also see that when
it is forcibly the case that
and
. Therefore,
cannot satisfy a central limit theorem when
.
For examples of the exchangeable Markov chains mentioned in this work, we have the following. First, partial sums of Markov chains generated by
do not satisfy the central limit theorem, as the variance of partial sums is exactly
. Markov chains generated by
and the uniform distribution are rows of
, with probability
to move at any time to any of these two states. They are conditionally independent given the first state; thus, the partial sums conditionally satisfy the central limit theorem. Considering the constructed copula families, we can conclude that Let
,
Note that implies that ; the last interval has only one point, and can be ignored due to continuity of the random variables.
Theorem 9. The copula defined by density is idempotent and generates exchangeable Markov chains , for which the following hold:
- 1.
When the marginal distribution is continuous with density , these Markov chains are not α-mixing, but satisfy the conditional central limit theorem (24) for any given fixed initial value, where μ and are the conditional mean and variance of . - 2.
The covariance of any two variables of Markov chains generated by these copulas is always positive.
Proof. Note that this copula is constant on eight subsets that form its support and that there are regions that can never be reached after the first value is known. This justifies the Markov chain being non-mixing. The values seen for every realization of the Markov chain depends on the first observed value. All other observations of the Markov chain after the first observation are independent and uniformly distributed on two possible subintervals. Therefore, the conditional central limit theorem holds given
for partial sums. This is what Peligrad (2023) [
21] called the quenched central limit theorem.
The covariance of any two variables of the Markov chain generated by these copulas with a continuous distribution
F is
The only other point in Theorem 9 that requires a proof is mixing, which we can complete by the following proposition. □
Proposition 4. Exchangeable Markov chains generated by copulas with symmetric absolutely continuous and square integrable density and continuous marginal distributions are non-mixing.
For general notions on mixing, see Bradley (2007) [
19]. Here, we use
-mixing, which is equivalent to
Proof. The proof of the proposition relies on the fact that
on a set
A of Lebesgue measure
and
on a set
B of Lebesgue measure
.
since the copula of
is absolutely continuous and its density is 0 on
. Therefore,
for all positive integer
n.