Homogeneity Test of Multi-Sample Covariance Matrices in High Dimensions

Peng Sun; Yincai Tang; Mingxiang Cao

doi:10.3390/math10224339

,

and

¹

Department of Statistics, East China Normal University, Shanghai 200062, China

²

KLATASDS-MOE, School of Statistics, East China Normal University, Shanghai 200062, China

³

Center for Quantitative Medicine, Duke-NUS Medical School, Singapore 169857, Singapore

⁴

School of Mathematics and Statistics, Anhui Normal University, Anhui 241002, China

Mathematics2022, 10(22), 4339;https://doi.org/10.3390/math10224339

This article belongs to the Special Issue Statistical Theory and Application

Version Notes

Order Reprints

Abstract

In this paper, a new test statistic based on the weighted Frobenius norm of covariance matrices is proposed to test the homogeneity of multi-group population covariance matrices. The asymptotic distributions of the proposed test under the null and the alternative hypotheses are derived, respectively. Simulation results show that the proposed test procedure tends to outperform some existing test procedures.

Keywords:

high-dimensional data; weighted Frobenius norm; homogeneity test; martingale central limit theorem; asymptotic distributions

MSC:

62H15; 62E20

1. Introduction

In the last thirty years, statistical methods have made great developments for high-dimensional data. One common feature of high-dimensional data is that the data dimension is larger or vastly larger than the total sample size. In high-dimensional settings, many classic methods are not well-defined or have poor performances. As a result, a great deal of statistical methods have been proposed to deal with high-dimensional data. As an important issue of statistical inference, hypothesis testing is being followed with interest by scholars. Testing the equality of several covariance matrices is a fundamental problem in multivariate statistical analysis and can be found in many relevant works, such as [1,2]. This problem will arise from the analysis of gene expression data. It was shown in [3] that many genes have different variances in gene expressions between disease states. For example, the dataset of the breast cancer of patient is classified in three groups based on their gene expression signatures: well-differentiated tumors, moderately differentiated tumors, and poorly differentiated and undifferentiated tumors. Refs. [4,5,6] respectively used artificial neural networks, feature selection and a Markov blanket-embedded genetic algorithm to investigate this dataset. This dataset includes 83 samples, which is greatly less than data dimension 2308. For this gene dataset, we are interested in checking whether the covariance matrices of the four groups are the same or not. It is noted that an important assumption for multivariate analysis of variance is that of equal covariance matrices in different groups. Therefore, testing the equality of several covariance matrices is needed. Specifically, assume

X_{i 1}, \dots, X_{i n_{i}}

are independent and identically distributed (iid) with a p-dimensional distribution with mean vector

μ_{i}

and covariance matrix

Σ_{i}

for

i = 1, \dots, k

with

k \geq 2

. We want to test the following hypothesis

\begin{matrix} H_{0} : Σ_{1} = \dots = Σ_{k} vs . H_{1} : H_{0} is not true . \end{matrix}

(1)

A classical method is the likelihood ratio test proposed by [7], where the population distribution is normal. However, as the data dimension is larger or vastly larger than the sample size, the likelihood ratio test is not well-defined because of the singularity of sample covariance matrices in probability one. Thus, it is vital to propose new procedures suitable to high-dimensional data. Owing to the curse of dimensionality, it becomes more challenging in high dimensions. For the hypothesis (1), there exists some test procedures in the literature. Based on the Frobenius norm, Refs. [8,9,10,11] proposed test statistics, respectively. Ref. [12] presented a test for high-dimensional longitudinal data. Ref. [13] proposed a power-enhancement high-dimensional test when the maximum eigenvalue of

Σ_{i}

is bounded or

tr (Σ_{i}^{j}) = O (p^{j})

for

j = 1, 2, 3, 4

. Ref. [14] presented a test based on the well-known Box’s test ([15]) for high-dimensional normal data. It is noted that [9] imposed the conditions

0 < \lim_{p \to \infty} tr (Σ^{i}) / p < \infty

for

i = 1, \dots, 8

to obtain the asymptotical null distribution of his test under normality, where

Σ

is equal to the covariance matrix under the null hypothesis

H_{0}

. Under high-dimensional normal data, [10] proposed a test as

0 < \lim_{p \to \infty} tr (Σ^{i}) / p < \infty

for

i = 1, \dots, 4

. Ref. [8] imposed

0 < tr (Σ^{2}) / p < \infty

and other conditions to build asymptotical properties of his test statistic. The test statistics in [8,9,10,13,14] imposed the condition about the relationship between the data dimension p and the sample size

n_{i}

or about the normal data. These conditions restrict the application of their test procedures. Ref. [11] gave a Frobenius norm-based test, which can be seen as an extension of that in [16], where the data dimension and the sample size can change arbitrarily.

The existing Frobenius norm-based tests for the hypothesis

H_{0}

were almost constructed by

\sum_{i < j}^{k} tr {(Σ_{i} - Σ_{j})}^{2} = k \sum_{i = 1}^{k} tr {(Σ_{i} - \bar{Σ})}^{2}

, where

\bar{Σ} = \sum_{i = 1}^{k} Σ_{i} / k

means average covariance matrix of the population covariance matrices

Σ_{1}, \dots, Σ_{k}

. The deviations of

Σ_{i}

from overall

\bar{Σ}

is weighted by all equal weight

1 / k

, which, however, cannot emphasize the deviations of populations with large sample sizes. Based on this, we here construct a new test statistic by a different Frobenius norm

\sum_{i = 1}^{k} n_{i} tr {(Σ_{i} - Σ^{*})}^{2}

where

Σ^{*} = \sum_{i = 1}^{k} n_{i} Σ_{i} / n

and

n = \sum_{i = 1}^{k} n_{i}

. Here,

Σ^{*}

is a weighted average of k covariance matrices, which can emphasize the deviations of populations with large sample sizes. On the other hand, it is evident that the null hypothesis

H_{0}

holds if and only if

\sum_{i = 1}^{k} n_{i} tr {(Σ_{i} - Σ^{*})}^{2} = 0

.

The main purpose of this paper is to develop a new method to test the homogeneity of k high-dimensional covariance matrices on the basis of the weighted Frobenius norm. For the hypothesis in (1), most existing methods ([8,9,10,13]) imposed normality or the explicit relationship of between the data dimension p and the sample size n, and may behave poorly under non-normal data or ultra-high dimensional data. However, our method does not require normality or an explicit relationship between p and n, which is similar with the method in [11]. Thus, our method can be applied to non-normal and ultra-high dimensional data. On the other hand, the difference of the method in [11] and ours is that we use a sample-sizes-based Frobenius norm to build test statistics so that the tests in this paper and [11] behave differently, as the sample sizes are unequal. When

k = 2

, our proposed test statistic can be seen as an extension of that [16] except a constant. Hence it can also be applied to two-sample data.

Simulation results show that the proposed test behaves differently from existing tests such as the tests in [11,13,14]. We will discuss the differences between the proposed test and the competing tests through numerical comparisons in various scenarios. We observe that the proposed test outperforms the competing tests in both size and power in many cases.

The remainder of the paper is organized as follows. Section 2 first presents the statistical model and the imposed conditions in order to construct our new test. In Section 3, a new test statistic is proposed and its asymptotic properties are also given. Section 4 presents a numerical study of the proposed test to compare three competing tests. Concluding remarks are provided in Section 5. The proofs of main results are arranged in Appendix A.

2. Preliminaries

We consider the following general multivariate model that is often used in literature:

X_{i j} = μ_{i} + Γ_{i} Z_{i j}

for

j = 1, \dots, n_{i}

and

i = 1, \dots, k

, where

Γ_{i}

is a

p \times r

matrix for some

r \geq p

such that

Γ_{i} Γ_{i}^{T} = Σ_{i}

and

Z_{i 1}, \dots, Z_{i n_{i}}

are r-variate iid random vectors with

E (Z_{i j}) = 0

and

Var (Z_{i 1}) = I_{r}

. Furthermore, denote

Z_{i j} = {(z_{i j 1}, \dots, z_{i j r})}^{T}

and

z_{i j 1}, \dots, z_{i j r}

have a finite eighth moment with

E (z_{i j l}^{4}) = Δ_{l}

where

Δ_{l}

is some constant. Moreover, for any positive integers q and

α_{v}

, there has

E (z_{i j v_{1}}^{α_{1}} \dots z_{i j v_{q}}^{α_{q}}) = E (z_{i j v_{1}}^{α_{1}}) \dots E (z_{i j v_{q}}^{α_{q}})

(2)

as

\sum_{v = 1}^{q} α_{v} \leq 8

, where

v_{1}, \dots, v_{q}

are distinct indices. It shows from (2) that

z_{i j v_{1}}, \dots, z_{i j v_{q}}

are pseudo-independent, which is naturally satisfied when samples are generated from normal distribution.

In order to obtain the asymptotic distributions of new test statistic, some conditions are imposed as follows:

(C1): As $n \to \infty$ , $n_{i} / n \to c_{i} \in (0, 1)$ for $i = 1, \dots, k$ .
(C2): As $n \to \infty$ , $p = p (n_{1}, \dots, n_{k}) \to \infty$ and for arbitrary $l, m, s, h \in {1, \dots, k}$ , $tr (Σ_{l} Σ_{m}) \to \infty$ and $tr (Σ_{l} Σ_{m} Σ_{s} Σ_{h}] = o [t r (Σ_{l} Σ_{m}) t r (Σ_{s} Σ_{h})]$ .

(C1) implies that all sample sizes have the same increasing rate, except constant terms. (C2) can be seen as an extension of the condition A2 in [16] to the case of multi-groups. The two conditions are the same as those in [11]. A key aspect of (C2) is that it does not impose any explicit relationship between p and sample size n. It is noted that (C2) naturally holds when all eigenvalues of k covariance matrices are uniformly bounded. Next, we consider another set of covariance matrices satisfying (C2), namely spiked covariance structures. For convenience, we set

k = 4

and let

Σ_{i} = diag (a_{i 1} p^{δ_{i 1}}, \dots, a_{i m_{i}} p^{δ_{i m_{i}}}, a_{i, m_{i} + 1},

\dots, a_{i p})

, where

a_{i j}

s,

δ_{i j}

s and

m_{i}

s are fixed positive constants with

δ_{i 1} \geq \dots \geq δ_{i m_{i}}

for

i = 1, 2, 3, 4

. Then, the main items of

tr (Σ_{1} Σ_{2} Σ_{3} Σ_{4})

,

tr (Σ_{1} Σ_{2})

and

tr (Σ_{3} Σ_{4})

is respectively

a_{11} a_{21} a_{31} a_{41} p^{δ_{11} + δ_{21} + δ_{31} + δ_{41}} + O (p)

,

a_{11} a_{21} p^{δ_{11} + δ_{21}} + O (p)

and

a_{31} a_{41} p^{δ_{31} + δ_{41}} + O (p)

. As a result, (C2) holds if

δ_{11} + δ_{21} < 1

and

δ_{31} + δ_{41} < 1

. Let

λ_{i j}

denote the jth largest eigenvalue of

Σ_{i}

, then if

δ_{11}, \dots, δ_{41}

are less than 0.5,

λ_{i 1}^{2} / tr (Σ_{i}^{2}) \to 0

for

i = 1, \dots, 4

, which is called a non-strongly spiked eigenvalue (NSSE) structure in [17]. Otherwise, if there exists some

δ_{i 1} \geq 0.5

, then

λ_{i 1}^{2} / tr (Σ_{i}^{2}) ↛ 0

, which is called a strongly spiked eigenvalue (SSE) structure in [17]. Therefore, (C2) holds when all of the covariance matrices have NSSE structures, or some of the covariance matrices have SSE structures.

To propose our test statistic, the hypothesis in (1) is rewritten as the following hypothesis

\begin{matrix} H_{0} : \sum_{i = 1}^{k} n_{i} tr {(Σ_{i} - Σ^{*})}^{2} = 0 vs . H_{1} : \sum_{i = 1}^{k} n_{i} tr {(Σ_{i} - Σ^{*})}^{2} > 0 . \end{matrix}

(3)

Then, we can construct a new test statistic based on

\sum_{i = 1}^{k} n_{i} tr {(Σ_{i} - Σ^{*})}^{2}

. Note that

\begin{matrix} \sum_{i = 1}^{k} n_{i} tr {(Σ_{i} - Σ^{*})}^{2} = \sum_{i = 1}^{k} \frac{n_{i} (n - n_{i})}{n} tr (Σ_{i}^{2}) - \sum_{i \neq j}^{k} \frac{n_{i} n_{j}}{n} tr (Σ_{i} Σ_{j}) . \end{matrix}

Thus, a new test statistic can be given if unbiased estimators of

tr (Σ_{i}^{2})

and

tr (Σ_{i} Σ_{j})

are respectively obtained for

i = 1, \dots, k

.

3. Main Results

In this section, we propose a new test statistic for the hypothesis in (1) and give its asymptotic properties under conditions (C1) and (C2). According to the equivalent hypothesis in (3), our new test statistic is given as the following

T : = \sum_{i = 1}^{k} \frac{n_{i} (n - n_{i})}{n} A_{i} - \sum_{i \neq j}^{k} \frac{n_{i} n_{j}}{n} C_{i j},

(4)

where

A_{i}

and

C_{i j}

are respectively unbiased estimators of

tr (Σ_{i}^{2})

and

tr (Σ_{i} Σ_{j})

, which are given as follows

A_{i} = \frac{1}{{(n_{i})}_{2}} \sum_{j, l}^{*} {(X_{i j}^{T} X_{i l})}^{2} - \frac{2}{{(n_{i})}_{3}} \sum_{j, l, f}^{*} (X_{i j}^{T} X_{i l} X_{i j}^{T} X_{i f}) + \frac{1}{{(n_{i})}_{4}} \sum_{j, l, f, g}^{*} (X_{i j}^{T} X_{i l} X_{i f}^{T} X_{i g})

(5)

and

\begin{matrix} C_{i j} = & \frac{1}{n_{i} n_{j}} \sum_{l = 1}^{n_{i}} \sum_{m = 1}^{n_{j}} {(X_{i l}^{T} X_{j m})}^{2} - \frac{1}{n_{i} {(n_{j})}_{2}} \sum_{l = 1}^{n_{i}} \sum_{f, m}^{*} (X_{i l}^{T} X_{j f} X_{i l}^{T} X_{j m}) - \frac{1}{{(n_{i})}_{2} n_{j}} \sum_{l = 1}^{n_{j}} \sum_{f, m}^{*} \\ (X_{j l}^{T} X_{i f} X_{j l}^{T} X_{i m}) + \frac{1}{{(n_{i})}_{2} {(n_{j})}_{2}} \sum_{l, m}^{*} \sum_{f, g}^{*} (X_{i l}^{T} X_{j f} X_{i m}^{T} X_{j g}) . \end{matrix}

(6)

Here

{(n_{i})}_{l} = \frac{(n_{i})!}{(n_{i} - l)!}

and

\sum^{*}

denotes the sum for all different indices. Note that T is an unbiased estimator of

\sum_{i = 1}^{k} n_{i} tr {(Σ_{i} - Σ^{*})}^{2}

. The above two unbiased estimators are used in [11,16,18,19], respectively. It is noted that we can assume, without loss of generality,

μ_{1} = μ_{2} = \dots = μ_{k}

because T is invariant under location transformation. Under this assumption, the leading terms in

A_{i}

and

C_{i j}

are respectively the first term since the last two terms in

A_{i}

and the last three terms in

C_{i j}

are respectively infinitesimals of higher order of the first term. As a result, we only treat the first term in

A_{i}

and

C_{i j}

to save computation time. Please see [11,18] for more details about this computation time.

It shows from conditions (C1) and (C2) that the variance of T is

\begin{matrix} σ^{2} = & \frac{4}{n^{2}} \{\sum_{s = 1}^{k} \frac{n_{s} {(n - n_{s})}^{2}}{(n_{s} - 1)} {tr}^{2} (Σ_{s}^{2}) + \sum_{s \neq h}^{k} 2 n_{s} n_{h} {tr}^{2} (Σ_{s} Σ_{h})\} \\ + \frac{4}{n^{2}} \sum_{s = 1}^{k} \sum_{h \neq f}^{k} n_{s} n_{h} n_{f} tr {Γ_{s}^{T} (Σ_{s} - Σ_{h}) Γ_{s} \circ Γ_{s}^{T} (Σ_{s} - Σ_{f}) Γ_{s}} \\ + \frac{8}{n^{2}} \sum_{s = 1}^{k} \sum_{h \neq f}^{k} n_{s} n_{h} n_{f} tr {Σ_{s} (Σ_{s} - Σ_{h}) Σ_{s} (Σ_{s} - Σ_{f})} + o [{tr}^{2} (Σ_{s}^{2})] . \end{matrix}

Then, we can obtain the following asymptotic distribution of T.

Theorem 1.

Under (C1) and (C2), as

m i n {p, n} \to \infty

, we have

\begin{matrix} \frac{T - E (T)}{σ} \overset{d}{⟶} N (0, 1) . \end{matrix}

Proof.

See Appendix B. □

It is clear that the variance of T under the null hypothesis

H_{0}

is

σ_{0}^{2} = \frac{4}{n^{2}} \{\sum_{i = 1}^{k} \frac{n_{i} {(n - n_{i})}^{2}}{(n_{i} - 1)} {tr}^{2} (Σ_{i}^{2}) + \sum_{i \neq j}^{k} 2 n_{i} n_{j} {tr}^{2} (Σ_{i} Σ_{j})\} .

As a result, to formulate the test procedure, we need to give a ratio-consistent estimator of

σ_{0}^{2}

. In this paper, we use the unbiased estimators

A_{i}

and

C_{i j}

in (5) and (6) to respectively estimate

tr (Σ_{i}^{2})

and

tr (Σ_{i} Σ_{j})

. The following lemma is from [11].

Lemma 1.

Under (C1) and (C2), as

m i n {p, n} \to \infty

,

\frac{A_{i}}{tr (Σ_{i}^{2})} \overset{p}{⟶} 1, \frac{C_{i j}}{tr (Σ_{i} Σ_{j})} \overset{p}{⟶} 1 .

On the basis of Lemma 1, a ratio-consistent estimator of

σ_{0}

is given by

{\hat{σ_{0}}}^{2} : = \frac{4}{n^{2}} \{\sum_{i = 1}^{k} \frac{n_{i} {(n - n_{i})}^{2}}{(n_{i} - 1)} A_{i}^{2} + \sum_{i \neq j}^{k} 2 n_{i} n_{j} C_{i j}^{2}\} .

Then, our new test is proposed by

\hat{T} = T / \hat{σ_{0}}

. It follows from Theorem 1 and Lemma 1 that

\hat{T} \overset{d}{⟶} N (E (T) / σ_{0}, σ^{2} / σ_{0}^{2})

as

m i n {p, n} \to \infty

. We see especially that the proposed test

\hat{T}

is asymptotically distributed with the standard norm distribution under the null hypothesis

H_{0}

. As a result, we reject the null hypothesis when

\hat{T} \geq z_{α}

, where

z_{α}

is the upper

α

quantile of standard normal distribution.

The following corollary is easy to be taken, which gives the asymptotic power function under the hypothesis

H_{1}

.

Corollary 1.

Under (C1) and (C2), as

m i n {p, n} \to \infty

, we have

P (\hat{T} \geq z_{α}) = Φ [- z_{α} σ_{0} / σ + E (T) / σ] + o (1) .

4. Simulation Studies

In this section, we compare our proposed test with four existing tests by simulation. The four competing tests are denoted by

T_{Z B H W}

,

T_{Z L G Y}

and

ρ L_{k} (y)

from [11,13,14], respectively. Note that the authors used eight test statistics according to a different y in [14]. Here, we only make simulations for four test statistics to save space, namely

ρ L_{k} (y_{i})

,

i = 1, 2, 3

and 4. We obtain that

z_{i j r}

s are independently generated from the standard normal distribution

N (0, 1)

and the centralized gamma distribution

G a m m a (4, 2) - 2

, respectively, for

i = 1, \dots, k

,

j = 1, \dots, n_{i}

and

r = 1, \dots, p

. We set

k = 3

and

Γ_{i} = Σ_{i}^{1 / 2}

for i = 1, 2 and 3, where the covariance matrices are considered respectively in the following four cases:

Case 1: $H_{0} : Σ_{1} = Σ_{2} = Σ_{3} = Λ U_{0} Λ$ , $H_{1} : Σ_{i} = Λ U_{i - 1} Λ, i = 1, 2, 3,$ where $Λ = diag (λ_{1}, \dots, λ_{p})$ with $λ_{1}, \dots, λ_{p} \overset{i . i . d .}{\sim} U n i f (1, 5)$ and $U_{i - 1}$ is a $p \times p$ matrix with the $(a, b)$ th element being ${(- 1)}^{a + b} {(\frac{5 - 2^{i - 1}}{6})}^{| a - b |}^{0.1}$ .
Case 2: $H_{0} : Σ_{1} = Σ_{2} = Σ_{3} = Λ U_{0} Λ$ , $H_{1} : Σ_{i} = Λ U_{i - 1} Λ, i = 1, 2, 3,$ where $Λ = diag (λ_{1}, \dots, λ_{p})$ with $λ_{1}, \dots, λ_{p} \overset{i . i . d .}{\sim} U n i f (1, 5)$ and $U_{i - 1}$ is a $p \times p$ matrix with the $(a, b)$ th element being ${(- 1)}^{a + b} {(\frac{4 - i}{5})}^{| a - b |}^{0.1}$ .
Case 3: $H_{0} : Σ_{1} = Σ_{2} = Σ_{3} = Λ_{0}$ , $H_{1} : Σ_{i} = Σ_{i - 1}, i = 1, 2, 3,$ where $Λ_{0} = diag (λ_{1}, \dots, λ_{p})$ with $λ_{1}, \dots, λ_{p} \overset{i . i . d .}{\sim} U n i f (1, 5)$ . $Λ_{1}$ is a $p \times p$ symmetric matrix with the $(a, a)$ th element being 1.01, the $(a, a + 1)$ th elements being 0.1 and the rest being 0; $Λ_{2}$ is a $p \times p$ symmetric matrix with the $(a, a)$ th element being 3, the $(a, a + 1)$ th elements being 2, the $(a, a + 2)$ th elements being 1 and the rest being 0.
Case 4: $H_{0} : Σ_{1} = Σ_{2} = Σ_{3} = diag (ω_{1}, \dots, ω_{p})$ where $ω_{1}, \dots, ω_{p} \overset{i . i . d .}{\sim} U n i f (0.5, 10)$ , $H_{1} : Σ_{1} = U_{i} Λ U_{i}^{T}, i = 1, 2, 3,$ where $Λ = diag (λ_{1}, \dots, λ_{p})$ with $λ_{1}, \dots, λ_{p} \overset{i . i . d .}{\sim} G a m m a (4, 0.5)$ , $U_{i} = {(W_{i}^{T} W_{i})}^{- 1 / 2} W_{i}^{T}$ and $W_{i}$ is a $p \times p$ matrix whose entries are independently generated from the normal distribution $N (0, i)$ .

Without loss of generality, all the population means are chosen to be 0. The sample size and data dimension are respectively

n = 45, 95

and

p = 16, 32, 64, 128, 256

. Empirical sizes and powers are computed under the nominal level

α = 0.05

with

10, 000

replications.

All the simulation results are reported in Table 1, Table 2, Table 3, Table 4, Table 5, Table 6, Table 7 and Table 8 and Figure 1, Figure 2, Figure 3, Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8. Figures provide intuitive observation, and tables show the simulated values. First, Table 1, Table 2, Table 3, Table 4, Table 5, Table 6, Table 7 and Table 8 show that the empirical sizes of the test

T_{Z L G Y}

are seriously inflated, especially when

n = 95

in all cases of simulation. For example, just as what is presented in Table 1, the empirical sizes of

T_{Z L G Y}

are respectively 0.0719, 0.1230, 0.2814, 0.5641 and 0.7692 when

n = 95

and

p =

16, 32, 64, 128, 256. Thus,

T_{Z L G Y}

cannot control the nominal size reasonably. Second, Table 1, Table 2, Table 3 and Table 4 imply that the empirical sizes of tests

ρ L_{k} (y_{i})

,

i = 1, 2, 3, 4

, are closer to a given size than those of

T_{Z B H W}

and

\hat{T}

, while

T_{Z B H W}

and

\hat{T}

respectively obtains about

7 %

and

6 %

of empirical sizes as

n = 95

. Thus,

ρ L_{k} (y_{i})

outperforms

T_{Z B H W}

and

\hat{T}

in size under Cases 1 and 2 when data are generated from standard normal distribution and gamma distribution. At the same time, our new test

\hat{T}

has higher empirical powers than

T_{Z B H W}

and

ρ L_{k} (y_{i})

. For example, as

n = 95

and

p = 256

, the empirical powers of

\hat{T}

,

T_{Z B H W}

and

ρ L_{k} (y_{i})

,

i = 1, 2, 3, 4

, are respectively 0.9966, 0.8974, 0.5509, 0.5560, 0.5359 and 0.3469 under Case 1 and normal distribution. Finally, Table 5, Table 6, Table 7 and Table 8 show that the six tests

T_{Z B H W}

,

ρ L_{k} (y_{i})

,

i = 1, 2, 3, 4

, and

\hat{T}

have similar empirical sizes, which are all close to the nominal size. On the other hand, the six tests still have similar empirical powers under Case 3. However, under Case 4, the empirical powers of

ρ L_{k} (y_{i})

are extremely deflated, which are no more than 0.1600. Moreover, the empirical powers of our new test are respectively 0.04 and 0.16 more than those of

T_{Z B H W}

when

n = 45

and 95.

Table 1. Empirical sizes and powers of seven tests under Case 1 and normal distribution.

Table 2. Empirical sizes and powers of seven tests under Case 1 and Gamma distribution.

Table 3. Empirical sizes and powers of seven tests in Case 2 and normal distribution.

Table 4. Empirical sizes and powers of seven tests in Case 2 and Gamma distribution.

Table 5. Empirical sizes and powers of seven tests in Case 3 and normal distribution.

Table 6. Empirical sizes and powers of seven tests in Case 3 and Gamma distribution.

Table 7. Empirical sizes and powers of seven tests in Case 4 and normal distribution.

Table 8. Empirical sizes and powers of seven tests in Case 4 and Gamma distribution.

Figure 1. The Empirical sizes of

T_{Z B H W} (r e d)

,

ρ L_{k} (y_{1}) (o r a n g e)

,

ρ L_{k} (y_{2}) (y e l l o w)

,

ρ L_{k} (y_{3}) (g r e e n)

,

ρ L_{k} (y_{4}) (c y a n)

,

T_{Z L G Y} (b l u e)

and

\hat{T} (p u r p l e)

under normal distribution.

Figure 2. The Empirical powers of

T_{Z B H W} (r e d)

,

ρ L_{k} (y_{1}) (o r a n g e)

,

ρ L_{k} (y_{2}) (y e l l o w)

,

ρ L_{k} (y_{3}) (g r e e n)

,

ρ L_{k} (y_{4}) (c y a n)

,

T_{Z L G Y} (b l u e)

and

\hat{T} (p u r p l e)

under normal distribution.

Figure 3. The Empirical sizes of

T_{Z B H W} (r e d)

,

ρ L_{k} (y_{1}) (o r a n g e)

,

ρ L_{k} (y_{2}) (y e l l o w)

,

ρ L_{k} (y_{3}) (g r e e n)

,

ρ L_{k} (y_{4}) (c y a n)

,

T_{Z L G Y} (b l u e)

and

\hat{T} (p u r p l e)

under normal distribution.

Figure 4. The Empirical powers of

T_{Z B H W} (r e d)

,

ρ L_{k} (y_{1}) (o r a n g e)

,

ρ L_{k} (y_{2}) (y e l l o w)

,

ρ L_{k} (y_{3}) (g r e e n)

,

ρ L_{k} (y_{4}) (c y a n)

,

T_{Z L G Y} (b l u e)

and

\hat{T} (p u r p l e)

under normal distribution.

Figure 5. The Empirical sizes of

T_{Z B H W} (r e d)

,

ρ L_{k} (y_{1}) (o r a n g e)

,

ρ L_{k} (y_{2}) (y e l l o w)

,

ρ L_{k} (y_{3}) (g r e e n)

,

ρ L_{k} (y_{4}) (c y a n)

,

T_{Z L G Y} (b l u e)

and

\hat{T} (p u r p l e)

under Gamma distribution.

Figure 6. The Empirical powers of

T_{Z B H W} (r e d)

,

ρ L_{k} (y_{1}) (o r a n g e)

,

ρ L_{k} (y_{2}) (y e l l o w)

,

ρ L_{k} (y_{3}) (g r e e n)

,

ρ L_{k} (y_{4}) (c y a n)

,

T_{Z L G Y} (b l u e)

and

\hat{T} (p u r p l e)

under Gamma distribution.

Figure 7. The Empirical sizes of

T_{Z B H W} (r e d)

,

ρ L_{k} (y_{1}) (o r a n g e)

,

ρ L_{k} (y_{2}) (y e l l o w)

,

ρ L_{k} (y_{3}) (g r e e n)

,

ρ L_{k} (y_{4}) (c y a n)

,

T_{Z L G Y} (b l u e)

and

\hat{T} (p u r p l e)

under Gamma distribution.

Figure 8. The Empirical powers of

T_{Z B H W} (r e d)

,

ρ L_{k} (y_{1}) (o r a n g e)

,

ρ L_{k} (y_{2}) (y e l l o w)

,

ρ L_{k} (y_{3}) (g r e e n)

,

ρ L_{k} (y_{4}) (c y a n)

,

T_{Z L G Y} (b l u e)

and

\hat{T} (p u r p l e)

under Gamma distribution.

In summary, our proposed test

\hat{T}

can control a given size reasonably and has greater powers than competing tests in all cases of our simulation whenever samples are from the normal model or the non-normal model. However,

T_{Z L G Y}

fails in controlling a given size.

ρ L_{k} (y_{i})

,

i = 1, 2, 3, 4

, seriously deflates the empirical powers in some cases of our simulations.

T_{Z B H W}

can slightly inflate the empirical size in some cases of our simulation.

5. Real Data Analysis

This problem will arise from the analysis of gene expression data. It was shown in [3] that many genes have different variances in gene expressions between disease states. For example, the dataset of the breast cancer of patient is classified into three groups based on their gene expression signatures: well-differentiated tumors (

n_{1}

= 29), moderately differentiated tumors (

n_{2}

= 136), and poorly differentiated and undifferentiated tumors (

n_{3}

= 35). The breast cancer microarray data sets, including patient outcome information, were downloaded from the National Center for Biotechnology Information (NCBI) Gene Expression Omnibus (GEO) data repository and are accessible through GEO Series accession now. GSE11121 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE11121, accessed on 26 June 2022). This dataset includes 200 samples, which is far fewer than data dimension 22,283. For this gene dataset, we are interested in checking whether the covariance matrices of the three groups are the same or not. It is noted that an important assumption for multivariate analysis of variance is that of equal covariance matrices in different groups. Therefore, testing the equality of several covariance matrices is necessary.

Since we used the raw breast cancer dataset (22,283 features), there are many features and no preprocessing. First, we preprocessed and filtered the data, including conventional preprocessing such as background adjustment, normalization, and summarization. Then, we performed feature screening, filtered out features whose coefficient of variation is out of range (0.25, 1.0) and controlled at least 5 samples to exceed 1320 count values. Finally, a data set of 200 samples and 1280 features was screened out, and then a high-dimensional hypothesis test problem was performed on the screened data set.

Compared with other methods and our method, the observed test statistics (p value) of the screened breast cancer dataset are

T_{Z B H W} (1.92, 2.75 \times 10^{- 2})

,

ρ L_{k} (y_{1}) (0.46, 0.79)

,

ρ L_{k} (y_{2}) (0.4, 0.82)

,

ρ L_{k} (y_{3}) (0.91, 0.63)

,

ρ L_{k} (y_{4}) (0.27, 0.69)

,

T_{Z L G Y} (2.47, 6.62 \times 10^{- 3})

and

\hat{T} (1.96, 2.52 \times 10^{- 2})

. The p-values of the comparison method statistics

T_{Z B H W}

,

T_{Z L G Y}

and our method’s statistic are significant.

6. Concluding Remarks

In this paper, we propose a new test on the homogeneity of k-sample covariance matrices in a high-dimensional setting. The asymptotic properties of the proposed test are derived under some regularity conditions. We compare our new test with six competing tests by simulation. Numerical results show that our proposed test can control the nominal size reasonably and that it has the highest empirical powers in our simulation scenario. However, just as what we showed in Section 2, the technique condition (C2) may not hold again when covariance matrices have some spiked eigenvalues. How to obtain the theoretical results of test statistics under spiked covariance matrices structure is an interested problem. We leave this problem as a future research direction.

Author Contributions

Conceptualization, P.S., Y.T. and M.C.; Simulation studies, P.S.; Formal analysis, P.S.; Funding acquisition, Y.T.; Investigation, P.S.; Methodology, P.S. and M.C.; Supervision, Y.T.; Software, P.S.; Visualization, P.S.; Validation, P.S. and M.C.; Writing—original draft, P.S.; Writing—review & editing, P.S. and M.C. All authors have read and agreed to the published version of the manuscript.

Funding

Tang’s research is supported by the Natural Science Foundation of China (No. 12271168) and the 111 Project of China (No. B14019). Cao’s research is supported by the Humanities and Social Sciences Foundation of Ministry of Education (No. 22YJC910001), the Natural Science Foundation of Anhui Province (No. 2108085MA09), the Foundation for Excellent Young Talents in College of Anhui Province (No. gxyqZD2021092) and the Program for Mathematical Statistics Research Team of Anhui Province (No. 2020jxtd102).

Data Availability Statement

Gene expression data generated for this manuscript is available to download and explore at https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE11121, accessed on 26 June 2022. All raw data are available in GEO database under the accession number GEO: GSE11121.

Conflicts of Interest

On behalf of the authors, the corresponding author states that there is no conflict of interest.

Appendix A. Some Lemmas

As mentioned in Section 3, we can assume all the population means are 0. In this case, we only need to consider respectively the first terms

A_{i}^{*} = \frac{1}{{(n_{i})}_{2}} \sum_{j, l}^{*} {(X_{i j}^{T} X_{i l})}^{2}

and

C_{i j}^{*} = \frac{1}{n_{i} n_{j}} \sum_{l = 1}^{n_{i}} \sum_{m = 1}^{n_{j}} {(X_{i l}^{T} X_{j m})}^{2}

of

A_{i}

and

C_{i j}

in (5) and (6). Let

T_{1} : = \sum_{i = 1}^{k} \frac{n_{i} (n - n_{i})}{n} A_{i}^{*} - \sum_{i \neq j}^{k} \frac{n_{i} n_{j}}{n} C_{i j}^{*}

, then we only need to prove the asymptotic normality of

T_{1}

. To do this, we define

Z_{l} = Z_{s i}

, where

l = i + \sum_{j = 1}^{s - 1} n_{j}

and

\sum_{j = 1}^{0} n_{j} = 0

for

i = 1, \dots, n_{s}

and

s = 1, \dots, k

. A sequence of increasing

σ

-fields is defined as

F_{0} = {\emptyset, Ω}

and

F_{j} = σ {Z_{1}, \dots, Z_{j}}

for

j = 1, \dots, n

. Let

E_{j} (\cdot)

denote the conditional expectation with respect to

F_{j}

and

D_{j} : = (E_{j} - E_{j - 1}) T_{1}

. It is easy to obtain

T_{1} - E (T_{1}) = \sum_{j = 1}^{n} D_{j}

.

Lemma A1.

For any n,

{D_{j}, F_{j}}_{j = 1}^{n}

is a square integrable martingale difference sequence.

Proof.

The conclusion is obvious. Hence, the proof is omitted. □

Therefore, to prove Theorem 1, we next apply the martingale central limit theorem.

Lemma A2.

Under conditions (C1) and (C2), as

n, p \to \infty

,

\frac{\sum_{j = 1}^{n} E (D_{j}^{2} | F_{j - 1})}{V a r (T_{1})} \overset{p}{⟶} 1 .

Proof.

By some calculations, we have

\begin{matrix} \sum_{j = 1}^{n} E (D_{j}^{2} | F_{j - 1}) = & \sum_{j = 1}^{n} E_{j - 1} {[E_{j} (T_{1}) - E_{j - 1} (T_{1})]}^{2} \\ = & \sum_{l = 1}^{k} \sum_{j = 1}^{n_{l}} E_{\sum_{i = 1}^{l - 1} n_{i} + j - 1} [E_{\sum_{i = 1}^{l - 1} n_{i} + j} (\frac{n - n_{l}}{n} A_{l}^{*} - \sum_{h \neq l}^{k} \frac{n_{h} n_{l}}{n} C_{l h}^{*}) \\ {- E_{\sum_{i = 1}^{l - 1} n_{i} + j - 1} (\frac{n - n_{l}}{n} A_{l}^{*} - \sum_{h \neq l}^{k} \frac{n_{h} n_{l}}{n} C_{l h}^{*})]}^{2} \\ = & \sum_{l = 1}^{k} \sum_{j = 1}^{n_{l}} E_{\sum_{i = 1}^{l - 1} n_{i} + j - 1} [\frac{n - n_{l}}{n {(n_{l})}_{2}} \sum_{i < j}^{n_{l}} tr [(X_{l i} X_{l i}^{T} - Σ_{l}) (X_{l j} X_{l j}^{T} - Σ_{l})] \\ + \frac{n - n_{l}}{n n_{l}} tr [(X_{l j} X_{l j}^{T} - Σ_{l}) Σ_{l}] - \frac{n_{h}}{n} \sum_{h > l}^{k} tr [(X_{l j} X_{l j}^{T} - Σ_{l}) Σ_{h}] \\ {- \frac{1}{n} \sum_{h < l}^{k} \sum_{i = 1}^{n_{h}} (X_{h i}^{T} (X_{l j} X_{l j}^{T} - Σ_{l}) X_{h i})]}^{2} \\ = & \sum_{l = 1}^{k} \frac{1}{n^{2} n_{l}^{2}} \sum_{j = l}^{n_{l}} E_{\sum_{i = 1}^{l - 1} n_{i} + j - 1} {[tr ((X_{l j} X_{l j}^{T} - Σ_{l}) Q_{l j})]}^{2} \\ = & \sum_{l = l}^{k} \frac{1}{n^{2} n_{l}^{2}} \sum_{j = l}^{n_{l}} \{2 tr {(Q_{l j} Σ_{l})}^{2} + Δ_{l} tr (Γ_{l}^{T} Q_{l j} Γ_{l} \circ Γ_{l}^{T} Q_{l j} Γ_{l})\}, \end{matrix}

where

\begin{matrix} Q_{l j} & = \frac{n - n_{l}}{n_{l} - 1} \sum_{i < j}^{n_{l}} (X_{l i} X_{l i}^{T} - Σ_{l}) + (n - n_{l}) Σ_{l} + n_{l} \sum_{h \neq l}^{k} n_{h} Σ_{h} - n_{l} \sum_{h < l}^{k} \sum_{i}^{n_{h}} (X_{h i} X_{h i}^{T} - Σ_{h}) \\ = : Q_{l j 1} + Q_{l j 2} + Q_{l j 3} + Q_{l j 4} . \end{matrix}

Note that

E (\sum_{j = 1}^{n} E (D_{j}^{2} | F_{j - 1})) = Var (T_{1})

. Next, we prove

\begin{matrix} \frac{Var (\sum_{j = 1}^{n} E (D_{j}^{2} | F_{j - 1}))}{{V a r}^{2} (T_{1})} \to 0 . \end{matrix}

Note that

\begin{matrix} tr {(Q_{l j} Σ_{l})}^{2} = & tr {(Q_{l j 1} Σ_{l})}^{2} + tr {(Q_{l j 2} Σ_{l})}^{2} + tr {(Q_{l j 3} Σ_{l})}^{2} + tr {(Q_{l j 4} Σ_{l})}^{2} \\ + 2 tr [(Q_{l j 1} Σ_{l}) (Q_{l j 2} Σ_{l})] + 2 tr [(Q_{l j 1} Σ_{l}) (Q_{l j 3} Σ_{l})] + 2 tr [(Q_{l j 1} Σ_{l}) (Q_{l j 4} Σ_{l})] \\ + 2 tr [(Q_{l j 2} Σ_{l}) (Q_{l j 3} Σ_{l})] + 2 tr [(Q_{l j 2} Σ_{l}) (Q_{l j 4} Σ_{l})] + 2 tr [(Q_{l j 3} Σ_{l}) (Q_{l j 4} Σ_{l})] . \end{matrix}

It then follows that

\begin{matrix} Var (\sum_{l = 1}^{k} \frac{1}{n^{2} n_{l}^{2}} \sum_{j = 1}^{n_{l}} tr {(Q_{l j} Σ_{l})}^{2}) & \leq 16 \sum_{s, t = 1}^{4} R_{s t}, \end{matrix}

where

R_{s t} = Var (\sum_{l = 1}^{k} \frac{1}{n^{2} n_{l}^{2}} \sum_{j = 1}^{n_{l}} tr [E_{\sum_{i = 1}^{l - 1} n_{i} + j} Q_{l j s} Σ_{l} E_{\sum_{i = 1}^{l - 1} n_{i} + j} Q_{l j t} Σ_{l}])

. In the following, we will prove

R_{s t} = o [{Var}^{2} (T_{1})]

for

s, t = 1, 2, 3

and 4.

\begin{matrix} R_{11} = & Var (\sum_{l = 1}^{k} \frac{{(n - n_{l})}^{2}}{n^{2} {(n_{l} - 1)}^{2} n_{l}^{2}} \sum_{j_{1}, j_{2} = 1}^{n_{l}} (n_{l} - j_{1} \lor j_{2}) tr [(X_{l j_{1}} X_{l j_{1}}^{T} - Σ_{l}) (X_{l j_{2}} X_{l j_{2}}^{T} - Σ_{l})]) \\ = & \sum_{l = 1}^{k} \sum_{j_{1}, j_{2} = 1}^{n_{l}} \frac{{(n - n_{l})}^{4} {(n_{l} - j_{1} \lor j_{2})}^{2}}{n^{4} {(n_{l} - 1)}^{4} n_{l}^{4}} Var (tr [(X_{l j_{1}} X_{l j_{1}}^{T} - Σ_{l}) (X_{l j_{2}} X_{l j_{2}}^{T} - Σ_{l})]) \\ = & \sum_{l = 1}^{k} \sum_{j_{1} = j_{2} = j}^{n_{l}} \frac{{(n - n_{l})}^{4} {(n_{l} - j)}^{2}}{n^{4} {(n_{l})}_{2}^{4}} Var (tr [(X_{l j} X_{l j}^{T} - Σ_{l}) (X_{l j} X_{l j}^{T} - Σ_{l})]) \\ + \sum_{l = 1}^{k} \sum_{j_{1} \neq j_{2}}^{n_{l}} \frac{{(n - n_{l})}^{4} {(n_{l} - j_{1} \lor j_{2})}^{2}}{n^{4} {(n_{l} - 1)}^{4} n_{l}^{4}} Var (tr [(X_{l j_{1}} X_{l j_{1}}^{T} - Σ_{l}) (X_{l j_{2}} X_{l j_{2}}^{T} - Σ_{l})]) \\ \leq & \sum_{l = 1}^{k} \frac{1}{{(n_{l})}_{2}^{2}} {tr}^{4} (Σ_{l}^{2}) [o (1) + O (\frac{1}{n_{l}}))] = o [{Var}^{2} (T_{1})] . \end{matrix}

Similarly,

\begin{matrix} R_{12} \leq & \sum_{l = 1}^{k} \sum_{j = 1}^{n_{l}} \frac{3 {(n - n_{l})}^{2} {(n_{l} - j)}^{2}}{n^{4} n_{l}^{4} {(n_{l} - 1)}^{2}} tr [{(Σ_{l} Q_{l j 2} Σ_{l}^{2})}^{2}] = \sum_{l = 1}^{k} \frac{{(n - n_{l})}^{4} (2 n_{l} - 1)}{2 n^{4} n_{l}^{3} (n_{l} - 1)} tr [{(Σ_{l}^{4})}^{2}] \\ \leq & \sum_{l = 1}^{k} \frac{{(n - n_{l})}^{4} (2 n_{l} - 1)}{2 n^{4} n_{l}^{3} (n_{l} - 1)} {tr}^{2} (Σ_{l}^{4}) = o [{Var}^{2} (T_{1})] . \end{matrix}

\begin{matrix} R_{13} \leq & \sum_{l = 1}^{k} \sum_{j = 1}^{n_{l}} \frac{3 {(n - n_{l})}^{2} {(n_{l} - j)}^{2}}{n^{4} n_{l}^{4} {(n_{l} - 1)}^{2}} tr [{(Σ_{l} Q_{l j 3} Σ_{l}^{2})}^{2}] \\ = & \sum_{l = 1}^{k} \frac{{(n - n_{l})}^{2} (2 n_{l} - 1)}{2 n^{4} n_{l} (n_{l} - 1)} tr {(Σ_{l}^{3} n_{l} \sum_{h \neq l}^{k} n_{h} Σ_{h})}^{2} \\ \leq & \sum_{l = l}^{k} \frac{{(n - n_{l})}^{2} (2 n_{l} - 1)}{2 n^{4} n_{l} (n_{l} - 1)} tr (Σ_{l}^{4}) tr {(Σ_{l} \sum_{h \neq l}^{k} n_{h} Σ_{h})}^{2} = o [{Var}^{2} (T_{1})] . \end{matrix}

We can obtain

R_{i 4} = o [{Var}^{2} (T_{1})]

for

i = 2, 3

and 4 by a similar method. It is noted that

R_{23} = R_{22} = R_{33} = 0

since

Q_{l j 2}

and

Q_{l j 3}

are non random.

As a result, it follows from the above equalities that

\begin{matrix} Var (\sum_{l = 1}^{k} \frac{1}{n^{2} n_{l}^{2}} \sum_{j = 1}^{n_{l}} 2 tr {(Q_{l j} Σ_{l})}^{2}) = o [{Var}^{2} (T_{1})] . \end{matrix}

Finally, using similar calculations, we can obtain

\begin{matrix} Var (\sum_{l = l}^{k} \frac{1}{n^{2} n_{l}^{2}} \sum_{j = l}^{n_{l}} tr (Γ_{l}^{T} Q_{l j} Γ_{l} \circ Γ_{l}^{T} Q_{l j} Γ_{l})) = o [{Var}^{2} (T_{1})] . \end{matrix}

This completes the proof of Lemma A2. □

Lemma A3.

Under the condition (C2), as

n, p \to \infty

,

\begin{matrix} \sum_{j = 1}^{n} E (D_{j}^{4}) = o [{V a r}^{2} (T_{1})] . \end{matrix}

Proof.

By some calculations, for some constants

c_{1}, c_{2}, c_{3}

and

c_{4}

, we can obtain

\begin{matrix} \sum_{j = 1}^{n} E (D_{j}^{4}) \leq c_{1} \sum_{l = 1}^{k} \frac{1}{n_{l}^{4}} {tr}^{4} (Σ_{l}^{2}) + c_{2} \sum_{l \neq h}^{k} \frac{n_{h}}{n_{l}^{2}} {tr}^{4} (Σ_{l} Σ_{h}) \end{matrix}

and

\begin{matrix} {Var}^{2} (T_{1}) \geq c_{3} \sum_{l = 1}^{k} \frac{n_{l}^{2} {(n - n_{l})}^{4}}{n^{4} {(n_{l} - 1)}^{2}} {tr}^{4} (Σ_{l}^{2}) + c_{4} \sum_{l \neq h}^{k} \frac{n_{l}^{2} n_{h}^{2}}{n^{4}} {tr}^{4} (Σ_{l} Σ_{h}) . \end{matrix}

As a result,

\begin{matrix} \frac{\sum_{j = 1}^{n} E (D_{j}^{4})}{{V a r}^{2} (T_{1})} \leq \sum_{l = 1}^{k} \frac{1}{n_{l}} \to 0 . \end{matrix}

This completes the proof of Lemma A3. □

Appendix B. Proof of Theorem 1

Proof.

According Lemmas A1–A3, as

n, p \to \infty

, it is easy to complete the proof of Theorem 1. □

References

Anderson, T.W. An Introduction to Multivariate Statistical Analysis, 3rd ed.; Wiley: Hoboken, NJ, USA, 2003. [Google Scholar]
Muirhead, R.J. Aspects of Multivariate Statistical Theory; Wiley: New York, NY, USA, 2005. [Google Scholar]
Schmidt, M.; Böhm, D.; Törne, C.; Steiner, E.; Puhl, A.; Pilch, H.; Lehr, H.; Hengstler, J.; Kölbl, H.; Gehrmann, M. The humoral immune system has a key prognostic impact in node-negative breast cancer. Cancer Res. 2008, 68, 5405–5413. [Google Scholar] [CrossRef] [PubMed]
Khan, J.; Wei, J.S.; Ringner, M.; Saal, L.H.; Ladanyi, M.; Westermann, F.; Berthold, F.; Schwab, M.; Antonescu, C.R.; Peterson, C.; et al. Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks. Nat. Med. 2001, 7, 673–679. [Google Scholar] [CrossRef] [PubMed]
Zhu, Z.; Ong, Y.S.; Dash, M. Markov blanket-embedded genetic algorithm for gene selection. Pattern Recognit. 2007, 49, 3236–3248. [Google Scholar] [CrossRef]
Li, T.; Zhang, C.; Ogihara, M. A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression. Bioinformatics 2004, 20, 2429–2437. [Google Scholar] [CrossRef] [PubMed]
Wilks, S.S. Sample criteria for testing equality of means, equality of variances, and equality of covariances in a normal multivariate distribution. Ann. Math. Stat. 1946, 17, 257–281. [Google Scholar] [CrossRef]
Ahmad, M.R. Testing homogeneity of several covariance matrices and multi-sample sphericity for high-dimensional data under non-normality. Commun. Stat. Theory Methods 2017, 46, 3738–3753. [Google Scholar] [CrossRef]
Schott, J.R.A. Test for the equality of covariance matrices when the dimension is large relative to the sample sizes. Comput. Stat. Data Anal. 2007, 51, 6535–6542. [Google Scholar] [CrossRef]
Srivastava, M.S.; Yanagihara, H. Testing the equality of several covariance matrices with fewer observations than the dimension. J. Multivar. Anal. 2010, 101, 1319–1329. [Google Scholar] [CrossRef]
Zhang, C.; Bai, Z.; Hu, J.; Wang, C. Multi-sample test for high-dimensional covariance matrices. Commun. Stat. Theory Methods 2018, 47, 3161–3177. [Google Scholar] [CrossRef]
Zhong, P.; Li, R.; Shanto, S. Homogeneity tests of covariance matrices with high-dimensional longitudinal data. Biometrika 2019, 106, 619–634. [Google Scholar] [CrossRef] [PubMed]
Zheng, S.; Lin, R.; Guo, J.; Yin, G. Testing homogeneity of high-dimensional covariance matrices. Stat. Sin. 2020, 30, 35–53. [Google Scholar] [CrossRef]
Qayed, A.; Han, D. Homogeneity test of several covariance matrices with high-dimensional data. J. Biopharm. Stat. 2021, 31, 523–540. [Google Scholar] [CrossRef] [PubMed]
Box, G.E.P. A general distribution theory for a class of likelihood criteria. Biometrika 1949, 36, 317–346. [Google Scholar] [CrossRef]
Li, J.; Chen, S. Two sample tests for high-dimensional covariance matrices. Ann. Stat. 2012, 40, 908–940. [Google Scholar] [CrossRef]
Aoshima, M.; Yata, K. Two-sample tests for high-dimension, strongly spiked eigenvalue models. Stat. Sin. 2018, 28, 43–62. [Google Scholar] [CrossRef]
Jiang, Y.; Wen, C.; Jiang, Y.; Wang, X.; Zhang, H. Use of Random Integration to Test Equality of High Dimensional Covariance Matrices. Stat. Sin. 2020. [Google Scholar] [CrossRef]
Chen, S.; Qin, Y. A two-sample test for high-dimensional data with applications to gene-set testing. Ann. Stat. 2010, 38, 808–835. [Google Scholar] [CrossRef]

Figure 1. The Empirical sizes of

T_{Z B H W} (r e d)

,

ρ L_{k} (y_{1}) (o r a n g e)

,

ρ L_{k} (y_{2}) (y e l l o w)

,

ρ L_{k} (y_{3}) (g r e e n)

,

ρ L_{k} (y_{4}) (c y a n)

,

T_{Z L G Y} (b l u e)

and

\hat{T} (p u r p l e)

under normal distribution.

Figure 2. The Empirical powers of

T_{Z B H W} (r e d)

,

ρ L_{k} (y_{1}) (o r a n g e)

,

ρ L_{k} (y_{2}) (y e l l o w)

,

ρ L_{k} (y_{3}) (g r e e n)

,

ρ L_{k} (y_{4}) (c y a n)

,

T_{Z L G Y} (b l u e)

and

\hat{T} (p u r p l e)

under normal distribution.

Figure 3. The Empirical sizes of

T_{Z B H W} (r e d)

,

ρ L_{k} (y_{1}) (o r a n g e)

,

ρ L_{k} (y_{2}) (y e l l o w)

,

ρ L_{k} (y_{3}) (g r e e n)

,

ρ L_{k} (y_{4}) (c y a n)

,

T_{Z L G Y} (b l u e)

and

\hat{T} (p u r p l e)

under normal distribution.

Figure 4. The Empirical powers of

T_{Z B H W} (r e d)

,

ρ L_{k} (y_{1}) (o r a n g e)

,

ρ L_{k} (y_{2}) (y e l l o w)

,

ρ L_{k} (y_{3}) (g r e e n)

,

ρ L_{k} (y_{4}) (c y a n)

,

T_{Z L G Y} (b l u e)

and

\hat{T} (p u r p l e)

under normal distribution.

Figure 5. The Empirical sizes of

T_{Z B H W} (r e d)

,

ρ L_{k} (y_{1}) (o r a n g e)

,

ρ L_{k} (y_{2}) (y e l l o w)

,

ρ L_{k} (y_{3}) (g r e e n)

,

ρ L_{k} (y_{4}) (c y a n)

,

T_{Z L G Y} (b l u e)

and

\hat{T} (p u r p l e)

under Gamma distribution.

Figure 6. The Empirical powers of

T_{Z B H W} (r e d)

,

ρ L_{k} (y_{1}) (o r a n g e)

,

ρ L_{k} (y_{2}) (y e l l o w)

,

ρ L_{k} (y_{3}) (g r e e n)

,

ρ L_{k} (y_{4}) (c y a n)

,

T_{Z L G Y} (b l u e)

and

\hat{T} (p u r p l e)

under Gamma distribution.

Figure 7. The Empirical sizes of

T_{Z B H W} (r e d)

,

ρ L_{k} (y_{1}) (o r a n g e)

,

ρ L_{k} (y_{2}) (y e l l o w)

,

ρ L_{k} (y_{3}) (g r e e n)

,

ρ L_{k} (y_{4}) (c y a n)

,

T_{Z L G Y} (b l u e)

and

\hat{T} (p u r p l e)

under Gamma distribution.

Figure 8. The Empirical powers of

T_{Z B H W} (r e d)

,

ρ L_{k} (y_{1}) (o r a n g e)

,

ρ L_{k} (y_{2}) (y e l l o w)

,

ρ L_{k} (y_{3}) (g r e e n)

,

ρ L_{k} (y_{4}) (c y a n)

,

T_{Z L G Y} (b l u e)

and

\hat{T} (p u r p l e)

under Gamma distribution.

Table 1. Empirical sizes and powers of seven tests under Case 1 and normal distribution.

	p	$T_{Z B H W}$	$ρ L_{k} (y_{1})$	$ρ L_{k} (y_{2})$	$ρ L_{k} (y_{3})$	$ρ L_{k} (y_{4})$	$T_{Z L G Y}$	$\hat{T}$
Size	16	0.0476	0.0501	0.0484	0.0508	0.0500	0.0225	0.0509
n = 45	32	0.0358	0.0497	0.0507	0.0505	0.0498	0.0227	0.0461
	64	0.0407	0.0492	0.0471	0.0472	0.0534	0.0317	0.0475
	128	0.0422	0.0516	0.0539	0.0532	0.0524	0.0414	0.0448
	256	0.0526	0.0495	0.0489	0.0452	0.0525	0.0710	0.0514
n = 95	16	0.0633	0.0524	0.0535	0.0522	0.0512	0.0719	0.0601
	32	0.0584	0.0488	0.0537	0.0515	0.0479	0.1230	0.0576
	64	0.0595	0.0518	0.0499	0.0472	0.0534	0.2814	0.0582
	128	0.0628	0.0477	0.0556	0.0499	0.0445	0.5641	0.0594
	256	0.0710	0.0478	0.0477	0.0534	0.0474	0.7692	0.0669
Power	16	0.1291	0.2582	0.1991	0.1224	0.1710	0.3301	0.2837
n = 45	32	0.1273	0.2526	0.2160	0.1709	0.1691	0.4441	0.3582
	64	0.1398	0.2528	0.2337	0.2039	0.1710	0.5516	0.4377
	128	0.1552	0.2523	0.2462	0.2270	0.1671	0.6594	0.5222
	256	0.1676	0.2595	0.2490	0.2444	0.1665	0.7380	0.5847
n = 95	16	0.5490	0.5605	0.4198	0.2466	0.3561	0.8783	0.8870
	32	0.6810	0.5684	0.4816	0.3566	0.3653	0.9559	0.9563
	64	0.7771	0.5669	0.5128	0.4494	0.3603	0.9861	0.9863
	128	0.8476	0.5670	0.5465	0.5048	0.3521	0.9946	0.9924
	256	0.8974	0.5509	0.5560	0.5359	0.3469	0.9974	0.9966

Table 2. Empirical sizes and powers of seven tests under Case 1 and Gamma distribution.

	p	$T_{Z B H W}$	$ρ L_{k} (y_{1})$	$ρ L_{k} (y_{2})$	$ρ L_{k} (y_{3})$	$ρ L_{k} (y_{4})$	$T_{Z L G Y}$	$\hat{T}$
Size	16	0.0630	0.0550	0.0610	0.0631	0.0632	0.0329	0.0644
n = 45	32	0.0467	0.0523	0.0526	0.0557	0.0561	0.0278	0.0511
	64	0.0434	0.0573	0.0533	0.0494	0.0552	0.0332	0.0537
	128	0.0449	0.0484	0.0494	0.0545	0.0514	0.0461	0.0504
	256	0.0490	0.0498	0.0499	0.0493	0.0514	0.0707	0.0492
n = 95	16	0.0822	0.0560	0.0657	0.0699	0.0631	0.0851	0.0789
	32	0.0741	0.0531	0.0610	0.0629	0.0594	0.1224	0.0698
	64	0.0619	0.0544	0.0535	0.0545	0.0540	0.2505	0.0617
	128	0.0638	0.0524	0.0540	0.0517	0.0477	0.5037	0.0627
	256	0.0686	0.0508	0.0493	0.0498	0.0517	0.7041	0.0668
Power	16	0.1375	0.2602	0.2085	0.1437	0.1866	0.3225	0.2815
n = 45	32	0.1392	0.2613	0.2221	0.1760	0.1815	0.4421	0.3639
	64	0.1429	0.2587	0.2427	0.2094	0.1746	0.5561	0.4463
	128	0.1652	0.2513	0.2538	0.2334	0.1695	0.6536	0.5155
	256	0.1658	0.2565	0.2514	0.2401	0.1639	0.7279	0.5824
n = 95	16	0.5336	0.5610	0.4219	0.2685	0.3780	0.8629	0.8730
	32	0.6772	0.5664	0.4895	0.3683	0.3681	0.9523	0.9551
	64	0.7749	0.5695	0.5256	0.4493	0.3709	0.9834	0.9849
	128	0.8460	0.5628	0.5391	0.5005	0.3540	0.9929	0.9923
	256	0.8922	0.5608	0.5477	0.5260	0.3470	0.9966	0.9978

Table 3. Empirical sizes and powers of seven tests in Case 2 and normal distribution.

	p	$T_{Z B H W}$	$ρ L_{k} (y_{1})$	$ρ L_{k} (y_{2})$	$ρ L_{k} (y_{3})$	$ρ L_{k} (y_{4})$	$T_{Z L G Y}$	$\hat{T}$
Size	16	0.0559	0.0501	0.0485	0.0502	0.0495	0.0242	0.0539
n = 45	32	0.0562	0.0474	0.0510	0.0499	0.0460	0.0303	0.0545
	64	0.0553	0.0482	0.0483	0.0501	0.0486	0.0312	0.0528
	128	0.0658	0.0471	0.0472	0.0484	0.0502	0.0455	0.0580
	256	0.0783	0.0497	0.0500	0.0518	0.0509	0.0629	0.0576
n = 95	16	0.0724	0.0511	0.0499	0.0484	0.0494	0.0750	0.0656
	32	0.0675	0.0492	0.0497	0.0486	0.0491	0.1132	0.0624
	64	0.0747	0.0532	0.0505	0.0483	0.0498	0.2468	0.0641
	128	0.0821	0.0479	0.0465	0.0473	0.0468	0.5114	0.0680
	256	0.0790	0.0514	0.0488	0.0514	0.0492	0.7645	0.0625
Power	16	0.0838	0.0869	0.0804	0.0662	0.0781	0.1447	0.1170
n = 45	32	0.0862	0.0933	0.0858	0.0743	0.0788	0.1969	0.1471
	64	0.0883	0.0928	0.0923	0.0838	0.0771	0.2706	0.1844
	128	0.0924	0.0915	0.0933	0.0940	0.0765	0.3505	0.2235
	256	0.1035	0.0989	0.0937	0.0907	0.0779	0.4172	0.2631
n = 95	16	0.2053	0.1640	0.1405	0.1048	0.1238	0.4722	0.4314
	32	0.2877	0.1693	0.1482	0.1220	0.1225	0.6532	0.5974
	64	0.3641	0.1659	0.1569	0.1405	0.1203	0.7805	0.7090
	128	0.4380	0.1740	0.1632	0.1559	0.1143	0.8758	0.8046
	256	0.5136	0.1680	0.1651	0.1595	0.1129	0.9364	0.8659

Table 4. Empirical sizes and powers of seven tests in Case 2 and Gamma distribution.

	p	$T_{Z B H W}$	$ρ L_{k} (y_{1})$	$ρ L_{k} (y_{2})$	$ρ L_{k} (y_{3})$	$ρ L_{k} (y_{4})$	$T_{Z L G Y}$	$\hat{T}$
Size	16	0.0724	0.0605	0.0620	0.0632	0.0594	0.0330	0.0667
n = 45	32	0.0570	0.0553	0.0583	0.0562	0.0548	0.0284	0.0556
	64	0.0603	0.0529	0.0508	0.0553	0.0564	0.0349	0.0540
	128	0.0669	0.0467	0.0509	0.0480	0.0547	0.0404	0.0556
	256	0.0806	0.0522	0.0529	0.0489	0.0526	0.0646	0.0581
n = 95	16	0.0886	0.0556	0.0653	0.0660	0.0649	0.0865	0.0844
	32	0.0792	0.0539	0.0554	0.0580	0.0580	0.1146	0.0718
	64	0.0794	0.0557	0.0550	0.0545	0.0555	0.2220	0.0696
	128	0.0774	0.0549	0.0524	0.0570	0.0490	0.4400	0.0658
	256	0.0855	0.0523	0.0504	0.0486	0.0505	0.7047	0.0691
Power	16	0.0976	0.1041	0.0937	0.0834	0.0889	0.1430	0.1253
n = 45	32	0.0882	0.0934	0.0961	0.0870	0.0839	0.1936	0.1486
	64	0.0893	0.0931	0.0942	0.0890	0.0823	0.2746	0.1844
	128	0.0908	0.0972	0.0965	0.0922	0.0748	0.3395	0.2197
	256	0.0982	0.1029	0.0918	0.0913	0.0734	0.4195	0.2553
n = 95	16	0.2248	0.1807	0.1547	0.1205	0.1464	0.4621	0.4391
	32	0.2886	0.1757	0.1641	0.1403	0.1267	0.6325	0.5913
	64	0.3671	0.1699	0.1681	0.1560	0.1253	0.7592	0.7119
	128	0.4392	0.1675	0.1715	0.1605	0.1184	0.8598	0.8064
	256	0.5194	0.1641	0.1682	0.1593	0.1129	0.9217	0.8760

Table 5. Empirical sizes and powers of seven tests in Case 3 and normal distribution.

	p	$T_{Z B H W}$	$ρ L_{k} (y_{1})$	$ρ L_{k} (y_{2})$	$ρ L_{k} (y_{3})$	$ρ L_{k} (y_{4})$	$T_{Z L G Y}$	$\hat{T}$
Size	16	0.0530	0.0517	0.0478	0.0495	0.0484	0.1897	0.0545
n = 45	32	0.0491	0.0539	0.0513	0.0514	0.0503	0.2663	0.0517
	64	0.0463	0.0521	0.0498	0.0487	0.0491	0.3956	0.0513
	128	0.0490	0.0481	0.0503	0.0496	0.0475	0.5715	0.0508
	256	0.0492	0.0512	0.0491	0.0478	0.0501	0.7879	0.0511
n = 95	16	0.0455	0.0489	0.0516	0.0493	0.0496	0.2732	0.0520
	32	0.0407	0.0484	0.0508	0.0493	0.0493	0.3462	0.0535
	64	0.0357	0.0471	0.0501	0.0516	0.0496	0.4335	0.0483
	128	0.0386	0.0483	0.0496	0.0465	0.0483	0.5736	0.0476
	256	0.0381	0.0497	0.0501	0.0532	0.0494	0.7541	0.0506
Power	16	0.9951	0.9363	0.9047	0.8594	0.8190	0.9981	0.9986
n = 45	32	0.9988	0.9446	0.9331	0.9079	0.8428	0.9993	0.9997
	64	0.9996	0.9415	0.9403	0.9284	0.8500	0.9997	0.9999
	128	0.9998	0.9433	0.9404	0.9386	0.8547	0.9994	1.0000
	256	0.9999	0.9504	0.9458	0.9476	0.8608	0.9983	1.0000
n = 95	16	1.0000	0.9999	0.9996	0.9974	0.9952	1.0000	1.0000
	32	1.0000	1.0000	1.0000	0.9999	0.9972	1.0000	1.0000
	64	1.0000	1.0000	0.9998	0.9998	0.9980	1.0000	1.0000
	128	1.0000	1.0000	1.0000	0.9999	0.9983	1.0000	1.0000
	256	1.0000	1.0000	0.9999	1.0000	0.9986	1.0000	1.0000

Table 6. Empirical sizes and powers of seven tests in Case 3 and Gamma distribution.

	p	$T_{Z B H W}$	$ρ L_{k} (y_{1})$	$ρ L_{k} (y_{2})$	$ρ L_{k} (y_{3})$	$ρ L_{k} (y_{4})$	$T_{Z L G Y}$	$\hat{T}$
Size	16	0.0743	0.0540	0.0587	0.0633	0.0637	0.1885	0.0740
n = 45	32	0.0599	0.0553	0.0544	0.0530	0.0551	0.2490	0.0622
	64	0.0532	0.0476	0.0512	0.0536	0.0557	0.3514	0.0544
	128	0.0506	0.0508	0.0494	0.0511	0.0511	0.5232	0.0503
	256	0.0493	0.0471	0.0533	0.0516	0.0487	0.7273	0.0510
n = 95	16	0.0619	0.0526	0.0614	0.0679	0.0635	0.2469	0.0709
	32	0.0441	0.0513	0.0565	0.0562	0.0501	0.3059	0.0553
	64	0.0430	0.0526	0.0537	0.0524	0.0558	0.3869	0.0521
	128	0.0368	0.0534	0.0500	0.0506	0.0511	0.5153	0.0478
	256	0.0365	0.0494	0.0463	0.0489	0.0523	0.7029	0.0445
Power	16	0.9924	0.9316	0.9007	0.8519	0.8251	0.9968	0.9972
n = 45	32	0.9978	0.9404	0.9286	0.9059	0.8451	0.9984	0.9994
	64	0.9996	0.9411	0.9365	0.9273	0.8532	0.9994	1.0000
	128	0.9999	0.9449	0.9444	0.9360	0.8564	0.9993	1.0000
	256	1.0000	0.9421	0.9459	0.9432	0.8559	0.9986	1.0000
n = 95	16	0.9998	0.9999	0.9995	0.9979	0.9937	1.0000	1.0000
	32	1.0000	0.9997	0.9996	0.9995	0.9974	1.0000	1.0000
	64	1.0000	0.9999	0.9999	0.9998	0.9979	1.0000	1.0000
	128	1.0000	0.9999	0.9999	0.9998	0.9979	1.0000	1.0000
	256	1.0000	0.9999	1.0000	1.0000	0.9984	1.0000	1.0000

Table 7. Empirical sizes and powers of seven tests in Case 4 and normal distribution.

	p	$T_{Z B H W}$	$ρ L_{k} (y_{1})$	$ρ L_{k} (y_{2})$	$ρ L_{k} (y_{3})$	$ρ L_{k} (y_{4})$	$T_{Z L G Y}$	$\hat{T}$
Size	16	0.0517	0.0490	0.0514	0.0484	0.0522	0.1209	0.0548
n = 45	32	0.0481	0.0496	0.0506	0.0507	0.0541	0.2649	0.0507
	64	0.0487	0.0480	0.0495	0.0493	0.0487	0.3975	0.0512
	128	0.0459	0.0496	0.0459	0.0515	0.0510	0.5640	0.0511
	256	0.0455	0.0504	0.0483	0.0469	0.0520	0.7811	0.0493
n = 95	16	0.0450	0.0470	0.0522	0.0489	0.0550	0.2487	0.0513
	32	0.0388	0.0504	0.0511	0.0514	0.0502	0.3486	0.0484
	64	0.0381	0.0504	0.0503	0.0488	0.0502	0.4379	0.0480
	128	0.0329	0.0530	0.0511	0.0487	0.0523	0.5704	0.0441
	256	0.0348	0.0498	0.0508	0.0514	0.0462	0.7527	0.0438
Power	16	0.5566	0.0872	0.0922	0.0930	0.0933	0.5331	0.5965
n = 45	32	0.5749	0.0710	0.0698	0.0724	0.0749	0.6018	0.6180
	64	0.5935	0.0596	0.0591	0.0601	0.0633	0.6495	0.6432
	128	0.6095	0.0547	0.0507	0.0522	0.0532	0.7108	0.6546
	256	0.6142	0.0516	0.0527	0.0508	0.0552	0.8269	0.6615
n = 95	16	0.7418	0.1311	0.1361	0.1359	0.1338	0.8735	0.9232
	32	0.7846	0.0923	0.0916	0.0966	0.0953	0.9024	0.9548
	64	0.8089	0.0680	0.0738	0.0735	0.0679	0.9151	0.9753
	128	0.8184	0.0650	0.0607	0.0615	0.0626	0.9166	0.9824
	256	0.8258	0.0560	0.0548	0.0533	0.0549	0.9161	0.9849

Table 8. Empirical sizes and powers of seven tests in Case 4 and Gamma distribution.

	p	$T_{Z B H W}$	$ρ L_{k} (y_{1})$	$ρ L_{k} (y_{2})$	$ρ L_{k} (y_{3})$	$ρ L_{k} (y_{4})$	$T_{Z L G Y}$	$\hat{T}$
Size	16	0.0776	0.0559	0.0589	0.0632	0.0641	0.1338	0.0773
n = 45	32	0.0634	0.0526	0.0534	0.0584	0.0567	0.2403	0.0642
	64	0.0532	0.0488	0.0527	0.0527	0.0540	0.3579	0.0546
	128	0.0520	0.0533	0.0500	0.0517	0.0480	0.5195	0.0534
	256	0.0483	0.0495	0.0506	0.0482	0.0512	0.7316	0.0530
n = 95	16	0.0674	0.0586	0.0605	0.0605	0.0668	0.2348	0.0729
	32	0.0482	0.0528	0.0530	0.0586	0.0514	0.2945	0.0613
	64	0.0425	0.0511	0.0491	0.0551	0.0525	0.3839	0.0533
	128	0.0420	0.0523	0.0522	0.0491	0.0499	0.5164	0.0520
	256	0.0388	0.0456	0.0488	0.0519	0.0511	0.6998	0.0490
Power	16	0.5435	0.0993	0.1039	0.1028	0.1026	0.5191	0.5735
n = 45	32	0.5685	0.0697	0.0741	0.0787	0.0756	0.5717	0.6106
	64	0.5901	0.0606	0.0601	0.0615	0.0602	0.6153	0.6368
	128	0.6124	0.0551	0.0552	0.0592	0.0551	0.6810	0.6606
	256	0.6133	0.0511	0.0512	0.0554	0.0510	0.7685	0.6645
n = 95	16	0.7283	0.1401	0.1468	0.1506	0.1438	0.8526	0.9070
	32	0.7695	0.0973	0.0997	0.1000	0.1019	0.8832	0.9513
	64	0.8081	0.0790	0.0745	0.0738	0.0749	0.9100	0.9746
	128	0.8158	0.0577	0.0627	0.0621	0.0611	0.8980	0.9835
	256	0.8231	0.0570	0.0545	0.0601	0.0550	0.8966	0.9843

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Homogeneity Test of Multi-Sample Covariance Matrices in High Dimensions

Abstract

1. Introduction

2. Preliminaries

3. Main Results

4. Simulation Studies

5. Real Data Analysis

6. Concluding Remarks

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Some Lemmas

Appendix B. Proof of Theorem 1

References

Article Metrics

Citations

Article Access Statistics