Mean Equality Tests for High-Dimensional and Higher-Order Data with k-Self Similar Compound Symmetry Covariance Structure

Ricardo Leiva; Anuradha Roy

doi:10.3390/sym14020291

and

¹

Departamento de Matemática, Facultad de Ciencias Económicas, Universidad Nacional de Cuyo, Mendoza 5500, Argentina

²

Department of Management Science and Statistics, The University of Texas at San Antonio, San Antonio, TX 78249, USA

^*

Author to whom correspondence should be addressed.

Symmetry2022, 14(2), 291;https://doi.org/10.3390/sym14020291

This article belongs to the Special Issue Symmetry in Multivariate Analysis

Version Notes

Order Reprints

Abstract

An extension of the

D^{2}

test statistic to test the equality of mean for high-dimensional and k-th order array-variate data using k-self similar compound symmetry (k-SSCS) covariance structure is derived. The k-th order data appear in many scientific fields including agriculture, medical, environmental and engineering applications. We discuss the property of this k-SSCS covariance structure, namely, the property of Jordan algebra. We formally show that our

D^{2}

test statistic for k-th order data is an extension or the generalization of the

D^{2}

test statistic for second-order data and for third-order data, respectively. We also derive the

D^{2}

test statistic for third-order data and illustrate its application using a medical dataset from a clinical trial study of the eye disease glaucoma. The new test statistic is very efficient for high-dimensional data where the estimation of unstructured variance-covariance matrix is not feasible due to small sample size.

Keywords:

array-variate data; eigenblock; high dimensional data; Wishart distribution; Hotelling’s T2 statistic; Lawley–Hotelling trace distribution

1. Introduction

We study the hypotheses testing problems of equality of means for high-dimensional and higher-order (multi-dimensional arrays) data. Standard multivariate techniques such as Hotelling’s

T^{2}

distribution with one big unstructured variance-covariance matrix (assuming a large sample size) do not work for these higher-order data, as Hotelling’s

T^{2}

distribution cannot incorporate any higher-order information into the test statistic and thus draws wrong conclusions [1]. Higher-order data are formed by representing the additional associations that are inherent in repetition across several dimensions. To obtain a better understanding of higher-order data, we first share a simple and interesting example of higher-order data:

Traditional multivariate (vector-variate) data are the first-order data. For example, a clinical trial study of glaucoma, where several factors $(m_{1})$ such as intraocular pressure (IOP) and central corneal thickness (CCT) are effective in the diagnosis of glaucoma. This example is an illustration of the $(m_{1} \times 1)$ vector-variate first-order data.
When the first-order data are measured at various locations/sites or time points, the data become two-dimensional matrix-variate data, and we name it as second-order data. These data are also recognized as multivariate repeated measures data, or doubly multivariate data; e.g., multivariate spatial data or multivariate temporal data. In the above example of the clinical trial study, an ophthalmologist or optometrist diagnoses glaucoma by measuring IOP and CCT $(m_{1})$ in both the eyes $(m_{2})$ . So, we see how the vector-variate first-order dataset discussed in the previous paragraph becomes a $(m_{1} \times m_{2})$ matrix-variate second-order dataset by measuring $m_{1}$ variables repeatedly over another dimension.
When the second-order data are measured at various sites, or over various time points, the data become three-dimensional array-variate data, and we name it as third-order data. In addition, these are recognized as triply multivariate data, e.g., multivariate spatio-temporal data or multivariate spatio-spatio data. In the previous example, if the IOP and CCT $(m_{1} = 2)$ are measured in both eyes $(m_{2} = 2)$ as well as over, say, three time points $(m_{3} = 3)$ , the dataset would become third-order data.
When the third-order data are measured at various directions, the data become four-dimensional $(m_{1} \times m_{2} \times m_{3} \times m_{4})$ array-variate fourth-order data, e.g., multivariate directo-spatio-temporal data or multivariate directo-spatio-spatio data.
When the fourth-order data are measured at various depths, the data become five-dimensional $(m_{1} \times m_{2} \times m_{3} \times m_{4} \times m_{5})$ array-variate fifth-order data, and so on, e.g., multivariate deptho-directo-spatio-temporal data.

In the above glaucoma data example, the dataset has

m_{1}

variables, and these

m_{1}

variables are repeatedly measured over various dimensions adding higher-order information to the dataset. Now, the question is, what is the higher-order information? Higher-order information is embedded in the higher-order covariance structures that are formed by signifying the additional associations that are inherent in repetition of the variables across several dimensions. The other question is how can we measure and capture the higher-order information? For this, one needs to understand how to read these structured higher-order data and how to use the appropriate variance-covariance structure to incorporate the higher-order information that are integral to the higher-order data.

Higher-order data have been studied by many authors in the last 20 years using various variance-covariance structures to reduce the number of unknown parameters, which is very important for high-dimensional data. Second-order data are studied using matrix-variate normal distribution [2,3]. Second-order data can also be analyzed vectorially using a two-separable (Kronecker product) variance-covariance structure [4,5], or a block compound symmetry (BCS), also named a block exchangeable (BE) or a 2-SSCS covariance structure [6]. Two-separable covariance structure for second-order data has two covariance matrices, one for each order of the data; in other words, one covariance matrix for within-subject information and the other covariance matrix for between-subject information. Combining the covariance structures of within-subject information and between-subject information results in a second-order model for second-order data. Ignoring this information often influences the test statistic, and if not properly taken care of this information, test statistic will end up yielding wrong conclusions [1]. To obtain a picture of the third-order data, see [7]. Manceur and Dutilleul [7] used tensor normal distribution with double separable covariance structure. 2-SSCS and 3-SSCS covariance structures are useful tools for the analyses of the second- and third-order datasets, respectively. Manceur and Dutilleul [7] also studied fourth-order data with four-separable covariance structure. In the same way, k-th order data can be analyzed vectorially with some structured variance-covariance matrix to integrate the higher-order information into the model, e.g., k-separable covariance structure [8,9] for the k-th order data. However, k-separable covariance structure may not be appropriate for all datasets; thus, we investigate some other structure, namely, k-SSCS covariance structure (defined in Section 3) for the k-th order data in this article. See [10].

The high-dimensionality of a dataset needs to exploit the structural properties of the data to reduce the number of estimated degrees of freedom for more accurate conclusion for the k-th order data, and k-SSCS covariance structure is one of them. For example, for the third-order glaucoma data, the number of unknown parameters in the

(12 \times 12)

-dimensional unstructured variance-covariance matrix is 78, whereas the number of unknown parameters for 3-SSCS covariance structure is just 9 and thus may help in providing the correct information about the true association of the structured third-order data. The data quickly become high-dimensional with the increase in the order of the data, and thus, the variance-covariance matrix becomes singular for small samples, and thus testing of mean is not possible. This necessitates the development of new statistical methods with a suitable structured variance-covariance matrix, which can integrate the existing correlations information of the higher-order data into the test statistic and can take care of the high-dimensionality of the data as well.

Rao [11,12] introduced 2-SSCS covariance structure while classifying genetically different groups. Olkin and Press [13] examined a circular stationary model. The problem of estimation in balanced multilevel models with a block circular symmetric covariance structure was studied by Liang et al. [14]. Olkin [15] studied the hypothesis testing problem of the equality of the mean vectors of multiple populations of second-order data using a 2-SSCS covariance structure, which is reminiscent of a model of Wilks [16]. Arnold [17] studied normal testing problems that mean is patterned when the variance-covariance matrix has a 2-SSCS structure. Arnold [17] also studied multivariate analysis of variance problem when the variance-covariance matrix has a 2-SSCS structure. Arnold [18] later developed linear models with a 2-SSCS structure as the error matrix for one matrix-variate observation. Roy et al. [19] and Žežula et al. [1] studied the hypotheses testing problems on the mean for the second-order data using a 2-SSCS covariance structure. There are few studies on third-order data using the 3-SSCS covariance structure. See Leiva and Roy [20] for classification problems and Roy and Fonseca [21] for linear models with a 3-SSCS covariance structure on the error vectors. Recently, Žežula et al. [22] studied the mean value test for third-order data using a 3-SSCS covariance structure.

A majority of the above-mentioned authors only studied the second-order matrix-variate data and used a 2-SSCS covariance structure where the exchangeability (invariance) property in one factor was present. However, we obtain datasets these days with more than one factor, and the assumption of exchangeability on the levels of factors is appropriate for these datasets. A k-SSCS structured matrix results from the exchangeability property of the

k - 1

factors of a dataset. Employing a 2-SSCS covariance structure would be wrong for the datasets with more than one factor. One may construct a second-order data from a k-th order data by summing the observations, however, it would result in a loss of detailed information of particular characteristics that may be of interest. One may also consider matricization of the k-th order data to a second-order data and then using the 2-SSCS covariance structure, but then, once again, all the correlation information will be wiped out. So, the development of new statistical methods are in demand to handle the k-th order data using k-SSCS variance-covariance matrix.

The aim of this paper is to derive a test statistic for mean for high-dimensional k-th order data using k-SSCS covariance matrix by generalizing

D^{2}

test statistics developed in Žežula et al. [1]. In doing so, we exploit the distributions of the eigenblocks of the k-SSCS covariance matrix. We obtain the test statistic to test the mean for one sample case, paired samples case and two independent samples case. We show through Remark 2 that our generalized

D^{2}

test statistic for the k-th order data generalizes the test statistic for the second-order data, and we derive the test statistic for the third-order data, which is largely motivated by the work of Žežula et al. [22] and in Remark 2 as well.

This article is organized as follows. In Section 2, we set up some preliminaries about some matrix notations and definitions related to block matrices. Section 3 has the definition of k-SSCS covariance matrix. Section 3 has properties of the k-SSCS covariance matrix, such as Jordan algebra. Section 4 discusses the estimation of the eigenblocks and their distributions. The test for the mean for one population is proposed in Section 5. Tests for the equality of means for two populations are proposed in Section 6, and an example of a dataset exemplifying our proposed method is presented in Section 7. Finally, Section 8 concludes with some discussion and the scope for the future research.

2. Preliminaries

Let

m_{g},

for

g = 1, \dots, k,

be natural numbers greater than

1,

and

p_{i, j}

be given by:

p_{i, j} = \{\begin{matrix} \prod_{g = i}^{j} m_{g} & if & 0 \leq j - i < k \\ 1 & if & j - i = - 1 \\ 0 & if & j - i = - 2 \end{matrix},

(1)

with

i = 1, \dots, k .

We denote by

F_{g}

the set

F_{g} = \{1, \dots, m_{g}\}

, for

g = 1, \dots, k .

Definition 1.

We say that a matrix

\underset{p_{1, k} \times p_{1, k}}{A_{k}}

is a k-th order block matrix according to the factorization

p_{1, k} = \prod_{g = 1}^{k} m_{g}

to point out that it can be expressed as k different “natural" partitioned matrix forms, that is:

\underset{p_{1, k} \times p_{1, k}}{A_{k}} = {(\underset{p_{1, j} \times p_{1, j}}{A_{f_{k}, \dots, f_{k + 1 - (k - j)}; f_{k}^{*}, \dots, f_{k + 1 - (k - j)}^{*}}})}_{f_{k}, f_{k}^{*} \in F_{k}; \dots; f_{j + 1}, f_{j + 1}^{*} \in F_{j + 1}} : j = 0, \dots, k - 1 .

Note that for the case

j = 0

, the matrix

A

is a

(k \times k) -

dimensional matrix with

1 \times 1

blocks. Clearly, both

m_{1} \geq 2

and

m_{2} \geq 2

for second-order data, and

m_{1} \geq 2

,

m_{2} \geq 2

and

m_{3} \geq 2

for third-order data, and so on. Next, we define matrix operators that will be useful tools in working with these k-th order block matrices, where

k \geq 2

. Let

M_{p_{1, g}}

denote the set of

p_{1, g} \times p_{1, g}

-matrices.

Definition 2.

Let

{BS}_{p_{1, g}}

and

{BT}_{p_{1, g}}

denote the

p_{1, g} - S u m

and

p_{1, g} - T r a c e

block operators from

M_{p_{1, h}}

to

M_{p_{1, g}}

for

1 \leq g \leq h \leq k

, respecitvely, where

M_{p_{1, h}}

will always be evident from the context. These block operators applied to a matrix

\begin{matrix} \underset{p_{1, g} p_{g + 1, h} \times p_{1, g} p_{g + 1, h}}{G_{k}} = {(\underset{p_{1, g} \times p_{1, g}}{G_{f, f^{*}}})}_{f, f^{*} \in F_{h, g + 1} = F_{h} \times \dots \times F_{g + 1} = \overset{h - g}{\underset{j = 1}{\times}} F_{h + 1 - j}} \end{matrix}

give the following

p_{1, g} \times p_{1, g}

-matrices:

\begin{matrix} {BS}_{p_{1, g}} (G_{k}) = \sum_{f \in F_{h, g + 1}} \sum_{f^{*} \in F_{h, g + 1}} G_{f, f^{*}} a n d {BT}_{p_{1, g}} (G_{k}) = \sum_{f \in F_{h, g + 1}} G_{f, f} . \end{matrix}

The subindex

p_{1, g}

in these block matrix operators represents

p_{1, g} \times p_{1, g} -

dimensional blocks in a partitioned square matrix

G

, and thus their use results in

p_{1, g} \times p_{1, g} -

dimensional matrices. Many useful properties of these block operators, which we will use later in this article, are examined in Leiva and Roy [10]. For any natural number

a > 1,

we use the following additional notations:

\begin{matrix} \underset{a \times a}{Q_{a}} & = & I_{a} - P_{a}, \end{matrix}

(2)

\begin{matrix} and P_{a} & = & \frac{1}{a} J_{a}, \end{matrix}

(3)

where

J_{a} = 1_{a} 1_{a}^{'}

, with

1_{a}

be the

a \times 1

vector of ones, and

I_{a} = [e_{a, 1}, \dots, e_{a, a}]

being the

a \times a -

identity matrix with

e_{a, i}

the ith column vector of

I_{a} .

Observe that

P_{a}

and

Q_{a}

are idempotent matrices and mutually orthogonal to each other, that is:

{(P_{a})}^{2} = P_{a}, {(Q_{a})}^{2} = Q_{a} and P_{a} Q_{a} = 0 .

For a fixed natural number

k \geq 2,

let

R_{k, j}

be the

p_{1, k} \times p_{1, k}

-matrix

R_{k, j + 1} = R_{k, j + 1}^{*} \otimes I_{m_{1}},

(4)

where, the symbol ⊗ represents the Kronecker product operator and for each

j = 1, \dots, k - 1,

\begin{matrix} \underset{p_{2, k} \times p_{2, k}}{R_{k, j + 1}^{*}} & = & (⨂_{h = 1}^{k - (j + 1)} I_{m_{k + 1 - h}}) \otimes Q_{m_{j + 1}} \otimes (⨂_{h = k - (j - 1)}^{k - 1} P_{m_{k + 1 - h}}) \\ = & I_{p_{j + 2, k}} \otimes Q_{m_{j + 1}} \otimes P_{m_{j,} m_{2}}, \end{matrix}

(5)

with

P_{m_{i}, m_{i^{*}}} = \{\begin{matrix} ⨂_{h = k - (i - 1)}^{k - (i^{*} - 1)} P_{m_{k + 1 - h}} = P_{m_{i}} \otimes P_{m_{i - 1}} \otimes \dots \otimes P_{m_{i^{*}}} & if & i \geq i^{*} \\ 1 & if & i < i^{*} \end{matrix}

and

⨂_{h = 1}^{0} I_{m_{k + 1 - h}} = 1 = ⨂_{h = k}^{k - 1} P_{m_{k + 1 - h}}

. Also, let

R_{k, k + 1}

be the

p_{1, k} \times p_{1, k}

-matrix such that

R_{k, k + 1} = R_{k, k + 1}^{*} \otimes I_{m_{1}},

(6)

where

\underset{p_{2, k} \times p_{2, k}}{R_{k, k + 1}^{*}} = ⨂_{h = 1}^{k - 1} P_{m_{k + 1 - h}} = P_{m_{k}, m_{2}} .

(7)

3. Properties of the Self Similar Compound Symmetry Covariance Matrix

Let

x_{r; f}

be an

m_{1} -

variate vector of measurements on the rth replicate (individual) at the

f = (f_{k}, \dots, f_{2}) \in F = F_{k, 2} = F_{k} \times \dots \times F_{2} = \overset{2}{\underset{g = k}{\times}} F_{g}

factor combination. Let

x_{r}

be the

p_{1, k} = \prod_{j = 1}^{k} m_{j} -

variate vector of all measurements corresponding to the rth sample unit of the population, that is,

x_{r} = {(x_{r; 1, \dots, 1}, \dots, x_{r, m_{k}, \dots, m_{1}})}^{'}

. Thus, the unstructured covariance matrix

Γ_{k}

has

q = p_{1, k} (p_{1, k} + 1) / 2

unknown parameters that can be large for random values of either of the

m_{j}

’s. Consequently, if the data are high-dimensional, k-SSCS covariance matrix (defined bellow in Definition 3) with number of unknown parameters

k m_{1} (m_{1} + 1) / 2

is a good choice if the exchangeable feature is present in the data.

Definition 3.

We say that

x_{r}

has a k-SSCS covariance matrix if

Γ_{k} = cov [x_{r}]

is of the form:

Γ_{k} = [\sum_{j = 1}^{k - 1} I_{p_{j + 1, k}} \otimes J_{p_{2, j}} \otimes (U_{k, j} - U_{k, j + 1})] + J_{p_{2, k}} \otimes U_{k, k},

(8)

where

U_{k, j},

for

j = 1, \dots, k,

are

m_{1} \times m_{1}

-matrices called SSCS-component matrices, with the assumption that

J_{p_{2, 1}}

is equal to the real number 1.

An alternative expression to (8) is:

Γ_{k} = [\sum_{j = 1}^{k - 1} I_{p_{j + 1, k}} \otimes J_{p_{2, j}} \otimes T_{k, j}] + ⨂_{h = 1}^{k - 1} J_{m_{k + 1 - h}} T_{k, k},

(9)

with

\begin{matrix} T_{k, k} & = & U_{k, k}, \end{matrix}

(10)

\begin{matrix} and T_{k, j} & = & U_{k, j} - U_{k, j + 1} for j = 1, \dots, k - 1 \\ or equivalently U_{k, k} & = & T_{k, k}, and U_{k, i} = \sum_{h = i}^{k} T_{k, h} for i = 1, \dots, k - 1 . \end{matrix}

(11)

The covariance matrix

Γ_{k}

given in (8) is called k-self similar compound symmetry covariance matrix because if we consider the

p_{1, k}

-dimensional vector

x = {(x_{1, \dots, 1}, \dots, x_{m_{k}, \dots, m_{1}})}^{'}

with a k-SSCS covariance matrix

Γ_{k}

and for each fixed

g ϵ \{2, \dots, k - 1\}

, we also consider the partition of

x

in

p_{1, g}

-subvectors. Then, its corresponding covariance matrix

Γ_{k}

is partitioned in

p_{1, g} \times p_{1, g}

-submatrices, which is

(k + 1 - g)

-SSCS matrix

Γ_{k + 1 - g}^{*}

(see Leiva and Roy [10]) as follows:

\begin{matrix} \underset{p_{1, k} \times p_{1, k}}{Γ_{k}} & = & Γ_{k + 1 - g}^{*} \\ = & [\sum_{j = 1}^{k + 1 - g - 1} I_{p_{j + 1, k + 1 - g}} \otimes J_{p_{2, j}} \otimes (U_{k + 1 - g, j}^{*} - U_{k + 1 - g, j + 1}^{*})] \\ + J_{p_{2, k + 1 - g}} \otimes U_{k + 1 - g, k + 1 - g}^{*}, \end{matrix}

where

U_{k + 1 - g, 1}^{*}

is the g-SSCS matrix given by:

\begin{matrix} U_{k + 1 - g, 1}^{*} & = & \{\sum_{f = 1}^{g - 1} I_{p_{j + 1, g}} \otimes J_{p_{2, j}} \otimes (U_{k, f} - U_{k, f + 1})\} + J_{p_{2, k}} \otimes U_{k, g}, \\ and U_{k + 1 - g, j}^{*} & = & (⨂_{h = 1}^{g - 1} J_{m_{g + 1 - h}}) \otimes U_{k, g + j - 1} for j ϵ \{2, \dots, k + 1 - g\} . \end{matrix}

The existence of

Γ_{k}^{- 1}

can be proved using the principle of Mathematical Induction and to derive its expression as well. For the expression of

Γ_{k}^{- 1}

, we need matrices

Δ_{k, j}

for

j = 1, \dots, k

, which are defined as follows:

Δ_{k, j} = \sum_{i = 1}^{j} p_{2, i} (U_{k, i} - U_{k, i + 1}),

(12)

where

U_{k, k + 1} = 0

and

p_{2, 1} = 1 .

Note that

Δ_{k, j} = \{\begin{matrix} U_{k, 1} - U_{k, 2} & i f & j = 1 \\ Δ_{k, j - 1} + p_{2, j} (U_{k, j} - U_{k, j + 1}) & i f & j = 2, \dots, k \end{matrix} .

(13)

It can be proved that if matrices

Δ_{k, j},

j = 1, \dots, k

are non-singular, then

Γ_{k}^{- 1}

exists and is given by:

\begin{matrix} Γ_{k}^{- 1} & = & [\sum_{j = 1}^{k - 1} I_{p_{j + 1, k}} \otimes J_{p_{2, j}} \otimes \frac{1}{p_{2, j}} (Δ_{k, j}^{- 1} - Δ_{k, j - 1}^{- 1})] + J_{p_{2, k}} \otimes \frac{1}{p_{2, k}} (Δ_{k, k}^{- 1} - Δ_{k, k - 1}^{- 1}), \end{matrix}

(see Leiva and Roy [10]), where the symbol

Δ_{k, 0}^{- 1}

indicates the

m_{1} \times m_{1}

zero matrix

(Δ_{k, 0}^{- 1} = 0_{m_{1} \times m_{1}})

. It is worthwhile to note that the structure of

Γ_{k}^{- 1}

is the same as the structure of

Γ_{k},

that is, it has the k-SSCS structure given in (9) with (10) and (11) and

Γ_{k}^{- 1} = [\sum_{j = 1}^{k - 1} I_{p_{j + 1, k}} \otimes J_{p_{2, j}} \otimes T_{k, j}] + J_{p_{2, k}} \otimes T_{k, k},

where, in this formula

Γ_{k}^{- 1}

,

T_{k, j}

is as follows

T_{k, j} = \frac{1}{p_{2, j}} (Δ_{k, j}^{- 1} - Δ_{k, j - 1}^{- 1}), for each j = 1, \dots, k .

Using a similar inductive arguments, it can be proved that:

|Γ_{k}| = \prod_{j = 1}^{k} {|Δ_{k, j}|}^{p_{j + 1, k} - p_{j + 2, k}},

where the matrices

Δ_{k, j}

are given by (12), and it is assumed that

p_{k + 1, k} = 1

and

p_{k + 2, k} = 0

. The matrices

Δ_{k, j}, j = 1, \dots, k

are the k eigenblocks of the k-SSCS covariance structure. See Lemma 4 of Leiva and Roy [10] for proof. The matrix

Γ_{k}

can be written as the following sum of k orthogonal parts:

Γ_{k} = \sum_{j = 1}^{k} R_{k, j + 1}^{*} \otimes Δ_{k, j}

(14)

and if

Γ_{k}^{- 1}

exists, then it can be written as:

Γ_{k}^{- 1} = \sum_{j = 1}^{k} R_{k, j + 1}^{*} \otimes Δ_{k, j}^{- 1},

(15)

where

R_{k, j + 1}^{*}

is given in (5), for each

j = 1, \dots, k - 1,

and, for

j = k,

R_{k, k + 1}^{*}

is given in (7).

The conventional Hotelling’s

T^{2}

statistic to test the mean is based on the unbiased estimate of the unstructured variance-covariance matrix, which follows a Wishart distribution. Nevertheless, the unbiased estimate of the k-SSCS covariance matrix does not follow a Wishart distribution, and thus the test statistic to test the equality of mean does not follow Hotelling’s

T^{2}

statistic. We thus make a canonical transformation of the data to block diagonalize the k-SSCS covariance matrix, and show that a scalar multiple of the estimates of the diagonal blocks (eigenblocks) follow independent Wishart distributions and use this property in our advantage to obtain test statistics to test the mean for the k-th order data

(k \geq 2)

. We see from Leiva and Roy [10] that the k-SSCS matrix

Γ_{k}

given by (8) can be transformed into an

m_{1} \times m_{1}

-block diagonal matrix (blocks in the diagonal are

m_{1} \times m_{1}

-matrices) by pre- and post-multiplying

Γ_{k}

by appropriate orthogonal matrices.

For

1 \leq j < g < i \leq k,

let

I_{m_{i}, m_{j}}

denote the

m_{i} m_{i - 1} \dots m_{j}

identity matrix, that is:

I_{m_{i}, m_{j}} = I_{m_{i} m_{i - 1} \dots m_{j}} = I_{m_{i}} \otimes I_{m_{i - 1}} \otimes \dots \otimes I_{m_{j}},

and let

\underset{p_{g, i} \times p_{g, i}}{H_{m_{i}, m_{g}}} = H_{m_{i}} \otimes H_{m_{i - 1}} \otimes \dots \otimes H_{m_{g}},

where

H_{m_{h}}

is an

m_{h} \times m_{h}

Helmert matrix for each

h = 2, \dots, k,

i.e., each

H_{m_{h}}

is an orthogonal matrix whose first column is proportional to

1_{m_{h}} .

Then:

L_{h}^{'} = H_{m_{h}, m_{2}}^{'} \otimes I_{m_{1}}

is an orthogonal matrix (note that

L_{h}

are not function of either of the

U_{i}

’s), and in particular

L_{k}^{'} = H_{m_{h}, m_{2}}^{'} \otimes I_{m_{1}} = (I_{m_{k}} \otimes L_{k - 1}^{'}) (H_{m_{k}}^{'} \otimes I_{m_{k - 1}, m_{1}}) .

(16)

Lemma 4 of Leiva and Roy [10] states and proves the block diagonalization result of the k-SSCS matrix

Γ_{k}

by using the orthogonal matrix

L_{k}^{'}

as defined in (16), that is:

\begin{matrix} L_{k}^{'} Γ_{k} L_{k} & = & diag \{D_{f}; f = {(f_{k}, f_{k - 1}, \dots, f_{2})}^{'} \in F\} \\ = & diag \{D_{f_{k}, f_{k - 1}, \dots, f_{2}}; {(f_{k}, f_{k - 1}, \dots, f_{2})}^{'} \in F_{k} \times F_{k - 1} \times \dots \times F_{2}\}, \end{matrix}

(17)

where, for each

j = 1, \dots, k,

the

m_{1} \times m_{1}

-diagonal matrices

D_{f} = D_{f_{k}, f_{k - 1}, \dots, f_{2}}

are given by:

\begin{matrix} D_{f_{k}, f_{k - 1}, \dots, f_{2}} = Δ_{k, j} & i f & f_{2} = 1, \dots, f_{j} = 1, f_{j + 1} \neq 1, \end{matrix}

where

f_{k + 1} \neq 1

is not taken into consideration, that is:

\begin{matrix} D_{f_{k}, f_{k - 1}, \dots, f_{2}} & = & \{\begin{matrix} Δ_{k, k} & if & f_{2} = 1, \dots, f_{k - 1} = 1, f_{k} = 1 \\ Δ_{k, k - 1} & if & f_{2} = 1, \dots, f_{k - 1} = 1, f_{k} \neq 1 \\ Δ_{k, k - 2} & if & f_{2} = 1, \dots, f_{k - 2} = 1, f_{k - 1} \neq 1 \\ ⋮ & ⋮ & ⋮ \\ Δ_{k, 2} & if & f_{2} = 1, f_{3} \neq 1 \\ Δ_{k, 1} & if & f_{2} \neq 1 \end{matrix} . \end{matrix}

(18)

Thus,

Δ_{k, j} f o r j = 1, \dots, k

are the k eigenblocks of the k-SSCS covariance matrix

Γ_{k}

. We will obtain the estimators of the eigenblocks

Δ_{k, j}, j = 1, \dots, k

in Section 4. In the following section, we briefly discuss that the k-SSCS covariance structure is of the Jordan algebra type.

k-SSCS Covariance Structure Is of the Jordan Algebra Type

The k-SSCS covariance structure is of the Jordan algebra type (Jordan et al. [23]). Let

G_{p_{1, k}}

be the set of all k-SSCS

p_{1, k} \times p_{1, k}

matrices. It is clear that under the usual matrix addition and scalar multiplication,

G_{p_{1, k}}

is a subspace of the linear vectorial space

S_{p_{1, k}}

of the

p_{1, k} \times p_{1, k}

symmetric matrices. For any natural number

k \geq 2

, it is easy to prove the following proposition:

Proposition 1.

If

Γ_{k}

is a k-SSCS matrix given by (9) (or equivlently by (8)), then

Γ_{k}^{2} = Γ_{k} Γ_{k}

is also a k-SSCS matrix given by:

Γ_{k}^{2} = Γ_{k} Γ_{k} = Γ_{k}^{*} = [\sum_{j = 1}^{k - 1} I_{p_{j + 1, k}} \otimes J_{p_{2, j}} \otimes T_{k, j}^{*}] + ⨂_{h = 1}^{k - 1} J_{m_{k + 1 - h}} T_{k, k}^{*},

where

\begin{matrix} T_{k, k}^{*} & = & [\sum_{j = 1}^{k - 1} p_{2, j} (T_{k, j} T_{k, k} + T_{k, k} T_{k, j})] + p_{2, k} T_{k, k}^{2} \\ = : & U_{k, k}^{*}, \end{matrix}

and for

h = k - 1, k - 2, \dots, 2,

T_{k, h}^{*} = : U_{k, h}^{*} - U_{k, h + 1}^{*},

with

U_{k, h}^{*} = [\sum_{j = 1}^{h - 1} p_{2, j} (T_{k, j} T_{k, h} + T_{k, h} T_{k, j})] + p_{2, k} U_{k, h}^{2} + U_{k, h + 1}^{*}

and where

T_{k, 1}^{*} = : U_{k, 1}^{*} - U_{k, 2}^{*} = {(U_{k, 1} - U_{k, 2})}^{2} = T_{k, 1}^{2},

that is, with

U_{k, 1}^{*} =

T_{k, 1}^{2} + U_{k, 2}^{*} .

Therefore, we conclude that

G_{p_{1, k}}

is a Jordan Algebra. See Lemma 4.1 on Page 10 in Malley [24], which states that

G_{p_{1, k}}

is a Jordan Algebra if and only if

Γ_{k}^{2} = Γ_{k} Γ_{k} \in G_{p_{1, k}}

for all

Γ_{k} \in G_{p_{1, k}}

. See Roy et al. [25] and Kozioł et al. [26] for proofs that 2-SSCS and 3-SSCS covariance structures are of Jordan algebra types.

4. Estimators of the Eigenblocks

Let

x_{r} : r = 1, \dots, n

be

p_{1, k} \times 1

random vectors partitioned into

m_{1} \times 1

subvectors as follows:

\begin{matrix} x_{r} & = & {(x_{r; f}^{'} : f = (f_{k}, f_{k - 1}, \dots, f_{2}) \in F = F_{k, 2} = \overset{2}{\underset{j = k}{\times}} F_{j})}^{'} \\ = & {(x_{r; f_{k}, f_{k - 1}, \dots, f_{2}}^{'} : f_{j} \in F_{j} = \{1, \dots, m_{j}\}, for j = k, k - 1, \dots, 2)}^{'} . \end{matrix}

The vectors

x_{r} : r = 1, \dots, n

are a random sample from a population with distribution

N_{p_{1, k}} (μ; Γ_{k}),

where

Γ_{k}

is a positive definite k-SSCS structured covariance matrix as given in (8) in Definition 3. Let

X

be the

n \times p_{1, k}

-sample data matrix as follows:

\underset{n \times p_{1, k}}{X} = (\begin{matrix} x_{1}^{'} \\ ⋮ \\ x_{n}^{'} \end{matrix}) = (\underset{n \times m_{1}}{X_{\cdot; 1, \dots, 1}}, \dots, \underset{n \times m_{1}}{X_{\cdot; f_{k}, \dots, f_{2}}}, \dots, \underset{n \times m_{1}}{X_{\cdot; m_{k}, \dots, m_{2}}})

with

\underset{n \times m_{1}}{X_{\cdot, f}} = \underset{n \times m_{1}}{X_{\cdot, f_{k}, f_{k - 1}, \dots, f_{2}} =} (\begin{matrix} x_{1, f}^{'} \\ ⋮ \\ x_{n, f}^{'} \end{matrix}) = {(\underset{1 \times m_{1}}{x_{r, f}^{'}})}_{r = 1}^{n} .

In this section, we prove that certain unbiased estimators (to be defined) of the matrix parameters

U_{k, j} : j = 1, \dots, k - 1

can be written as functions of the usual sample variance-covariance matrix

S

as follows:

\begin{matrix} S & = & \frac{1}{n - 1} X^{'} Q_{n} X \\ = & \frac{1}{n - 1} (\begin{matrix} \underset{m_{1} \times n}{X_{\cdot; 1, \dots, 1}^{'}} \\ ⋮ \\ \underset{m_{1} \times n}{X_{\cdot; m_{k}, \dots, m_{2}}^{'}} \end{matrix}) Q_{n} (\underset{n \times m_{1}}{X_{\cdot; 1, \dots, 1}}, \dots, \underset{n \times m_{1}}{X_{\cdot; m_{k}, \dots, m_{2}}}) = {(S_{f, f^{*}})}_{f, f^{*} \in F}, \end{matrix}

where

Q_{n}

is given in (2) with (3). Now the sample mean

\bar{x}

can be expressed as:

\begin{matrix} \underset{p_{1, k} \times 1}{\bar{x}} & = & \frac{1}{n} \underset{p_{1, k} \times n}{X^{'}} 1_{n} = \frac{1}{n} (\begin{matrix} X_{\cdot; 1, \dots, 1}^{'} \\ ⋮ \\ X_{\cdot; m_{k}, \dots, m_{2}}^{'} \end{matrix}) 1_{n} = (\begin{matrix} \frac{1}{n} X_{\cdot; 1, \dots, 1}^{'} 1_{n} \\ ⋮ \\ \frac{1}{n} X_{\cdot; m_{k}, \dots, m_{2}}^{'} 1_{n} \end{matrix}) \\ = & {(\frac{1}{n} \underset{m_{1} \times n}{X_{\cdot; f}^{'}} 1_{n})}_{f \in F} = {(\frac{1}{n} \sum_{r = 1}^{n} x_{r; f})}_{f \in F} = {\underset{p_{1, k} \times 1}{(\underset{m_{1} \times 1}{{\bar{x}}_{f}})}}_{f \in F} . \end{matrix}

Thus,

S_{f, f^{*}}

in

S

can be expressed as:

\begin{matrix} S_{f, f^{*}} & = & \frac{1}{n - 1} X_{\cdot; f}^{'} Q_{n} X_{\cdot; f^{*}}^{'} = \{\begin{matrix} \frac{1}{n - 1} \sum_{r = 1}^{n} (\underset{m_{1} \times 1}{x_{r; f} - {\bar{x}}_{f}}) (\underset{1 \times m_{1}}{x_{r; f}^{'} - {\bar{x}}_{f}^{'}}) & if & f = f^{*} \\ \frac{1}{n - 1} \sum_{r = 1}^{n} (\underset{m_{1} \times 1}{x_{r; f} - {\bar{x}}_{f}}) (\underset{1 \times m_{1}}{x_{r; f^{*}}^{'} - {\bar{x}}_{f^{*}}^{'}}) & if & f \neq f^{*} \end{matrix} . \end{matrix}

Since

S

is an unbiased estimator of

Γ_{k}

, we have:

\begin{matrix} E [S_{f; f^{*}}] & = & E [S_{f_{k}, f_{k - 1}, \dots, f_{2}; f_{k}^{*}, f_{k - 1}^{*}, \dots, f_{2}^{*}}] \\ = & \{\begin{matrix} U_{k, 1} & i f & f_{2} = f_{2}^{*}, \dots, f_{k} = f_{k}^{*} \\ U_{k, j} & i f & f_{j} \neq f_{j}^{*}, f_{j + 1} = f_{j + 1}^{*}, \dots, f_{k} = f_{k}^{*} & for & j = 2, \dots, k \end{matrix} . \end{matrix}

Therefore, to find a better unbiased estimator of

U_{k, j}

, we average all the above random matrices that are unbiased estimator of the same

U_{k, j}

. The unbiased estimators

{\hat{U}}_{k, j}

of

U_{k, j}

for each

j = 1, \dots, k

are derived in Lemma 5 in Leiva and Roy [10] with

q_{k, j}

defined in Lemma 3 in Leiva and Roy [10] as:

q_{k, j} = \{\begin{matrix} p_{2, k} & i f & j = 1 \\ \begin{matrix} p_{2, k} (m_{j} - 1) p_{2, j - 1} \end{matrix} & i f & j = 2, \dots, k \end{matrix} .

(19)

Unbiased estimators of the eigenblocks

Δ_{k, j}

can be obtained from (13). Then, using (14), the unbiased estimators of

Γ_{k}

can be obtained as the following othogonal sums:

{\hat{Γ}}_{k} = \sum_{j = 1}^{k} R_{k, j + 1}^{*} \otimes {\hat{Δ}}_{k, j},

and if

{\hat{Γ}}_{k}^{- 1}

exists, it can be obtained from (15) as follows:

{\hat{Γ}}_{k}^{- 1} = \sum_{j = 1}^{k} R_{k, j + 1}^{*} \otimes {\hat{Δ}}_{k, j}^{- 1},

(20)

where

R_{k, j + 1}^{*}

is given in (5), for each

j = 1, \dots, k - 1,

and, for

j = k,

R_{k, k + 1}^{*}

is given in (7).

The computation of the unbiased estimates of the component matrices

U_{k, j}

for each

j = 1, \dots, k

is easy, as all of them have explicit solutions. At this point, we want to mention that for k-separable covariance structure the estimates of the component matrices are not easy, as the MLEs have implicit equations, and therefore are not tractable analytically. Now, from Theorem 1 of Leiva and Roy [10], we see that a multiple of the unbiased estimators of the eigenblocks

{\hat{Δ}}_{k, j}

for each

j = 1, \dots, k

, have Wishart distributions as follows:

(n - 1) p_{j + 2, k} (m_{j + 1} - 1) {\hat{Δ}}_{k, j} = (n - 1) {BT}_{p_{1, 1}} (R_{k, j + 1} S R_{k, j + 1}) : j = 1, \dots, k - 1,

and (n - 1) {\hat{Δ}}_{k, k} = (n - 1) {BT}_{p_{1, 1}} (R_{k, k + 1} S R_{k, k + 1}),

where

R_{k, j + 1} = R_{k, j + 1}^{*} \otimes I_{m_{1}}

given by (4) with (5) and (6) with (7), are independent and

\begin{matrix} (n - 1) p_{j + 2, k} (m_{j + 1} - 1) {\hat{Δ}}_{k, j} & \sim & W_{m_{1}} ((n - 1) p_{j + 2, k} (m_{j + 1} - 1); Δ_{k, j}) for 1, \dots, k - 1 \\ and (n - 1) {\hat{Δ}}_{k, k} & \sim & W_{m_{1}} ((n - 1); Δ_{k, k}) . \end{matrix}

From Corollary 1 of Leiva and Roy [10], the 2-SSCS covariance matrix for second-order data or multivariate repeated measures data has two eigenblocks,

Δ_{2, 2}

and

Δ_{2, 1}

with multiplicity

m_{2} - 1

, and their distributions are as follows:

\begin{matrix} (n - 1) (m_{2} - 1) {\hat{Δ}}_{2, 1} & \sim & W_{m_{1}} ((n - 1) (m_{2} - 1), Δ_{2, 1}), \\ and (n - 1) {\hat{Δ}}_{2, 2} & \sim & W_{m_{1}} ((n - 1), Δ_{2, 2}) . \end{matrix}

The 3-SSCS covariance matrix for third-order data has three eigenblocks,

Δ_{3, 3}

,

Δ_{3, 2}

with multiplicity

m_{3} - 1

and

Δ_{3, 1}

with multiplicity

m_{3} (m_{2} - 1)

, and their distributions are as follows:

\begin{matrix} (n - 1) m_{3} (m_{2} - 1) {\hat{Δ}}_{3, 1} & \sim & W_{m_{1}} ((n - 1) m_{3} (m_{2} - 1), Δ_{3, 1}), \\ a n d (n - 1) (m_{3} - 1) {\hat{Δ}}_{3, 2} & \sim & W_{m_{1}} ((n - 1) (m_{3} - 1), Δ_{3, 2}), \\ a n d (n - 1) {\hat{Δ}}_{3, 3} & \sim & W_{m_{1}} ((n - 1), Δ_{3, 3}) . \end{matrix}

5. Test for the Mean

5.1. One Sample Test

Using the notation and assumptions in Section 4, let

\underset{n \times p_{1, k}}{X} = {(x_{1}, \dots, x_{n})}^{'}

be a

n \times p_{1, k}

-dimensional data matrix formed from the random samples

x_{1}, \dots, x_{n}

from

N_{p_{1, k}} (μ; Γ_{k})

. Let

\bar{x}

be the sample mean, then

\bar{x} \sim N_{p_{1, k}} (μ, \frac{1}{n} Γ_{k})

. We are interested in testing the following hypothesis:

H_{0} : μ = μ_{0} v s H_{1} : μ \neq μ_{0},

(21)

for known

μ_{0}

. For testing hypothesis (21), we use the test statistic

D^{2}

defined as:

D^{2} = n {(\bar{x} - μ_{0})}^{'} {\hat{Γ}}_{k}^{- 1} (\bar{x} - μ_{0}) .

(22)

5.1.1. Distribution of Test Statistic $D^{2}$ under $H_{0}$

Now, let

L_{k}^{'} = H_{m_{k}, m_{2}}^{'} \otimes I_{m_{1}}

be the matrix as given in (16). We use here the following canonical transformation:

z = L_{k}^{'} (\bar{x} - μ_{0}) = (H_{m_{k}, m_{2}}^{'} \otimes I_{m_{1}}) (\bar{x} - μ_{0}) .

Therefore,

z = {(z {_{1, \dots, 1}}^{'}, \dots, z {_{m_{k}, \dots, m_{2}}}^{'})}^{'} = L_{k}^{'} (\bar{x} - μ_{0}) \sim N_{p_{1, k}} (0, Ω_{k})

. Therefore, according to (17) with (18), we have:

\begin{matrix} Ω_{k} & = & \frac{1}{n} D_{k} = L_{k}^{'} (\frac{1}{n} Γ_{k}) L_{k} \\ = & \frac{1}{n} (H_{m_{k}, m_{2}}^{'} \otimes I_{m_{1}}) Γ_{k} (H_{m_{k}, m_{2}} \otimes I_{m_{1}}) \\ = & \frac{1}{n} diag \{D_{f_{k}, f_{k - 1}, \dots, f_{2}}; {(f_{k}, f_{k - 1}, \dots, f_{2})}^{'} \in F_{k} \times F_{k - 1} \times \dots \times F_{2}\}, \end{matrix}

where, for each

j = 1, \dots, k,

the diagonal

m_{1} \times m_{1}

- matrices

D_{f} = D_{f_{k}, f_{k - 1}, \dots, f_{2}}

are given by:

\begin{matrix} D_{f_{k}, f_{k - 1}, \dots, f_{2}} = Δ_{k, j} & i f & f_{2} = 1, \dots, f_{j} = 1, f_{j + 1} \neq 1, \end{matrix}

where

f_{k + 1} \neq 1

is not taken into consideration, and the

m_{1} \times 1

component vectors

z_{f_{k}, \dots, f_{2}},

with

f = (f_{k}, \dots, f_{2}) \in F,

are independent. The distribution of

z_{f_{k}, \dots, f_{2}}

, under

H_{0}

is given by:

\begin{matrix} z_{f_{k}, \dots, f_{2}} \sim N_{m_{1}} (0, \frac{1}{n} Δ_{k, j}) i f \end{matrix}

f_{2, j} = (f_{2}, \dots, f_{j}), f_{j + 1} \in F_{j + 1} - \{1\}, and f_{j + 2, k} = (f_{j + 2}, \dots, f_{k}) \in F_{j + 2} \times \dots \times F_{k} .

Since

\begin{matrix} H_{m_{i}} I_{m_{i}} H_{m_{i}}^{'} & = & I_{m_{i}} = [e_{m_{i}, 1}, e_{m_{i}, 2}, \dots, e_{m_{i}, m_{i}}] \\ H_{m_{i}} P_{m_{i}} H_{m_{i}}^{'} & = & e_{m_{i}, 1} e_{m_{i}, 1}^{'} = diag (1, \underset{m_{i} - 1}{\underset{⏟}{0, \dots, 0}}) \\ and H_{m_{i}} Q_{m_{i}} H_{m_{i}}^{'} & = & \sum_{j = 2}^{m_{i}} e_{m_{i}, j} e_{m_{i}, j}^{'} = diag (0, \underset{m_{i} - 1}{\underset{⏟}{1, \dots, 1}}), \end{matrix}

for

j = 1, \dots, k - 1,

we have:

\begin{matrix} H_{m_{k}, m_{2}} R_{k, j + 1}^{*} H_{m_{k}, m_{2}}^{'} & = & (H_{m_{k}, m_{j + 2}} \otimes H_{m_{j + 1}} \otimes H_{m_{j}, m_{2}}) (I_{p_{j + 2, k}} \otimes Q_{m_{j + 1}} \otimes P_{m_{j,} m_{2}}) \\ (H_{m_{k}, m_{j + 2}}^{'} \otimes H_{m_{j + 1}}^{'} \otimes H_{m_{j}, m_{2}}^{'}) \\ = & ⨂_{i = 1}^{k - j - 1} [e_{m_{k + 1 - i}, 1}, \dots, e_{m_{k + 1 - i}, m_{k + 1 - i}}] \otimes (\sum_{i = 2}^{m_{j + 1}} e_{m_{j + 1}, i} e_{m_{j + 1}, i}^{'}) \otimes (e_{p_{j, 2}, 1} e_{p_{j, 2}, 1}^{'}) \\ = & diag (⋮ \underset{p_{j + 2, k}}{\underset{⏟}{\underset{m_{j + 1}}{\underset{⏟}{\underset{p_{2, j}}{\underset{⏟}{0, \dots, 0} ⋮} 1, \underset{p_{2, j} - 1}{\underset{⏟}{0, \dots, 0} ⋮} \dots ⋮ 1, \underset{p_{2, j} - 1}{\underset{⏟}{0, \dots, 0}}}} ⋮ \dots ⋮ \underset{m_{j + 1}}{\underset{⏟}{\underset{p_{2, j}}{\underset{⏟}{0, \dots, 0} ⋮} 1, \underset{p_{2, j} - 1}{\underset{⏟}{0, \dots, 0} ⋮} \dots ⋮ 1, \underset{p_{2, j} - 1}{\underset{⏟}{0, \dots, 0}}}}}} ⋮), \end{matrix}

and for

j = k,

we have:

\begin{matrix} H_{m_{k}, m_{2}} R_{k, k + 1}^{*} H_{m_{k}, m_{2}}^{'} & = & H_{m_{k}, m_{2}} P_{m_{k,} m_{2}} H_{m_{k}, m_{2}}^{'} = e_{p_{2, k}, 1} e_{p_{2, k}, 1}^{'} = diag (1, \underset{p_{2, k} - 1}{\underset{⏟}{0, \dots, 0}}) . \end{matrix}

Therefore, using (20), the statistic

D^{2}

in (22) can be written as:

\begin{matrix} D^{2} & = & n {(\bar{x} - μ_{0})}^{'} {\hat{Γ}}_{k}^{- 1} (\bar{x} - μ_{0}) \\ = & n z^{'} (H_{m_{k}, m_{2}}^{'} \otimes I_{m_{1}}) \{\sum_{j = 1}^{k} R_{k, j + 1}^{*} \otimes {\hat{Δ}}_{k, j}^{- 1}\} (H_{m_{k}, m_{2}} \otimes I_{m_{1}}) z \\ = & n z^{'} \sum_{j = 1}^{k - 1} \{diag (\overset{p_{j + 2, k}}{\overset{︷}{⋮ \underset{m_{j + 1}}{\underset{⏟}{⋮ 0_{p_{2, j}}^{'} ⋮ 1, 0_{p_{2, j - 1}}^{'} ⋮ \dots ⋮ 1, 0_{p_{2, j - 1}}^{'} ⋮}} ⋮ \dots ⋮ \underset{m_{j + 1}}{\underset{⏟}{⋮ 0_{p_{2, j}}^{'} ⋮ 1, 0_{p_{2, j - 1}}^{'} ⋮ \dots ⋮ 1, 0_{p_{2, j - 1}}^{'} ⋮}} ⋮}}) \\ \underset{}{\overset{}{\overset{}{\underset{}{}}}} \otimes {\hat{Δ}}_{k, j}^{- 1}\} z + n z^{'} \{diag (1, \underset{p_{2, k} - 1}{\underset{⏟}{0, \dots, 0}}) \otimes {\hat{Δ}}_{k, k}^{- 1}\} z, \end{matrix}

that is,

\begin{matrix} D^{2} = : \sum_{j = 1}^{k} T_{0 j}^{2}, \end{matrix}

(23)

where for

j = 1, \dots, k - 1

\begin{matrix} T_{0 j}^{2} = tr (n \underset{k - j S u m s}{\underset{⏟}{\sum_{f_{k} = 1}^{m_{k}} \dots \sum_{f_{j + 2} = 1}^{m_{j + 2}} \sum_{f_{j + 1} = 2}^{m_{j + 1}}}} z_{f_{k}, \dots, f_{j + 1}, \underset{j - 1 o n e s}{\underset{⏟}{1, \dots, 1}}} z_{f_{k}, \dots, f_{j + 1}, \underset{j - 1 o n e s}{\underset{⏟}{1, \dots, 1}}}^{'} {\hat{Δ}}_{k, j}^{- 1}) \end{matrix}

(24)

and for

j = k

, we assume

\begin{matrix} T_{0 k}^{2} & = & tr [n \underset{k - j S u m s}{\underset{⏟}{\sum_{f_{k} = 1}^{m_{k}} \dots \sum_{f_{j + 2} = 1}^{m_{j + 2}} \sum_{f_{j + 1} = 2}^{m_{j + 1}}}} z_{f_{k}, \dots, f_{j + 1}, \underset{j - 1 o n e s}{\underset{⏟}{1, \dots, 1}}} z_{f_{k}, \dots, f_{j + 1}, \underset{j - 1 o n e s}{\underset{⏟}{1, \dots, 1}}}^{'} {\hat{Δ}}_{k, k}^{- 1}] \\ = & n tr (z_{\underset{k - 1 o n e s}{\underset{⏟}{1, \dots, 1}}} z_{\underset{k - 1 o n e s}{\underset{⏟}{1, \dots, 1}}}^{'} {\hat{Δ}}_{k, k}^{- 1}) . \end{matrix}

(25)

Note that the subsets of vectors involved in

T_{01}^{2}, \dots, T_{0 k}^{2},

respectively, form a partition of the set of independent vectors

\{z_{f_{k}, \dots, f_{2}} : f_{k}, \dots, f_{2} \in F\}

. Therefore,

T_{01}^{2}, \dots, T_{0 k}^{2}

are mutually independent. Moreover, since for

j = 1, \dots, k,

H_{j} = n \underset{k - j S u m s}{\underset{⏟}{\sum_{f_{k} = 1}^{m_{k}} \dots \sum_{f_{j + 2} = 1}^{m_{j + 2}} \sum_{f_{j + 1} = 2}^{m_{j + 1}}}} z_{f_{k}, \dots, f_{j + 1}, \underset{j - 1 o n e s}{\underset{⏟}{1, \dots, 1}}} z_{f_{k}, \dots, f_{j + 1}, \underset{j - 1 o n e s}{\underset{⏟}{1, \dots, 1}}}^{'} \sim W_{m_{1}} (k_{(j)}; Δ_{k, j}),

where

k_{(j)} = p_{j + 2, k} (m_{j + 1} - 1),

and

E_{j} = (n - 1) p_{j + 2, k} (m_{j + 1} - 1) {\hat{Δ}}_{k, j} \sim W_{m_{1}} (d_{j}; Δ_{k, j}),

with

d_{j} = (n - 1) p_{j + 2, k} (m_{j + 1} - 1) .

Therefore,

T_{0 j}^{2}

given by (24) reduces to

\begin{matrix} T_{0 j}^{2} & = & tr [H_{j} {(\frac{E_{j}}{(n - 1) p_{j + 2, k} (m_{j + 1} - 1)})}^{- 1}] \\ = & (n - 1) p_{j + 2, k} (m_{j + 1} - 1) tr (H_{j} E_{j}^{- 1}) = d_{j} tr (H_{j} E_{j}^{- 1}), \end{matrix}

and has a Lawley–Hotelling trace (LH-trace) distribution denoted by

T_{0}^{2} (m_{1}; p_{j + 2, k} (m_{j + 1} - 1), d_{j})

if

d_{j} = (n - 1) p_{j + 2, k} (m_{j + 1} - 1) \geq m_{1} .

Note that, using (25), the case

j = k

reduces to

H_{k} = n z_{1, \dots, 1} z {_{1, \dots, 1}}^{'} \sim W_{m_{1}} (1, Δ_{k, k})

and

E_{k} = (n - 1) {\hat{Δ}}_{k, k} \sim W_{m_{1}} (n - 1; Δ_{k, k})

. Then,

T_{0 k}^{2}

has the LH-trace distribution

T_{0}^{2} (m_{1}; 1, (n - 1))

if

(n - 1) \geq m_{1} .

Thus, the distribution of

D^{2}

given by (23) is the convolution of k-independent LH-trace distributions:

⨁_{j = 1}^{k} T_{0}^{2} (m_{1}; p_{j + 2, k} (m_{j + 1} - 1), d_{j} = (n - 1) p_{j + 2, k} (m_{j + 1} - 1) \geq m_{1}) .

The critical values of this distribution can be obtained using simulations. However, LH-trace distribution is usually aproximated by

F

distribution, and we use here the second approximation suggested in McKeon [27]. For jth case, i.e., for

j = 1, \dots, k - 1,

let us use the notations

m_{(j)} = d_{j} - m_{1} - 1 = (n - 1) p_{j + 2, k} (m_{j + 1} - 1) - m_{1} - 1

,

k_{(j)} = p_{j + 2, k} (m_{j + 1} - 1)

and

\begin{matrix} B_{j} & = & \frac{[m_{(j)} + k_{(j)}]}{[m_{(j)} - 2]} \cdot \frac{[m_{(j)} + m_{1}]}{[m_{(j)} + 1]} \\ = & \frac{[n p_{j + 2, k} (m_{j + 1} - 1) - m_{1} - 1] [(n - 1) p_{j + 2, k} (m_{j + 1} - 1) - 1]}{[(n - 1) p_{j + 2, k} (m_{j + 1} - 1) - m_{1} - 3] [(n - 1) p_{j + 2, k} (m_{j + 1} - 1) - m_{1}]} . \end{matrix}

Then, the distribution

T_{0}^{2} (m_{1}; p_{j + 2, k} (m_{j + 1} - 1), (n - 1) p_{j + 2, k} (m_{j + 1} - 1))

of

T_{0 j}^{2} = d_{j} tr (H_{j} E_{j}^{- 1})

can be approximated by

g_{j} F (K_{j}, D_{j})

where

K_{j} = k_{(j)} m_{1} = p_{j + 2, k}

(m_{j + 1} - 1)

m_{1},

D_{j} = 4 + \frac{K_{j} + 2}{B_{j} - 1} = 4 + \frac{p_{j + 2, k} (m_{j + 1} - 1) m_{1} + 2}{B_{j} - 1}

and

g_{j} = \frac{d_{j} K_{j}}{m_{(j)}} \frac{D_{j} - 2}{D_{j}} = \frac{(n - 1) p_{j + 2, k}^{2} {(m_{j + 1} - 1)}^{2} m_{1}}{(n - 1) p_{j + 2, k} (m_{j + 1} - 1) - m_{1} - 1} \frac{D_{j} - 2}{D_{j}} .

Finally, for

j = k,

the distribution

T_{0}^{2} (m_{1}; 1, (n - 1))

is the usual Hotelling

T_{1, n - 1}^{2},

that is, distributed as an exact distribution as follows:

\frac{(n - 1) m_{1}}{n - m_{1}} F (m_{1}, n - m_{1}) .

This means that the distribution of

D^{2}

can be approximated by the convolution of the above k distributions (

(k - 1)

approximated

F

distribution and one exact

F

distribution), where its critical values are obtained by the method suggested by Dyer [28].

Remark 1.

The statistic

T_{0 j}^{2}, j = 1, \dots, k - 1

has LH-trace distribution

T_{0}^{2} (m_{1}; p_{j + 2, k} (m_{j + 1} - 1), d_{j})

if

d_{j} = (n - 1) p_{j + 2, k} (m_{j + 1} - 1) \geq m_{1}

. We also note that

T_{0 k}^{2}

has the LH-trace distribution

T_{0}^{2} (m_{1}; 1, (n - 1))

if

(n - 1) \geq m_{1}

. Now, for k-th order data, all

m_{j} \geq 2

for

j = 1, \dots, k

and

k \geq 2

. See Definition 1. Now,

k > j

. Therefore,

k - j \geq 1

and then

k - j - 2 \geq - 1

. Thus,

p_{j + 2, k} \geq 1

. Since

m_{j} \geq 2

for all k-th order data,

m_{j + 1} \geq 2

, we have

(n - 1) p_{j + 2, k} (m_{j + 1} - 1) \geq m_{1}

when

(n - 1) \geq m_{1}

. Therefore, the only constraint needed on sample size in order to have all

T_{0 j}^{2}, j = 1, \dots, k

to follow LH-trace distribution is

(n - 1) \geq m_{1}

, i.e.,

n \geq m_{1} + 1

, regardless of any

m_{j}, j = 2, \dots, k

. In essence, the minimum sample size needed to compute the

D^{2}

test statistic is

m_{1} + 1

, although the minimum sample size needed to compute the Hotelling’s

T^{2}

test statistic is

p_{1, k} + 1

, where

p_{1, k}

is the full dimension of the observations. For this reason, one cannot compute Hotelling’s

T^{2}

test statistic for a small sample data set where

n \leq p_{1, k}

, which is doable for the

D^{2}

test statistic.

We will now discuss some special cases of the

D^{2}

statistic in the following remark.

Remark 2.

For second-order data or multivariate repeated measures data,

D^{2} = T_{01}^{2} + T_{02}^{2}

. Now,

T_{02}^{2}

is distributed as LH-trace distribution

T_{0}^{2} (m_{1}; 1; n - 1)

, and

T_{01}^{2}

is distributed as LH-trace distribution as follows

\begin{matrix} T_{0}^{2} (m_{1}; p_{1 + 2, k} (m_{1 + 1} - 1); (n - 1) p_{1 + 2, k} (m_{1 + 1} - 1)) \\ o r & T_{0}^{2} (m_{1}; p_{3, 2} (m_{2} - 1); (n - 1) p_{3, 2} (m_{2} - 1)) a s k = 2 \\ o r & T_{0}^{2} (m_{1}; (m_{2} - 1); (n - 1) (m_{2} - 1)) a s p_{3, 2} = 1 b y (1) . \end{matrix}

Thus,

T_{01}^{2}

is distributed as LH-trace distribution

T_{0}^{2} (m_{1}; (m_{2} - 1); (n - 1) (m_{2} - 1))

.

So, we see that this test exactly matches the test obtained by Žežula et al. [1] for multivariate repeated measures data (second-order data) with 2-SSCS or BCS covariance structure. Therefore, we can say that our mean test statistic in this article is an extension or generalization of Žežula et al.’s [1] mean test statistic for k-th order data with k-SSCS covariance structure.

We will now derive the mean test statistic for third-order data with 3-SSCS covariance structure. For third-order data,

D^{2} = T_{01}^{2} + T_{02}^{2} + T_{03}^{2}

. Now,

T_{03}^{2}

is distributed as LH-trace distribution

T_{0}^{2} (m_{1}; 1; n - 1)

, and

T_{02}^{2}

is distributed as LH-trace distribution as follows:

\begin{matrix} T_{0}^{2} (m_{1}; p_{2 + 2, k} (m_{2 + 1} - 1); (n - 1) p_{2 + 2, k} (m_{2 + 1} - 1)) \\ o r & T_{0}^{2} (m_{1}; p_{4, 3} (m_{3} - 1); (n - 1) p_{4, 3} (m_{3} - 1)) a s k = 3 \\ o r & T_{0}^{2} (m_{1}; (m_{3} - 1); (n - 1) (m_{3} - 1)) a s p_{4, 3} = 1 b y (1), \end{matrix}

that can be approximated by

g_{2} F (K_{2}, D_{2}),

where

K_{2} = (m_{3} - 1) m_{1},

D_{2} = 4 + \frac{K_{2} + 2}{B_{2} - 1}

and

g_{2} = \frac{d_{2} K_{2}}{m_{(2)}} \frac{D_{2} - 2}{D_{2}} = \frac{(n - 1) {(m_{3} - 1)}^{2} m_{1}}{(n - 1) (m_{3} - 1) - m_{1} - 1} \frac{D_{2} - 2}{D_{2}},

with

d_{2} = (n - 1) (m_{3} - 1),

m_{(2)} = d_{2} - m_{1} - 1

and

B_{2} = \frac{[m_{(2)} + (m_{3} - 1) m_{1}] [m_{(2)} + m_{1}]}{[m_{(2)} - 2] [m_{(2)} + 1]},

and

T_{01}^{2}

is distributed as LH-trace distribution as follows

\begin{matrix} T_{0}^{2} (m_{1}; p_{1 + 2, k} (m_{1 + 1} - 1); (n - 1) p_{1 + 2, k} (m_{1 + 1} - 1)) \\ o r & T_{0}^{2} (m_{1}; p_{3, 3} (m_{2} - 1); (n - 1) p_{3, 3} (m_{2} - 1)) a s k = 3 \\ o r & T_{0}^{2} (m_{1}; m_{3} (m_{2} - 1); (n - 1) m_{3} (m_{2} - 1)) a s p_{3, 3} = m_{3} b y (1), \end{matrix}

that can be approximated by

g_{1} F (K_{1}, D_{1})

where

K_{1} = m_{3} (m_{2} - 1) m_{1},

D_{1} = 4 + \frac{K_{1} + 2}{B_{1} - 1}

and

g_{1} = \frac{d_{1} K_{1}}{m_{(1)}} \frac{D_{1} - 2}{D_{1}} = \frac{(n - 1) m_{3}^{2} {(m_{2} - 1)}^{2} m_{1}}{(n - 1) m_{3} (m_{2} - 1) - m_{1} - 1} \frac{D_{1} - 2}{D_{1}},

with

d_{1} = (n - 1) m_{3} (m_{2} - 1),

m_{(1)} = d_{1} - m_{1} - 1

and

B_{1} = \frac{[m_{(1)} + m_{3} (m_{2} - 1) m_{1}] [m_{(1)} + m_{1}]}{[m_{(1)} - 2] [m_{(1)} + 1]} .

So, one can easily derive the test statistic for j-th order data for

j = 2, \dots, k

from our generalized

D^{2}

statistic. The distribution of

D^{2}

under

H_{0}

for second-order data with 2-SSCS covariance structure is discussed in detail in Žežula et al. [1]. We will discuss the distribution of

D^{2}

under

H_{0}

for third-order data in detail in the following section.

5.1.2. Distribution of Statistic $D^{2}$ under $H_{0}$ for Third-order Data with 3-SSCS Covariance Structure

This section is adopted from the work of Žežula et al. [22]. However, we use a much simpler, straightforward approach so that the practitioners or the analysts can appreciate and apply the method easily. Let

L_{3} = H_{m_{3}} \otimes H_{m_{2}} \otimes I_{m_{1}} = H_{m_{3}, m_{2}} \otimes I_{m_{1}}

be a matrix such that for each

j = 2, 3

H_{m_{j}}

is an

(m_{j} \times m_{j})

Helmert matrix, that is, an

(m_{j} \times m_{j})

orthogonal matrix with the first column proportional to the

m_{j} \times 1

vector of 1’s. We use here the following canonical transformation:

z = L_{3}^{'} (\bar{x} - μ_{0}) = (H_{m_{3}, m_{2}}^{'} \otimes I_{m_{1}}) (\bar{x} - μ_{0}) .

Therefore,

z = {(z {_{1, 1}}^{'}, \dots, z {_{m_{3}, m_{2}}}^{'})}^{'} = L_{3}^{'} (\bar{x} - μ_{0}) \sim N_{p_{1, k}} (0, Ω_{3}),

where

\begin{matrix} Ω_{3} & = & \frac{1}{n} D_{3} = L_{3}^{'} (\frac{1}{n} Γ_{3}) L_{3} = \frac{1}{n} (H_{m_{3}, m_{2}}^{'} \otimes I_{m_{1}}) Γ_{3} (H_{m_{3}, m_{2}} \otimes I_{m_{1}}) \\ = & \frac{1}{n} diag \{D_{f_{3} f_{2}}; {(f_{3}, f_{2})}^{'} \in F = F_{3} \times F_{2}\}, \end{matrix}

where, for each

j = 1, 2, 3,

the diagonal

m_{1} \times m_{1}

-matrices

D_{f} = D_{f_{3}, f_{2}}

are given by

\begin{matrix} D_{f_{3}, f_{2}} = Δ_{3, j} & if & \{\begin{matrix} f_{2} \neq 1 & j = 1 \\ f_{2} = 1, f_{3} \neq 1 & j = 2 \\ f_{2} = 1, f_{3} = 1 & j = 3 \end{matrix}, \end{matrix}

and the

m_{1} \times 1

component vectors

z_{f_{3}, f_{2}},

with

f = (f_{3}, f_{2}) \in F,

are independent, with distributions (under the null hypothesis)

z_{f_{3}, f_{2}} \sim N_{m_{1}} (0, \frac{1}{n} Δ_{3, j})

if

\begin{matrix} f_{2, j} = (f_{2}, \dots, f_{j}) = 1_{j - 1}, f_{j + 1} \in F_{j + 1} - {1}, and f_{j + 2, 3} = (f_{j + 2}, \dots, f_{k}) \in F_{j + 2} \times \dots \times F_{3} . \end{matrix}

Therefore, particularizing

D^{2}

, given by (23), for

k = 3

we have

\begin{matrix} D^{2} & = & \sum_{j = 1}^{k} [tr \{n \underset{k - j S u m s}{\underset{⏟}{\sum_{f_{k} = 1}^{m_{k}} \dots \sum_{f_{j + 2} = 1}^{m_{j + 2}} \sum_{f_{j + 1} = 2}^{m_{j + 1}}}} z_{f_{k}, \dots, f_{j + 1}, \underset{j - 1 o n e s}{\underset{⏟}{1, \dots, 1}}} z_{f_{k}, \dots, f_{j + 1}, \underset{j - 1 o n e s}{\underset{⏟}{1, \dots, 1}}}^{'} {\hat{Δ}}_{k, j}^{- 1}\}] \\ = & tr [n \underset{3 - 1 S u m s}{\underset{⏟}{\sum_{f_{k} = 1}^{m_{3}} \sum_{f_{2} = 2}^{m_{2}}}} z_{f_{3}, f_{2}} z_{f_{3}, f_{2}}^{'} {\hat{Δ}}_{3, 1}^{- 1}] + t r [n \underset{3 - 2 S u m s}{\underset{⏟}{\sum_{f_{k} = 2}^{m_{3}}}} z_{f_{3}, 1} z_{f_{3}, 1}^{'} {\hat{Δ}}_{3, 2}^{- 1}] + t r [n z_{1, 1} z_{1, 1}^{'} {\hat{Δ}}_{3, 3}^{- 1}] \\ = : & T_{01}^{2} + T_{02}^{2} + T_{03}^{2} . \end{matrix}

Since the subsets of vectors involved in

T_{01}^{2}, T_{02}^{2}, T_{03}^{2},

respectively, form a partition of the set of independent vectors

\{z_{f_{3}, f_{2}} : f_{3}, f_{2} \in F_{3} \times F_{2} = F\},

T_{01}^{2}, T_{02}^{2}, T_{03}^{2}

are mutually independent. Moreover, since, for

j = 1,

\begin{matrix} H_{1} & = & n \underset{3 - 1 S u m s}{\underset{⏟}{\sum_{f_{k} = 1}^{m_{3}} \sum_{f_{2} = 2}^{m_{2}}}} z_{f_{3}, f_{2}} z_{f_{3}, f_{2}}^{'} \sim W_{m_{1}} (m_{3} (m_{2} - 1); Δ_{k, j}) = W_{m_{1}} (k_{1}; Δ_{k, j}), and \\ E_{1} & = & (n - 1) m_{3} (m_{2} - 1) {\hat{Δ}}_{3, 1} \sim W_{m_{1}} ((n - 1) m_{3} (m_{2} - 1); Δ_{3, 1}) = W_{m_{1}} (d_{1}; Δ_{3, 1}), and \end{matrix}

\begin{matrix} T_{01}^{2} & = & tr [n \sum_{f_{k} = 1}^{m_{3}} \sum_{f_{2} = 2}^{m_{2}} z_{f_{3}, f_{2}} z_{f_{3}, f_{2}}^{'} {\hat{Δ}}_{3, 1}^{- 1}] = tr [H_{1} {(\frac{E_{1}}{(n - 1) m_{3} (m_{2} - 1)})}^{- 1}] \\ = & (n - 1) m_{3} (m_{2} - 1) tr (H_{1} E_{1}^{- 1}) = d_{1} tr (H_{1} E_{1}^{- 1}) \end{matrix}

has a LH-trace distribution denoted by

T_{0}^{2} (m_{1}; m_{3} (m_{2} - 1), d_{1})

if

d_{1} = (n - 1) m_{3} (m_{2} - 1) > m_{1} .

Similarly, for

j = 2,

\begin{matrix} H_{2} & = & n \underset{3 - 2 S u m s}{\underset{⏟}{\sum_{f_{k} = 2}^{m_{3}}}} z_{f_{3}, 1} z_{f_{3}, 1}^{'} \sim W_{m_{1}} ((m_{3} - 1); Δ_{3, 2}) and \\ E_{2} & = & (n - 1) (m_{3} - 1) {\hat{Δ}}_{3, 2} \sim W_{m_{1}} ((n - 1) (m_{3} - 1); Δ_{3, 2}), and \end{matrix}

\begin{matrix} T_{02}^{2} & = & tr [n \sum_{f_{3} = 2}^{m_{3}} z_{f_{3}, 1} z_{f_{3}, 1}^{'} {\hat{Δ}}_{3, 2}^{- 1}] = tr [H_{2} {(\frac{E_{2}}{(n - 1) (m_{3} - 1)})}^{- 1}] \\ = & (n - 1) (m_{3} - 1) tr (H_{2} E_{2}^{- 1}) = d_{2} tr (H_{2} E_{2}^{- 1}), \end{matrix}

has a LH-trace distribution denoted by

T_{0}^{2} (m_{1}; (m_{3} - 1), d_{2})

if

d_{2} = (n - 1) (m_{3} - 1) > m_{1} .

Note that the case

j = 3

reduces to

H_{3} = n z_{1, 1} z {_{1, 1}}^{'} \sim W_{m_{1}} (1, Δ_{3, 3})

and

E_{3} = (n - 1) {\hat{Δ}}_{3, 3} \sim W (n - 1; Δ_{3, 3}),

then

T_{03}^{2}

has the LH-trace distribution

T_{0}^{2} (m_{1}; 1, d_{3})

if

d_{3} = (n - 1) > m_{1} .

Therefore, the distribution of

D^{2} = \sum_{j = 1}^{3} T_{0 j}^{2}

is the following convolution of three independent LH-trace distributions:

\begin{matrix} ⨁_{j = 1}^{3} T_{0}^{2} (m_{1}; p_{j + 2, 3} (m_{j + 1} - 1), d_{j}) = \\ T_{0}^{2} (m_{1}; m_{3} (m_{2} - 1), d_{1}) \oplus T_{0}^{2} (m_{1}; (m_{3} - 1), d_{2}) \oplus T_{0}^{2} (m_{1}; 1, d_{3}), \end{matrix}

(26)

if for

j = 1, 2, 3

,

(n - 1) p_{j + 2, 3} (m_{j + 1} - 1) \geq m_{1} .

The critical values of this distribution can be obtained using simulations. LH-trace distribution is usually approximated by

F

distribution as mentioned before, however, we use here the second approximation suggested in McKeon [27].

For

j = 1,

denoting by

m_{(1)} = d_{1} - m_{1} - 1 = (n - 1) m_{3} (m_{2} - 1) - m_{1} - 1,

by

k_{1} = m_{3} (m_{2} - 1)

and by

\begin{matrix} B_{1} & = & \frac{[m_{(1)} + k_{1}]}{[m_{(1)} - 2]} \cdot \frac{[m_{(1)} + m_{1}]}{[m_{(1)} + 1]} \\ = & \frac{[n m_{3} (m_{2} - 1) - m_{1} - 1] [(n - 1) m_{3} (m_{2} - 1) - 1]}{[(n - 1) m_{3} (m_{2} - 1) - m_{1} - 3] [(n - 1) m_{3} (m_{2} - 1) - m_{1}]}, \end{matrix}

the distribution

T_{0}^{2} (m_{1}; m_{3} (m_{2} - 1), (n - 1) m_{3} (m_{2} - 1))

of

T_{01}^{2} = d_{1} tr (H_{1} E_{1}^{- 1})

can be approximated by

g_{1} F (K_{1}, D_{1}),

where

K_{1} = m_{3} (m_{2} - 1) m_{1},

D_{1} = 4 + \frac{m_{3} (m_{2} - 1) m_{1} + 2}{B_{1} - 1}

and

g_{1} = d_{1} \frac{K_{1}}{m_{(1)}} \frac{D_{1} - 2}{D_{1}} = \frac{(n - 1) m_{3}^{2} {(m_{2} - 1)}^{2} m_{1}}{(n - 1) m_{3} (m_{2} - 1) - m_{1} - 1} \frac{D_{1} - 2}{D_{1}} .

For

j = 2,

denoting by

m_{(2)} = d_{2} - m_{1} - 1 = (n - 1) (m_{3} - 1) - m_{1} - 1,

by

k_{(2)} = m_{3} - 1

and by

\begin{matrix} B_{2} & = & \frac{[m_{(2)} + k_{(2)}]}{[m_{(2)} - 2]} \cdot \frac{[m_{(2)} + m_{1}]}{[m_{(2)} + 1]} \\ = & \frac{[n (m_{3} - 1) - m_{1} - 1] [(n - 1) (m_{3} - 1) - 1]}{[(n - 1) (m_{3} - 1) - m_{1} - 3] [(n - 1) (m_{3} - 1) - m_{1}]}, \end{matrix}

the distribution

T_{0}^{2} (m_{1}; (m_{3} - 1), d_{2})

of

T_{02}^{2} = d_{2} tr (H_{2} E_{2}^{- 1})

can be approximated by

g_{2} F (K_{2}, D_{2}),

where

K_{2} = (m_{3} - 1) m_{1},

D_{2} = 4 + \frac{(m_{3} - 1) m_{1} + 2}{B_{2} - 1}

and

g_{2} = (n - 1) (m_{3} - 1) \frac{K_{2}}{m_{(2)}} \frac{D_{2} - 2}{D_{2}} = \frac{(n - 1) {(m_{3} - 1)}^{2} m_{1}}{(n - 1) (m_{3} - 1) - m_{1} - 1} \frac{D_{2} - 2}{D_{2}} .

Finally, for our last case corresponding to

j = 3,

the distribution

T_{0}^{2} (m_{1}; 1, (n - 1))

is the usual Hotelling

T_{m_{1}, n - 1}^{2},

that is, an exact distribution as follows:

\frac{(n - 1) m_{1}}{n - m_{1}} F (m_{1}, n - m_{1}) .

This means that the distribution of

D^{2}

can be approximated by the convolution of the above three

F

distributions (two approximated

F

distribution and one exact

F

distribution), where its critical values are obtained by the method suggested by Dyer [28].

Now, we need to perform the convolution of three distribution functions. Since convolution is associative, for three distribution functions

F_{1}, F_{2},

and

F_{3}

, the associative law of random variables implies that

(F_{1} \otimes F_{2}) \otimes F_{3} = F_{1} \otimes (F_{2} \otimes F_{3})

, so we can dispense with the parentheses and can write

F_{1} \otimes F_{2} \otimes F_{3}

. In the following section, we present the unbiased estimates of the eigenblocks for a 3-SSCS covariance matrix.

5.2. The Expressions of the $Δ$ ’s Estimators for the Case $k = 3$

1.: From Lemma 5 in Leiva and Roy [10], the unbiased estimators ${\hat{U}}_{3, j}$ of $U_{3, j}$ for each $j = 1, 2, 3,$ are written as follows:

$\begin{matrix} {\hat{U}}_{3, 1} & = & \frac{1}{p_{2, 3}} {BT}_{p_{1, 1}} (S) = \frac{1}{p_{2, 3}} \sum_{f \in F} S_{f; f} \\ = & \frac{1}{(n - 1) q_{3, 1}} \overset{2 s u m s}{\overset{︷}{\sum_{f_{3} \in F_{3}} \sum_{f_{2} \in F_{2}}}} \sum_{r = 1}^{n} (\underset{m_{1} \times 1}{x_{r; f_{3}, f_{2}} - {\bar{x}}_{f_{3}, f_{2}}}) (\underset{m_{1} \times 1}{x_{r; f_{3}, f_{2}}^{'} - {\bar{x}}_{f_{3}, f_{2}}^{'}}), \end{matrix}$

(27)

$\begin{matrix} {\hat{U}}_{3, 2} & = & \frac{{BS}_{p_{1, 1}} [{BT}_{p_{1, 2}} (S)] - {BS}_{p_{1, 1}} [{BT}_{p_{1, 1}} (S)]}{q_{3, 2}} = \frac{{BS}_{p_{1, 1}} [{BT}_{p_{1, 2}} (S)] - {BT}_{p_{1, 1}} (S)}{q_{3, 2}} \\ = & \frac{1}{(n - 1) q_{3, 2}} \overset{1 s u m}{\overset{︷}{\sum_{f_{3} \in F_{3}}}} \overset{1 s p e c i a l s u m p a i r}{\overset{︷}{(\sum_{f_{2} \in F_{2}} \sum_{f_{2} \neq f_{2}^{*} \in F_{2}})}} \sum_{r = 1}^{n} (\underset{m_{1} \times 1}{x_{r; f_{3}, f_{2}} - {\bar{x}}_{f_{3}, f_{2}}}) (\underset{1 \times m_{1}}{x_{r; f_{3}^{*}, f_{2}^{*}}^{'} - {\bar{x}}_{f_{3}^{*}, f_{2}^{*}}^{'}}), \end{matrix}$

(28)

and

$\begin{matrix} {\hat{U}}_{3, 3} & = & \frac{{BS}_{p_{1, 1}} [{BT}_{p_{1, 3}} (S)] - {BS}_{p_{1, 1}} [{BT}_{p_{1, 2}} (S)]}{q_{3, 3}} \\ = & \frac{1}{(n - 1) q_{3, 3}} \overset{1 s p e c i a l s u m p a i r}{\overset{︷}{(\sum_{f_{3} \in F_{3}} \sum_{f_{3} \neq f_{3}^{*} \in F_{3}})}} \overset{1 s u m p a i r}{\overset{︷}{(\sum_{f_{2} \in F_{2}} \sum_{f_{2}^{*} \in F_{2}})}} \sum_{r = 1}^{n} (\underset{m_{1} \times 1}{x_{r; f_{3}, f_{2}} - {\bar{x}}_{f_{3}, f_{2}}}) (\underset{1 \times m_{1}}{x_{r; f_{3}^{*}, f_{2}^{*}}^{'} - {\bar{x}}_{f_{3}^{*}, f_{2}^{*}}^{'}}), \end{matrix}$

(29)

where $q_{3, j}, j = 1, 2, 3$ are given in (19). Therefore, an unbiased estimator of $Γ_{3}$ is given by:

$\begin{matrix} {\hat{Γ}}_{3} = [\sum_{j = 1}^{2} I_{p_{j + 1, 3}} \otimes J_{p_{2, j}} \otimes ({\hat{U}}_{3, j} - {\hat{U}}_{3, j + 1})] + J_{p_{2, 3}} \otimes {\hat{U}}_{3, 3} . \end{matrix}$

Since k-SSCS matrix $Γ_{k}$ in (9) is of Jordan algebra type, following Kozioł et al. [26] one can show that the above estimate ${\hat{Γ}}_{3}$ is the best unbiased, consistent and complete estimator for $Γ_{3}$ .
2.: For each $j = 1, 2, 3$ an unbiased estimator of $Δ_{3, j},$ ${\hat{Δ}}_{3, j}$ is given by:

${\hat{Δ}}_{3, j} = \sum_{i = 1}^{j} p_{2, i} ({\hat{U}}_{3, i} - {\hat{U}}_{3, i + 1}),$

where ${\hat{U}}_{3, 4} = 0$ and $p_{2, 1} = 1,$ or equivalently:

${\hat{Δ}}_{3, j} = \{\begin{matrix} {\hat{U}}_{3, 1} - {\hat{U}}_{3, 2} & i f & j = 1 \\ {\hat{Δ}}_{3, j - 1} + p_{2, j} ({\hat{U}}_{3, j} - {\hat{U}}_{3, j + 1}) & i f & j = 2, 3 \end{matrix},$

The above unbiased estimators admit the following expressions as functions of $S$ :

${\hat{Δ}}_{3, j} = \frac{m_{j + 1} \cdot {BS}_{p_{1, 1}} [{BT}_{p_{1, j}} (S)] - {BS}_{p_{1, 1}} [{BT}_{p_{1, j + 1}} (S)]}{p_{2, 3} (m_{j + 1} - 1)},$

for $j = 1, 2$ and where ${BS}_{p_{1, 1}} [{BT}_{p_{1, 1}} (S)] = {BT}_{p_{1, 1}} (S)$ and ${BS}_{p_{1, 1}} [{BT}_{p_{1, j + 1}} (S)] = {BS}_{p_{1, 1}} (S),$ and an unbiased estimator of $Δ_{3, 3}$ , ${\hat{Δ}}_{3, 3}$ is given by:

${\hat{Δ}}_{3, 3} = \frac{1}{p_{2, 3}} {BS}_{p_{1, 1}} (S) .$

6. Test for the Equality of Two Means

6.1. Paired Observation Model

In this section, we consider that in each one of the n individuals, a

p_{1, k}

-variate vector is measured at two different times (e.g., before and after a treatment). These measurements are k-th order (array-variate) measurements from each individual. To be more precise, for each

f = {(f_{k}, \dots, f_{2})}^{'} \in F,

let

v_{r; f_{k}, \dots, f_{2}}

and

w_{r; f_{k}, \dots, f_{2}}

be the paired

m_{1}

-dimensional vectors measured at the

(f_{k}, \dots, f_{2})

site of the r individuals, for

r = 1, \dots, n .

Let

u_{r}

be the partitioned

2 m

-variate vectors

u_{r}^{'} = (v_{r}^{'}, w_{r}^{'}) = ((v_{r, 1}^{'}, \dots, v_{r, m_{k}}^{'}), (w_{r, 1}^{'}, \dots, w_{r, m_{k}}^{'})),

where

v_{r; f_{k}} = {(v_{r; f_{k}, 1}^{'}, \dots, v_{r; f_{k - 1}, m_{k - 1}}^{'})}^{'}

and

w_{r; f_{k}} = {(w_{r; f_{k}, 1}^{'}, \dots, w_{r; f_{k}, m_{k - 1}}^{'})}^{'}

where

v_{r; f_{k}, f_{k - 1}} = {(v_{r; f_{k}, f_{k - 1}, 1}^{'}, \dots, v_{r; f_{k}, f_{k - 1}, m_{k - 2}}^{'})}^{'}

and

w_{r; f_{k}, f_{k - 1}} = {(w_{r; f_{k}, f_{k - 1}, 1}^{'}, \dots, w_{r; f_{k}, f_{k - 1}, m_{k - 2}}^{'})}^{'},

with

v_{r; f_{k}, \dots, f_{3}}

= {(v_{r; f_{k}, \dots, f_{3}, 1}^{'}, \dots, v_{r; f_{k}, \dots, f_{3}, m_{2}}^{'})}^{'}

and

w_{r; f_{k}, \dots, f_{3}} = {(w_{r; f_{k}, \dots, f_{3}, 1}^{'}, \dots, w_{r; f_{k}, \dots, f_{3}, m_{2}}^{'})}^{'}

, respectively, where

v_{r; f_{k}, \dots, f_{3}, f_{2}}^{'} = (v_{r; f_{k}, \dots, f_{3}, f_{2}, 1}, \dots, v_{r; f_{k}, \dots, f_{3}, f_{2}, m_{1}})

and

w_{r; f_{k}, \dots, f_{3}, f_{2}}^{'}

= (w_{r; f_{k}, \dots, f_{3}, f_{2}, 1}, \dots, w_{r; f_{k}, \dots, f_{3}, f_{2}, m_{1}})

are the

m_{1}

paired measurements taken from the rth individual, for

r = 1, \dots, n .

We assume that

u_{1}, \dots, u_{n} \overset{i . i . d .}{\sim} N_{2 p_{1, k}} (μ_{u}, Γ_{u}),

where i.i.d. stands for independent and identically distributed, and

μ_{u} = {(μ_{v}^{'}, μ_{w}^{'})}^{'}

and

Γ_{u}

is the partitioned

2 p_{1, k} \times 2 p_{1, k}

-matrix

Γ_{u} = (\begin{matrix} \underset{p_{1, k} \times p_{1, k}}{Γ_{v}} & \underset{p_{1, k} \times p_{1, k}}{Γ_{vw}} \\ \underset{p_{1, k} \times p_{1, k}}{Γ_{wv}} & \underset{p_{1, k} \times p_{1, k}}{Γ_{w}} \end{matrix}),

where

\begin{matrix} Γ_{v} & = & [\sum_{j = 1}^{k - 1} I_{p_{j + 1, k}} \otimes J_{p_{2, j}} \otimes (U_{v, j} - U_{v, j + 1})] + J_{p_{2, k}} \otimes U_{v, k} and \\ Γ_{w} & = & [\sum_{j = 1}^{k - 1} I_{p_{j + 1, k}} \otimes J_{p_{2, j}} \otimes (U_{w, j} - U_{w, j + 1})] + J_{p_{2, k}} \otimes U_{w, k}, and \end{matrix}

\begin{matrix} Γ_{vw} & = & Γ_{wv}^{'} = [\sum_{j = 1}^{k - 1} I_{p_{j + 1, k}} \otimes J_{p_{2, j}} \otimes (U_{vw, j} - U_{vw, j + 1})] + J_{p_{2, k}} \otimes U_{vw, k} . \end{matrix}

The matrices

Γ_{vw}

and

Γ_{wv}

are accountable for the linear dependence among the considered

m_{1}

paired measurements. Particular cases of

Γ_{vw}

could be of interest, e.g.,

U_{vw, 1} = \dots = U_{vw, k},

that is,

Γ_{vw} = Γ_{wv}^{'} = J_{p_{2, k}} \otimes U_{vw, k}

(see, for example, when

k = 2,

Definition 2 on Page 388 in Leiva [6]). Under this set up, we are interested in testing the following hypothesis:

H_{0} : μ_{v} = μ_{w} v s . H_{1} : μ_{v} \neq μ_{w} .

If we define

x = v - w

the above hypothesis is equivalent to

H_{0} : μ_{x} = 0 v s H_{1} : μ_{x} \neq 0,

as

μ_{x} = E (v - w) = μ_{v} - μ_{w} .

Moreover,

x_{r} =

v_{r} - w_{r},

r = 1

, \dots, n,

are i.i.d.

N (μ_{x}; Γ)

where

μ_{x} = μ_{v} - μ_{w}

and

\begin{matrix} Γ & = & cov (x) = cov (v - w) = Γ_{v} + Γ_{w} - Γ_{vw} - Γ_{wv} \\ = & [\sum_{j = 1}^{k - 1} I_{p_{j + 1, k}} \otimes J_{p_{2, j}} \otimes (U_{x, j} - U_{x, j + 1})] + J_{p_{2, k}} \otimes U_{x, k}, \end{matrix}

where U_{x, j} = U_{v, j} + U_{w, j} - U_{vw, j} - U_{vw, j}^{'} for j = 1, \dots, k .

Assuming

Γ

is a positive definite matrix and that

n > p_{1, k},

one may consider the likelihood ratio test for the above hypothesis testing problem for k- level multivariate data assuming the mean vectors

μ_{v}

and

μ_{w}

are unstructured. Note that this test problem reduces to the one sample mean case of the previous section where

μ_{0} = 0 .

Therefore, all the results obtained in the previous section are valid for this case. Following the same logic as in Remark 1, the needed sample size for the test is

n \geq m_{1} + 1

, regardless of any

m_{j}, j = 2, \dots, k

.

6.2. Independent Observation Model

In this section, we consider the case where we have two independent samples: one random sample of size

n_{1}

of

p_{1, k}

vectors

v_{r_{1}} : r_{1} = 1, \dots, n_{1} \overset{i . i . d .}{\sim} N_{p_{1, k}} (μ_{v}, Γ_{v}),

with

v_{r_{1}}^{'} = (v_{r_{1}, 1}^{'}, \dots, v_{r_{1}, m_{k}}^{'}),

where

v_{r_{1}; f_{k}}^{'} = (v_{r_{1}; f_{k}, 1}^{'}, \dots, v_{r_{1}; f_{k}, m_{k - 1}}^{'})

,

v_{r_{1}; f_{k}, f_{k - 1}}^{'} = (v_{r_{1}; f_{k}, f_{k - 1}, 1}^{'}, \dots, v_{r_{1}; f_{k}, f_{k - 1}, m_{k - 2}}^{'}),

…, and where

v_{r_{1}; f_{k}, \dots, f_{3}}^{'} = (v_{r_{1}; f_{k}, \dots, f_{3}, 1}^{'}, \dots, v_{r_{1}; f_{k}, \dots, f_{3}, m_{2}}^{'})

with

v_{r_{1}; f_{k}, \dots, f_{3}, f_{2}}^{'} = (v_{r_{1}; f_{k}, \dots, f_{3}, f_{2}, 1}, \dots, v_{r_{1}; f_{k}, \dots, f_{3}, f_{2}, m_{1}})

being the

m_{1}

measurements taken from the

r_{1}^{t h}

individual, for

r_{1} = 1, \dots, n_{1},

and another random sample of size

n_{2}

of

p_{1, k}

vectors

w_{r_{2}} : r_{2} = 1, \dots, n_{2} \overset{i . i . d .}{\sim} N_{p_{1, k}} (μ_{w}, Γ_{w})

, with

w_{r_{2}}^{'} = (w_{r_{2}, 1}^{'}, \dots, w_{r_{2}, m_{k}}^{'}),

where

w_{r_{2}; f_{k}}^{'} = (w_{r_{2}; f_{k}, 1}^{'}, \dots, w_{r_{2}; f_{k}, m_{k - 1}}^{'})

, where

w_{r_{2}; f_{k}, f_{k - 1}}^{'} = (w_{r_{2}; f_{k}, f_{k - 1}, 1}^{'}, \dots, w_{r_{2}; f_{k}, f_{k - 1}, m_{k - 2}}^{'})

, …, and where

w_{r_{2}; f_{k}, \dots, f_{3}}^{'} = (w_{r_{2}; f_{k}, \dots, f_{3}, 1}^{'}, \dots, w_{r_{2}; f_{k}, \dots, f_{3}, m_{1}}^{'})

with

w_{r_{2}; f_{k}, \dots, f_{3}, f_{2}}^{'} = (w_{r_{2}; f_{k}, \dots, f_{3}, f_{2}, 1}, \dots, w_{r_{2}; f_{k}, \dots, f_{3}, f_{2}, m_{1}})

being the

m_{1}

measurements taken from the

r_{2}^{t h}

individual, for

r_{2} = 1, \dots, n_{2},

for

f = {(f_{k}, \dots, f_{2})}^{'} \in F

.

Our interest is in testing the following hypothesis:

H_{0} : μ_{v} = μ_{w} v s . H_{1} : μ_{v} \neq μ_{w},

(30)

under the assumption that

Γ_{v} = Γ_{w} = Γ = Γ_{k}

is an unknown k-SSCS covariance matrix of the form (8). Let

V = (v_{1}, \dots, v_{n_{1}})

and

W = (w_{1}, \dots, w_{n_{2}})

denote the corresponding two sample matrix data. We know that the sample means

\bar{v} = \frac{1}{n_{1}} \sum_{r_{1} = 1}^{n_{1}} v_{r_{1}}

and

\bar{w} = \frac{1}{n_{2}} \sum_{r_{2} = 1}^{n_{2}} w_{r_{2}}

are independent of the covariance matrix estimators

S_{1} = \frac{1}{n_{1} - 1} V Q_{n_{1}} V^{'}

and

S_{2} = \frac{1}{n_{2} - 1} W Q_{n_{2}} W^{'}

respectively. Therefore, they are also independent of

S^{pl},

the pooled unbiased estimator

Γ

(convex linear combination of unbiased estimators of

Γ),

which is given by:

\begin{matrix} S^{pl} & = & \frac{n_{1} - 1}{n_{1} + n_{2} - 2} S_{1} + \frac{n_{2} - 1}{n_{1} + n_{2} - 2} S_{2} \\ = & {(S_{f_{k}, \dots, f_{2}; f_{k}^{*}, \dots, f_{2}^{*}}^{pl})}_{f_{2}, f_{2}^{*} = 1; \dots; f_{k}, f_{k}^{*} = 1}^{m_{2}; \dots; m_{k}} = {(S_{f; f^{*}}^{pl})}_{f; f^{*} \in F}, \end{matrix}

where

S_{f; f^{*}}^{pl} = \frac{\sum_{r_{1} = 1}^{n_{1}} (v_{r_{1}; f} - {\bar{v}}_{f}) (v_{r_{1}; f^{*}}^{'} - {\bar{v}}_{f^{*}}^{'}) + \sum_{r_{2} = 1}^{n_{2}} (w_{r_{2}; f} - {\bar{w}}_{f}) (w_{r_{2}; f^{*}}^{'} - {\bar{w}}_{f^{*}}^{'})}{(n_{1} + n_{2} - 2)} .

Now

\bar{v} - \bar{w} \sim N_{p_{1, k}} (μ_{v} - μ_{w}, \frac{1}{n^{pl}} Γ_{k}), where n^{pl} = \frac{n_{1} n_{2}}{n_{1} + n_{2}} .

We know that under

H_{0}

:

\begin{matrix} \bar{v} - \bar{w} \sim N_{p_{1, k}} (0, \frac{1}{n^{pl}} Γ_{k}), and S^{pl} \sim W_{p_{1, k}} (n_{1} + n_{2} - 2, \frac{1}{n_{1} + n_{2} - 2} Γ_{k}) . \end{matrix}

Due to the exchangeable form of

Γ_{k},

it is clear that we again have:

\begin{matrix} E [S_{f; f^{*}}^{pl}] & = & E [S_{f_{k}, \dots, f_{2}; f_{k}^{*}, \dots, f_{2}^{*}}^{pl}] \\ = & \{\begin{matrix} U_{k, 1} & i f & f_{2} = f_{2}^{*}, \dots, f_{k} = f_{k}^{*} \\ U_{k, j} & i f & f_{j} \neq f_{j}^{*}, f_{j + 1} = f_{j + 1}^{*}, \dots, f_{k} = f_{k}^{*} & f o r & j = 2, \dots, k \end{matrix} . \end{matrix}

(31)

Note that each of the following expressions is the arithmetic mean of all submatrices of

S^{pl}

, which, according to (31), have the same expectation. It is easy to prove that for each

j = 1, \dots, k,

an unbiased estimator of

U_{k, j},

{\hat{U}}_{k, j}^{pl},

for

j = 1

is given by:

\begin{matrix} {\hat{U}}_{k, 1}^{pl} & = & \frac{1}{p_{2, k}} {BT}_{p_{1, 1}} (S^{pl}) = \frac{1}{p_{2, k}} \sum_{f \in F} S_{f; f}^{pl} \\ = & \frac{n_{1} - 1}{n_{1} + n_{2} - 2} \frac{1}{p_{2, k}} \sum_{f \in F} S_{1 : f; f} + \frac{n_{2} - 1}{n_{1} + n_{2} - 2} \frac{1}{p_{2, k}} \sum_{f \in F} S_{2 : f; f}, \end{matrix}

and for

j = 2, \dots, k,

{\hat{U}}_{k, j}^{pl}

after some algebraic simplification is given by:

\begin{matrix} {\hat{U}}_{k, j}^{pl} & = & \frac{{BS}_{p_{1, 1}} [({BT}_{p_{1, j}} (S^{pl})) - {BT}_{p_{1, j - 1}} (S^{pl})]}{q_{k, j}} \\ = & \frac{n_{1} - 1}{n_{1} + n_{2} - 2} \frac{1}{q_{k, j}} \sum_{f_{k} \in F_{k}} \dots \sum_{f_{j + 1} \in F_{j + 1}} (\sum_{f_{j} \in F_{j}} \sum_{f_{j} \neq f_{j}^{*} \in F_{j}}) (\sum_{f_{j - 1} \in F_{j - 1}} \sum_{f_{j - 1}^{*} \in F_{j - 1}}) \dots (\sum_{f_{2} \in F_{k}} \sum_{f_{2}^{*} \in F_{k}}) S_{1 : f; f^{*}} \\ + & \frac{n_{2} - 1}{n_{1} + n_{2} - 2} \frac{1}{q_{k, j}} \sum_{f_{k} \in F_{k}} \dots \sum_{f_{j + 1} \in F_{j + 1}} (\sum_{f_{j} \in F_{j}} \sum_{f_{j} \neq f_{j}^{*} \in F_{j}}) (\sum_{f_{j - 1} \in F_{j - 1}} \sum_{f_{j - 1}^{*} \in F_{j - 1}}) \dots (\sum_{f_{2} \in F_{k}} \sum_{f_{2}^{*} \in F_{k}}) S_{2 : f; f^{*}}, \end{matrix}

where

q_{k, j}

is given in (19). Therefore, we can use the following unbiased estimator of variance and covariance matrices

\begin{matrix} {\hat{Γ}}^{pl} & = & {\hat{Γ}}_{k}^{pl} = [\sum_{j = 1}^{k - 1} I_{p_{j + 1, k}} \otimes J_{p_{2, j}} \otimes (U_{k, j}^{pl} - U_{k, j + 1}^{pl})] + J_{p_{2, k}} \otimes U_{k, k}^{pl}, \\ and {\hat{Ω}}_{k}^{pl} & = & \frac{1}{n^{pl}} {\hat{D}}_{k}^{pl} = L_{k}^{'} (\frac{1}{n^{pl}} {\hat{Γ}}_{k}^{pl}) L_{k} = \frac{1}{n^{pl}} (H_{m_{k}, m_{2}}^{'} \otimes I_{m_{1}}) {\hat{Γ}}_{k}^{pl} (H_{m_{k}, m_{2}} \otimes I_{m_{1}}) \\ = & \frac{1}{n^{pl}} diag \{{\hat{D}}_{f_{k}, f_{k - 1}, \dots, f_{2}}^{pl}; {(f_{k}, f_{k - 1}, \dots, f_{2})}^{'} \in F_{k} \times F_{k - 1} \times \dots \times F_{2}\}, \end{matrix}

where, for each

j = 1, \dots, k,

the diagonal

m_{1} \times m_{1}

-matrices

D_{f}^{pl} = D_{f_{k}, f_{k - 1}, \dots, f_{2}}^{pl}

are given by:

\begin{matrix} {\hat{D}}_{f_{k}, f_{k - 1}, \dots, f_{2}}^{pl} = {\hat{Δ}}_{k, j}^{pl} & i f & f_{2} = 1, \dots, f_{j} = 1, f_{j + 1} \neq 1, \end{matrix}

where

f_{k + 1} \neq 1

is not taken into consideration and where

{\hat{Δ}}_{k, j}^{pl} = \sum_{i = 1}^{j} p_{2, i} ({\hat{U}}_{k, i}^{pl} - {\hat{U}}_{k, i + 1}^{pl}),

with

{\hat{U}}_{k, k + 1}^{pl} = 0

, or equvalently:

{\hat{Δ}}_{k, j}^{pl} = \{\begin{matrix} {\hat{U}}_{k, 1}^{pl} - {\hat{U}}_{k, 2}^{pl} & i f & j = 1 \\ {\hat{Δ}}_{k, j - 1}^{pl} + p_{2, j} ({\hat{U}}_{k, j}^{pl} - {\hat{U}}_{k, j + 1}^{pl}) & i f & j = 2, \dots, k \end{matrix} .

The usual likelihood ratio test of (30) is to reject

H_{0}

if:

T^{2} = \frac{n_{1} n_{2}}{n_{1} + n_{2}} {(\bar{v} - \bar{w})}^{'} {(S^{pl})}^{- 1} (\bar{v} - \bar{w}) > c^{2} .

Since

(n_{1} + n_{2} - 2) S^{pl} = (n_{1} - 1) S_{1} + (n_{2} - 1) S_{2} \sim W_{m_{1}} ((n_{1} + n_{2} - 2); Γ_{k})

,

\begin{matrix} T^{2} & = & \underset{\sim N o r m a l_{p_{1, k}}}{\underset{⏟}{{(\frac{1}{n_{1}} + \frac{1}{n_{2}})}^{- \frac{1}{2}} {[(\bar{v} - \bar{w}) - (μ_{v} - μ_{w})]}^{'}}} {(\underset{\sim W i s h a r t / d . f .}{\underset{⏟}{\frac{(n_{1} + n_{2} - 2) S^{pl}}{(n_{1} + n_{2} - 2)}}})}^{- 1} \underset{\sim N o r m a l_{p_{1, k}}}{\underset{⏟}{{(\frac{1}{n_{1}} + \frac{1}{n_{2}})}^{- \frac{1}{2}} [(\bar{v} - \bar{w}) - (μ_{v} - μ_{w})]}} . \end{matrix}

Nevertheless, we cannot use the above result, as in our case,

{\hat{Γ}}_{k}^{pl}

is an estimator of

Γ_{k}

. However, by Theorem 1 of Leiva and Roy [10], we know that the random vectors

(n - 1) p_{j + 2, k} (m_{j + 1} - 1) {\hat{Δ}}_{k, j} = (n - 1) {BT}_{p_{1, 1}} (R_{k, j + 1} S R_{k, j + 1}) : j = 1, \dots, k - 1,

and

(n - 1) {\hat{Δ}}_{k, k} = (n - 1) {BT}_{p_{1, 1}} (R_{k, j + 1} S R_{k, j + 1}),

where

R_{k, j + 1} = R_{k, j + 1}^{*} \otimes I_{m_{1}}

are given by (4) with (5) and (6) with (7) are independent and

\begin{matrix} (n_{1} + n_{2} - 2) p_{j + 2, k} (m_{j + 1} - 1) {\hat{Δ}}_{k, j}^{pl} & \sim & W_{m_{1}} (Δ_{k, j}; (n_{1} + n_{2} - 2) p_{j + 2, k} (m_{j + 1} - 1)) \\ for j = 1, \dots, k - 1, \\ and (n_{1} + n_{2} - 2) {\hat{Δ}}_{k, k}^{pl} & \sim & W_{m_{1}} (Δ_{k, k}; (n_{1} + n_{2} - 2)), where p_{k + 1, k} = 1 . \end{matrix}

Since the estimators

{\hat{Δ}}_{k, j}^{pl} : j = 1, \dots, k

are functions of

S^{pl},

they are independent of

\bar{v} - \bar{w} .

Therefore, using a similar procedure as in the one sample case where we used the transformation

z = L_{k} (\bar{x} - μ_{0}) = (H_{m_{k}} \otimes \dots \otimes H_{m_{2}} \otimes I_{m_{1}}) (\bar{x} - μ_{0})

, we now use the following transformation:

\begin{matrix} d & = & {(d {_{1, \dots, 1}}^{'}, \dots, d {_{m_{k}, \dots, m_{2}}}^{'})}^{'} = L_{k}^{'} (\bar{v} - \bar{w}) (H_{m_{k}} \otimes \dots \otimes H_{m_{2}} \otimes I_{m_{1}}) (\bar{v} - \bar{w}) . \end{matrix}

According to the previous result,

d = L_{k}^{'} (\bar{v} - \bar{w}) \sim N_{p_{1, k}} (0, Ω_{k}),

where

\begin{matrix} Ω_{k} & = & \frac{1}{n} D_{k} = L_{k}^{'} (\frac{1}{n} Γ_{k}) L_{k} = \frac{1}{n} (H_{m_{k}, m_{2}}^{'} \otimes I_{m_{1}}) Γ_{k} (H_{m_{k}, m_{2}} \otimes I_{m_{1}}) \\ = & \frac{1}{n} diag \{D_{f_{k}, f_{k - 1}, \dots, f_{2}}; {(f_{k}, f_{k - 1}, \dots, f_{2})}^{'} \in F_{k} \times F_{k - 1} \times \dots \times F_{2}\}, \end{matrix}

where, for each

j = 1, \dots, k,

the diagonal

m_{1} \times m_{1}

- matrices

D_{f} = D_{f_{k}, f_{k - 1}, \dots, f_{2}}

are given by:

\begin{matrix} D_{f_{k}, f_{k - 1}, \dots, f_{2}} = Δ_{k, j} & if & f_{2} = 1, \dots, f_{j} = 1, f_{j + 1} \neq 1 . \end{matrix}

Using a similar result as the one used in the one sample case, we obtain the statistic

{(D^{pl})}^{2}

as follows:

\begin{matrix} {(D^{pl})}^{2} & = & \frac{n_{1} n_{2}}{n_{1} + n_{2}} {(\bar{v} - \bar{w})}^{'} {({\hat{Γ}}^{pl})}^{- 1} (\bar{v} - \bar{w}) = n^{pl} {\bar{d}}^{'} {\hat{Γ}}_{k}^{- 1} \bar{d} \\ = & n^{pl} \sum_{j = 1}^{k - 1} \{\underset{k - j S u m s}{\underset{⏟}{\sum_{f_{k} = 1}^{m_{k}} \dots \sum_{f_{j + 2} = 1}^{m_{j + 2}} \sum_{f_{j + 1} = 2}^{m_{j + 1}}}} d_{f_{k}, \dots, f_{j + 1}, \underset{j - 1 o n e s}{\underset{⏟}{1, \dots, 1}}}^{'} {({\hat{Δ}}_{k, j}^{pl})}^{- 1} d_{f_{k}, \dots, f_{j + 1}, \underset{j - 1 o n e s}{\underset{⏟}{1, \dots, 1}}}\} \\ + n^{pl} d_{f_{k}, \dots, f_{j + 1}, \underset{k - 1 o n e s}{\underset{⏟}{1, \dots, 1}}}^{'} {({\hat{Δ}}_{k, k}^{pl})}^{- 1} d_{\underset{k - 1 o n e s}{\underset{⏟}{1, \dots, 1}}} \\ = & \sum_{j = 1}^{k} [tr \{\frac{n_{1} n_{2}}{n_{1} + n_{2}} \underset{k - j S u m s}{\underset{⏟}{\sum_{f_{k} = 1}^{m_{k}} \dots \sum_{f_{j + 2} = 1}^{m_{j + 2}} \sum_{f_{j + 1} = 2}^{m_{j + 1}}}} d_{f_{k}, \dots, f_{j + 1}, \underset{j - 1 o n e s}{\underset{⏟}{1, \dots, 1}}} d_{f_{k}, \dots, f_{j + 1}, \underset{j - 1 o n e s}{\underset{⏟}{1, \dots, 1}}}^{'} {({\hat{Δ}}_{k, j}^{pl})}^{- 1}\}] \\ = : & \sum_{j = 1}^{k} T_{0 j}^{2} . \end{matrix}

Then, the distribution of

{(D^{pl})}^{2} = \sum_{j = 1}^{k} T_{0 j}^{2}

is the convolution of k independent LH-trace distributions as follows:

⨁_{j = 1}^{k} T_{0}^{2} (m_{1}; p_{j + 2, k} (m_{j + 1} - 1), d_{j} = (n_{1} + n_{2} - 2) p_{j + 2, k} (m_{j + 1} - 1) \geq m_{1}) .

The only condition needed on the sample size in order to have the above convolution of k independent LH-trace distribution is

n_{1} + n_{2} \geq m_{1} + 2

. However, LH-trace distributions are usually approximated by

F

distribution (We use here the second approximation suggested in McKeon [27]). This means that the distribution of

{(D^{pl})}^{2}

can be aproximated by the convolution of k

F

distributions, where its critical values are obtained by the method sugested by Dyer [28].

7. An Example

We apply our proposed extended

D^{2}

test statistic to a third-order (

k = 3

) medical dataset as described in the Introduction, where the interest is in testing the equality of mean of a population of glaucoma patients to a target mean of another population of glaucoma patients [29]. Several studies showed that the central corneal thickness (CCT) plays a major role in the diagnosis of glaucoma. Intraocular pressure (IOP) is positively correlated with CCT and may therefore affect diagnosis. Therefore, CCT should be measured along with IOP in all patients for verification of glaucoma. CCT and IOP vary from individual to individual, from right eye to left eye, and from time to time. We have a sample of 30 glaucoma patients. Measurements on IOP and CCT were taken from both the eyes (sites) and were observed over three time points at an interval of three months. Clearly, then, this dataset is a third-order dataset with

m_{1} = 2, m_{2} = 2

, and

m_{3} = 3

. This dataset was studied by Leiva and Roy [20] by assuming a 3-SSCS covariance structure. Here, we also assume that this dataset has a 3-SSCS covariance structure. The

(2 \times 1)

-dimensional sample partitioned mean vector in our sample of 30 glaucoma patients is presented in Table 1.

Table 1. The (2 × 1) dimensional sample partitioned mean vector in our sample of 30 glaucoma patients.

Additionally, using the Formulas (27)–(29) presented in Section 5.2, the unbiased estimates

{\hat{U}}_{3, 1}

,

{\hat{U}}_{3, 2}

, and

{\hat{U}}_{3, 3}

are:

{\hat{U}}_{3, 1} = [\begin{matrix} 12.230 & 12.061 \\ 12.061 & 426.155 \end{matrix}], {\hat{U}}_{3, 2} = [\begin{matrix} 5.826 & 6.939 \\ 6.939 & 164.156 \end{matrix}], and {\hat{U}}_{3, 3} = [\begin{matrix} 3.528 & 9.268 \\ 9.268 & 288.684 \end{matrix}],

respectively. Using the above estimates, the unbiased estimate of

Γ_{3}

is:

The

2 \times 2

block diagonals

{\hat{U}}_{3, 1}

represent the estimate of the variance-covariance matrix of the two response variables IOP and CCT at any given eye and at any given time point, whereas the

2 \times 2

block off diagonals

{\hat{U}}_{3, 2}

represent the estimate of the covariance matrix of the two response variables IOP and CCT between the two eyes and at any given time point. The

2 \times 2

block off diagonals

{\hat{U}}_{3, 3}

represent the covariance matrix of the two response variables IOP and CCT between any two time points.

Iester et al. [29] reported the mean and standard deviation (SD) of the IOP and CCT measurements for both the eyes from 794 Italian Caucasian glaucoma patients (see Table 2). We deem these means as the means of the IOP and CCT at the first time point and then randomly generate four samples within three standard errors (SD of mean) from these reported means of IOP and CCT to represent the means of IOP and CCT for the left and right eyes in the third and sixth months, respectively. These randomly generated means of IOP and CCT for the left and right eyes at three time points in vector form are reported in Table 3, and we will take this mean vector as the targeted mean

μ_{0}

in (21). The sample mean vector in Table 1 appears to be very different from the targeted population mean vector

μ_{0}

in Table 3.

Table 2. IOP and CCT measurements from 794 Italian Caucasian glaucoma patients.

Table 3. The (2 × 1) dimensional targeted partitioned mean vector

μ_{0}

in the Italian Caucasian glaucoma patients.

The aim of our study is to see whether our sample of 30 glaucoma patients has the same mean vector as the Italian Caucasian glaucoma patients. Our main intention of the analysis of our glaucoma dataset is to illustrate the use of our new hypotheses testing procedures rather than giving any insight into the dataset itself.

The calculated

D^{2}

statistic (26), which is a convolution of three independent L–H distributions,

T_{0}^{2} (2; 3, 87)

,

T_{0}^{2} (2; 2, 58) and T_{0}^{2} (2; 1, 29)

, respectively, which in turn is approximated by two approximated F distributions and one exact F distribution, is 317.2971, and the corresponding p value is 0. So, we reject the null hypothesis that the population mean of our dataset is equal to the Italian Caucasian glaucoma patients, and this conclusion was expected from the data.

8. Conclusions and Discussion

We study the tests of hypotheses of equality of means for one population as well as for two populations for high-dimensional and higher-order data with k-SSCS covariance structure. Such a structure is natural and a credible assumption in many research studies. MLEs and the unbiased estimates of the matrix parameters of the k-SSCS covariance structure have closed-form solutions. On the other hand, the MLEs and the unbiased estimates of the matrix parameters of the

k -

separable covariance structure are not tractable and are computationally intensive. So, k-SSCS covariance structure is a desirable covariance structure for k-th order data. Aghaian et al. [30] examined differences in CCT of 801 subjects, establishing the fact that the CCT of Japanese participants was significantly lower than that of Caucasians, Chinese, Filipinos, and Hispanics, and greater than that of African Americans. African American individuals have thinner corneas compared to white individuals [31]. So, CCT and IOP in glaucoma patients vary with race, and our result confirms this fact. Our proposed new hypotheses testing procedures are perfect for high-dimensional array-variate data, which are ubiquitous in this century. In discriminant analysis [32], the first step is to test the equality of means for the two populations. Therefore, our new method developed in this article will have important applications in the analysis of modern multivariate datasets with higher-order structure. Our new method can be extended to non-normal datasets. In addition, it can be extended in testing the equality of means for more than two populations and simultaneous hypotheses testing in models with k-SSCS covariance structure.

Author Contributions

All authors contributed equally to this work. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors gratefully acknowledge the funding support from the Department of Management Science and Statistics, Carlos Alvarez College of Business, The University of Texas at San Antonio, San Antonio, Texas for paying the APC. The authors are also thankful to the editor and three anonymous referees for their careful reading, valuable comments, and suggestions that led to a quite improved version of the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Žežula, I.; Klein, D.; Roy, A. Testing of multivariate repeated measures data with block exchangeable covariance structure. Test 2018, 27, 360–378. [Google Scholar] [CrossRef]
Dutilleul, P. The mle algorithm for the matrix normal distribution. J. Stat. Comput. Simul. 1999, 64, 105–123. [Google Scholar] [CrossRef]
Gupta, A.K. On a multivariate statistical classification model. In Multivariate Statistical Analysis; Gupta, R.P., Ed.; North-Holland: Amsterdam, The Netherlands, 1980; pp. 83–93. [Google Scholar]
Lu, N.; Zimmerman, D. The likelihood ratio test for a separable covariance matrix. Stat. Probab. Lett. 2005, 73, 449–457. [Google Scholar] [CrossRef]
Kollo, T.; von Rosen, D. Advanced Multivariate Statistics with Matrices; Springer: Dordrecht, The Netherlands, 2005. [Google Scholar]
Leiva, R. Linear discrimination with equicorrelated training vectors. J. Multivar. Anal. 2007, 98, 384–409. [Google Scholar] [CrossRef] [Green Version]
Manceur, A.M.; Dutilleul, P. Maximum likelihood estimation for the tensor normal distribution: Algorithm, minimum sample size, and empirical bias and dispersion. J. Computat. Appl. Math. 2013, 239, 37–49. [Google Scholar] [CrossRef]
Leiva, R.; Roy, A. Classification of higher-order data with separable covariance and structured multiplicative or additive mean models. Commun. Stat. Theory Methods 2014, 43, 989–1012. [Google Scholar] [CrossRef]
Ohlson, M.; Ahmad, M.R.; von Rosen, D. The multilinear normal distribution: Introduction and some basic properties. J. Multivar. Anal. 2013, 113, 37–47. [Google Scholar] [CrossRef] [Green Version]
Leiva, R.; Roy, A. Self Similar Compound Symmetry Covariance Structure. J. Stat. Theory Pract. 2021, 15, 70. [Google Scholar] [CrossRef]
Rao, C.R. Familial correlations or the multivariate generalizations of the intraclass correlation. Curr. Sci. 1945, 14, 66–67. [Google Scholar]
Rao, C.R. Discriminant functions for genetic differentiation and selection. Sankhya 1953, 12, 229–246. [Google Scholar]
Olkin, I.; Press S., J. Testing and estimation for a circular stationary model. Ann. Math. Stat. 1969, 40, 1358–1373. [Google Scholar] [CrossRef]
Liang, Y.; von Rosen, D.; von Rosen, T. On estimation in hierarchical models with block circular covariance structures. Ann. Inst. Stat. Math. 2015, 67, 773–791. [Google Scholar] [CrossRef]
Olkin, I. Inference for a Normal Population when the Parameters Exhibit Some Structure, Reliability and Biometry; SIAM: Philadelphia, PA, USA, 1974; pp. 759–773. [Google Scholar]
Wilks, S.S. Sample criteria for testing equality of means, equality of variances, and equality of covariances in a normal multivariate distribution. Ann. Math. Stat. 1946, 17, 257–281. [Google Scholar] [CrossRef]
Arnold, S.F. Application of the theory of products of problems to certain patterned covariance matrices. Ann. Stat. 1973, 1, 682–699. [Google Scholar] [CrossRef]
Arnold, S.F. Linear models with exchangeably distributed errors. J. Am. Stat. Assoc. 1979, 74, 194–199. [Google Scholar] [CrossRef]
Roy, A.; Leiva, R.; Žežula, I.; Klein, D. Testing of equality of mean vectors for paired doubly multivariate observations in blocked compound symmetric covariance matrix setup. J. Multivar. Anal. 2015, 137, 50–60. [Google Scholar] [CrossRef]
Leiva, R.; Roy, A. Linear discrimination for three-level multivariate data with separable additive mean vector and doubly exchangeable covariance structure. Comput. Stat. Data Anal. 2012, 56, 1644–1661. [Google Scholar] [CrossRef]
Roy, A.; Fonseca, M. Linear models with doubly exchangeable distributed errors. Commun. Stat. Theory Methods 2012, 41, 2545–2569. [Google Scholar] [CrossRef]
Žežula, I.; Klein, D.; Roy, A. Mean Value Test for Three-Level Multivariate Observations with Doubly Exchangeable Covariance Structure. In Recent Developments in Multivariate and Random Matrix Analysis; Holgersson, T., Singull, M., Eds.; Springer Nature: Cham, Switzerland, 2020; pp. 335–349. [Google Scholar]
Jordan, P.; von Neumann, J.; Wigner, E.P. On an algebraic generalization of the quantum mechanical formalism. Ann. Math. 1934, 35, 29–64. [Google Scholar] [CrossRef]
Malley, J.D. Statistical Applications of Jordan Algebras. In Lecture Notes in Statistics; Springer: New York, NY, USA, 1994. [Google Scholar]
Roy, A.; Zmyślony, R.; Fonseca, M.; Leiva, R. Optimal estimation for doubly multivariate data in blocked compound symmetric covariance structure. J. Multivar. Anal. 2016, 144, 81–90. [Google Scholar] [CrossRef]
Kozioł, A.; Roy, A.; Zmyślony, R.; Leiva, R.; Fonseca, M. Best unbiased estimates for parameters of three-level multivariate data with doubly exchangeable covariance structure. Linear Algebra Appl. 2017, 535, 87–104. [Google Scholar] [CrossRef]
McKeon, J.J. F approximations to the distribution of Hotelling’s $T_{0}^{2}$ . Biometrika 1974, 61, 381–383. [Google Scholar]
Dyer, D. The Convolution of Generalized F Distributions. J. Am. Stat. Assoc. 1982, 77, 184–189. [Google Scholar] [CrossRef]
Iester, M.; Telani, S.; Frezzotti, P.; Manni, G.; Uva, M.; Figus, M.; Perdicchi, A. Differences in central corneal thickness between the paired eyes and the severity of the glaucomatous damage. Eye 2012, 26, 1424–1430. [Google Scholar] [CrossRef] [Green Version]
Aghaian, E.; Choe, J.E.; Lin, S.; Stamper, R.L. Central corneal thickness of Caucasians, Chinese, Hispanics, Filipinos, African Americans, and Japanese in a glaucoma clinic. Ophthalmology 2004, 111, 2211–2219. [Google Scholar] [CrossRef] [PubMed]
Brandt, J.D.; Beiser, J.A.; Kass, M.A.; Gordon, M.O.; Ocular Hypertension Treatment Study (OHTS) Group. Central corneal thickness in the Ocular Hypertension Treatment Study (OHTS). Ophthalmology 2001, 108, 1779–1788. [Google Scholar] [CrossRef]
Johnson, R.A.; Wichern, D.W. Applied Multivariate Statistical Analysis, 6th ed.; Pearson Prentice Hall: Hoboken, NJ, USA, 2007. [Google Scholar]

Table 1. The (2 × 1) dimensional sample partitioned mean vector in our sample of 30 glaucoma patients.

t	s	(IOP, CCT)
1	1	${(24.333, 527.367)}^{'}$
1	2	${(23.567, 534.633)}^{'}$
2	1	${(20.233, 525.333)}^{'}$
2	2	${(19.567, 532.500)}^{'}$
3	1	${(19.233, 527.133)}^{'}$
3	2	${(18.933, 534.867)}^{'}$

Table 2. IOP and CCT measurements from 794 Italian Caucasian glaucoma patients.

	Mean	SD
Right IOP	16.16	3.55
Right CCT	545.68	35.82
Left IOP	16.28	3.31
Left CCT	546.89	36.09

Table 3. The (2 × 1) dimensional targeted partitioned mean vector

μ_{0}

in the Italian Caucasian glaucoma patients.

Table 3. The (2 × 1) dimensional targeted partitioned mean vector

μ_{0}

in the Italian Caucasian glaucoma patients.

t	s	(IOP, CCT)
1	1	(16.16, 545.68)
1	2	(16.28, 546.89)
2	1	(15.97, 546.18)
2	2	(16.25, 550.30)
3	1	(16.20, 546.90)
3	2	(16.07, 549.64)

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Mean Equality Tests for High-Dimensional and Higher-Order Data with k-Self Similar Compound Symmetry Covariance Structure

Abstract

1. Introduction

2. Preliminaries

3. Properties of the Self Similar Compound Symmetry Covariance Matrix

k-SSCS Covariance Structure Is of the Jordan Algebra Type

4. Estimators of the Eigenblocks

5. Test for the Mean

5.1. One Sample Test

5.1.1. Distribution of Test Statistic $D^{2}$ under $H_{0}$

5.1.2. Distribution of Statistic $D^{2}$ under $H_{0}$ for Third-order Data with 3-SSCS Covariance Structure

5.2. The Expressions of the $Δ$ ’s Estimators for the Case $k = 3$

6. Test for the Equality of Two Means

6.1. Paired Observation Model

6.2. Independent Observation Model

7. An Example

8. Conclusions and Discussion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

Mean Equality Tests for High-Dimensional and Higher-Order Data with k-Self Similar Compound Symmetry Covariance Structure

Abstract

1. Introduction

2. Preliminaries

3. Properties of the Self Similar Compound Symmetry Covariance Matrix

k-SSCS Covariance Structure Is of the Jordan Algebra Type

4. Estimators of the Eigenblocks

5. Test for the Mean

5.1. One Sample Test

5.1.1. Distribution of Test Statistic D 2 under H 0

5.1.2. Distribution of Statistic D 2 under H 0 for Third-order Data with 3-SSCS Covariance Structure

5.2. The Expressions of the Δ ’s Estimators for the Case k = 3

6. Test for the Equality of Two Means

6.1. Paired Observation Model

6.2. Independent Observation Model

7. An Example

8. Conclusions and Discussion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

5.1.1. Distribution of Test Statistic $D^{2}$ under $H_{0}$

5.1.2. Distribution of Statistic $D^{2}$ under $H_{0}$ for Third-order Data with 3-SSCS Covariance Structure

5.2. The Expressions of the $Δ$ ’s Estimators for the Case $k = 3$