Estimation and Sufficiency Under the Mixed Effects Extended Growth Curve Model with Compound Symmetry Covariance Structure

Filipiak, Katarzyna; Markiewicz, Augustyn; Krajewski, Paweł; Ćwiek-Kupczyńska, Hanna

doi:10.3390/sym17111901

Open AccessArticle

Estimation and Sufficiency Under the Mixed Effects Extended Growth Curve Model with Compound Symmetry Covariance Structure

¹

Institute of Mathematics, Poznań University of Technology, Piotrowo 3A, 60-965 Poznań, Poland

²

Department of Mathematical and Statistical Methods, Poznań University of Life Sciences, Wojska Polskiego 28, 60-637 Poznań, Poland

³

Institute of Plant Genetics, Polish Academy of Sciences, Strzeszyńska 34, 60-479 Poznań, Poland

⁴

Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Belval Campus 1, Boulevard du Jazz, L-4370 Belvaux, Luxembourg

^*

Author to whom correspondence should be addressed.

Symmetry 2025, 17(11), 1901; https://doi.org/10.3390/sym17111901

Submission received: 3 October 2025 / Revised: 31 October 2025 / Accepted: 4 November 2025 / Published: 7 November 2025

(This article belongs to the Special Issue Symmetrical and Asymmetrical Distributions in Statistics and Data Science II)

Download

Browse Figure

Versions Notes

Abstract

An extended growth curve model with fixed and random effects is considered. Under the assumption of multivariate normality, the maximum likelihood estimators of the fixed effects and the dispersion matrix are determined in a model with random nuisance parameters, both without any assumption on the covariance structure and under the assumption of compound symmetry. For this purpose, rules for differentiation of symmetric matrices are applied. Furthermore, when the experiments are designed in balanced complete blocks, particular symmetric matrices appear in the likelihood equations, allowing closed-form expressions for the estimators. It is also shown that the vector of sufficient statistics for the fixed effects extended growth curve model is also sufficient for the model with random nuisance parameters. The presented results are illustrated using a real data example.

Keywords:

extended growth curve model; fixed effects; random effects; dispersion matrix; estimation; sufficiency; compound symmetry; balanced incomplete block design

1. Introduction

In 1964, Potthoff and Roy [1] introduced the growth curve model (GCM) as an extension of the multivariate analysis of variance model (MANOVA), suitable for modeling data with repeated measurements, where underlying trends or growth curves can be observed. Such examples can be found, for instance, in environmental or medical research, as well as in biostatistics. Assuming mutually independent, multivariate normally distributed observation vectors, the maximum likelihood estimators (MLEs) of the matrix of expectation parameters and the dispersion matrix of observations can be found, for example, in [2]. Note that when the dispersion matrix is proportional to the identity matrix, the multivariate normal distribution is symmetric and is called a spherical distribution; when the dispersion matrix is diagonal, the distribution is symmetric with respect to the corresponding eigenvectors. Furthermore, ref. [3] considered the GCM with a particular dispersion structure, namely compound symmetry, where all diagonal entries are the same and all off-diagonal entries are equal. Since such a matrix has only two distinct eigenvalues, the distribution represents the simplest generalization of the spherical normal distribution, because the observation variances remain equal, and the observations are equicorrelated rather than independent. Note also that the explicit formula for the MLE under such a structure, as well as the sufficient statistics for the estimation of the expectation and covariance structure, are given in [3].

A generalization of the GCM is the extended growth curve model (EGCM), also known as the sum of profiles model, introduced by [4]. The MLEs of the expectation matrices in an EGCM with two components, as well as the MLE of the dispersion matrix, can be found, for example, in [2]; while results for an EGCM with three components are given, for instance, in [2,5]. It is worth noting that explicit forms of the estimators can be determined if the nested subspace conditions are satisfied (cf. [2] for the estimators and examples). Explicit formulas for unbiased estimators of the expectation parameters and the dispersion matrix can be found in [6].

In all the aforementioned papers, fixed effects models have been considered. Although fixed effects models provide a straightforward way to account for systematic differences among groups, they are limited when the grouping factors represent only a sample of all possible conditions. In such cases, treating these factors as fixed restricts inference to the specific observed levels and may reduce the model generalizability. Mixed effects models address this issue by incorporating random effects that capture variation across a broader population of conditions.

The first aim of this paper is to show that, under multivariate normality, the vector of statistics sufficient for a fixed effects EGCM remains sufficient when the nuisance parameters are treated as random. The second objective is to determine the maximum likelihood estimators (MLEs) of the unknown parameters in a mixed effects EGCM. We first consider an EGCM in which both the covariance matrices of the random effects and the errors are positive definite matrices. Subsequently, we determine the estimators of the unknown parameters of a multivariate random block effects model with specific restrictions (which represents a special case of the EGCM), and under the assumption of compound symmetry of the error dispersion matrix. For the purpose of estimation, the rules for differentiation of symmetric matrices are applied. Furthermore, when the experiments are designed in balanced complete blocks, particular symmetric matrices appear in the likelihood equations, allowing the closed-form expressions for the MLEs. It is worth noting that parameter estimation in a mixed effects EGCM has been studied previously in, e.g., [3]; however, the assumptions of compound symmetry of the dispersion matrices of random effects and errors were imposed in that work.

This paper is organized as follows. In Section 2, we introduce the EGCM, we determine the MLEs of the parameters under the mixed effects EGCM, and we show that the vector of sufficient statistics for a fixed effects EGCM remains sufficient for a mixed effects EGCM. In Section 3, we consider a special case of the mixed effects EGCM, namely the model of block experiments with the same random block effects for every observation, subject to particular restrictions on the random nuisance parameters, and assuming compound symmetry of the dispersion matrix of errors. Similarly to the general mixed effects EGCM, we determine the MLEs of the unknown parameters and show that the vector of sufficient statistics for the fixed effects EGCM with a compound symmetry dispersion matrix remains sufficient for the mixed effect EGCM. We also show that, in the special case when the block design is complete, there exists the best linear unbiased estimator of the matrix of treatment effects, and the MLE of the compound symmetry dispersion matrix can be expressed in explicit form. In Section 4 we illustrate the results with real data examples. The paper ends up with short discussion.

2. Estimation Under Mixed EGCM

Consider the extended growth curve model (EGCM)

Y \sim N_{n, q} (A_{1} B_{1} C_{1} + A_{2} B_{2} C_{2}, I_{n}, Σ),

(1)

where

Y \in R_{n \times q}

is a matrix of normally distributed observations with independent rows,

A_{i} \in R_{n \times n_{i}}

and

C_{i} \in R_{q_{i} \times q}

,

i = 1, 2

, are known matrices (the design and restrictions matrices, respectively), while

B_{i} \in R_{n_{i} \times q_{i}}

are matrices of unknown parameters,

Σ

is an unknown symmetric positive definite matrix of order q, i.e.,

Σ \in R_{q}^{>}

, such that

D (Y) = D (vec Y) = Σ \otimes I_{n}

, with “vec” denoting the vectorization operator, which transforms a matrix into a vector by stacking its columns one below the other, and ⊗ denotes the Kronecker product; cf. [2]. It is worth mentioning that the matrices

A_{i}

usually contain information about the experimental designs, whereas the matrices

C_{i}

represent restrictions related to growth. An EGCM of the form (1) is called a fixed effect EGCM.

In model (1), the best linear unbiased estimators (BLUEs) of estimable functions of

B_{1}

and

B_{2}

usually do not exist, and thus the MLEs of

B_{1}

and

B_{2}

, denoted by

{\tilde{B}}_{1}

and

{\tilde{B}}_{2}

, under normality of the distribution of observations are usually considered. The log-likelihood function has the form

\begin{matrix} L (B_{1}, B_{2}, Σ; Y) & = & - \frac{n q}{2} ln (2 π) - \frac{n}{2} ln | Σ | \\ - \frac{1}{2} tr [(Y - A_{1} B_{1} C_{1} - A_{2} B_{2} C_{2}) Σ^{- 1} \\ \times {(Y - A_{1} B_{1} C_{1} - A_{2} B_{2} C_{2})}^{⊤}], \end{matrix}

(2)

where

| \cdot |

denotes the determinant, and, following [5], the normal equations are given by

\begin{matrix} A_{1}^{⊤} (Y - A_{1} B_{1} C_{1} - A_{2} B_{2} C_{2}) Σ^{- 1} C_{1}^{⊤} = 0, \\ A_{2}^{⊤} (Y - A_{1} B_{1} C_{1} - A_{2} B_{2} C_{2}) Σ^{- 1} C_{2}^{⊤} = 0, \\ n Σ = {(Y - A_{1} B_{1} C_{1} - A_{2} B_{2} C_{2})}^{⊤} \cdot (Y - A_{1} B_{1} C_{1} - A_{2} B_{2} C_{2}) . \end{matrix}

To obtain explicit estimators of all parameters, one of the common assumptions is

C (C_{1}^{⊤}) \subseteq C (C_{2}^{⊤})

, where

C (\cdot)

denotes a column space. Using the notation for orthogonal projectors

P_{A} = A {(A^{⊤} A)}^{-} A^{⊤}

,

Q_{A} = I - P_{A}

, and

P_{A; B} = A {(A^{⊤} B A)}^{- 1} A^{⊤} B

,

Q_{A; B} = I - P_{A; B}

, the theorem below can be proved using the same techniques as in [5] (Theorem 4). The sketch of the proof is given in the Appendix A.

Theorem 1.

In a fixed effects EGCM (1) with

C (C_{1}^{⊤}) \subseteq C (C_{2}^{⊤})

and an unknown positive definite covariance matrix Σ, the maximum likelihood estimators of the unknown parameters are given by

\begin{matrix} A_{1} {\hat{B}}_{1} C_{1} & = & P_{A_{1}; Q_{A_{2}}} Y P_{C_{1}^{⊤}; S_{1}^{- 1}}^{⊤} \\ A_{2} {\hat{B}}_{2} C_{2} & = & P_{A_{2}} (Y P_{C_{2}^{⊤}; S_{2}^{- 1}}^{⊤} - P_{A_{1}; Q_{A_{2}}} Y P_{C_{1}^{⊤}; S_{1}^{- 1}}^{⊤}) \\ n \hat{Σ} & = & S_{2} + Q_{C_{2}^{⊤}; S_{2}^{- 1}} Y^{⊤} P_{A_{2}} Y Q_{C_{2}^{⊤}; S_{2}^{- 1}}^{⊤} \end{matrix}

where

S_{1} = Y^{⊤} Q_{(A_{1} : A_{2})} Y

and

S_{2} = S_{1} + Q_{C_{1}^{⊤}; S_{1}^{- 1}} Y^{⊤} P_{Q_{A_{2}} A_{1}} Y Q_{C_{1}^{⊤}; S_{1}^{- 1}}^{⊤}

.

It is known (see, e.g., [7]) that under the GMANOVA model—that is, the EGCM (1) with

C_{1} = C_{2} = I_{q}

– the vector

F (Y) = ({(A_{1} : A_{2})}^{⊤} Y, Y^{⊤} Q_{(A_{1} : A_{2})} Y)

(3)

represents a sufficient statistic for the estimation of

(B_{1}, B_{2}, Σ)

. It can be seen that this is also a sufficient statistic under model (1), as the log-likelihood function (2) can be expressed in terms of

(A_{1}^{⊤} Y, A_{2}^{⊤} Y, Y^{⊤} Y)

.

Assume now that in the EGCM (1), the nuisance parameter

B_{2}

is random, with

D (B_{2}) = D (vec B_{2}) = Ψ \otimes I_{n_{2}}

, where

Ψ \in R_{q_{2}}^{>}

is an unknown dispersion matrix, and which is independent of the random error of the model. Then the EGCM with

\begin{matrix} E (Y) = A_{1} B_{1} C_{1}, \\ D (Y) = C_{2}^{⊤} Ψ C_{2} \otimes A_{2} A_{2}^{⊤} + Σ \otimes I_{n} . \end{matrix}

(4)

is called a mixed effects EGCM.

Denote

y = vec Y

,

β_{1} = vec B_{1}

,

β_{2} = vec B_{2}

, and

v = y - (C_{1}^{⊤} \otimes A_{1}) β_{1}

. Let

D (Y) = Ω

. Moreover, let

{PTr}_{u} [\cdot]

denote the partial trace operator, which replaces each

u \times u

block of a partitioned

m u \times m u

matrix by its trace; cf. [8]. Then the MLEs of

B_{1}

,

Ψ

, and

Σ

can be determined as stated in the following theorem.

Theorem 2.

In a mixed effects EGCM (4) with unknown positive definite covariance matrices Ψ and Σ, the maximum likelihood estimators of the unknown parameters can be obtained by solving

\begin{matrix} (C_{1} \otimes A_{1}^{⊤}) Ω^{- 1} v = 0, \\ {PTr}_{n_{2}} [(C_{2} \otimes A_{2}^{⊤}) (Ω^{- 1} - Ω^{- 1} v v^{⊤} Ω^{- 1}) (C_{2}^{⊤} \otimes A_{2})] = 0, \\ {PTr}_{n} [Ω^{- 1} - Ω^{- 1} v v^{⊤} Ω^{- 1}] = 0 . \end{matrix}

Proof.

The log-likelihood function can be written as

\begin{matrix} L (B_{1}, Ω; Y) & = & - \frac{n q}{2} ln (2 π) - \frac{1}{2} ln | Ω | \\ - \frac{1}{2} {vec}^{⊤} [Y - A_{1} B_{1} C_{1}] Ω^{- 1} vec [Y - A_{1} B_{1} C_{1}] \\ = & - \frac{n q}{2} ln (2 π) - \frac{1}{2} ln | Ω | - \frac{1}{2} v^{⊤} Ω^{- 1} v . \end{matrix}

(5)

Note that

Ψ

and

Σ

(embedded in

Ω

) are symmetric. Therefore, differentiating the log-likelihood function with respect to symmetric matrices amounts to differentiating with respect to the lower triangles of these matrices, denoted respectively by

Ψ^{▵}

and

Σ^{▵}

. By the chain rule given in [9], and following the differentiation formulas presented in [10], we obtain

\begin{matrix} \frac{\partial L}{\partial β_{1}} & = & v^{⊤} Ω^{- 1} (C_{1}^{⊤} \otimes A_{1}), \\ \frac{\partial L}{\partial Ψ^{▵}} & = & [- \frac{1}{2} {vec}^{⊤} Ω^{- 1} + \frac{1}{2} (v^{⊤} \otimes v^{⊤}) (Ω^{- 1} \otimes Ω^{- 1})] \\ \times (C_{2}^{⊤} \otimes A_{1} \otimes C_{2}^{⊤} \otimes A_{2}) (I_{q_{2}} \otimes K_{n_{2}, q_{2}} \otimes I_{n_{2}}) (I_{q_{2}^{2}} \otimes vec I_{n_{2}}) D_{q_{2}}, \\ \frac{\partial L}{\partial Σ^{▵}} & = & [- \frac{1}{2} {vec}^{⊤} Ω^{- 1} + \frac{1}{2} (v^{⊤} \otimes v^{⊤}) (Ω^{- 1} \otimes Ω^{- 1})] \\ \times (I_{q} \otimes K_{n, q} \otimes I_{n}) (I_{q^{2}} \otimes vec I_{n}) D_{q}, \end{matrix}

(6)

where

K_{u, m}

denotes the

u m \times u m

commutation matrix (see e.g., [2]) and

D_{m}

denotes the

m^{2} \times m (m + 1) / 2

duplication matrix; cf. [9]. From [9] (Formula (14)), we have

b \otimes a = vec (a b^{⊤})

, and thus

{vec}^{⊤} Ω^{- 1} - (v^{⊤} \otimes v^{⊤}) (Ω^{- 1} \otimes Ω^{- 1}) = {vec}^{⊤} [Ω^{- 1} - Ω^{- 1} v v^{⊤} Ω^{- 1}] .

From [8] (Lemma 2.9), we have

(I_{m^{2}} \otimes {vec}^{⊤} I_{u}) (I_{m} \otimes K_{u, m} \otimes I_{u}) vec A = vec [{PTr}_{u} A],

which implies that the normal equations can be written as stated in the theorem. □

From the Fisher-Neyman factorization theorem, it can be seen that the sufficient statistic (3) remains sufficient for estimation of

B_{1}

,

Ψ

, and

Σ

also in the mixed EGCM (4), since the log-likelihood Function (5) can be expressed in terms of

(A_{1}^{⊤} Y, A_{2}^{⊤} Y, Y^{⊤} Y)

. Indeed, since

\begin{matrix} Ω^{- 1} & = & Σ^{- 1} \otimes I_{n} - (Σ^{- 1} C_{2}^{⊤} \otimes A_{2}) {(C_{2} Σ^{- 1} C_{2}^{⊤} \otimes A_{2}^{⊤} A_{2})}^{- 1} (C_{2} Σ^{- 1} \otimes A_{2}^{⊤}), \end{matrix}

the quadratic form in (5) can be represented as

\begin{matrix} v^{⊤} Ω^{- 1} v & = & v^{⊤} (Σ^{- 1} \otimes I_{n}) v - v (Σ^{- 1} C_{2}^{⊤} \otimes A_{2}) W (C_{2} Σ^{- 1} \otimes A_{2}^{⊤}) v \\ = & tr [V^{⊤} V Σ^{- 1}] - {vec}^{⊤} [A_{2}^{⊤} V Σ^{- 1} C_{2}] W vec [A_{2}^{⊤} V Σ^{- 1} C_{2}], \end{matrix}

where

W = {(C_{2} Σ^{- 1} C_{2}^{⊤} \otimes A_{2}^{⊤} A_{2})}^{- 1}

and

V = Y - A_{1} B_{1} C_{1}

. Similarly to the case of fixed effects EGCM,

V^{⊤} V

can be expressed in terms of

(A_{1}^{⊤} Y, Y^{⊤} Y)

, while

A_{2}^{⊤} V

can be expressed in terms of

A_{2}^{⊤} Y

.

In the next section, we consider a particular form of the mixed effects EGCM, that is, we assume that

B_{1}

is a matrix of treatment effects,

B_{2}

is a matrix of block effects (the same for all q characteristics), and that the dispersion matrix

Ω

belongs to a particular quadratic subspace.

3. Block Effects Model with Compound Symmetry Matrix $Σ$

Consider an experiment in which t treatments are arranged in b blocks, each of size k. Assume a mixed effects EGCM in which

B_{1}

is a

t \times q_{1}

matrix of treatment effects and

B_{2}

is a

b \times q_{2}

matrix of random block effects. The design matrix for treatments,

A_{1}

, represents the arrangement of treatments on experimental units. It is therefore a 0–1 matrix with

n = b k

rows and t columns, such that there is exactly one 1 in each row. The design matrix for block effects,

A_{2}

, represents the arrangement of experimental units in blocks, and hence

A_{2} = I_{b} \otimes 1_{k}

. Clearly,

A_{2}^{⊤} A_{2} = k I_{b}

. Under these assumptions, the inverse of

Ω

can be written as

\begin{matrix} Ω^{- 1} & = & Σ^{- 1} \otimes I_{b k} - (Σ^{- 1} C_{2}^{⊤} \otimes A_{2}) {(Ψ^{- 1} \otimes I_{b} + C_{2} Σ^{- 1} C_{2}^{⊤} \otimes A_{2}^{⊤} A_{2})}^{- 1} (C_{2} Σ^{- 1} \otimes A_{2}^{⊤}) \\ = & Σ^{- 1} \otimes I_{b k} - Σ^{- 1} C_{2}^{⊤} {(Ψ^{- 1} + k C_{2} Σ^{- 1} C_{2}^{⊤})}^{- 1} C_{2} Σ^{- 1} \otimes (I_{b} \otimes 1_{k} 1_{k}^{⊤}) \\ = & Σ^{- 1} \otimes I_{n} - Ξ \otimes (I_{b} \otimes 1_{k} 1_{k}^{⊤}), \end{matrix}

where

Ξ = Σ^{- 1} C_{2}^{⊤} {(Ψ^{- 1} + k C_{2} Σ^{- 1} C_{2}^{⊤})}^{- 1} C_{2} Σ^{- 1}

. Plugging this

Ω^{- 1}

into the system of equations in Theorem 2, and using the following properties of the partial trace operator:

{PTr}_{m} [vec A \cdot {vec}^{⊤} B] = A^{⊤} B, {PTr}_{m} [A \otimes B] = tr [B] \cdot A

(cf. [8] (Lemmas 2.8 and 2.13)), and noting that

Ω^{- 1} v = vec [V Σ^{- 1}] - vec [(I_{b} \otimes 1_{k} 1_{k}^{⊤}) V Ξ],

the maximum likelihood equations from Theorem 2 can be written in matrix form as

\begin{matrix} A_{1}^{⊤} V Σ^{- 1} C_{1}^{⊤} = A_{1}^{⊤} (I_{b} \otimes 1_{k} 1_{k}^{⊤}) V Ξ C_{1}^{⊤}, \\ C_{2} (Σ^{- 1} - k Ξ) V^{⊤} (I_{b} \otimes 1_{k} 1_{k}^{⊤}) V (Σ^{- 1} - k Ξ) C_{2}^{⊤} = b k C_{2} (Σ^{- 1} - k Ξ) C_{2}^{⊤}, \\ (Σ^{- 1} V^{⊤} - Ξ V^{⊤} (I_{b} \otimes 1_{k} 1_{k}^{⊤})) (V Σ^{- 1} - (I_{b} \otimes 1_{k} 1_{k}^{⊤}) V Ξ) = n Σ^{- 1} - b k Ξ . \end{matrix}

Assume now that the same block effect is observable for every characteristic, that is,

C_{2} = 1_{q}^{⊤}

. Then

B_{2} = β_{2}

, implying that

Ψ = ψ^{2}

. Assume also that the covariance matrix

Σ

has a compound symmetry structure, i.e.,

Σ = σ^{2} \cdot [(1 - ρ) I_{q} + ρ 1_{q} 1_{q}^{⊤}]

, and that

1_{q} \in C (C_{1}^{⊤})

. Then, we can formulate the following theorem.

Theorem 3.

In a block effects EGCM (4) with

A_{2} = I_{b} \otimes 1_{k}

,

C_{2} = 1_{q}^{⊤}

,

1_{q} \in C (C_{1}^{⊤})

, an unknown positive

ψ^{2}

and compound symmetry covariance matrix Σ, the maximum likelihood estimators of the unknown parameters are given by

\begin{matrix} vec B_{1} & = & {[(C_{1} \otimes A_{1}^{⊤}) Ω^{- 1} (C_{1}^{⊤} \otimes A_{1})]}^{- 1} (C_{1} \otimes A_{1}^{⊤}) Ω^{- 1} vec Y \\ Ω & = & \frac{v^{⊤} (P_{1_{q}} \otimes I_{b} \otimes P_{1_{k}}) v}{b} (P_{1_{q}} \otimes I_{b} \otimes P_{1_{k}}) \\ + \frac{v^{⊤} (P_{1_{q}} \otimes I_{b} \otimes Q_{1_{k}}) v}{b (k - 1)} (P_{1_{q}} \otimes I_{b} \otimes Q_{1_{k}}) \\ + \frac{v^{⊤} (Q_{1_{q}} \otimes I_{b} \otimes I_{k}) v}{(q - 1) b k} (Q_{1_{q}} \otimes I_{b} \otimes I_{k}) \end{matrix}

where

v = Q_{(C_{1}^{⊤} \otimes A_{1}); Ω^{- 1}} vec Y

.

Proof.

Observing that

Σ

can be represented as

λ_{1} Q_{1_{q}} + λ_{2} P_{1_{q}}

, where

λ_{1} = σ^{2} (1 - ρ)

and

λ_{2} = σ^{2} [1 + (q - 1) ρ]

, its inverse can be written as

Σ^{- 1} = \frac{1}{λ_{1}} Q_{1_{q}} + \frac{1}{λ_{2}} P_{1_{q}}

. Thus,

\begin{matrix} Ω & = & ψ^{2} \cdot 1_{q} 1_{q}^{⊤} \otimes (I_{b} \otimes 1_{k} 1_{k}^{⊤}) + (λ_{1} Q_{1_{q}} + λ_{2} P_{1_{q}}) \otimes I_{n} \\ = & (q k ψ^{2} + λ_{2}) (P_{1_{q}} \otimes I_{b} \otimes P_{1_{k}}) + λ_{2} (P_{1_{q}} \otimes I_{b} \otimes Q_{1_{k}}) + λ_{1} (Q_{1_{q}} \otimes I_{b} \otimes I_{k}) . \end{matrix}

(7)

Note that the components of

Ω

are orthogonal, and thus

Ω^{- 1} = \frac{1}{q k ψ^{2} + λ_{2}} (P_{1_{q}} \otimes I_{b} \otimes P_{1_{k}}) + \frac{1}{λ_{2}} (P_{1_{q}} \otimes I_{b} \otimes Q_{1_{k}}) + \frac{1}{λ_{1}} (Q_{1_{q}} \otimes I_{b} \otimes I_{k}) .

(8)

Therefore, to obtain the normal equation for

ψ^{2}

, it is sufficient to use the above formula for

Ω^{- 1}

in the second equation of the system in Theorem 2, and replace

A_{2}

and

C_{2}

with the considered matrices. This yields

tr [(P_{1_{q}} \otimes I_{b} \otimes P_{1_{k}}) (Ω^{- 1} - Ω^{- 1} v v^{⊤} Ω^{- 1})] = 0 .

Differentiating

Σ

with respect to

λ_{1}

and

λ_{2}

gives

\frac{\partial Σ}{\partial λ_{1}} = vec Q_{1_{q}} and \frac{\partial Σ}{\partial λ_{2}} = vec P_{1_{q}},

and hence, the third normal equation in the system from Theorem 2 must be multiplied by the above giving

\begin{matrix} {vec}^{⊤} ({PTr}_{n} [Ω^{- 1} - Ω^{- 1} v v^{⊤} Ω^{- 1}]) vec Q_{1_{q}} = 0, \\ {vec}^{⊤} ({PTr}_{n} [Ω^{- 1} - Ω^{- 1} v v^{⊤} Ω^{- 1}]) vec P_{1_{q}} = 0, \end{matrix}

which, using

{vec}^{⊤} A vec B = tr (A^{⊤} B)

, reduces to

\begin{matrix} tr [{PTr}_{n} [Ω^{- 1} - Ω^{- 1} v v^{⊤} Ω^{- 1}] Q_{1_{q}}] & = & 0 \\ tr [{PTr}_{n} [Ω^{- 1} - Ω^{- 1} v v^{⊤} Ω^{- 1}] P_{1_{q}}] & = & 0 . \end{matrix}

Taking into account that

Q_{1_{q}} = I_{q} - P_{1_{q}}

and using the second equality, as well as [8] (Lemma 2.11) and the fact that

tr [PTr [\cdot]] = tr [\cdot]

, we may rewrite the above as

\begin{matrix} tr [Ω^{- 1} - Ω^{- 1} v v^{⊤} Ω^{- 1}] & = & 0, \\ tr [(Ω^{- 1} - Ω^{- 1} v v^{⊤} Ω^{- 1}) (P_{1_{q}} \otimes I_{n})] & = & 0 . \end{matrix}

Plugging

Ω^{- 1}

into the normal equations for

ψ^{2}

,

λ_{1}

and

λ_{2}

, and using orthogonality and idempotency of the components of

Ω^{- 1}

, the MLEs of the unknown dispersion parameters under the mixed effects EGCM with additional restrictions can be obtained as

\begin{matrix} ψ^{2} = \frac{v^{⊤} [(k - 1) (P_{q} \otimes I_{b} \otimes P_{k}) - (P_{q} \otimes I_{b} \otimes Q_{k})] v}{q b k (k - 1)} \\ λ_{1} = \frac{v^{⊤} (Q_{q} \otimes I_{b k}) v}{(q - 1) b k} \\ λ_{2} = \frac{v^{⊤} (P_{q} \otimes I_{b} \otimes Q_{1_{k}}) v}{b (k - 1)} . \end{matrix}

Substituting the above into

Ω

given in (7) we obtain the desired result. □

Note that the positive definiteness of the estimate of

Ω

follows from the positivity of all coefficients of the respective Kronecker products, which are orthogonal to each other and sum to the identity matrix.

It is known (see, e.g., [3,11]) that the sufficient statistics for estimation of

B_{1}

,

B_{2}

, and

Σ = Σ_{CS}

under the fixed effects EGCM (1), with

C_{2} = 1_{q}^{⊤}

and

1_{q} \in R (C_{1}^{⊤})

, is

(tr [Y^{⊤} Y], 1_{q}^{⊤} Y^{⊤} Y 1_{q}, A_{1}^{⊤} Y C_{1}^{⊤}, A_{2}^{⊤} Y 1_{q}) .

(9)

It can be shown that this is also a sufficient statistic for the estimation of

B_{1}

,

ψ^{2}

and

Σ = Σ_{CS}

under the considered mixed effects EGCM. Taking into account (5), it is enough to show that

v^{⊤} Ω^{- 1} v

, with

Ω^{- 1}

given in (8), is a function of (9). Note that

\begin{matrix} v^{⊤} (P_{1_{q}} \otimes I_{b} \otimes P_{1_{k}}) v & = & tr [V^{⊤} (I_{b} \otimes P_{1_{k}}) V P_{1_{q}}] = \frac{1}{q} 1_{q}^{⊤} V^{⊤} (I_{b} \otimes P_{1_{k}}) V 1_{q} \\ v^{⊤} (P_{1_{q}} \otimes I_{b} \otimes Q_{1_{k}}) v & = & v^{⊤} (P_{1_{q}} \otimes I_{b k}) v - v^{⊤} (P_{1_{q}} \otimes I_{b} \otimes P_{1_{k}}) v \\ = & tr [V^{⊤} V P_{1_{q}}] - tr [V^{⊤} (I_{b} \otimes P_{1_{k}}) V P_{1_{q}}] \\ = & \frac{1}{q} 1_{q}^{⊤} V^{⊤} V 1_{q} - \frac{1}{q} 1_{q}^{⊤} V^{⊤} (I_{b} \otimes P_{1_{k}}) V 1_{q} \\ = & \frac{1}{q} 1_{q}^{⊤} V^{⊤} V 1_{q} - \frac{1}{k q} 1_{q}^{⊤} V^{⊤} A_{2} A_{2}^{⊤} V 1_{q}, \end{matrix}

and

v^{⊤} (Q_{1_{q}} \otimes I_{b} \otimes I_{k}) v = v^{⊤} v - v^{⊤} (P_{1_{q}} \otimes I_{b k}) v = tr [V^{⊤} V] - \frac{1}{q} 1_{q}^{⊤} V^{⊤} V 1_{q} .

We therefore need to show that

1_{q}^{⊤} V^{⊤} V 1_{q}

,

1_{q}^{⊤} V^{⊤} A_{2} A_{2}^{⊤} V 1_{q}

, and

tr [V^{⊤} V]

are functions of (9). Consider

1_{q}^{⊤} V^{⊤} V 1_{q} = q tr [V^{⊤} V P_{1_{q}}]

. Since

1_{q} \in C (C_{1}^{⊤})

, we have

P_{1_{q}} = P_{C_{1}^{⊤}} P_{1_{q}} = P_{1_{q}} P_{C_{1}^{⊤}}

. Therefore

\begin{matrix} 1_{q}^{⊤} V^{⊤} V 1_{q} & = & 1_{q}^{⊤} Y^{⊤} Y 1_{q} - 2 q tr [C_{1} P_{1_{q}} Y^{⊤} A_{1} B_{1}] + 1_{q}^{⊤} C_{1}^{⊤} B_{1}^{⊤} A_{1}^{⊤} A_{1} B_{1} C_{1} 1_{q} \\ = & 1_{q}^{⊤} Y^{⊤} Y 1_{q} - 2 q tr [C_{1} P_{1_{q}} P_{C_{1}^{⊤}} Y^{⊤} A_{1} B_{1}] + 1_{q}^{⊤} C_{1}^{⊤} B_{1}^{⊤} A_{1}^{⊤} A_{1} B_{1} C_{1} 1_{q} \\ = & 1_{q}^{⊤} Y^{⊤} Y 1_{q} - 2 1_{q}^{⊤} C_{1}^{⊤} {(C_{1} C_{1}^{⊤})}^{-} C_{1} Y^{⊤} A_{1} B_{1} C_{1} 1_{q} + 1_{q}^{⊤} C_{1}^{⊤} B_{1}^{⊤} A_{1}^{⊤} A_{1} B_{1} C_{1} 1_{q} \end{matrix}

is a function of

1_{q}^{⊤} Y^{⊤} Y 1_{q}

and

A_{1}^{⊤} Y C_{1}^{⊤}

.

It can be easily seen that

\begin{matrix} 1_{q}^{⊤} V^{⊤} A_{2} A_{2}^{⊤} V 1_{q} & = & 1_{q}^{⊤} Y^{⊤} A_{2} A_{2}^{⊤} Y 1_{q} - 2 1_{q}^{⊤} Y^{⊤} A_{2} A_{2}^{⊤} A_{1} B_{1} C_{1} 1_{q} \\ + 1_{q}^{⊤} A_{1}^{⊤} B_{1}^{⊤} C_{1} ⊤ A_{2} A_{2}^{⊤} A_{1} B_{1} C_{1} 1_{q} \end{matrix}

is a function of

A_{2}^{⊤} Y 1_{q}

, and that

tr [V V^{⊤}] = tr [Y^{⊤} Y] - 2 tr [C_{1} Y^{⊤} A_{1} B_{1}] + tr [C_{1}^{⊤} B_{1}^{⊤} A_{1}^{⊤} A_{1} B_{1} C_{1}]

is a function of

tr [Y^{⊤} Y]

and

A_{1}^{⊤} Y C_{1}^{⊤}

, and thus sufficiency follows.

Finally, let us take a look at the design matrix for treatments,

A_{1}

, since in block experiments it contains information about the arrangement of treatments on experimental units. The most common block designs are balanced incomplete block designs (BIBDs), that is, binary block designs with

k \leq t

, such that each treatment appears in the same number of blocks, say

r_{1}

, and each pair of distinct treatments appears in the same number of blocks, say

r_{2}

. Since the design is binary, all treatments within a single block must be distinct; cf. [12]. An immediate consequence is that

A_{1}^{⊤} A_{1} = (r_{1} - r_{2}) I_{t} + r_{2} 1_{t} 1_{t}^{⊤}

. Moreover, in the particular case when

k = t

, the BIBD becomes complete, and

A_{1}

can be represented as a partitioned matrix consisting of b permutation matrices,

Π_{i}

,

i = 1, \dots, b

, that is,

A_{1} = {(Π_{1}, Π_{2}, \dots, Π_{b})}^{⊤} .

Obviously,

A_{1}^{⊤} A_{1} = b I_{t}

and

Π_{i} 1_{t} = Π_{i}^{⊤} 1_{t} = 1_{t}

. Since

P_{C_{1}^{⊤} \otimes A_{1}} = P_{C_{1}^{⊤}} \otimes P_{A_{1}}

and

1_{q} \in C (C_{1}^{⊤})

, for the dispersion matrix with compound symmetry structure of

Σ

we have

P_{C_{1}^{⊤} \otimes A_{1}} Ω = Ω P_{C_{1}^{⊤} \otimes A_{1}},

(10)

which means that the orthogonal projector onto the expectation space commutes with the dispersion matrix. Moreover, in such a case, the best linear unbiased estimator (BLUE) of

B_{1}

exists and coincides with the ordinary least squares estimator (OLSE); cf. [13]. Indeed, the normal equation obtained from the derivative with respect to

β_{1}

given in (6), that is

(C_{1} \otimes A_{1}^{⊤}) Ω^{- 1} v = 0

when multiplied on the left by

(C_{1}^{⊤} \otimes A_{1}) {[(C_{1} \otimes A_{1}^{⊤}) (C_{1}^{⊤} \otimes A_{1})]}^{- 1}

, this can be represented as

P_{C_{1}^{⊤} \otimes A_{1}} Ω^{- 1} v = 0 .

From the commutativity relation (10), we obtain an explicit solution for the maximum likelihood estimator of

B_{1}

in the form

A_{1} {\hat{B}}_{1} C_{1} = P_{A_{1}} Y P_{C_{1}^{⊤}},

which is also the BLUE of

(A_{1} B_{1} C_{1})

.

Moreover, since

Ω

belongs to a quadratic subspace, according to [14], the MLEs of the parameters of model (4) have an explicit form, and the estimator of the dispersion matrix can be represented as a projection of

S = Q_{C_{1}^{⊤} \otimes A_{1}} y y^{⊤} Q_{C_{1}^{⊤} \otimes A_{1}}

onto the space of compound symmetry matrices, that is,

\begin{matrix} \hat{Ω} & = & \frac{1}{tr (P_{1_{q}} \otimes I_{b} \otimes P_{1_{k}})} tr [S (P_{1_{q}} \otimes I_{b} \otimes P_{1_{k}})] \cdot (P_{1_{q}} \otimes I_{b} \otimes P_{1_{k}}) \\ + \frac{1}{tr (P_{1_{q}} \otimes I_{b} \otimes Q_{1_{k}})} tr [S (P_{1_{q}} \otimes I_{b} \otimes P_{Q_{k}})] \cdot (P_{1_{q}} \otimes I_{b} \otimes Q_{1_{k}}) \\ + \frac{1}{tr (Q_{1_{q}} \otimes I_{b} \otimes I_{k})} tr [S (Q_{1_{q}} \otimes I_{b} \otimes I_{k})] \cdot (Q_{1_{q}} \otimes I_{b} \otimes I_{k}) . \end{matrix}

Using the commutativity relation (10) and the fact that

1_{q} \in C (C_{1}^{⊤})

, we finally obtain the MLEs of

B_{1}

and

Ω

for balanced complete block designs as

\begin{matrix} A_{1} {\hat{B}}_{1} C_{1} & = & P_{A_{1}} Y P_{C_{1}^{⊤}} \\ \hat{Ω} & = & \frac{1}{b} tr [P_{1_{q}} Y^{⊤} (I_{b} \otimes P_{1_{k}}) Q_{A_{1}} Y] \cdot (P_{1_{q}} \otimes I_{b} \otimes P_{1_{k}}) \\ + \frac{1}{b (k - 1)} tr [P_{1_{q}} Y^{⊤} (I_{b} \otimes Q_{1_{k}}) Q_{A_{1}} Y] \cdot (P_{1_{q}} \otimes I_{b} \otimes Q_{1_{k}}) \\ + \frac{1}{(q - 1) b k} tr [Q_{1_{q}} Y^{⊤} Y - Q_{1_{q}} Y^{⊤} P_{A_{1}} Y P_{C_{1}^{⊤}}] \cdot (Q_{1_{q}} \otimes I_{b} \otimes I_{k}) . \end{matrix}

(11)

Note that (11) can be also obtained by substituting

\hat{v} = vec [Y - P_{A_{1}} Y P_{C_{1}^{⊤}}]

instead of

v

in Theorem 3.

In many experiments, it is common to replicate observations. In such cases, complete block designs can be generalized to block designs, in which each treatment appears more than once. For example, if each treatment is replicated r times in every block (of size

k = r t

), and if the observations are arranged consecutively with respect to replications, then

A_{1} = {(Π_{11}, Π_{12}, \dots, Π_{1 r}, \dots, Π_{b 1}, Π_{b 2}, \dots, Π_{b r})}^{⊤}

. This implies that

A_{1}^{⊤} A_{1} = b r I_{t}

and

Π_{i j} 1_{t} = Π_{i j}^{⊤} 1_{t} = 1_{t}

,

i \in {1, \dots, b}

,

j \in {1, \dots, r}

. Note, however, that the commutativity condition (10) still holds, and the MLEs of

B_{1}

and

Σ

can be computed via (11) with

A_{1}

given above.

4. Applications

In this section, we illustrate the results of Section 3 by analyzing two subsets of a metabolomic dataset containing measurements of the concentrations of 104 secondary metabolites in 9 barley varieties (C—CamB1, G—Georgia, H—Harmal, L—Lubuski, M—Maresi, Md—MDingo, Mx—Morex, S—Sebastian, St—Stratus) under 4 different environmental conditions related to water shortage: two droughts (I and II), a combined drought (I + II), and a control condition. Each variety was observed at 7 time points (from T2 to T8), in 4 biological replicates (plants). The data were collected at the Institute of Plant Genetics, Polish Academy of Sciences, in Poznań, in a pilot study for a larger systems biology project aimed at investigating the effects of water shortage on the concentrations of secondary metabolites in the leaves of barley plants; cf. [15].

In the examples below, we focus on the fixed effects of variety, treating the drought effects as random nuisance parameters, since the three droughts and the control condition represent only a subset of possible environmental conditions. Two models that differ in the response structure but share the same random component are considered.

Example 1.

In this example, we analyze the concentration of the first metabolite in the dataset under three environmental conditions (II, I + II, and a control), at three time points (

q = 3

) at the end of the experiment (T6, T7, T8). All four biological observations (

r = 4

) of all nine varieties (

t = 9

) are taken into account. We are interested in the linear trend in metabolite concentration over time for each variety, taking into account that the environmental conditions may additionally influence the results. Considering these environmental conditions as random blocks (

b = 3

) of size

k = r t

, and assuming that the drought condition affects the response equally at each time point, we employ the random effects EGCM (4) with a

b k \times q

matrix of observations,

Y

. Specifically, the rows of the observation matrix

Y

are arranged first by block (drought condition) and, within each block, by replicate, with varieties nested within replicates. Thus, the observations corresponding to the first drought condition appear first, followed by those from the second and third conditions. Within each block, all nine varieties are recorded sequentially for the first replicate, then for the second, and so on. This ordering implies that

A_{1} = 1_{r b} \otimes I_{t} a n d A_{2} = I_{b} \otimes 1_{k} .

Considering the linear trend in metabolite concentration over time, we assume that

C_{1} = (\begin{matrix} 1 & 1 & 1 \\ 3 & 6 & 10 \end{matrix}) .

The second row represents the numerical coding of time: third, sixth and 10th day of the second part of the experiment. This coding implies that the coefficient associated with the second row of

C_{1}

in

B_{1}

represents the rate of change in metabolite concentration per time unit. The assumption of an equal drought effect across time points leads to

C_{2} = 1_{q}^{⊤},

reducing the covariance matrix of the random drought effects to a scalar,

ψ^{2}

. Finally, since all measurements were taken under comparable experimental conditions and within a relatively short time window, it is reasonable to assume that the correlations between time points are approximately equal. Therefore, we assume that metabolite concentrations at a single time point have equal variances and identical covariances at distinct time points, resulting in a compound symmetry structure of the

3 \times 3

matrix Σ.

Using Theorem 3 we obtain

\hat{B} = (\begin{matrix} 21.0097 & 0.0609 \\ 23.4933 & - 0.0042 \\ 23.4382 & - 0.0688 \\ 23.1798 & - 0.0659 \\ 21.7026 & 0.0243 \\ 22.1970 & 0.0379 \\ 22.3129 & 0.0945 \\ 23.4419 & - 0.0590 \\ 23.5034 & - 0.0067 \end{matrix})

and

\hat{Ω} = 3.4871 (P_{1_{3}} \otimes I_{3} \otimes P_{1_{36}}) + 1.2519 (P_{1_{3}} \otimes I_{3} \otimes Q_{1_{36}}) + 1.2570 (Q_{1_{3}} \otimes I_{108})

with the estimate of the variance of block effects given by

{\hat{ψ}}^{2} = 0.0207

and

\hat{Σ} = (\begin{matrix} 1.2553 & - 0.0017 & - 0.0017 \\ - 0.0017 & 1.2553 & - 0.0017 \\ - 0.0017 & - 0.0017 & 1.2553 \end{matrix}),

implying

{\hat{σ}}^{2} = 1.2553

and

\hat{ρ} = - 0.0014

.

The estimated fixed effects matrix

{\hat{B}}_{1}

indicates that the mean concentration of the first metabolite varies moderately across varieties, with estimated intercepts ranging from 21.0097 to 23.5034. The estimated slopes, corresponding to the linear trend over time, are close to zero for most varieties and vary in sign, suggesting the absence of a consistent overall increase or decrease in metabolite concentration during the experiment.

The estimated variance components are

{\hat{ψ}}^{2} = 0.0207

for the random drought effect and

{\hat{σ}}^{2} = 1.2553

for the residual variance. The small value of

{\hat{ψ}}^{2}

indicates that the environmental (drought) conditions contribute little to the total variability in metabolite concentration, confirming that differences among varieties dominate over drought effects.

The estimated correlation parameter

\hat{ρ} = - 0.0014

suggests that correlations between time points are negligible, which supports the assumption of weak temporal dependence within each variety.

Performed analysis suggests that the concentration of the metabolite remained stable over the experimental period, and that the drought treatments did not have a substantial impact compared with varietal differences.

Recall that

{\hat{B}}_{1}

is the best linear unbiased estimator of

B_{1}

, while the MLEs of

ψ^{2}

,

σ^{2}

, and

ϱ

are biased. To assess the accuracy of the estimates for different sample sizes, we conduct small simulation studies using parameters near the estimated ones. Specifically, we generate 10,000 data matrices from the matrix normal distribution with expectation

A_{1} {\hat{B}}_{1} C_{1}

and dispersion matrix

{\hat{ψ}}^{2} C_{2}^{⊤} C_{2} \otimes A_{2} A_{2}^{⊤} + \hat{Σ} \otimes I_{n},

where

\hat{Σ} = {\hat{σ}}^{2} [(1 - \hat{ρ}) I_{q} + \hat{ρ} 1_{q} 1_{q}^{⊤}],

and we compute the MLEs of

B_{1}

,

ψ^{2}

,

σ^{2}

, and

ρ

for each simulated dataset. Note that in such a demanding setup, the only parameter that can be varied is the number of replicates, r. In Table 1, we demonstrate the empirical bias of the estimated parameters obtained from 10,000 Monte Carlo runs for four different values of r.

Results presented in Table 1 indicate that the MLEs of the variance components are biased, particularly for small sample sizes. The bias of

{\hat{σ}}^{2}

is notably larger than that of

{\hat{ψ}}^{2}

and

\hat{ρ}

, but it decreases substantially as the number of replicates r increases. The estimator of the correlation parameter

\hat{ρ}

shows only a small negative bias even for

r = 1

, which becomes negligible for larger r. The estimator

{\hat{B}}_{1}

is theoretically unbiased, and the small deviations observed in the simulation results can be attributed to sampling variability inherent in the Monte Carlo procedure. Overall, the results confirm that the accuracy of the estimators improves with increasing sample size, as expected from the asymptotic properties of maximum likelihood estimators.

Example 2.

In this example, we analyze the concentrations of all 104 metabolites measured at the end of the experiment (T8) under the same three environmental conditions (b = 3) as in previous example. The dataset includes all nine barley varieties (t = 9), each represented by four biological replicates (r = 4). The aim is to investigate the effect of variety on metabolite concentrations while accounting for the variability introduced by the environmental conditions. Treating the drought conditions as random blocks of size

k = r t

, and assuming that the drought effect is constant across all metabolites, we employ the random effects EGCM (4) with a

b k \times q

matrix of observations,

Y

, where q = 104.

Similarly as in Example 1, the rows of the observation matrix

Y

are arranged first by block (drought condition) and, within each block, by replicate, with varieties nested within replicates. Consequently, the treatment and block design matrices are the same as before, that is

A_{1} = 1_{r b} \otimes I_{t} a n d A_{2} = I_{b} \otimes 1_{k} .

Since the analysis concerns metabolite concentrations observed at a single time point, the corresponding matrix is

C_{1} = I_{q},

with

q = 104

.

Assuming that the drought effect is identical for all metabolites leads to

C_{2} = 1_{q}^{⊤},

which reduces the covariance matrix of random drought effects to a scalar,

ψ^{2}

. Finally, we assume that all metabolites have equal variances and identical pairwise correlations, resulting in a compound symmetry structure of the

104 \times 104

matrix Σ. This assumption is further supported by the heatmap of pairwise correlations between metabolites (Figure 1). The plot reveals a relatively uniform correlation structure, with no distinct clusters or strong block patterns, suggesting that the correlations between metabolites are approximately constant across pairs.

Using Theorem 3, we obtain the estimate

{\hat{B}}_{1}

of size

9 \times 104

, which is therefore not presented in detail. However, Table 2 shows the estimated mean concentrations of three metabolites with minimal variation and three metabolites with maximal variation across the nine barley varieties. Metabolites vtrait[87], vtrait[84], and vtrait[9] show very small differences in their estimated means, indicating that their levels are largely unaffected by genotype. Such metabolites are likely stable markers and may not contribute to varietal differentiation. In contrast, metabolites vtrait[25], vtrait[31], and vtrait[30] exhibit substantial differences in estimated mean concentrations between varieties, suggesting a strong genotypic effect. These metabolites are likely important in distinguishing the metabolic profiles of barley varieties and could be prioritized in further studies of varietal traits.

The estimate of Ω is given by

\hat{Ω} = 20.4000 (P_{1_{104}} \otimes I_{3} \otimes P_{1_{36}}) + 16.0083 (P_{1_{104}} \otimes I_{3} \otimes Q_{1_{36}}) + 0.7131 (Q_{1_{104}} \otimes I_{108}),

which is positive definite. The small value of

{\hat{ψ}}^{2} = 0.0012

suggests that the environmental (drought) conditions contribute only marginally to the total variability in metabolite concentration. This indicates that differences among barley varieties dominate over drought-related effects, and the random block component has a negligible impact on the model.

The estimate of the

104 \times 104

compound symmetry covariance matrix of metabolite concentrations has diagonal elements equal to

0.8602

and off-diagonal elements equal to

0.1471

. Consequently,

{\hat{σ}}^{2} = 0.8602

and

\hat{ρ} = 0.1710

, which indicates a relatively weak positive correlation among metabolites measured under the same environmental conditions.

The two examples presented above illustrate the flexibility of the EGCM in handling data with different response structures while maintaining a common specification of random effects. In the first example, the model captured the linear trend of metabolite concentration over time for each barley variety, under the assumption of a random block effect representing environmental conditions. In the second example, the same random component structure was applied to a large-dimensional response comprising 104 metabolites measured at the end of the experiment. In both cases, the assumption of compound symmetry for the covariance matrix

Σ

provided a reasonable and interpretable representation of the dependence structure between observations. However, the near-zero estimate of

ψ^{2}

in the second example suggests that the random block effects may not play a substantial role in explaining the observed variability. Overall, these analyses demonstrate that the EGCM framework offers a unified approach for modeling complex experimental designs, accommodating both longitudinal and multivariate response structures within a coherent covariance formulation.

5. Conclusions

In this paper, maximum likelihood estimation in the mixed effects EGCM was investigated under the assumption of multivariate normality. The likelihood equations were determined for a general dispersion matrix as well as for models with specific assumptions on the nuisance parameters (block effects). It was demonstrated that, for a compound symmetry structure of the dispersion matrix, these equations simplify considerably. Moreover, for experiments designed in balanced complete blocks, the MLEs of the unknown parameters can be obtained in closed form.

In addition, we demonstrated an analogy between fixed and mixed effects EGCM with respect to sufficiency. This result implies that the original datasets can be reduced to sufficient statistics, which enables more efficient data analysis, as only matrices of substantially lower dimensions are involved in the estimation process. Furthermore, an additional advantage of using sufficient statistics lies in the facilitation of data storage and transfer.

The practical applicability of the proposed framework was illustrated through two examples involving metabolomic data from a barley drought experiment. These analyses confirmed that the EGCM approach can effectively accommodate both longitudinal and large-dimensional multivariate response structures within a unified modeling framework. The results also highlighted that, in some cases, the random block component may have negligible influence, suggesting that model simplification could be justified without loss of interpretability.

Overall, the EGCM framework provides a flexible and computationally tractable tool for modeling complex experimental designs, offering both theoretical clarity and practical efficiency.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/sym17111901/s1.

Author Contributions

Conceptualization, K.F., A.M. and P.K.; methodology, K.F. and A.M.; validation, A.M. and P.K.; formal analysis, K.F.; investigation, K.F. and A.M.; resources, P.K. and H.Ć.-K.; data curation, H.Ć.-K.; writing—original draft preparation, K.F.; writing—review and editing, A.M. and P.K.; supervision, P.K.; funding acquisition, K.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially funded by the grant no. 0213/SBAD/0122 from Poznań University of Technology (K. Filipiak).

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Appendix A. Sketch of the Proof of Theorem 1

Consider the fixed effects EGCM with two nuisance parameters, as in [5], where

C (C_{1}^{⊤}) \subseteq C (C_{3}^{⊤})

and

C (C_{2}^{⊤}) \subseteq C (C_{3}^{⊤})

. To apply the results of Theorem 4 in [5], it must additionally hold that

C (C_{2}^{⊤}) \subseteq C (C_{1}^{⊤})

. For

C_{2} = 0

, all the above assumptions are satisfied, and hence we can follow the steps of the proof of that theorem, focusing additionally on the estimation of

B_{3}

.

From the normal equations we have:

A_{3} B_{3} C_{3} = P_{A_{3}} {(Y - A_{1} B_{1} C_{1})}^{⊤} (Y - A_{1} B_{1} C_{1}) P_{C_{3}^{⊤}; Σ^{- 1}}^{⊤} .

(A1)

Since

C (C_{1}^{⊤}) \subseteq C (C_{3}^{⊤})

, replacing

A_{3} B_{3} C_{3}

in the normal equation for

B_{1}

, we get

A_{1}^{⊤} Q_{A_{3}} (Y - A_{1} B_{1} C_{1}) Σ^{- 1} C_{1} = 0,

which implies

A_{1} B_{1} C_{1} = P_{A_{1}; Q_{A_{3}}} Y P_{C_{1}^{⊤}; Σ^{- 1}}^{⊤};

(A2)

cf. [5] (Formula (16)).

Let

S_{1} = Y^{⊤} Q_{(A_{1} : A_{3})} Y

. Since

P_{(A_{1} : A_{3})} (A_{1} B_{1} C_{1} + A_{3} B_{3} C_{3}) = A_{1} B_{1} C_{1} + A_{3} B_{3} C_{3}

and

P_{(A_{1} : A_{3})} = P_{A_{3}} + P_{Q_{A_{3}} A_{1}}

, we can express

n Σ = {(Y - A_{1} B_{1} C_{1} - A_{3} B_{3} C_{3})}^{⊤} \cdot (Y - A_{1} B_{1} C_{1} - A_{3} B_{3} C_{3})

as

n Σ = S_{1} + {(P_{Q_{A_{3}} A_{1}} Y Q_{C_{1}^{⊤}; Σ^{- 1}}^{⊤} + P_{A_{3}} Y Q_{C_{3}^{⊤}; Σ^{- 1}}^{⊤})}^{⊤} (P_{Q_{A_{3}} A_{1}} Y Q_{C_{1}^{⊤}; Σ^{- 1}}^{⊤} + P_{A_{3}} Y Q_{C_{3}^{⊤}; Σ^{- 1}}^{⊤}) .

(A3)

Multiplying (A3) by

Σ^{- 1} C_{1}^{⊤}

and since for

i \in {1, 3}

the column-space condition implies

Q_{C_{i}^{⊤}; Σ^{- 1}} Σ^{- 1} C_{1}^{⊤} = 0

, we obtain

n C_{1}^{⊤} = S_{1} Σ^{- 1} C_{1}^{⊤},

and therefore

C_{1} Σ^{- 1} = n C_{1} S_{1}^{- 1}

.

Multiplying (A3) by

Σ^{- 1} C_{3}^{⊤}

and since

Q_{C_{3}^{⊤}; Σ^{- 1}} Σ^{- 1} C_{3}^{⊤} = 0

and

P_{3} P_{Q_{A_{3}} A_{1}} = 0

, we obtain

n C_{3}^{⊤} = S_{2} Σ^{- 1} C_{3}^{⊤},

with

S_{2} = S_{1} + Q_{C_{1}^{⊤}; S_{1}^{- 1}} Y^{⊤} P_{Q_{A_{3}} A_{1}} Y Q_{C_{1}^{⊤}; S_{1}^{- 1}}^{⊤}

, and therefore

C_{3} Σ^{- 1} = n C_{3} S_{2}^{- 1}

.

Substituting the above into (A1) and (A2), and replacing index “3” by “2”, we obtain the result stated in the theorem.

References

Potthoff, R.F.; Roy, S.N. A generalized multivariate analysis of variance model useful especially for growth curve problems. Biometrika 1964, 51, 313–326. [Google Scholar] [CrossRef]
Kollo, T.; von Rosen, D. Advanced Multivariate Statistics with Matrices; Springer: Dordrecht, The Netherlands, 2005. [Google Scholar]
Žežula, I. Special variance structures in the growth curve model. J. Multivar. Anal. 2006, 97, 606–618. [Google Scholar] [CrossRef]
Verbyla, A.P.; Venables, W.N. An extension of the growth curve model. Biometrika 1988, 75, 129–138. [Google Scholar] [CrossRef]
Filipiak, K.; Markiewicz, A.; Szczepańska, A. Optimal designs under a multivariate linear model with additional nuisance parameters. Stat. Papers 2009, 50, 761–778. [Google Scholar] [CrossRef]
Žežula, I. Remarks of unibased estimation of the sum-of-profiles model parameters. Tatra Mt. Math. Publ. 2008, 39, 45–52. [Google Scholar]
Arnold, S.F. The Theory of Linear Models and Multivariate Analysis; Wiley: New York, NY, USA, 1981. [Google Scholar]
Filipiak, K.; Klein, D.; Vojtkova, E. The properties of partial trace and block trace operators of partitioned matrices. Electron. J. Linear Algebra 2018, 33, 3–15. [Google Scholar] [CrossRef]
Magnus, J.R.; Neudecker, H. Symmetry, 0–1 matrices and Jacobians. Econom. Theory 1986, 2, 157–190. [Google Scholar] [CrossRef]
Fackler, P.L. Notes on Matrix Calculus. Available online: https://erho.weebly.com/uploads/2/7/8/4/27841631/matrixcalc.pdf (accessed on 10 October 2025).
Wu, Q.-G. Existence conditions of the uniformly minimum risk unbiased estimators in extended growth curve models. J. Statist. Plann. Inference 1998, 69, 101–114. [Google Scholar] [CrossRef]
Shah, K.R.; Sinha, B.K. Theory of Optimal Designs; Springer-Verlag: Berlin/Heidelberg, Germany, 1989. [Google Scholar]
Rao, C.R. Least squares theory using an estimated dispersion matrix and its application to measurement of signals. In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA, USA, 21 June–18 July 1965, 27 December 1965–7 January 1966; Le Cam, L.M., Neyman, J., Eds.; University of California Press: Berkeley, CA, USA, 1967; Volume 1, pp. 355–372. [Google Scholar]
Filipiak, K.; John, M.; Markiewicz, A. Comments on maximum likelihood estimation and projections under multivariate statistical models. In Recent Developments in Multivariate and Random Matrix Analysis; Holgersson, T., Singull, M., Eds.; Springer: Cham, Germany, 2020; pp. 51–66. [Google Scholar]
Piasecka, A.; Sawikowska, A.; Krajewski, P.; Kachlicki, P. Combined mass spectrometric and chromatographic methods for in-depth analysis of phenolic secondary metabolites in barley leaves. J. Mass Spectrom. 2015, 50, 513–532. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Heatmap of pairwise correlations between the 104 metabolites measured at the T8 time point.

Table 1. Empirical bias of the estimated parameters for

r \in {1, 2, 4, 8}

obtained from 10,000 Monte Carlo runs.

Table 1. Empirical bias of the estimated parameters for

r \in {1, 2, 4, 8}

obtained from 10,000 Monte Carlo runs.

	$r = 1$	$r = 2$
$ψ^{2}$	−0.0062	−0.0104
$σ^{2}$	−0.2791	−0.1347
$ρ$	−0.0728	−0.0282
$B_{1}$	$(\begin{matrix} 0.0223 & - 0.0023 \\ - 0.0040 & 0.0005 \\ - 0.0126 & 0.0016 \\ - 0.0101 & 0.0011 \\ - 0.0163 & 0.0019 \\ - 0.0231 & 0.0024 \\ - 0.0012 & - 0.0006 \\ - 0.0030 & - 0.0007 \\ - 0.0040 & 0.0012 \end{matrix})$	$(\begin{matrix} 0.0022 & - 0.0001 \\ 0.0028 & - 0.0009 \\ - 0.0032 & - 0.0003 \\ 0.0120 & - 0.0015 \\ 0.0050 & - 0.0005 \\ - 0.0114 & 0.0014 \\ - 0.0007 & - 0.0004 \\ - 0.0006 & - 0.0004 \\ 0.0005 & - 0.0003 \end{matrix})$
	$r = 4$	$r = 8$
$ψ^{2}$	−0.0095	−0.0086
$σ^{2}$	−0.0667	−0.0337
$ρ$	−0.0133	−0.0064
$B_{1}$	$(\begin{matrix} - 0.0013 & 0.0003 \\ - 0.0053 & 0.0010 \\ - 0.0049 & 0.0004 \\ 0.0008 & - 0.0004 \\ - 0.0072 & 0.0008 \\ - 0.0022 & 0.0008 \\ 0.0000 & - 0.0003 \\ - 0.0062 & 0.0005 \\ 0.0022 & 0.0001 \end{matrix})$	$(\begin{matrix} 0.0000 & 0.0005 \\ - 0.0033 & 0.0003 \\ - 0.0038 & 0.0005 \\ 0.0034 & - 0.0007 \\ - 0.0040 & 0.0003 \\ - 0.0034 & 0.0004 \\ 0.0007 & - 0.0001 \\ - 0.0091 & 0.0010 \\ 0.0038 & - 0.0004 \end{matrix})$

Table 2. Estimated mean concentrations of selected metabolites across nine barley varieties. Left: metabolites with minimal variation between varieties. Right: metabolites with maximal variation between varieties.

variety	vtrait[9]	vtrait[84]	vtrait[87]	vtrait[25]	vtrait[30]	vtrait[31]
1	27.8508	22.8692	24.3817	24.8125	28.9858	26.8550
2	22.4475	23.2183	24.6992	21.7183	28.4525	29.4067
3	23.0925	23.5933	24.7342	22.5092	28.6792	28.9925
4	22.5083	23.1550	24.5750	21.6408	28.2775	28.8625
5	23.2758	23.0300	24.7800	24.9775	28.4075	27.7183
6	22.6392	23.1475	24.7942	21.9442	28.3717	28.9525
7	22.3358	22.7808	24.4058	21.8908	28.5150	29.3667
8	22.5292	23.1825	24.9267	21.9033	28.7233	29.3100
9	22.8033	22.6975	24.4250	21.6517	28.6667	29.8375

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Filipiak, K.; Markiewicz, A.; Krajewski, P.; Ćwiek-Kupczyńska, H. Estimation and Sufficiency Under the Mixed Effects Extended Growth Curve Model with Compound Symmetry Covariance Structure. Symmetry 2025, 17, 1901. https://doi.org/10.3390/sym17111901

AMA Style

Filipiak K, Markiewicz A, Krajewski P, Ćwiek-Kupczyńska H. Estimation and Sufficiency Under the Mixed Effects Extended Growth Curve Model with Compound Symmetry Covariance Structure. Symmetry. 2025; 17(11):1901. https://doi.org/10.3390/sym17111901

Chicago/Turabian Style

Filipiak, Katarzyna, Augustyn Markiewicz, Paweł Krajewski, and Hanna Ćwiek-Kupczyńska. 2025. "Estimation and Sufficiency Under the Mixed Effects Extended Growth Curve Model with Compound Symmetry Covariance Structure" Symmetry 17, no. 11: 1901. https://doi.org/10.3390/sym17111901

APA Style

Filipiak, K., Markiewicz, A., Krajewski, P., & Ćwiek-Kupczyńska, H. (2025). Estimation and Sufficiency Under the Mixed Effects Extended Growth Curve Model with Compound Symmetry Covariance Structure. Symmetry, 17(11), 1901. https://doi.org/10.3390/sym17111901

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Estimation and Sufficiency Under the Mixed Effects Extended Growth Curve Model with Compound Symmetry Covariance Structure

Abstract

1. Introduction

2. Estimation Under Mixed EGCM

3. Block Effects Model with Compound Symmetry Matrix $Σ$

4. Applications

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Sketch of the Proof of Theorem 1

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Estimation and Sufficiency Under the Mixed Effects Extended Growth Curve Model with Compound Symmetry Covariance Structure

Abstract

1. Introduction

2. Estimation Under Mixed EGCM

3. Block Effects Model with Compound Symmetry Matrix Σ

4. Applications

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Sketch of the Proof of Theorem 1

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3. Block Effects Model with Compound Symmetry Matrix $Σ$