Equivalence Analysis of Statistical Inference Results under True and Misspecified Multivariate Linear Models

Bo Jiang; Yongge Tian

doi:10.3390/math11010182

and

¹

College of Mathematics and Information Science, Shandong Technology and Business University, Yantai 264005, China

²

College of Business and Economics, Shanghai Business School, Shanghai 201400, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics2023, 11(1), 182;https://doi.org/10.3390/math11010182

This article belongs to the Special Issue Contemporary Contributions to Statistical Modelling and Data Science

Version Notes

Order Reprints

Abstract

This paper provides a complete matrix analysis on equivalence problems of estimation and inference results under a true multivariate linear model

Y = X Θ + Ψ

and its misspecified form

Y = X Θ + Z Γ + Ψ

with an augmentation part

Z Γ

through the cogent use of various algebraic formulas and facts in matrix analysis. The coverage of this study includes the matrix derivations of the best linear unbiased estimators under the true and misspecified models, and the establishment of necessary and sufficient conditions for the different estimators to be equivalent under the model assumptions.

Keywords:

augmentation; BLUE; BLUP; Kronecker product; multivariate linear model; parameter matrix; rank; reduced model

MSC:

15A10; 62J05; 62H12

1. Introduction

Throughout this article, we use

R^{m \times n}

to stand for the collection of all

m \times n

matrices with real numbers;

A^{'}

,

r (A)

, and

R (A)

to stand for the transpose, the rank, and the range (column space) of a matrix

A \in R^{m \times n}

, respectively; and

I_{m}

to denote the identity matrix of order m. For two symmetric matrices

A

and

B

of the same size, they are said to satisfy the inequality

A ≽ B

in the Löwner partial ordering if

A - B

is positive semi-definite. The Kronecker product of any two matrices

A

and

B

is defined to be

A \otimes B = (a_{i j} B)

. The vectorization operator of a matrix

A = [a_{1}, \dots, a_{n}]

is defined to be

vec (A) = \vec{A} = {[a_{1}^{'}, \dots, a_{n}^{'}]}^{'}

. A well-known property on the vec operator of a triple matrix product is

\vec{A Z B} = (B^{'} \otimes A) \vec{Z}

. The Moore–Penrose inverse of

A \in R^{m \times n}

, denoted by

A^{+}

, is defined to be the unique solution

G

of the four matrix equations

A G A = A

,

G A G = G

,

{(A G)}^{'} = A G

, and

{(G A)}^{'} = G A

. We also denote by

P_{A} = A A^{+}

,

E_{A} = I_{m} - A A^{+}

, and

F_{A} = I_{n} - A^{+} A

the three orthogonal projectors induced from

A

, respectively, which will help in briefly denoting calculation processes related to generalized inverses of matrices. We also adopt the notation

A^{⊥} = E_{A}

when

A

is a block matrix. Further information about the orthogonal projectors

P_{A}

,

E_{A}

, and

F_{A}

with their applications in the linear statistical models can be found e.g., in [1,2,3].

In this paper, we consider the multivariate general linear model

M : Y = X Θ + Ψ, E (Ψ) = 0, Cov (\vec{Ψ}) = Cov {\vec{Ψ}, \vec{Ψ}} = σ^{2} Σ_{2} \otimes Σ_{1},

(1)

where

Y \in R^{n \times m}

is an observable random matrix (a longitudinal data set),

X = (x_{i j}) \in R^{n \times p}

is a known model matrix of arbitrary rank (

0 \leq r (X) \leq min {n, p})

,

Θ = (θ_{i j}) \in R^{p \times m}

is a matrix of fixed but unknown parameters,

E (Ψ)

and

Cov (\vec{Ψ})

denote the expectation vector and the dispersion matrix of the random error matrix

Ψ \in R^{n \times m}

,

Σ_{1} = (σ_{1 i j}) \in R^{n \times n}

and

Σ_{2} = (σ_{2 i j}) \in R^{m \times m}

are two known positive semi-definite matrices of arbitrary ranks, and

σ^{2}

is an arbitrary positive scaling factor. As we know, the multivariate general linear model (for short, MGLM) as such in (1) is a relative direct extension of the most welcome type of univariate general linear models.

The assumption in (1) is typical in the estimation and statistical inference under a multivariate linear regression framework. In statistical practice, we may meet with the situation where a true regression model is misspecified in some other forms due to certain unforeseeable reasons, and, therefore, we face with the task of comparing estimation and inference results and establishing certain links between them for the purpose of reasonably explaining and utilizing the misspecified regression model. In this light, one of the situations in relation to model misspecification problems appears by adding or deleting regressors in the model. As such an example, if taking (1) as a true model and misspecifically adding a multiple new regressor part

Z Γ

in (1), we obtain an over-parameterized (over-fitted) form of

M

as

N : Y = X Θ + Z Γ + Ψ,

(2)

where

Z \in R^{n \times q}

is a known matrix of arbitrary rank, and

Γ \in R^{q \times m}

is a matrix of fixed but unknown parameters. Given (1) and (2), we proposed and studied some research problems in [4] on the equivalence of inference results that are obtained from the two competing MGLMs.

As we know, a commonly-used technique of handling a partitioned model is to multiply a certain annihilating matrix and to transform the the model equation into a reduced model form. As a new exploration regarding the equivalence problem, we introduce the commonly-used technique into the study of (1) and (2). To do so, we pre-multiply

Z^{⊥}

to the both sides of the model equation and noting that

Z^{⊥} Z = 0

to obtain a reduced model as follows:

P : Z^{⊥} Y = Z^{⊥} X Θ + Z^{⊥} Ψ .

(3)

It should be pointed out that estimation and inference results that we derive from the triple models in (1)–(3) are not necessarily identical. Thus, it is a primary requirement to describe the links between the models and to propose and describe possible equalities among estimation and inference results under three MGLMs.

Before approaching comparison problems of estimation and inference results under the triple models in (1)–(3), we mention a well-known and effective method that was widely used in the investigation of multivariate general linear models. Recall that the Kronecker products and vec operations of matrices are popular tools in dealing with matrix operations in relation to multivariate general linear models. Referring to these operations, we can alternatively represent the triple models in (1)–(3) in the following three standard linear statistical models:

\begin{matrix} \hat{M} : \vec{Y} = (I_{m} \otimes X) \vec{Θ} + \vec{Ψ}, \end{matrix}

(4)

\begin{matrix} \hat{N} : \vec{Y} = (I_{m} \otimes X) \vec{Θ} + (I_{m} \otimes Z) \vec{Γ} + \vec{Ψ}, \end{matrix}

(5)

\begin{matrix} \hat{P} : (I \otimes Z^{⊥}) \vec{Y} = (I_{m} \otimes Z^{⊥} X) \vec{Θ} + (I_{m} \otimes Z^{⊥}) \vec{Ψ} . \end{matrix}

(6)

As a common fact in statistical analysis, we know that the first step in the inference of (1) is to estimate/predict certain functions of the unknown parameter matrices

Θ

and

Ψ

. Based on this consideration, it is of great interest to identify their estimators and predictors simultaneously. For this purpose, we construct a general parametric matrix that involves both

Θ

and

Ψ

as follows:

\begin{matrix} Φ = K Θ + J Ψ, \vec{Φ} = (I_{m} \otimes K) \vec{Θ} + (I_{m} \otimes J) \vec{Ψ}, \end{matrix}

(7)

where

K

and

J

are

k \times p

and

k \times n

matrices, respectively. In this situation, we easily obtain that

\begin{matrix} E (Φ) = K Θ, Cov (\vec{Φ}) = σ^{2} (I_{m} \otimes J) (Σ_{2} \otimes Σ_{1}) {(I_{m} \otimes J)}^{'}, \end{matrix}

(8)

\begin{matrix} Cov {\vec{Φ}, \vec{Y}} = Cov {\vec{J Ψ}, \vec{Ψ}} = σ^{2} (I_{m} \otimes J) (Σ_{2} \otimes Σ_{1}) \end{matrix}

(9)

hold. Under the assumptions in the triple models in (1)–(3), the corresponding predictions of

Φ

in (7) are not necessarily identical, and we even use the same optimality criterion to derive the predictors of

Φ

under the triple competing models, and therefore, this fact leads us to propose and study a series of research problems regarding the comparison and equivalence issues about inference results obtained from the triple models. In order to obtain general results and facts under (1)–(8), we do not require probability distributions of the random variables in the MGLMs although they are necessary for further discussing identification and test problems.

The purpose of this paper is to consider some concrete problems on the comparisons of the best linear unbiased estimators derived from (1) and those derived from (2) and (3). Historically, there were some previous investigations on establishing possible equalities of estimations of unknown parameter matrices in two competing linear models; see e.g., [5,6], while equalities of estimations of unknown parameter vectors under linear models with new regressors (augmentation by nuisance parameters) were approached in [7,8,9,10,11,12,13,14,15,16,17]. Particularly, the present two authors studied in [4] the equivalences of estimation and inference results under (1), (2), (4), and (5). As an updated work on this subject, we introduce the two reduced models in (3) and (6), and carry out a new analysis of the equivalences of estimators under (1)–(6).

The remaining of this paper is constructed as follows: In Section 2, we introduce some matrix analysis tools that can be used to characterize equalities that involve algebraic operations of matrices and their generalized inverses. In Section 3, the authors present a standard procedure to describe the predictability and estimability of parametric matrices under the triple models in (1)–(3), and then show how to establish analytical expressions for calculating best linear unbiased predictors and best linear unbiased estimators of parametric matrices under the triple models in (1)–(3). In Section 4, the authors discuss a group of problems on the equivalences of the BLUEs under (1)–(3).

2. Some Preliminaries

In order to establish the proposed mathematical equalities for predictors/estimators udder the triple models in (1)–(3), we need to use a series of basic rank equalities in the following two lemmas:

Lemma 1

([18]). Let

A \in R^{m \times n},

B \in R^{m \times k},

C \in R^{l \times n}

and

D \in R^{l \times k} .

Then,

\begin{matrix} r [A, B] = r (A) + r (E_{A} B) = r (B) + r (E_{B} A), \end{matrix}

(10)

\begin{matrix} r [\begin{matrix} A \\ C \end{matrix}] = r (A) + r (C F_{A}) = r (C) + r (A F_{C}), \end{matrix}

(11)

\begin{matrix} r [\begin{matrix} A & B \\ C & 0 \end{matrix}] = r (B) + r (C) + r (E_{B} A F_{C}), \end{matrix}

(12)

\begin{matrix} r [\begin{matrix} A A^{'} & B \\ B^{'} & 0 \end{matrix}] = r [A, B] + r (B), \end{matrix}

(13)

\begin{matrix} r [\begin{matrix} A & B \\ C & D \end{matrix}] = r (A) + r (D - C A^{+} B) i f R (B) \subseteq R (A) a n d R (C^{'}) \subseteq R (A^{'}) . \end{matrix}

(14)

In particular, the following results hold:

(a): $r [A, B] = r (A) \Leftrightarrow R (B) \subseteq R (A) \Leftrightarrow A A^{+} B = B \Leftrightarrow E_{A} B = 0 .$
(b): $r [\begin{matrix} A \\ C \end{matrix}] = r (A) \Leftrightarrow R (C^{'}) \subseteq R (A^{'}) \Leftrightarrow C A^{+} A = C \Leftrightarrow C F_{A} = 0 .$
(c): $r [A, B] = r (A) + r (B) \Leftrightarrow R (A) \cap R (B) = {0} \Leftrightarrow R [{(E_{A} B)}^{'}] = R (B^{'}) \Leftrightarrow$ $R ({(E_{B} A)}^{'})$ = $R (A^{'}) .$
(d): $r [\begin{matrix} A \\ C \end{matrix}] = r (A) + r (C) \Leftrightarrow R (A^{'}) \cap R (C^{'}) = {0} \Leftrightarrow R (C F_{A}) = R (C) \Leftrightarrow R (A F_{C}) = R (A) .$

A special consequence of (14) is given below, which we shall use to simplify some complicated matrix expressions that involve generalized inverses in the sequel.

Lemma 2.

Assume that five matrices

A_{1},

B_{1},

A_{2},

B_{2},

and

A_{3}

of appropriate sizes satisfy the conditions

R (A_{1}^{'}) \subseteq R (B_{1}^{'}),

R (A_{2}) \subseteq R (B_{1}),

R (A_{2}^{'}) \subseteq R (B_{2}^{'}),

and

R (A_{3}) \subseteq R (B_{2}) .

Then,

\begin{matrix} r (A_{1} B_{1}^{+} A_{2} B_{2}^{+} A_{3}) = r [\begin{matrix} 0 & B_{2} & A_{3} \\ B_{1} & A_{2} & 0 \\ A_{1} & 0 & 0 \end{matrix}] - r (B_{1}) - r (B_{2}) . \end{matrix}

(15)

Hence,

\begin{matrix} A_{1} B_{1}^{+} A_{2} B_{2}^{+} A_{3} = 0 \Leftrightarrow r [\begin{matrix} 0 & B_{2} & A_{3} \\ B_{1} & A_{2} & 0 \\ A_{1} & 0 & 0 \end{matrix}] = r (B_{1}) + r (B_{2}) . \end{matrix}

(16)

Matrix rank formulas and their consequences, as displayed in Lemmas 1 and 2, now are highly recognized as useful techniques to construct and characterize various simple or complicated algebraic equalities for matrices and their operations. We refer the reader to [19] and the references therein on the matrix rank method in the investigations of various linear statistical models.

Lemma 3

([20]). The linear matrix equation

A X = B

is consistent if and only if

r [A, B] = r (A),

or equivalently,

A A^{+} B = B .

In this case, the general solution of the equation can be written in the following parametric form:

X = A^{+} B + (I - A^{+} A) U,

where

U

is an arbitrary matrix.

Finally, we present the following established result on constrained quadratic matrix-valued function minimization problem.

Lemma 4

([21,22]). Let

f (L) = (L C + D) M {(L C + D)}^{'} s . t . L A = B,

(17)

where

A \in R^{p \times q}

,

B \in R^{n \times q}

,

C \in R^{p \times m},

D \in R^{n \times m}

are given,

M \in R^{m \times m}

is positive semi-definite, and the matrix equation

L A = B

is consistent. Then, there always exists a solution

L_{0}

of

L_{0} A = B

such that

f (L) ≽ f (L_{0})

(18)

holds for all solutions of

L A = B

. In this case, the matrix

L_{0}

satisfying (18) is determined by the following consistent matrix equation:

L_{0} [A, C M C^{'} A^{⊥}] = [B, - D M C^{'} A^{⊥}],

(19)

while the general expression of

L_{0}

and the corresponding

f (L_{0})

are given by

\begin{matrix} L_{0} = \underset{L A = B}{argmin} f (L) = [B, - D M C^{'} A^{⊥}] {[A, C M C^{'} A^{⊥}]}^{+} + V {[A, C M C^{'}]}^{⊥}, \end{matrix}

(20)

\begin{matrix} f (L_{0}) = min_{L A = B} f (L) = K M K^{'} - K M C^{'} {(A^{⊥} C M C^{'} A^{⊥})}^{+} C M K^{'}, \end{matrix}

(21)

\begin{matrix} f (L) - f (L_{0}) = (L C M C^{'} A^{⊥} + D M C^{'} A^{⊥}) {(A^{⊥} C M C^{'} A^{⊥})}^{+} {(L C M C^{'} A^{⊥} + D M C^{'} A^{⊥})}^{'}, \end{matrix}

(22)

where

K = B A^{+} C + D,

and

V \in R^{n \times p}

is arbitrary.

3. The Precise Theory of Predictability, Estimability, and BLUP/BLUE

In this section, we present a standard procedure of establishing predictability, estimability, and BLUP theory under an MGLM for the purpose of solving the comparison problems proposed in Section 1. Most of the materials given below are routine illustrations of various known conceptions, definitions, and fundamental results and facts on MGLMs; see e.g., [4].

Definition 1.

Let

Φ

be as given in (7). Then,

(a): $Φ$ is said to be predictable under (1) if there exists a $k \times n$ matrix $L$ such that $E (L Y - Φ) = 0$ ;
(b): $\vec{Φ}$ is said to be predictable under (4) if there exists an $m k \times m n$ matrix $L$ such that $E (L \vec{Y} - \vec{Φ}) = 0$ .

Definition 2.

Let

Φ

be as given in (7). Then,

(a): Given that $Φ$ is predictable under (1), if there exists a matrix $L$ such that

$Cov (\vec{L Y} - \vec{Φ}) = min s . t . E (L Y - Φ) = 0$

(23)

holds in the Löwner partial ordering, the linear statistic $L Y$ is defined to be the best linear unbiased predictor (for short, BLUP) of $Φ$ under (1), and is denoted by

$L Y = {BLUP}_{M} (Φ) = {BLUP}_{M} (K Θ + J Ψ) .$

If $J = 0$ or $K = 0$ in (7), the $L Y$ satisfying (23) is called the best linear unbiased estimator (for short, BLUE) of $K Θ$ and the BLUP of $J Ψ$ under (1), respectively, and is denoted by

$L Y = {BLUE}_{M} (K Θ), L Y = {BLUP}_{M} (J Ψ),$

respectively.
(b): Given that $\vec{Φ}$ is predictable under (4), if there exists a matrix $L$ such that

$Cov (L \vec{Y} - \vec{Φ}) = min s . t . E (L \vec{Y} - \vec{Φ}) = 0$

(24)

holds in the Löwner partial ordering, the linear statistic $L \vec{Y}$ is defined to be the BLUP of $\vec{Φ}$ under (4), and is denoted by

$L \vec{Y} = {BLUP}_{\hat{M}} (\vec{Φ}) = {BLUP}_{\hat{M}} ((I_{m} \otimes K) \vec{Θ} + (I_{m} \otimes J) \vec{Ψ}) .$

If $J = 0$ or $K = 0$ in (7), the $L \vec{Y}$ satisfying (24) is called the BLUE of $(I_{m} \otimes K) \vec{Θ}$ and the BLUP of $(I_{m} \otimes J) \vec{Ψ}$ under (4), respectively, and is denoted by

$L \vec{Y} = {BLUE}_{\hat{M}} ((I_{m} \otimes K) \vec{Θ}), L \vec{Y} = {BLUP}_{\hat{M}} ((I_{m} \otimes J) \vec{Ψ}),$

respectively.

Recall that the unbiasedness of given predictors/estimators and the lowest covariance matrices formulated in (23) and (24) are intrinsic requirements in statistic analysis of parametric regression models, which can be regarded as some special cases of mathematical optimization problems on constrained quadratic matrix-valued functions in the Löwner partial ordering. Note from (1) and (7) that

L Y - Φ

and

\vec{L Y - Φ}

can be rewritten as

\begin{matrix} L Y - Φ & = L X Θ + L Ψ - K Θ - J Ψ = (L X - K) Θ + (L - J) Ψ, \end{matrix}

(25)

\begin{matrix} \vec{L Y - Φ} & = (I_{m} \otimes (L X - K)) \vec{Θ} + (I_{m} \otimes (L - J)) \vec{Ψ} . \end{matrix}

(26)

Hence, the expectations of

L Y - Φ

and

\vec{L Y - Φ}

can be expressed as

\begin{matrix} E (L Y - Φ) = (L X - K) Θ, E (\vec{L Y - Φ}) = (I_{m} \otimes (L X - K)) \vec{Θ} . \end{matrix}

(27)

The dispersion matrix of

\vec{L Y - Φ}

can be expressed as

\begin{matrix} Cov (\vec{L Y - Φ}) & = (I_{m} \otimes (L - J)) Cov (\vec{Ψ}) {(I_{m} \otimes (L - J))}^{'} \\ = σ^{2} (I_{m} \otimes (L - J)) (Σ_{2} \otimes Σ_{1}) {(I_{m} \otimes (L - J))}^{'} \\ = σ^{2} Σ_{2} \otimes (L - J) Σ_{1} {(L - J)}^{'} \overset{▵}{=} σ^{2} Σ_{2} \otimes f (L), \end{matrix}

(28)

where

f (L) = (L - J) Σ_{1} {(L - J)}^{'}

.

Concerning the predictability of

Φ

in (7), we have the following known result.

Lemma 5

([4]). Let Φ be as given in (7). Then, the following three statements are equivalent:

(a): Φ is predictable by $Y$ in (1).
(b): $R (I_{m} \otimes K^{'}) \subseteq R (I_{m} \otimes X^{'}) .$
(c): $R (K^{'}) \subseteq R (X^{'}) .$

Theorem 1.

Assume Φ in (7) is predictable. Then,

Cov (\vec{L Y - Φ}) = min s . t . E (L Y - Φ) = 0 \Leftrightarrow L [X, Σ_{1} X^{⊥}] = [K, J Σ_{1} X^{⊥}] .

(29)

The matrix equation in (29), called the BLUP equation associated with

Φ,

is consistent as well, i.e.,

[K, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+} [X, Σ_{1} X^{⊥}] = [K, J Σ_{1} X^{⊥}]

(30)

holds under Lemma 5(c), while the general expressions of

L

and the corresponding

{BLUP}_{M} (Φ)

can be written as

\begin{matrix} {BLUP}_{M} (Φ) = L Y = ([K, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+} + U {[X, Σ_{1} X^{⊥}]}^{⊥}) Y, \end{matrix}

(31)

where

U \in R^{k \times n}

is arbitrary. In particular,

\begin{matrix} {BLUE}_{M} (K Θ) & = ([K, 0] {[X, Σ_{1} X^{⊥}]}^{+} + U {[X, Σ_{1} X^{⊥}]}^{⊥}) Y, \end{matrix}

(32)

\begin{matrix} {BLUP}_{M} (J Ψ) & = ([0, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+} + U {[X, Σ_{1} X^{⊥}]}^{⊥}) Y, \end{matrix}

(33)

where

U \in R^{k \times n}

is arbitrary. Furthermore, the following results hold.

(a): $r [X, Σ_{1} X^{⊥}] = r [X, Σ_{1}],$ $R [X, Σ_{1} X^{⊥}] = R [X, Σ_{1}],$ and $R (X) \cap R (Σ_{1} X^{⊥}) = {0} .$
(b): $L$ is unique if and only if $r [X, Σ_{1}] = n .$
(c): ${BLUP}_{M} (Φ)$ is unique if and only if $R (Y) \subseteq R [X, Σ_{1}]$ holds with probability $1 .$
(d): The expectation, the dispersion matrices of $\vec{{BLUP}_{M} (Φ)}$ and $\vec{Φ} - \vec{{BLUP}_{M} (Φ)},$ as well as the covariance matrix between $\vec{{BLUP}_{M} (Φ)}$ and $\vec{Φ}$ are unique, and are given by

$\begin{matrix} E ({BLUP}_{M} (Φ) - Φ) = 0, \end{matrix}$

(34)

$\begin{matrix} Cov (\vec{{BLUP}_{M} (Φ)}) = σ^{2} Σ_{2} \otimes ([K, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+}) Σ_{1} {([K, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+})}^{'}, \end{matrix}$

(35)

$Cov {\vec{{BLUP}_{M} (Φ)}, \vec{Φ}} = σ^{2} Σ_{2} \otimes [K, J Σ_{1} X^{⊥}] {[X, Σ X^{⊥}]}^{+} Σ_{1} J^{'},$

(36)

$\begin{matrix} Cov (\vec{Φ}) - Cov (\vec{{BLUP}_{M} (Φ)}) = σ^{2} Σ_{2} \otimes J Σ_{1} J^{'} \\ - σ^{2} Σ_{2} \otimes ([K, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+}) Σ_{1} {([K, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+})}^{'}, \end{matrix}$

(37)

$\begin{matrix} Cov (\vec{Φ} - \vec{{BLUP}_{M} (Φ)}) \\ = σ^{2} Σ_{2} \otimes ([K, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+} - J) Σ_{1} {([K, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+} - J)}^{'} . \end{matrix}$

(38)
(e): ${BLUP}_{M} (Φ),$ ${BLUE}_{M} (K Θ),$ and ${BLUP}_{M} (J Ψ)$ satisfy

$\begin{matrix} {BLUP}_{M} (Φ) = {BLUE}_{M} (K Θ) + {BLUP}_{M} (J Ψ), \end{matrix}$

(39)

$\begin{matrix} Cov {\vec{{BLUE}_{M} (K Θ)}, \vec{{BLUP}_{M} (J Ψ)}} = 0, \end{matrix}$

(40)

$\begin{matrix} Cov (\vec{{BLUP}_{M} (Φ)}) = Cov (\vec{{BLUE}_{M} (K Θ)}) + Cov (\vec{{BLUP}_{M} (J Ψ)}) . \end{matrix}$

(41)
(f): ${BLUP}_{M} (T Φ) = T {BLUP}_{M} (Φ)$ holds for any matrix $T \in R^{t \times k} .$

Proof.

We obtain from (28) that the constrained minimization problem in (23) is equivalent to

Σ_{2} \otimes f (L) ≽ Σ_{2} \otimes f (L_{0}) for all solutions of I_{m} \otimes L X = I_{m} \otimes K,

(42)

which is further reduced to

f (L) ≽ f (L_{0}) for all solutions of L X = K .

(43)

Since

Σ_{2}

is a non-null nonnegative definite matrix, we apply Lemma 4 to (43) to yield the matrix equation

L_{0} [X, Σ_{1} X^{⊥}] = [K, J Σ_{1} X^{⊥}],

as required for (29). Equations (32) and (33) follow directly from (31). Result (a) is well known on the matrix

[X, Σ_{1} X^{⊥}]

; see, e.g., [1,2].

Note that

{[X, Σ_{1} X^{⊥}]}^{⊥} = 0 \Leftrightarrow [X, Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+} = I_{n} \Leftrightarrow r [X, Σ_{1} X^{⊥}] = r [X, Σ_{1}] = n

by (10). Combining this fact with (31) leads to (b). Setting the term

{[X, Σ_{1} X^{⊥}]}^{⊥} Y = 0

in (31) leads to (c).

From (1) and (31),

\begin{matrix} Cov (\vec{{BLUP}_{M} (Φ)}) \\ = (I_{m} \otimes L) Cov (\vec{Ψ}) (I_{m} \otimes L^{'}) \\ = σ^{2} (I_{m} \otimes L) (Σ_{2} \otimes Σ_{1}) (I_{m} \otimes L^{'}) = σ^{2} Σ_{2} \otimes L Σ_{1} L^{'} \\ = σ^{2} Σ_{2} \otimes ([K, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+} + U {[X, Σ_{1} X^{⊥}]}^{⊥}) Σ_{1} ([K, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+} \\ + U {[X, Σ_{1} X^{⊥}]}^{⊥})^{'} \\ = σ^{2} Σ_{2} \otimes ([K, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+}) Σ_{1} {([K, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+})}^{'}, \end{matrix}

thus establishing (35).

From (1) and (31),

\begin{matrix} Cov {\vec{{BLUP}_{M} (Φ)}, \vec{Φ}} & = (I_{m} \otimes L) Cov (\vec{Ψ}) (I_{m} \otimes J^{'}) = σ^{2} (I_{m} \otimes L) (Σ_{2} \otimes Σ_{1}) (I_{m} \otimes J^{'}) \\ = σ^{2} Σ_{2} \otimes L Σ_{1} J^{'} = σ^{2} Σ_{2} \otimes [K, J Σ_{1} X^{⊥}] {[X, Σ X^{⊥}]}^{+} Σ_{1} J^{'}, \end{matrix}

establishing (36). Combining (8) and (35) yields (37). Substituting (31) into (28) and simplifying, we obtain

\begin{matrix} Cov (\vec{Φ} - \vec{{BLUP}_{M} (Φ)}) \\ = σ^{2} Σ_{2} \otimes ([K, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+} + U {[X, Σ_{1} X^{⊥}]}^{⊥} - J) Σ_{1} \\ \times {([K, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+} + U {[X, Σ_{1} X^{⊥}]}^{⊥} - J)}^{'} \\ = σ^{2} Σ_{2} \otimes ([K, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+} - J) Σ_{1} {([K, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+} - J)}^{'} . \end{matrix}

Rewrite the arbitrary matrix

U

in (31) as

U = U_{1} + U_{2}

, and

[K, J Σ_{1} X^{⊥}]

in (31) as

[K, J Σ_{1} X^{⊥}] = [K, 0] + [0, J Σ_{1} X^{⊥}]

. Then, (31) can equivalently be represented as

\begin{matrix} {BLUP}_{M} (Φ) \\ = ([K, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+} + U {[X, Σ_{1} X^{⊥}]}^{⊥}) Y \\ = ([K, 0] {[X, Σ_{1} X^{⊥}]}^{+} + U_{1} {[X, Σ_{1} X^{⊥}]}^{⊥}) Y + ([0, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+} + U_{2} {[X, Σ_{1} X^{⊥}]}^{⊥}) Y \\ = {BLUE}_{M} (K Θ) + {BLUP}_{M} (J Ψ), \end{matrix}

thus establishing (39).

From (32) and (33), the covariance matrix between

\vec{{BLUE}_{M} (K Θ)}

and

\vec{{BLUP}_{M} (J Ψ)}

is

\begin{matrix} Cov {\vec{{BLUE}_{M} (K Θ)}, \vec{{BLUP}_{M} (J Ψ)}} \\ = σ^{2} Σ_{2} \otimes ([K, 0] {[X, Σ_{1} X^{⊥}]}^{+} + U_{1} {[X, Σ_{1} X^{⊥}]}^{⊥}) Σ_{1} {([0, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+} + U_{2} {[X, Σ_{1} X^{⊥}]}^{⊥})}^{'} \\ = σ^{2} Σ_{2} \otimes ([K, 0] {[X, Σ_{1} X^{⊥}]}^{+}) Σ_{1} {([0, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+})}^{'} . \end{matrix}

(44)

Applying (15) to the matrix product on the right-hand side of (44) and simplifying, we obtain

\begin{matrix} r ([K, 0] {[X, Σ_{1} X^{⊥}]}^{+} Σ_{1} {([0, J Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+})}^{'}) \\ = r [\begin{matrix} 0 & [\begin{matrix} X^{'} \\ X^{⊥} Σ_{1} \end{matrix}] & [\begin{matrix} 0 \\ X^{⊥} Σ_{1} J^{'} \end{matrix}] \\ [X, Σ_{1} X^{⊥}] & Σ_{1} & 0 \\ [K, 0] & 0 & 0 \end{matrix}] - 2 r [X, Σ_{1} X^{⊥}] \\ = r [\begin{matrix} [\begin{matrix} 0 & 0 \\ 0 & - X^{⊥} Σ_{1} X^{⊥} \end{matrix}] & [\begin{matrix} X^{'} \\ 0 \end{matrix}] & [\begin{matrix} 0 \\ X^{⊥} Σ_{1} J^{'} \end{matrix}] \\ [X, 0] & Σ_{1} & 0 \\ [K, 0] & 0 & 0 \end{matrix}] - 2 r [X, Σ_{1}] \\ = r [\begin{matrix} 0 & X^{'} \\ X & Σ_{1} \\ K & 0 \end{matrix}] + r [X^{⊥} Σ_{1} X^{⊥}, X^{⊥} Σ_{1} J^{'}] - 2 r [X, Σ_{1}] \\ = r [\begin{matrix} X \\ K \end{matrix}] + r [\begin{matrix} X^{'} \\ Σ_{1} \end{matrix}] + r [X, Σ_{1} X^{⊥}, Σ_{1} J^{'}] - r (X) - 2 r [X, Σ_{1}] (by (10) and (13)) \\ = r (X) + r [X, Σ_{1}] + r [X, Σ_{1}] - r (X) - 2 r [X, Σ_{1}] (by (a)) \\ = 0, \end{matrix}

thus, the right-hand side of (44) is null, establishing (40). Equation (41) follows from (39) and (40). □

Concerning the BLUEs of

K Θ,

the mean matrix

X Θ

, and the BLUP of the error matrix

Ψ

in (1), we have the following results.

Corollary 1.

Let

M

be as given in (1). Then, the following facts hold.

(i): $K Θ$ is estimable under (1) ⇔ $R (K^{'}) \subseteq R (X^{'}) .$
(ii): The mean matrix $X Θ$ is always estimable under (1).

In this case, the matrix equation

L [X, Σ_{1} X^{⊥}] = [K, 0]

(45)

is consistent, and the following results hold.

(a): The general expression of ${BLUE}_{M} (K Θ)$ can be written as

$\begin{matrix} {BLUE}_{M} (K Θ) = ([K, 0] {[X, Σ_{1} X^{⊥}]}^{+} + U {[X, Σ_{1} X^{⊥}]}^{⊥}) Y \end{matrix}$

(46)

with

$\begin{matrix} E ({BLUE}_{M} (K Θ)) = K Θ, \end{matrix}$

(47)

$\begin{matrix} Cov (\vec{{BLUE}_{M} (K Θ)}) = σ^{2} Σ_{2} \otimes ([K, 0] {[X, Σ_{1} X^{⊥}]}^{+}) Σ_{1} {([K, 0] {[X, Σ_{1} X^{⊥}]}^{+})}^{'}, \end{matrix}$

(48)

where $U \in R^{k \times n}$ is arbitrary.
(b): The general expression of ${BLUE}_{M} (X Θ)$ can be written as

$\begin{matrix} {BLUE}_{M} (X Θ) = ([X, 0] {[X, Σ_{1} X^{⊥}]}^{+} + U {[X, Σ_{1} X^{⊥}]}^{⊥}) Y \end{matrix}$

(49)

with

$\begin{matrix} E ({BLUE}_{M} (X Θ)) = X Θ, \end{matrix}$

(50)

$\begin{matrix} Cov (\vec{{BLUE}_{M} (X Θ)}) = σ^{2} Σ_{2} \otimes ([X, 0] {[X, Σ_{1} X^{⊥}]}^{+}) Σ_{1} {([X, 0] {[X, Σ_{1} X^{⊥}]}^{+})}^{'}, \end{matrix}$

(51)

where $U \in R^{n \times n}$ is arbitrary.
(c): The general expression of ${BLUP}_{M} (Ψ)$ can be written as

$\begin{matrix} {BLUP}_{M} (Ψ) & = ([0, Σ_{1} X^{⊥}] {[X, Σ_{1} X^{⊥}]}^{+} + U {[X, Σ_{1} X^{⊥}]}^{⊥}) Y \\ = (Σ_{1} {(X^{⊥} Σ_{1} X^{⊥})}^{+} + U {[X, Σ_{1} X^{⊥}]}^{⊥}) Y \end{matrix}$

(52)

with

$Cov {\vec{{BLUP}_{M} (Ψ)}, \vec{Ψ}} = Cov (\vec{{BLUP}_{M} (Ψ)}) = σ^{2} Σ_{2} \otimes Σ_{1} {(X^{⊥} Σ_{1} X^{⊥})}^{+} Σ_{1},$

(53)

$\begin{matrix} Cov (\vec{Ψ} - \vec{{BLUP}_{M} (Ψ)}) & = Cov (\vec{Ψ}) - Cov (\vec{{BLUP}_{M} (Ψ)}) \\ = σ^{2} Σ_{2} \otimes Σ_{1} - σ^{2} Σ_{2} \otimes Σ_{1} {(X^{⊥} Σ_{1} X^{⊥})}^{+} Σ_{1}, \end{matrix}$

(54)

where $U \in R^{n \times n}$ is arbitrary.
(d): $Y,$ ${BLUE}_{M} (X Θ),$ and ${BLUP}_{M} (Ψ)$ satisfy

$\begin{matrix} Y = {BLUE}_{M} (X Θ) + {BLUP}_{M} (Ψ), \end{matrix}$

(55)

$\begin{matrix} Cov {\vec{{BLUE}_{M} (X Θ)}, \vec{{BLUP}_{M} (Ψ)}} = 0, \end{matrix}$

(56)

$\begin{matrix} Cov (\vec{Y}) = Cov (\vec{{BLUE}_{M} (X Θ)}) + Cov (\vec{{BLUP}_{M} (Ψ)}) . \end{matrix}$

(57)

The BLUEs under the over-parameterized model (2) can be formulated from the standard results on the BLUEs under the true model as follows.

Theorem 2.

Let

N

be as given in (2), and denote

W = [X, Z],

\hat{K} = [K, 0],

and

\hat{X} = [X, 0] .

Then, the following facts hold.

(a): $K Θ$ is estimable under (2) ⇔ $R ({\hat{K}}^{'}) \subseteq R (W^{'})$ ⇔ $R (K^{'}) \subseteq R (X^{'} Z^{⊥}) .$
(b): $X Θ$ is estimable under (2) ⇔ $R (X) \cap R (Z) = {0} .$

In this case, the matrix equation

L [W, Σ_{1} W^{⊥}] = [\hat{K}, 0]

(58)

is consistent, and a BLUE of

K Θ

under (2) is

\begin{matrix} {BLUE}_{N} (K Θ) = L Y = ([\hat{K}, 0] {[W, Σ_{1} W^{⊥}]}^{+} + U {[W, Σ_{1} W^{⊥}]}^{⊥}) Y, \end{matrix}

(59)

\begin{matrix} \vec{{BLUE}_{N} (K Θ)} = (I_{m} \otimes L) \vec{Y} = (I_{m} \otimes ([\hat{K}, 0] {[W, Σ_{1} W^{⊥}]}^{+} + U {[W, Σ_{1} W^{⊥}]}^{⊥})) \vec{Y}, \end{matrix}

(60)

\begin{matrix} Cov (\vec{{BLUE}_{N} (K Θ)}) = σ^{2} Σ_{2} \otimes ([\hat{K}, 0] {[W, Σ_{1} W^{⊥}]}^{+}) Σ_{1} {([\hat{K}, 0] {[W, Σ_{1} W^{⊥}]}^{+})}^{'}, \end{matrix}

(61)

where

U \in R^{k \times n}

is arbitrary. In particular,

\begin{matrix} {BLUE}_{N} (X Θ) = ([\hat{X}, 0] {[W, Σ_{1} W^{⊥}]}^{+} + U {[W, Σ_{1} W^{⊥}]}^{⊥}) Y, \end{matrix}

(62)

\begin{matrix} \vec{{BLUE}_{N} (X Θ)} = (I_{m} \otimes L) \vec{Y} = (I_{m} \otimes ([\hat{X}, 0] {[W, Σ_{1} W^{⊥}]}^{+} + U {[W, Σ_{1} W^{⊥}]}^{⊥})) \vec{Y}, \end{matrix}

(63)

\begin{matrix} Cov (\vec{{BLUE}_{N} (X Θ)}) = σ^{2} Σ_{2} \otimes ([\hat{X}, 0] {[W, Σ_{1} W^{⊥}]}^{+}) Σ_{1} {([\hat{X}, 0] {[W, Σ_{1} W^{⊥}]}^{+})}^{'}, \end{matrix}

(64)

where

U \in R^{n \times n}

is arbitrary.

Proof.

Note that

K Θ

can be rewritten as

K Θ = [K, 0] [\begin{matrix} Θ \\ Γ \end{matrix}] = \hat{K} [\begin{matrix} Θ \\ Γ \end{matrix}]

under (2). Hence,

K Θ

is estimable under (2) if and only if

R ({[K, 0]}^{'}) \subseteq R ({[X, Z]}^{'})

by Lemma 5, or equivalently,

\begin{matrix} r [\begin{matrix} X & Z \\ K & 0 \end{matrix}] = r [X, Z] . \end{matrix}

(65)

In addition, note that

r [\begin{matrix} X & Z \\ K & 0 \end{matrix}] = r [\begin{matrix} Z^{⊥} X \\ K \end{matrix}] + r (Z)

and

r [X, Z] = r (Z^{⊥} X) + r (Z)

by (10). Hence, (65) is further equivalent to

R (K^{'}) \subseteq R ({(Z^{⊥} X)}^{'}) = R (X^{'} Z^{⊥})

, as required for (a). Let

K = X

in (65). Then, we obtain from (65) that

\begin{matrix} r [\begin{matrix} X & Z \\ X & 0 \end{matrix}] = r [\begin{matrix} 0 & Z \\ X & 0 \end{matrix}] = r (X) + (Z) . \end{matrix}

(66)

Hence, (66) is equivalent to

R (X) \cap R (Z) = {0},

as required for (b). Equations (58)–(64) follow from the standard results on BLUEs in Corollary 1. □

The BLUEs of unknown parameter matrices under the transformed model in (3) can be formulated from the standard results on the BLUEs under the true model as follows.

Theorem 3.

Let

P

be as given in (3). Then, the following results hold:

(a): $K Θ$ is estimable under (3) ⇔ $R (K^{'}) \subseteq R (X^{'} Z^{⊥}) .$
(b): $X Θ$ is estimable under (3) ⇔ $R (X) \cap R (Z) = {0} .$

In this case, the matrix equation

L [Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}] = [K, 0]

(67)

is consistent, and

\begin{matrix} {BLUE}_{P} (K Θ) \\ = L Z^{⊥} Y = ([K, 0] {[Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}]}^{+} + U {[Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}]}^{⊥}) Z^{⊥} Y, \end{matrix}

(68)

\begin{matrix} \vec{{BLUE}_{P} (K Θ)} \\ = (I_{m} \otimes ([K, 0] {[Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}]}^{+} Z^{⊥} + U {[Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}]}^{⊥} Z^{⊥})) \vec{Y}, \end{matrix}

(69)

\begin{matrix} Cov (\vec{{BLUE}_{P} (K Θ)}) \\ = σ^{2} Σ_{2} \otimes ([K, 0] {[Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}]}^{+}) Z^{⊥} Σ_{1} Z^{⊥} {([K, 0] {[Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}]}^{+})}^{'}, \end{matrix}

(70)

where

U \in R^{k \times n}

is arbitrary.

4. The Equivalence Analysis of BLUEs under True and Misspecified Models

Concerning equalities between two linear statistics

G_{1} Y

and

G_{2} Y

, the following three possible situations should be addressed.

Definition 3.

Let

Y

be a random matrix.

(a): The equality $G_{1} Y = G_{2} Y$ is said to hold definitely iff $G_{1} = G_{2}$ .
(b): The equality $G_{1} Y = G_{2} Y$ is said to hold with probability 1 iff both $E (G_{1} Y - G_{2} Y) = 0$ and $Cov (\vec{G_{1} Y} - \vec{G_{2} Y}) = 0$ hold.
(c): $G_{1} Y$ and $G_{2} Y$ are said to have the same expectation and covariance matrix iff both $E (G_{1} Y) = E (G_{2} Y)$ and $Cov (\vec{G_{1} Y}) = Cov (\vec{G_{2} Y})$ hold.

Assume that

K Θ

is estimable under (2). Then, it can be seen from Corollary 1(i), Theorem 2(a), and Theorem 3(a) that

K Θ

is estimable under (1) and (3) as well. In this case, the BLUE of

K Θ

can be written as (46), while the BLUEs of

K Θ

under the misspecified models can be written as (59) and (68), respectively. The triple BLUEs are not necessarily the same, and thus it is natural to consider relations among the three BLUEs by Definition 3.

Theorem 4.

Assume that

K Θ

is estimable under (2) for

K \in R^{k \times p},

and let

{BLUE}_{M} (K Θ),

{BLUE}_{N} (K Θ),

and

{BLUE}_{P} (K Θ)

be as given in (46), (59), and (68), respectively. In addition, let

M = [\begin{matrix} Σ_{1} & X & Z \\ X^{'} & 0 & 0 \\ 0 & K & 0 \end{matrix}], N = [\begin{matrix} Σ_{1} & X & Z \\ X^{'} & 0 & 0 \end{matrix}] .

Then, the following eight statements are equivalent:

(a): ${BLUE}_{M} (K Θ) = {BLUE}_{N} (K Θ)$ holds definitely;
(b): ${BLUE}_{M} (K Θ) = {BLUE}_{P} (K Θ)$ holds definitely;
(c): ${BLUE}_{M} (K Θ) = {BLUE}_{N} (K Θ)$ holds with probability $1;$
(d): ${BLUE}_{M} (K Θ) = {BLUE}_{P} (K Θ)$ holds with probability $1;$
(e): $Cov (\vec{{BLUE}_{M} (K Θ)} - \vec{{BLUE}_{N} (K Θ)}) = Cov (\vec{{BLUE}_{M} (K Θ)} - \vec{{BLUE}_{P} (K Θ)}) = 0;$
(f): $Cov (\vec{{BLUE}_{M} (K Θ)}) = Cov (\vec{{BLUE}_{N} (K Θ)});$
(g): $Cov (\vec{{BLUE}_{M} (K Θ)}) = Cov (\vec{{BLUE}_{P} (K Θ)}) .$
(h): $r (M) = r (N) .$

Proof.

Combining (45) and (58), we obtain a new equation for

L

:

L [X, Σ_{1} X^{⊥}, W, Σ_{1} W^{⊥}] = [K, 0, \hat{K}, 0] .

(71)

This matrix equation has a solution for

L

if and only if

r [\begin{matrix} X & Σ_{1} X^{⊥} & W & Σ_{1} W^{⊥} \\ K & 0 & \hat{K} & 0 \end{matrix}] = r [X, Σ_{1} X^{⊥}, W, Σ_{1} W^{⊥}] .

(72)

Simplifying both sides by

R (W^{⊥}) \subseteq R (X^{⊥})

and elementary matrix operations, we obtain

\begin{matrix} r [\begin{matrix} X & Σ_{1} X^{⊥} & W & Σ_{1} W^{⊥} \\ K & 0 & \hat{K} & 0 \end{matrix}] & = r [\begin{matrix} Σ_{1} X^{⊥} & X & Z \\ 0 & K & 0 \end{matrix}] \\ = r [\begin{matrix} Σ_{1} & X & Z \\ X^{'} & 0 & 0 \\ 0 & K & 0 \end{matrix}] - r (X) \\ = r (M) - r (X) \end{matrix}

by (11) and

X^{⊥} = F_{X^{'}}

, and

\begin{matrix} r [X, Σ_{1} X^{⊥}, W, Σ_{1} W^{⊥}] & = r [Σ_{1} X^{⊥}, X, Z] = r [\begin{matrix} Σ_{1} & X & Z \\ X^{'} & 0 & 0 \end{matrix}] - r (X) = r (N) - r (X) . \end{matrix}

Hence, (a) is equivalent to the rank equality in (h).

By Definition 3(a),

{BLUE}_{M} (K Θ) = {BLUE}_{P} (K Θ)

hold definitely iff the coefficient matrix in (68) satisfies (45)

\begin{matrix} L Z^{⊥} [X, Σ_{1} X^{⊥}] = [K, 0] \Leftrightarrow L [Z^{⊥} X, Z^{⊥} Σ_{1} X^{⊥}] = [K, 0] \\ \Leftrightarrow ([K, 0] {[Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}]}^{+} + U {[Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}]}^{⊥}) [Z^{⊥} X, Z^{⊥} Σ_{1} X^{⊥}] \\ = [K, 0], \end{matrix}

which is further reduced to

\begin{matrix} [K, 0] {[Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}]}^{+} [Z^{⊥} X, Z^{⊥} Σ_{1} X^{⊥}] = [K, 0] . \end{matrix}

(73)

Note that

R ([Z^{⊥} X, Z^{⊥} Σ_{1} X^{⊥}]) \subseteq R ([Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}]),

R ({[K, 0]}^{'}) \subseteq R ({[Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}]}^{'}) .

Then, (73) is simplified by (10), (11), (13), and (14) as

\begin{matrix} r ([K, 0] - [K, 0] {[Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}]}^{+} [Z^{⊥} X, Z^{⊥} Σ_{1} X^{⊥}]) \\ = r [\begin{matrix} [Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}] & [Z^{⊥} X, Z^{⊥} Σ_{1} X^{⊥}] \\ [K, 0] & [K, 0] \end{matrix}] - r [Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}] \\ = r [\begin{matrix} Z^{⊥} X & Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥} & Z^{⊥} X & Z^{⊥} Σ_{1} X^{⊥} \\ K & 0 & K & 0 \end{matrix}] - r [Z^{⊥} X, Z^{⊥} Σ_{1}] \\ = r [\begin{matrix} Z^{⊥} X & Z^{⊥} Σ_{1} Z^{⊥} & Z^{⊥} Σ_{1} \\ K & 0 & 0 \\ 0 & {(Z^{⊥} X)}^{'} & 0 \\ 0 & 0 & X^{'} \end{matrix}] - r (Z^{⊥} X) - r (X) - r [Z^{⊥} X, Z^{⊥} Σ_{1}] \\ = r [\begin{matrix} Z^{⊥} X & 0 & Z^{⊥} Σ_{1} \\ K & 0 & 0 \\ 0 & {(Z^{⊥} X)}^{'} & 0 \\ 0 & 0 & X^{'} \end{matrix}] - r (Z^{⊥} X) - r (X) - r [Z^{⊥} X, Z^{⊥} Σ_{1}] \\ = r [\begin{matrix} Z^{⊥} X & Z^{⊥} Σ_{1} \\ K & 0 \\ 0 & X^{'} \end{matrix}] - r (X) - r [Z^{⊥} X, Z^{⊥} Σ_{1}] \\ = r [\begin{matrix} Σ_{1} & X & Z \\ X^{'} & 0 & 0 \\ 0 & K & 0 \end{matrix}] - r (X) - r [X, Z, Σ_{1}] = r (M) - r (N) . \end{matrix}

(74)

Setting (74) equal to zero, we obtain that (b) is equivalent to the rank equality in (h).

By Definition 3(b), we see that

{BLUE}_{M} (K Θ) = {BLUE}_{N} (K Θ)

holds with probability 1 if and only if

\begin{matrix} E ({BLUE}_{M} (K Θ) - {BLUE}_{N} (K Θ)) = 0 and Cov (\vec{{BLUE}_{M} (K Θ)} - \vec{{BLUE}_{N} (K Θ)}) = 0; \end{matrix}

(75)

{BLUE}_{M} (K Θ) = {BLUE}_{P} (K Θ)

holds with probability 1 if and only if

\begin{matrix} E ({BLUE}_{M} (K Θ) - {BLUE}_{P} (K Θ)) = 0 and Cov (\vec{{BLUE}_{M} (K Θ)} - \vec{{BLUE}_{P} (K Θ)}) = 0 . \end{matrix}

(76)

The first equalities in (75) and (76) hold naturally. Thus, (c), (d), and (e) are equivalent. Furthermore, the two equalities in (e) are equivalent to

\begin{matrix} (I_{m} \otimes [K, 0] {[X, Σ_{1} X^{⊥}]}^{+} - I_{m} \otimes [\hat{K}, 0] {[W, Σ_{1} W^{⊥}]}^{+}) (Σ_{2} \otimes Σ_{1}) \\ = Σ_{2} \otimes ([K, 0] {[X, Σ_{1} X^{⊥}]}^{+} Σ_{1} - [\hat{K}, 0] {[W, Σ_{1} W^{⊥}]}^{+} Σ_{1}) = 0, \end{matrix}

(77)

\begin{matrix} (I_{m} \otimes [K, 0] {[X, Σ_{1} X^{⊥}]}^{+} - I_{m} \otimes [K, 0] {[Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}]}^{+} Z^{⊥}) (Σ_{2} \otimes Σ_{1}) \\ = Σ_{2} \otimes ([K, 0] {[X, Σ_{1} X^{⊥}]}^{+} Σ_{1} - [K, 0] {[Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}]}^{+} Z^{⊥} Σ_{1}) = 0 . \end{matrix}

(78)

Notice that

Σ_{2} \neq 0

. Equations (77) and (78) are equivalent to

\begin{matrix} [K, 0] {[X, Σ_{1} X^{⊥}]}^{+} Σ_{1} - [\hat{K}, 0] {[W, Σ_{1} W^{⊥}]}^{+} Σ_{1} = 0, \end{matrix}

(79)

\begin{matrix} [K, 0] {[X, Σ_{1} X^{⊥}]}^{+} Σ_{1} - [K, 0] {[Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}]}^{+} Z^{⊥} Σ_{1} = 0 . \end{matrix}

(80)

Applying (14) to the difference on the left-hand side of (79), and simplifying by elementary matrix operations, (11) and (13), we obtain

\begin{matrix} r ([K, 0] {[X, Σ_{1} X^{⊥}]}^{+} Σ_{1} - [K, 0, 0] {[W, Σ_{1} W^{⊥}]}^{+} Σ_{1}) \\ = r [\begin{matrix} [X, Σ_{1} X^{⊥}] & 0 & Σ_{1} \\ 0 & - [X, Z, Σ_{1} {[X, Z]}^{⊥}] & Σ_{1} \\ [K, 0] & [K, 0, 0] & 0 \end{matrix}] - r [X, Σ_{1} X^{⊥}] - r [W, Σ_{1} W^{⊥}] \\ = r [\begin{matrix} X & 0 & 0 & 0 & 0 & Σ_{1} \\ - X & - Σ_{1} X^{⊥} & - X & - Z & - Σ_{1} {[X, Z]}^{⊥} & 0 \\ K & 0 & K & 0 & 0 & 0 \end{matrix}] - r [X, Σ_{1}] - r [W, Σ_{1}] \\ = r [\begin{matrix} X & 0 & 0 & 0 & 0 & Σ_{1} \\ 0 & - Σ_{1} X^{⊥} & - X & - Z & 0 & 0 \\ 0 & 0 & K & 0 & 0 & 0 \end{matrix}] - r [X, Σ_{1}] - r [W, Σ_{1}] \\ = r [\begin{matrix} Σ_{1} X^{⊥} & X & Z \\ 0 & K & 0 \end{matrix}] - r [W, Σ_{1}] \\ = r [\begin{matrix} Σ_{1} & X & Z \\ X^{'} & 0 & 0 \\ 0 & K & 0 \end{matrix}] - r (X) - r [W, Σ_{1}] = r (M) - r (N) \end{matrix}

(81)

by (13). Similarly, we can show that

\begin{matrix} r ([K, 0] {[X, Σ_{1} X^{⊥}]}^{+} Σ_{1} - [K, 0] {[Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}]}^{+} Z^{⊥} Σ_{1}) \\ = r [\begin{matrix} [X, Σ_{1} X^{⊥}] & 0 & Σ_{1} \\ 0 & - [Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}] & Z^{⊥} Σ_{1} \\ [K, 0] & [K, 0] & 0 \end{matrix}] \\ - r [X, Σ_{1} X^{⊥}] - r [Z^{⊥} X, Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥}] \\ = r [\begin{matrix} X & 0 & 0 & 0 & Σ_{1} \\ - Z^{⊥} X & - Z^{⊥} Σ_{1} X^{⊥} & - Z^{⊥} X & - Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥} & 0 \\ K & 0 & K & 0 & 0 \end{matrix}] \\ - r [X, Σ_{1}] - r [Z^{⊥} X, Z^{⊥} Σ_{1}] \\ = r [\begin{matrix} X & 0 & 0 & 0 & Σ_{1} \\ 0 & - Z^{⊥} Σ_{1} X^{⊥} & - Z^{⊥} X & - Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥} & 0 \\ 0 & 0 & K & 0 & 0 \end{matrix}] - r [X, Σ_{1}] - r [Z^{⊥} X, Z^{⊥} Σ_{1}] \\ = r [\begin{matrix} Z^{⊥} Σ_{1} X^{⊥} & Z^{⊥} X & Z^{⊥} Σ_{1} Z^{⊥} {(Z^{⊥} X)}^{⊥} \\ 0 & K & 0 \end{matrix}] - r [Z^{⊥} X, Z^{⊥} Σ_{1}] \\ = r [\begin{matrix} Z^{⊥} Σ_{1} & Z^{⊥} X & Z^{⊥} Σ_{1} Z^{⊥} \\ 0 & K & 0 \\ 0 & 0 & {(Z^{⊥} X)}^{'} \\ X^{'} & 0 & 0 \end{matrix}] - r (X) - r (Z^{⊥} X) - r [Z^{⊥} X, Z^{⊥} Σ_{1}] \\ = r [\begin{matrix} Z^{⊥} Σ_{1} & Z^{⊥} X \\ 0 & K \\ X^{'} & 0 \end{matrix}] - r (X) - r [Z^{⊥} X, Z^{⊥} Σ_{1}] \\ = r [\begin{matrix} Σ_{1} & X & Z \\ X^{'} & 0 & 0 \\ 0 & K & 0 \end{matrix}] - r (X) - r [X, Z, Σ_{1}] = r (M) - r (N) \end{matrix}

(82)

by (13). Setting the right-hand sides of (81) and (82) equal to zero, we obtain the equivalence of (e) and (h).

It follows from (48) and (61) that

\begin{matrix} Cov (\vec{{BLUE}_{M} (K Θ)}) - Cov (\vec{{BLUE}_{N} (K Θ)}) \\ = σ^{2} Σ_{2} \otimes [K, 0] {[X, Σ_{1} X^{⊥}]}^{+} Σ_{1} {([K, 0] {[X, Σ_{1} X^{⊥}]}^{+})}^{'} \\ - σ^{2} Σ_{2} \otimes [\hat{K}, 0] {[W, Σ_{1} W^{⊥}]}^{+} Σ_{1} {([\hat{K}, 0] {[W, Σ_{1} W^{⊥}]}^{+})}^{'} . \end{matrix}

(83)

Hence,

\begin{matrix} r (Cov (\vec{{BLUE}_{M} (K Θ)}) - Cov (\vec{{BLUE}_{N} (K Θ)})) \\ = r (Σ_{2}) r ([K, 0] {[X, Σ_{1} X^{⊥}]}^{+} Σ_{1} {([K, 0] {[X, Σ_{1} X^{⊥}]}^{+})}^{'} \\ - [\hat{K}, 0] {[W, Σ_{1} W^{⊥}]}^{+} Σ_{1} {([\hat{K}, 0] {[W, Σ_{1} W^{⊥}]}^{+})}^{'}), \end{matrix}

(84)

where a rank formula for the matrix difference in (84) is

\begin{matrix} r ([K, 0] {[X, Σ_{1} X^{⊥}]}^{+} Σ_{1} {([K, 0] {[X, Σ_{1} X^{⊥}]}^{+})}^{'} \\ - [\hat{K}, 0] {[W, Σ_{1} W^{⊥}]}^{+} Σ_{1} {([\hat{K}, 0] {[W, Σ_{1} W^{⊥}]}^{+})}^{'}) \\ = r (M) - r (N), \end{matrix}

(85)

see [13]. Substituting (85) into (84) yields

\begin{matrix} r (Cov (\vec{{BLUE}_{M} (K Θ)}) - Cov (\vec{{BLUE}_{N} (K Θ)})) = r (Σ_{2}) (r (M) - r (N)) . \end{matrix}

(86)

Similarly, we can obtain

\begin{matrix} r (Cov (\vec{{BLUE}_{M} (K Θ)}) - Cov (\vec{{BLUE}_{P} (K Θ)})) = r (Σ_{2}) (r (M) - r (N)) . \end{matrix}

(87)

Setting the right-hand side equal to zero, we obtain the equivalences of (f), (g), and (h). □

Finally, we give a special case of Theorem 4 for

K = X

as follows.

Corollary 2.

Assume that the mean vector

X Θ

is estimable under (2), i.e.,

R (X) \cap R (Z) = {0}

holds. Then, the following statements are equivalent:

(a): ${BLUE}_{M} (X Θ) = {BLUE}_{N} (X Θ)$ holds definitely (with probability $1) .$
(b): ${BLUE}_{M} (X Θ) = {BLUE}_{P} (X Θ)$ holds definitely (with probability $1) .$
(c): $Cov (\vec{{BLUE}_{M} (X Θ)} - \vec{{BLUE}_{N} (X Θ)}) = Cov (\vec{{BLUE}_{M} (X Θ)} - \vec{{BLUE}_{P} (X Θ)}) = 0 .$
(d): $Cov (\vec{{BLUE}_{M} (X Θ)}) = Cov (\vec{{BLUE}_{N} (X Θ)}) .$
(e): $Cov (\vec{{BLUE}_{M} (X Θ)}) = Cov (\vec{{BLUE}_{P} (X Θ)}) .$
(f): $r [\begin{matrix} Σ_{1} & Z \\ X^{'} & 0 \end{matrix}] = r [Σ_{1}, X, Z] .$

Proof.

Set

K = X

in Theorem 4(h) and, simplifying by (13), we obtain

r (M) = r [\begin{matrix} Σ_{1} & X & Z \\ X^{'} & 0 & 0 \\ 0 & X & 0 \end{matrix}] = r [\begin{matrix} Σ_{1} & Z \\ X^{'} & 0 \end{matrix}] + r (X), r (N) = r [\begin{matrix} Σ_{1} & X & Z \\ X^{'} & 0 & 0 \end{matrix}] = r [Σ_{1}, X, Z] + r (X) .

Hence,

r (M) = r (N)

in Theorem 4(h) is reduced to the rank formula in (f). □

5. Conclusions

The comparison and equivalence analysis of statistical inference results under true and misspecified linear models can be proposed from the theoretical and applied point of view, as illustrated in this article, while there are many mathematical methods and techniques that are available to address the problems of such kind under more and less serious statistical assumptions. As a concrete topic in this regard, we reconsidered in the previous sections some equivalence analysis problems under a true multivariate linear model and its two misspecified forms. The key step of this study is to convert the equivalence analysis problems under the three models into certain algebraic matrix equalities or equations, and then to obtain the corresponding results and facts from the three true and misspecified models by means of some effective matrix analysis tools, including the matrix equation method and the matrix rank method. Because conclusions in the preceding sections are all presented through certain explicit expressions and equalities, we believe that the contributions in this article are easy to understand and can serve as a group of theoretical references in the statistical analysis of various subsequent problems regarding MGLMs. Because all the formulas and facts in the preceding theorems are represented in certain analytical expressions or formulas, they can easily be reduced to various specified conclusions when the model matrices and covariance matrix in (1) are given in certain prescribed formulations. For example, let

Cov (\vec{Ψ}) = σ^{2} I_{m} \otimes Σ_{1}, Cov (\vec{Ψ}) = σ^{2} Σ_{2} \otimes I_{n}, Cov (\vec{Ψ}) = σ^{2} I_{m n}

in (1), respectively, which are regularly assumed in various concrete MGLMs.

We believe that the resultful studies on the equivalences of BLUPs/BLUEs provide significant advances to algebraical methodology in the statistical analysis of MGLMs, which will bring enough enabling methodological improvements and advances in the field of multivariate analysis. Finally, we propose a further problem on comparison and equivalence analysis of statistical inference results under the following two competing constrained MGLMs:

\begin{matrix} M : Y = X Θ + Ψ, A Θ = B, \\ N : Y = X Θ + Z Γ + Ψ, A Θ = B, \end{matrix}

where

A Θ = B

is a consistent matrix equation for the unknown parameter matrix

Θ

.

Author Contributions

Conceptualization, B.J.; methodology, Y.T.; investigation, B.J. and Y.T.; writing original draft, B.J.; writing, review and editing, Y.T. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by the Shandong Provincial Natural Science Foundation #ZR2019MA065.

Acknowledgments

The authors are grateful to three referees for their helpful reports to an earlier version of this article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Markiewicz, A.; Puntanen, S. All about the ⊥ with its applications in the linear statistical models. Open Math. 2015, 13, 33–50. [Google Scholar] [CrossRef]
Puntanen, S.; Styan, G.P.H.; Isotalo, J. Matrix Tricks for Linear Statistical Models; Our Personal Top Twenty; Springer: Berlin, Germany, 2011. [Google Scholar]
Rao, C.R.; Mitra, S.K. Generalized Inverse of Matrices and Its Applications; Wiley: New York, NY, USA, 1971. [Google Scholar]
Jiang, B.; Tian, Y. On equivalence of predictors/estimators under a multivariate general linear model with augmentation. J. Korean Stat. Soc. 2017, 46, 551–561. [Google Scholar] [CrossRef]
Nel, D.G. Tests for equality of parameter matrices in two multivariate linear models. J. Multivar. Anal. 1997, 61, 29–37. [Google Scholar] [CrossRef]
Gamage, J.; Ananda, M.M.A. An exact test for testing the equality of parameter matrices in two multivariate linear models. Linear Algebra Appl. 2006, 418, 882–885. [Google Scholar] [CrossRef]
Isotalo, J.; Puntanen, S.; Styan, G.P.H. Effect of adding regressors on the equality of the OLSE and BLUE. Int. J. Stat. Sci. 2010, 6, 193–201. [Google Scholar]
Jammalamadaka, S.R.; Sengupta, D. Changes in the general linear model: A unified approach. Linear Algebra Appl. 1999, 289, 225–242. [Google Scholar] [CrossRef]
Jammalamadaka, S.R.; Sengupta, D. Inclusion and exclusion of data or parameters in the general linear model. Stat. Probab. Lett. 2007, 77, 1235–1247. [Google Scholar] [CrossRef]
Gan, S.; Sun, Y.; Tian, Y. Equivalence of predictors under real and over-parameterized linear models. Commun. Stat. Theor. Meth. 2017, 46, 5368–5383. [Google Scholar] [CrossRef]
Jun, S.J.; Pinkse, J. Adding regressors to obtain efficiency. Econom. Theory 2009, 25, 298–301. [Google Scholar] [CrossRef][Green Version]
Li, W.; Tian, Y.; Yuan, R. Statistical analysis of a linear regression model with restrictions and superfluous variables. J. Ind. Manag. Optim. 2023, 19, 3107–3127. [Google Scholar] [CrossRef]
Lu, C.; Gan, S.; Tian, Y. Some remarks on general linear model with new regressors. Stat. Prob. Lett. 2015, 97, 16–24. [Google Scholar] [CrossRef]
Magnus, J.R.; Durbin, J. Estimation of regression coefficients of interest when other regression coefficients are of no interest. Econometrica 1999, 67, 639–643. [Google Scholar] [CrossRef]
Baksalary, J.K. A study of the equivalence between a Gauss–Markoff model and its augmentation by nuisance parameters. Statistics 1984, 15, 3–35. [Google Scholar]
Bhimasankaram, P.; Jammalamadaka, S.R. Updates of statistics in a general linear model: A statistical interpretation and applications. Commun. Stat. Simul. Comput. 1994, 23, 789–801. [Google Scholar] [CrossRef]
Haslett, S.J.; Puntanen, S. Effect of adding regressors on the equality of the BLUEs under two linear models. J. Stat. Plann. Inference 2010, 140, 104–110. [Google Scholar] [CrossRef]
Marsaglia, G.; Styan, G.P.H. Equalities and inequalities for ranks of matrices. Linear Multilinear Algebra 1974, 2, 269–292. [Google Scholar] [CrossRef]
Tian, Y. Matrix rank and inertia formulas in the analysis of general linear models. Open Math. 2017, 15, 126–150. [Google Scholar] [CrossRef]
Penrose, R. A generalized inverse for matrices. Proc. Camb. Phil. Soc. 1955, 51, 406–413. [Google Scholar] [CrossRef]
Tian, Y. A new derivation of BLUPs under random-effects model. Metrika 2015, 78, 905–918. [Google Scholar] [CrossRef]
Tian, Y. A matrix handling of predictions under a general linear random-effects model with new observations. Electron. J. Linear Algebra 2015, 29, 30–45. [Google Scholar] [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Equivalence Analysis of Statistical Inference Results under True and Misspecified Multivariate Linear Models

Abstract

1. Introduction

2. Some Preliminaries

3. The Precise Theory of Predictability, Estimability, and BLUP/BLUE

4. The Equivalence Analysis of BLUEs under True and Misspecified Models

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics