Non-Negativity of a Quadratic form with Applications to Panel Data Estimation, Forecasting and Optimization

Pochiraju, Bhimasankaram; Seshadri, Sridhar; Thomakos, Dimitrios D.; Nikolopoulos, Konstantinos

doi:10.3390/stats3030015

Open AccessArticle

Non-Negativity of a Quadratic form with Applications to Panel Data Estimation, Forecasting and Optimization

by

Bhimasankaram Pochiraju

^1,†,

Sridhar Seshadri

²,

Dimitrios D. Thomakos

³ and

Konstantinos Nikolopoulos

^4,*

¹

Indian School of Business, Gachibowli, Hyderabad, Telangana 500032, India

²

Gies College of Business, University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA

³

Department of Economics, School of Management and Economics, University of Peloponnese, 22100 Peloponnese, Greece

⁴

Durham University Business School, Durham DH1 3LB, UK

^*

Author to whom correspondence should be addressed.

^†

Bhimasankaram Pochiraju was Professor at the Indian School of Business in India. He passed away on 1 April 2018. He and Sridhar Seshadri worked on this paper when they were at the Indian School of Business.

Stats 2020, 3(3), 185-202; https://doi.org/10.3390/stats3030015

Submission received: 26 May 2020 / Revised: 28 June 2020 / Accepted: 30 June 2020 / Published: 6 July 2020

Download Versions Notes

Abstract

:

For a symmetric matrix B, we determine the class of Q such that

Q^{t} BQ

is non-negative definite and apply it to panel data estimation and forecasting: the Hausman test for testing the endogeneity of the random effects in panel data models. We show that the test can be performed if the estimated error variances in the fixed and random effects models satisfy a specific inequality. If it fails, we discuss the restrictions under which the test can be performed. We show that estimators satisfying the inequality exist. Furthermore, we discuss an application to a constrained quadratic minimization problem with an indefinite objective function.

Keywords:

quadratic form; Non-negativity; Hausman test; optimization

JEL Classification:

C01; C02; C61

1. Introduction

Optimization of quadratic structures and corrections to the construction of covariance matrices has a long history in econometrics and financial economics. From the early stages of simultaneous equations modeling, to testing panel data models, to recent advances in handling volatilities and correlations of financial returns in large systems, to applications in portfolio management, all these issues have contained a component of either quadratic structures and covariance estimation or some form of optimization based on them. The importance of the above in forecasting cannot be overstated as they all relate to decision-making at some future time period: a good panel data model can be used in generating out-of-sample forecasts, a well-constructed covariance matrix can be used in optimizing the weights of a portfolio or for building a model for volatility and correlation forecasting. Although we rarely admit it, most models and mathematical operations are –in the end –geared for forecasting.

In this paper we present a novel mathematical approach for solving a particular class of quadratic optimization problems with applications in econometrics, statistics and portfolio construction. At the center of these mathematical derivations is a symmetric indefinite matrix, in applications a covariance matrix. The indefinite nature of this matrix can come from many sources but the common ground to all is rank deficiency or rank indeterminacy based on redundant information in the variables from which we compute that said matrix: a large number of variables involved or some deficiency in the structure of the underlying problem. The problem of this indeterminacy leads to other problems in different set-ups: in the context of a matrix-version of the well-known Hausman test in econometrics a difference in covariance matrices required for the application of the test might not be positive definite; in the context of a large portfolio optimization the covariance matrix of the financial returns might not be positive definite. Admittedly, there are many solutions that have been proposed for these problems; for example, the Hausman test can be computed with a regression approach so that no matrix inversion is required (or a generalized inverse can be used, as has been done in many parts of the relevant literature); in the context of a rank-deficient covariance there are a number of solutions that have been proposed that correct this. So, what are the new insights that this paper has to offer? First, in the context of the Hausman test the results offer a very clear rule as to when the test (in its matrix form) can be applied. This is useful as a specification tool, and is very simple to compute and report –so when the underlying conditions do not hold for the test to be performed in its matrix form some caution is warranted, both in model specification and model performance. Second, in the context of portfolio optimization the results suggest a particular procedure for optimizing a portfolio given a set of particular constraints without reference to the potential problems of rank indeterminacy of the covariance matrix. This therefore allows for a direct solution to the optimization problem even with sample covariance matrices, computed with a limited number of observations. Finally, it should be noted that the optimization problems discussed in this paper have potential for other applications as well. For example, one might consider the case of a least generalized least squares-like problem for which the covariance matrix has rank indeterminacy because of, say, less observations than variables. In that case, given constraints, the solution proposed here can be utilized. Of course such a model can then be used in forecasting.

2. Notations

In this paper, we obtain the class of all

Q

such that

Q^{t} BQ

is non-negative definite (nnd) where

B

is a given symmetric indefinite matrix (the terms ’non-negative’ and ’nonnegative’ are used interchangeably in the literature). Why is this important? Primarily because there is a very active interest in the topic in the literature recently [1,2,3], but even more importantly due to the vast array of applications in statistics, finance and economics, most notably in panel data econometrics and quadratic optimization, as elaborated in the subsequent sections.

(a) Let

θ

be a parametric vector of interest and

t

be an unbiased estimator of

θ

, the dispersion matrix of which depends on some parameters

γ

. The estimated dispersion matrix

\hat{D} (t)

based on

\hat{γ}

may turn out to be an indefinite matrix. It is of interest to find out the class of all linear parametric functions

Q^{t} θ

for which the estimated dispersion matrix of the unbiased estimator

Q^{t} t

is non-negative definite. Take a specific instance where

\hat{D} (t) = σ^{2} ((1 - \hat{ρ}) I + \hat{ρ} 11^{t})

and let

\hat{ρ} < - \frac{1}{n - 1}

where

t

is of order

n \times 1

. In this case

\hat{D} (t)

is indefinite.

(b) Again, let

θ

be a parametric vector of interest and let

t_{1}

and

t_{2}

be two unbiased estimators of

θ

. We say that

t_{1}

is superior to

t_{2}

if

\hat{D} (t_{2}) - \hat{D} (t_{1})

is non-negative definite or equivalently if

\hat{D} (t_{1})

is below

\hat{D} (t_{2})

under the Löwner order [4]. Suppose neither of

t_{1}

and

t_{2}

is superior to the other. It is of interest to find out sets of linear functions

Q^{t} θ

of

θ

such that

\hat{D} (t_{2}) - \hat{D} (t_{1})

is non-negative definite. For a specific case, we consider the fixed effects and random effects panel data models [5] to examine when the issue of endogeneity in the random effects model can be checked using the Hausman test. We study the following:

(i): Suppose we choose and fix the functional form of the estimators of the variance components. We obtain the class of all regressor matrices for which we can perform the Hausman test.
(ii): Suppose for given data on regressors the difference in the estimated dispersion matrices is indefinite. We obtain the class of linear compounds of the regression coefficient vector for which we can perform the Hausman test.
(iii): We note that there always exists an estimator of the variance of the white noise part of the error in the random effects model for which the difference in the estimated dispersion matrices of the fixed effects and random effects estimators of the regression coefficient vector is indeed non-negative definite. Reference [6]’s estimator is one such estimator of the variance component mentioned above. There can be others.
(iv): We extend the above results to the cases where either the random error or the random effects or both are heteroscadastic.

We are aware that there are alternatives to the traditional Hausman test such as in Reference [7] that incorporates the time-invariant parts of the regressors in the random effects model. However, the most popular test to date is the traditional Hausman test.

(c) Consider the problem of minimizing

x^{t} Bx

subject to

Ax = 0

. Clearly, this is equivalent to the unconstrained minimization problem

u^{t} (I - A^{+} A) B (I - A^{+} A) u

, which has a finite solution if and only if

(I - A^{+} A) B (I - A^{+} A)

is non-negative definite. (

A^{+}

denotes the Moore-Penrose inverse of

A

.) Thus, it is often of interest to explicitly obtain the class of all vectors x such that

x^{t} Bx \geq 0

, where B is an indefinite real symmetric matrix. Unfortunately, this class is not a subspace of

R^{n}

. It is also not a convex set. In this paper, we characterize the class of all matrices Q such that

Q^{t} BQ

is non-negative definite (nnd). We then study the problem of minimization of a quadratic form

x^{t} Bx

subject to

Ax = b

, where

B

is an indefinite matrix and

b

is in the column space of

A

. Given a matrix

A

, we characterize the class of all real symmetric matrices

B

and vectors

b

in the column space of

A

for which the aforementioned problem has a finite solution. It turns out that one of the key conditions for the minimization problem to have a finite solution is the non-negative definiteness of

Q^{t} BQ

for a suitable orthogonal projection matrix

Q

.

In Section 3, we state the results on non-negative definite matrices and generalized inverses which will be needed in the later sections. In Section 4, we obtain necessary and sufficient conditions for

Q^{t} BQ

to be non-negative definite (when B is a symmetric indefinite matrix). Based on this, we develop an algorithm to generate all such matrices Q. We then specialize to the cases where A has (i) just one negative eigenvalue and (ii) just one positive eigenvalue. As a special case of (i), we consider the intraclass correlation matrix which comes naturally as the dispersion matrix in random effects models. In Section 6, we study, in detail, the issues related to performing Hausman test mentioned in (b) above. In Section 7, we show that the problem of finding the class of all matrices

Q

such that

Q^{t} BQ

is nnd is equivalent to the solution of the quadratic optimization problem: Minimize

x^{t} Bx

subject to

Ax = 0

varying over matrices

A

. As we shall show, the connection between

Q

and

A

comes through the relationship, null space of

A

being equal to the column space of

Q

. Given

A

, we then determine the class of all matrices

Q

such that

N (A)

=

C (Q)

. Likewise, given

Q

, we also determine the class of all matrices

A

such that

N (A)

=

C (Q)

. In Section 8, we study, in some detail, the constrained optimization problem of minimizing

x^{t} Bx

subject to

Ax = b

where

b \in C (A)

and B is a symmetric indefinite matrix. We consider two cases: (i) the problem has a solution for some non-null vector

b \in C (A)

and (ii) the problem has a solution for every non-null

b \in C (A)

. Finally, Section 9 concludes.

We use real vectors and matrices in this paper and use the following notations. For a matrix A,

ρ (A)

, tr(A),

C (A)

,

N (A)

,

A^{t}

,

A^{-}

,

A^{+}

,

P_{A}

denote respectively the rank, trace, column space, null space, transpose, generalized inverse, Moore-Penrose inverse and orthogonal projector into the column space of a matrix A. For a positive integer r,

1_{r}

denotes a column vector with r components where each component is 1. Further,

{\bar{J}}_{r}

denotes the matrix of order

r \times r

each element of which is

\frac{1}{r}

. Clearly

{\bar{J}}_{r}

=

P_{1_{r}}

. The orthogonal projector

I_{r} - {\bar{J}}_{r}

is denoted by

E_{r}

. For matrices

A

and

B

,

A \otimes B

denotes the Kronecker product defined as

((a_{i j} B))

. A symmetric matrix A is said to be non-negative definite (nnd) if

x^{t} Ax \geq 0

for all vectors x. The symbol diag(A,B,C) denotes a block diagonal matrix

(\begin{matrix} A & 0 & 0 \\ 0 & B & 0 \\ 0 & 0 & C \end{matrix})

. For a random vector

ϵ

,

E (ϵ)

and

D (ϵ)

denote the expectation vector and the dispersion matrix of

ϵ

. Also

c o v (α, ξ)

denotes the covariance matrix of

α

with

ξ

.

3. Preliminaries

In this section, we provide a few results which are well-known and which will be used in the later sections of this paper.

Lemma 1.

Let

M = (\begin{matrix} P & Q \\ Q^{t} & S \end{matrix})

be a real symmetric matrix where

P

and

S

are real symmetric matrices. ThenM is nnd if and only if

(i): $P$ is nnd,
(ii): $C (Q) \subseteq C (P)$ , and
(iii): $S$ - $Q^{t} P^{-} Q$ is nnd.

(In view of (ii),

Q^{t} P^{-} Q

is invariant under choices of generalized inverses of P.)

The following lemma is well known. For a proof, please see Reference [8].

Lemma 2.

Let

A a n d B

be matrices of order

m \times n

. Then

{AA}^{t} = {BB}^{t}

if and only if

A = BT

, where

T

is an orthogonal matrix.

For the proofs of the following Lemmas 3–5, please see Rao and Mitra, 1971.

Lemma 3.

LetAbe a matrix of order

m \times n

. Let

b \in C (A)

. LetGbe a generalized inverse of

A

. Then

(i): $N (A) = C (I - GA)$ .
(ii): The class of all solutions to $Ax = b$ is given by $Gb + (I - GA) ζ$ , where ζ is arbitrary.

Lemma 4.

Let

A

be an

m \times n

matrix of rank r

(> 0)

. Let

A = U (\begin{matrix} Δ & 0 \\ 0 & 0 \end{matrix}) V^{t}

be a singular value decomposition of

A

where

U

and

V

are orthogonal matrices and Δ is a positive definite (pd) diagonal matrix of order

r \times r

. Then the class of the generalized inverses of

A

is given by

V (\begin{matrix} Δ^{- 1} & L \\ M & N \end{matrix}) U^{t}

where

L

,

M

,

N

are arbitrary. In particular, the Moore-Penrose inverse

A^{+}

of

A

is given by

V (\begin{matrix} Δ^{- 1} & 0 \\ 0 & 0 \end{matrix}) U^{t}

.

Lemma 5.

Let

C

and

D

be nnd matrices of the same order. Then there exists a nonsingular matrix

T

such that

T^{t} CT

and

T^{t} DT

are diagonal matrices.

The following lemma on quadratic optimization is well-known.

Lemma 6.

Let

C

be a real-symmetric matrix of order

n \times n

, then the function f

(x) = \frac{1}{2} x^{t} Cx - d^{t} x

has a minimum value if and only if

C

is nnd and

d \in C (C)

, in which case the minimum value is given by

- \frac{1}{2} d^{t} C^{+} d

. Furthermore, if

C = S^{t} (\begin{matrix} Λ & 0 \\ 0 & 0 \end{matrix}) S

is a spectral decomposition of

C

, where

S

is orthogonal and Λ is diagonal positive definite matrix of order

r \times r

, then the optimal value is achieved by all vectors

x

of the form,

x = C^{+} d + S^{t} (\begin{matrix} 0 \\ z \end{matrix})

for any

z \in R^{n - r}

.

The following result is well-known for researchers in parallel sums of matrices and shorted operators. For a proof, see Reference [4].

Lemma 7.

Let

A

and

B

be nnd matrices of the same order. Then

C (A) \cap C (B) = C (A {(A + B)}^{-} B)

, where

{(A + B)}^{-}

is any generalized inverse of

A + B

.

The following lemmas are well known. For a proof, please see Reference Rao and Bhimasankaram [8].

Lemma 8.

Let

A

and

B

be positive definite matrices of the same order. Then

B^{- 1} - A^{- 1}

is nnd if and only if

A - B

is nnd.

Lemma 9.

Let

A

be a nonsingular matrix of order

n \times n

and letuandvbe vectors of order

n \times 1

. Then

A + {uv}^{t}

is nonsingular if and only if 1 +

v^{t} A^{- 1} u

is not equal to 0. Also if 1 +

v^{t} A^{- 1} u

is not equal to 0, then

{(A + {uv}^{t})}^{- 1}

=

A^{- 1}

-

A^{- 1} {uv}^{t} A^{- 1}

/ (1 +

v^{t} A^{- 1} u)

.

Lemma 10.

Let

A

and

D

be nonsingular matrices of orders

n \times n

and

r \times r

respectively and let

B

and

C

are matrices of orders

n \times r

and

r \times n

respectively. Then

A + BDC

is nonsingular if and only if

W = D^{- 1} + {CA}^{- 1} B

is nonsingular. Also if

W

is nonsingular, then

{(A + BDC)}^{- 1} = A^{- 1} - A^{- 1} B {(D^{- 1} + {CA}^{- 1} B)}^{- 1} {CA}^{- 1}

.

Lemma 11.

Let

A

and

B

be matrices of orders

m \times n

and

n \times m

respectively Then the non-null eigenvalues of

AB

and

BA

are identical.

Lemma 12.

Let

A_{1}, \dots, A_{k}

be commuting real symmetric matrices of the same order. Then they have a simultaneous spectral decomposition.

4. Non-Negative Definiteness of $Q^{t} BQ$

Let B be a real symmetric indefinite matrix of order

n \times n

and let

Q

be a matrix with n rows. In this section, we investigate the conditions under which

Q^{t} BQ

is non-negative definite. We shall also give a method of constructing all such matrices Q.

Let B be a real symmetric indefinite matrix of order

n \times n

. Let a spectral decomposition of B be given by

B = (P_{1} : P_{2} : P_{3}) diag (Δ_{1}, - Δ_{2}, 0) {(P_{1} : P_{2} : P_{3})}^{t},

where

Δ_{i}

is a positive definite diagonal matrix of order

r_{i} \times r_{i}

, i = 1, 2, and

P = (P_{1} : P_{2} : P_{3})

is an orthogonal matrix,

P_{i}

being a matrix of

n \times r_{i}

, i = 1, 2, 3 such that

r_{1} + r_{2} + r_{3} = n

.

We prove

Theorem 1.

Let

B, P

,

Δ_{1}

, and

Δ_{2}

be as specified above. LetQbe a matrix of order

n \times s

. Write

R_{i} = Q^{t} P_{i}

, i = 1,2,3 and

R = (R_{1} : R_{2} : R_{3})

. Let

ρ (R_{i}) = w_{i}

, i = 1,2. Then

Q^{t} BQ

is nnd if and only if there exists a matrixLwith number of columns equal to

w_{1}

such that

{LL}^{t} = R_{1} Δ_{1} R_{1}^{t}

and

L Λ L^{t} = R_{2} Δ_{2} R_{2}^{t},

where Λ is a diagonal nnd matrix with exactly

w_{2}

diagonal elements in (0,1] and the rest are equal to 0.

Proof.

Notice that

Q = {PR}^{t}

and

Q^{t} BQ = {RP}^{t} P diag (Δ_{1} : - Δ_{2} : 0) P^{t} {PR}^{t} = R_{1} Δ_{1} R_{1}^{t} - R_{2} Δ_{2} R_{2}^{t}

.

‘If part’:

Q^{t} BQ = R_{1} Δ_{1} R_{1}^{t} - R_{2} Δ_{2} R_{2}^{t} = {LL}^{t} - L Λ L^{t} = L (I - Λ) L^{t}

is nnd since

I - Λ

is nnd.

‘Only if’ part:

Notice that

R_{1} Δ_{1} R_{1}^{t}

and

R_{2} Δ_{2} R_{2}^{t}

are both nnd. Since

Q^{t} BQ = R_{1} Δ_{1} R_{1}^{t} - R_{2} Δ_{2} R_{2}^{t}

is nnd, we have

C (R_{2}) \subseteq C (R_{1})

. By Lemma 5, there exists a nonsingular matrix T such that

R_{1} Δ_{1} R_{1}^{t} = T (\begin{matrix} Γ & 0 \\ 0 & 0 \end{matrix}) T^{t}

and

R_{2} Δ_{2} R_{2}^{t} = T (\begin{matrix} Ω_{1} & 0 \\ 0 & Ω_{2} \end{matrix}) T^{t}

where

Γ

is a positive definite diagonal matrix of order

ω_{1} \times ω_{1}

and diag (

Ω_{1}, Ω_{2}

) is a diagonal nnd matrix of rank

ω_{2}

(

Ω_{1}

is of order

ω_{1} \times ω_{1}

).

R_{1} Δ_{1} R_{1}^{t} - R_{2} Δ_{2} R_{2}^{t}

is nnd

\Leftrightarrow T (\begin{matrix} Γ & 0 \\ 0 & 0 \end{matrix}) T^{t} - T (\begin{matrix} Ω_{1} & 0 \\ 0 & Ω_{2} \end{matrix}) T^{t}

is nnd

\Leftrightarrow Γ - Ω_{1}

is nnd and

Ω_{2} = 0

. Writing

T = (T_{1} : T_{2})

where

T_{1}

has

w_{1}

columns, we have,

R_{1} Δ_{1} R_{1}^{t} = T_{1} Γ T_{1}^{t}

and

R_{2} Δ_{2} R_{2}^{t} = T_{1} Ω_{1} T_{1}^{t}

Writing

L = T_{1} Γ^{\frac{1}{2}}

, we have

R_{1} Δ_{1} R_{1}^{t} = {LL}^{t}

and

R_{2} Δ_{2} R_{2}^{t} = L Λ L^{t},

where

Λ = Γ^{- \frac{1}{2}} Ω_{1} {(Γ^{- \frac{1}{2}})}^{t}

. Further, since

Γ - Ω_{1}

is nnd, so is

I - Λ = Γ^{- \frac{1}{2}} (Γ - Ω_{1}) {(Γ^{- \frac{1}{2}})}^{t}

. Clearly, all diagonal elements of

Λ

are in [0, 1]. Since,

æ (Λ) = æ (Ω_{1}) = æ (R_{2}) = ω_{2}

, exactly

ω_{2}

diagonal elements of

Λ

lie in the interval (0, 1]. Q.E.D. □

Given a real symmetric matrix B, we now give a method of generating all matrices Q such that

Q^{t} BQ

is nnd. Let B be as specified just before Theorem 1. Clearly,

ω_{2} \leq ω_{1}

. Also,

ω_{2} \leq ρ (Δ_{2})

. Thus,

ω_{2} \leq l = m i n {ω_{1}, ρ (Δ_{2})}

.

We now prove that Algorithm 1 yields the class of all Q such that

Q^{t} BQ

is nnd. First, notice that for each Q obtained through Algorithm 1,

R_{1} Δ_{1} R_{1}^{t} = {LL}^{t}

and

\begin{matrix} R_{2} Δ_{2} R_{2}^{t} & = {LVS}_{1} (\begin{matrix} D_{1}^{\frac{1}{2}} & 0 \\ 0 & 0 \end{matrix}) U Δ_{2}^{- \frac{1}{2}} Δ_{2} Δ_{2}^{- \frac{1}{2}} U^{t} (\begin{matrix} D_{1}^{\frac{1}{2}} & 0 \\ 0 & 0 \end{matrix}) S_{1}^{t} V^{t} L^{t} \\ = {LVS}_{1} (\begin{matrix} D_{1} & 0 \\ 0 & 0 \end{matrix}) S_{1}^{t} V_{1}^{t} L^{t} \\ = L Λ L^{t}, \end{matrix}

where

Λ

is 0 or is orthogonally similar to a diagonal nnd matrix with at most

l = m i n {w_{1}, ρ (Δ_{2})}

diagonal elements in (0,1]. Hence, by Theorem 1,

Q^{t} BQ

is nnd.

Next, let

Q^{t} BQ

be nnd. Then, there exists

R_{1}, R_{2}

such that

Q^{t} BQ = R_{1} Δ_{1} R_{1}^{t} - R_{2} Δ_{2} R_{2}^{t}

and M with rank

ρ (R_{1})

and

Λ

with rank

ρ (R_{2})

and

Λ

such that

{MM}^{t} = R_{1} Δ_{1} R_{1}^{t}

and

M Λ M^{t} = R_{2} Δ_{2} R_{2}^{t}

where

Λ

and

I - Λ

are nnd.

Therefore, by Lemma 2,

R_{1} Δ_{1}^{\frac{1}{2}} = {MV}^{t}

for some orthogonal matrix V. Without loss of generality, define

L = {MV}^{t}

.

LV Λ V^{t} L^{t} = R_{2} Δ_{2} R_{2}^{t}

or

LVB (\begin{matrix} D & 0 \\ 0 & 0 \end{matrix}) S^{t} V^{t} L^{t} = R_{2} Δ_{2} R_{2}^{t}

where S is a permutation matrix. So,

R_{2} Δ_{2} R_{2}^{t} = {LVS}_{1} (\begin{matrix} D & 0 \\ 0 & 0 \end{matrix}) S_{1}^{t} V^{t} L^{t},

where

S_{1}

is a semi-permutation matrix with

ρ (Δ_{2})

columns. By Lemma 2,

R_{2} Δ_{2}^{\frac{1}{2}} = {LVS}_{1} (\begin{matrix} D^{\frac{1}{2}} & 0 \\ 0 & 0 \end{matrix}) U

where U is an orthogonal matrix. Q.E.D.

Algorithm 1 demonstrates how to construct the class of all Q such that

Q^{t} BQ

is nnd in an organized manner. However, it is clear that even when

R_{1}

is fixed, the class of all Q such that

Q^{t} BQ

is nnd is neither a subspace nor a convex set.

Algorithm 1: The Pochiraju algorithm.

Step 1: Choose

R_{1}

and

R_{3}

arbitrarily. (Once

R_{1}

and

R_{3}

are chosen and fixed, their ranks

ω_{1}

and

ω_{3}

automatically get fixed.)

Step 2: Construct

L = R_{1} Δ_{1}^{\frac{1}{2}}

.

Step 3: Choose

ω_{2}

arbitrarily such that

0 \leq ω_{2} \leq l = m i n {ω_{1}, ρ (Δ_{2})}

.

Step 4: Choose D = diag

(d_{1}, d_{2}, \dots, d_{ω_{2}}, 0, \dots, 0) = d i a g (D_{1} : 0)

where

d_{i}

is an arbitrary number in (0,1].

Step 5: Construct

R_{2} = R_{1} Δ_{1}^{\frac{1}{2}} T Δ_{2}^{- \frac{1}{2}}

where

T

is an arbitrary matrix of rank

ω_{2}

with singular values in [0,1]. (This is actually achieved as follows: Choose

S_{1}

to be an arbitrary semi-permutation matrix of order

ω_{1} \times ρ (Δ_{2})

,

U

be an arbitrary orthogonal matrix of order

ρ (Δ_{2}) \times ρ (Δ_{2})

and construct

R_{2} = {LVS}_{1} (\begin{matrix} D_{1}^{\frac{1}{2}} & 0 \\ 0 & 0 \end{matrix}) U Δ_{2}^{- \frac{1}{2}}

, where V is an orthogonal matrix.)

Step 6: Construct

Q = {PR}^{t}

where

R = (R_{1} : R_{2} : R_{3})

.

We now consider two special cases where the construction of the class of all Q such that

Q^{t} BQ

is nnd becomes simple: (i) B has just one negative eigenvalue and (ii) B has just one positive eigenvalue.

Case (i): B has just one negative eigenvalue.

Choose

R_{1}

and

R_{3}

arbitrarily. Let

{LL}^{t} = R_{1} Δ_{1} R_{1}^{t}

where the number of columns in L is

ρ (R_{1})

. Since there is only one negative eigenvalue, let us denote it by

- δ_{2}

and the corresponding matrix

R_{2}

by the vector

r_{2}

. Now

L Λ L^{t} = δ_{2} r_{2} r_{2}^{t}

. Clearly,

Λ

has exactly one nonzero (positive) diagonal element (say

λ

) which can appear in any of the diagonal elements. So

L Λ L^{t} = λ l_{i} l_{i}^{t}

where

0 \leq λ \leq 1

. So

r_{2} = \sqrt{\frac{λ}{δ_{2}}} l_{i}

. The class of all

r_{2}

is obtained by choosing

λ

arbitrarily such that

0 \leq λ \leq 1

and an arbitrary column (say ith column)

l_{i}

of L and constructing

r_{2} = \sqrt{\frac{λ}{δ_{2}}} l_{i}

. Then

Q = P (\begin{matrix} R_{1}^{t} \\ r_{2}^{t} \\ R_{3}^{t} \end{matrix})

.

As a special instance, we consider the estimated intraclass correlation matrix B where the estimated intraclass correlation coefficient

\hat{ρ} < \frac{- 1}{n - 1}

. We now obtain the class of all Q such that

Q^{t} BQ

is nnd.

Here

Δ_{1} = (1 - \hat{ρ}) I

,

Δ_{2} = - (1 + (n - 1) \hat{ρ})

,

P_{1} P_{1}^{t} = I - \frac{11^{t}}{n}

and

P_{2} P_{2}^{t} = \frac{11^{t}}{n}

and

P_{3}

does not exist since there is no zero eigenvalue.

Choose

R_{1}

arbitrarily and let L be a matrix of maximum column rank such that

{LL}^{t} = (1 - \hat{ρ}) R_{1} R_{1}^{t}

. Choose a column (arbitrarily), say,

l_{i}

and a number

λ

in the interval (0,1] (arbitrarily). Construct

r_{2} = \sqrt{\frac{λ}{- (1 + (n - 1) \hat{ρ})}} l_{i}

. Construct

Q = (P_{1} : P_{2}) (\begin{matrix} R_{1}^{t} \\ r_{2}^{t} \end{matrix})

. These are all the matrices Q such that

Q^{t} BQ

is nnd.

Case (ii): B has just one positive eigenvalue.

Let

δ_{1}

be the positive eigenvalue. Since

R_{1}

has just one column, we denote it by

r_{1}

. Choose

r_{1}

and

R_{3}

arbitrarily. Denote L by l since L has only one column. Then

{ll}^{t} = δ_{1} r_{1} r_{1}^{t}

. So,

l = \sqrt{δ_{1}} r_{1}

.

Since

Λ

is a

1 \times 1

matrix, we denote it by

λ

. As per Theorem 1,

0 \leq λ \leq 1

. Choose and fix

λ

such that

0 \leq λ \leq 1

. Then by Theorem 1

λ {ll}^{t} = R_{2} Δ_{2} R_{2}^{t} .

Notice that

ρ (R_{2}) = ρ (R_{2} Δ_{2} R_{2}^{t}) = ρ (λ {ll}^{t}) = ρ (l) = 1 .

Write

R_{2} = {uv}^{t}

, where u and v are column vectors.

λ {ll}^{t} = v^{t} Δ_{2} {vuu}^{t} .

Clearly, u is a scalar multiple of l. Choose v arbitrarily, and then

u = \sqrt{\frac{λ}{v^{t} Δ_{2} v}} l

. Construct

R_{2} = {uv}^{t}

and

Q = P (\begin{matrix} r_{1}^{t} \\ R_{2}^{t} \\ R_{3}^{t} \end{matrix})

.

It may be noted that even in these two simple cases, the class of all Q such that

Q^{t} BQ

is nnd is a complex structure. (Neither of them is an affine space).

5. Remarks

In Theorem 1, We have obtained a solution to the following problem in matrix partial orders:

Suppose two real symmetric matrices

C

and

D

are not related by Lowner order. What is the class of all matrices

Q

such that

Q^{t} CQ

is below

Q^{t} DQ

under the Lowner order?

Comparison of the estimators of vector valued parameters is quite common in sample surveys where no estimator is uniformly superior to the others as the difference in the estimated dispersion matrices of the estimators, say,

Δ

is indefinite (for details, see Section 6.1 of Reference [9]). The results this Section help in identifying the subsets of linear functions of such parameters for which one estimator is superior to the other by finding the class of all

Q

for which

Q^{t} Δ Q

is nnd.

6. Hausman Test

The usual Hausman Test—in order to test for endogeneity in the random effects model [5], cannot be performed if the difference between the estimated dispersion matrices of the regression coefficient estimators in the fixed effects and random effects models (with homoscedastic structures for the error in the fixed effects model and for the random error and also for the random effects in the random effects model) denoted by

Π

is not non-negative definite. In this section, we study the difference matrix

Π

in detail. Since we do not know the regressor matrix at the design stage, we first study when

Π

is nnd for every choice of the regressor matrix. It turns out that

Π

is nnd for all regressor matrices X if and only if the estimated variance of the error in the fixed effects model is at least as big as the estimator of the variance component of the random noise part in the error in the random effects model. When the difference in the estimated dispersion matrices of the errors in the fixed effects and the random effects models is not nnd, using Algorithm 1, we obtain explicitly the class of all regressor matrices X for which

Π

is nnd. Owing to this structure, we show that when the number of regressors is larger than the number of individuals,

Π

cannot be non-negative definite. Finally, for a given regressor matrix X, if

Π

is not nnd, we find an explicit expression for the class of all linear functions of regression coefficients for which the Hausman Test can be performed. We note that Reference [6]’s estimator of the variance component of the random noise part satisfies the property that the estimated variance of the error in the fixed effects model is at least as big as the estimator of the variance component of the random noise part in the error in the random effects model. Thus, with this choice of the estimators of the variance components, Hausman test can be performed for all regressor matrices X. We observe that we can always get estimators of variance components such that the difference in the dispersion matrices of the error structures in the fixed and random effects models is non-negative definite. Finally, we show that for a suitable choice of the variance component estimators, the difference between the estimated dispersion matrices of the regression coefficient estimators in the fixed effects and random effects models is non-negative definite even when there is heteroscedasticity in the random effects or the random error or both.

We introduce briefly the homoscedastic fixed and random effects panel data models. For details please see References [5,7]. Consider a balanced panel data

(y_{i t}, x_{i t}^{t}), t = 1, \dots, T; i = i, \dots, N

where

y_{i t}

is the response and

x_{i t}^{t}

is a

1 \times k

vector of regressor values on k regressors for the

i^{t h}

individual at time point t. Denote

Y_{i} = {(y_{i 1}, \dots, y_{i T})}^{t}

,

X_{i} = {(x_{i 1}, \dots, x_{i T})}^{t}

, and

X = {(X_{1}^{t}, \dots, X_{N}^{t})}^{t}

. Let

1

denote a column vector of appropriate order where each component is 1. Denote

F = d i a g (1, \dots, 1)

where

1

is of order

T \times 1

.

The fixed effects specification is given by

Y = F α + X β + ϵ

where

α = {(α_{1}, \dots, α_{N})}^{t}

is the vector of fixed effects (treated as non-stochastic),

β = {(β_{1}, \dots, β_{k})}^{t}

is the vector of regression coefficients (also treated as nonstochastic), and

ϵ

is a random error vector of order

N T \times 1

with

E (ϵ) = 0

and

D (ϵ) = σ_{F}^{2} I

. (In the fixed effects model, it is assumed that the observational errors are all uncorrelated and have the same variance, denoted by

σ_{F}^{2}

.)

The random effects specification is given by

Y = 1_{N T} μ + F α + X β + ξ

where

α, β, ξ

are as specified in the fixed effects model except that

α

is treated as random with

E (α) = 0

and

D (α) = σ_{α}^{2} I

,

c o v (α, ξ) = 0

and

D (ξ) = σ_{ξ}^{2} I

. We shall denote

σ_{1}^{2} = σ_{ξ}^{2} + T σ_{α}^{2} .

If we denote

η = F α + ξ

in the random effects model, we get

D (η) = Ω

=

σ_{α}^{2} (I_{N} \otimes T {\bar{J}}_{T}) + σ_{ξ}^{2} (I_{N} \otimes I_{T})

=

σ_{1}^{2} (I_{N} \otimes {\bar{J}}_{T}) + σ_{ξ}^{2} (I_{N} \otimes E_{T})

.

The usual fixed and random effects estimators of

β

, denoted by

\hat{β_{F}}

and

\hat{β_{R}}

are given by

\begin{matrix} \hat{β_{F}} = {(X^{t} (I_{N} \otimes E_{T}) X)}^{- 1} X^{t} (I_{N} \otimes E_{T}) Y \\ a n d \\ \hat{β_{R}} = {(X^{t} E_{1_{N T}} {(E_{1_{N T}} Ω E_{1_{N T}})}^{-} E_{1_{N T}} X)}^{- 1} X^{t} E_{1_{N T}} {(E_{1_{N T}} Ω E_{1_{N T}})}^{-} E_{1_{N T}} Y . \end{matrix}

Also,

\begin{matrix} D (\hat{β_{F}}) = {(X^{t} (I_{N} \otimes E_{T}) X)}^{- 1} σ_{F}^{2} \\ a n d \\ D (\hat{β_{R}}) = {(X^{t} E_{1_{N T}} {(E_{1_{N T}} Ω E_{1_{N T}})}^{-} E_{1_{N T}} X)}^{- 1} . \end{matrix}

Let

s_{F}^{2}, s_{α}^{2}, s_{ξ}^{2}, s_{1}^{2} (= s_{ξ}^{2} + T s_{α}^{2})

denote the estimators of

σ_{F}^{2}, σ_{α}^{2}, σ_{ξ}^{2}, σ_{1}^{2}

respectively and let

\hat{D} (\hat{β_{F}})

and

\hat{D} (\hat{β_{R}})

denote the estimators of

D (\hat{β_{F}})

and

D (\hat{β_{R}})

where

σ_{F}^{2}, σ_{α}^{2}, σ_{ξ}^{2}, σ_{1}^{2}

are replaced by their estimators

s_{F}^{2}, s_{α}^{2}, s_{ξ}^{2}, s_{1}^{2}

respectively. (

Ω

is replaced by

\hat{Ω}

obtained by plugging in the estimators of the variance components).

As a step towards checking when

\hat{D} (\hat{β_{F}}) - \hat{D} (\hat{β_{R}})

is nnd for all

X

, we obtain the spectral decomposition of

E_{1_{N T}} {(E_{1_{N T}} Ω E_{1_{N T}})}^{-} E_{1_{N T}}

in the following Lemma.

Lemma 13.

The spectral decomposition of

E_{1_{N T}} {(E_{1_{N T}} \hat{Ω} E_{1_{N T}})}^{-} E_{1_{N T}}

is given by

\frac{1}{s_{1}^{2}} (E_{N} \otimes {\bar{J}}_{T}) + \frac{1}{s_{ξ}^{2}} (I_{N} \otimes E_{T})

.

Proof.

Let us start with simplifying

(E_{1_{N T}} \hat{Ω} E_{1_{N T}})

.

First,

E_{1_{N T}} \hat{Ω} = \{(I_{N} \otimes I_{T}) - ({\bar{J}}_{N} \otimes {\bar{J}}_{T})\} \{s_{1}^{2} (I_{N} \otimes {\bar{J}}_{T}) + s_{ξ}^{2} (I_{N} \otimes E_{T})\}

=

s_{1}^{2} (I_{N} \otimes {\bar{J}}_{T}) + s_{ξ}^{2} (I_{N} \otimes E_{T}) - s_{1}^{2} ({\bar{J}}_{N} \otimes {\bar{J}}_{T})

,

(since,

({\bar{J}}_{N} \otimes {\bar{J}}_{T}) (I_{N} \otimes E_{T}) = {\bar{J}}_{N} \otimes {\bar{J}}_{T} E_{T} = 0

)

=

s_{1}^{2} (E_{N} \otimes {\bar{J}}_{T}) + s_{ξ}^{2} (I_{N} \otimes E_{T})

. Now,

\begin{matrix} E_{1_{N T}} \hat{Ω} E_{1_{N T}} = \{s_{1}^{2} (E_{N} \otimes {\bar{J}}_{T}) + s_{ξ}^{2} (I_{N} \otimes E_{T})\} \{(I_{N} \otimes I_{T}) - ({\bar{J}}_{N} \otimes {\bar{J}}_{T})\} \\ = s_{1}^{2} (E_{N} \otimes {\bar{J}}_{T}) + s_{ξ}^{2} (I_{N} \otimes E_{T}) . \end{matrix}

(1)

□

Notice that,

E_{N} \otimes {\bar{J}}_{T}

and

I_{N} \otimes E_{T}

are both orthogonal projectors and their product is

0

. Hence, (1) is the spectral decomposition of

E_{1_{N T}} \hat{Ω} E_{1_{N T}}

. One generalized inverse (in fact, the Moore-Penrose inverse) of

E_{1_{N T}} \hat{Ω} E_{1_{N T}}

is

\frac{1}{s_{1}^{2}} (E_{N} \otimes {\bar{J}}_{T}) + \frac{1}{s_{ξ}^{2}} (I_{N} \otimes E_{T})

.

Further, since

\hat{Ω}

is positive definite with probability 1,

C E_{1_{N T}} \subseteq C E_{1_{N T}} \hat{Ω} E_{1_{N T}})

with probability 1. Therefore,

E_{1_{N T}} {\{E_{1_{N T}} \hat{Ω} E_{1_{N T}}\}}^{-} E_{1_{N T}}

is invariant under the choices of generalized inverses of

E_{1_{N T}} \hat{Ω} E_{1_{N T}}

.

Now,

E_{1_{N T}} {\{E_{1_{N T}} \hat{Ω} E_{1_{N T}}\}}^{-} E_{1_{N T}}

=

\{(I_{N} \otimes I_{T}) - ({\bar{J}}_{N} \otimes {\bar{J}}_{T})\} \{\frac{1}{s_{1}^{2}} (E_{N} \otimes {\bar{J}}_{T}) + \frac{1}{s_{ξ}^{2}} (I_{N} \otimes E_{T})\} \{(I_{N} \otimes I_{T}) - ({\bar{J}}_{N} \otimes {\bar{J}}_{T})\}

=

\{\frac{1}{s_{1}^{2}} (E_{N} \otimes {\bar{J}}_{T}) + \frac{1}{s_{ξ}^{2}} (I_{N} \otimes E_{T})\} \{(I_{N} \otimes I_{T}) - ({\bar{J}}_{N} \otimes {\bar{J}}_{T})\}

=

\frac{1}{s_{1}^{2}} (E_{N} \otimes {\bar{J}}_{T}) + \frac{1}{s_{ξ}^{2}} (I_{N} \otimes E_{T})

. Q.E.D.

We are now ready to prove

Theorem 2.

The difference in the estimated dispersion matrices,

Π = \hat{D} (\hat{β_{F}}) - \hat{D} (\hat{β_{R}})

is nnd for all

X

if and only if

s_{F}^{2} \geq s_{ξ}^{2}

.

Proof.

\hat{D} (\hat{β_{F}}) - \hat{D} (\hat{β_{R}})

is nnd, if and only if

{[X^{t} (I_{N} \otimes E_{T}) X]}^{- 1} s_{F}^{2} - {[X^{t} E_{1_{N T}} {[E_{1_{N T}} \hat{Ω} E_{1_{N T}}]}^{- 1} E_{1_{N T}} X]}^{- 1}

is nnd

\Leftrightarrow X^{t} E_{1_{N T}} {[E_{1_{N T}} \hat{Ω} E_{1_{N T}}]}^{- 1} E_{1_{N T}} X - X^{t} (I_{N} \otimes E_{T}) X \frac{1}{s_{F}^{2}}

is nnd.

But,

E_{1_{N T}} {[E_{1_{N T}} \hat{Ω} E_{1_{N T}}]}^{- 1} E_{1_{N T}} - (I_{N} \otimes E_{T}) \frac{1}{s_{F}^{2}} = \frac{1}{s_{1}^{2}} (E_{N} \otimes {\bar{J}}_{T}) + \frac{1}{s_{ξ}^{2}} (I_{N} \otimes E_{T}) - \frac{1}{s_{F}^{2}} (I_{N} \otimes E_{T}) = \frac{1}{s_{1}^{2}} (E_{N} \otimes {\bar{J}}_{T}) + (\frac{1}{s_{ξ}^{2}} - \frac{1}{s_{F}^{2}}) (I_{N} \otimes E_{T})

(In fact, this is the spectral decomposition.) which is nnd if and only if

\frac{1}{s_{ξ}^{2}} - \frac{1}{s_{F}^{2}} \geq 0

or

s_{F}^{2} \geq s_{ξ}^{2} .

Q.E.D.

If a computed estimator

s_{ξ}^{2}

is larger than

s_{F}^{2}

, it is clear from Theorem 2 that

Π = \hat{D} (\hat{β_{F}}) - \hat{D} (\hat{β_{R}})

is not nnd at least for some

X

. We now determine the class of all

X

for which

Π

is nnd, so that the Hausman test can be performed for the entire

β

vector. Towards this end, as we already noted in the proof of Theorem 2, the spectral decomposition of

S

=

E_{1_{N T}} {[E_{1_{N T}} \hat{Ω} E_{1_{N T}}]}^{- 1} E_{1_{N T}} - (I_{N} \otimes E_{T}) \frac{1}{s_{F}^{2}}

is given by

\frac{1}{s_{1}^{2}} (E_{N} \otimes {\bar{J}}_{T}) + (\frac{1}{s_{ξ}^{2}} - \frac{1}{s_{F}^{2}}) (I_{N} \otimes E_{T})

. From the spectral decomposition of

S

it is clear that the distinct eigen-values of

S

are

\frac{1}{s_{1}^{2}}

,

\frac{1}{s_{ξ}^{2}} - \frac{1}{s_{F}^{2}}

and 0 with algebraic multiplicities

N - 1

,

N T - N

and 1 respectively. Now we can use Algorithm 1 to obtain the class of all

X

for which

Π

is nnd. □

We prove

Theorem 3.

Let

(C_{l}, C_{l}^{t})

be a rank factorization of

E_{l}

where l is a positive integer. Let

s_{ξ}^{2} > s_{F}^{2}

. Then the class of all

X

for which Π is nnd is given by

X

=

\frac{1}{\sqrt{T}} (C_{N} \otimes 1_{T}) R_{1}^{t} + (I_{N} \otimes C_{T}) R_{2}^{t} + \frac{1}{\sqrt{N T}} (1_{N} \otimes 1_{T}) R_{3}^{t}

where

(i): $R_{1}$ and $R_{3}$ are arbitrary,
(ii): $R_{2} = R_{1} W$ where $W$ is an arbitrary matrix, of rank not greater than that of $R_{1}$ , having singular values in the interval [0.θ],with $θ = s_{1}^{2} (\frac{1}{s_{F}^{2}} - \frac{1}{s_{ξ}^{2}})$ .

Proof.

Notice that

\frac{1}{\sqrt{T}} (C_{N} \otimes 1_{T})

,

I_{N} \otimes C_{T}

and

\frac{1}{\sqrt{N T}} (1_{N} \otimes 1_{T})

are orthonormal bases of the eigen spaces of

(I - P_{1_{N T}}) {[(I - P_{1_{N T}}) Ω (I - P_{1_{N T}})]}^{- 1} (I - P_{1_{N T}}) - (I_{N} \otimes E_{T}) \frac{1}{s_{F}^{2}}

corresponding to the eigen-values

\frac{1}{s_{1}^{2}}

,

\frac{1}{s_{ξ}^{2}} - \frac{1}{s_{F}^{2}}

and 0 respectively.

Since

S

has

N T - N

negative eigen-values,

R_{2}

in the expression for

X

in Theorem 3 is heavily restricted. Thus, for a large class of matrices

X

, we cannot perform the Hausman test for all linear parametric functions. We shall now concentrate on this situation (namely, the difference matrix

Π

is not nnd) and obtain the class of all linear parametric functions for which we can still perform the Hausman test.

Notice that the Hausman Test can be performed on estimable linear functions,

A β

, if and only if

A Π A^{t}

is non-negative definite. Also,

A β

are estimable if and only if

A

is of the form

A = Z (I_{N} \otimes E_{T}) X

for some

Z .

First observe that

\hat{D} (\hat{β_{F}}) - \hat{D} (\hat{β_{R}}) = {[X^{t} (I_{N} \otimes E_{T}) X]}^{- 1} s_{F}^{2} - {[X^{t} (\frac{1}{s_{1}^{2}} (E_{N} \otimes {\bar{J}}_{T}) + \frac{1}{s_{ξ}^{2}} (I_{N} \otimes E_{T})) X]}^{- 1}

=

{[X^{t} (I_{N} \otimes E_{T}) X]}^{- 1} (s_{F}^{2} - s_{ξ}^{2}) + {(X^{t} (I_{N} \otimes E_{T}) X)}^{- 1} X^{t} (I_{N} \otimes {\bar{J}}_{T})

{[\frac{s_{1}^{2}}{s_{ξ}^{2}} I_{N T} + (I_{N} \otimes {\bar{J}}_{T}) X {(X^{t} (I_{N} \otimes E_{T}) X)}^{- 1} X^{t} (I_{N} \otimes {\bar{J}}_{T})]}^{- 1} (I_{N} \otimes {\bar{J}}_{T}) X {(X^{t} (I_{N} \otimes E_{T}) X)}^{- 1}

(applying Lemma 10).

Consider

A β

. The class of all

A

such that the Hausman test can be performed for

A β

is completely determined by the class of all

Z

such that

A = Z (I_{N} \otimes E_{T}) X

. Write

C = (I_{N} \otimes E_{T}) X

and

D = (I_{N} \otimes {\bar{J}}_{T}) X

. In these expressions,

C

and

D

are the time-varying and time-invariant parts of

X

.

We want to determine the class of all

Z

such that

A Π A^{t}

is nnd.

Now,

A Π A^{t}

=

Z Φ Z^{t}

where

Φ

=

P_{C} (s_{F}^{2} - s_{ξ}^{2}) + C {(C^{t} C)}^{- 1} D^{t} {(\frac{s_{1}^{2}}{s_{ξ}^{2}} I + D {(C^{t} C)}^{- 1} D^{t})}^{- 1} D {(C^{t} C)}^{- 1} C^{t}

.

As before, we need to get the spectral decomposition of

Φ

in order to determine the class of all

Z

such that

Z Φ Z^{t}

is nnd. We first prove

Lemma 14.

(a) Let

s = ρ (D {(C^{t} C)}^{- 1} D^{t})

. Then

s \leq ρ (C)

.

(b) The non-null eigenvalues of Φ are

(s_{F}^{2} - s_{ξ}^{2}) + {(\frac{s_{1}^{2}}{s_{ξ}^{2}} + λ_{i})}^{- 1} λ_{i}, i = 1, \dots, s

and

s_{F}^{2} - s_{ξ}^{2}, i = s + 1, \dots, k

.

Proof.

(a) is trivial.

(b) follows from the following facts: (i) Non-null eigenvalues of

BF

and

FB

are the same including multiplicities; (ii)

B

and

I + B

commute. (iii) Eigen-values of

α I + B

are obtained by adding

α

to the eigen-values of

B

. (iv) If two real symmetric matrices commute they have simultaneous spectral decomposition. If

B

and

F

are two real symmetric matrices of the same order such that

C (B) \subseteq C (F)

, the null space of

F

is contained in that of

B

. Q.E.D. □

As a consequence we have the following

Theorem 4.

The spectral decomposition of Φ is given by

Φ = G Δ G^{t}

where the columns of

G

form an orthonormal basis of

C

and Δ is a diagonal matrix whose diagonal entries are the non-null eigen-values of Φ as detailed in Lemma 14.

Using the spectral decomposition of

Φ

we can determine the class of all

Z

such that

Z Φ Z^{t}

is nnd using Algorithm 1. From there we can get the class of all

A = Z (I_{N} \otimes E_{T}) X

such that we can perform the Hausman test for

A β

.

We now show that there is at least one good estimator of

{σ_{ξ}}^{2}

which satisfies Theorem 2.

Notice that

s_{F}^{2} = \frac{{R_{o}}^{2}}{N T - N - k}

where

R_{o}^{2}

is the sum of squared residuals in the fixed effects model. Amemiya’s estimator of

{σ_{ξ}}^{2}

, namely,

s_{ξ}^{2}

is

\frac{{R_{o}}^{2}}{N T - k - 1}

(see page 16 of Reference [7]). Clearly

s_{F}^{2} \geq s_{ξ}^{2}

for this choice. Amemiya (1971) obtains some optimal properties of this estimator. Thus we proved

Theorem 5.

For the choice of the estimators

s_{F}^{2} = \frac{{R_{o}}^{2}}{N T - N - k}

and

s_{ξ}^{2}

=

\frac{{R_{o}}^{2}}{N T - k - 1}

of the error variance in the fixed effects model and the variance of the random component of the random effects model respectively, where

R_{o}^{2}

is the sum of squared residuals in the fixed effects model, the difference in the estimated dispersion matrices of the regression coefficient estimators in the fixed effects and random effects models is non-negative definite.

So far, we considered the case where both the random effects and the random error are homoscedastic. We now examine the case where one or both of them are heteroscedastic. Specifically, we explore whether we can find estimators of the variance components whereby the difference in the dispersion matrices of the design parameter estimators corresponding to the fixed effects and random effects specifications respectively is non-negative definite for all

X

.

Let us first write down the fixed effects and random effects specifications with heteroscedasticity.

The fixed effects specification is given by

Y = F α + X β + ϵ

where

α = {(α_{1}, \dots, α_{N})}^{t}

is the vector of fixed effects (treated as non-stochastic),

β = {(β_{1}, \dots, β_{k})}^{t}

is the vector of regression coefficients (also treated as nonstochastic), and

ϵ

is a random error vector of order

N T \times 1

with

E (ϵ) = 0

and

D (ϵ) = d i a g (σ_{1}^{2}, σ_{2}^{2}, \dots, σ_{N}^{2}) \otimes I_{T}

.

The random effects specification is given by

Y = 1_{N T} μ + F α + X β + ξ

where

α, β, ξ

are as specified in the fixed effects model except that

α

is treated as random with

E (α) = 0

and

D (α) = d i a g ({w_{1}}^{2}, \dots, {w_{N}}^{2}) \otimes I_{T}

,

c o v (α, ξ) = 0

and

D (ξ) = d i a g ({r_{1}}^{2}, \dots, {r_{N}}^{2}) \otimes I_{T}

.

If we denote

η = F α + ξ

in the random effects model, we get

D (η) = Ω

=

(T D (α) \otimes {\bar{J}}_{T}) + (D (ξ) \otimes I_{T})

.

Consider the random effects model. Let the estimated dispersion matrices of the random effects and the random error be denoted by

D_{α} = d i a g ({\hat{w}}_{1}^{2}, \dots, {\hat{w}}_{N}^{2})

and

D_{ξ} \otimes I_{T})

where

D_{ξ} = d i a g ({\hat{r}}_{1}^{2}, \dots, {\hat{r}}_{N}^{2})

.

Hence the estimated error dispersion matrix is

\hat{Ω} = (T D_{α} \otimes {\bar{J}}_{T}) + (D_{ξ} \otimes I_{T})

.

Further, the estimated dispersion matrix of the error in the fixed effects specification is

\hat{D} (ϵ) = D_{F} \otimes I_{T}

where

D_{F} = d i a g ({\hat{σ}}_{1}^{2}, {\hat{σ}}_{2}^{2}, \dots, {\hat{σ}}_{N}^{2})

.

The fixed and random effects estimators of

β

, denoted by

\hat{β_{F}}

and

\hat{β_{R}}

are given by

\begin{matrix} \hat{β_{F}} = {(X^{t} (D_{F}^{- 1} \otimes E_{T}) X)}^{- 1} X^{t} (D_{F}^{- 1} \otimes E_{T}) Y \\ a n d \\ \hat{β_{R}} = {(X^{t} E_{1_{N T}} {(E_{1_{N T}} \hat{Ω} E_{1_{N T}})}^{-} E_{1_{N T}} X)}^{- 1} X^{t} E_{1_{N T}} {(E_{1_{N T}} \hat{Ω} E_{1_{N T}})}^{-} E_{1_{N T}} Y . \end{matrix}

Also,

\begin{matrix} \hat{D} (\hat{β_{F}}) = {(X^{t} (D_{F}^{- 1} \otimes E_{T}) X)}^{- 1} \\ a n d \\ \hat{D} (\hat{β_{R}}) = {(X^{t} E_{1_{N T}} {(E_{1_{N T}} \hat{Ω} E_{1_{N T}})}^{-} E_{1_{N T}} X)}^{- 1} . \end{matrix}

We now proceed to evaluate the difference in the estimated dispersion matrices of fixed effects and random effects. As before, the difference in the dispersion matrices of the design parameter estimators corresponding to the fixed effects and random effects specifications respectively is non-negative definite for all

X

if and only if

(E_{1_{N T}} {(E_{1_{N T}} \hat{Ω} E_{1_{N T}})}^{-} E_{1_{N T}}) - (D_{F}^{- 1} \otimes E_{T})

is nnd. We start with computing

E_{1_{N T}} \hat{Ω} E_{1_{N T}} = (I_{N} \otimes I_{T} - {\bar{J}}_{N} \otimes {\bar{J}}_{T}) (T D_{α} \otimes {\bar{J}}_{T} + D_{ξ} \otimes E_{T} + D_{ξ} \otimes {\bar{J}}_{T}) (I_{N} \otimes I_{t} - {\bar{J}}_{N} \otimes {\bar{J}}_{T})

= (T D_{α} \otimes {\bar{J}}_{T} + D_{ξ} \otimes E_{T} + D_{ξ} \otimes {\bar{J}}_{T} - T {\bar{J}}_{N} D_{α} \otimes {\bar{J}}_{T} - {\bar{J}}_{N} D_{ξ} \otimes {\bar{J}}_{T}) (I_{N} \otimes I_{T} - {\bar{J}}_{N} \otimes {\bar{J}}_{T})

= T D_{α} \otimes {\bar{J}}_{T} + D_{ξ} \otimes E_{T} + D_{ξ} \otimes {\bar{J}}_{T} - T {\bar{J}}_{N} D_{α} \otimes {\bar{J}}_{T} - {\bar{J}}_{N} D_{ξ} \otimes {\bar{J}}_{T} - T D_{α} {\bar{J}}_{N} \otimes {\bar{J}}_{T} - D_{ξ} {\bar{J}}_{T} \otimes {\bar{J}}_{T} + T {\bar{J}}_{N} D_{α} {\bar{J}}_{N} \otimes {\bar{J}}_{T} + {\bar{J}}_{N} D_{ξ} {\bar{J}}_{N} \otimes {\bar{J}}_{T}

= D_{ξ} \otimes E_{T} + (T D_{α} + D_{ξ} - T {\bar{J}}_{N} D_{α} - {\bar{J}}_{N} D_{ξ} - T D {\bar{J}}_{N} - D_{ξ} {\bar{J}}_{N} + T {\bar{J}}_{N} D {\bar{J}}_{N} + {\bar{J}}_{N} D_{R} {\bar{J}}_{N}) \otimes {\bar{J}}_{T}

= D_{ξ} \otimes E_{T} + (T (E_{N} D_{α} E_{N}) + E_{N} D_{ξ} E_{N}) \otimes \bar{J_{T}}

= D_{ξ} \otimes E_{T} + E_{N} (T D_{α} + D_{ξ}) E_{N} \otimes {\bar{J}}_{T}

Now,

E_{1_{N T}} {(E_{1_{N T}} \hat{Ω} E_{1_{N T}})}^{-} E_{1_{N T}} = {(I_{N} \otimes I_{T}) - ({\bar{J}}_{N} \otimes {\bar{J}}_{T})} D_{ξ}^{- 1} \otimes E_{T} + {(E_{N} (T D_{α} + D_{ξ}) E_{N})}^{-} \otimes {\bar{J}}_{T}} {I_{N} \otimes I_{T} - {\bar{J}}_{N} \otimes {\bar{J}}_{T}}

= D_{ξ}^{- 1} \otimes E_{T} + {(E_{N} (T D_{α} + D_{ξ}) E_{N})}^{-} \otimes {\bar{J}}_{T} - {\bar{J}}_{N} {(E_{N} (T D_{α} + D_{ξ}) E_{N})}^{-} \otimes {\bar{J}}_{T} \otimes {\bar{J}}_{T} - {(E_{N} (T D_{α} + D_{ξ}) E_{N})}^{-} {\bar{J}}_{N} \otimes {\bar{J}}_{T} + {\bar{J}}_{N} {(E_{N} (T D_{α} + D_{ξ}) E_{N})}^{-} {\bar{J}}_{N} \otimes {\bar{J}}_{T}

= D_{ξ}^{- 1} \otimes E_{T} + {(E_{N} (T D_{α} + D_{ξ}) E_{N})}^{-} E_{N} \otimes {\bar{J}}_{T}

.

Thus, we proved

Theorem 6.

\hat{D} ({\hat{β}}_{F}) - \hat{D} ({\hat{β}}_{R})

under heteroscedistic specification is nnd if and only if

(D_{ξ}^{- 1} - D_{F}^{- 1}) \otimes E_{T} + {(E_{N} (T D_{α} + D_{ξ}) E_{N})}^{-} E_{N} \otimes {\bar{J}}_{T}

is nnd.

We can always get a positive definite estimator of the dispersion matrix of random error, that is

D_{ξ}

. For the difference in the estimated dispersion matrices,

\hat{D} ({\hat{β}}_{F}) - \hat{D} ({\hat{β}}_{R})

to be nnd, we need both

(D_{ξ}^{- 1} - D_{F}^{- 1})

and

(E_{N} (T D_{α} + D_{ξ}) E_{N})

to be nnd. We can use Amemiya type estimator to make the first expression to be nnd. It is easy to see that for the second expression to be nnd, it is sufficient that

(T D_{α} + D_{ξ})

is nnd. This is indeed nnd. (Adapt equation 2.21 of Reference [7] for each individual.) Hence, we can always find error component estimators such that

\hat{D} ({\hat{β}}_{F}) - \hat{D} ({\hat{β}}_{R})

under heteroscedistic specification is nnd.

The cases where the random error alone or the random effects alone are heteroscedastic are simple special cases of the case we have discussed above. However, if there is heteroscedasticity in the random error, performing the Hausman test requires that T is large, for otherwise the large sample chi-square test will not be valid.

In the next section, we shall show that the problem of finding the class of all

Z

such that

Z Φ Z^{t}

is nnd is equivalent to solving a quadratic optimization problem.

7. A Quadratic Optimization Problem

In a previous section we obtained the class of all matrices

Q

such that

Q^{t} BQ

is nnd where

B

is a symmetric indefinite matrix. In this section, for a given symmetric indefinite matrix

B

, we establish the connection between the following two problems:

(a): When is $Q^{t} BQ$ nnd?
(b): When does $x^{t} Bx$ have a minimum subject to $Ax$ = $0$ .

We prove

Theorem 7.

Let

B

be a real symmetric indefinite matrix. Then

Q^{t} BQ

is nnd if and only if

x^{t} Bx

has a minimum subject to

Ax

=

0

where

N (A)

=

C (Q)

.

Proof.

We note that the orthogonal projectors into

N (A)

and

C (Q)

are

(I - A^{+} A)

and

{QQ}^{+}

respectively. Hence

N (A)

=

C (Q)

if and only if

(I - A^{+} A)

=

{QQ}^{+}

‘If part’:

x^{t} Bx

has a minimum subject to

Ax

=

0

⇒

(I - A^{+} A) B (I - A^{+} A)

is nnd

⇒

{QQ}^{+} {BQQ}^{+}

is nnd

⇒

Q^{t} BQ

=

Q^{t} {QQ}^{+} {BQQ}^{+} Q

is nnd.

‘Only if part’:

Q^{t} BQ

is nnd

⇒

Q {(Q^{t} Q)}^{+} Q^{t} BQ {(Q^{t} Q)}^{+} Q^{t}

is nnd

⇒

{QQ}^{+} {BQQ}^{+}

is nnd

⇒

x^{t} Bx

has a minimum subject to

Ax

=

0

.

Given

A

, the class

A_{A}

of all

Q

such that

N (A)

=

C (Q)

can be obtained as follows. Let

(C, C^{t})

be a rank factorization of

(I - A^{+} A)

.

Let

A_{A}

= {

Q : Q = CT

where

T

is an arbitrary full row-rank matrix}. Now,

Q \in A_{A} \Rightarrow C (Q) = C (C)

(since

T

is of full row-rank) =

C (I - A^{+} A))

=

N (A)

.

Conversely, let

N (A)

=

C (Q)

. Then

Q = CT

for some matrix

T

. Also

ρ (Q)

= dimension of

N (A)

= dimension of

C (I - A^{+} A))

=

ρ (C)

. Since

C

has a left inverse,

ρ (Q) = ρ (T)

. So,

(C, T)

is a rank factorization of

Q

. Hence

Q \in A_{A}

. Q.E.D.

Let

Q

be a given matrix. We now obtain the class of all matrices

A

such that

N (A)

=

C (Q)

. Let

(D, D^{t})

be a rank factorization of

(I - {QQ}^{+})

. Then the class

D_{Q}

of all matrices

A

such that

N (A)

=

C (Q)

is given by

D_{Q}

= {

A

:

A = {WD}^{t}

where

W

is an arbitrary full column-rank matrix}.

Proof follows along similar lines to the earlier case. □

8. A Quadratic Optimization Problem with Non-Homogeneous Linear Constraints

In this section, we consider the problem of minimization of a quadratic form

x^{t} Bx

subject to linear constraints

Ax = b

, where B is an

n \times n

symmetric matrix, A an

m \times n

matrix, and b an

m \times 1

vector. The case where B is a pd matrix is well-known [8]. The case where B is nnd is described in Reference [10]. In this section, for given matrices A and B where B is symmetric (not necessarily nnd), we study when the minimum exists in the following cases:

(i): For some non-null vector $b \in R^{m}$ .
(ii): For all non-null vectors $b \in R^{m}$ .

We shall notice that for a suitable matrix

Q

,

Q^{t} BQ

being nnd forms an important condition for the existence of a finite solution to the minimization problem. We shall then proceed to characterize the class of all matrices

B

and vectors

b

(given a matrix

A

), such that

x^{t} Bx

has a finite minimum subject to

Ax = b

.

We prove

Theorem 8.

Let

B

be a real symmetric matrix of order

n \times n

. Let

A

be an

m \times n

matrix and let

b \in C (A)

. Consider the minimization problem:

Minimize

x^{t} Bx

subject to

Ax = b

.

Write

H_{1} = (I - A^{+} A) B (I - A^{+} A)

and

H_{2} = (I - A^{+} A) B A^{+} A

.

(a) The problem has a finite solution for some non-null vector

b

if and only if

H_{1} i s n n d a n d

(2)

H_{1} (H_{1} + H_{2} H_{2}^{t}) H_{2} \neq 0 .

(3)

(b) The problem has a finite solution for every

b \in C (A)

if and only if (2) holds and

C (H_{2}) \subseteq C (H_{1}) .

(4)

Let

b = Au

. The minimum value in either case is given by

u^{t} A^{+} A B A^{+} A u - u^{t} H_{2}^{t} H_{1}^{+} H_{2} u,

(5)

and the minimum value is achieved at all vectors

x

of the form

H_{1}^{+} H_{2} u + S^{t} (\begin{matrix} 0 \\ 1 \end{matrix})

(6)

where

H_{1} = S^{t} (\begin{matrix} Γ & 0 \\ 0 & 0 \end{matrix}) S

is a spectral decomposition of

H_{1}

,

S

being orthogonal, Γ a diagonal positive definite matrix, ζ an arbitrary vector in

R^{n - r}

, r being the rank of

H_{1}

.

Proof.

By Lemma 3, the class of all

x

satisfying

Ax = b

is given by,

x = A^{+} b + (I - A^{+} A) ζ

(7)

where

ζ

is arbitrary. Invoking (7) into

x^{t} Bx

, we get

x^{t} Bx = b^{t} {(A^{+})}^{t} {BA}^{+} b + 2 b^{t} {(A^{+})}^{t} B (I - A^{+} A) ζ + ζ^{t} (I - A^{+} A) B (I - A^{+} A) ζ .

(8)

Thus, constrained minimization of

x^{t} Bx

subject to

Ax = b

is equivalent to unconstrained minimization of the right hand side of (8) over

ζ

. Since

b \in C (A)

, we can write

b = Au

for some

u

. Now, the theorem follows from Lemmas 6 and 7. Q.E.D. □

Let us identify (2)–(6) in terms of a singular value decomposition of

A

. Let us write

b = Au

. Let

A = U (\begin{matrix} Δ & 0 \\ 0 & 0 \end{matrix}) V^{t}

be a singular value decomposition of

A

.

Write

V^{t} BV = (\begin{matrix} R_{11} & R_{12} \\ R_{21} & R_{22} \end{matrix})

, where

R_{11}

is of the same order as

Δ

and

R_{22}

is a square matrix.

It is easy to see that

H_{1} = V (\begin{matrix} 0 & 0 \\ 0 & R_{22} \end{matrix}) V^{t}

and

H_{2} = V (\begin{matrix} 0 & 0 \\ R_{21} & 0 \end{matrix}) V^{t}

.

Hence, (2) is equivalent to saying that

R_{22}

is nnd, (3) is equivalent to saying that

C (R_{21}) \cap C (R_{22}) \neq \{0\}

, (4) is equivalent to the statement that

C (R_{21}) \subseteq C (R_{22})

, and (5) translates to the expression

u^{t} V (\begin{matrix} R_{11} - R_{12} R_{22}^{+} R_{21} & 0 \\ 0 & 0 \end{matrix}) V^{t} u

.

Let

ρ (R_{22}) = s

, and let

R_{22} = M (\begin{matrix} Γ & 0 \\ 0 & 0 \end{matrix}) M^{t}

be a spectral decomposition of

R_{22}

, where

M

is orthogonal and

Γ

is a diagonal positive definite matrix of order

s \times s

. Then a spectral decomposition of

H_{1}

is given by

V (\begin{matrix} I & 0 \\ 0 & M \end{matrix}) P (\begin{matrix} Γ & 0 \\ 0 & 0 \end{matrix}) P^{t} (\begin{matrix} I & 0 \\ 0 & M^{t} \end{matrix}) V^{t}

, where

P

is a suitable permutation matrix. In view of this, (6) translates to the expression

V (\begin{matrix} 0 & 0 \\ R_{22}^{+} R_{21} & 0 \end{matrix}) V^{t} u + V (\begin{matrix} I & 0 \\ 0 & M \end{matrix}) P (\begin{matrix} 0 \\ ζ \end{matrix})

, where

ζ

is an arbitrary vector in

R^{n - s}

.

We note that (2), namely

H_{1}

should be nnd, is a key factor for the existence of a finite solution to the constrained optimization problem under consideration, which falls into the line of investigation in Section 1.

Let

A

be a given

m \times n

matrix of rank r. The above identification helps us in characterizing the class of all real symmetric matrices

B

and the class of all vectors

b \in C (A)

such that the constrained optimization problem

Minimize

x^{t} B x

subject to

Ax = b

has a finite solution.

As before, let

A = U (\begin{matrix} Δ & 0 \\ 0 & 0 \end{matrix}) V^{t}

be singular value decomposition of

A

, where

U

and

V

are orthogonal matrices and

Δ

is a positive definite diagonal matrix of order

r \times r

. Write

V^{t} BV = (\begin{matrix} R_{11} & R_{12} \\ R_{21} & R_{22} \end{matrix})

, where

R_{11}

is of order

r \times r

and

R_{22}

is of order

(n - r) \times (n - r)

. Since,

b \in C (A)

, write

b = Au

. Characterizing

B

and

b

is equivalent to characterizing

R_{11}

,

R_{21}

,

R_{22}

, and

u

.

If the minimization problem should have a finite solution for every

b \in C (A)

, then the class of all

B

is given by

B = V (\begin{matrix} R_{11} & R_{21}^{t} \\ R_{21} & R_{22} \end{matrix}) V^{t}

.

where (a)

R_{11}

is an arbitrary real symmetric matrix of order

r \times r

(b)

R_{22}

is an arbitrary nnd matrix of order

(n - r) \times (n - r)

(c)

R_{21} = R_{22} D

, where

D

is an arbitrary matrix of order

(n - r) \times r

.

If the minimization problem should have a solution for some non-null vector

b \in C (A)

, then the class of all

B

is given by

B = V (\begin{matrix} R_{11} & R_{21}^{t} \\ R_{21} & R_{22} \end{matrix}) V^{t}

, satisfying (a) and (b) as above and

(d) ((

Fy : G

),

W

) is a rank-factorization of

R_{2}

, where

(i): ( $F, F^{t}$ ) is a rank factorization of $R_{22}$ ,
(ii): $y$ is arbitrary non-null vector,
(iii): $G$ is arbitrary such that ( $Fy : G$ ) is of full column rank, and
(iv): $W$ is an arbitrary full row rank matrix.

The class of all

b

is obtained as follows:

Using

B

as obtained above, compute

J = H_{1} {(H_{1} + H_{2} H_{2}^{t})}^{-} H_{2} H_{2}^{t}

. Let

w

be an arbitrary non-zero vector in

C (J)

. Let

u

be an arbitrary solution of

H_{2} u = w

. Compute

b = Au

. Notice that

b \neq 0

, for, if

b = 0

,

Au = 0

, and hence

H_{2} u = 0

, which is a contradiction, since

H_{2} u = w \neq 0

.

We prove

Theorem 9.

LetBbe a symmetric indefinite matrix of order

n \times n

, andAbe an

n \times n

matrix, andbbe an

m \times 1

vector in the column space ofA. If

x^{t} Bx

has a finite minimum subject to

Ax = b

for every

b \in C (A)

, then there exists a generalized inverse

G

ofAsuch that

(I - GA) B (I - GA) i s n n d

(9)

a n d BGA = {(GA)}^{t} B .

(10)

Proof.

Since

x^{t} Bx

has a finite minimum subject to

Ax = b

for every

b

, by Lemma 13, we have,

(I - A^{+} A) B (I - A^{+} A)

is nnd. Whenever,

G

is a generalized inverse of

A

,

(I - A^{+} A) (I - GA) = I - GA

. Now, it follows that,

{(I - GA)}^{t} (I - A^{+} A) B (I - A^{+} A) (I - GA)

=

{(I - GA)}^{t} B (I - GA)

is nnd. Further, from the discussion after Theorem 7, it follows that

R_{22}

is nnd and

C (R_{21}) \subseteq C (R_{22})

.

Since

R_{22}

is nnd,

{(I - GA)}^{t} B (I - GA) = V (\begin{matrix} - Δ M^{t} \\ - I \end{matrix}) R_{22} (\begin{matrix} M Δ & - I \end{matrix}) V^{t}

is nnd, whatever M be. Further, since

C (R_{21}) \subset C (R_{22})

, there exists a matrix T such that

R_{21} = R_{22} T

. Write

M = T Δ^{- 1}

. Now it is easy to verify that, for this choice of M,

G = V (\begin{matrix} Δ^{- 1} & L \\ M & N \end{matrix}) U^{t}

is a generalized inverse of A (where L and N are arbitrary) such that

BGA = {(GA)}^{t} B

. Q.E.D. □

Remark 1.

A^{+}

does not in general have the properties (9) and (10).

Is there anything special about a generalized inverse G of A satisfying (9) and (10)?

It turns out that every generalized inverse G of A satisfying (9) and (10) is in fact a minimum semi-norm generalized inverse of A under a suitable semi-inner product. To see this construct

S = V (\begin{matrix} k Δ^{- 2} - R_{11} & 0 \\ 0 & 0 \end{matrix}) V^{t}

where

k > tr (R_{12}^{t} R_{22}^{-} R_{21})

. Then,

B + A^{t} SA = V (\begin{matrix} kI & R_{21}^{t} \\ R_{21} & R_{22} \end{matrix}) V^{t}

is nnd, since

R_{22}

is nnd,

C (R_{21}) \subset C (R_{22})

and

kI - R_{21}^{t} R_{22}^{-} R_{21}

is nnd (by virtue of choice of k and Lemma 1). Also,

AGA = A

and

(B + A^{t} SA) GA = BGA + A^{t} SA = {(GA)}^{t} B + A^{t} SA,

and is thus symmetric. Hence,

G

is a minimum semi-norm generalized inverse of A under the semi-inner product

(x, y) = y^{t} (B + A^{t} SA) x

(see Theorem 1.4 of Reference [11]).

9. Conclusions, Limitations and Future Research

In this article we discussed extensively and provided new results for the optimization of quadratic structures and respective corrections to the construction of covariance matrices. This is an area has a long history in econometrics and financial economics with various applications in testing panel data models, handling volatility, to applications in portfolio management. Furthermore, the implications in forecasting are numerous as all relate to decision-making at some future time period: panel data models can be used to generate out-of-sample forecasts; a covariance matrix can be used in optimizing a portfolio or in a model for volatility and correlation forecasting. The economic implications of all the aforementioned are profound. We have presented a novel statistical approach for solving a particular class of quadratic optimization problems. At the center of the mathematical derivations is a symmetric indefinite matrix, the indefinite nature of which can come from many sources usually rank deficiency or rank indeterminacy driven from redundant information in the variables from which we compute that said matrix. The problem of this indeterminacy leads to subsequent problems in the context of: a matrix-version of the well-known Hausman test in econometrics; a large portfolio optimization the covariance matrix of the financial returns might not be positive definite. This is the body of literature and application we contribute to.

As any other statistical derivation, this paper comes with the usual limitations and caveats of any statistical analysis—we do provide solutions to well known problems, but for which there are alternative solutions [6], and as such no solution is universally better, and given collected samples the researcher may have to try an extensive array of available tools, to which we contribute emphatically one more here.

For future research, we leave the investigation of further application areas for our propositions, as well as simulations for a wide range of panel data, and optimization problems.

Author Contributions

Conceptualization, B.P.; methodology, B.P. and S.S.; software, D.D.T.; validation, B.P., S.S. and D.D.T.; formal analysis, B.P. and S.S.; investigation, B.P., S.S., D.D.T.; resources, B.P. and S.S.; data curation, S.S.; writing–original draft preparation, B.P.; writing–review and editing, S.S., D.D.T., and K.N.; supervision, B.P.; project administration, B.P. and K.N.; All authors have read and agreed to the published version of the manuscript.(S.S. and K.N. on behalf of B.P.

Funding

This research received no external funding.

Acknowledgments

Bhimasankaram Pochiraju has acknowledged and Sridhar Seshadri gratefully acknowledges the research support from the Applied Statistics and Computing Lab, Indian School of Business. Nikolopoulos gratefully acknowledges the research support from Indian School of Business during his two visits in the campuses in Hyderabad in 2017, and the extended stay in Mohali in early 2018 while delivering the PGP elective module on ‘Forecasting Analytics’.

Conflicts of Interest

The authors declare no conflict of interest.

References

Linton, O.; Tang, H. Estimation of the Kronecker Covariance Model by Quadratic Form; Cambridge Working Papers in Economics 2050; Faculty of Economics, University of Cambridge: Cambridge, UK, 2020. [Google Scholar]
Eriksson, A.; Preve, D.; Yu, J. Forecasting Realized Volatility Using a Nonnegative Semiparametric Modell. J. Risk Financ. Manag. 2019, 12, 139. [Google Scholar] [CrossRef] [Green Version]
Toloo, M.; Mensah, E. Robust optimization with nonnegative decision variables: A DEA approach. Comput. Ind. Eng. 2019, 127, 313–325. [Google Scholar] [CrossRef]
Mitra, S.; Bhimasankaram, P.; Malik, S. Matrix Partial Orders. Shorted Operators and Applications; World Scientific: Singapore, 2010. [Google Scholar]
Greene, W. Econometric Analysis, 7th ed.; Prentice Hall: Upper Saddle River, NJ, USA, 2012. [Google Scholar]
Amemiya, T. The estimation of variance components in a variance components model. Int. Econ. Rev. 1971, 12, 1–13. [Google Scholar] [CrossRef]
Baltagi, B. Econometric Analysis of Panel Data, 5th ed.; Wiley: New York, NY, USA, 2013. [Google Scholar]
Rao, A.; Bhimasankaram, P. Linear Algebra, 2nd ed.; Hindustan Book Agency: New Delhi, India, 2000. [Google Scholar]
Goga, C. Variance Estimators in Survey Sampling. Available online: http://goga.perso.math.cnrs.fr/ChapVar1_coursBesan.pdf (accessed on 24 March 2008).
Kambo, N. Mathematical Programming Techniques; Affiliated East West Press: New Delhi, India, 1984. [Google Scholar]
Rao, C.; Mitra, S. Generalized Inverse of Matrices and Its Applications; Wiley: New York, NY, USA, 1971. [Google Scholar]

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Pochiraju, B.; Seshadri, S.; Thomakos, D.D.; Nikolopoulos, K. Non-Negativity of a Quadratic form with Applications to Panel Data Estimation, Forecasting and Optimization. Stats 2020, 3, 185-202. https://doi.org/10.3390/stats3030015

AMA Style

Pochiraju B, Seshadri S, Thomakos DD, Nikolopoulos K. Non-Negativity of a Quadratic form with Applications to Panel Data Estimation, Forecasting and Optimization. Stats. 2020; 3(3):185-202. https://doi.org/10.3390/stats3030015

Chicago/Turabian Style

Pochiraju, Bhimasankaram, Sridhar Seshadri, Dimitrios D. Thomakos, and Konstantinos Nikolopoulos. 2020. "Non-Negativity of a Quadratic form with Applications to Panel Data Estimation, Forecasting and Optimization" Stats 3, no. 3: 185-202. https://doi.org/10.3390/stats3030015

APA Style

Pochiraju, B., Seshadri, S., Thomakos, D. D., & Nikolopoulos, K. (2020). Non-Negativity of a Quadratic form with Applications to Panel Data Estimation, Forecasting and Optimization. Stats, 3(3), 185-202. https://doi.org/10.3390/stats3030015

Article Menu

Non-Negativity of a Quadratic form with Applications to Panel Data Estimation, Forecasting and Optimization

Abstract

1. Introduction

2. Notations

3. Preliminaries

4. Non-Negative Definiteness of $Q^{t} BQ$

5. Remarks

6. Hausman Test

7. A Quadratic Optimization Problem

8. A Quadratic Optimization Problem with Non-Homogeneous Linear Constraints

9. Conclusions, Limitations and Future Research

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Non-Negativity of a Quadratic form with Applications to Panel Data Estimation, Forecasting and Optimization

Abstract

1. Introduction

2. Notations

3. Preliminaries

4. Non-Negative Definiteness of Q t BQ

5. Remarks

6. Hausman Test

7. A Quadratic Optimization Problem

8. A Quadratic Optimization Problem with Non-Homogeneous Linear Constraints

9. Conclusions, Limitations and Future Research

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4. Non-Negative Definiteness of $Q^{t} BQ$