A New Biased Estimator to Combat the Multicollinearity of the Gaussian Linear Regression Model

Issam Dawoud; B. M. Golam Kibria

doi:10.3390/stats3040033

and

¹

Department of Mathematics, Al-Aqsa University, Gaza 4051, Palestine

²

Department of Mathematics and Statistics, Florida International University, Miami, FL 33199, USA

^*

Author to whom correspondence should be addressed.

Stats2020, 3(4), 526-541;https://doi.org/10.3390/stats3040033

Version Notes

Order Reprints

Abstract

In a multiple linear regression model, the ordinary least squares estimator is inefficient when the multicollinearity problem exists. Many authors have proposed different estimators to overcome the multicollinearity problem for linear regression models. This paper introduces a new regression estimator, called the Dawoud–Kibria estimator, as an alternative to the ordinary least squares estimator. Theory and simulation results show that this estimator performs better than other regression estimators under some conditions, according to the mean squares error criterion. The real-life datasets are used to illustrate the findings of the paper.

Keywords:

Dawoud–Kibria estimator; Liu estimator; Monte Carlo simulation; multicollinearity; MSE; ridge regression estimator

1. Introduction

Consider the following linear regression model:

y = X β + ε,

(1)

where

y

is an

n \times 1

vector of the dependent variable,

X

is a known

n \times p

full rank matrix of explanatory variables, and

β

is a

p \times 1

vector of unknown regression parameter. The ordinary least squares estimator (OLS) of

β

in (1) is defined by

\hat{β} = S^{- 1} X^{'} y,

(2)

where

S = X^{'} X

and

ε

is an

n \times 1

vector of disturbances with zero mean and variance–covariance matrix,

C o v (ε) = σ^{2} I_{n};

I_{n}

is an identity matrix of order nxn. Under the normality assumption of the disturbances,

\hat{β}

follows

N (β, σ^{2} S^{- 1})

distribution.

In a multiple linear regression model, it is assumed that the explanatory variables are independent. However, in real-life situations, there may be strong or near-to-strong linear relationships among the explanatory variables. This causes the problem of multicollinearity. In the presence of multicollinearity, it is difficult to estimate the unique effect of individual variables in the regression equations. Moreover, the OLS estimator becomes unstable or inefficient and may produce the wrong sign (see Hoerl and Kennard) [1]. To overcome these problems, many authors have introduced different kinds of one- and two-parameter estimators: to mention a few, Stein [2], Massy [3], Hoerl and Kennard [1], Mayer and Willke [4], Swindel [5], Liu [6], Akdeniz and Kaçiranlar [7], Ozkale and Kaçiranlar [8], Sakallıoglu and Kaçıranlar [9], Yang and Chang [10], Roozbeh [11], Akdeniz and Roozbeh [12], Lukman et al. [13,14], and, very recently, Kibria and Lukman [15], among others.

The objective of this paper is to introduce a new class of two-parameter estimator for the regression parameter when the explanatory variables are correlated and then to compare the performance of the new estimator with the OLS estimator, the ordinary ridge regression (ORR) estimator, the Liu estimator, the Kibria–Lukman (KL) estimator, the two-parameter (TP) estimator proposed by Ozkale and Kaciranlar [8], and the new two-parameter (NTP) estimator that is proposed by Yang and Chang [10].

Some Alternative Biased Estimators and the Proposed Estimator

The canonical form of Equation (1) is as follows:

y = Z α + ε,

(3)

where

Z = X P

and

α = P^{'} β

. Here,

P

is an orthogonal matrix such that

Z^{'} Z = P^{'} X^{'} X P = Λ = d i a g (λ_{1}, λ_{2}, \dots, λ_{p}) .

The OLS estimator of

α

is as follows:

\hat{α} = Λ^{- 1} Z^{'} y,

(4)

and the mean squared error matrix (MSEM) of

\hat{α}

is given by

M S E M (\hat{α}) = σ^{2} Λ^{- 1} .

(5)

The ORR of

α

[1] is given by

\hat{α} (k) = W (k) \hat{α},

(6)

where

W (k) = {[I_{p} + k Λ^{- 1}]}^{- 1}

,

k

is the biasing parameter, and

M S E M (\hat{α} (k)) = σ^{2} W (k) Λ^{- 1} W (k) + (W (k) - I_{p}) α α^{'} (W (k) - I_{p})^{'} .

(7)

The Liu estimator of

α

[6] is given by

\hat{α} (d) = F (d) \hat{α},

(8)

where

F (d) = {[Λ + I_{p}]}^{- 1} [Λ + d I_{p}]

,

d

is the biasing parameter of Liu estimator, and

M S E M (\hat{α} (d)) = σ^{2} F (d) Λ^{- 1} F^{'} (d) + {(1 - d)}^{2} {(Λ + I_{p})}^{- 1} α α^{'} {(Λ + I_{p})}^{- 1} .

(9)

The KL estimator of

α

[15] is given by

{\hat{α}}_{K L} = W (k) M (k) \hat{α},

(10)

where

M (k) = [I_{p} - k Λ^{- 1}]

and

\begin{array}{l} M S E M ({\hat{α}}_{K L}) = & σ^{2} W (k) M (k) Λ^{- 1} M^{'} (k) W^{'} (k) \\ + [W (k) M (k) - I_{p}] α α^{'} [W (k) M (k) - I_{p}]^{'} \end{array}

(11)

The two-parameter (TP) estimator of

α

(Ozkale and Kaçiranlar [8]) is given by

{\hat{α}}_{T P} = R \hat{α},

(12)

where

R = {(Λ + k Ι_{p})}^{- 1} (Λ + k d Ι_{p})

,

k

and

d

are the biasing parameters, and

M S E M ({\hat{α}}_{T P}) = σ^{2} R Λ^{- 1} R^{'} + [R - I_{p}] α α^{'} [R - I_{p}]^{'} .

(13)

The new two-parameter (NTP) estimator of

α

(Yang, H.; Chang [10]) is given by

{\hat{α}}_{N T P} = F (d) W (k) \hat{α},

(14)

M S E M ({\hat{α}}_{N T P}) = σ^{2} F (d) W (k) Λ^{- 1} W^{'} (k) F^{'} (d) + [F (d) W (k) - I_{p}] α α^{'} [F (d) W (k) - I_{p}]^{'}

(15)

The proposed new class of two-parameter estimator of

α

is obtained by minimizing

(y - Z α)^{'} (y - Z α)

, subject to

(α + \hat{α})^{'} (α + \hat{α}) = c

, where

c

is a constant,

(y - Z α)^{'} (y - Z α) + k (1 + d) [(α + \hat{α})^{'} (α + \hat{α}) - c] .

(16)

Here,

k

and

1 + d

are the Lagrangian multipliers.

The solution of minimizing the objective function

(y - Z α)^{'} (y - Z α) + k [(α + \hat{α})^{'} (α + \hat{α}) - c]

is obtained by Kibria and Lukman [15] for getting the KL estimator and defined in Equation (10).

Now, the solution to (16) gives the proposed estimator as follows:

{\hat{α}}_{D K} = {(Z^{'} Z + k (1 + d) Ι_{p})}^{- 1} (Z^{'} Z - k (1 + d) Ι_{p}) \hat{α} = W (k, d) M (k, d) \hat{α},

(17)

where

W (k, d) = {[I_{p} + k (1 + d) Λ^{- 1}]}^{- 1}

and

M (k, d) = [I_{p} - k (1 + d) Λ^{- 1}]

.

The proposed estimator will be called the Dawoud–Kibria (DK) estimator and is denoted by

{\hat{α}}_{D K}

.

Moreover, the proposed DK estimator is also obtained by augmenting

- \sqrt{k} \sqrt{1 + d} \hat{α} = \sqrt{k} \sqrt{1 + d} α + ε^{'}

to (3) and then using the OLS estimate. The MSEM of the DK estimator is given by

\begin{array}{l} M S E M ({\hat{α}}_{D K}) = & σ^{2} W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d) \\ + [W (k, d) M (k, d) - I_{p}] α α^{'} [W (k, d) M (k, d) - I_{p}]^{'} \end{array}

(18)

The main differences between the KL estimator and the proposed DK estimator are as follows:

-: The KL is a one-parameter estimator, while the proposed DK is a two-parameter estimator.
-: The KL estimator is obtained based on the objective function $(y - Z α)^{'} (y - Z α) + k [(α + \hat{α})^{'} (α + \hat{α}) - c]$ , while the proposed DK estimator is obtained from a different objective function, which is $(y - Z α)^{'} (y - Z α) + k (1 + d) [(α + \hat{α})^{'} (α + \hat{α}) - c]$ .
-: The KL estimator is a function of the shrinkage estimator $k$ , while the proposed DK estimator is a function of $k$ and $d$ .
-: Since the KL estimator has one parameter and the proposed DK estimator has two parameters, their MSEs are different.
-: In the KL estimator, shrinkage parameter $k$ needs to be estimated, while in the proposed DK estimator, both $k$ and $d$ need to be estimated.
-: The KL estimator is a special case of the proposed DK estimator when $d = 0$ , so the proposed DK estimator is the general estimator.

The following lemmas will be used to make some theoretical comparisons among estimators in the following section.

Lemma 1 [16].

Let

n \times n

matrices

N > 0

and

B > 0

(or

B \geq 0

), then

N > B

if and only if

λ_{\max} (B N^{- 1}) < 1

, where

λ_{\max} (B N^{- 1})

is the maximum eigenvalue of matrix

B N^{- 1 .}

Lemma 2 [17].

Let

B

be an

n \times n

positive definite matrix that is

B > 0

and

α

be some vector, then

B - α α^{'} > 0

if and only if

α^{'} B^{- 1} α < 1

.

Lemma 3 [18].

Let

α_{i} = B_{i} y

,

i = 1, 2

be two linear estimators of

α

. Suppose that

D = C o v ({\hat{α}}_{1}) - C o v ({\hat{α}}_{2}) > 0

, where

C o v ({\hat{α}}_{i}) i = 1, 2

is the covariance matrix of

{\hat{α}}_{i}

and

b_{i} = B i a s ({\hat{α}}_{i}) = (B_{i} X - I) α

,

i = 1, 2

. Consequently,

Δ ({\hat{α}}_{1} - {\hat{α}}_{2}) = M S E M ({\hat{α}}_{1}) - M S E M ({\hat{α}}_{2}) = σ^{2} D + b_{1} {b^{'}}_{1} - b_{2} {b^{'}}_{2} > 0

(19)

if and only if

{b^{'}}_{2} [σ^{2} D + {b^{'}}_{1} b_{1}] b_{2} < 1

, where

M S E M ({\hat{α}}_{i}) = C o v ({\hat{α}}_{i}) + b_{i} {b^{'}}_{i}

.

The rest of this article is organized as follows: In Section 2, we give the theoretical comparisons among the abovementioned estimators and derive the biasing parameters of the proposed DK estimator. A simulation study is conducted in Section 3. Two numerical examples are illustrated in Section 4. Finally, some concluding remarks are given in Section 5.

2. Comparison among the Estimators

2.1. Theoretical Comparisons among the Proposed DK Estimator and the OLS, ORR, Liu, KL, TP, and NTP Estimators

Theorem 1.

The proposed estimator

{\hat{α}}_{D K}

is superior to estimator

\hat{α}

if and only if

\begin{matrix} α^{'} [W (k, d) M (k, d) - I_{p}]^{'} [σ^{2} (Λ^{- 1} - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d))] \\ [W (k, d) M (k, d) - I_{p}] α < 1 . \end{matrix}

(20)

Proof.

The difference of the dispersion matrices is given by

\begin{matrix} D (\hat{α}) - D ({\hat{α}}_{D K}) = σ^{2} (Λ^{- 1} - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d)) \\ = σ^{2} d i a g {\frac{1}{λ_{i}} - \frac{{(λ_{i} - k (1 + d))}^{2}}{λ_{i} {(λ_{i} + k (1 + d))}^{2}}}_{i = 1}^{p} \end{matrix}

(21)

where

Λ^{- 1} - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d)

will be positive definite (pd) if and only if

{(λ_{i} + k (1 + d))}^{2} - {(λ_{i} - k (1 + d))}^{2} > 0 .

We observed that for

k > 0

and

0 < d < 1

,

{(λ_{i} + k (1 + d))}^{2} - {(λ_{i} - k (1 + d))}^{2} = 4 k (1 + d) λ_{i} > 0 .

Consequently,

Λ^{- 1} - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d)

is positive definite. □

Theorem 2.

When

λ_{\max} (H G^{- 1}) < 1

, the proposed estimator

{\hat{α}}_{D K}

is superior to estimator

\hat{α} (k)

if and only if

\begin{matrix} α^{'} [W (k, d) M (k, d) - I_{p}]^{'} [V_{1} + (W (k) - I_{p}) α α^{'} (W (k) - I_{p})^{'}] \\ [W (k, d) M (k, d) - I_{p}] α < 1 \end{matrix}

(22)

λ_{\max} (H G^{- 1}) < 1,

(23)

where

V_{1} = σ^{2} (W (k) Λ^{- 1} W^{'} (k) - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d)),

H = W (k, d) Λ W^{'} (k, d) + k^{2} {(1 + d)}^{2} W (k, d) Λ^{- 1} W^{'} (k, d),

G = W (k) Λ W^{'} (k) + 2 k (1 + d) W (k, d) W^{'} (k, d) .

Proof.

\begin{array}{l} V_{1} = σ^{2} (W (k) Λ^{- 1} W^{'} (k) - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d)) \\ = σ^{2} (W (k) Λ^{- 1} W^{'} (k) - W (k, d) (I_{p} - k (1 + d) Λ^{- 1}) Λ^{- 1} (I_{p} - k (1 + d) Λ^{- 1}) W^{'} (k, d)) \\ = σ^{2} Λ^{- 1} (W (k) Λ W^{'} (k) + 2 k (1 + d) W (k, d) W^{'} (k, d) \\ - (W (k, d) Λ W^{'} (k, d) + k^{2} {(1 + d)}^{2} W (k, d) Λ^{- 1} W^{'} (k, d))) Λ^{- 1} \\ = σ^{2} Λ^{- 1} (G - H) Λ^{- 1} . \end{array}

It is clear that for

k > 0

and

0 < d < 1

,

G > 0

and

H > 0

. It is obvious that

G - H > 0

if and only if

λ_{\max} (H G^{- 1}) < 1,

where

λ_{\max} (H G^{- 1})

is the maximum eigenvalue of the matrix

H G^{- 1}

. Consequently,

V_{1}

is positive definite. □

Theorem 3.

The proposed estimator

{\hat{α}}_{D K}

is superior to estimator

\hat{α} (d)

if and only if

\begin{matrix} α^{'} [W (k, d) M (k, d) - I_{p}]^{'} [V_{2} + {(1 - d)}^{2} {(Λ + I_{p})}^{- 1} α α^{'} {(Λ + I_{p})}^{- 1}] \\ [W (k, d) M (k, d) - I_{p}] α < 1 \end{matrix}

(24)

where

V_{2} = σ^{2} (F (d) Λ^{- 1} F^{'} (d) - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d))

.

Proof.

Using the difference between the dispersion matrices

\begin{array}{l} V_{2} = σ^{2} (F (d) Λ^{- 1} F^{'} (d) - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d)) \\ = σ^{2} d i a g {\frac{{(λ_{i} + d)}^{2}}{λ_{i} {(λ_{i} + 1)}^{2}} - \frac{{(λ_{i} - k (1 + d))}^{2}}{λ_{i} {(λ_{i} + k (1 + d))}^{2}}}_{i = 1}^{p} \end{array}

(25)

where

F (d) Λ^{- 1} F^{'} (d) - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d)

will be pd if and only if

{(λ_{i} + k (1 + d))}^{2} {(λ_{i} + d)}^{2} - {(λ_{i} - k (1 + d))}^{2} {(λ_{i} + 1)}^{2} > 0 or (λ_{i} + k (1 + d)) (λ_{i} + d) - (λ_{i} - k (1 + d)) (λ_{i} + 1) > 0 .

So, if

k > 0

and

0 < d < 1

,

(λ_{i} + k (1 + d)) (λ_{i} + d) - (λ_{i} - k (1 + d)) (λ_{i} + 1) = k (1 + d) (2 λ_{i} + d + 1) + λ_{i} (d - 1) > 0

. Consequently,

F (d) Λ^{- 1} F^{'} (d) - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d)

is positive definite. □

Theorem 4.

The proposed estimator

{\hat{α}}_{D K}

is superior to estimator

{\hat{α}}_{K L}

if and only if

\begin{matrix} α^{'} [W (k, d) M (k, d) - I_{p}]^{'} [V_{3} + [W (k) M (k) - I_{p}] α α^{'} [W (k) M (k) - I_{p}]^{'}] \\ [W (k, d) M (k, d) - I_{p}] α < 1 \end{matrix}

(26)

where

V_{3} = σ^{2} (W (k) M (k) Λ^{- 1} M^{'} (k) W^{'} (k) - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d))

.

Proof.

Using the difference between the dispersion matrices

\begin{array}{l} V_{3} = σ^{2} (W (k) M (k) Λ^{- 1} M^{'} (k) W^{'} (k) - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d)) \\ = σ^{2} d i a g {\frac{{(λ_{i} - k)}^{2}}{λ_{i} {(λ_{i} + k)}^{2}} - \frac{{(λ_{i} - k (1 + d))}^{2}}{λ_{i} {(λ_{i} + k (1 + d))}^{2}}}_{i = 1}^{p}, \end{array}

(27)

where

W (k) M (k) Λ^{- 1} M^{'} (k) W^{'} (k) - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d)

will be pd if and only if

{(λ_{i} + k (1 + d))}^{2} {(λ_{i} - k)}^{2} - {(λ_{i} - k (1 + d))}^{2} {(λ_{i} + k)}^{2} > 0 or (λ_{i} + k (1 + d)) (λ_{i} - k) - (λ_{i} - k (1 + d)) (λ_{i} + k) > 0 .

Obviously, for

k > 0

and

0 < d < 1

,

(λ_{i} + k (1 + d)) (λ_{i} - k) - (λ_{i} - k (1 + d)) (λ_{i} + k) = 2 k d λ_{i} > 0 .

Consequently,

W (k) M (k) Λ^{- 1} M^{'} (k) W^{'} (k) - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d)

is positive definite. □

Theorem 5.

The proposed estimator

{\hat{α}}_{D K}

is superior to estimator

{\hat{α}}_{T P}

if and only if

α^{'} [W (k, d) M (k, d) - I_{p}]^{'} [V_{4} + (R - I_{p}) α α^{'} (R - I_{p})^{'}] [W (k, d) M (k, d) - I_{p}] α < 1

(28)

where

V_{4} = σ^{2} (R Λ^{- 1} R^{'} - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d))

.

Proof.

\begin{array}{l} V_{4} = σ^{2} (R Λ^{- 1} R^{'} - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d)) \\ = σ^{2} d i a g {\frac{{(λ_{i} + k d)}^{2}}{λ_{i} {(λ_{i} + k)}^{2}} - \frac{{(λ_{i} - k (1 + d))}^{2}}{λ_{i} {(λ_{i} + k (1 + d))}^{2}}}_{i = 1}^{p}, \end{array}

(29)

where

R Λ^{- 1} R^{'} - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d)

will be positive definite if and only if

{(λ_{i} + k d)}^{2} {(λ_{i} + k (1 + d))}^{2} - {(λ_{i} + k)}^{2} {(λ_{i} - k (1 + d))}^{2} > 0 or (λ_{i} + k d) (λ_{i} + k (1 + d)) - (λ_{i} + k) (λ_{i} - k (1 + d)) > 0 .

Clearly, for

k > 0

and

0 < d < 1

,

(λ_{i} + k d) (λ_{i} + k (1 + d)) - (λ_{i} + k) (λ_{i} - k (1 + d)) = λ_{i} k (3 d + 1) + k^{2} {(1 + d)}^{2} > 0

. Consequently,

R Λ^{- 1} R^{'} - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d)

is pd. □

Theorem 6.

The proposed estimator

{\hat{α}}_{D K}

is superior to estimator

{\hat{α}}_{N T P}

if and only if

\begin{matrix} α^{'} [W (k, d) M (k, d) - I_{p}]^{'} [V_{5} + (F (d) W (k) - I_{p}) α α^{'} (F (d) W (k) - I_{p})^{'}] \\ [W (k, d) M (k, d) - I_{p}] α < 1 \end{matrix}

(30)

where

V_{5} = σ^{2} (F (d) W (k) Λ^{- 1} W^{'} (k) F^{'} (d) - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d))

.

Proof.

\begin{array}{l} V_{5} = σ^{2} (F (d) W (k) Λ^{- 1} W^{'} (k) F^{'} (d) - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d)) \\ = σ^{2} d i a g {\frac{λ_{i} {(λ_{i} + d)}^{2}}{{(λ_{i} + 1)}^{2} {(λ_{i} + k)}^{2}} - \frac{{(λ_{i} - k (1 + d))}^{2}}{λ_{i} {(λ_{i} + k (1 + d))}^{2}}}_{i = 1}^{p}, \end{array}

(31)

where

F (d) W (k) Λ^{- 1} W^{'} (k) F^{'} (d) - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d)

will be pd if and only if

λ_{i}^{2} {(λ_{i} + d)}^{2} {(λ_{i} + k (1 + d))}^{2} - {(λ_{i} + 1)}^{2} {(λ_{i} + k)}^{2} {(λ_{i} - k (1 + d))}^{2} > 0 or λ_{i} (λ_{i} + d) (λ_{i} + k (1 + d)) - (λ_{i} + 1) (λ_{i} + k) (λ_{i} - k (1 + d)) > 0 .

Clearly, for

k > 0

and

0 < d < 1

,

λ_{i} (λ_{i} + d) (λ_{i} + k (1 + d)) - (λ_{i} + 1) (λ_{i} + k) (λ_{i} - k (1 + d))) = λ_{i}^{2} (k (1 + 2 d) + d - 1) + λ_{i} (k d (2 + d + k) + k^{2}) + k^{2} (1 + d) .

Consequently,

F (d) W (k) Λ^{- 1} W^{'} (k) F^{'} (d) - W (k, d) M (k, d) Λ^{- 1} M^{'} (k, d) W^{'} (k, d)

is positive definite. □

2.2. Determination of the Parameters k and d

Since both biasing parameters

k

and

d

are unknown and need to be estimated from the observed data, we will give a short discussion on the estimation of the parameters in this subsection. The biasing parameter

k

in the ORR estimator and the biasing parameter

d

in the Liu estimator were derived by Hoerl and Kennard [1] and Liu [6], respectively. Different authors for different kinds of models have proposed different estimators of

k

and

d

: to mention a few, Hoerl et al. [19], Kibria [20], Kibria and Banik [21], Lukman and Ayinde [22], Mansson et al. [23], and Khalaf and Shukur [24], among others.

Now, we will discuss the estimation of the optimal values of

k

and

d

for the proposed DK estimator. First, we assume that

d

is fixed, then the optimal value of

k

can be obtained by minimizing

M S E M ({\hat{α}}_{D K}) = E (({\hat{α}}_{D K} - α)^{'} ({\hat{α}}_{D K} - α)),

m (k, d) = t r (M S E M ({\hat{α}}_{D K})),

m (k, d) = σ^{2} \sum_{i = 1}^{p} \frac{{(λ_{i} - k (1 + d))}^{2}}{λ_{i} {(λ_{i} + k (1 + d))}^{2}} + 4 k^{2} {(1 + d)}^{2} \sum_{i = 1}^{p} \frac{α_{i}^{2}}{{(λ_{i} + k (1 + d))}^{2}}

(32)

Differentiating

m (k, d)

with respect to

k

and setting

(\partial m (k, d) / \partial k) = 0

, we obtain

k = \frac{σ^{2}}{(1 + d) (\frac{σ^{2}}{λ_{i}} + 2 α_{i}^{2})}

(33)

Since the optimal value of

k

in (33) depends on the unknown parameters

σ^{2}

and

α_{i}^{2}

, we replace them with their corresponding unbiased estimators. Consequently, we have

\hat{k} = \frac{{\hat{σ}}^{2}}{(1 + d) (\frac{{\hat{σ}}^{2}}{λ_{i}} + 2 {\hat{α}}_{i}^{2})}

(34)

and

{\hat{k}}_{\min} (D K) = \min {\frac{{\hat{σ}}^{2}}{(1 + d) ({\hat{σ}}^{2} / λ_{i} + (2 {\hat{α}}_{i}^{2}))}}_{i = 1}^{p}

(35)

Furthermore, the optimal value of

d

can be obtained by differentiating

m (k, d)

with respect to

d

for a fixed

k

and setting

(\partial m (k, d) / \partial d) = 0

, and we obtain

d = \frac{σ^{2} λ_{i}}{m} - 1,

(36)

where

m = k (σ^{2} + 2 λ_{i} α_{i}^{2})

.

Additionally, the optimal

d

with known parameters is

\hat{d} = \frac{{\hat{σ}}^{2} λ_{i}}{\hat{m}} - 1,

(37)

where

\hat{m} = \hat{k} ({\hat{σ}}^{2} + 2 λ_{i} {\hat{α}}_{i}^{2})

.

In addition,

{\hat{d}}_{\min} (D K) = {\frac{{\hat{σ}}^{2} λ_{i}}{{\hat{k}}_{\min} (D K) ({\hat{σ}}^{2} + 2 λ_{i} {\hat{α}}_{i}^{2})} - 1}_{i = 1}^{p}

(38)

The estimator determination of the parameters

k

and

d

in

{\hat{α}}_{D K}

is obtained iteratively as follows:

Step 1: Obtain an initial estimate of

d

using

\hat{d} = \min (\frac{{\hat{σ}}^{2}}{{\hat{α}}_{i}^{2}})

.

Step 2: Obtain

{\hat{k}}_{\min} (D K)

from (35) using

\hat{d}

in Step 1.

Step 3: Estimate

{\hat{d}}_{\min} (D K)

in (38) by using

{\hat{k}}_{\min} (D K)

in Step 2.

Step 4: In case

{\hat{d}}_{\min} (D K)

is not between 0 and 1, use

{\hat{d}}_{\min} (D K) = \hat{d}

.

Additionally, Hoerl et al. [19] defined the biasing parameter k for the ORR estimator as

\hat{k} = \frac{p {\hat{σ}}^{2}}{\sum_{i = 1}^{p} {\hat{α}}_{i}^{2}}

(39)

The biasing parameter d is given by Ozkale and Kaciranlar [8] and adopted for the Liu estimator

\hat{d} = \min (\frac{{\hat{σ}}^{2}}{{\hat{α}}_{i}^{2} + \frac{{\hat{σ}}^{2}}{λ_{i}}})

(40)

Then, Kibria and Lukman [15] found the biasing parameter estimator for the KL estimator as

{\hat{k}}_{\min} = \min (\frac{{\hat{σ}}^{2}}{2 {\hat{α}}_{i}^{2} + \frac{{\hat{σ}}^{2}}{λ_{i}}})

(41)

In addition,

{\hat{k}}_{\min}

of the KL estimator is also obtained when

d = 0

in the derived biasing parameter estimator

{\hat{k}}_{\min} (D K)

for the proposed DK estimator.

3. Simulation Study

To support a theoretical comparison of the estimators, a Monte Carlo simulation study was conducted to compare the performance of the estimators in this section. As such, this section will contain (i) the simulation technique and (ii) a discussion of the results.

3.1. Simulation Technique

Following Gibbons [25] and Kibria [20], we generated the explanatory variables using the following equation:

x_{i j} = {(1 - ρ^{2})}^{1 / 2} z_{i j} + ρ z_{i, p + 1}, i = 1, 2, \dots, n, j = 1, 2, \dots, p

(42)

where

z_{i j}

are independent standard normal pseudo-random numbers, and

ρ

represents the correlation between any two explanatory variables and is considered here to be 0.90 and 0.99. We consider

p = 3

in the simulation. These variables are standardized so that

X^{'} X

and

X^{'} y

are in correlation forms. The

n

observations for the dependent variable

y

are determined by the following equation:

y_{i} = β_{1} x_{i 1} + β_{2} x_{i 2} + \dots + β_{p} x_{i p} + e_{i}, i = 1, 2, \dots, n

(43)

where

e_{i}

are

i . i . d N (0, σ^{2})

. The values of

β

are chosen such that

β^{'} β = 1

[26]. Since we aimed to compare the performance of the DK estimator with OLS, ORR, Liu, KL, TP, and NTP estimators, we chose

k

(0.3, 0.6, 0.9) between 0 and 1, as did Wichern and Churchill [27] and Kan et al. [28], where ORR gives better results and

d

(0.2, 0.5, 0.8). The replication of this simulation study is 1000 times for the sample sizes

n = 50

and 100 and

σ^{2} =

1, 25, and 100. For each replicate, we computed the mean square error (MSE) of the estimators by using the equation below:

M S E (α^{*}) = \frac{1}{1000} \sum_{j = 1}^{1000} (α_{i j}^{*} - α_{i})^{'} (α_{i j}^{*} - α_{i})

(44)

where

α_{i j}^{*}

is the estimator values and

α_{i}

is the true parameter values. The estimated MSEs of the estimators are shown in Table 1, Table 2, Table 3 and Table 4.

Table 1. Estimated MSE for ordinary least squares estimator (OLS), ordinary ridge regression (ORR), Liu, Kibria–Lukman (KL), two-parameter (TP), new two-parameter (NTP), and Dawoud–Kibria (DK).

Table 2. Estimated MSE for OLS, ORR, Liu, KL, TP, NTP, and DK.

Table 3. Estimated MSE for OLS, ORR, Liu, KL, TP, NTP, and DK.

Table 4. Estimated MSE for OLS, ORR, Liu, KL, TP, NTP, and DK.

3.2. Simulation Results Discussions

From Table 1, Table 2, Table 3. Table 4, it appears that as

σ

and

ρ

increase, the estimated MSE values increase, while as

n

increases, the estimated MSE values decrease. As expected, when the multicollinearity problem exists, the OLS estimator gives the highest MSE values and performs the worst among all estimators. Additionally, the results show that the proposed DK estimator is performing better than the rest of the estimators, followed by NTP and KL estimators, most of the time for all conditions. The NTP estimator gives better results in MSE values when

d

and

k

are near zero. The proposed DK estimator always performs better than the KL estimator. The NTP estimator performance is between the KL and DK estimators most of the time, while the KL estimator performance is between the NTP estimator and the proposed DK estimator some of the time. Thus, simulation results are consistent with the theoretical results.

To see the effect of various parameters on MSE, we plotted MSE vs. the parameters in Figure 1, Figure 2, Figure 3, Figure 4, Figure 5 and Figure 6.

Figure 1. MSE values versus

ρ

values.

Figure 2. MSE values versus

n

values.

Figure 3. MSE values versus

σ

values.

Figure 4. MSE values versus

d

values when k = 0.3.

Figure 5. MSE values versus

k

values when d = 0.5.

Figure 6. MSE values versus

k

values when d = 0.8.

It appears from Figure 1 that as

ρ

increases, the MSE values of the estimators increase for σ = 10, n = 50, k = 0.9, and d = 0.8, and the proposed DK estimator has the smallest MSE value among all estimators.

Figure 2 shows that as

n

increases, the MSE values of the estimators decrease for σ = 10, ρ = 0.99, k = 0.9, and d = 0.8, and the proposed DK estimator has the smallest MSE value among all estimators.

Figure 3 shows the behavior of

σ

, where as

σ

increases, the MSE values of the estimators increase for n = 100, ρ = 0.99, k = 0.9, d = 0.8, and for other values of these factors.

Figure 4 shows the behavior of the estimators for different values of

d

when

k = 0.3

. It is evident from Figure 4 that the proposed DK estimator gives the smallest MSE values when d is greater than 0.3, while the NTP estimator gives better results when d is less than 0.3 for n = 100, ρ = 0.99, σ = 10, and for other values of these factors.

Figure 5 shows the behavior of the estimators for different values of

k

when

d = 0.5

, such that the proposed DK estimator gives the smallest MSE values among all other estimators for n = 100, ρ = 0.99, σ = 10, and for other values of these factors.

Figure 6 shows the behavior of the estimators for different values of

k

when

d = 0.8

, such that the proposed DK estimator gives the smallest MSE values among all other estimators for n = 100, ρ = 0.99, σ = 10, and for other values of these factors.

4. Application

4.1. Portland Cement Data

We use the Portland cement data, which was originally adopted by Woods et al. [29] to explain their theoretical results. The data were analyzed by various researchers: to mention a few, Kaciranlar et al. [30], Li and Yang [31], Lukman et al. [13], and, recently, Kibria and Lukman [15], among others.

The regression model for these data is defined as

y_{i} = β_{0} + β_{1} X_{1} + β_{2} X_{2} + β_{3} X_{3} + β_{4} X_{4} + ε_{i} .

(45)

For more details about these data, see Woods et al. [29].

The variance inflation factors are

V I F_{1} = 38.50

,

V I F_{2} = 254.42

,

V I F_{3} = 46.87

, and

V I F_{4} = 282.51

. Eigenvalues of

S

are

λ_{1} = 44676.206

,

λ_{2} = 5965.422

,

λ_{3} = 809.952

, and

λ_{4} = 105.419

, and the condition number of

S

is approximately 20.58. The VIFs, the eigenvalues, and the condition number all indicate that severe multicollinearity exists. The estimated parameters and the MSE values of the estimators are presented in Table 5. It appears from Table 5 that the proposed DK estimator performs the best among the mentioned estimators as it gives the smallest MSE value.

Table 5. The results of regression coefficients and the corresponding MSE values.

4.2. Longley Data

Longley data were originally used by Longley [32] and then by other authors (Yasin and Murat [33]; Lukman and Ayinde [22]). The regression model of this data is defined as

y = β_{1} x_{1} + β_{2} x_{2} + \dots + β_{5} x_{5} + β_{6} x_{6} + ε

(46)

For more details about these data, see Longley [32].

The variance inflation factors are

V I F_{1} = 135 . 53

,

V I F_{2} = 1788 . 51

,

V I F_{3} = 33 . 62

V I F_{4} = 3 . 59

,

V I F_{5} = 399 . 15

, and

V I F_{6} = 758 . 98

. Eigenvalues of

S

are as follows: 2.76779 × 10¹², 7,039,139,179, 11,608,993.96, 2,504,761.021, 1738.356, 13.309, and the condition number of

S

is approximately 456,070. The VIFs, the eigenvalues, and the condition number all indicate that severe multicollinearity exists. The estimated parameters and the MSE values of the estimators are presented in Table 6. It appears from Table 6 that the proposed DK estimator performs the best among the mentioned estimators as it gives the smallest MSE value.

Table 6. The results of regression coefficients and the corresponding MSE values.

5. Summary and Concluding Remarks

In this paper, we introduced a new class of two-parameter estimator, namely, the Dawoud–Kibria (DK) estimator, to solve the multicollinearity problem for linear regression models. We theoretically compared the proposed DK estimator with some existing estimators, for example, the ordinary least squares (OLS) estimator, the ordinary ridge regression (ORR) estimator, the Liu (1993) estimator, the new modified ridge-type estimator of Kibria and Lukman (KL; 2020), the two-parameter (TP) estimator of Ozkale and Kaciranlar (2007), and the new two-parameter (NTP) estimator of Yang and Chang (2010), and derived the biasing parameters

d

and

k

of the proposed DK estimator. A simulation study has been conducted to compare the performance of the OLS, ORR, Liu, KL, TP, NTP, and the proposed DK estimators. It is evident from simulation results that the proposed DK estimator gives better results than the rest of the estimators under some conditions. Real-life datasets were analyzed to illustrate the findings of the paper. Hopefully, the paper will be useful for practitioners of various fields.

Author Contributions

I.D.: Conceptualization, methodology, original draft preparation. B.M.G.K.: Conceptualization, Results Discussion and Review and Editing. Both authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

Authors are grateful to three anonymous referees and the editor for their valuable comments and suggestions, which certainly improved the presentation and quality of the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Hoerl, A.E.; Kennard, R.W. Ridge regression: Biased estimation for nonorthogonal problems. Technometrics 1970, 12, 55–67. [Google Scholar] [CrossRef]
Stein, C. Inadmissibility of the usual estimator for the mean of a multivariate normal distribution. In Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability; MR0084922; University California Press: Berkeley, CA, USA, 1956; Volume 197–206, pp. 1954–1955. [Google Scholar]
Massy, W.F. Principal components regression in exploratory statistical research. J. Am. Stat. Assoc. 1965, 60, 234–256. [Google Scholar] [CrossRef]
Mayer, L.S.; Willke, T.A. On biased estimation in linear models. Technometrics 1973, 15, 497–508. [Google Scholar] [CrossRef]
Swindel, B.F. Good ridge estimators based on prior information. Commun. Stat. Theory Methods 1976, 5, 1065–1075. [Google Scholar] [CrossRef]
Liu, K. A new class of biased estimate in linear regression. Commun. Stat. Theory Methods 1993, 22, 393–402. [Google Scholar]
Akdeniz, F.; Kaçiranlar, S. On the almost unbiased generalized liu estimator and unbiased estimation of the bias and mse. Commun. Stat. Theory Methods 1995, 24, 1789–1797. [Google Scholar] [CrossRef]
Ozkale, M.R.; Kaçiranlar, S. The restricted and unrestricted two-parameter estimators. Commun. Stat. Theory Methods 2007, 36, 2707–2725. [Google Scholar] [CrossRef]
Sakallıoglu, S.; Kaçıranlar, S. A new biased estimator based on ridge estimation. Stat. Pap. 2008, 49, 669–689. [Google Scholar] [CrossRef]
Yang, H.; Chang, X. A new two-parameter estimator in linear regression. Commun. Stat. Theory Methods 2010, 39, 923–934. [Google Scholar] [CrossRef]
Roozbeh, M. Optimal QR-based estimation in partially linear regression models with correlated errors using GCV criterion. Comput. Stat. Data Anal. 2018, 117, 45–61. [Google Scholar] [CrossRef]
Akdeniz, F.; Roozbeh, M. Generalized difference-based weighted mixed almost unbiased ridge estimator in partially linear models. Stat. Pap. 2019, 60, 1717–1739. [Google Scholar] [CrossRef]
Lukman, A.F.; Ayinde, K.; Binuomote, S.; Clement, O.A. Modified ridge-type estimator to combat multicollinearity: Application to chemical data. J. Chemother. 2019, 33, e3125. [Google Scholar] [CrossRef]
Lukman, A.F.; Ayinde, K.; Sek, S.K.; Adewuyi, E. A modified new two-parameter estimator in a linear regression model. Model. Eng. Simul. 2019, 2019, 6342702. [Google Scholar] [CrossRef]
Kibria, B.M.G.; Lukman, A.F. A New Ridge-Type Estimator for the Linear Regression Model: Simulations and Applications. Hindawi Sci. 2020, 2020, 9758378. [Google Scholar] [CrossRef]
Wang, S.G.; Wu, M.X.; Jia, Z.Z. Matrix Inequalities; Science Chinese Press: Beijing, China, 2006. [Google Scholar]
Farebrother, R.W. Further results on the mean square error of ridge regression. J. R. Stat. Soc. B 1976, 38, 248–250. [Google Scholar] [CrossRef]
Trenkler, G.; Toutenburg, H. Mean squared error matrix comparisons between biased estimators-an overview of recent results. Stat. Pap. 1990, 31, 165–179. [Google Scholar] [CrossRef]
Hoerl, A.E.; Kannard, R.W.; Baldwin, K.F. Ridge regression: Some simulations. Commun. Stat. 1975, 4, 105–123. [Google Scholar] [CrossRef]
Kibria, B.M.G. Performance of some new ridge regression estimators. Commun. Stat. Simul. Comput. 2003, 32, 419–435. [Google Scholar] [CrossRef]
Kibria, B.M.G.; Banik, S. Some ridge regression estimators and their performances. J. Mod. Appl. Stat. Methods 2016, 15, 206–238. [Google Scholar] [CrossRef]
Lukman, A.F.; Ayinde, K. Review and classifications of the ridge parameter estimation techniques. Hacet. J. Math. Stat. 2017, 46, 953–967. [Google Scholar] [CrossRef]
Månsson, K.; Kibria, B.M.G.; Shukur, G. Performance of some weighted Liu estimators for logit regression model: An application to Swedish accident data. Commun. Stat. Theory Methods 2015, 44, 363–375. [Google Scholar] [CrossRef]
Khalaf, G.; Shukur, G. Choosing ridge parameter for regression problems. Commun. Stat. Theory Methods 2005, 21, 2227–2246. [Google Scholar] [CrossRef]
Gibbons, D.G. A simulation study of some ridge estimators. J. Am. Stat. Assoc. 1981, 76, 131–139. [Google Scholar] [CrossRef]
Newhouse, J.P.; Oman, S.D. An evaluation of ridge estimators. In A Report Prepared for United States Air Force Project; RAND: Santa Monica, CA, USA, 1971. [Google Scholar]
Wichern, D.W.; Churchill, G.A. A comparison of ridge estimators. Technometrics 1978, 20, 301–311. [Google Scholar] [CrossRef]
Kan, B.; Alpu, O.; Yazıcı, B. Robust ridge and robust Liu estimator for regression based on the LTS estimator. J. Appl. Stat. 2013, 40, 644–655. [Google Scholar] [CrossRef]
Woods, H.; Steinour, H.H.; Starke, H.R. Effect of composition of Portland cement on heat evolved during hardening. J. Ind. Eng. Chem. 1932, 24, 1207–1214. [Google Scholar] [CrossRef]
Kaciranlar, S.; Sakallioglu, S.; Akdeniz, F.; Styan, G.P.H.; Werner, H.J. A new biased estimator in linear regression and a detailed analysis of the widely-analysed dataset on portland cement. Sankhya Indian J. Stat. B 1999, 61, 443–459. [Google Scholar]
Li, Y.; Yang, H. Anew Liu-type estimator in linear regression model. Stat. Pap. 2012, 53, 427–437. [Google Scholar] [CrossRef]
Longley, J.W. An appraisal of least squares programs for electronic computer from the point of view of the user. J. Am. Stat. Assoc. 1967, 62, 819–841. [Google Scholar] [CrossRef]
Yasin, A.; Murat, E. Influence Diagnostics in Two-Parameter Ridge Regression. J. Data Sci. 2016, 14, 33–52. [Google Scholar]

Figure 1. MSE values versus

ρ

values.

Figure 2. MSE values versus

n

values.

Figure 3. MSE values versus

σ

values.

Figure 4. MSE values versus

d

values when k = 0.3.

Figure 5. MSE values versus

k

values when d = 0.5.

Figure 6. MSE values versus

k

values when d = 0.8.

Table 1. Estimated MSE for ordinary least squares estimator (OLS), ordinary ridge regression (ORR), Liu, Kibria–Lukman (KL), two-parameter (TP), new two-parameter (NTP), and Dawoud–Kibria (DK).

$ρ = 0.90, n = 50$
$k$	$d$	$σ$	OLS	ORR	Liu	KL	TP	NTP	DK
0.3	0.2	1	0.2136	0.2005	0.1821	0.1879	0.2031	0.1711	0.1832
		5	5.3394	5.0135	4.5507	4.6982	5.0778	4.2749	4.5799
		10	21.357	20.054	18.203	18.793	20.311	17.099	18.319
	0.5	1	0.2136	0.2005	0.1936	0.1879	0.2070	0.1818	0.1764
		5	5.3394	5.0135	4.8388	4.6982	5.1751	4.5446	4.4080
		10	21.357	20.054	19.355	18.793	20.700	18.178	17.632
	0.8	1	0.2136	0.2005	0.2054	0.1879	0.2109	0.1929	0.1698
		5	5.3394	5.0135	5.1361	4.6982	5.2734	4.8231	4.2427
		10	21.357	20.054	20.544	18.793	21.093	19.292	16.970
0.6	0.2	1	0.2136	0.1887	0.1821	0.1655	0.1936	0.1611	0.1574
		5	5.3394	4.7176	4.5507	4.1361	4.8388	4.0245	3.9308
		10	21.357	18.870	18.203	16.544	19.355	16.098	15.723
	0.5	1	0.2136	0.1887	0.1936	0.1655	0.2009	0.1712	0.1459
		5	5.3394	4.7176	4.8388	4.1361	5.0235	4.2777	3.6422
		10	21.357	18.870	19.355	16.544	20.094	17.110	14.568
	0.8	1	0.2136	0.1887	0.2054	0.1655	0.2085	0.1816	0.1353
		5	5.3394	4.7176	5.1361	4.1361	5.2118	4.5389	3.3748
		10	21.357	18.870	20.544	16.544	20.847	18.155	13.498
0.9	0.2	1	0.2136	0.1780	0.1821	0.1459	0.1848	0.1521	0.1353
		5	5.3394	4.4483	4.5507	3.6422	4.6197	3.7965	3.3748
		10	21.357	17.793	18.203	14.568	18.479	15.186	13.498
	0.5	1	0.2136	0.1780	0.1936	0.1459	0.1953	0.1615	0.1209
		5	5.3394	4.4483	4.8388	3.6422	4.8832	4.0346	3.0101
		10	21.357	17.793	19.355	14.568	19.533	16.138	12.039
	0.8	1	0.2136	0.1780	0.2054	0.1459	0.2062	0.1713	0.1081
		5	5.3394	4.4483	5.1361	3.6422	5.1544	4.2803	2.6846
		10	21.357	17.793	20.544	14.568	20.617	17.121	10.736

Minimum mean squared error (MSE) value is bolded in each row.

Table 2. Estimated MSE for OLS, ORR, Liu, KL, TP, NTP, and DK.

$ρ = 0.99, n = 50$
$k$	$d$	$σ$	OLS	ORR	Liu	KL	TP	NTP	DK
0.3	0.2	1	1.9452	1.1258	1.0786	0.6261	1.5075	0.2548	0.5308
		5	48.628	28.145	26.965	15.651	37.686	6.3689	13.268
		10	194.51	112.58	107.86	62.607	150.74	25.475	53.074
	0.5	1	1.9452	1.1258	1.5679	0.5308	1.7633	0.9083	0.1548
		5	48.628	28.145	39.197	13.268	44.083	22.706	3.8693
		10	194.51	112.58	156.79	53.074	176.33	90.826	15.477
	0.8	1	1.9452	0.7349	0.6813	0.1072	0.9304	0.2612	0.0457
		5	48.628	18.372	17.031	2.6782	23.258	6.5262	1.1386
		10	194.51	73.489	68.124	10.712	93.034	26.105	4.5545
0.6	0.2	1	1.9452	0.7349	1.0786	0.1072	1.2672	0.4101	0.0109
		5	48.628	18.372	26.965	2.6782	31.680	10.251	0.2678
		10	194.51	73.489	107.86	10.712	126.72	41.006	1.0709
	0.5	1	1.9452	0.7349	1.5679	0.1072	1.6565	0.5935	0.0178
		5	48.628	18.372	39.197	2.6782	41.412	14.837	0.4391
		10	194.51	73.489	156.79	10.712	165.65	59.348	1.7561
	0.8	1	1.9452	0.5184	0.6813	0.0109	0.7302	0.1859	0.0108
		5	48.628	12.958	17.031	0.2678	18.254	4.6442	0.2391
		10	194.51	51.834	68.124	1.0709	73.017	18.576	1.0561
0.9	0.2	1	1.9452	0.5184	1.0786	0.0109	1.1169	0.2905	0.0108
		5	48.628	12.958	26.965	0.2678	27.921	7.2590	0.2118
		10	194.51	51.834	107.86	1.0709	111.68	29.036	1.0684
	0.5	1	1.9452	0.5184	1.5679	0.0109	1.5863	0.4192	0.0107
		5	48.628	12.958	39.197	0.2678	39.656	10.477	0.2540
		10	194.51	51.834	156.79	1.0709	158.62	41.909	1.0611
	0.8	1	1.9452	1.1258	1.0786	0.5308	1.5075	0.6261	0.2548
		5	48.628	28.145	26.965	13.268	37.686	15.651	6.3689
		10	194.51	112.58	107.86	53.074	150.74	62.607	25.475

Minimum MSE value is bolded in each row.

Table 3. Estimated MSE for OLS, ORR, Liu, KL, TP, NTP, and DK.

$ρ = 0.90, n = 100$
$k$	$d$	$σ$	OLS	ORR	Liu	KL	TP	NTP	DK
0.3	0.2	1	0.1064	0.1032	0.0982	0.1000	0.1038	0.0952	0.0987
		5	2.6611	2.5793	2.4538	2.4989	2.5956	2.3787	2.4678
		10	10.644	10.317	9.8149	9.9956	10.382	9.5147	9.8709
	0.5	1	0.1064	0.1032	0.1012	0.1000	0.1048	0.0981	0.0969
		5	2.6611	2.5793	2.5305	2.4989	2.6200	2.4529	2.4218
		10	10.644	10.317	10.121	9.9956	10.480	9.8116	9.6869
	0.8	1	0.1064	0.1032	0.1043	0.1000	0.1058	0.1011	0.0951
		5	2.6611	2.5793	2.6084	2.4989	2.6446	2.5284	2.3767
		10	10.644	10.317	10.433	9.9956	10.578	10.113	9.5065
0.6	0.2	1	0.1064	0.1001	0.0982	0.0939	0.1013	0.0923	0.0916
		5	2.6611	2.5015	2.4538	2.3471	2.5330	2.3072	2.2891
		10	10.644	10.005	9.8149	9.3882	10.131	9.2287	9.1561
	0.5	1	0.1064	0.1001	0.1012	0.0939	0.1032	0.0952	0.0882
		5	2.6611	2.5015	2.5305	2.3471	2.5806	2.3791	2.2048
		10	10.644	10.005	10.121	9.3882	10.322	9.5162	8.8190
	0.8	1	0.1064	0.1001	0.1043	0.0939	0.1052	0.0981	0.0850
		5	2.6611	2.5015	2.6084	2.3471	2.6287	2.4521	2.1238
		10	10.644	10.005	10.433	9.3882	10.514	9.8084	8.4947
0.9	0.2	1	0.1064	0.0971	0.0982	0.0882	0.0989	0.0896	0.0850
		5	2.6611	2.4273	2.4538	2.2048	2.4731	2.2391	2.1238
		10	10.644	9.7090	9.8149	8.8190	9.8924	8.9561	8.4947
	0.5	1	0.1064	0.0971	0.1012	0.0882	0.1017	0.0924	0.0804
		5	2.6611	2.4273	2.5305	2.2048	2.5428	2.3087	2.0079
		10	10.644	9.7090	10.121	8.8190	10.171	9.2347	8.0312
	0.8	1	0.1064	0.0971	0.1043	0.0882	0.1045	0.0952	0.0761
		5	2.6611	2.4273	2.6084	2.2048	2.6134	2.3795	1.8985
		10	10.644	9.7090	10.433	8.8190	10.453	9.5178	7.5934

Minimum MSE value is bolded in each row.

Table 4. Estimated MSE for OLS, ORR, Liu, KL, TP, NTP, and DK.

$ρ = 0.99, n = 100$
$k$	$d$	$σ$	OLS	ORR	Liu	KL	TP	NTP	DK
0.3	0.2	1	0.9913	0.7446	0.5288	0.5341	0.7911	0.3990	0.4714
		5	24.782	18.615	13.220	13.353	19.776	9.9738	11.784
		10	99.128	74.463	52.882	53.412	79.107	39.895	47.136
	0.5	1	0.9913	0.7446	0.6850	0.5341	0.8634	0.5158	0.3900
		5	24.782	18.615	17.125	13.353	21.586	12.894	9.7508
		10	99.128	74.463	68.502	53.412	86.343	51.577	39.003
	0.8	1	0.9913	0.7446	0.8619	0.5341	0.9391	0.6480	0.3218
		5	24.782	18.615	21.547	13.353	23.476	16.199	8.0436
		10	99.128	74.463	86.188	53.412	93.905	64.796	32.174
0.6	0.2	1	0.9913	0.5811	0.5288	0.2824	0.6542	0.3125	0.2162
		5	24.782	14.526	13.220	7.0598	16.354	7.8110	5.4042
		10	99.128	58.107	52.882	28.239	65.419	31.243	21.616
	0.5	1	0.9913	0.5811	0.6850	0.2824	0.7722	0.4033	0.1419
		5	24.782	14.526	17.125	7.0598	19.306	10.081	3.5462
		10	99.128	58.107	68.502	28.239	77.223	40.326	14.184
	0.8	1	0.9913	0.5811	0.8619	0.2824	0.9003	0.5060	0.0901
		5	24.782	14.526	21.547	7.0598	22.508	12.649	2.2524
		10	99.128	58.107	86.188	28.239	90.031	50.598	9.0095
0.9	0.2	1	0.9913	0.4668	0.5288	0.1419	0.5557	0.2518	0.0901
		5	24.782	11.668	13.220	3.5462	13.892	6.2937	2.2524
		10	99.128	46.674	52.882	14.184	55.568	25.174	9.0095
	0.5	1	0.9913	0.4668	0.6850	0.1419	0.7041	0.3245	0.0422
		5	24.782	11.668	17.125	3.5462	17.601	8.1116	1.0520
		10	99.128	46.674	68.502	14.184	70.406	32.446	4.2074
	0.8	1	0.9913	0.4668	0.8619	0.1419	0.8704	0.4067	0.0182
		5	24.782	11.668	21.547	3.5462	21.760	10.166	0.4524
		10	99.128	46.674	86.188	14.184	87.040	40.667	1.8092

Minimum MSE value is bolded in each row

Table 5. The results of regression coefficients and the corresponding MSE values.

Coef.	$\hat{α}$	$\hat{α} (\hat{k})$	$\hat{α} (\hat{d})$	$\begin{array}{c} {\hat{α}}_{K L} \\ ({\hat{k}}_{\min}) \end{array}$	$\begin{array}{c} {\hat{α}}_{T P} \\ (\hat{k}, \hat{d}) \end{array}$	$\begin{array}{c} {\hat{α}}_{N T P} \\ (\hat{k}, \hat{d}) \end{array}$	$\begin{array}{c} {\hat{α}}_{D K} \\ ({\hat{k}}_{\min}, {\hat{d}}_{\min}) \end{array}$
$α_{0}$	62.405	8.5871	27.665	27.627	32.386	3.8295	27.588
$α_{1}$	1.5511	2.1046 *	1.9008 *	1.9088 *	1.8598 *	2.1459 *	1.9092 *
$α_{2}$	0.5101	1.0648 *	0.8699 *	0.8685 *	0.8196 *	1.1157 *	0.8689 *
$α_{3}$	0.1019	0.6680 *	0.4619	0.4678	0.4177	0.7126 *	0.4682
$α_{4}$	−0.1440	0.3995 *	0.2080	0.2072	0.1592	0.4488 *	0.2076
$k$	-----------	0.007676	-	0.000471	0.007676	0.007676	0.000471
$d$	-----------	-----------	0.442224	-----------	0.442224	0.442224	0.001536
$M S E$	4912.090	2989.820	2170.967	2170.9604	2222.682	3450.710	2170.9602

* Coefficient is significant at 0.05.

Table 6. The results of regression coefficients and the corresponding MSE values.

Coef.	$\hat{α}$	$\hat{α} (\hat{k})$	$\hat{α} (\hat{d})$	$\begin{matrix} {\hat{α}}_{K L} \\ ({\hat{k}}_{\min}) \end{matrix}$	$\begin{matrix} {\hat{α}}_{T P} \\ (\hat{k}, \hat{d}) \end{matrix}$	$\begin{matrix} {\hat{α}}_{N T P} \\ (\hat{k}, \hat{d}) \end{matrix}$	$\begin{matrix} {\hat{α}}_{D K} \\ ({\hat{k}}_{\min}, {\hat{d}}_{\min}) \end{matrix}$
$α_{1}$	−52.994	1.0931	−49.641	−5.0190	−7.7933	1.2529	−5.0188
$α_{2}$	0.0711 *	0.0526 *	0.0704 *	0.0609 *	0.0556 *	0.0525 *	0.0609 *
$α_{3}$	−0.4235	−0.6457 *	−0.4316	−0.5426	−0.6092 *	−0.6464 *	−0.5427
$α_{4}$	−0.5726 *	−0.5611	−0.5745	−0.5985 *	−0.5630	−0.5610	−0.5984 *
$α_{5}$	−0.4142	−0.2062	−0.4083	−0.3266	−0.2404	−0.2056	−0.3267
$α_{6}$	48.418 *	37.119 *	48.046 *	42.918 *	38.976 *	37.085 *	42.918 *
$k$	---------	262.88	---------	9.5600	262.88	262.88	8.2110
$d$	---------	---------	0.1643	---------	0.1643	0.1643	0.1643
$M S E$	17095	3190.6	15183	2915.1	2945.3	3204.1	2914.7

* Coefficient is significant at 0.05.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

A New Biased Estimator to Combat the Multicollinearity of the Gaussian Linear Regression Model

Abstract

1. Introduction

Some Alternative Biased Estimators and the Proposed Estimator

2. Comparison among the Estimators

2.1. Theoretical Comparisons among the Proposed DK Estimator and the OLS, ORR, Liu, KL, TP, and NTP Estimators

2.2. Determination of the Parameters k and d

3. Simulation Study

3.1. Simulation Technique

3.2. Simulation Results Discussions

4. Application

4.1. Portland Cement Data

4.2. Longley Data

5. Summary and Concluding Remarks

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics