On the Estimation of the Binary Response Model

Muhammad Amin; Muhammad Nauman Akram; B. M. Golam Kibria; Huda M. Alshanbari; Nahid Fatima; Ahmed Elhassanein

doi:10.3390/axioms12020175

Abstract

The binary logistic regression model (LRM) is practical in situations when the response variable (RV) is dichotomous. The maximum likelihood estimator (MLE) is generally considered to estimate the LRM parameters. However, in the presence of multicollinearity (MC), the MLE is not the correct choice due to its inflated standard deviation (SD) and standard errors (SE) of the estimates. To combat MC, commonly used biased estimators, i.e., the Ridge estimators (RE) and Liu estimators (LEs), are preferred. However, most of the time, the traditional LE attains a negative value for its Liu parameter (LP), which is considered to be a major drawback. Therefore, to overcome this issue, we proposed a new adjusted LE for the binary LRM. Owing to numerical evaluation purposes, Monte Carlo simulation (MCS) study is performed under different conditions where bias and mean squared error are the performance criteria. Findings showed the superiority of our proposed estimator in comparison with the other estimation methods due to the existence of high but imperfect multicollinearity, which clearly means that it is consistent when the regressors are multicollinear. Furthermore, the findings demonstrated that whenever there is MC, the MLE is not the best choice. Finally, a real application is being considered to be evidence for the advantage of the intended estimator. The MCS and the application findings pointed out that the considered adjusted LE for the binary logistic regression model is a more efficient estimation method whenever the regressors are highly multicollinear.

Keywords:

bias; binary logistic regression model; Liu estimator; multicollinearity; maximum likelihood estimator; ridge regression

MSC:

62J02; 62J05; 62J07; 62J20

1. Introduction

The binary LRM (BLRM) is a common RM, and it is preferred in situations where the RV is dichotomous or binary. Forecasting a dichotomous RV is vital in epidemiology and medicine [1,2]. For example, a set of risk factors such as cholesterol and blood pressure effect on the possibility of undergoing a heart attack. The method of MLE is commonly employed to estimate the binary LRM [2,3]. In the case of MC, the variance becomes inflated, and the standard errors become high, and consequently, the t-statistic and F-ratios become insignificant [3,4,5]. To overcome the issue of MC, different biased techniques are available in the writings. One of them constructing on ridge regression (RR), see [6], for the linear RM. The logistic RR was first introduced in [7] via the results of [6]. For the same purpose three biased estimators are considered [3]. Lee and Silvapulle [8] introduced the two ridge parameters for the logistic RR estimator (LRRE). As in the LRRE, the ridge parameter plays a very important role. After these studies, many researchers focused on the collection of the ridge parameter in the LRRE [9,10,11,12,13,14]. Liu [15] discussed some limitations of the RRE and proposed another alternative estimator called the LE for handling the issue of multicollinearity in a better way. The LE for the BLRM was adapted and called the logistic LE (LLE), [16]. The theoretical and numerical properties of the LLE and its comparison with the MLE and LRRE have been discussed.

In the LLE, the LP (d) plays a main character in the estimation of the LLE and should lie between 0 and 1, i.e.,

0 < d < 1 .

Sometimes, the Liu parameter produces negative and zero values that affect the efficiency of the LLE [17]. Several authors introduced various types of Liu estimators in the BLRM to improve the efficiency of the LE [18,19,20,21,22]. These modified Liu estimators also often produced zero values for the Liu parameter. To overcome the limitations of the available LEs, Lukman et al. [23] introduced the modified one-parameter LE for the linear RM.

In this article, we propose a new adjusted LLE (ALLE) for the BLRM. Its theoretical properties are investigated. Furthermore, the properties of the new estimator are assessed via a theoretical comparison and compared with other estimation methods. The matrix mean squared error (MMSE) and the scalar mean squared error (MSE) are used as the performance evaluation criteria. To investigate the new estimator a MCS study has been performed. The forthcoming sections are organized as follows: Section 2 presents the construction of the proposed estimator for the BLRM with a theoretical comparison. A MCS study is presented in Section 3. Section 4 is devoted to analyzing prostate cancer data. Section 5 ends with some conclusions.

2. Statistical Method

Following [7,24,25], the BLRM is defined as

y_{i} = δ_{i} + ε_{i}, i = 1, \dots, n,

where

ε_{i}

has zero mean and variance

ω_{i} = δ_{i} (1 - δ_{i})

with expectation

δ_{i}

of

y_{i}

and are independent for each

i

. Considering, the ith value of RV to be distributed as Bernoulli

B e (δ_{i})

, with

δ_{i} = \frac{e x p (x_{i}^{t} ξ)}{1 + e x p (x_{i}^{t} ξ)}, i = 1, \dots, n,

(1)

where

δ = {(δ_{i})}_{i = 1, 2, \dots, n}

, and

X = {(x_{i})}_{i = 1, 2, \dots, n}^{t}

is the

n \times p

data matrix with

x_{i} = {(1, x_{1 i}, \dots, x_{p i})}^{t}

; and

ξ = {(ξ_{j})}_{j = 0, 1, \dots p}^{t}

is considered to be the

(p + 1)

vector of regression coefficients. The regression coefficients vector

ξ

is usually estimated via the MLE method. The logarithm of the likelihood function for Equation (1) is computed as

H = \sum_{i = 1}^{n} y_{i} l o g (δ_{i}) + (1 - y_{i}) l o g (1 - δ_{i}) .

(2)

Let

{\hat{δ}}^{T} = {({\hat{δ}}_{i}^{T})}_{i = 1, 2, \dots, n}

be the estimates of

δ

at the T^th step, using

{\hat{ξ}}^{T}

and

{\hat{V}}^{T} = d i a g ({\hat{δ}}_{i}^{T} (1 - {\hat{δ}}_{i}^{T})) .

The following iterative algorithm is well define [26],

{\hat{ξ}}^{T + 1} = {\hat{ξ}}^{T} + {(X^{t} \hat{V} X)}^{- 1} X^{T} {\hat{V}}^{T} (y - {\hat{δ}}^{T}),

(3)

The Equation (3) can be written as

{\hat{ξ}}_{M L E} = {(Υ)}^{- 1} X^{t} \hat{V} \hat{z},

(4)

with

Υ = X^{t} \hat{V} X

and

\hat{z} = {(z_{i})}_{i = 1, 2, \dots, n}

where

η_{i} = x_{i}^{t} ξ

, while

z_{i} = η_{i} + (y_{i} - δ_{i}) (\partial η_{i} / \partial δ_{i})

.

{\hat{ξ}}_{M L E}

has asymptotic normal distribution as

{\hat{ξ}}_{M L E} ~ N (β, {(Υ)}^{- 1})

[8].

Let

ℚ

be the orthogonal matrix that has the eigenvectors of

Υ

as columns. Then the covariance and MMSE of the

{\hat{ξ}}_{M L E}

are

C o v ({\hat{ξ}}_{M L E}) = Υ^{- 1} and M M S E ({\hat{ξ}}_{M L E}) = ℚ Γ^{- 1} ℚ^{t} .

(5)

The scalar MSE of the

{\hat{ξ}}_{M L E}

is

M S E ({\hat{ξ}}_{M L E}) = E {({\hat{ξ}}_{M L E} - ξ)}^{t} ({\hat{ξ}}_{M L E} - ξ) = t r (ℚ Γ^{- 1} ℚ^{t}) = \sum_{j = 1}^{r} \frac{1}{μ_{j}},

(6)

where

Γ = d i a g (μ_{1}, μ_{2}, \dots, μ_{r})

.

In the case of high correlated regressors, the matrix

Υ

becomes ill-conditioned, which causes the problem of multicollinearity. The LRRE is defined by Schaefer et al. [7] as a simple extension of Hoerl and Kennard [6] to treat multicollinearity effects as

{\hat{ξ}}_{L R R E} = R_{k} {\hat{ξ}}_{M L E},

(7)

where

R_{k} = {(Υ + k I_{r})}^{- 1} (Υ)

with the ridge parameter k

(k > 0)

and identity matrix

I_{r}

. The bias vector (BV), covariance matrix (CM) and MMSE of (7) are

B i a s ({\hat{ξ}}_{L R R E}) = - k ℚ Γ_{k}^{- 1} ξ,

(8)

C o v ({\hat{ξ}}_{L R R E}) = ℚ Γ_{k}^{- 1} {Γ Γ}_{k}^{- 1} ℚ^{t}

(9)

and

M M S E ({\hat{ξ}}_{L R R E}) = R_{k} {(Υ)}^{- 1} R_{k}^{t} + B i a s ({\hat{ξ}}_{L R R E}) B i a s {({\hat{ξ}}_{L R R E})}^{t} = ℚ Γ_{k}^{- 1} {Γ Γ}_{k}^{- 1} ℚ^{t} + k^{2} ℚ Γ_{k}^{- 1} ξ ξ^{t} Γ_{k}^{- 1} ℚ^{t},

(10)

where

Γ_{k} = d i a g (μ_{1} + k, μ_{2} + k, \dots, μ_{r} + k)

. From (10) by applying the

t r (.)

operator, we obtain the scalar MSE of the LRRE as

M S E ({\hat{ξ}}_{L R R E}) = t r \{M M S E ({\hat{ξ}}_{L R R E})\} = \sum_{j = 1}^{r} \frac{μ_{j}}{{(μ_{j} + k)}^{2}} + k^{2} \sum_{j = 1}^{r} \frac{τ_{j}^{2}}{{(μ_{j} + k)}^{2}},

(11)

where

τ = ℚ^{t} ξ

and k (k > 0) is the Hoerl and Kennard [6] ridge parameter. Optimizing the Equation (11) with respect to k yields

k = \frac{1}{\sum_{j = 1}^{r} {\hat{τ}}_{j}^{2}} .

(12)

Another estimator that treats the multicollinearity better than the LRRE was given by Liu [25] and Mansson et al. [11] for the BRLM that is named as the LLE and is defined by

{\hat{ξ}}_{L L E} = F_{d} {\hat{ξ}}_{M L E},

(13)

where

F_{d} = {(Υ + I_{r})}^{- 1} (Υ + d I_{r})

, and

d (0 \leq d < 1)

is the LP. The BV and CM of (13) are given respectively by

B i a s ({\hat{ξ}}_{L L E}) = ℚ (d - 1) Γ_{I}^{- 1} ξ .

(14)

C o v ({\hat{ξ}}_{L L E}) = ℚ Γ_{I}^{- 1} Γ_{d} Γ^{- 1} Γ_{d} Γ_{I}^{- 1} ℚ^{t} .

(15)

By using (14) and (15), the MMSE is defined as

\begin{matrix} M M S E ({\hat{ξ}}_{L L E}) = F_{d} Υ^{- 1} F_{d}^{t} + B i a s ({\hat{ξ}}_{L L E}) B i a s {({\hat{ξ}}_{L L E})}^{t} \\ = ℚ Γ_{I}^{- 1} Γ_{d} Γ^{- 1} Γ_{d} Γ_{I}^{- 1} ℚ^{t} + {(d - 1)}^{2} ℚ Γ_{I}^{- 1} ξ ξ^{t} Γ_{I}^{- 1} ℚ^{t}, \end{matrix}

(16)

where

Γ_{I} = d i a g (μ_{1} + 1, μ_{2} + 1, \dots, μ_{r} + 1)

, and

Γ_{d} = d i a g (μ_{1} + d, μ_{2} + d, \dots, μ_{r} + d)

. The scalar MSE of the LLE is given by

M S E ({\hat{ξ}}_{L L E}) = t r \{M M S E ({\hat{ξ}}_{L L E})\} = \sum_{j = 1}^{r} \frac{{(μ_{j} + d)}^{2}}{μ_{j} {(μ_{j} + 1)}^{2}} + {(d - 1)}^{2} \sum_{j = 1}^{r} \frac{τ_{j}^{2}}{{(μ_{j} + 1)}^{2}},

(17)

where d is the LP. Optimizing the Equation (17) with respect to d yields

d = \sum_{j = 1}^{r} (τ_{j}^{2} - 1) / (τ_{j}^{2} + 1 / μ_{j})

. For

d = 1

, we obtain

{\hat{ξ}}_{L L E} = {\hat{ξ}}_{M L E}

.

2.1. The Proposed Estimator

In the LLE, most of the time, the value of d is negative, which affects the BLRM estimation under multicollinearity. This estimator has been proposed for the Poisson RM [27]. Therefore, following [27], we define the adjusted logistic Liu estimator (ALLE), which is given as

{\hat{ξ}}_{A L L E} = A_{d} {\hat{ξ}}_{M L E},

(18)

where

A_{d} = {(Υ + I_{r})}^{- 1} (Υ - d_{0} I_{r}),

and

0 < d_{0} < 1

is the Liu parameter for the ALLE. The new estimator gives a genuine refinement in the efficacy of the BLRM coefficients. The BV, CM, MSE matrix and scalar MSE of the ALLE are given in Equations (19)–(22), respectively.

B i a s ({\hat{ξ}}_{A L L E}) = - (d_{0} - 1) {(Υ + I_{r})}^{- 1} ξ .

(19)

C o v ({\hat{ξ}}_{A L L E}) = {(Υ + I_{r})}^{- 1} (Υ - d_{0} I_{r}) {(Υ)}^{- 1} (Υ - d_{0} I_{r}) {(Υ + I_{r})}^{- 1} .

(20)

\begin{matrix} M M S E ({\hat{ξ}}_{A L L E}) = C o v ({\hat{ξ}}_{A L L E}) + B i a s ({\hat{ξ}}_{A L L E}) B i a s {({\hat{ξ}}_{A L L E})}^{t} \\ = {(Υ + I_{r})}^{- 1} (Υ - d_{0} I_{r}) {(Υ)}^{- 1} (Υ - d_{0} I_{r}) {(Υ + I_{r})}^{- 1} + {(d_{0} - 1)}^{2} {(Υ + I_{r})}^{- 1} β β^{t} {(Υ + I_{r})}^{- 1} . \end{matrix}

(21)

M S E ({\hat{ξ}}_{A L L E}) = t r \{M M S E ({\hat{ξ}}_{A L L E})\} = \sum_{j = 1}^{r} \frac{{(μ_{j} - d_{0})}^{2}}{μ_{j} {(μ_{j} + 1)}^{2}} + {(d_{0} - 1)}^{2} \sum_{j = 1}^{r} \frac{τ_{j}^{2}}{{(μ_{j} + 1)}^{2}} .

(22)

2.2. Theoretical Comparison

Proposition 2.1.

Let M > 0, i.e., M is a positive definite (p,d) matrix. Then

M - τ τ^{t} \geq 0

if and only if

τ^{t} M^{- 1} τ \leq 1

for some vector

τ

[28].

Proposition 2.2.

Let

{\hat{θ}}_{1} = B_{1} y

and

{\hat{θ}}_{2} = B_{2} y

be estimators of

θ

. Suppose that

D = C o v ({\hat{θ}}_{1}) - C o v ({\hat{θ}}_{2}) > 0

, where

C o v ({\hat{θ}}_{1})

and

C o v ({\hat{θ}}_{2})

represents the CM of

{\hat{θ}}_{1}

and

{\hat{θ}}_{2}

, respectively. Then,

M M S E ({\hat{θ}}_{1}) - M M S E ({\hat{θ}}_{2}) > 0

if and only if

c_{2}^{t} {(D + c_{2} c_{2}^{t})}^{- 1} c_{2} < 1,

where

c_{2}

is the bias whereas

M M S E ({\hat{θ}}_{j}) = C o v ({\hat{θ}}_{j}) + c_{j} c_{j}^{t},

where

c_{j}

is the BV of

{\hat{θ}}_{j}

[3].

Theorem 2.1.

For the BLRM, if

0 < d_{0} < 1

, then

{\hat{ξ}}_{A L L E}

is better than

{\hat{ξ}}_{M L E}

; that is,

Δ_{1} = M M S E ({\hat{ξ}}_{M L E}) - M M S E ({\hat{ξ}}_{A L L E}) > 0

if and only if

b_{A L L E}^{t} {[(Υ^{- 1} - {(Υ + I_{r})}^{- 1} (Υ - d_{0} I_{r}) Υ^{- 1} (Υ - d_{0} I_{r}) {(Υ + I_{r})}^{- 1})]}^{- 1} b_{A L L E} < 1

, where

b_{A L L E} = - (d_{0} - 1) {(Υ + I_{r})}^{- 1} ξ

.

Proof:

See Appendix A. □

Theorem 2.2.

Under the BLRM, if

k > 0

and

0 < d_{0} < 1

, then

{\hat{ξ}}_{A L L E}

is better than

{\hat{ξ}}_{L R R E}

; that is,

Δ_{2} = M M S E ({\hat{ξ}}_{L R R E}) - M M S E ({\hat{ξ}}_{A L L E}) > 0

if and only if

b_{A L L E}^{t} [(\begin{matrix} {(Υ + k I_{r})}^{- 1} Υ {(Υ + k I_{r})}^{- 1} - {(Υ + I_{r})}^{- 1} (Υ - d_{0} I_{r}) \\ Υ^{- 1} (Υ - d_{0} I_{r}) {(Υ + I_{r})}^{- 1} \end{matrix})] b_{A L L E} < 1,

where

b_{A L L E} = - (d_{0} - 1) {(Υ + I_{r})}^{- 1} ξ

.

Proof:

See Appendix A. □

Theorem 2.3.

Under the BLRM, if

0 \leq d < 1

and

0 < d_{0} < 1

, then

{\hat{ξ}}_{A L L E}

is better than

{\hat{ξ}}_{L L E}

; that is,

Δ_{3} = M M S E ({\hat{ξ}}_{L L E}) - M M S E ({\hat{ξ}}_{A L L E}) > 0

if and only if

b_{A L L E}^{t} [(\begin{matrix} {(Υ + I_{r})}^{- 1} (Υ + d I_{r}) {(Υ)}^{- 1} (Υ + d I_{r}) {(Υ + I_{r})}^{- 1} \\ - {(Υ + I_{r})}^{- 1} \\ (Υ - d_{0} I_{r}) Υ^{- 1} (Υ - d_{0} I_{r}) {(Υ + I_{r})}^{- 1} \end{matrix})] b_{A L L E} < 1

, where

b_{A L L E} = - (d_{0} - 1) {(Υ + I_{r})}^{- 1} ξ

.

Proof:

See Appendix A. □

2.3. Parameter Estimation

In the literature different methods are employed to optimize the biasing parameters; see [17,20,29,30]. Computing the critical points of the Equation (22) with respect to

d_{0}

, we obtain the jth term as

d_{0}^{j} = \frac{μ_{j} (1 + τ_{j}^{2})}{1 + τ_{j}^{2} μ_{j}} .

(23)

In (23), replacing

{\hat{μ}}_{j}

and

{\hat{τ}}_{j}

by their unbiased estimators, we obtain

{\hat{d}}_{0}^{j} = \frac{{\hat{μ}}_{j} (1 + {\hat{τ}}_{j}^{2})}{1 + {\hat{τ}}_{j}^{2} {\hat{μ}}_{j}} .

(24)

For practical considerations, we consider the minimum value of (24) as

{\hat{d}}_{0}^{m i n} = m i n (\frac{{\hat{μ}}_{j} (1 + {\hat{τ}}_{j}^{2})}{1 + {\hat{τ}}_{j}^{2} {\hat{μ}}_{j}}) .

(25)

3. Simulation

3.1. Design

Here the simulation study is designed, and general layout is given. The comparison criteria is the simulated MSE of the estimators, i.e., MLE, LRRE, LLE and ALLE, when the regressors are multicollinear. Considering

z_{i j}

to be the independent standard normal pseudo-random numbers and following [2,14,23], the EVs with various levels of correlation can be generated via the following expression:

x_{i j} = {(1 - ρ^{2})}^{1 / 2} z_{i j} + ρ z_{i (j + 1)}, i = 1, \dots, n; j = 1, \dots, r,

(26)

where

ρ

is select so as

ρ^{2}

be the correlation between any two explanatories. For simulation, we select four different values of

ρ

: 0.80; 0.90; 0.95; 0.99. Four different sample size are considered

n : 50; 100; 200

; 400. Moreover, we also consider different number of EVs

p : 3; 6; 12

, and

β

is selected to be the eigenvector of the matrix

F

corresponding to the largest eigenvalue [14].

The Bernoulli distribution

B e (δ_{i})

is used to generate observations, where

δ_{i} = \frac{e x p (x_{i}^{t} β)}{1 + e x p (x_{i}^{t} β)}, i = 1, \dots, n

such that the data matrix

X = {(x_{i}^{t})}_{i = 1, 2, \dots, n}

. The experiment is repeated 1000 times by generating

z_{i j}

. The bias and MSE values of the estimators are computed via the equation:

B i a s (\hat{β}) = \sum_{i = 1}^{R} \frac{({\hat{ξ}}_{i} - ξ)}{R}; M S E (\hat{β}) = \frac{\sum_{i = 1}^{R} {({\hat{ξ}}_{i} - ξ)}^{t} ({\hat{ξ}}_{i} - ξ)}{R} .

(27)

where

({\hat{ξ}}_{i} - ξ)

is the deviation between the parameter and its estimate for each

i

of the BLRM estimators and R is the number replications. The R Language is used for computations.

3.2. Discussion

The estimated biases and MSEs of the BLRM estimators for changed controlled conditions are summarized in Table 1, Table 2 and Table 3 for p = 3, 6 and 12, respectively. The simulated results in Table 1, Table 2 and Table 3 clearly demonstrated that the constructed ALLE surpassed the other estimators in terms of minimum bias and MSE. In all cases, the performance of the MLE is the worst since the estimated MSE is larger as compared to the other estimators. Based on the findings of the simulation, it can be seen that there is a direct relationship between the scalar MSEs and various levels of multicollinearity. As we change the multicollinearity level from mild to severe, the estimated MSEs increase for a given sample size and number of explanatory variables (EV). However, it is clearly observed that the proposed ALLE for the BLRM is a unique estimation method due to its minimum bias as well as minimum estimated MSEs. We also provide a graphical representation for the readers to see a clear image of the constructed and other considered estimators. Figure 1a–d demonstrates the effect of collinearity on the performance of the under-studied estimators with different sizes. Figure 1 clearly shows that the proposed ALLE performed the best for all the levels of multicollinearity. In addition, when the results are compared with respect to the sample size, then it can be noticed that the estimated MSE values of all the estimators decrease as n increases. For more details, see Figure 2a–d. When the results are interpreted with regard to the number of EVs, then it is observed from Table 1, Table 2 and Table 3 that the estimated MSEs increase by increasing the value of p. Based on the results, ALLE is the most suitable and consistent option whenever the RV is binary, and the EVs are correlated due to its minimum bias and estimated MSEs. Undoubtedly, other biased estimators also attain a lower MSE in contrast to the MLE, but the variation in the proposed ALLE is quite lower than the other estimators for all evaluated states.

Table 1. Bias and MSE values of the estimators for p = 3.

Table 2. Bias and MSE values of the estimators for p = 6.

Table 3. Bias and MSE values of the estimators for p = 12.

Figure 1. The BLRM estimators in relation to multicollinearity: (a).

n = 50

, (b).

n = 100

, (c).

n = 200

, (d).

n = 400

.

Figure 2. The BLRM estimators for various

n

: (a).

ρ = 0.80

, (b).

ρ = 0.90

, (c).

ρ = 0.95

, (d).

ρ = 0.99

.

4. Application

Here, a real application is considered to discuss the performance of the constructed estimator. For this purpose, prostate cancer data have been considered, which were taken from Kutner et al. [31]. However, the RV

y

is the seminal vesicle invasion, i.e., presence or absence of seminal vesicle invasion: the binary RV is 1 if yes; 0 otherwise. The descriptions of seven regressors are given in Table 4.

Table 4. Explanation of the respected regressors.

The eigenvalues of

F

matrix are found to be:

μ_{1} = 40,669.86

,

μ_{2} = 1960.62

,

μ_{3} = 1868.34

,

μ_{4} = 176.214

,

μ_{5} = 45.520

,

μ_{6} = 18.943

and

μ_{7} = 27.4104

. The multicollinearity of the regressors is assessed via a condition index (CI). The CI is evaluated by the eigenvalues of the

X^{t} \hat{V} X

matrix without an intercept term, while the eigenvalues of the

F

matrix include the intercept term. We observed that the

C I = \sqrt{μ_{m a x} / μ_{m i n}} = 121.97 > 30

. This confirms that there is a multicollinearity issue among the regressors. Moreover, we also compute the correlations among the regressors, and the results are displayed in Figure 3. It declares the moderate correlation among CV and PSA. level, CP and PSA. level and CV and CP.

Figure 3. Correlation matrix among the seven regressors of a prostate cancer data.

The estimated coefficients (EC) and the MSE values are presented in Table 5. The EC of the MLE, LRRE, LLE and ALLE are obtained from (4), (7), (14) and (19), respectively. However, the scalar MSE values of the MLE, LRRE, LLE and ALLE are calculated using equations (6), (12), (18) and (23), respectively. The LRRE involves a shrinkage ridge parameter

(k)

, which is found to be 0.077. Whereas the Liu parameter

(d)

for the LLE was found to be 0.5689. However, the shrinkage parameter

d_{0}

of the proposed ALLE was found to be 0.0693. Table 5 reveals that ALLE decreases the EC and MSE value in a better way in comparison with the competitive estimators. It is also noticed that the MLE attains the largest MSE, which clearly indicates that it is the most sensitive estimator when there exists a multicollinearity among the regressors. In addition, the proposed ALLE is the most consistent option in the case of multicollinearity. Further, it can be noticed that the performance of LRRE is comparatively better than that of LLE and MLE. Further, it is also observed that the ALLE’s performance is better as compared to the other biased estimators as well as the MLE. Therefore, it is recommended to use the ALLE estimator when one considers the BLRM with correlated regressors because of its consistent and robust behavior against multicollinearity.

Table 5. Regression estimates and MSEs of the logistics regression estimators for the prostate cancer data.

5. Concluding Remarks

We constructed a new adjusted logistic LE (ALLE) for the binary LRM (BLRM) to handle the multicollinearity issue. The MLE method is not a good choice due to its inflated variance and standard error whenever the regressors are multicollinear. The efficacy of the constructed estimator is judged via a MCS under various controlled conditions. The investigation has been performed for different values of correlation, sample sizes and the number of EVs. Findings showed that the ALLE’s performance is better compared to competitive ones and the MLE. Further, the MSE values of the MLE, LRRE, LLE and ALLE increase as the multicollinearity level increases. This phenomenon is particularly sharp for small sample sizes and whenever the correlation is high. However, this increase is quite smaller for the proposed ALLE. The superiority of the proposed estimator over others is also proved via a real application. The findings from both simulation and application clearly support our constructed estimator, which is a good and robust estimation method whenever there exists an imperfect but high multicollinearity among the regressors. As a result, we strongly advise practitioners to use the new estimator when estimating the unknown BLRM regression coefficients in the presence of multicollinearity.

Author Contributions

Authors contributed equally. All authors have read and agreed to the published version of the manuscript.

Funding

The author Nahid Fatima would like to acknowledge the support of Prince Sultan University for its support and help for paying the Article Processing Charges (APC) of this publication. Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2023R 299), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Data Availability Statement

The data is included with in the article.

Acknowledgments

The author Nahid Fatima would like to acknowledge the support of Prince Sultan University for its support and help for paying the Article Processing Charges (APC) of this publication. The authors are thankful to Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2023R 299), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Conflicts of Interest

There is no conflict of interest.

Appendix A

Proof of Theorem 2.1

Δ_{1} = Υ^{- 1} - {(Υ + I_{r})}^{- 1} (Υ - d_{0} I_{r}) Υ^{- 1} (Υ - d_{0} I_{r}) {(Υ + I_{r})}^{- 1} - b_{A L L E} b_{A L L E}^{t} .

(A1)

= Υ^{- 1} [I_{r} - {(Υ + I_{r})}^{- 1} (Υ - d_{0} I_{r}) (Υ - d_{0} I_{r}) {(Υ + I_{r})}^{- 1}] - b_{A L L E} b_{A L L E}^{t} .

(A2)

However, the scalar MSE of (24) can be written as

\begin{matrix} M S E ({\hat{ξ}}_{M L E}) & - M S E ({\hat{ξ}}_{A L L E}) = ℚ d i a g {\{\frac{1}{μ_{j}} - \frac{{(μ_{j} - d_{0})}^{2}}{μ_{j} {(μ_{j} + 1)}^{2}}\}}_{j = 1}^{r} ℚ^{t} - b_{A L L E}^{t} b_{A L L E} \\ = ℚ d i a g {\{\frac{2 μ_{j} (d_{0} + 1) - (d_{0}^{2} - 1)}{μ_{j} {(μ_{j} + 1)}^{2}}\}}_{j = 1}^{r} ℚ^{t} - b_{A L L E}^{t} b_{A L L E}, \end{matrix}

(A3)

where

b_{A L L E}

is the bias of our proposed estimation method. We observed from Theorem 2.1 that

Υ^{- 1} - {(Υ + I_{r})}^{- 1} (Υ - d_{0} I_{r}) Υ^{- 1} (Υ - d_{0} I_{r}) {(Υ + I_{r})}^{- 1}

is positive definite.

Υ^{- 1} [I_{r} - {(Υ + I_{r})}^{- 1} (Υ - d_{0} I_{r}) (Υ - d_{0} I_{r}) {(Υ + I_{r})}^{- 1}]

is positive definite if

2 μ_{j} (d_{0} + 1) - (d_{0}^{2} - 1) > 0 \forall j = 1, \dots, r

. Thus, by using proposition 2.1 and 2.2, we conclude that

M S E ({\hat{ξ}}_{M L E}) > M S E ({\hat{ξ}}_{A L L E});

the proof is ended by proposition 2.1 and 2.2. □

Proof of Theorem 2.2

Δ_{2} = {(Υ + k I_{r})}^{- 1} Υ {(Υ + k I_{r})}^{- 1} - {(Υ + I_{r})}^{- 1} (Υ - d_{0} I_{r}) Υ^{- 1} (Υ - d_{0} I_{r}) {(Υ + I_{r})}^{- 1} + b_{L R R E} b_{L R R E}^{t} - b_{A L L E} b_{A L L E}^{t},

(A4)

where

b_{L R R E} = - k {(Υ + k I_{r})}^{- 1}

. While the difference in scalar MSEs of (26) can be written as

M S E ({\hat{ξ}}_{L R R E}) - M S E ({\hat{ξ}}_{A L L E}) = ℚ d i a g {\{\frac{μ_{j}}{{(μ_{j} + k)}^{2}} - \frac{{(μ_{j} - d_{0})}^{2}}{μ_{j} {(μ_{j} + 1)}^{2}}\}}_{j = 1}^{r} ℚ^{t} + b_{L R R E}^{t} b_{L R R E} - b_{A L L E}^{t} b_{A L L E}

(A5)

= ℚ d i a g {\{\frac{μ_{j}^{2} {(μ_{j} + 1)}^{2} - {(μ_{j} - d_{0})}^{2} {(μ_{j} + k)}^{2}}{μ_{j} {(μ_{j} + k)}^{2} {(μ_{j} + 1)}^{2}}\}}_{j = 1}^{r} ℚ^{t} + b_{L R R E}^{t} b_{L R R E} - b_{A L L E}^{t} b_{A L L E},

(A6)

where

b_{L R R E}

and

b_{A L L E}

is the bias of LRRE and our proposed (ALLE) estimation method. We observed from Theorem 2.2 that

{(Υ + k I_{r})}^{- 1} Υ {(Υ + k I_{r})}^{- 1} - {(Υ + I_{r})}^{- 1} (Υ - d_{0} I_{r}) Υ^{- 1} (Υ - d_{0} I_{r}) {(Υ + I_{r})}^{- 1}

is positive definite if

μ_{j}^{2} {(μ_{j} + 1)}^{2} - {(μ_{j} - d_{0})}^{2} {(μ_{j} + k)}^{2} > 0 \forall j = 1, \dots, r

. Thus, by using proposition 2.1 and 2.2, we conclude that

M S E ({\hat{ξ}}_{L R R E}) > M S E ({\hat{ξ}}_{A L L E});

the proof is ended by proposition 2.1 and 2.2. □

References

Lin, E.; Lin, C.-H.; Lane, H.-Y. Logistic ridge regression to predict bipolar disorder using mRNA expression levels in the N-methyl-D-aspartate receptor genes. J. Affect. Disord. 2022, 297, 309–313. [Google Scholar] [CrossRef] [PubMed]
Afzal, N.; Amanullah, M. Dawoud–Kibria Estimator for the Logistic Regression Model: Method, Simulation and Application. Iran. J. Sci. Tech. Trans. A Sci. 2022, 46, 1483–1493. [Google Scholar] [CrossRef]
Schaefer, R.L. Alternative estimators in logistic regression when the data are collinear. J. Stat. Comput. Simul. 1986, 25, 75–91. [Google Scholar] [CrossRef]
Lukman, A.F.; Ayinde, K. Review and classifications of the ridge parameter estimation techniques. Hacet. J. Math. Stat. 2017, 46, 953–967. [Google Scholar] [CrossRef]
Frisch, R. Statistical confluence analysis by means of complete regression systems. Econ. J. 1934, 45, 741–742. [Google Scholar]
Hoerl, A.E.; Kennard, R.W. Ridge regression: Biased estimation for nonorthogonal problems. Technometrics 1970, 12, 55–67. [Google Scholar] [CrossRef]
Schaefer, R.L.; Roi, L.D.; Wolfe, R.A. A ridge logistic estimator. Commun. Stat.—Theor. Methods 1984, 13, 99–113. [Google Scholar] [CrossRef]
Lee, A.H.; Silvapulle, M.J. Ridge estimation in logistic regression. Commun. Stat. Simul. Comput. 1988, 17, 1231–1257. [Google Scholar] [CrossRef]
Cessie, S.L.; Houwelingen, J.C.V. Ridge estimators in logistic regression. J. R. Stat. Soc. C 1992, 41, 191–201. [Google Scholar] [CrossRef]
Segerstedt, B. On ordinary ridge regression in generalized linear models. Commun. Stat. Theor. Method. 1992, 21, 2227–2246. [Google Scholar] [CrossRef]
Akram, M.N.; Amin, M.; Elhassanein, A.; Aman Ullah, M. A new modified ridge-type estimator for the beta regression model: Simulation and application. AIMS Math. 2022, 7, 1035–1057. [Google Scholar] [CrossRef]
Kibria, B.M.G.; Månsson, K.; Shukur, G. Performance of some logistic ridge regression estimators. Comput. Econ. 2012, 40, 401–414. [Google Scholar] [CrossRef]
Asar, Y. Some new methods to solve multicollinearity in logistic regression. Commun. Stat. Simul. Comput. 2017, 46, 2576–2586. [Google Scholar] [CrossRef]
Hadia, M.; Amin, M.; Akram, M.N. Comparison of link functions for the estimation of logistic ridge regression: An application to urine data. Commun. Stat. Simul. Comput. 2022. [Google Scholar] [CrossRef]
Liu, K. A new class of biased estimate in linear regression. Commun. Stat. 1993, 22, 393–402. [Google Scholar]
Mansson, K.; Kibria, B.M.G.; Shukur, G. On Liu estimators for the logit regression model. Econ. Model. 2012, 29, 1483–1488. [Google Scholar] [CrossRef]
Qasim, M.; Amin, M.; Amanullah, M. On the performance of some new Liu parameters for the gamma regression model. J. Stat. Comput. Simul. 2018, 88, 3065–3080. [Google Scholar] [CrossRef]
İnan, D.; Erdoğan, B.E. Liu-type logistic estimator. Commun. Stat. Simul. Comput. 2013, 42, 1578–1586. [Google Scholar] [CrossRef]
Şiray, G.Ü.; Toker, S.; Kaçiranlar, S. On the restricted Liu estimator in the logistic regression model. Commun. Stat. Simul. Comput. 2015, 44, 217–232. [Google Scholar] [CrossRef]
Asar, Y.; Genc, A. New shrinkage parameters for the Liu-type logistic estimators. Commun. Stat. Simul. Comput. 2016, 45, 1094–1103. [Google Scholar] [CrossRef]
Wu, J. Modified restricted Liu estimator in logistic regression model. Comput. Stat. 2016, 31, 1557–1567. [Google Scholar] [CrossRef]
Wu, J.; Asar, Y. More on the restricted Liu estimator in the logistic regression model. Commun. Statist. Simul. Comput. 2017, 46, 3680–3689. [Google Scholar] [CrossRef]
Lukman, A.F.; Kibria, B.M.G.; Ayinde, K.; Jegede, S.L. Modified one-parameter Liu estimator for the linear regression model. Model. Simul. Eng. 2020, 2020, 9574304. [Google Scholar] [CrossRef]
Wu, J.; Asar, Y.; Arashi, M. On the restricted almost unbiased Liu estimator in the logistic regression model. Commun. Stat. Theor. Method. 2018, 47, 4389–4401. [Google Scholar] [CrossRef]
Varathan, N.; Wijekoon, P. Logistic Liu estimator under stochastic linear restrictions. Stat. Pap. 2019, 60, 595–612. [Google Scholar] [CrossRef]
Pregibon, D. Logistic regression diagnostics. Ann. Stat. 1981, 9, 705–724. [Google Scholar] [CrossRef]
Li, Y.; Asar, Y.; Wu, J. On the stochastic restricted Liu estimator in logistic regression model. J. Stat. Comput. Simul. 2020, 90, 2766–2788. [Google Scholar] [CrossRef]
Amin, M.; Akram, M.N.; Kibria, B.M.G. A new adjusted Liu estimator for the Poisson regression model. Concurr. Comput. Pract. Exper. 2021, 33, e6340. [Google Scholar] [CrossRef]
Farebrother, R.W. Further results on the mean square error of ridge regression. J. R. Stat. Soc. 1976, 38, 248–250. [Google Scholar]
Mustafa, S.; Amin, M.; Akram, M.N.; Afzal, N. On the performance of link functions in the beta ridge regression model: Simulation and application. Concurr. Comput. Pract. Exper. 2022, 34, e7005. [Google Scholar] [CrossRef]
Kutner, M.H.; Nachtsheim, C.J.; Neter, J.; Li, W. Applied Linear Statistical Models, 5th ed.; McGraw Hill: New York, NY, USA, 2005. [Google Scholar]

Figure 1. The BLRM estimators in relation to multicollinearity: (a).

n = 50

, (b).

n = 100

, (c).

n = 200

, (d).

n = 400

.

Figure 1. The BLRM estimators in relation to multicollinearity: (a).

n = 50

, (b).

n = 100

, (c).

n = 200

, (d).

n = 400

.

Figure 2. The BLRM estimators for various

n

: (a).

ρ = 0.80

, (b).

ρ = 0.90

, (c).

ρ = 0.95

, (d).

ρ = 0.99

.

Figure 2. The BLRM estimators for various

n

: (a).

ρ = 0.80

, (b).

ρ = 0.90

, (c).

ρ = 0.95

, (d).

ρ = 0.99

.

Figure 3. Correlation matrix among the seven regressors of a prostate cancer data.

Table 1. Bias and MSE values of the estimators for p = 3.

		Bias			MSE
n	$ρ$	LRRE	LLU	ALLE	MLE	LRRE	LLU	ALLE
50	0.80	−0.1164	−0.0706	−0.0181	297.24	158.62	109.11	22.15
	0.90	0.1104	0.0454	−0.0059	495.09	255.09	187.01	40.57
	0.95	−0.0601	−0.0047	−0.0572	970.93	498.44	393.88	95.04
	0.99	0.2172	0.1165	−0.0585	4195.40	1971.66	1634.97	431.22
100	0.80	0.2245	0.1493	−0.0182	141.16	76.78	50.44	9.45
	0.90	0.1207	0.0485	−0.0047	224.26	115.77	80.56	16.53
	0.95	0.1288	0.0948	−0.0192	410.35	202.71	151.40	32.77
	0.99	−0.0330	−0.0106	−0.0178	1858.28	873.64	698.43	166.81
200	0.80	0.0348	0.0297	−0.0201	65.68	35.79	23.81	5.17
	0.90	0.1545	0.0914	−0.0048	105.27	54.22	38.29	9.97
	0.95	0.1235	0.0583	0.0107	194.05	95.79	72.44	19.80
	0.99	0.2043	0.0872	0.0328	925.09	423.63	348.39	118.45
400	0.80	0.0584	0.0394	0.0067	30.47	16.65	10.46	2.32
	0.90	0.2181	0.1382	0.0566	52.09	27.34	17.83	4.48
	0.95	0.1036	0.0689	0.0183	91.95	47.21	30.58	7.20
	0.99	0.0694	0.0341	0.0212	435.24	213.83	142.51	37.29

Table 2. Bias and MSE values of the estimators for p = 6.

		Bias			MSE
n	$ρ$	LRRE	LLU	ALLE	MLE	LRRE	LLU	ALLE
50	0.80	0.2889	0.1538	−0.0200	983.86	547.58	350.62	21.41
	0.90	0.0248	0.0232	−0.0343	1820.37	982.09	656.32	44.56
	0.95	0.2548	0.1422	0.0029	3505.32	1879.78	1249.99	67.30
	0.99	0.4358	0.2714	−0.0154	17,361.14	9256.68	6295.43	407.18
100	0.80	0.1648	0.1086	−0.0137	349.73	200.38	114.25	6.74
	0.90	0.1927	0.0948	0.0303	617.50	339.62	195.78	12.17
	0.95	0.2478	0.1529	0.0361	1224.53	666.51	392.01	24.99
	0.99	0.0427	0.0111	0.0100	5899.93	3172.73	1845.47	120.22
200	0.80	0.2858	0.2014	0.0484	152.74	89.51	48.15	3.12
	0.90	0.3155	0.1970	0.0865	273.10	155.04	84.30	4.79
	0.95	0.3309	0.2162	0.0968	516.23	285.45	156.09	8.09
	0.99	0.2304	0.1557	0.0749	2529.83	1393.39	756.02	37.58
400	0.80	0.1762	0.1265	0.0719	73.58	43.07	22.74	2.16
	0.90	0.1934	0.1410	0.0957	135.46	76.29	40.17	3.36
	0.95	0.2236	0.1625	0.1109	252.20	139.15	72.36	5.01
	0.99	0.2450	0.1807	0.1292	1203.21	652.75	330.89	18.81

Table 3. Bias and MSE values of the estimators for p = 12.

		Bias			MSE
n	$ρ$	LRRE	LLU	ALLE	MLE	LRRE	LLU	ALLE
50	0.80	0.6909	0.5045	−0.0052	3448.20	1847.27	1398.01	17.48
	0.90	0.5428	0.3454	0.0074	1,049,972.88	3275.03	2453.74	29.13
	0.95	0.6166	0.4575	0.0142	884,816.47	6229.80	4808.46	63.91
	0.99	0.4064	0.2876	−0.0199	2,162,403.68	30,159.23	23,554.65	590.93
100	0.80	0.4906	0.3261	0.0717	818.31	495.73	265.91	3.52
	0.90	0.2566	0.1681	0.0480	1447.22	861.72	454.04	5.08
	0.95	0.3170	0.2055	0.0675	2849.43	1697.90	906.36	8.21
	0.99	0.0922	0.0587	0.0131	13,816.37	8172.86	4350.73	30.44
200	0.80	0.2717	0.1904	0.1051	322.54	200.00	94.28	3.01
	0.90	0.1694	0.1185	0.0773	586.96	357.99	169.71	3.78
	0.95	0.3037	0.2306	0.1461	1144.84	698.86	330.32	4.79
	0.99	0.2496	0.1941	0.1276	5603.54	3422.06	1606.92	16.26
400	0.80	0.3649	0.2926	0.2101	143.93	91.70	41.90	2.77
	0.90	0.2928	0.2394	0.1839	257.25	160.64	71.30	2.82
	0.95	0.2859	0.2350	0.1873	498.05	309.52	136.67	3.23
	0.99	0.2874	0.2443	0.1925	2399.74	1479.83	647.96	6.25

Table 4. Explanation of the respected regressors.

N	Regressor Call	Explanation
1	PSA level	Serum prostate- specific antigen level (mg/mL).
2	Cancer volume (CV)	Estimate of prostate cancer volume (cc).
3	Weight	Prostate weight (gm)
4	Age	Age of patients (years)
5	Benign prostatic hyperplasia (BPH)	Amount of benign prostatic hyperplasia (cm²)
6	Capsular penetration (CP)	Degree of capsular penetration (cm)
7	Gleason score (GS)	Pathologically determined grade of disease using total score of two patterns (summed scores were either 6, 7, or 8 with higher scores indicating more prognosis).

Source: Adapted in part from: Hastie, T. J.; R. J. Tibshirani; and J. Friedman. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, New York; Springer. Verlag, 2001.

Table 5. Regression estimates and MSEs of the logistics regression estimators for the prostate cancer data.

	MLE	LRRE	LLE	ALLE
(Intercept)	−10.1574	−2.3825	−5.8266	0.4431
x1	0.1189	0.1090	0.1134	0.0985
x2	−0.1345	−0.1146	−0.1234	−0.0915
x3	0.0001	0.0002	0.0001	−0.0001
x4	0.0973	0.0220	0.0553	−0.0169
x5	−0.2281	−0.2185	−0.2228	−0.2021
x6	0.6487	0.6513	0.6501	0.5976
x7	−0.0673	−0.4491	−0.2799	−0.4555
MSE	45.6873	4.0569	15.9075	2.2935

Note: The best value is in bold font.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.