Bias Reduction for the Marshall-Olkin Extended Family of Distributions with Application to an Airplane’s Air Conditioning System and Precipitation Data

Magalhães, Tiago M.; Gómez, Yolanda M.; Gallardo, Diego I.; Venegas, Osvaldo

doi:10.3390/sym12050851

Open AccessArticle

Bias Reduction for the Marshall-Olkin Extended Family of Distributions with Application to an Airplane’s Air Conditioning System and Precipitation Data

¹

Department of Statistics, Institute of Exact Sciences, Federal University of Juiz de Fora, Juiz de Fora MG 36036-900, Brazil

²

Departamento de Matemáticas, Facultad de Ingeniería, Universidad de Atacama, Copiapó 1530000, Chile

³

Departamento de Ciencias Matemáticasy Físicas, Facultad de Ingeniería, Universidad Católica de Temuco, Temuco 4780000, Chile

^*

Author to whom correspondence should be addressed.

Symmetry 2020, 12(5), 851; https://doi.org/10.3390/sym12050851

Submission received: 13 April 2020 / Revised: 8 May 2020 / Accepted: 9 May 2020 / Published: 22 May 2020

(This article belongs to the Special Issue Symmetric and Asymmetric Distributions: Theoretical Developments and Applications II)

Download

Browse Figures

Versions Notes

Abstract

The Marshall-Olkin extended family of distributions is an alternative for modeling lifetimes, and considers more or less asymmetry than its parent model, achieved by incorporating just one extra parameter. We investigate the bias of maximum likelihood estimators and use it to develop an estimator with less bias than traditional estimators, by a modification of the score function. Unlike other proposals, in this paper, we consider a bias reduction methodology that can be applied to any member of the family and not necessarily to any particular distribution. We conduct a Monte Carlo simulation in order to study the performance of the corrected estimators in finite samples. This simulation shows that the maximum likelihood estimator is quite biased and the proposed estimator is much less biased; in small sample sizes, the bias is reduced by around 50 percent. Two applications, related to the air conditioning system of an airplane and precipitations, are presented to illustrate the results. In those applications, the bias reduction for the shape parameters is close to 25% and the bias reduction also reduces, among others things, the width of the 95% confidence intervals for quantiles lower than 0.594.

Keywords:

asymmetric distributions; bias correction; Marshall-Olkin extended family; maximum likelihood estimators

1. Introduction

Marshall and Olkin [1] proposed a way of introducing a parameter in a family of distributions to compete with such commonly used distributions as the Weibull, gamma and lognormal distributions. Let G and g be the cumulative distribution function (cdf) and the probability density function (pdf) respectively, indexed by the vector of parameter

λ \in Λ

. The pdf of the Marshall-Olkin extended model is given by

\begin{matrix} f (x; α, λ) = \frac{α g (x; λ)}{{1 - \bar{α} \bar{G} (x; λ)}^{2}}, x \in X, α > 0, λ \in Λ, \end{matrix}

(1)

where

X

is the sample space defined for g and G,

\bar{G} = 1 - G

and

\bar{α} = 1 - α

. Henceforth, we use the notation

X \sim M O E_{g} (α, λ)

to refer to a random variable with density provided in (1). Marshall and Olkin [2] called

α

the “tilt” parameter because the hazard rate of

M O E_{g} (α, λ)

is shifted below (

α \geq 1

) or above (

0 < α \leq 1

) the baseline hazard rate related to g. Castellares and Lemonte [3] also showed an interpretation of

M O E_{g} (α, λ)

based on the distribution of order statistics. Specifically, let

{X_{n}, n \in N}

be a sequence of independent, identically distributed (iid) random variables in which each

X_{n}

has baseline cumulative function G. Also, let

N_{1} \sim G e o (α)

, for

0 < α < 1

and

N_{2} \sim G e o (α^{- 1})

, for

α > 1

, with

G e o (p)

denoting the geometric distribution with mean

1 / p

. We have that

\{\begin{matrix} Y_{N_{1}} = min (X_{1}, \dots, X_{N_{1}}) \sim M O E_{g} (α, λ), & if 0 < α \leq 1, and \\ Z_{N_{2}} = max (X_{1}, \dots, X_{N_{2}}) \sim M O E_{g} (α, λ), & if α > 1 . \end{matrix}

In their initial work, the authors considered the case where G and g came from the exponential and Weibull models. Other recent proposals discussed in the literature include the Pareto [4], extended Weibull [5], gamma [6], Lomax [7], linear failure-rate [8], Burr type XII [9], normal [10], geometric [11], Birnbaum-Saunders [12], extended Weibull [13,14], modified Weibull [15], beta [16], generalized exponential [17], extended generalized Lindley [18], additive Weibull [19], Kappa [20] and logistic-exponential [21] distributions, among others. [22] presented a generalization based on the exponentiated method discussed in [23], named the Marshall-Olkin generalized-G family, which included as a particular case the Marshall-Olkin extended model in [1].

From a statistical point of view, interpretations for some of those extensions are as follows: for

α \geq 1

, Marshall and Olkin [1] interpreted the Marshall-Olkin extended exponential model as the conditional distribution, given the variable in the positive axis, of a random variable with logistic survival function. Ristic et al. [6] interpreted the Marshall-Olkin extended gamma model as a minification process, useful in a time series context. Ghitany et al. [7] interpreted the Marshall-Olkin extended Lomax model as a compounding process with exponential mixing model. A similar interpretation was presented in Ghitany and Kotz [8] for the Marshall-Olkin extended linear failure-rate. Gómez-Déniz [11] discretize the Marshall-Olkin extended exponential model in Marshall and Olkin [1] to obtain a generalized version of the geometric distribution. This model also can be seen as an infinite mixture of geometric distributions.

Applications to real data sets for some of those extensions include remission times in bladder cancer patients [7] and cancers in general [24], reliability analyses of electronic devices [15] and mechanical components [20], stress-rupture life of kevlar 49/epoxy strands [25], strengths of glass fibers [25], solid epoxy electrical-insulation in an accelerated voltage life test [25], flood peaks in a river [21], lifetimes of front disk brake pads [21], annual salaries of baseball players in Major League Baseball [26], stream flow amounts [20], etc.

We note that in many simulation studies presented in those works, the bias for the estimator of

α

is greater than the bias for the estimator of

λ

(the vector for the baseline model). If the estimators are biased, this implies that they are inconsistent; consequently, any function of these estimators will be inconsistent. In particular, any measurement of interest, such as mean, median, quantile, etc. will be inconsistent.

For illustrative purposes, we considered the Marshall-Olkin extended exponential (MOEE) with pdf given by

\begin{matrix} f (x; α, θ) = \frac{α θ e^{- θ x}}{{(1 - \bar{α} e^{- θ x})}^{2}}, x > 0, α > 0, θ > 0, \end{matrix}

(2)

and the the Marshall-Olkin extended Rayleigh (MOER), a submodel in the class proposed by Alshangiti et al. [15], for which pdf is

\begin{matrix} f (x; α, θ) = \frac{2 α θ x e^{- θ x^{2}}}{{(1 - \bar{α} e^{- θ x^{2}})}^{2}}, x > 0, α > 0, θ > 0 . \end{matrix}

(3)

Note that in both models, the dimension of

λ

is 1. Moreover,

λ = θ

. Figure 1 shows the density plot for different choices of

α

and

θ

in MOEE and MOER models, respectively.

Figure 2 and Figure 3 show the estimated bias based on 5000 replicates for the MOEE and MOER models and different sample sizes for the maximum likelihood estimators (MLE). These figures illustrate that the bias for the estimator of

α

is considerably greater (in relative terms) than the bias of

θ

in the same models. For this reason, we propose to study a methodology to reduce the bias for the MLE of

α

in the general class of model defined in (1), which can be applied to any member of the class, i.e., with any considered g and G in the baseline model.

In a frequentist context, in general, the maximum likelihood method is used to estimate the parameters of the model. The inferences depend strongly on asymptotic properties of the MLE, for instance, let

\hat{α}

be the MLE of

α

. Among these properties, we have that the MLE is approximately non-biased, i.e.,

E (\hat{α} - α) = 0

and follows a normal distribution when the sample size is large enough. However, likelihood inferences based on asymptotic approximation, when samples are of small or moderate size, may not be reliable.

The study of the behavior of the bias of MLE in small samples is an important area of research. There are several works in the literature related with bias correction; they can be divided into two main approaches: the corrective and the preventive, proposed by Cox and Snell [27] and Firth [28], respectively. In the first method, the bias is corrected after the MLE calculation; in the second, with a modification in the score function, the procedure already computes a less biased estimator than the regular MLE. The two methods are comparable, however Firth’s procedure has gained more popularity in recent years. Assuming g free of parameters in

M O E_{g} (α, λ)

, the aim of our paper is to obtain, through the preventive method, a bias-corrected maximum likelihood estimator (BCE) for

α

that is less biased and shows the useful side-effect for MLE for the components of the vector

λ

.

The work is organized as follows. In Section 2 we develop the procedure to estimate a bias-corrected parameter in

M O E_{g} (α, λ)

. Monte Carlo simulation experiments are presented in Section 3 to discuss the importance of the expression obtained in the previous section, which produces much less biased estimates than the traditional procedure. In Section 4, we consider two empirical examples. We conclude in Section 5 with some final remarks. In the Appendix A we present details of the quantities needed in our work.

2. Bias Correction of MLE

The first known method of bias correction was proposed by Cox and Snell [27]. Specifically, for the

M O E_{g} (α; λ)

model this method requires algebraic manipulations for each specific g adopted. As our main goal is to apply a methodology to reduce the bias in the estimator of

α

for any considered g, we discarded this approach. The same logic is applied to discard the preventive approach of [28], at least in its original form. We added an extra supposition.

Following the idea of Sartori [29], we considered first that g is free of parameters (i.e., the vector

λ

is known). For the univariate case, Firth’s method consisted of modifying the score function, say

S (α)

, by

\begin{matrix} S_{M} (α) = S (α) + M (α), \end{matrix}

(4)

where

M (α) = \frac{1}{2} I {(α)}^{- 1} (κ_{α, α, α} + κ_{α, α α})

,

I (α)

is the information matrix for the model,

κ_{α, α, α} = E [{(S (α))}^{3}]

and

κ_{α, α α} = E [S (α) \frac{\partial S (α)}{\partial α}]

, where

E

is the expectation operator. The solution of the modified likelihood equation

S_{M} (α) = 0

produces the modified MLE, say

{\hat{α}}_{M}

. Firth shows that the order of the bias of

{\hat{α}}_{M}

is reduced from

O (n^{- 1})

to

O (n^{- 2})

when compared with the ordinary MLE. Moreover, the asymptotic distribution of

{\hat{α}}_{M}

coincides with that of

\hat{α}

, i.e.,

\sqrt{n} ({\hat{α}}_{M} - α) \to N (0, I {(α)}^{- 1}), a s n \to \infty,

for more details on bias reduction, see Cordeiro and Cribari-Neto [30].

When

λ

is known, the likelihood function for the

M O E_{g} (α)

distribution is

ℓ (α) = n log α + \sum_{i = 1}^{n} log g (x_{i}) - 2 \sum_{i = 1}^{n} log \{1 - (1 - α) [1 - G (x_{i})]\} .

(5)

It can be verified that

\begin{matrix} I (α) = \frac{n}{3 α^{2}}, κ_{α, α, α} = 0 a n d κ_{α, α α} = - \frac{n}{3 α^{3}}, \end{matrix}

see details in Appendix A. Then, for the

M O E_{g} (α)

model we have that

M (α) = - {(2 α)}^{- 1}

. Therefore,

S_{M} (α) = \frac{(n - 1 / 2)}{α} - 2 \sum_{i = 1}^{n} \frac{(1 - G (x_{i}))}{1 - (1 - α) (1 - G (x_{i}))}

. Solving

S_{M} (α) = 0

, we obtain the BCE.

For the case where

λ

is unknown, the log-likelihood function for

ψ = (α, λ)

is given by

\begin{matrix} ℓ (ψ) = n log α + \sum_{i = 1}^{n} log g (x_{i}; λ) - 2 \sum_{i = 1}^{n} log \{1 - (1 - α) [1 - G (x_{i}; λ)]\} . \end{matrix}

Our proposal is to consider a bias correction methodology only for

α

and not for

λ

. In the introductory section, we justified this fact in some models, such as MOEE and MOER, because the bias for

α

is considerable in small and moderate sample sizes and lower for the components of

λ

, as presented in Figure 2 and Figure 3. The second reason is because with more than one parameter, the form of the Fisher Information matrix, among other cumulants, is not closed. Moreover, these terms required for the application of Firth’s methodology need to be computed for each member of the

M O E_{g} (α; λ)

family. Therefore, an alternative successfully applied in other models such as [29,31], is (i) first compute the constrained MLE

\hat{λ} (α)

for a fixed

α

, and then (ii) apply Firth’s method to the profile score function of

α

, which produces the modified estimator obtained from the non-linear equation

\begin{matrix} \frac{(n - 1 / 2)}{{\hat{α}}_{M}} - 2 \sum_{i = 1}^{n} \frac{(1 - G (x_{i}; λ))}{1 - (1 - {\hat{α}}_{M}) (1 - G (x_{i}; λ))} = 0 . \end{matrix}

(6)

In short, the estimation procedure can be described as

Step 0: choose an initial value for $ψ = (α, λ)$ , say ${\hat{ψ}}^{(0)}$ . A possible value can be ${\hat{ψ}}^{(0)} = (1, {\hat{λ}}^{(0)})$ , where ${\hat{λ}}^{(0)}$ is the MLE for $λ$ considering that $X_{1}, \dots, X_{n}$ are iid from $G (\cdot; λ)$ .
Step 1: For $k = 1, 2, \dots,$ choose ${\hat{λ}}^{(k)}$ as the vector that maximizes the profile log-likelihood function $ℓ ({\hat{α}}^{(k - 1)}; λ)$ in relation to $λ$ .
Step 2: For $k = 1, 2, \dots,$ do ${\hat{α}}_{M}^{(k)}$ as the solution for $α$ in (6) considering $λ = {\hat{λ}}^{(k)}$ .

Steps 1 and 2 are repeated until a convergence rule is satisfied. For instance,

| | ψ^{(k + 1)} - ψ^{(k)} | |

is less than a tolerance value, where

| | x | |

is the Euclidean norm of x.

3. Numerical Results

In this section, we present a simulation study to illustrate the performance of our methodology compared with the traditional MLE in the MOEE and MOER models. Additionally, we present a simulated data set to illustrate the gains obtained when our proposal is applied.

3.1. Reducing the Bias in the MOEE and MOER Models

We evaluate the performance of the MLE and BCE, described in Section 2, through a Monte Carlo simulation. The sample sizes considered are

n = 10, 20, \dots, 100

and the total number of replications was set at 5000. All simulations were performed using the R software [32].

The MOEE model with true parameter vectors

(α = 1.5, θ = 1.5)

and

(α = 2.5, θ = 1.5)

and MOER model with true parameter vectors

(α = 0.4, θ = 0.1)

and

(α = 0.8, θ = 0.05)

were used as an illustration. However, with other sets of

(α, θ)

, the conclusions obtained in this study were similar.

In each of the 5000 replications, for each scenario, the data were drawn from the respective model and we computed the MLE (say

\hat{α}

and

\hat{θ}

) and the BCE (say

{\hat{α}}_{M}

and

{\hat{θ}}_{M}

). Additionally, we computed the estimated absolute relative bias (ARB) and the estimated root mean square errors (RMSE) which are defined, respectively, as

|E (\hat{α}) - α| / α

and

\sqrt{E [{(\hat{α} - α)}^{2}]}

for the MLE of

α

(for the BCE, we replace

\hat{α}

by

{\hat{α}}_{M}

and the terms are analogous for the estimators of

θ

). In Table 1, we present the results using MOEE for

n = 10, \dots, 50

. Since the results were the same using MOER, we preferred to present them through Figure 4 and Figure 5 because it is more interesting for large sample sizes.

We can make some observations from Table 1 and Figure 4 and Figure 5. First, the ARB is not negligible for small samples, but the BCE

{\hat{α}}_{M}

has a smaller ARB than

\hat{α}

. In small sample sizes especially (say

n = 10, 20, 30

), the ARB is half the ARB for

\hat{α}

. In larger sample sizes, the ARB, hence the bias, of the MLE continues larger than the BCE, but as expected, the two terms become closer as n increases. We also note that the RMSE of

{\hat{α}}_{M}

is smaller than the RMSE of

\hat{α}

. On the other hand, the original proposal only considered a bias reduction methodology for

{\hat{α}}_{M}

; this also produces an improvement in the behavior of

{\hat{θ}}_{M}

in comparison with

\hat{θ}

, provided that the bias and RMSE for

{\hat{θ}}_{M}

is less than the bias and RMSE for

\hat{θ}

, in both MOEE and MOER. This suggests that in models where g is indexed by a vector

λ

with one or more parameters, the improvement in terms of bias and RMSE should benefit the BCE of

α

and

λ

jointly, as we will illustrate in Section 4. The histograms of the estimates for

{\hat{α}}_{M}

and

{\hat{θ}}_{M}

show that their respective distributions are less asymmetrical than

\hat{α}

and

\hat{θ}

; these histograms were omitted for the sake of brevity. As there is no theoretical explanation that justifies this behavior, we can interpret it as a side-effect of the BCE procedure.

3.2. A Simulated Example with Outliers

In this simulation study, we show the impact of the bias in an estimation of the MOEE distribution in presence of an outlier. From the model with

α = 2.5

and

θ = 1.5

, we sampled 10 observations:

x = (0.554, 0.960, 1.099, 1.200, 0.424, 0.223, 2.883, 0.938, 0.276, 1.545)

. To illustrate the robustness of the method, we replaced the first observation

x_{1}

by

x_{1} + 1.5 \times sd (x) = 1.732

. This perturbation scheme is usually used in local influence. In Table 2, we present the estimate for the original data set and the data set with presence of the outlier. Please note that the impact on the estimators of

α

differs further for the MLE than for the BCE. Additionally, as expected, the bias for the MLE is considerably smaller than the bias for the BCE. On the other hand, the estimations of

θ

seem not to be much impacted by the presence of the outlier. Figure 6 shows the MOEE pdf of this artificial data set compared with the pdf estimated by MLE and BCE. We also note that the pdf estimated by BCE is closer to the pdf with the original parameters than the pdf estimated by MLE.

4. Two Applications

In this section, we present two real data applications illustrating the gains in bias reduction in the MOEE and MOER models.

4.1. Air Conditioning System of an Airplane Data Set

In this section, we analyze the dataset presented by Linhart and Zucchini [33] (p. 69) to illustrate our method of estimation in MOEE. The data are failure times of the air conditioning system of an airplane: 23, 261, 87, 7, 120, 14, 62, 47, 225, 71, 246, 21, 42, 20, 5, 12, 120, 11, 3, 14, 71, 11, 14, 11, 16, 90, 1, 16, 52, 95. We use the MOEE distribution to fit this data. Table 3 presents the parameter estimates using the MLE and the preventive method of estimation. In agreement with our simulation study, the MLE overestimated parameter vector

λ

, especially for parameter

α

. In this case, besides the benefits related to bias reduction of BCE in comparison with MLE, we observe that the estimated standard error for

α

is lower with the BCE than with the MLE. Figure 7 shows the empirical cdf for this data set compared with the estimated cdf for MOEE in the MLE and BCE. We remark that the Kolmogorov-Statistic (i.e., the maximum distance between the empirical and estimated cdf) is reduced from 0.1284 to 0.1185 considering MLE versus BCE. Finally, Figure 8 presents the randomized quantile residuals [34] for MLE and BCE. If the model is correct, those models are a random sample from the standard normal distribution. Several normality test are presented in the figure. Please note that all p-values related to those tests are greater for the BCE than for the MLE, suggesting a better fit of the BCE estimates.

4.2. Precipitation Data Set

This data set is obtained from Hinkley [35]. It consists of thirty successive values of March precipitation (measured in inches) in Minneapolis/St Paul. We consider this illustration of our method of estimation in MOER. Table 4 presents the parameter estimates using maximum likelihood estimation and the preventive method of estimation. MLE overestimated parameter vector

λ

in comparison with BCE. Also, the bias (estimated via bootstrap) is reduced in both parameters, as well the standard errors. Figure 9 shows the histogram for this data set compared with the estimated pdf for MOER in MLE and BCE. Table 5 shows the approximated 95% confidence interval (CI) for the q-th quantile (say,

x_{q}

) in this model based on MLE and BCE considering different values for q (see Appendix A for details of the construction of this approximated CI). Finally, Figure 10 shows the width of these CI’s based on the two methods. Please note that BCE provides a lower width for

q \leq 0.592

and MLE provides a lower width for

q > 0.592

.

5. Concluding Remarks

We derived an expression for the bias of MLE related to the

α

parameter in the Marshall-Olkin extended family. The expression found allowed us to construct a procedure, using penalized likelihood, which generates a modified MLE with reduced bias. This is a useful development. In many cases, it is very difficult or even impossible to find bias-corrected maximum likelihood estimators for specific distribution families; we were able to find them for the Marshall-Olkin extended family. The reduced-bias estimator presents better performance than the corresponding procedures based on the initial estimator; for sample sizes equal to 10 or 20, the bias reduction was approximately 50%. The mean square error was also reduced. The scheme to use BCE is quite simple to implement in statistical softwares such as R. As mentioned by Firth [28]: “the merits of bias reduction in any particular problem will depend on several factors, including the skewness of the maximum likelihood estimator.” Since the MLE of the parameters in the Marshall-Olkin extended family is quite biased, this is an indication that its distribution is asymmetrical. For a future work, we suggest the study of the skewness of the MLE in the

M O E_{g}

, as, recently performed by Magalhães et al. [36] for the varying beta regression model (BRM) and Magalhães et al. [37] for Weibull censored data (WCD). In the first work, the authors showed that the MLE of the precision parameter is quite asymmetrical, while in the second, the MLE of the parameters is close to symmetry. This agrees with bias literature for the respective models, which showed that estimates of the precision parameters in the varying beta regression model are highly biased while the estimates in WCD are little biased. Since there is no closed-form for the skewness coefficient of the MLE of the

M O E_{g}

, it will be a greater contribution to this family of distributions.

Author Contributions

Conceptualization, T.M.M. and D.I.G.; Formal analysis, T.M.M., Y.M.G. and D.I.G.; Investigation, T.M.M. and Y.M.G.; Methodology, D.I.G. and O.V.; Resources, O.V.; Software, Y.M.G. and D.I.G.; Writing—original draft, D.I.G. and Os. V.; Writing—review & editing, O.V. All authors have read and agreed to the published version of the manuscript.

Funding

The research of O. Venegas was supported by Vicerrectoría de Investigación y Postgrado de la Universidad Católica de Temuco, Projecto interno FEQUIP 2019-INRN-03.

Acknowledgments

We acknowledge the referees’ suggestions that helped us improve this work.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Details of the Computation of M(α)

In this Section we present details of the computation of

I (α)

,

κ_{α, α, α}

and

κ_{α, α α}

, required to compute the term

M (α)

in (4). First, the survival function of the

M O E_{g}

model is given by

S_{M O E_{g}} (x; α) = \frac{α \bar{G} (x)}{1 - \bar{α} \bar{G} (x)}

Also, deriving (5) in relation to

α

we obtain

S (α) = \frac{\partial ℓ (α)}{\partial α} = \frac{1}{α} \{n - 2 \sum_{i = 1}^{n} \frac{α [1 - G (X_{i})]}{1 - (1 - α) [1 - G (X_{i})]}\} = \frac{1}{α} (n - 2 \sum_{i = 1}^{n} U_{i}),

where

U_{i} = \frac{α [1 - G (X_{i})]}{1 - (1 - α) [1 - G (X_{i})]}

. As

U_{i} = S_{M O E_{g}} (X_{i}; α)

, by the inverse transform method it follows that

U_{i} \sim U (0, 1)

and then,

E (U_{i}^{r}) = {(r + 1)}^{- 1}

, for

r \neq - 1

. It is also valid to rewrite the last expression as

S (α) = \frac{1}{α} \sum_{i = 1}^{n} V_{i},

where

V_{i} = 1 - 2 U_{i} \sim U (- 1, 1)

. Please note that

\sum_{i = 1}^{n} V_{i}

is a symmetric random variable around zero; it is then immediate that

κ_{α, α, α} = E [{(S (α))}^{3}] = 0

. On the other hand,

\frac{\partial S (α)}{\partial α} = - \frac{1}{α^{2}} (n - 2 \sum_{i = 1}^{n} \frac{α^{2} {[1 - G (X_{i})]}^{2}}{{1 - (1 - α) [1 - G (X_{i})]}^{2}}) = - \frac{1}{α^{2}} (n - 2 \sum_{i = 1}^{n} U_{i}^{2}) .

Therefore,

I (α) = E [- \frac{\partial S (α)}{\partial α}] = \frac{1}{α^{2}} (n - \frac{2 n}{3}) = \frac{n}{3 α^{2}} .

Finally,

\begin{matrix} κ_{α, α α} & = & E [S (α) \frac{\partial S (α)}{\partial α}] = - \frac{1}{α^{3}} E [(n - 2 \sum_{i = 1}^{n} U_{i}) (n - 2 \sum_{i = 1}^{n} U_{i}^{2})] \\ = & - \frac{1}{α^{3}} [n^{2} - 2 n \sum_{i = 1}^{n} E (U_{i}) - 2 n \sum_{i = 1}^{n} E (U_{i}^{2}) + 4 E (\sum_{i = 1}^{n} U_{i} \sum_{j = 1}^{n} U_{j}^{2})] . \end{matrix}

As

U_{1}, \dots, U_{n}

are iid, we have that

E (\sum_{i = 1}^{n} U_{i} \sum_{j = 1}^{n} U_{j}^{2}) = n E (U_{1}^{3}) + n (n - 1) E (U_{1}) E (U_{1}^{2}) .

Reducing terms, we obtain that

κ_{α, α α} = - n / (3 α^{3})

.

Details of the Asymptotic CI Used in Application 2

For the MOER model, the q-th quantile of the distribution, say

x_{q}

, is given by

x_{q} = {[\frac{1}{θ} log (1 - α + \frac{α}{1 - q})]}^{\frac{1}{2}} .

Thus,

\nabla {(x_{q})}^{⊤} = \frac{\partial x_{q}}{\partial ψ} = \frac{1}{2 θ^{3 / 2}} {[log (1 - α + \frac{α}{1 - q})]}^{- \frac{1}{2}} [\frac{q θ}{(1 - q + q α)}, - log (1 - α + \frac{α}{1 - q})] .

From asymptotic properties of the MLE (which also is valid for the BCE), we have that

\sqrt{n} {[H (\hat{ψ})]}^{- 1} (\hat{ψ} - ψ) \overset{D}{\to} N (0, I_{2}),

where

I_{2}

is the identity matrix with dimension 2 and

H (\cdot)

is the hessian matrix of the model. Therefore, using the delta method it follows that

\sqrt{n} \frac{(\hat{x_{q}} - x_{q})}{{\hat{σ}}_{x_{q}}} \overset{D}{\to} N (0, 1),

where

{\hat{σ}}_{x_{q}}^{2} = \nabla {(\hat{x_{q}})}^{⊤} H (\hat{ψ}) \nabla (\hat{x_{q}})

. Therefore, an asymptotic

100 (1 - p) %

confidence interval based on the delta method for

x_{q}

is given by

I C (x_{q}; 100 (1 - p) %) = \hat{x_{q}} \mp z_{1 - p / 2} {\hat{σ}}_{x_{q}},

with

z_{q}

denoting the q-th quantile of the normal standard model.

References

Marshall, A.W.; Olkin, I. A new method for adding a parameter to a family of distributions with application to the exponential and weibull families. Biometrika 1997, 84, 641–652. [Google Scholar] [CrossRef]
Marshall, A.W.; Olkin, I. Life Distributions. Structure of Nonparametric, Semipara- Metric and Parametric Families; Springer: New York, NY, USA, 2007. [Google Scholar]
Castellares, F.; Lemonte, A.J. On the Marshall-Olkin extended distributions. Commun. Stat. Theory Methods 2016, 45, 4537–4555. [Google Scholar] [CrossRef]
Ghitany, M.E. Marshall-Olkin extended pareto distribution and its application. Int. J. Appl. Math. 2005, 18, 17–32. [Google Scholar]
Ghitany, M.E.; Al-Hussaini, E.K.; Jarallah, R.A. Marshall-Olkin extended Weibull distribution and its application to censored data. J. Appl. Stat. 2005, 32, 1025–1034. [Google Scholar] [CrossRef]
Ristic, M.M.; Kanichukattu, J.; Joseph, A. A marshall-olkin gamma distribution and minification process. STARS Int. J. Sci. 2007, 1, 107–117. [Google Scholar]
Ghitany, M.E.; Al-Awadhi, F.A.; Alkhalfan, L.A. Marshall-olkin extended lomax distribution and its application to censored data. Commun. Stat. Theory Methods 2007, 36, 1855–1866. [Google Scholar] [CrossRef]
Ghitany, M.E.; Kotz, S. Reliability properties of extended linear failure-rate distributions. Probab. Eng. Inf. Sci. 2007, 21, 441–450. [Google Scholar] [CrossRef]
Jayakumar, K.; Mathew, T. On a generalization to Marshall–Olkin scheme and its application to Burr type XII distribution. Stat. Pap. 2008, 49, 421–439. [Google Scholar] [CrossRef]
García, V.; Gómez, E.; Vásquez-Polo, F. A new skew generalization of the normal distribution: Properties and applications. Comput. Stat. Data Anal. 2010, 54, 2021–2034. [Google Scholar] [CrossRef]
Gómez-Déniz, E. Another generalization of the geometric distribution. Test 2010, 19, 399–415. [Google Scholar] [CrossRef]
Lemonte, A. A new extension of the Birnbaum-Saunders distribution. Braz. J. Probab. Stat. 2013, 27, 133–149. [Google Scholar] [CrossRef]
Cordeiro, G.; Lemonte, A. On the Marshall-Olkin extended Weibull distribution. Stat. Pap. 2013, 54, 333–353. [Google Scholar] [CrossRef]
Cordeiro, G.; Lemonte, A.; Ortega, E. The Marshall-Olkin family of distributions: Mathematical properties and new models. J. Stat. Theory Pract. 2013, 8, 343–366. [Google Scholar] [CrossRef]
Alshangiti, A.M.; Kayid, M.; Alarfaj, B. A new family of Marshall-Olkin extended distributions. J. Comput. Appl. Math. 2014, 271, 369–379. [Google Scholar] [CrossRef]
Alizadeh, M.; Cordeiro, G.M.; De Brito, E.; Demétrio, C.G.B. The beta Marshall-Olkin family of distributions. J. Stat. Distrib. Appl. 2015, 2, 1–18. [Google Scholar] [CrossRef]
Ristić, M.; Kundu, D. Marshall-Olkin generalized exponential distribution. Metron 2015, 73, 317–333. [Google Scholar] [CrossRef]
Benkhelifa, L. The Marshall-Olkin extended generalized Lindley distribution: Properties and applications. Commun. Stat. Simul. Computation. 2017, 46, 8306–8330. [Google Scholar] [CrossRef]
Afify, A.Z.; Cordeiro, G.M.; Yousof, H.M.; Saboor, A.; Ortega, E.M.M. The Marshall-Olkin additive Weibull distribution with variable shapes for the hazard rate. Hacet. J. Math. Stat. 2018, 47, 365–381. [Google Scholar] [CrossRef]
Javed, M.; Nawaz, T.; Irfan, M. The Marshall-Olkin Kappa distribution: Properties and applications. J. King Saud Univ. Sci. 2019, 31, 684–691. [Google Scholar] [CrossRef]
Mansoor, M.; Tahir, M.H.; Cordeiro, G.M.; Provost, S.B.; Alzaatreh, A. The Marshall-Olkin logistic-exponential distribution. Commun. Stat. Theory Methods 2019, 48, 220–234. [Google Scholar] [CrossRef]
Yousof, H.M.; Afify, A.Z.; Alizadeh, M.; Nadarajah, S.; Aryal, G.; Hamedani, G. The Marshall-Olkin generalized-G family of distributions with applications. Statistica 2018, 78, 273–295. [Google Scholar]
Durrans, S.R. Distributions of fractional order statistics in hydrology. Water Resour. Res. 1992, 28, 1649–1655. [Google Scholar] [CrossRef]
Guney, Y.; Tuac, Y.; Arslan, O. Marshall-Olkin distribution: Parameter estimation and application to cancer data. J. Appl. Stat. 2017, 44, 2238–2250. [Google Scholar] [CrossRef]
Korkmaz, M.Ç.; Cordeiro, G.M.; Yousof, H.M.; Pescim, R.R.; Afify, A.Z.; Nadarajah, S. The Weibull Marshall-Olkin family: Regression model and application to censored data. Commun. Stat. Theory Methods 2019, 48, 4171–4194. [Google Scholar] [CrossRef]
Nassar, M.; Kumar, D.; Dey, S.; Cordeiro, G.M.; Afify, A.Z. The Marshall-Olkin alpha power family of distributions with applications. J. Comput. Appl. Math. 2019, 1, 41–53. [Google Scholar] [CrossRef]
Cox, D.R.; Snell, E.J. A general definition of residuals. J. R. Society. Ser. B Methodol. 1968, 30, 248–275. [Google Scholar] [CrossRef]
Firth, D. Bias reduction of maximum likelihood estimates. Biometrika 1993, 80, 27–38. [Google Scholar] [CrossRef]
Sartori, N. Bias prevention of maximum likelihood estimates for scalar skew normal and skew t distributions. J. Stat. Plan. Inference 2006, 136, 4259–4275. [Google Scholar] [CrossRef]
Cordeiro, G.M.; Cribari-Neto, F. An introduction to Bartlett Corrections and Bias Reduction; Springer: Berlin/Heidelberg, Geramny, 2014. [Google Scholar]
Arrué, J.; Arellano-Valle, R.B.; Gómez, H.W. Bias reduction of maximum likelihood estimates for a modified skew normal distribution. J. Stat. Comput. Simul. 2016, 86, 2967–2984. [Google Scholar] [CrossRef]
R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2017. [Google Scholar]
Linhart, H.; Zucchini, W. Model Selection; John Wiley and Sons: New York, NY, USA, 1986. [Google Scholar]
Dunn, P.; Smyth, G. Randomized quantile residuals. J. Comput. Graph. Stat. 1996, 5, 236–244. [Google Scholar]
Hinkley, D. On quick choice of power transformation. Am. Stat. 1977, 26, 67–69. [Google Scholar] [CrossRef]
Magalhães, T.M.; Botter, D.A.; Sandoval, M.C.; Pereira, G.H.A.; Cordeiro, G.M. Skewness of maximum likelihood estimators in the varying dispersion beta regression model. Commun. Stat. Theory Methods 2019, 48, 4250–4260. [Google Scholar] [CrossRef]
Magalhães, T.M.; Gallardo, D.I.; Gómez, H.W. Skewness of maximum likelihood estimators in the weibull censored data. Symmetry 2019, 11, 1351. [Google Scholar] [CrossRef]

Figure 1. Pdf for MOEE and MOER models under different combinations of parameters.

Figure 2. Estimated bias for the MLE of

α

and

θ

in the MOEE(

α

,

θ

) under different scenarios based on 5000 replicates.

Figure 2. Estimated bias for the MLE of

α

and

θ

in the MOEE(

α

,

θ

) under different scenarios based on 5000 replicates.

Figure 3. Estimated bias for the MLE of

α

and

θ

in the MOER(

α

,

θ

) under different scenarios based on 5000 replicates.

Figure 3. Estimated bias for the MLE of

α

and

θ

in the MOER(

α

,

θ

) under different scenarios based on 5000 replicates.

Figure 4. Estimated bias for the MLE and BCE of

α

and

θ

in the MOER distribution.

Figure 4. Estimated bias for the MLE and BCE of

α

and

θ

in the MOER distribution.

Figure 5. Estimated RMSE for the MLE and BCE of

α

and

θ

in the MOER distribution.

Figure 5. Estimated RMSE for the MLE and BCE of

α

and

θ

in the MOER distribution.

Figure 6. Estimated RMSE for the MLE and BCE of

α

and

θ

in the MOEE.

Figure 6. Estimated RMSE for the MLE and BCE of

α

and

θ

in the MOEE.

Figure 7. Empirical cdf and cdf estimated by MLE and BCE in the air conditioning system of an airplane data set considering the MOEE model.

Figure 8. Randomized quantile residuals for MOEE in the air conditioning system of an airplane. Left panel: MLE. Right panel: BCE. The p-values for the following normality tests are also presented: Kolmogorov-Smirnov (KS), Shapiro-Wilks (SW), Anderson-Darling (AD) and Cramer-von Mises (CVM)

Figure 9. Histogram and estimated pdf for MLE and BCE considering the MOER distribution in the precipitation data set.

Figure 10. Width of 95% CI for quantiles of MOER model considering the MLE (continuous line) and BCE (dashed line).

Table 1. Absolute value of relative biases and the RMSE (in parentheses) in the MOEE distribution.

	$α = 1.5$ , $θ = 1.5$				$α = 2.5$ , $θ = 1.5$
n	Estimator of $α$		Estimator of $θ$		Estimator of $α$		Estimator of $θ$
	MLE	BCE	MLE	BCE	MLE	BCE	MLE	BCE
10	2.329	1.374	0.385	0.062	2.481	1.588	0.297	0.116
10	(0.855)	(0.656)	(1.283)	(1.104)	(0.851)	(0.642)	(1.036)	(0.924)
20	0.711	0.311	0.184	0.033	0.715	0.385	0.144	0.011
20	(0.780)	(0.693)	(0.743)	(0.670)	(0.780)	(0.664)	(0.619)	(0.578)
30	0.414	0.170	0.122	0.030	0.408	0.193	0.096	0.017
30	(0.749)	(0.699)	(0.564)	(0.520)	(0.749)	(0.700)	(0.474)	(0.447)
40	0.295	0.120	0.089	0.021	0.290	0.130	0.070	0.014
40	(0.732)	(0.692)	(0.465)	(0.435)	(0.736)	(0.700)	(0.394)	(0.376)
50	0.233	0.098	0.072	0.019	0.228	0.099	0.057	0.012
50	(0.713)	(0.686)	(0.417)	(0.395)	(0.717)	(0.690)	(0.354)	(0.339)

Table 2. Estimates for the parameters in a simulated data set.

Parameter	True	Original		with Outlier
Parameter	Value	MLE	BCE	MLE	BCE
$α$	2.5	3.873	2.174	5.223	2.866
$θ$	1.5	1.839	1.464	1.817	1.467

Table 3. Estimates for MOEE distribution in the air conditioning system of an airplane. Standard errors are presented in parenthesis.

Parameter	MLE	BCE
$α$	0.380 (0.272)	0.285 (0.225)
$θ$	0.010 (0.005)	0.008 (0.005)

Table 4. Estimates, standard errors (SE) and bias (estimated via bootstrap) for MOER distribution in precipitation data set.

Parameter	MLE			BCE
Parameter	Estimate	SE	Bias	Estimate	SE	Bias
$α$	0.514	0.351	0.255	0.394	0.270	0.213
$θ$	0.185	0.087	0.041	0.157	0.080	0.039

Table 5. Approximated 95% CI for the q-th quantile of the MOER model in precipitation data set based on MLE and BCE.

q	MLE		BCE
q	95% IC	Width	95% IC	Width
0.10	(0.5313–0.5629)	0.0316	(0.5106–0.5361)	0.0256
0.25	(0.8845–0.9619)	0.0773	(0.8553–0.9210)	0.0657
0.50	(1.4225–1.5678)	0.1452	(1.3882–1.5261)	0.1379
0.75	(2.1286–2.3566)	0.2281	(2.1067–2.3593)	0.2526
0.90	(2.8078–3.2958)	0.4880	(2.8108–3.4102)	0.5994
0.99	(3.5118–5.7177)	2.2059	(3.3545–6.3541)	2.9996

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Magalhães, T.M.; Gómez, Y.M.; Gallardo, D.I.; Venegas, O. Bias Reduction for the Marshall-Olkin Extended Family of Distributions with Application to an Airplane’s Air Conditioning System and Precipitation Data. Symmetry 2020, 12, 851. https://doi.org/10.3390/sym12050851

AMA Style

Magalhães TM, Gómez YM, Gallardo DI, Venegas O. Bias Reduction for the Marshall-Olkin Extended Family of Distributions with Application to an Airplane’s Air Conditioning System and Precipitation Data. Symmetry. 2020; 12(5):851. https://doi.org/10.3390/sym12050851

Chicago/Turabian Style

Magalhães, Tiago M., Yolanda M. Gómez, Diego I. Gallardo, and Osvaldo Venegas. 2020. "Bias Reduction for the Marshall-Olkin Extended Family of Distributions with Application to an Airplane’s Air Conditioning System and Precipitation Data" Symmetry 12, no. 5: 851. https://doi.org/10.3390/sym12050851

APA Style

Magalhães, T. M., Gómez, Y. M., Gallardo, D. I., & Venegas, O. (2020). Bias Reduction for the Marshall-Olkin Extended Family of Distributions with Application to an Airplane’s Air Conditioning System and Precipitation Data. Symmetry, 12(5), 851. https://doi.org/10.3390/sym12050851

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Bias Reduction for the Marshall-Olkin Extended Family of Distributions with Application to an Airplane’s Air Conditioning System and Precipitation Data

Abstract

1. Introduction

2. Bias Correction of MLE

3. Numerical Results

3.1. Reducing the Bias in the MOEE and MOER Models

3.2. A Simulated Example with Outliers

4. Two Applications

4.1. Air Conditioning System of an Airplane Data Set

4.2. Precipitation Data Set

5. Concluding Remarks

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A. Details of the Computation of M(α)

Details of the Asymptotic CI Used in Application 2

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI