Generalized Modified Slash Birnbaum–Saunders Distribution

Jimmy Reyes; Inmaculada Barranco-Chamorro; Diego I. Gallardo; Héctor W. Gómez

doi:10.3390/sym10120724

,

and

¹

Departamento de Matemáticas, Facultad de Ciencias Básicas, Universidad de Antofagasta, Antofagasta 1240000, Chile

²

Departamento de Estadística e Investigación Operativa, Universidad de Sevilla, 41000 Sevilla, Spain

³

Departamento de Matemática, Facultad de Ingeniería, Universidad de Atacama, Copiapó 1530000, Chile

^*

Author to whom correspondence should be addressed.

Symmetry2018, 10(12), 724;https://doi.org/10.3390/sym10120724

Version Notes

Order Reprints

Abstract

In this paper, a generalization of the modified slash Birnbaum–Saunders (BS) distribution is introduced. The model is defined by using the stochastic representation of the BS distribution, where the standard normal distribution is replaced by a symmetric distribution proposed by Reyes et al. It is proved that this new distribution is able to model more kurtosis than other extensions of BS previously proposed in the literature. Closed expressions are given for the pdf (probability density functio), along with their moments, skewness and kurtosis coefficients. Inference carried out is based on modified moments method and maximum likelihood (ML). To obtain ML estimates, two approaches are considered: Newton–Raphson and EM-algorithm. Applications reveal that it has potential for doing well in real problems.

Keywords:

Birnbaum–Saunders distribution; generalized modified slash distribution; kurtosis; maximum likelihood; EM-algorithm

1. Introduction

The BS distribution was introduced by Birnbaum and Saunders [1,2]. The aim of this distribution is to model the fatigue in lifetime of certain materials. Nowadays, its use has spread to other contexts such as economic and environmental data. In these new applications, it is quite common to find real datasets in which a BS model with heavier tails would be suitable. Slash models are a good option to deal with this kind of situations, in which heavy tails are a serious problem for the data analyst. This is the main reason slash distributions have received a great deal of attention during the last decades. In this context, we face the problem of improving BS distribution by introducing a generalization able to model more kurtosis than other slash extensions previously proposed in the literature. In these extensions, the emphasis is on kurtosis because, as Moors [3] pointed out, the presence of heavy tails produces high kurtosis. Next, we briefly describe the BS-model and the most relevant slash precedents of our proposal.

1.1. Birnbaum–Saunders Distribution

If a random variable (rv)

T > 0

follows a BS distribution with shape parameter

α > 0

and scale parameter

β > 0

,

T \sim B S (α, β)

, then T can be expressed as

\begin{matrix} T = β {(\frac{α}{2} Z + \sqrt{{(\frac{α}{2} Z)}^{2} + 1})}^{2} \end{matrix}

(1)

where

Z \sim N (0, 1)

. From Equation (1), T is a monotone transformation of Z, and its cumulative distribution function (cdf)

F_{T}

is

F_{T} (t) = Φ (w (t))

(2)

with

Φ (\cdot)

the cdf of a

N (0, 1)

distribution and

w (t) = w_{α, β} (t) = \frac{1}{α} (\sqrt{\frac{t}{β}} - \sqrt{\frac{β}{t}}) = \frac{2}{α} s i n h (ln \sqrt{\frac{t}{β}}), t > 0 .

(3)

The probability density function (pdf) of T is

f_{T} (t; α, β) = \frac{t^{- \frac{3}{2}} (t + β)}{2 α β^{\frac{1}{2}}} ϕ (w (t))

(4)

where

ϕ (\cdot)

is the pdf of a

N (0, 1)

distribution (Johnson et al. [4]).

As properties (see, for instance, Leiva [5]), we highlight that the BS distribution is continuous, unimodal and positively skewed (asymmetry to right).

β

is the median of the distribution.

α

is a shape parameter that modifies the skewness and kurtosis of the distribution. As

α

tends to zero, the BS distribution tends to be symmetrical around

β

and its variability decreases. On the other hand, as

α

increases, the BS distribution exhibits heavier tails.

1.2. Slash Methodology

To use the BS distribution for modeling data with outliers, Gómez et al. [6] and Reyes et al. [7] proposed extensions of BS model based on the slash (S) and modified slash (MS) distribution. In this way, they got extensions of BS distribution with a high kurtosis coefficient.

The canonic slash distribution was introduced by Rogers and Tukey [8]. This model is defined as the ratio of a

N (0, 1)

and an independent uniform

U (0, 1)

distribution. It is proposed as a model for bell-shaped data with heavier tails than the corresponding normal distribution. Their theoretical properties can be seen, for instance, in Rogers and Tukey [8] or Johnson et al. [4]. The slash model, denoted as S, in which a kurtosis parameter q is introduced, is defined as

\begin{matrix} S = \frac{Z}{U^{\frac{1}{q}}} \end{matrix}

(5)

with

Z \sim N (0, 1)

independent of

U \sim U (0, 1)

and

q > 0

. Based on the representation given in Equation (5), Reyes et al. [7] proposed the modified slash (MS) distribution in which the variable at the denominator of Equation (5) is replaced by an exponential distribution of parameter 2, that is,

U \sim E x p (2)

. The MS model exhibits greater kurtosis than the S model. A new extension, called generalized modified slash (GMS) distribution, was introduced recently by Reyes et al. [9]. These authors proposed a new slash model where the denominator in Equation (5) is a Gamma distribution of parameters (

2 q, q

) with

q > 0

. The GMS model generalizes the MS model. As main features of the GMS model, we highlight that is a bell-shaped distribution, symmetrical with respect to zero, and exhibits a greater level of kurtosis than its predecessors, thus it can be of interest to study the distribution of the BS extension obtained when

Z \sim N (0, 1)

in Equation (1) is replaced by a GMS distribution with kurtosis parameter

q > 0

. This proposal is a generalization of the papers by Gómez et al. [6] and Reyes et al. [7] where slash versions of the BS distribution were considered based on the slash and modified slash distribution, called slash Birnbaum–Saunders (SBS) and modified slash Birnbaum–Saunders (MSBS), respectively.

This paper is outlined as follows. In Section 2, the stochastic representation of the generalized modified slash Birnbaum–Saunders (GMSBS) distribution is introduced, and its probability density function, properties, moments, skewness and kurtosis coefficients are obtained. Section 3 is devoted to estimation methods: modified moment and maximum likelihood estimation (an iterative method and the EM-algorithm are proposed). Section 4 assesses the performance of the MLE using the EM algorithm via a simulation study. Two practical applications are given in Section 5.

2. GMSBS Distributions

In this section, the stochastic representation of a GMSBS distribution is introduced. A closed expression for its pdf is obtained and its properties are studied in depth. Motivated by Equation (1), the stochastic expression proposed for a GMSBS distribution is

T = β {(\frac{α}{2} X + \sqrt{{(\frac{α}{2} X)}^{2} + 1})}^{2}, α > 0, β > 0,

(6)

where X follows a Generalized Modified Slash distribution,

X \sim G M S (0, 1, q)

,

q > 0

. It is then said that T follows a GMSBS distribution with parameters

α

,

β

, and q,

T \sim G M S B S (α, β, q)

. Similar to the BS distribution,

α

is a shape parameter and

β

is a scale parameter. It is shown below that the new parameter q allows us to control the kurtosis and skewness of this new model and to obtain distributions with greater level of kurtosis than other slash Birnbaum–Saunders models. This fact allows us to model real datasets in which a BS-model can be appropriate but we have heavy tails, especially on the right.

2.1. Probability Density Function

Since T, introduced in Equation (6), is given as a function X with

X \sim G M S (0, 1, q)

, to obtain the distribution of T, we need the pdf of X, which is given in next lemma.

Lemma 1.

Let

X \sim G M S (0, 1, q)

be defined as

X = Z / V

with

Z \sim N (0, 1)

independent of

V \sim G a (2 q, q)

,

q > 0

. Then, the pdf of X is

f_{X} (x; q) = \frac{{(2 q)}^{q}}{Γ (q)} \int_{0}^{\infty} v^{q} e^{- 2 q v} ϕ (x v) d v, x \in R

(7)

where

ϕ ()

denotes the pdf of a

N (0, 1)

distribution.

Proof.

It can be seen in Reyes et al. [9]. ☐

Lemma 2.

The following closed expression for Equation (7) can be given

f_{X} (x; q) = \{\begin{matrix} \frac{q}{\sqrt{8 π}}, & if x = 0 \\ q \frac{2^{q / 2}}{\sqrt{2 π}} \frac{1}{{| x |}^{q + 2}} U (1 + \frac{q}{2}, \frac{3}{2}, \frac{2}{x^{2}}), & if x \neq 0 \end{matrix}

(8)

where

U (\cdot)

denotes the confluent hypergeometric function of the second kind (Abramowitz and Stegun [10], p. 505).

Proof.

It can be seen in Reyes et al. [9]. ☐

Proposition 1.

Let

T \sim G M S B S (α, β, q)

. Then, the pdf of T is

f_{T} (t; α, β, q) = \frac{2^{q - 1} q^{q} t^{- 3 / 2} (t + β)}{α β^{1 / 2} Γ (q)} \int_{0}^{\infty} v^{q} e^{- 2 q v} ϕ (w (t) v) d v,

(9)

with

w (t) = \frac{1}{α} (\sqrt{\frac{t}{β}} - \sqrt{\frac{β}{t}})

and

t > 0 .

Proof.

From Equation (6)

F_{T} (t) = F_{X} (w (t))

(10)

where

F_{T} (\cdot)

and

F_{X} (\cdot)

denote the cdfs of T and X, respectively, and

w (t)

is given in Equation (3).

From Equation (10), the following relationship for the pdf’s of T and X follows

f_{T} (t) = f_{X} (w (t)) w^{'} (t),

(11)

where

f_{X} (\cdot)

denotes the pdf of a

X \sim G M S (0, 1, q)

.

Finally, from Equation (11), Lemma 1 and Equation (3), the expression proposed in Equation (9) is obtained. ☐

Corollary 1.

From Equation (11) and (8), we have the following closed expression for

f_{T} ()

f_{T} (t; α, β, q) = \{\begin{matrix} \frac{1}{α β \sqrt{8 π}}, & if t = β \\ \frac{2^{q / 2} q^{q + 2}}{\sqrt{8 π}} \frac{(β + t)}{{| t - β |}^{q + 2}} α^{q + 1} β^{(q + 1) / 2} t^{(q - 1) / 2} U (1 + \frac{q}{2}, \frac{3}{2}, \frac{2 q^{2} α^{2} β t}{{(t - β)}^{2}}), & if t \neq β \end{matrix}

(12)

with

t > 0

,

α > 0

,

β > 0

and

q > 0

.

The next corollary relates the new model, proposed in Equation (6), to other slash models previously introduced in the literature.

Corollary 2.

For

q = 1

, the pdf given in Equation (12) reduces to the pdf of a modified slash Birnbaum–Saunders distributions,

M S B S (α, β, 1)

proposed in Reyes el al. [11].

Proof.

This corollary follows from the fact that, for

q = 1

, a

G a (2, 1)

distribution reduces to an exponential,

E x p (2)

, and the stochastic representation proposed in Equation (6). ☐

Figure 1 illustrates the effect of the parameter q on the tails of our proposal. Plots given in this figure compare the pdfs of several

G M S B S

models for different values of q. Specifically, the pdfs of a

G M S B S (0.3, 2, q)

distribution for

q = 8, 3, 1

are given. Note that a greater level of kurtosis is observed for small values of q. These appreciations are formalized in Section 2.3 where moments are obtained.

Figure 1. GMSMS

(α = 0.3, β = 2, q)

pdfs for different values of q.

2.2. Properties

In this subsection, some properties of GMSBS distributions are deduced.

Proposition 2.

Let

T \sim G M S B S (α, β, q)

, with

α > 0

,

β > 0

,

q > 0

. Then,

1.: Let $t_{p}$ be the pth quantile of T, $0 < p < 1$ .

$t_{p} = β {(\frac{α}{2} x_{p} + \sqrt{{(\frac{α}{2} x_{p})}^{2} + 1})}^{2}$

(13)

where $x_{p}$ denotes the pth quantile of $X \sim G M S (0, 1, q) .$
In particular, the median of T is β, $t_{0.5} = β$ .
2.: $\forall b > 0$ , $b T \sim G M S B S (α, b β, q)$ .
3.: $T^{- 1} \sim G M S B S (α, β^{- 1}, q)$ .

Proof.

(1). Equation (13) follows from the fact that Equation (6) is a one-to-one transformation from

R

to

R^{+}

.

On the other hand,

t_{0.5} = β

since

X \sim G M S (0, 1, q)

is a symmetric distribution around zero, and therefore

x_{0.5} = 0

.

(2) and (3) are immediate from Proposition 1 by properly using the change-of-variable technique. ☐

Next, we show that if

q \to \infty

then

G M S B S (α, β, q)

model approaches to a Birnbaum–Saunders distribution. The subscript q is included in the notation to highlight this fact.

Proposition 3.

Let

T_{q} \sim G M S B S (α, β, q)

. Then,

T_{q}

converges in law to

T \sim B S (2 α, β)

as

q \to \infty

, that is

T_{q} \overset{L}{⟶} T, q \to \infty, where T \sim B S (2 α, β) .

(14)

Proof.

See Appendix A, Proof of Proposition 3. ☐

Proposition 3 means that, for large q,

G M S B S (α, β, q)

model can be approached by a Birnbaum–Saunders distribution.

2.3. Moments

Formulae for the moments of order r,

r \in Z^{+}

, in a GMSBS distribution are given next.

Proposition 4.

Let

T \sim G M S B S (α, β, q)

. For

r \in Z^{+}

,

E [T^{r}]

exists if and only if

q > 2 r

and

E [T^{r}] = β^{r} \sum_{y = 0}^{r} (\binom{2 r}{2 y}) \sum_{s = 0}^{y} (\binom{y}{s}) {(\frac{α}{2})}^{2 (r + s - y)} \frac{2^{(r + s - y)} q^{2 (r + s - y)} [2 (r + s - y)]! Γ (q - 2 (r + s - y))}{(r + s - y)! Γ (q)} .

(15)

Proof.

See Appendix A, Proof of Proposition 4. ☐

Remark 1.

From Equation (15), note that

E [T^{r}]

is a polynomial in β of degree r, in α of degree

2 r

(only even powers are obtained), and coefficients that involve rational functions of q (with numerator and denominator of the same degree).

Next, some non-central moments for the GMSBS distribution are given. These expressions involve the Pochhammer symbol or rising factorial, defined for

a > 0

and

k \in Z^{+}

as

{(a)}_{k} = a (a + 1) (a + 2) \dots (a + k - 1) = \frac{Γ (a + k)}{Γ (a)} .

(16)

Corollary 3.

Let

T \sim G M S B S (α, β, q)

and

μ_{r} = E [T^{r}]

. Then

μ_{1} = β [2 α^{2} \frac{q^{2}}{{(q - 2)}_{2}} + 1], q > 2

(mean or expected value of T),

μ_{2} = β^{2} [24 α^{4} \frac{q^{4}}{{(q - 4)}_{4}} + 8 α^{2} \frac{q^{2}}{{(q - 2)}_{2}} + 1], q > 4

,

μ_{3} = β^{3} [480 α^{6} \frac{q^{6}}{{(q - 6)}_{6}} + 144 α^{4} \frac{q^{4}}{{(q - 4)}_{4}} + 18 α^{2} \frac{q^{2}}{{(q - 2)}_{2}} + 1], q > 6

,

μ_{4} = β^{4} [13440 α^{8} \frac{q^{8}}{{(q - 8)}_{8}} + 3840 α^{6} \frac{q^{6}}{{(q - 6)}_{6}} + 480 α^{4} \frac{q^{4}}{{(q - 4)}_{4}} + 32 α^{2} \frac{q^{2}}{{(q - 2)}_{2}} + 1], q > 8

.

Proof.

The proposed results follow from Proposition 4 and Equation (16). Aditional details can be seen in Appendix A, Proof of Corollary 3. ☐

From Corollary 3, it follows that the variance of T is

V a r (T) = 4 β^{2} [α^{2} c_{2} (q) + α^{4} c_{4} (q)], q > 4

(17)

where

\begin{matrix} c_{2} (q) & = & \frac{q^{2}}{{(q - 2)}_{2}} \\ c_{4} (q) & = & \frac{6 q^{4}}{{(q - 4)}_{4}} - \frac{q^{4}}{{{(q - 2)}_{2}}^{2}} . \end{matrix}

The skewness coefficient,

\sqrt{β_{1}}

, and the kurtosis coefficient,

β_{2}

, can be computed by using the previous expressions and the relationships

\sqrt{β_{1}} = \frac{μ_{3} - 3 μ_{1} μ_{2} + 2 μ_{1}^{3}}{{(μ_{2} - μ_{1}^{2})}^{3 / 2}} .

β_{2} = \frac{μ_{4} - 4 μ_{1} μ_{3} + 6 μ_{1}^{2} μ_{2} - 3 μ_{1}^{4}}{{(μ_{2} - μ_{1}^{2})}^{2}} .

Next, the behavior of

\sqrt{β_{1}}

and

β_{2}

as functions of the kurtosis parameter q is studied.

Although the convergence in law, in general, does not imply the convergence in moments, in this case, we have such convergence as

q \to \infty

. The notation

β_{1} (q)

and

β_{2} (q)

is used. The next corollary states explicit results for

\sqrt{β_{1} (q)}

and

β_{2} (q)

, if

q \to \infty

, along with others that help us to understand the behavior of these features. The explicit expressions of

\sqrt{β_{1} (q)}

and

β_{2} (q)

, given in Appendix A, Equation (A12), are used.

Corollary 4.

(1) Limit behavior of skewness coefficient

\begin{matrix} lim_{q \to 6} \sqrt{β_{1} (q)} & = & \infty \\ lim_{q \to \infty} \sqrt{β_{1} (q)} & = & \frac{48 α + 352 α^{4}}{{4 (1 + 5 α^{2})}^{3 / 2}} \end{matrix}

that is, if

q \to \infty

then the skewness coefficient of a

G M S B S (α, β, q)

tends to the skewness coefficient of a

B S (2 α, β)

distribution.

(2) Limit behavior of kurtosis coefficient

\begin{matrix} lim_{q \to 8} β_{2} (q) & = & \infty \\ lim_{q \to \infty} β_{2} (q) & = & 3 + \frac{24 α^{2} + (372 α^{2} + 41)}{{(20 α^{2} + 4)}^{2}} . \end{matrix}

that is, if

q \to \infty

then the kurtosis coefficient of a

G M S B S (α, β, q)

tends to the kurtosis coefficient of a

B S (2 α, β)

distribution.

Proof.

The proposed results follow from expressions for

β_{1} (q)

and

β_{2} (q)

given in Appendix A, Equation (A12), and the moments

μ_{r}

, given in Equation (15). ☐

Remark 2.

Interpretation of parameters in a

G M S B S (α, β, q)

model.

(i) In the

G M S B S

model, as in the Birnbaum–Saunders distribution,

β > 0

is a scale parameter, which is also the median of the distribution (see Equation (6) and Proposition 2).

(ii) It can be seen in Leiva [5] that in the Birnbaum–Saunders distribution

α > 0

is a shape parameter that modifies the skewness and kurtosis of the distribution. As α tends to zero, the BS distribution tends to be more symmetrical around its median β and its variability decreases. The expressions of the skewness coefficient

\sqrt{β_{1}}

, given in Equations (A12) and (17), suggest that α has a similar interpretation in the

G M S B S

model.

(iii) As for the parameter

q > 0

, it is proven through this paper that controls the kurtosis and skewness coefficient in the

G M S B S

model, in such a way that allows us to obtain models with greater level of kurtosis than other slash BS distributions, previously introduced in the literature.

As graphical aid, to show the way in which

α

and q determine the asymmetry and kurtosis of a

G M S B S (α, β, q)

model, see plots in Figure 2. Without loss of generality, the scale parameter is taken equal to one,

β = 1

. They illustrate the way in which the asymmetry and kurtosis coefficients depend on both parameters. Plots in Figure 2 suggest that, on the one hand, for increasing values of

α

, the asymmetry and kurtosis increase. On the other hand, if

α

is fixed, asymmetry and kurtosis coefficients are decreasing functions of q.

Figure 2. Skewness and kurtosis coefficients for

G M S B S (α, β = 1, q)

model as function of q taking

α = 0.25, 1

and 5.

These considerations motivate that GMSBS distribution can be used for modeling more kurtosis than other slash Birnbaum–Saunders distributions previously introduced in the literature such as SBS and MSBS densities. Figure 3 displays the GMSBS pdf plot along with MSBS and SBS densities. Note that the right tail of the GMSBS distribution is heavier than the tails of the other ones.

Figure 3. Comparison of right tails of densities for GMSBS, MSBS and SBS models for the same value for parameters

α, β

and q.

3. Estimation

Let

T_{1}, \dots, T_{n}

be a simple random sample (srs) from

T \sim G M S B S (α, β, q)

,

n > 3

. In this section, we face the problem of estimating

(α, β, q)

. Next, we propose a couple of techniques to tackle this problem.

3.1. Modified Moment Estimation

Following Ng et al. [12], a modified method moment based on Property (3) given in Proposition 2 is next introduced. Thus, we propose to equal

E [T]

,

E [T^{2}]

, and

E [1 / T]

to their corresponding sample moments, that is

\begin{matrix} E [T] & = & β [1 + 2 α^{2} \frac{q^{2}}{{(q - 2)}_{2}}] = \bar{T} \end{matrix}

(18)

\begin{matrix} E [T^{2}] & = & β^{2} [1 + 8 α^{2} \frac{q^{2}}{{(q - 2)}_{2}} + 24 α^{2} \frac{q^{4}}{{(q - 4)}_{4}}] = m_{2} \end{matrix}

(19)

\begin{matrix} E [\frac{1}{T}] & = & \frac{1}{β} [1 + 2 α^{2} \frac{q^{2}}{{(q - 2)}_{2}}] = R \end{matrix}

(20)

where

\bar{T} = \frac{\sum_{i = 1}^{n} T_{i}}{n}

,

m_{2} = \frac{\sum_{i = 1}^{n} T_{i}^{2}}{n}

, and

R = \frac{\sum_{i = 1}^{n} \frac{1}{T_{i}}}{n}

.

Note that

R = H^{- 1}

with

H = \frac{n}{\sum_{i = 1}^{n} \frac{1}{T_{i}}}

the sample harmonic mean.

The solutions of previous equations for

q > 4

and

α > 0

are called the modified moment (MM) estimators, denoted as

{\hat{α}}_{M M}

,

{\hat{β}}_{M M}

, and

{\hat{q}}_{M M}

.

3.2. Maximum Likelihood Estimation

Given a srs

T_{1}, T_{2}, \dots, T_{n}

from a

G M S B S (α, β, q)

distribution and

t_{1}, t_{2}, \dots, t_{n}

their observations, by applying Equation (9), the log-likelihood function is

l (α, β, q) = \sum_{i = 1}^{n} log f_{T} (t_{i}; α, β, q)

= n (q - 1) l o g 2 + n q log q - \frac{3}{2} \sum_{i = 1}^{n} log t_{i} + \sum_{i = 1}^{n} log (t_{i} + β) - n log Γ (q) - n log α - \frac{n}{2} log β

+ \sum_{i = 1}^{n} log G (w_{i})

with

G (w_{i}) = \int_{0}^{\infty} v^{q} e^{- 2 q v} ϕ (w_{i} v) d v

and

w_{i} = \frac{1}{α} (\sqrt{\frac{t_{i}}{β}} - \sqrt{\frac{β}{t_{i}}})

.

To maximize

l (α, β, q)

in

(α, β, q)

, consider the first derivatives of

l (α, β, q)

with respect to

α

,

β

and q, denoted as

{\dot{l}}_{α}

,

{\dot{l}}_{β}

and

{\dot{l}}_{q}

, respectively. From

{\dot{l}}_{α} = 0

,

{\dot{l}}_{β} = 0

and

{\dot{l}}_{q} = 0

, we obtain the likelihood equations, whose expressions are given in Appendix B, and can be solved by using iterative Newton–Raphson methods.

Let us denote by

d (t_{i}) = \frac{\int_{0}^{\infty} v^{q + 2} e^{- 2 q v} ϕ (w_{i} v) d v}{G (w_{i})}

. Then, the following iterative process can be proposed for

k \geq 0

\begin{matrix} {\hat{α}}^{(k + 1)} & = & \frac{1}{n} {\{\sum_{i = 1}^{n} (\frac{t_{i}}{{\hat{β}}^{(k)}} + \frac{{\hat{β}}^{(k)}}{t_{i}} - 2) d^{(k)} (t_{i})\}}^{1 / 2} \end{matrix}

(21)

\begin{matrix} {\hat{β}}^{(k + 1)} & = & {\{\frac{\frac{1}{2 {({\hat{α}}^{(k)})}^{2}} \sum_{i = 1}^{n} t_{i} d^{(k)} (t_{i})}{\frac{n}{2 {\hat{β}}^{(k)}} - \sum_{i = 1}^{n} \frac{1}{t_{i} + {\hat{β}}^{(k)}} + \frac{1}{2 {({\hat{α}}^{(k)})}^{2}} \sum_{i = 1}^{n} \frac{1}{t_{i}} d^{(k)} (t_{i})}\}}^{1 / 2} \end{matrix}

(22)

\begin{matrix} {\hat{q}}^{(k + 1)} & = & exp \{ψ (q^{(k)}) - 1 - ln 2 - \frac{1}{n} \sum_{i = 1}^{n} \sum_{i = 1}^{n} \frac{G_{3}^{(k)} (w_{i})}{G^{(k)} (w_{i})}\} \end{matrix}

(23)

which needs starting values

{\hat{α}}^{(0)}

,

{\hat{β}}^{(0)}

and

{\hat{q}}^{(0)}

to start the recursion. As initial values, the modified moment estimators, previously proposed, can be considered.

Remark 3.

(1) In Equations (21)–(23),

d^{(k)} (\cdot)

,

G^{(k)} (\cdot)

and

G_{3}^{(k)} (\cdot)

denote these expressions evaluated at

{\hat{α}}^{(k)}

,

{\hat{β}}^{(k)}

and

{\hat{q}}^{(k)}

. The expression of

G_{3} (\cdot)

can be seen in Appendix B.

(2) It can be seen in Leiva [5] p. 41 that in the Birnbaum–Saunders model,

B S (α, β)

, the iterative equations for the MLEs of

\hat{α}

and

\hat{β}

are

\begin{matrix} {\hat{α}}^{(k + 1)} & = & \frac{1}{n} {\{\sum_{i = 1}^{n} (\frac{t_{i}}{{\hat{β}}^{(k)}} + \frac{{\hat{β}}^{(k)}}{t_{i}} - 2)\}}^{1 / 2} \end{matrix}

(24)

\begin{matrix} {\hat{β}}^{(k + 1)} & = & {\{\frac{\frac{1}{2 {({\hat{α}}^{(k)})}^{2}} \sum_{i = 1}^{n} t_{i}}{\frac{n}{2 {\hat{β}}^{(k)}} - \sum_{i = 1}^{n} \frac{1}{t_{i} + {\hat{β}}^{(k)}} + \frac{1}{2 {({\hat{α}}^{(k)})}^{2}} \sum_{i = 1}^{n} \frac{1}{t_{i}}}\}}^{1 / 2} . \end{matrix}

(25)

The effect of introducing the generalized modified slash variable on the

B S (α, β)

model can be appreciated by comparing Equations (21) and (22) to Equations (24) and (25).

3.3. ML Estimation Using EM-Algorithm

Taking advantage of the stochastic representation of the GMSBS model, we can develop a more attractive iterative method to find the MLEs based on the EM algorithm (Dempster et al. [13]). This is a well-known tool when unobserved (missing) data or latent variables are present while modeling. This algorithm enables the computationally efficient determination of the ML estimates when iterative procedures are required. Looking at the stochastic representation of a generalized modified slash distribution given in Equation (6), we note that the scale factor V depends on the parameter q, thus we consider a re-parameterization to get the EM-algorithm in the GMSBS model. Then, the resulting stochastic representation for T can be expressed as

\begin{matrix} T = β {(\frac{α}{2} X + \sqrt{{(\frac{α}{2} X)}^{2} + 1})}^{2}, \end{matrix}

(26)

where

X = U^{- 1 / 2} Z

, with

Z \sim N (0, 1)

independent of

U \sim G G (q, 2 q, 2)

, i.e., the generalized gamma distribution whose pdf can be expressed as

h (u) = 2^{q - 1} q^{q} u^{q / 2 - 1} exp {- 2 q u^{- 1 / 2}} / Γ (q), u > 0 .

Under the new parameterization, we have the conditional distribution of T, given

U = u

, follows the

B S (α / \sqrt{u}, β)

distribution. Consequently, the pdf of the T reduces to

\begin{matrix} f_{T} (t) = \frac{t^{- 3 / 2} (t + β) 2^{q - 1} q^{q}}{α β^{1 / 2} Γ (q)} \int_{0}^{\infty} u^{q / 2 - 1} exp {- 2 q u^{- 1 / 2}} ϕ (\sqrt{u} a_{t} (α, β)) d u, t > 0, \end{matrix}

(27)

where

ϕ (\cdot)

is the pdf of

N (0, 1)

distribution.

Let

T_{1}, \dots, T_{n}

be a simple random sample of size n of

T \sim GMSBS (α, β, q)

. Here, the parameter vector is

θ = {(α, β, q)}^{⊤}

, with

θ \in Θ \subseteq R_{+}^{3}

. Let

ℓ_{c} (θ | t_{c})

and

Q (θ | \hat{θ}) = E [ℓ_{c} (θ | t_{c}) | t, \hat{θ}]

denote the complete-data log-likelihood function and its expected value, respectively. Each iteration of the EM algorithm involves two steps. Note that the above setup can be represented through a hierarchical representation given by

\begin{matrix} T_{i} | (U_{i} = u_{i}) & \sim & BS (α / \sqrt{u_{i}}, β), \end{matrix}

(28)

\begin{matrix} U_{i} & \sim & G G (q, 2 q, 2), i = 1, \dots, n . \end{matrix}

(29)

Let

t = {[t_{1}, \dots, t_{n}]}^{⊤}

and

u = {[u_{1}, \dots, u_{n}]}^{⊤}

be observed and unobserved data, respectively. The complete data

t_{c} = {[t^{⊤}, u^{⊤}]}^{⊤}

corresponds to the original data

t

augmented with

u

. We now detail the implementation of the ML estimation of parameters of GMSBS distributions by using the EM-algorithm. In this section, the hierarchical representation given in Equations (28) and (29) is useful to obtain the complete log-likelihood function associated with

t_{c}

, which can be expressed as

\begin{matrix} ℓ_{c} (θ | t_{c}) & \propto & - n log (α) - \frac{n}{2} log (β) - \frac{1}{2 α^{2}} \sum_{i = 1}^{n} u_{i} [\frac{t_{i}}{β} + \frac{β}{t_{i}} - 2] + \sum_{i = 1}^{n} log (t_{i} + β) \\ + & ℓ_{c} (q | t_{c}), \end{matrix}

(30)

where

ℓ_{c} (q | t_{c}) = n [(q - 1) log (2) + q log q - log Γ (q)] + (q / 2 - 1) \sum_{i = 1}^{n} log (u_{i}) - 2 q \sum_{i = 1}^{n} u_{i}^{- 1 / 2}

.

Letting

{\hat{u}}_{i} = E [U_{i} | t_{i}, θ = \hat{θ}]

, it follows that the conditional expectation of the complete log-likelihood function has the form

\begin{matrix} Q (θ | \hat{θ}) & \propto & - n log (α) - \frac{n}{2} log (β) - \frac{1}{2 α^{2}} \sum_{i = 1}^{n} {\hat{u}}_{i} [\frac{t_{i}}{β} + \frac{β}{t_{i}} - 2] + \sum_{i = 1}^{n} log (t_{i} + β) \\ + & Q (q | \hat{θ}), \end{matrix}

(31)

where

Q (q | \hat{θ}) = n [(q - 1) log (2) + q log q - log Γ (q)] + (q / 2 - 1) S_{1 n} - 2 q S_{2 n}

, with

S_{1 n} = \sum_{i = 1}^{n} E [log (U_{i}) | t_{i}]

and

S_{2 n} = \sum_{i = 1}^{n} E [U_{i}^{- 1 / 2} | t_{i}]

. As both quantities

S_{1 n}

and

S_{2 n}

have no explicit forms in the context of our model, they have to be computed numerically. Thus, to compute

Q (q | \hat{θ})

, we use a similar approach to that by Lee and Xu (2004, Section 3.1) [14]. Specifically, let

{u_{r}; r = 1, \dots, R}

be a sample randomly drawn from the conditional distribution

U | (T = t, θ = \hat{θ})

, so the quantity

Q (q | \hat{θ})

can be approximated as follows:

Q (q | \hat{θ}) \approx \frac{1}{R} \sum_{r = 1}^{R} ℓ_{c} (q | u_{r}) .

We then have the EM-algorithm for the ML estimation of the parameters of the GMSBS distributions as follows:

E-step. Given

θ = {\hat{θ}}^{(k)} = {({\hat{α}}^{(k)}, {\hat{β}}^{(k)}, {\hat{q}}^{(k)})}^{⊤}

, compute

{\hat{u_{i}}}^{(k)}

, for

i = 1, \dots, n

.

CM-step I: Update

{\hat{α}}^{(k)}

by maximizing

Q (θ | {\hat{θ}}^{(k)})

over

α

, which leads to the expression:

\begin{matrix} {\hat{α}}^{2 (k + 1)} & = & \frac{S_{u}^{(k)}}{{\hat{β}}^{(k)}} + \frac{{\hat{β}}^{(k)}}{R_{u}^{(k)}} - 2 {\bar{u}}^{(k)}, \end{matrix}

CM-step II: Obtain

{\hat{β}}^{(k + 1)}

as the solution of

{\hat{β}}^{2 (k + 1)} - {\hat{β}}^{(k + 1)} [K ({\hat{β}}^{(k + 1)}) + 2 {\bar{u}}^{(k)} R_{u}^{(k)}] + R_{u} [{\bar{u}}^{(k)} K ({\hat{β}}^{(k + 1)}) + S_{u}^{(k)}] = 0 .

CM-step III: Fix

α = {\hat{α}}^{(k + 1)}

and

β = {\hat{β}}^{(k + 1)}

, update

q^{(k)}

by optimizing

\begin{matrix} {\hat{q}}^{(k + 1)} & = & \arg \max_{q} Q ({\hat{α}}^{(k + 1)}, {\hat{β}}^{(k + 1)}, q | {\hat{θ}}^{(k)}) . \end{matrix}

where

{\bar{u}}^{(k)} = \frac{1}{n} \sum_{i = 1}^{n} {\hat{u}}_{i}^{(k)}, S_{u}^{(k)} = \frac{1}{n} \sum_{i = 1}^{n} {\hat{u}}_{i}^{(k)} t_{i}, and R_{u}^{(k)} = \frac{1}{\frac{1}{n} \sum_{i = 1}^{n} (\frac{{\hat{u}}_{i}^{(k)}}{t_{i}})},

with

K (x) = {\{\frac{1}{n} \sum_{i = 1}^{n} (\frac{1}{x + t_{i}})\}}^{- 1}

. The iterations are repeated until a suitable convergence rule is satisfied, say

| ℓ ({\hat{θ}}^{(k + 1)}) - ℓ ({\hat{θ}}^{(k)}) |

sufficiently small. Useful starting values required to implement this algorithm are those obtained under the normality assumption or by using the modified moment estimates

{\hat{α}}_{M M}

,

{\hat{β}}_{M M}

and

{\hat{q}}_{M M}

.

Remark 4.

(1) Note that, if q tends to ∞, then the estimates of α and β in M-step reduce to those when the BS distribution is used.

(2) Note that CM-Step II requires a one-dimensional search for the root of β, respectively, which can easily be achieved by using the “uniroot" function built in R. On the other hand, CM-Step III can be very slow. An alternative is to use the idea by Lin and Liu [15] (Section 3), and it can be defined as:

CML-step: Update

q^{(k)}

by optimizing the following constrained actual log-likelihood function

\begin{matrix} {\hat{q}}^{(k + 1)} & = & \arg \max_{q} ℓ ({\hat{α}}^{(k + 1)}, {\hat{β}}^{(k + 1)}, q) . \end{matrix}

The corresponding standard errors (s.e.) are calculated from the observed information matrix.

4. Simulation

In this section, a simulation study is carried out to illustrate the behavior of EM algorithm to obtain MLEs of the parameters. By using the representation given in Equation (6), it is possible to generate random numbers for the

G M S B S (α, β, q)

distribution, which leads to the following algorithm.

Simulate $Z_{i} \sim N (0, 1), i = 1, 2, \dots, n .$
Simulate $V_{i} \sim G a (2 q, q), i = 1, 2, \dots, n,$ with $q > 0$ .
Compute $X_{i} = \frac{Z_{i}}{V_{i}} .$ Then, $X_{i} \sim G M S (0, 1, q)$ , $i = 1, 2, \dots, n .$
Compute $T_{i} = β {(\frac{α}{2} X_{i} + \sqrt{{(\frac{α}{2} X_{i})}^{2} + 1})}^{2}$ , $α > 0$ , $β > 0$ . $T_{i} \sim G M S B S (α, β, q)$ for $i = 1, 2, \dots, n .$

Table 1 shows results of simulation studies, which illustrate the behavior of the MLEs for 1000 samples of sizes

n = 50

, 100, and 200 generated from a population distributed as GMSBS(

α, β, q

) for different values of

α

,

β

and q. For each generated sample, MLEs were computed numerically using a EM-algorithm previously proposed. Bias, standard error (s.e) and

\sqrt{M S E}

of estimates are reported.

Table 1. Empirical bias, standard error and

\sqrt{M S E}

for the MLEs of

α

,

β

, and q using the EM-algorithm.

Results in Table 1 show that, when the sample size increases, MLEs’ bias tends to zero, and their standard errors and

{\sqrt{M S E}}^{'} s

decrease. Therefore, they are consistent.

5. Applications

Next, the model is illustrated with two datasets collected by the Department of Mines of the University of Atacama, Chile, representing Neodymium and Nickel levels in samples of minerals.

5.1. Neodymium Dataset

The descriptive summaries are given in Table 2 where

\bar{t}

denotes the sample mean,

S_{t}

the sample standard deviation,

g_{1}

the sample skewness coefficient, and

g_{2}

the sample kurtosis coefficient. GMSBS, MSBS and SBS distributions are fitted to this dataset, the parameters are estimated via maximum likelihood (EM-algorithm), abd their corresponding standard errors are given in parentheses in Table 3. As goodness of fit criteria, the Akaike Information Criterion (AIC) and QQ-plots are considered. Recall that AIC =

- 2 ln (l i k e l i h o o d) + 2 p

where p is the number of parameters to be estimated [16]. The AIC values we obtained are given in Table 3. They suggest that GMSBS model provides the best fit to these data since this model exhibits less AIC.

Table 2. Summary of Neodymium dataset.

Table 3. MLEs for Neodymium dataset, their standard errors (in parenthesis) and AIC values.

Figure 4 depicts the histogram for the data with the fitted density and the empirical cdf along with the cdf estimated by GMSBS model, as well QQ-plots given in Figure 5; these also show the good agreement of the GMSBS model for the Neodymium data.

Figure 4. (left) Histogram of the Neodymium data with estimated pdf of GMSBS distribution; and (right) empirical cdf (dotted lines) with estimated cdf of GMSBS model.

Figure 5. Q-Q plots in Neodymium dataset for: SBS model (left); MSBS model (middle); and GMSBS model (right)

5.2. Nickel Dataset

The descriptive summaries are given in Table 4. GMSBS, MSBS and SBS distributions are fitted to this dataset, the parameters are estimated via maximum likelihood (EM-algorithm), and their corresponding standard errors are given in parentheses in Table 5. The AIC values we obtained are given in Table 5. They suggest that GMSBS model provides the best fit to these data since this model exhibits less AIC.

Table 4. Summary of Nickel dataset.

Table 5. MLEs for Nickel dataset, their standard errors (in parenthesis) and AIC values.

Figure 6 depicts the histogram for the data with the fitted density and the empirical cdf along with the cdf estimated by GMSBS model. QQ-plots are given in Figure 7. All of them show the good agreement of the GMSBS model for the Nickel data.

Figure 6. (left) Histogram of the Nickel data with estimated pdf of GMSBS distribution; and (right) empirical cdf (dotted lines) with estimated cdf of GMSBS model.

Figure 7. Q-Q plots in Nickel dataset for: SBS model (left); MSBS model (middle); and GMSBS model (right)

Author Contributions

All the authors contributed significantly to this research article.

Funding

This research received no external funding.

Acknowledgments

The research of J. Reyes and H.W. Gómez was supported by Grant SEMILLERO UA-2018 (Chile). The research of I. Barranco-Chamorro was supported by Grant CTM2015-68276-R (Spain).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Some Proofs of Results Given in Section 2

In this appendix, details about results dealing with the convergence in law of a

G M S B S (α, β, q)

model to a BS distribution (

q \to \infty

), moments, skewness and kurtosis coefficients for the

G M S B S (α, β, q)

are given.

Proof of Proposition 3.

To obtain the result proposed in Proposition 3, we must prove that

{lim}_{q \to \infty} F_{T_{q}} (t) = F_{T} (t)

, with

F_{T} ()

the cdf of a

B S (2 α, β)

model (see, for instance, Rohatgi and Ehsanes Saleh, [17]).

Since

T_{q} \sim G M S B S (α, β, q)

, then we can write

T_{q} = h (X_{q})

where

X_{q} \sim G M S (0, 1, q)

and

h (\cdot)

was given in Equation (6). Recall that we have the following relationship for the cdf of

T_{q}

F_{T_{q}} (t) = F_{X_{q}} (w (t)) with w (t) = \frac{1}{α} (\sqrt{\frac{t}{β}} - \sqrt{\frac{β}{t}}) .

(A1)

It can be seen in Reyes et al. [9] Proposition 3 that, given

X_{q} \sim G M S (0, 1, q)

, then

X_{q} \overset{L}{⟶} X

as

q \to \infty

where

X \sim N (0, 2)

, that is,

lim_{q \to \infty} F_{X_{q}} (w) = Φ (\frac{w}{2}), with Φ () : cdf of a N (0, 1) .

So, taking the limit in Equation (A1), we have

lim_{q \to \infty} F_{T_{q}} (t) = lim_{q \to \infty} F_{X_{q}} (w (t)) = Φ (\frac{w (t)}{2}) = Φ (\frac{1}{2 α} (\sqrt{\frac{t}{β}} - \sqrt{\frac{β}{t}}))

(A2)

that corresponds to the cdf of a

B S (2 α, β)

distribution. Thus, we obtain the proposed result. ☐

Proof of Proposition 4.

By using the stochastic representation given in Equation (6), we have

E [T^{r}] = β^{r} E [{(\frac{α}{2} X + \sqrt{{(\frac{α}{2} X)}^{2} + 1})}^{2 r}],

with

X \sim G M S (0, 1, q)

.

By using the binomial formula

E [T^{r}] = β^{r} \sum_{k = 0}^{2 r} (\binom{2 r}{k}) E [{({(\frac{α}{2} X)}^{2} + 1)}^{(k / 2)} {(\frac{α}{2} X)}^{2 r - k}],

(A3)

and therefore

E [T^{r}]

exists iff

E [X^{2 r}]

exists, that is, iff

2 r < q

(Reyes et al. [9], Proposition 4). Next we also show that Equation (A3) allows us to obtain the explicit expression of

E [T^{r}]

given in Equation (15).

Note that for odd s

E [{({(\frac{α}{2} X)}^{2} + 1)}^{t} {(\frac{α}{2} X)}^{s}] = 0,

and therefore for

y = \frac{k}{2}

we can write

\begin{matrix} E [T^{r}] & = & β^{r} \sum_{y = 0}^{r} (\binom{2 r}{2 y}) E [{({(\frac{α}{2} X)}^{2} + 1)}^{y} {(\frac{α}{2} X)}^{2 (r - y)}] \\ = & β^{r} \sum_{y = 0}^{r} (\binom{2 r}{2 y}) \sum_{s = 0}^{y} (\binom{y}{s}) {(\frac{α}{2})}^{2 (r + s - y)} E [X^{2 (r + s - y)}] \end{matrix}

where

X \sim G M S (0, 1, q)

is such that

E [X^{2 j}] = \frac{2^{j} q^{2 j} (2 j)! Γ (q - 2 j)}{Γ (q)}

for

q > 2 j

, as can be seen in Reyes et al. [9]. Taking

j = r + s - y

, the result proposed in Equation (15) is obtained. ☐

Proof of Corollary 3.

From Proposition 4, it is straightforward that

μ_{1} = β [2 α^{2} \frac{q^{2} Γ (q - 2)}{Γ (q)} + 1], q > 2

μ_{2} = β^{2} [24 α^{4} \frac{q^{4} Γ (q - 4)}{Γ (q)} + 8 α^{2} \frac{q^{2} Γ (q - 2)}{Γ (q)} + 1], q > 4

μ_{3} = β^{3} [480 α^{6} \frac{q^{6} Γ (q - 6)}{Γ (q)} + 144 α^{4} \frac{q^{4} Γ (q - 4)}{Γ (q)} + 18 α^{2} \frac{q^{2} Γ (q - 2)}{Γ (q)} + 1], q > 6

μ_{4} = β^{4} [13440 α^{8} \frac{q^{8} Γ (q - 8)}{Γ (q)} + 3840 α^{6} \frac{q^{6} Γ (q - 6)}{Γ (q)} + 480 α^{4} \frac{q^{4} Γ (q - 4)}{Γ (q)} + 32 α^{2} + \frac{q^{2} Γ (q - 2)}{Γ (q)} + 1], q > 8,

and, thus, the proposed results follow by using the notation introduced in Equation (16). ☐

Corollary A1 (Central moments).

Let

T \sim G M S B S (α, β, q)

. Then,

(1) The variance of T,

V a r (T)

, was given in Equation (17).

(2) The central moment of order 3 is

E [{(T - μ_{1})}^{3}] = β^{3} [α^{4} d_{4} (q) + α^{6} d_{6} (q)], q > 6,

(A4)

where

\begin{matrix} d_{4} (q) & = & 72 \frac{q^{4}}{{(q - 4)}_{4}} - 24 \frac{q^{4}}{{{(q - 2)}_{2}}^{2}} \end{matrix}

(A5)

\begin{matrix} d_{6} (q) & = & 480 \frac{q^{6}}{{(q - 6)}_{6}} - 144 \frac{q^{6}}{{(q - 4)}_{4} {(q - 2)}_{2}} + 16 \frac{q^{6}}{{{(q - 2)}_{2}}^{3}} . \end{matrix}

(A6)

(3) The central moment of order 4 is

E [{(T - μ_{1})}^{4}] = β^{8} [α^{4} f_{4} (q) + α^{6} f_{6} (q) + α^{8} f_{8} (q)], q > 8,

(A7)

where

\begin{matrix} f_{4} (q) & = & 48 \frac{q^{4}}{{(q - 4)}_{4}} \end{matrix}

(A8)

\begin{matrix} f_{6} (q) & = & 1920 \frac{q^{6}}{{(q - 6)}_{6}} - 576 \frac{q^{6}}{{(q - 4)}_{4} {(q - 2)}_{2}} + 96 \frac{q^{6}}{{{(q - 2)}_{2}}^{3}} \end{matrix}

(A9)

\begin{matrix} f_{8} (q) & = & 13440 \frac{q^{8}}{{(q - 8)}_{8}} - 3840 \frac{q^{8}}{{(q - 6)}_{6} {(q - 2)}_{2}} + 576 \frac{q^{8}}{{(q - 4)}_{4} {{(q - 2)}_{2}}^{2}} \end{matrix}

(A10)

\begin{matrix} - & 48 \frac{q^{8}}{{{(q - 2)}_{2}}^{4}} . \end{matrix}

(A11)

Proof.

They are obtained by considering the results given in Corollary 3 and the following relationships

\begin{matrix} V a r (T) & = & μ_{2} - μ_{1}^{2} \\ E [{(T - μ_{1})}^{3}] & = & μ_{3} - 3 μ_{1} μ_{2} + 2 μ_{1}^{3} \\ E [{(T - μ_{1})}^{4}] & = & μ_{4} - 4 μ_{1} μ_{3} + 6 μ_{1}^{2} μ_{2} - 3 μ_{1}^{4} \end{matrix}

☐

Proposition A1 (Skewness and kurtosis coefficient).

For

T \sim G M S B S (α, β, q)

distribution the skewness,

\sqrt{β_{1}}

, and kurtosis,

β_{2}

, coefficients can be calculated as

\begin{matrix} \sqrt{β_{1}} & = & \frac{μ_{3} - 3 μ_{1} μ_{2} + 2 μ_{1}^{3}}{{(μ_{2} - μ_{1}^{2})}^{3 / 2}} \\ β_{2} & = & \frac{μ_{4} - 4 μ_{1} μ_{3} + 6 μ_{1}^{2} μ_{2} - 3 μ_{1}^{4}}{{(μ_{2} - μ_{1}^{2})}^{2}} \end{matrix}

From previous expressions, we have that

\begin{matrix} \sqrt{β_{1}} & = & \frac{α}{8} \frac{d_{4} (q) + α^{2} d_{6} (q)}{{c_{2} (q) + α^{2} c_{4} (q)}^{3 / 2}} \end{matrix}

(A12)

\begin{matrix} β_{2} & = & \frac{1}{16} \frac{f_{4} (q) + α^{2} f_{6} (q) + α^{4} f_{8} (q)}{{c_{2} (q) + α^{2} c_{4} (q)}^{2}} \end{matrix}

(A13)

Appendix B. Likelihood Equations

The likelihood equations are

\begin{matrix} \sum_{i = 1}^{n} \frac{G_{1} (w_{i})}{G (w_{i})} & = & \frac{n}{α} \end{matrix}

(A14)

\begin{matrix} \sum_{i = 1}^{n} \frac{G_{2} (w_{i})}{G (w_{i})} & = & \frac{n}{2 β} - \sum_{i = 1}^{n} \frac{1}{t_{i} + β} \end{matrix}

(A15)

\begin{matrix} \sum_{i = 1}^{n} \frac{G_{3} (w_{i})}{G (w_{i})} & = & - n (1 + log 2 + log q) + n ψ (q) \end{matrix}

(A16)

where

\begin{matrix} G_{1} (w_{i}) = \frac{\partial G (w_{i})}{\partial α} & = & \frac{1}{α^{3}} (\frac{t_{i}}{β} + \frac{β}{t_{i}} - 2) \int_{0}^{\infty} v^{q + 2} e^{- 2 q v} ϕ (w_{i} v) d v \\ G_{2} (w_{i}) = \frac{\partial G (w_{i})}{\partial β} & = & \frac{1}{2 α^{2} β^{2}} \frac{1}{t_{i}} (t_{i}^{2} - β^{2}) \int_{0}^{\infty} v^{q + 2} e^{- 2 q v} ϕ (w_{i} v) d v \end{matrix}

\begin{matrix} G_{3} (w_{i}) = \frac{\partial G (w_{i})}{\partial q} & = & \int_{0}^{\infty} (ln v - 2 v) v^{q} e^{- 2 q v} ϕ (w_{i} v) d v \end{matrix}

and

ψ (q) = Γ^{'} (q) / Γ (q)

is the digamma function.

References

Birnbaum, Z.W.; Saunders, S.C. A New Family of Life Distributions. J. Appl. Probab. 1969, 6, 319–327. [Google Scholar] [CrossRef]
Birnbaum, Z.W.; Saunders, S.C. Estimation for a family of life distributions with applications to fatigue. J. Appl. Probab. 1969a, 6, 328–347. [Google Scholar] [CrossRef]
Moors, J.J.A. A quantile alternative for kurtosis. J. R. Stat. Soc. Ser. D 1988, 37, 25–32. [Google Scholar] [CrossRef]
Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions, 2nd ed.; Wiley: New York, NY, USA, 1995; Volume 1. [Google Scholar]
Leiva, V. The Birnbaum–Saunders Distribution; Academic Press: New York, NY, US, 2016. [Google Scholar]
Gómez, H.W.; Olivares-Pacheco, J.F.; Bolfarine, H. An extension of the generalized Birnbaum–Saunders distribution. Stat. Probab. Lett. 2009, 79, 331–338. [Google Scholar] [CrossRef]
Reyes, J.; Gómez, H.W.; Bolfarine, H. Modified slash distribution. Stat. J. Theor. App. Stat. 2013, 47, 929–941. [Google Scholar] [CrossRef]
Rogers, W.H.; Tukey, J.W. Understanding some long-tailed symmetrical distributions. Stat. Neerl. 1972, 26, 211–226. [Google Scholar] [CrossRef]
Reyes, J.; Barranco-Chamorro, I.; Gómez, H.W. Generalized modified slash distribution. 2018. Submitted. [Google Scholar]
Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions, 10th ed.; Handbook of Mathematical Functions: Dover, NY, USA, 1972. [Google Scholar]
Reyes, J.; Vilca, F.; Gallardo, D.I.; Gómez, H.W. Modified slash Birnbaum–Saunders distribution. Hacettepe J. Math. Stat. Ser. B 2017, 46, 969–984. [Google Scholar] [CrossRef]
Ng, H.; Kundu, D.; Balakrishnan, N. Modified moment estimation for the two-parameter Birnbaum–Saunders distribution. Comput. Stat. Data Anal. 2003, 43, 283–298. [Google Scholar] [CrossRef]
Dempster, A.P.; Rubin, D.B.; Laird, N.M. Maximum likelihood from incomplete data via the EM algorithm (with discussion). J. R. Stat. Soc. B 1977, 39, 1–38. [Google Scholar]
Lee, S.Y.; Xu, L. Influence analyses of nonlinear mixed-effects model. Comput. Stat. Data Anal. 2004, 45, 321–341. [Google Scholar] [CrossRef]
Lin, X.S.; Liu, X. Markow aging process and phase-type law of mortality. N. Am. Actuar. J. 2007, 11, 92–109. [Google Scholar] [CrossRef]
Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control 1974, 19, 716–723. [Google Scholar] [CrossRef]
Rohatgi, V.K.; Ehsanes Saleh, A.K.M.D. An Introduction to Probability and Statistics, 3rd ed.; John Wiley and Sons: New York, NY, USA, 2001. [Google Scholar]

Figure 1. GMSMS

(α = 0.3, β = 2, q)

pdfs for different values of q.

Figure 2. Skewness and kurtosis coefficients for

G M S B S (α, β = 1, q)

model as function of q taking

α = 0.25, 1

and 5.

Figure 3. Comparison of right tails of densities for GMSBS, MSBS and SBS models for the same value for parameters

α, β

and q.

Figure 4. (left) Histogram of the Neodymium data with estimated pdf of GMSBS distribution; and (right) empirical cdf (dotted lines) with estimated cdf of GMSBS model.

Figure 5. Q-Q plots in Neodymium dataset for: SBS model (left); MSBS model (middle); and GMSBS model (right)

Figure 6. (left) Histogram of the Nickel data with estimated pdf of GMSBS distribution; and (right) empirical cdf (dotted lines) with estimated cdf of GMSBS model.

Figure 7. Q-Q plots in Nickel dataset for: SBS model (left); MSBS model (middle); and GMSBS model (right)

Table 1. Empirical bias, standard error and

\sqrt{M S E}

for the MLEs of

α

,

β

, and q using the EM-algorithm.

Table 1. Empirical bias, standard error and

\sqrt{M S E}

for the MLEs of

α

,

β

, and q using the EM-algorithm.

True Value				$n = 50$			$n = 100$			$n = 200$
$α$	$β$	$q$	$\hat{θ}$	bias	s.e.	$\sqrt{MSE}$	bias	s.e.	$\sqrt{MSE}$	bias	s.e.	$\sqrt{MSE}$
1	2	2	$\hat{α}$	0.0578	0.2267	0.2327	0.0455	0.1596	0.1588	0.0373	0.1110	0.1113
			$\hat{β}$	0.1191	0.6707	0.7412	0.0702	0.4794	0.4953	0.0367	0.3273	0.3352
			$\hat{q}$	1.0222	1.8522	3.5878	0.3839	0.7437	1.2601	0.1959	0.4183	0.4789
2			$\hat{α}$	0.1373	0.4633	0.4914	0.1012	0.3235	0.3282	0.0753	0.2290	0.2279
			$\hat{β}$	0.1871	0.9125	1.0495	0.0635	0.6299	0.6729	0.0489	0.4542	0.4677
			$\hat{q}$	1.3715	2.2452	5.0045	0.3781	0.7066	0.9840	0.1812	0.4254	0.4560
3			$\hat{α}$	0.2858	0.7062	0.7504	0.1563	0.4872	0.4798	0.1264	0.3398	0.3385
			$\hat{β}$	0.2776	0.9688	1.1139	0.1234	0.6537	0.7049	0.0535	0.4630	0.4451
			$\hat{q}$	1.2999	2.0412	4.3190	0.3930	0.7107	0.9874	0.1928	0.4161	0.4836
1	1	1	$\hat{α}$	0.1946	0.3061	0.6894	0.1402	0.2041	0.2457	0.1535	0.1373	0.3721
			$\hat{β}$	0.2621	0.4270	4.8105	0.0401	0.2711	0.2847	0.0333	0.1870	0.1996
			$\hat{q}$	0.3416	0.4464	0.6948	0.2206	0.2528	0.3243	0.1723	0.1647	0.2305
	2		$\hat{α}$	0.1741	0.3155	0.3550	0.1443	0.2088	0.2595	0.1461	0.1326	0.3455
			$\hat{β}$	0.1844	0.8233	0.9462	0.0805	0.5533	0.5930	0.0465	0.3566	0.3866
			$\hat{q}$	0.3206	0.4777	0.8124	0.2104	0.2537	0.3223	0.1745	0.1609	0.2323
	3		$\hat{α}$	0.1689	0.3042	0.4005	0.1405	0.2041	0.3346	0.1351	0.1348	0.2716
			$\hat{β}$	0.3346	1.2332	2.3262	0.1246	0.7904	0.8285	0.0445	0.5271	0.5435
			$\hat{q}$	0.3351	0.4584	0.6384	0.2132	0.2594	0.3229	0.1701	0.1608	0.2274
0.5	1	1	$\hat{α}$	0.0734	0.1447	0.1658	0.0712	0.0987	0.1625	0.0667	0.0659	0.1430
			$\hat{β}$	0.0314	0.1952	0.2170	0.0091	0.1306	0.1365	0.0064	0.0889	0.1240
			$\hat{q}$	0.2724	0.4067	0.5825	0.2005	0.2432	0.3146	0.1656	0.1588	0.2270
		2	$\hat{α}$	0.0196	0.1174	0.1117	0.0173	0.0783	0.0732	0.0183	0.0566	0.0570
			$\hat{β}$	0.0193	0.1848	0.1798	0.0080	0.1237	0.1215	0.0041	0.0879	0.0836
			$\hat{q}$	0.9310	1.7205	4.1727	0.2917	0.6519	0.7662	0.1691	0.4138	0.4612
		3	$\hat{α}$	0.0138	0.0993	0.0995	0.0148	0.0695	0.0660	0.0110	0.0488	0.0482
			$\hat{β}$	0.0118	0.1663	0.1742	0.0133	0.1207	0.1182	0.0009	0.0829	0.0832
			$\hat{q}$	2.4498	4.3431	7.8277	0.7645	1.4680	3.0066	0.2994	0.7723	0.9407

Table 2. Summary of Neodymium dataset.

n	$\bar{t}$	$S_{t}$	$g_{1}$	$g_{2}$
86	35.02	35.2307	3.648	18.216

Table 3. MLEs for Neodymium dataset, their standard errors (in parenthesis) and AIC values.

Parameter	SBS	MSBS	GMSBS
$α$	0.289 (0.064)	0.290 (0.105)	0.2087 (0.0366)
$β$	27.247 (1.592)	27.683 (5.983)	28.1102 (1.4994)
q	1.578 (0.426)	2.009 (0.570)	2.5661 (0.9489)
AIC	743.9906	741.3566	739.9394

Table 4. Summary of Nickel dataset.

n	$\bar{t}$	$S_{t}$	$g_{1}$	$g_{2}$
85	21.59	16.5732	2.3922	11.325

Table 5. MLEs for Nickel dataset, their standard errors (in parenthesis) and AIC values.

Parameter	SBS	MSBS	GMSBS
$α$	0.3877 (0.0918)	0.3266 (0.0852)	0.2490 (0.0330)
$β$	17.7982 (1.2464)	17.6017 (1.1027)	17.4961 (1.0622)
q	2.0118 (0.6967)	2.0932 (0.6284)	3.1927 (1.2341)
AIC	670.742	668.251	666.571

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Generalized Modified Slash Birnbaum–Saunders Distribution

Abstract

1. Introduction

1.1. Birnbaum–Saunders Distribution

1.2. Slash Methodology

2. GMSBS Distributions

2.1. Probability Density Function

2.2. Properties

2.3. Moments

3. Estimation

3.1. Modified Moment Estimation

3.2. Maximum Likelihood Estimation

3.3. ML Estimation Using EM-Algorithm

4. Simulation

5. Applications

5.1. Neodymium Dataset

5.2. Nickel Dataset

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A. Some Proofs of Results Given in Section 2

Appendix B. Likelihood Equations

References

Article Metrics

Citations

Article Access Statistics