Survival and Reliability Analysis with an Epsilon-Positive Family of Distributions with Applications

Perla Celis; Rolando de la Cruz; Claudio Fuentes; Héctor W. Gómez

doi:10.3390/sym13050908

,

and

¹

Facultad de Ingeniería y Ciencias, Universidad Adolfo Ibáñez, Diagonal Las Torres 2640, Peñalolén, Santiago 7941169, Chile

²

Department of Statistics, Oregon State University, 217 Weniger Hall, Corvallis, OR 97331, USA

³

Departamento de Matemática, Facultad de Ciencias Básicas, Universidad de Antofagasta, Antofagasta 1240000, Chile

^*

Author to whom correspondence should be addressed.

Symmetry2021, 13(5), 908;https://doi.org/10.3390/sym13050908

This article belongs to the Special Issue Symmetric and Asymmetric Distributions: Theoretical Developments and Applications II

Version Notes

Order Reprints

Abstract

We introduce a new class of distributions called the epsilon–positive family, which can be viewed as generalization of the distributions with positive support. The construction of the epsilon–positive family is motivated by the ideas behind the generation of skew distributions using symmetric kernels. This new class of distributions has as special cases the exponential, Weibull, log–normal, log–logistic and gamma distributions, and it provides an alternative for analyzing reliability and survival data. An interesting feature of the epsilon–positive family is that it can viewed as a finite scale mixture of positive distributions, facilitating the derivation and implementation of EM–type algorithms to obtain maximum likelihood estimates (MLE) with (un)censored data. We illustrate the flexibility of this family to analyze censored and uncensored data using two real examples. One of them was previously discussed in the literature; the second one consists of a new application to model recidivism data of a group of inmates released from the Chilean prisons during 2007. The results show that this new family of distributions has a better performance fitting the data than some common alternatives such as the exponential distribution.

Keywords:

censored data; EM algorithm; epsilon–exponential distribution; exponential distribution; maximum likelihood; reliability analysis; survival analysis; stress-strength parameter

1. Introduction

The statistical analysis of reliability and survival data is an important topic in several areas, including medicine, epidemiology, biology, economics, engineering, and environmental sciences, to name a few. When using a parametric approach, one of the first steps for modeling the data is to choose a suitable distribution that can capture relevant features of the observations of interest. In this context, the gamma and Weibull distributions have become popular choices due to their flexibility that allows for a non-constant hazard rate function and to model skewed data. Although several alternatives were considered to accommodate different cases, researchers have continued to develop extensions and modifications of the standard distributions to increase the flexibility of the models see [1,2,3] for a few examples.

In this paper, we consider a generalization of the distributions with positive support and propose a new family of distributions, called the epsilon–positive family, whose construction is motivated by the ideas behind the generation of skew distributions using symmetric kernels. Specifically, we build upon the ideas from [4], where the authors start with a symmetric around zero distribution f, and define a family of distributions indexed by a parameter

γ > 0

as the set of densities of the form

h (x; γ) = \frac{2}{γ + γ^{- 1}} (f (x / γ) 1_{{x \geq 0}} + f (γ x) 1_{{x < 0}}),

(1)

where

1_{A}

denotes the indicator function of the set A. Some extensions of this family include epsilon-skew-normal family introduced by [5] and the epsilon–skew–symmetric family introduced by [6], both discussed in some details in [7].

Here, starting with a probability density function g with positive support, we obtain a general class that extends the family of distributions with positive support, and that contains the Weibull, gamma and exponential distributions as special cases, depending on the choice of g. Furthermore, we discuss a stochastic representation and how to obtain maximum likelihood estimators for the members of this family. We also derive the corresponding survival and hazard functions and note that one interesting feature of this new class is that the hazard function is not necessarily constant.

The rest of the paper is organized as follows: in Section 2 we define the epsilon–positive family and obtain the hazard and survival functions, mean residual life and stress-strength parameters for this family. In addition, we discuss maximum likelihood estimation and how to obtain such estimates using an EM-type algorithm for the general case. In Section 3, we focus on one specific member of the family introduced in Section 2, namely epsilon–exponential distribution, and discuss its applicability in the analysis of survival data. In Section 4 we discuss two real data examples and we finish with a brief discussion in Section 5. We include Appendix A and Appendix B with some of the technical details.

2. The Epsilon–Positive Family

Let

g (\cdot) = g_{Y} (\cdot; Ψ)

be a probability density function (pdf) with positive support and parameters

Ψ \in ℜ^{p}

. Then, for

0 < ε < 1

, the corresponding epsilon–positive (EP) family of distributions is defined as

f_{X} (x; Ψ, ε) = \frac{1}{2} [g (\frac{x}{1 + ε}) + g (\frac{x}{1 - ε})], x > 0 .

(2)

If a random variable X has the density given in (2), we say that X has an epsilon–positive distribution and write

X \sim E P (Ψ, ε)

.

Observe that as

ε ↓ 0

,

f_{X} (x; Ψ, ε) \to g (x) 1_{{x > 0}}

and therefore the distribution

g_{Y} (\cdot; Ψ)

can be seen as a particular member of the family.

The rth moment of

X \sim E P (Ψ, ε)

,

r = 1, 2, \dots,

is given by

E (X^{r}) = (\frac{{(1 + ε)}^{r + 1}}{2} + \frac{{(1 - ε)}^{r + 1}}{2}) E (Y^{r}),

(3)

where

E (Y^{r})

is the rth moment of

Y \sim g_{Y} (\cdot; Ψ)

. From (3) we obtain that the mean, variance, skewness (CS) and kurtosis (CK) coefficients are (respectively)

E (X) = (1 + ε^{2}) E (Y)

V a r (X) = (1 + 3 ε^{2}) E (Y^{2}) - {(1 + ε^{2})}^{2} E^{2} (Y)

C S = \frac{(1 + 6 ε^{2} + ε^{4}) E (Y^{3}) - 3 (1 + 4 ε^{2} + 3 ε^{4}) E (Y) E (Y^{2}) + 2 {(1 + ε^{2})}^{3} E^{3} (Y)}{{((1 + 3 ε^{2}) E (Y^{2}) - {(1 + ε^{2})}^{2} E^{2} (Y))}^{3 / 2}}

and

C K = \frac{A}{{((1 + 3 ε^{2}) E (Y^{2}) - {(1 + ε^{2})}^{2} E^{2} (Y))}^{2}},

\begin{matrix} A = (1 + 10 ε^{2} + 5 ε^{4}) E (Y^{4}) - 4 (1 + 7 ε^{2} + 7 ε^{4} + ε^{6}) E (Y) E (Y^{3}) \\ + 6 (1 + 5 ε^{2} + 7 ε^{4} + 3 ε^{6}) E^{2} (Y) E (Y^{2}) - 3 (1 + 4 ε^{2} + 6 ε^{4} + 4 ε^{6} + ε^{8}) E^{4} (Y), \end{matrix}

where

E (Y^{r})

,

r = 1, 2, 3, 4

, are the first four moments of the random variable

Y \sim g_{Y} (\cdot; Ψ)

.

To draw observations from an epsilon–positive distribution, we first notice that for

0 < ε < 1

, if

Y \sim g_{Y} (\cdot; Ψ)

and

U_{ε}

(independent from Y) satisfies

P (U_{ε} = 1 + ε) = 1 - P (U_{ε} = 1 - ε) = (1 + ε) / 2

, then

X = U_{ε} Y \sim E P (Ψ, ε)

.

From this stochastic representation, it follows that we can generate EP random variables according to the Algorithm 1:

Algorithm 1 Algorithm to generate observations from an epsilon–positive distribution.

Require: Initialize the algorithm fixing

Ψ

and

ε

1: Generate Y from

g_{Y} (\cdot)

and U from

B e r (p = \frac{1 + ε}{2})

2: if

U = 1

then

3:

U_{ε} \leftarrow 1 + ε

4: else

5:

U_{ε} \leftarrow 1 - ε

6: end if

7: return

X = U_{ε} Y .

Finally, observe that the definition in (2) can be easily extended so we can represent the epsilon–positive family as a finite scale mixture of positive distributions. In fact, for any

0 < ε < 1

we can write

f_{X} (x; Ψ, ε) = \sum_{ξ \in J (ε)} π_{ξ} (ε) \frac{1}{ξ} g (\frac{x}{ξ}),

(4)

where

x > 0

,

π_{ξ} (ε) > 0

are mixing proportions satisfying

\sum_{ξ \in J (ε)} π_{ξ} (ε) = 1

, and

J (ε)

is some finite subset of ℜ that will typically depend on

ε

. For instance, taking

J (ε) = {1 - ε, 1 + ε}

and

π_{ξ} (ε) = (1 \pm ε) / 2

we recover the expression in (2). This representation will be particularly useful in order to obtain maximum likelihood estimates using EM–type algorithms, as we discuss in Section 2.5.

2.1. Reliability Properties

From the definition, we obtain that the survival function

S_{X} (x; Ψ, ε) = P (X > x)

for this family is given by

S_{X} (x; Ψ, ε) = (\frac{1 + ε}{2}) S_{Y} (\frac{x}{1 + ε}) + (\frac{1 - ε}{2}) S_{Y} (\frac{x}{1 - ε}),

(5)

where

S_{Y} (\cdot)

is the survival function associated with the density

g_{Y} (\cdot; Ψ)

. Similarly, the hazard function

λ_{X} (x; Ψ, ϵ) = f_{X} (x; Ψ, ε) / S_{X} (x; Ψ, ε)

is given by

λ_{X} (x; Ψ, ϵ) = \frac{1 + r (x)}{(1 + ε) R (\frac{x}{1 + ε}) + (1 - ε) R (\frac{x}{1 - ε}) r (x)},

(6)

where

r (x) = g_{Y} (\frac{x}{1 - ε}) / g_{Y} (\frac{x}{1 + ε})

, and

R (\cdot) = S_{Y} (\cdot) / g_{Y} (\cdot)

is the Mills ratio.

Table 1 shows some examples of the densities that can be extended using the definition of the epsilon–positive family, with the corresponding densities, survival and hazard functions. Figure 1 and Figure 2 show the pdf, survival and hazard functions of the epsilon-exponential, epsilon-Weibull, epsilon-log-logistic and epsilon-gamma distributions. We can see that in the case of the epsilon-Weibull, epsilon-log-logistic and epsilon-gamma distributions a bimodal shape is obtained when the value of the parameter

ε

is

0.9

.

Table 1. Hazard rate,

λ (\cdot)

, Survival,

S (\cdot)

, and density,

f (\cdot)

, functions of some probability models that can be generalized using the definition in (2) In the table

I (a, β) = \int_{0}^{a} Γ {(β)}^{- 1} u^{β - 1} e^{- u} d u

.

Figure 1. Examples of the probability density

f (x)

, survival

S (x)

and hazard

λ (x)

functions of epsilon-exponential distribution,

E E (σ, ε)

, and epsilon-Weibull distribution,

E W (α, σ, ε)

. Please note that the exponential and Weibull distributions correspond to the case

ε = 0

.

Figure 2. Examples of the probability density

f (x)

, survival

S (x)

and hazard

λ (x)

functions of epsilon-log-logistic distribution,

E L L (σ, ε)

, and epsilon-gamma distribution,

E G (α, σ, ε)

. Please note that the log-logistic and gamma distributions correspond to the case

ε = 0

.

2.2. Mean Residual Life

The mean residual life or life expectancy is an important characteristic of the model. It gives the expected additional lifetime given that a component has survived until time t. For a non-negative continuous random variable

X \sim E P (Ψ, ε)

the mean residual life (

m l r

) function is defined as

m r l (t) = E (X - t | X > t) = E (X | X > t) - t

(7)

where

t > 0

. The above conditional expectation is given by

E (X | X > t) = \int_{t}^{\infty} \frac{x f_{X} (x)}{P (X > t)} d x = \int_{t}^{\infty} \frac{x f_{X} (x)}{1 - F_{X} (t)} d x = \frac{1}{S_{X} (t)} \int_{t}^{\infty} x f_{X} (x) d x .

(8)

Calculation of the integral in (8) is done in the same way as the calculation of the mean. Thus,

I = \int_{t}^{\infty} x f_{X} (x) d x = \int_{t}^{\infty} x \frac{1}{2} [g_{Y} (\frac{x}{1 + ε}) + g_{Y} (\frac{x}{1 - ε})] d x .

Making the changes of variables

z = \frac{x}{1 + ε}

and

u = \frac{x}{1 - ε}

we have

\begin{matrix} I & = \frac{{(1 + ε)}^{2}}{2} \int_{t / (1 + ε)}^{\infty} z g_{Y} (z) d z + \frac{{(1 - ε)}^{2}}{2} \int_{t / (1 - ε)}^{\infty} u g_{Y} (u) d u \\ = \frac{{(1 + ε)}^{2}}{2} S_{Y} (t_{1}) \int_{t_{1}}^{\infty} \frac{z g_{Y} (z)}{S_{Y} (t_{1})} d z + \frac{{(1 - ε)}^{2}}{2} S_{Y} (t_{2}) \int_{t_{2}}^{\infty} \frac{u g_{Y} (u)}{S_{Y} (t_{2})} d u \\ = \frac{{(1 + ε)}^{2}}{2} S_{Y} (t_{1}) E (Y | Y > t_{1}) + \frac{{(1 - ε)}^{2}}{2} S_{Y} (t_{2}) E (Y | Y > t_{2}) \\ = \frac{{(1 + ε)}^{2}}{2} S_{Y} (t_{1}) (E (Y - t_{1} | Y > t_{1}) + t_{1}) + \frac{{(1 - ε)}^{2}}{2} S_{Y} (t_{2}) (E (Y - t_{2} | Y > t_{2}) + t_{2}) \\ = \frac{{(1 + ε)}^{2}}{2} S_{Y} (t_{1}) (m r l_{Y} (t_{1}) + t_{1}) + \frac{{(1 - ε)}^{2}}{2} S_{Y} (t_{2}) (m r l_{Y} (t_{2}) + t_{2}), \end{matrix}

where

t_{1} = \frac{t}{1 + ε}

,

t_{2} = \frac{t}{1 - ε}

, and

m r l_{Y} (t_{i}) = E (Y - t_{i} | Y > t_{i})

,

i = 1, 2

corresponds to the mean residual life of the random variable

Y \sim g_{Y} (\cdot)

. Finally, Equation (7) can be written as

m r l (t) = \frac{\frac{{(1 + ε)}^{2}}{2} S_{Y} (\frac{t}{1 + ε}) (m l r_{Y} (t_{1}) + t_{1}) + \frac{{(1 - ε)}^{2}}{2} S_{Y} (\frac{t}{1 - ε}) (m r l_{Y} (t_{2}) + t_{2})}{(\frac{1 + ε}{2}) S_{Y} (\frac{t}{1 + ε}) + (\frac{1 - ε}{2}) S_{Y} (\frac{t}{1 - ε})} - t .

(9)

2.3. Stress-Strength Parameter

An important concept in reliability theory is the stress-strength parameter. Let

X_{1}

denote the strength of a system or component with a stress

X_{2}

. Then, the stress-strength parameter is defined as

R = P (X_{2} < X_{1})

, which can be viewed as a measure of the system performance. In the next theorem, we look at this quantity when

X_{1}

and

X_{2}

are independent random variables with epsilon-positive distributions.

Theorem 1.

Suppose

X_{1}

and

X_{2}

are random variables independently distributed as

X_{1} \sim E P (Ψ_{1}, ε_{1})

and

X_{2} \sim E P (Ψ_{2}, ε_{2})

, the reliability of the system with stress variable (

X_{2}

) and strength variable (

X_{1}

) is given by

\begin{matrix} R = P (X_{2} < X_{1}) & = \frac{{(1 + ε_{2})}^{2}}{4} (a P (Y_{2} < a Y_{1}) + b P (Y_{2} < b Y_{1})) \\ + \frac{{(1 - ε_{2})}^{2}}{4} (c P (Y_{2} < c Y_{1}) + d P (Y_{2} < d Y_{1})), \end{matrix}

(10)

where

a = \frac{1 + ε_{1}}{1 + ε_{2}}

,

b = \frac{1 - ε_{1}}{1 + ε_{2}}

,

c = \frac{1 + ε_{1}}{1 - ε_{2}}

,

d = \frac{1 - ε_{1}}{1 - ε_{2}}

, and

Y_{i} \sim g_{Y_{i}} (\cdot; Ψ_{i})

,

i = 1, 2

, with

Y_{1}

independent of

Y_{2}

.

Proof of Theorem 1.

Making the changes of variables

z = \frac{x_{1}}{1 + ε}

and

u = \frac{x_{1}}{1 - ε}

we have

\begin{matrix} P (X_{2} < X_{1}) & = \int_{0}^{\infty} F_{X_{2}} (x_{1}) f_{X_{1}} (x_{1}) d x_{1} \\ = \int_{0}^{\infty} [(\frac{1 + ε_{2}}{2}) G_{Y_{2}} (\frac{x_{1}}{1 + ε_{2}}) + (\frac{1 - ε_{2}}{2}) G_{Y_{2}} (\frac{x_{1}}{1 - ε_{2}})] \\ \times [\frac{1}{2} g_{Y_{1}} (\frac{x_{1}}{1 + ε_{1}}) + \frac{1}{2} g_{Y_{1}} (\frac{x_{1}}{1 - ε_{1}})] d x_{1} \\ = \frac{{(1 + ε_{2})}^{2}}{4} a \int_{o}^{\infty} G_{Y_{2}} (a z) g_{Y_{1}} (z) d z + \frac{{(1 + ε_{2})}^{2}}{4} b \int_{o}^{\infty} G_{Y_{2}} (b u) g_{Y_{1}} (u) d u \\ \times \frac{{(1 - ε_{2})}^{2}}{4} c \int_{o}^{\infty} G_{Y_{2}} (c z) g_{Y_{1}} (z) d z + \frac{{(1 - ε_{2})}^{2}}{4} d \int_{o}^{\infty} G_{Y_{2}} (d u) g_{Y_{1}} (u) d u \\ = \frac{{(1 + ε_{2})}^{2}}{4} a P (Y_{2} < a Y_{1}) + \frac{{(1 + ε_{2})}^{2}}{4} b P (Y_{2} < b Y_{1}) + \\ \frac{{(1 - ε_{2})}^{2}}{4} c P (Y_{2} < c Y_{1}) + \frac{{(1 - ε_{2})}^{2}}{4} d P (Y_{2} < d Y_{1}) . \end{matrix}

□

Observe that the same concept can be used to make comparisons between two systems. For example, if

X_{1}

and

X_{2}

denote instead the lifetimes of systems

S_{1}

and

S_{2}

respectively, then, a probability

P (X_{1} < X_{2}) > 0.5

would indicate that the system

S_{2}

is better than the system

S_{1}

in a stochastic sense.

2.4. Maximum Likelihood Estimation

Let

{\tilde{X}}_{n} = (X_{1}, \dots, X_{n})

be a random sample from an

E P (Ψ, ε)

distribution. Then, the maximum likelihood estimator (MLE) of

θ = {(Ψ, ε)}^{'}

is given by

θ_{M L E} = {(\hat{Ψ}, \hat{ε})}_{M L E} = arg max_{Ψ, ε} ℓ (Ψ, ε; {\tilde{X}}_{n}),

(11)

where

ℓ (Ψ, ε; {\tilde{X}}_{n}) = \sum_{i = 1}^{n} log f_{X_{i}} (x_{i}; Ψ, ε)

is the log–likelihood.

Although the MLE for the

E P

family is conceptually straightforward, typically closed form solutions are not available and the MLE need to be obtained numerically. One possibility is the Newton–Raphson algorithm, with iteration equation

{\hat{θ}}^{(k + 1)} = {\hat{θ}}^{(k)} - {[H ({\hat{θ}}^{(k)})]}^{- 1} u ({\hat{θ}}^{(k)}),

(12)

where

θ^{(k)}

be the current estimate of

θ

,

u (θ)

denote the vector of first derivatives of

ℓ (θ; {\tilde{X}}_{n})

, and

H (θ)

.

A disadvantage of this approach is that it requires redthe calculation of the second derivatives of the likelihood function and repeated inversion of potentially large matrices, which can be computationally intensive. Instead, we can consider an expectation–maximization (EM) approach see [8] as a general iterative method for data sets with missing (or incomplete) data.

The mixture representation proposed in (4) is particularly useful in order to use an EM–type algorithm to estimate the model parameters, since it provides a hierarchical scheme for the

E P

family. Next, we show how to implement maximum likelihood estimation using an EM–type algorithm for the

E P

family.

2.5. MLE via the EM Algorithm

From (4), the log–likelihood takes the form,

ℓ (Ψ, ε; {\tilde{X}}_{n}) = \sum_{i = 1}^{n} log (\sum_{ξ \in J (ε)} π_{ξ} (ε) \frac{1}{ξ} g (\frac{x_{i}}{ξ})),

where the derivatives with respect to

Ψ

and

ε

typically lead to a system of equations with no closed form solution. To address this problem, we can “augment” the data

{\tilde{X}}_{n}

using an unobservable matrix

W = (w_{i j})

,

i = 1, \dots, n; j = 1, \dots, m = | J (ε) |

, with elements

w_{i j}

defined as

w_{i j} = \{\begin{matrix} 1, if observation x_{i} comes from the distribution \frac{1}{ξ_{j}} g (\frac{x}{ξ_{j}}) \\ 0, otherwise \end{matrix},

where

ξ_{1}, \dots, ξ_{m}

denote the distinct elements of

J (ε)

. This way, each row of W contains only one 1 and zero 0, and the (complete) log-likelihood for the augmented data

Y = ({\tilde{X}}_{n}, W)

is given by

ℓ_{c} (Ψ, ε; Y) = \sum_{i = 1}^{n} \sum_{j = 1}^{m} w_{i j} (log (\frac{π_{ξ_{j}} (ε)}{ξ_{j}}) + log g (\frac{x_{i}}{ξ_{j}})) .

Then, if we denote by

{\hat{θ}}^{(s)} = (Ψ^{(s)}, ε^{(s)})

the estimate of

θ = (Ψ, ε)

at iteration s, and by

Q (θ, {\hat{θ}}^{(s)})

, the conditional expectation of

ℓ_{c} (θ; Y)

given

{\tilde{X}}_{n}

and

{\hat{θ}}^{(s)}

, we obtain

\begin{matrix} Q (θ, {\hat{θ}}^{(s)}) & = E (ℓ_{c} (θ; Y) | {\tilde{X}}_{n}, {\hat{θ}}^{(s)}) \\ = \sum_{i = 1}^{n} \sum_{j = 1}^{m (s)} w_{i j}^{(s)} (log (\frac{π_{ξ_{j}} (ε)}{ξ_{j}}) + log g (\frac{x_{i}}{ξ_{j}})), \end{matrix}

where

m (s) = | J (ε^{(s)}) |

, and

w_{i j}^{(s)} = \frac{π {(ε^{(s)})}_{j} \frac{1}{ξ_{j}} g (\frac{x_{i}}{ξ_{j}})}{\sum_{j = 1}^{m (s)} π_{ξ_{j}} (ε^{(s)}) \frac{1}{ξ_{j}} g (\frac{x_{i}}{ξ_{j}})} .

From here, it follows that for

J (ε) = {1 - ε, 1 + ε}

, the iteration s of the EM algorithm takes the form:

E–step: For $i = 1, \dots, n$ , compute

$w_{i j}^{(s)} = \frac{g (\frac{x_{i}}{1 + ε^{(s)}})}{g (\frac{x_{i}}{1 + ε^{(s)}}) + g (\frac{x_{i}}{1 - ε^{(s)}})}, ξ_{j} = 1 + ε .$
M–step: Given $ε^{(s)}$ and $Ψ^{(s)}$ , compute

$\begin{matrix} ε^{(s + 1)} & = \frac{2}{n} \sum_{i = 1}^{n} w_{i j}^{(s)} - 1, j = 1 + ε, \\ Ψ^{(s + 1)} & = arg max_{Ψ} Q (θ, {\hat{θ}}^{(s)}) . \end{matrix}$

The E and M steps are alternated repeatedly until a convergence criteria is satisfied. For the variance estimation of the MLEs we consider the bootstrapping method suggested in [9].

3. The Epsilon–Exponential Distribution

If we take

g_{Y} (y; Ψ = σ) = \frac{1}{σ} e^{- y / σ} 1_{{y > 0}}

, the pdf of an exponential distribution, the expression in (2) becomes

f_{X} (x; σ, ε) = \frac{1}{2 σ} [e^{- x / (1 + ε) σ} + e^{- x / (1 - ε) σ}], x > 0,

(13)

where

σ > 0

and

0 < ε < 1

. We say that a random variable X has epsilon–exponential (EE) distribution with scale parameter

σ

and shape parameter

ε

if its density has the form in (13), and we write

X \sim E E (σ, ε)

.

Recall that the rth moment,

r = 1, 2, \dots,

of

Y \sim E x p (σ)

is

E (Y^{r}) = r! σ^{r}

. From (3), when

X \sim E E (σ, ε)

, we obtain that the mean, variance, skewness (CS) and kurtosis (CK) coefficients are, respectively,

\begin{matrix} E (X) & = (1 + ε^{2}) σ \\ V a r (X) & = σ^{2} (1 + 4 ε^{2} - ε^{4}) \\ C S & = \frac{(2 + 18 ε^{2} - 6 ε^{4} + 2 ε^{6})}{{(1 + 4 ε^{2} - ε^{4})}^{3 / 2}} \\ C K & = \frac{(9 + 120 ε^{2} + 18 ε^{4} - 3 ε^{8})}{{(1 + 4 ε^{2} - ε^{4})}^{2}} . \end{matrix}

Please note that for any value of

ε \in (0, 1)

,

C S > 0

and

C K > 0

. It can be seen that

2 < C S < 2.3

and

9 < C K < 11.023

. Figure 3 depicts the behavior of the skewness (CS) and kurtosis (CK) coefficients as a function of

ε

. In the figures we observe that the maximum skewness is attained ar

ε = 0.4

, while the maximum kurtosis coefficient is obtained when

ε = 0.37

.

Figure 3. Skewness (CS) and kurtosis (CK) coefficientes for

X \sim E E (σ, ε)

.

Recall that the survival function of the exponential distribution is of the form

S_{Y} (y) = e^{- y / σ} 1_{{y > 0}}

and the mean residual life is of the form

m r l_{Y} (t) = σ

. Then, it follows from (5), (6) and (9) that if

X \sim E E (σ, ε)

, then the survival and hazard functions and mean residual life are (respectively)

\begin{matrix} S (x; σ, ε) & = \frac{(1 + ε)}{2} e^{- x / (1 + ε) σ} + \frac{(1 - ε)}{2} e^{- x / (1 - ε) σ}, \\ λ (x; σ, ε) & = \frac{1}{σ} [\frac{e^{- x / (1 + ε) σ} + e^{- x / (1 - ε) σ}}{(1 + ε) e^{- x / (1 + ε) σ} + (1 - ε) e^{- x / (1 - ε) σ}}] and \\ m r l (t) & = σ [\frac{{(1 + ε)}^{2} e^{- t / (1 + ε) σ} + {(1 - ε)}^{2} e^{- t / (1 - ε) σ}}{(1 + ε) e^{- t / (1 + ε) σ} + (1 - ε) e^{- t / (1 - ε) σ}}] . \end{matrix}

Interestingly, in contrast to the exponential distribution, it can be shown that the hazard function

λ (x; σ, ε)

of the EE distribution is not constant, but decreasing in x. This feature can be easily observed in Figure 1 (top panel), where we show the pdf, survival and hazard functions of the EE distribution for different values of the parameter

ε

when

σ = 1

. Please note that

λ (x; σ, ε) ⟶ λ_{Y} (x; σ) = 1 / σ

as

ε ⟶ 0

. Additionally,

m r l (t) ⟶ m r l_{Y} (t) = σ

as

ε ⟶ 0

.

Suppose

X_{1}

and

X_{2}

are random variables independently distributed as

X_{1} \sim E E (σ_{1}, ε_{1})

and

X_{2} \sim E E (σ_{2}, ε_{2})

, using Theorem 1 the reliability of the system with stress variable (

X_{2}

) and strength variable (

X_{1}

) is given by

R = \frac{{(1 + ε_{2})}^{2}}{4} (\frac{a^{2} σ_{1}}{a σ_{1} + σ_{2}} + \frac{b^{2} σ_{1}}{b σ_{1} + σ_{2}}) + \frac{{(1 - ε_{2})}^{2}}{4} (\frac{c^{2} σ_{1}}{c σ_{1} + σ_{2}} + \frac{d^{2} σ_{1}}{d σ_{1} + σ_{2}}) .

Please note that when

(ε_{1}, ε_{2}) ⟶ (0, 0)

,

R ⟶ \frac{σ_{1}}{σ_{1} + σ_{2}}

which corresponds to the stress-strength reliability model of the exponential distributions with parameter

σ_{1}

for

X_{1}

(strength), and with parameter

σ_{2}

for

X_{2}

(stress), respectively.

Maximum likelihood estimations for the parameters

σ

and

ε

of the epsilon-exponential distribution, can be obtained following the strategy described in Section 2.4 and Section 2.5 (see the Appendix B for details). Let

X_{1}, X_{2}, \dots, X_{n}

and

Y_{1}, Y_{2}, \dots, Y_{m}

be random samples from

E E (σ_{1}, ε_{1})

and

E E (σ_{2}, ε_{2})

, respectively. Having estimates of

(σ_{1}, ε_{1}, σ_{2}, ε_{2})

, say

({\hat{σ}}_{1}, {\hat{ε}}_{1}, {\hat{σ}}_{2}, {\hat{ε}}_{2})

, by the invariance property of the MLE, the MLE of R becomes

\hat{R} = \frac{{(1 + {\hat{ε}}_{2})}^{2}}{4} (\frac{{\hat{a}}^{2} {\hat{σ}}_{1}}{\hat{a} {\hat{σ}}_{1} + {\hat{σ}}_{2}} + \frac{{\hat{b}}^{2} {\hat{σ}}_{1}}{\hat{b} {\hat{σ}}_{1} + {\hat{σ}}_{2}}) + \frac{{(1 - {\hat{ε}}_{2})}^{2}}{4} (\frac{{\hat{c}}^{2} {\hat{σ}}_{1}}{\hat{c} {\hat{σ}}_{1} + {\hat{σ}}_{2}} + \frac{{\hat{d}}^{2} {\hat{σ}}_{1}}{\hat{d} {\hat{σ}}_{1} + {\hat{σ}}_{2}}) .

Numerical Experiments

To illustrate the properties of the estimators we performed a small simulation study considering 5000 simulated datasets for different pair of values of

σ

and

ε

using Algorithm 1. The goal of the study is to observe the behavior of the MLEs for the model parameters using our proposed EM algorithm considering different sample sizes.

Table 2 summarizes the simulation results. In the table, the columns labeled as “estimate” show the average of the estimators obtained in the simulations, and the columns labeled “SD” show the sample standard deviation of the corresponding estimators. To obtain the standard errors we used the bootstrap method with

B = 150

samples. We observe that the estimates are quite stable and fairly accurate, reporting (on average) numbers close to the true values of the parameters in all cases. Please note that as expected, the precision of the estimates improves as the sample size increase.

Table 2. Summary of simulation results.

4. Survival and Reliability Analysis

Let

T \geq 0

represent the survival time until the occurrence of a “death” event. In this context, suppose we have n subjects with lifetimes determined by a survival function

S (t)

, and that the ith subject is observed for a time

t_{i}

. If the individual dies at time

t_{i}

, its contribution to the likelihood function is given by

L_{i} = f (t_{i})

, where

f (t) = - S^{'} (t)

is the event density associated with

S (t)

, or equivalently,

L_{i} = S (t_{i}) λ (t_{i})

, where

λ (t)

is the corresponding hazard function. On the other hand, if the ith individual is still alive at time

t_{i}

and therefore, under non–informative censoring, all we can say is that their lifetime exceeds

t_{i}

. It follows that the contribution of a censored observation to the likelihood is simply given by

L_{i} = S (t_{i})

. Notice that in either case we evaluate the survival function at time

t_{i}

, because in both cases the ith subject was alive until (at least) time

t_{i}

. A death will multiply this contribution by the hazard

λ (t_{i})

, but a censored observation will not.

We can combine these contributions into a single expression using a death indicator

d_{i}

, taking the value one if individual i died and the value zero otherwise. The resulting likelihood function

L

is of the form

L = \prod_{i = 1}^{n} λ {(t_{i})}^{d_{i}} S (t_{i}) = \prod_{i = 1}^{n} f {(t)}^{d_{i}} S {(t)}^{1 - d_{i}} .

In the next section we will assume that the random variable T follows an epsilon–positive family, and show how estimate the model parameters using the EM algorithm.

4.1. Estimation Using the EM Algorithm

Let

T_{1}, \dots, T_{n}

denote the survival times,

T_{i} \sim E P (Ψ, ε)

. Using the notation introduced in the previous section, the observed data are a collection of pairs

{\tilde{X}}_{n} = {(T_{1}, d_{1}), \dots, (T_{n}, d_{n})}

, where the

d_{i}

,

i = 1, \dots, n

, are the censoring indicators.

In order implement the EM algorithm, we augment the observed data

{\tilde{X}}_{n}

with the unobservable matrix W defined in Section 2.5, and obtain the (complete) likelihood is given by

\begin{matrix} L_{c} (θ; {\tilde{X}}_{n}, W) & = \prod_{i = 1}^{n} \prod_{j = 1}^{m} {\{π_{ξ_{j}} (ε) {[\frac{1}{ξ_{j}} g (\frac{t_{i}}{ξ_{j}})]}^{d_{i}} {[S (\frac{t_{i}}{ξ_{j}})]}^{1 - d_{i}}\}}^{w_{i j}} \end{matrix}

with corresponding (complete) log–likelihood

ℓ_{c} = log L_{c}

.

Then, if

{\hat{θ}}^{(s)} = {(Ψ^{(s)}, ε^{(s)})}^{'}

be the estimate of

θ = {(Ψ, ε)}^{'}

at iteration s, and we denote by

Q (θ, {\hat{θ}}^{(s)})

the conditional expectation of

ℓ_{c} (θ; {\tilde{X}}_{n}, W)

given the observed data

{\tilde{X}}_{n}

and

{\hat{θ}}^{(s)}

, we obtain

\begin{matrix} Q (θ, {\hat{θ}}^{(s)}) & = E (ℓ_{c} (θ; {\tilde{X}}_{n}, W) | {\tilde{X}}_{n}, {\hat{θ}}^{(s)}) \\ = \sum_{i = 1}^{n} \sum_{j = 1}^{m (s)} w_{i j}^{(s)} \{log π_{ξ_{j}} (ε) + d_{i} log [\frac{1}{ξ_{j}} g (\frac{t_{i}}{ξ_{j}})] + (1 - d_{i}) log S (\frac{t_{i}}{ξ_{j}})\}, \end{matrix}

where

w_{i j}^{(s)} = \frac{π_{ξ_{j}} (ε) {[\frac{1}{ξ_{j}} g (\frac{t_{i}}{ξ_{j}})]}^{d_{i}} {[S (\frac{t_{i}}{ξ_{j}})]}^{1 - d_{i}}}{\sum_{j = 1}^{m (s)} π_{ξ_{j}} (ε) {[\frac{1}{ξ_{j}} g (\frac{t_{i}}{ξ_{j}})]}^{d_{i}} {[S (\frac{t_{i}}{ξ_{j}})]}^{1 - d_{i}}} .

Then, for

J (ε) = {1 - ε, 1 + ε}

, the iteration s of the algorithm takes the form:

E–step: For $i = 1, \dots, n$ , compute

$w_{i j}^{(s)} = \frac{w_{i, +}}{w_{i, +} + w_{i, -}}, f o r ξ_{j} = 1 + ε,$

where

$w_{i, +} = {[g (\frac{t_{i}}{1 + ε^{(s)}})]}^{d_{i}} {[(1 + ε^{(s)}) S (\frac{t_{i}}{1 + ε^{(s)}})]}^{1 - d_{i}}$

and

$w_{i, -} = {[g (\frac{t_{i}}{1 - ε^{(s)}})]}^{d_{i}} {[(1 - ε^{(s)}) S (\frac{t_{i}}{1 - ε^{(s)}})]}^{1 - d_{i}} .$
M–step: Given $ε^{(s)}$ and $Ψ^{(s)}$ , compute

$\begin{matrix} ε^{(s + 1)} & = \frac{2}{n} \sum_{i = 1}^{n} w_{i j}^{(s)} - 1, ξ_{j} = 1 + ε, \\ Ψ^{(s + 1)} & = arg max_{Ψ} Q (θ, {\hat{θ}}^{(s)}) . \end{matrix}$

EM Algorithm for the Epsilon-Exponential Distribution

Suppose that the survival times

T_{i} \sim E E (σ, ε)

. Then the EM algorithm takes the form:

E–step: For $i = 1, \dots, n$ , compute

$w_{i j}^{(s)} = \frac{w_{i, +}}{w_{i, +} + w_{i, -}}, for ξ_{j} = 1 + ε,$

where

$w_{i, +} = {[\frac{1}{σ} e^{- t_{i} / (1 + ε^{(s)}) σ^{(s)}}]}^{d_{i}} {[(1 + ε^{(s)}) e^{- t_{i} / (1 + ε^{(s)}) σ^{(s)}}]}^{1 - d_{i}}$

and

$w_{i, -} = {[\frac{1}{σ} e^{- t_{i} / (1 - ε^{(s)}) σ^{(s)}}]}^{d_{i}} {[(1 - ε^{(s)}) e^{- t_{i} / (1 - ε^{(s)}) σ^{(s)}}]}^{1 - d_{i}} .$
M–step: Given $ε^{(s)}$ and $σ^{(s)}$ , compute

$\begin{matrix} ε^{(s + 1)} & = \frac{2}{n} \sum_{i = 1}^{n} w_{i j}^{(s)} - 1, ξ_{j} = 1 + ε, \\ σ^{(s + 1)} & = \sum_{i = 1}^{n} \sum_{j \in J (ε^{(s)})} w_{i j}^{(s)} (\frac{t_{i}}{j}) /\sum_{i = 1}^{n} \sum_{j \in J (ε^{(s)})} w_{i j}^{(s)} d_{i} . \end{matrix}$

5. Real Data Examples

In this section, we use two examples to illustrate the proposed distributions using (un)censored data sets.

5.1. Example 1: Maintenance Data

First, we consider a real data set originally analyzed by [10]. The complete data set correspond active repair times (in hours) for an airborne communication transceiver, and can be found in Table 3.

Table 3. Repair times (in hours) of 46 transceivers.

Using the EM algorithm described in Section 4 we fit an epsilon–exponential (EE) distribution to the active repair times. We obtain that the maximized log–likelihood value

- 103.806

. Alternatively, we also fit an exponential (Exp), exponentiated–exponential (EExp) and Weibull (Wei) distribution, obtaining the maximized log–likelihood values of

- 105.006

,

- 104.983

and

- 104.470

, respectively. For model comparison, we use the Akaike information criterion (AIC) introduced in [11], given by

A I C = - 2 \hat{l} + 2 k

where

\hat{l}

is the maximized log–likelihood and k is the number of parameters of the distribution under consideration.

The best model is deemed to be the one with the smallest AIC. For the data set in the example, we obtain

A I C_{E E} = 211.611

,

A I C_{E x p} = 212.012

,

A I C_{E E x p} = 213.966

, and

A I C_{W e i} = 212.939

. It follows that in terms of the AIC criteria, the epsilon–exponential shows the best performance when fitting these data. Figure 4 shows the fit of the different models used in the example. Figure 5 displays the three estimated survival functions for this data set.

Figure 4. The density functions of the fitted epsilon exponential, exponential, Weibull and exponentiated exponential distributions.

Figure 5. Fit of the survival functions: Kaplan–Meier estimator (solid line), exponential (dashed line red) and epsilon–exponential (dashed line blue) distributions.

5.2. Example 2: Recidivism Data

For the second example we use real data obtained from the official records of Gendarmerie of Chile on all inmates released from the Chilean prisons during 2007 after serving a sentence of imprisonment by robbery.

The data set contains records of 9477 inmates and the follow–up period from release until 30 April 2012. In this study, recidivism is defined to occur when a released prisoner goes back to prison for the original or any other offense.

Overall, 52.2% of the inmates in the cohort were convicted of one or more offenses and returned to prison within 64 months of release. About 11.8% of the cohort returned to prison within three months of release, and 30% returned within a year of release.

Table 4 shows the observed proportion of the cohort returning to prison within 1, 3, 6, 12, 18, 24, 36, 48 and 64 months of release. We observe that the cumulative proportion of recidivism grew quickly over the first 12 months after release, increasing by more than 7% every 3 months. After that, the percent increases were smaller and over longer periods.

Table 4. Observed recidivism by time after release.

To analyze the time to recidivism, we determined the number of days between an inmate’s release and his return to prison. Because some inmates did not reoffend, we have censored data and we used the EM algorithm described in Section 4 to fit an epsilon–exponential distribution to the time to recidivism. The maximized log–likelihood value for an assumed epsilon–exponential distribution is easily calculated to be −42,067.84. In comparison, we also fit an exponential distribution yielding a maximized log–likelihood value of −42,632.81.

Looking at the AIC values, we obtain

A I C_{E E}

= 84,139.68 and

A I C_{E x p}

= 85,267.62 for the epsilon–exponential and exponential model respectively, and therefore we conclude, the epsilon–exponential is a better model for these data, based on this criteria.

Finally, we also analyzed the survival time using the Kaplan–Meier estimator. Figure 5 displays the three estimated survival functions for this data set. We observe a close agreement between the Kaplan–Meier survival curve and the epsilon–exponential distribution.

6. Discussion

We introduced a new class of distributions with positive support called epsilon–positive which are generated from any distributions with positive support. This new class of distributions includes the exponential, Weibull, log–normal, etc. ones as special cases. We discussed a stochastic representation for this family, as well as parameter estimation, using the maximum likelihood approach via the Newton–Raphson. In addition, we showed that the elements of this new family can be expressed as a finite scale mixture of positive distributions, which facilitates the implementation of EM-type of algorithms.

We then focused on particular member of this family, called epsilon–exponential distribution, and discuss its applicability in the analysis of survival and reliability data. In this context, we considered the censored data case, and we show how we can use this new family to analyze this type of data sets. For the new class of distributions and, in particular, for the epsilon–exponential distribution we estimate the model parameters using the EM algorithm. An interesting feature of the hazard function of the epsilon–exponential distribution is that is not constant at difference of the exponential one. This feature increases the flexibility of the models allowing its use in a broader range of scenarios.

This greater flexibility is corroborated in the two examples considered in this paper where the AIC criteria shows a better performance our proposed epsilon–exponential model when compared to commonly used alternatives such as the exponential one. The results suggests that the epsilon–exponential distribution should be considered to be a legitimate alternative for the analysis of survival and reliability data in both the censored and uncensored case.

Author Contributions

Conceptualization, R.d.l.C., H.W.G.; methodology, P.C., R.d.l.C., C.F. and H.W.G.; software, P.C.; validation, R.d.l.C. and H.W.G.; formal analysis, P.C.; investigation, P.C., R.d.l.C., C.F. and H.W.G.; resources, R.d.l.C. and H.W.G.; data curation, P.C. and R.d.l.C.; writing–original draft preparation, P.C., R.d.l.C., C.F. and H.W.G.; writing–review and editing, P.C., R.d.l.C., C.F. and H.W.G.; visualization, P.C.; supervision, R.d.l.C. and H.W.G.; project administration, R.d.l.C. and H.W.G.; funding acquisition, R.d.l.C. and H.W.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by ANID FONDECYT grant number 1181662 & Anillo ACT–87 project, and grant SEMILLERO UA-2021 (Chile).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

The authors would like to thank Gendarmerie of Chile for sharing the data on recidivism used in the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. The EE–MLE

For a sample of n independent identically distributed (i.i.d.) observations

{\tilde{X}}_{n} = (X_{1}, \dots, X_{n})

from

E E (σ, ε)

,

θ = {(σ, ε)}^{'}

, has to be estimated from the data. The log–likelihood function is given by

\begin{matrix} ℓ = ℓ (σ, ε; {\tilde{X}}_{n}) & = - n log (2 σ) \\ + \sum_{i = 1}^{n} log (e^{- x_{i} / (1 + ε) σ} + e^{- x_{i} / (1 - ε) σ}) . \end{matrix}

(A1)

Differentiating (A1) with respect to

σ

and

ε

and equating to 0 respectively, we obtain the likelihood equations

\begin{matrix} \frac{\partial ℓ}{\partial σ} & = & \frac{- n}{σ} + \frac{1}{σ^{2} (1 + ε) (1 - ε)} \sum_{i = 1}^{n} x_{i} a_{i} (σ, ε) = 0 \\ \frac{\partial ℓ}{\partial ε} & = & \frac{1}{σ {(1 + ε)}^{2} {(1 - ε)}^{2}} \sum_{i = 1}^{n} x_{i} b_{i} (σ, ε) = 0, \end{matrix}

where

a_{i} (σ, ε) = \frac{(1 - ε) e^{- x_{i} / (1 + ε) σ} + (1 + ε) e^{- x_{i} / (1 - ε) σ}}{e^{- x_{i} / (1 + ε) σ} + e^{- x_{i} / (1 - ε) σ}}

(A2)

and

b_{i} (σ, ε) = \frac{{(1 - ε)}^{2} e^{- x_{i} / (1 + ε) σ} + {(1 + ε)}^{2} e^{- x_{i} / (1 - ε) σ}}{e^{- x_{i} / (1 + ε) σ} + e^{- x_{i} / (1 - ε) σ}} .

(A3)

Please note that no closed form solutions are available to obtain the MLEs of

σ

and

ε

, respectively. Therefore, the Newton–Raphson algorithm can be implemented. Let

u (θ) = \frac{\partial ℓ}{\partial θ}

and

H_{2 \times 2} (θ) = \frac{\partial^{2} ℓ}{\partial θ \partial θ^{'}}

for the epsilon–exponential distribution. Let

u (θ) = {(\partial ℓ / \partial σ, \partial ℓ / \partial ε)}^{'}

the vector of first derivatives. Define

H_{11} = \frac{\partial^{2} ℓ}{\partial σ^{2}}, H_{12} = \frac{\partial^{2} ℓ}{\partial σ \partial ε} and H_{22} = \frac{\partial^{2} ℓ}{\partial ε^{2}} .

The entries

H_{r s}

,

r, s = 1, 2

, of the symmetric matrix of second partial derivatives for the epsilon–exponential distribution are

\begin{matrix} H_{11} = \frac{\partial^{2} ℓ}{\partial σ^{2}} & = & \frac{n}{σ^{2}} - \frac{2}{σ^{3} (1 + ε) (1 - ε)} \sum_{i = 1}^{n} x_{i} a_{i} (σ, ε) \\ - \frac{1}{σ^{4} {(1 + ε)}^{2} {(1 - ε)}^{2}} \sum_{i = 1}^{n} x_{i}^{2} a_{i}^{2} (σ, ε) + \sum_{i = 1}^{n} x_{i} b_{i} (σ, ε) . \end{matrix}

\begin{matrix} H_{22} = \frac{\partial^{2} ℓ}{\partial ε^{2}} & = & \frac{4 ε}{σ {(1 + ε)}^{3} {(1 - ε)}^{3}} \sum_{i = 1}^{n} x_{i} b_{i} (σ, ε) \\ + \frac{1}{σ^{2} {(1 + ε)}^{4} {(1 - ε)}^{4}} \sum_{i = 1}^{n} x_{i}^{2} h_{i} (σ, ε) \\ - \frac{1}{σ (1 + ε) (1 - ε)} \sum_{i = 1}^{n} x_{i}^{2} b_{i}^{2} (σ, ε) . \end{matrix}

\begin{matrix} H_{12} = \frac{\partial^{2} ℓ}{\partial ε \partial σ} & = & - \frac{1}{σ^{2} {(1 + ε)}^{2} {(1 - ε)}^{2}} \sum_{i = 1}^{n} x_{i} b_{i} (σ, ε) \\ + \frac{1}{σ^{2} {(1 + ε)}^{4} {(1 - ε)}^{4}} \sum_{i = 1}^{n} x_{i} c_{i} (σ, ε) \\ - \frac{1}{σ^{3} {(1 + ε)}^{6} {(1 - ε)}^{6}} \sum_{i = 1}^{n} x_{i}^{2} b_{i}^{2} (σ, ε), \end{matrix}

where

c_{i} (σ, ε) = \frac{{(1 - ε)}^{3} e^{- x_{i} / (1 + ε) σ} + {(1 + ε)}^{3} e^{- x_{i} / (1 - ε) σ}}{e^{- x_{i} / (1 + ε) σ} + e^{- x_{i} / (1 - ε) σ}},

and

h_{i} (σ, ε) = \frac{{(1 - ε)}^{4} e^{- x_{i} / (1 + ε) σ} + {(1 + ε)}^{4} e^{- x_{i} / (1 - ε) σ}}{e^{- x_{i} / (1 + ε) σ} + e^{- x_{i} / (1 - ε) σ}},

and the quantities

a_{i} (σ, ε)

and

b_{i} (σ, ε)

are defined in Equations (A2) and (A3), respectively.

The functions

u (θ)

and

H_{2 \times 2} (θ)

define the terms of the Newton–Raphson iteration equation given in (12). To implement the Newton–Raphson algorithm, we can use the moments estimates for

σ

and

ε

as starting values.

Next, we show that the EM–type algorithm describe in Section 2.5 can be implemented to find the MLEs of the parameters of the epsilon–exponential distribution.

Appendix B. An EM–Type Algorithm for the EE–MLE

In order to implement the EM algorithm to estimate the model parameters of the epsilon–exponential distribution we need to choose

g (u) = \frac{1}{σ} e^{- u / σ}

in the EM algorithm described in Section 2.5. Let

{\tilde{X}}_{n} = (X_{1}, \dots, X_{n})

be a random sample from

E E (σ, ε)

. The complete log–likelihood is

ℓ_{c} (σ, ε; Y) = \sum_{i = 1}^{n} \sum_{j = 1}^{m} w_{i j} (log (\frac{π_{ξ_{j}} (ε)}{ξ_{j}}) - \frac{x_{i}}{ξ_{j} σ} - log σ) .

Let

{\hat{θ}}^{(s)} = {(σ^{(s)}, ε^{(s)})}^{'}

be the estimate of

θ = {(σ, ε)}^{'}

at iteration s, and denote by

Q (θ, {\hat{θ}}^{(s)})

the conditional expectation of

ℓ_{c} (θ; Y)

given the observed data

{\tilde{X}}_{n}

and

{\hat{θ}}^{(s)}

. We obtain

\begin{matrix} Q (θ, {\hat{θ}}^{(s)}) & = E (ℓ_{c} (θ; Y) | {\tilde{X}}_{n}, {\hat{θ}}^{(s)}) \\ = \sum_{i = 1}^{n} \sum_{j = 1}^{m (s)} w_{i j}^{(s)} (log (\frac{π_{ξ_{j}} (ε)}{ξ_{j}}) - \frac{x_{i}}{ξ_{j} σ}) - n log σ, \end{matrix}

where

w_{i j}^{(s)} = \frac{π_{ξ_{j}} (ε^{(s)}) \frac{1}{ξ_{j}} e^{- x_{i} / ξ_{j} σ^{(s)}}}{\sum_{j = 1}^{m (s)} π_{ξ_{j}} (ε^{(s)}) \frac{1}{ξ_{j}} e^{- x_{i} / ξ_{j} σ^{(s)}}}

. Therefore, the iteration s of the algorithm takes the form:

E–step: For $i = 1, \dots, n$ , compute

$w_{i j}^{(s)} = \frac{e^{- x_{i} / (1 + ε^{(s)}) σ^{(s)}}}{e^{- x_{i} / (1 + ε^{(s)}) σ^{(s)}} + e^{- x_{i} / (1 - ε^{(s)}) σ^{(s)}}}, ξ_{j} = 1 + ε .$
M–step: Given $ε^{(s)}$ and $σ^{(s)}$ , compute

$\begin{matrix} ε^{(s + 1)} & = \frac{2}{n} \sum_{i = 1}^{n} w_{i j}^{(s)} - 1, ξ_{j} = 1 + ε, \\ σ^{(s + 1)} & = \frac{1}{n} \sum_{i = 1}^{n} \sum_{j = 1}^{m (s)} w_{i j}^{(s)} (\frac{x_{i}}{ξ_{j}}) . \end{matrix}$

References

Mudholkar, G.S.; Srivastava, D.K.; Kollia, G.D. A generalization of the weibull distribution with application to the analysis of survival data. J. Am. Stat. Assoc. 1996, 91, 1575–1583. [Google Scholar] [CrossRef]
Gupta, R.D.; Kundu, D. Exponentiated exponential family: An alternative to gamma and weibull distributions. Biom. J. 2001, 43, 117–130. [Google Scholar] [CrossRef]
Cooray, K. Analyzing lifetime data with long-tailed skewed distribution: The logistic-sinh family. Stat. Model. 2005, 5, 343–358. [Google Scholar] [CrossRef]
Fernández, C.; Steel, M.F. On Bayesian modeling of fat tails and skewness. J. Am. Stat. Assoc. 1998, 93, 359–371. [Google Scholar]
Mudholkar, G.S.; Hutson, A.D. The epsilon skew normal distribution for analyzing near normal data. J. Statist. Plann. Inference 2000, 83, 291–309. [Google Scholar] [CrossRef]
Arellano-Valle, R.B.; Gómez, H.W.; Quintana, F.A. Statistical inference for a general class of asymmetric distributions. J. Statist. Plann. Inference 2005, 128, 427–443. [Google Scholar] [CrossRef]
Jones, M. A note on rescalings, reparametrizations and classes of distributions. J. Statist. Plann. Inference 2006, 136, 3730–3733. [Google Scholar] [CrossRef]
Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum likelihood from incomplete data via the em algorithm. J. R. Stat. Soc. Ser. B Stat. Methodol. 1977, 39, 1–38. [Google Scholar]
Givens, G.H.; Hoeting, J.A. Computational Statistics, 2nd ed.; John Wiley & Sons: New York, NY, USA, 2012. [Google Scholar]
Chhikara, R.S.; Folks, J.L. The inverse gaussian distribution as a lifetime model. Technometrics 1977, 19, 461–468. [Google Scholar] [CrossRef]
Akaike, H. A new look at the statistical model identification. IEEE Trans. Automat. Contr. 1974, 19, 716–723. [Google Scholar] [CrossRef]

Figure 1. Examples of the probability density

f (x)

, survival

S (x)

and hazard

λ (x)

functions of epsilon-exponential distribution,

E E (σ, ε)

, and epsilon-Weibull distribution,

E W (α, σ, ε)

. Please note that the exponential and Weibull distributions correspond to the case

ε = 0

.

Figure 2. Examples of the probability density

f (x)

, survival

S (x)

and hazard

λ (x)

functions of epsilon-log-logistic distribution,

E L L (σ, ε)

, and epsilon-gamma distribution,

E G (α, σ, ε)

. Please note that the log-logistic and gamma distributions correspond to the case

ε = 0

.

Figure 3. Skewness (CS) and kurtosis (CK) coefficientes for

X \sim E E (σ, ε)

.

Figure 4. The density functions of the fitted epsilon exponential, exponential, Weibull and exponentiated exponential distributions.

Figure 5. Fit of the survival functions: Kaplan–Meier estimator (solid line), exponential (dashed line red) and epsilon–exponential (dashed line blue) distributions.

Table 1. Hazard rate,

λ (\cdot)

, Survival,

S (\cdot)

, and density,

f (\cdot)

, functions of some probability models that can be generalized using the definition in (2) In the table

I (a, β) = \int_{0}^{a} Γ {(β)}^{- 1} u^{β - 1} e^{- u} d u

.

Table 1. Hazard rate,

λ (\cdot)

, Survival,

S (\cdot)

, and density,

f (\cdot)

, functions of some probability models that can be generalized using the definition in (2) In the table

I (a, β) = \int_{0}^{a} Γ {(β)}^{- 1} u^{β - 1} e^{- u} d u

.

	$λ (y)$	$S (y)$	$f (y)$
Exponential	$\frac{1}{σ} (> 0)$	$exp (- \frac{y}{σ})$	$\frac{1}{σ} exp (- \frac{y}{σ})$
Weibull	$(\frac{β}{σ^{β}}) y^{(β - 1)} (β, σ > 0)$	$exp (- {[\frac{y}{σ}]}^{β})$	$\frac{β y^{(β - 1)}}{σ^{β}} exp (- {[\frac{y}{σ}]}^{β})$
Log-logistic	$\frac{(\frac{β}{σ}) {(\frac{y}{σ})}^{β - 1}}{1 + {(\frac{y}{σ})}^{β}} (β, σ > 0)$	${(1 + {[\frac{y}{σ}]}^{β})}^{- 1}$	$\frac{(\frac{β}{σ}) {(\frac{y}{σ})}^{β - 1}}{{(1 + {(\frac{y}{σ})}^{β})}^{2}}$
Gamma	$\frac{f (y)}{S (y)}$	$1 - I (y / σ, β)$	$\frac{y^{(β - 1)} exp (- \frac{y}{σ})}{σ^{β} Γ (β)}$

Table 2. Summary of simulation results.

True Value		n	$σ$		$ε$
$σ$	$ε$	n	Estimate	SD	Estimate	SD
		n = 50	0.286	0.047	0.376	0.126
		n = 100	0.292	0.037	0.350	0.127
	0.3	n = 200	0.292	0.030	0.335	0.118
		n = 500	0.295	0.022	0.317	0.106
		n = 1000	0.299	0.017	0.302	0.089
		n = 50	0.287	0.054	0.558	0.133
		n = 100	0.290	0.043	0.542	0.129
0.3	0.5	n = 200	0.294	0.037	0.527	0.123
		n = 500	0.297	0.030	0.512	0.111
		n = 1000	0.298	0.023	0.506	0.091
		n = 50	0.299	0.059	0.818	0.126
		n = 100	0.299	0.047	0.809	0.122
	0.8	n = 200	0.301	0.038	0.801	0.108
		n = 500	0.303	0.029	0.794	0.088
		n = 1000	0.302	0.022	0.794	0.069
		n = 50	0.476	0.079	0.379	0.125
		n = 100	0.484	0.062	0.355	0.125
	0.3	n = 200	0.486	0.049	0.337	0.121
		n = 500	0.494	0.037	0.316	0.107
		n = 1000	0.497	0.029	0.303	0.088
		n = 50	0.477	0.088	0.558	0.133
		n = 100	0.484	0.074	0.542	0.131
0.5	0.5	n = 200	0.488	0.059	0.528	0.124
		n = 500	0.494	0.048	0.512	0.109
		n = 1000	0.498	0.039	0.502	0.091
		n = 50	0.484	0.104	0.838	0.129
		n = 100	0.499	0.076	0.809	0.119
	0.8	n = 200	0.503	0.064	0.799	0.110
		n = 500	0.505	0.049	0.793	0.089
		n = 1000	0.504	0.038	0.793	0.070
		n = 50	0.762	0.125	0.374	0.124
		n = 100	0.771	0.099	0.356	0.125
	0.3	n = 200	0.779	0.080	0.336	0.120
		n = 500	0.789	0.058	0.314	0.105
		n = 1000	0.796	0.046	0.301	0.089
		n = 50	0.765	0.139	0.555	0.134
		n = 100	0.771	0.116	0.543	0.134
0.8	0.5	n = 200	0.782	0.096	0.528	0.123
		n = 500	0.789	0.076	0.515	0.110
		n = 1000	0.797	0.062	0.503	0.091
		n = 50	0.791	0.153	0.815	0.130
		n = 100	0.800	0.126	0.804	0.122
	0.8	n = 200	0.798	0.103	0.798	0.106
		n = 500	0.806	0.079	0.794	0.089
		n = 1000	0.807	0.060	0.793	0.070

Table 3. Repair times (in hours) of 46 transceivers.

0.2	0.3	0.5	0.5	0.5	0.5	0.6	0.6	0.7	0.7
0.7	0.8	0.8	1.0	1.0	1.0	1.0	1.1	1.3	1.5
1.5	1.5	1.5	2.0	2.0	2.2	2.5	2.7	3.0	3.0
3.3	3.3	4.0	4.0	4.5	4.7	5.0	5.4	5.4	7.0
7.5	8.8	9.0	10.3	22.0	24.5

Table 4. Observed recidivism by time after release.

Time after Release	% Observed Recidivists
1 month	4.7%
3 month	11.8%
6 month	19.9%
12 month	30.0%
18 month	35.9%
24 month	40.8%
36 month	46.6%
48 month	50.4%
64 month	52.2%

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Survival and Reliability Analysis with an Epsilon-Positive Family of Distributions with Applications

Abstract

1. Introduction

2. The Epsilon–Positive Family

2.1. Reliability Properties

2.2. Mean Residual Life

2.3. Stress-Strength Parameter

2.4. Maximum Likelihood Estimation

2.5. MLE via the EM Algorithm

3. The Epsilon–Exponential Distribution

Numerical Experiments

4. Survival and Reliability Analysis

4.1. Estimation Using the EM Algorithm

EM Algorithm for the Epsilon-Exponential Distribution

5. Real Data Examples

5.1. Example 1: Maintenance Data

5.2. Example 2: Recidivism Data

6. Discussion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. The EE–MLE

Appendix B. An EM–Type Algorithm for the EE–MLE

References

Article Metrics

Citations

Article Access Statistics