Slash Truncation Positive Normal Distribution and Its Estimation Based on the EM Algorithm

Héctor J. Gómez; Diego I. Gallardo; Karol I. Santoro

doi:10.3390/sym13112164

,

and

¹

Departamento de Ciencias Matemáticas y Físicas, Facultad de Ingeniería, Universidad Católica de Temuco, Temuco 4780000, Chile

²

Departamento de Matemática, Facultad de Ingeniería, Universidad de Atacama, Copiapó 1530000, Chile

³

Departamento de Matemática, Facultad de Ciencias, Universidad Católica del Norte, Antofagasta 1240000, Chile

^*

Author to whom correspondence should be addressed.

Symmetry2021, 13(11), 2164;https://doi.org/10.3390/sym13112164

This article belongs to the Special Issue Symmetric and Asymmetric Distributions: Theoretical Developments and Applications Ⅲ

Version Notes

Order Reprints

Abstract

In this paper, we present an extension of the truncated positive normal (TPN) distribution to model positive data with a high kurtosis. The new model is defined as the quotient between two random variables: the TPN distribution (numerator) and the power of a standard uniform distribution (denominator). The resulting model has greater kurtosis than the TPN distribution. We studied some properties of the distribution, such as moments, asymmetry, and kurtosis. Parameter estimation is based on the moments method, and maximum likelihood estimation uses the expectation-maximization algorithm. We performed some simulation studies to assess the recovery parameters and illustrate the model with a real data application related to body weight. The computational implementation of this work was included in the tpn package of the R software.

Keywords:

slash distribution; half-normal distribution; EM algorithm; tpn package

1. Introduction

The modeling of non-negative data has grown exponentially, since many datasets have this characteristic. Distributions with support in the positive line are used widely in the engineering and reliability fields related to failure time (also known as lifetime data). The half-normal distribution (HN) is a very well-known model for non-negative data, discussed extensively in the literature. For instance, Rafiqullah et al. [1] used the HN model to analyze survival data related to breast cancer in Hispanic black and non-Hispanic black women. Bosch-Badia et al. [2] studied the applicability of the HN distribution to risk analysis traditionally performed using risk matrices. Tsizhmovska et al. [3] analyzed the length of sentences where one of their distributions was the HN.

Olmos et al. [4] generated an extension of the HN distribution, obtaining a distribution that captures atypical data, but with little flexibility, called the slashed half-normal distribution (SHN). Cooray and Ananda [5] generalized the HN distribution, obtaining a new flexible model, which they denominated the generalized half-normal (GHN) distribution, which includes the HN model as a particular case. Despite the flexibility offered, a major difficulty appears, commonly related to limitations on the use of atypical data. To solve the obstacle, Olmos et al. [6] proposed an extension of the GHN model, named the slashed generalized half-normal (SGHN) distribution. The main aim of the authors was to generate a model with a higher kurtosis that allows better modeling of positive data in the presence of outliers. Other authors have worked on a similar idea, e.g., Iriarte et al. [7], Reyes et al. [8], Olmos et al. [9], Segovia et al. [10], and Astorga et al. [11].

Gómez et al. [12] truncated the normal distribution, conditioning it to positive values, i.e., if X has a normal distribution, the authors studied

X | X > 0

(see Jonhson et al. [13]), creating a distribution that they named the truncated positive normal distribution (TPN). A random variable (rv) Z follows a TPN distribution, denoted by

Z \sim T P N (σ, λ)

, if its probability density function (pdf) is given by:

\begin{matrix} f (z; σ, λ) & = & \frac{1}{σ Φ (λ)} ϕ (\frac{z}{σ} - λ), z > 0, \end{matrix}

(1)

where

ϕ

denotes the pdf of the standard normal model,

σ > 0

is a scale parameter, and

λ \in R

is a shape parameter.

On the other hand, the slash distribution is defined stochastically as the quotient between two independent rv, let us say Z and U, as follows:

X = \frac{Z}{U^{\frac{1}{q}}},

(2)

where

Z \sim N (0, 1)

and

U \sim U (0, 1)

are independent and

q > 0

is a shape parameter.

Olmos et al. [6] used this idea to propose an extension to the half-normal generalized model of Cooray and Ananda [1], called the slashed generalized half-normal distribution (SGHN). The density function of this rv is as follows:

f (z; σ, α, q) = \frac{q \sqrt{\frac{2^{q / α}}{π}} σ^{q} Γ (\frac{q + α}{2 α})}{z^{q + 1}} G (z^{2 α}, \frac{q + α}{2 α}, \frac{1}{2 σ^{2 α}}), z > 0,

(3)

where

σ, α, q > 0

and

G (\cdot; a)

is the cumulative distribution function (cdf) of the gamma distribution with shape parameter a and rate parameter one. We denote

Z \sim S G H N (σ, α, q)

.

The objective of this paper is to propose an extension of the model proposed by Gómez et al. [12] using the “slash” procedure, utilizing a TPN

(σ, λ)

rv in the numerator. Thus, the new model, which we call the slash truncated positive normal (STPN), will become a direct competitor model for SGHN, since it creates heavier tails and, moreover, allows the fitting of atypical data.

The paper is organized as follows. Section 2 presents the pdf of the STPN distribution and some properties such as moments, the hazard function, and the kurtosis coefficient. Section 3 studies the inference for the proposed model. In particular, we discuss the moments estimator and the expectation-maximization (EM) [14] algorithm to find the maximum likelihood estimator. In addition, we offer the observed Fisher information using Louis’ method [15]. Section 4 shows a simulation study to assess the recovery parameters. Section 5 conducts a real data application, where the STPN is compared with other proposals in the literature. Finally, Section 6 presents the conclusions of the manuscript.

2. The Slash Truncation Positive Normal Model

In this section, we describe the stochastic representation of the STPN model, its pdf, and some basic properties of the model.

2.1. Stochastic Representation and Particular Cases

Definition 1.

An rv Y has an STPN distribution with parameters σ, λ, and q if it can be represented as the ratio:

\begin{matrix} Y & = & \frac{Z}{U^{\frac{1}{q}}} \end{matrix}

(4)

where

U \sim U (0, 1)

and

Z \sim T P N (σ, λ)

are independent rvs,

σ > 0

,

λ \in R

, and

q > 0

. We denote it as

Y \sim S T P N (σ, λ, q) .

By construction, the following models are particular cases for the STPN distribution:

STPN $(σ, λ, q \to \infty) \equiv$ TPN $(σ, λ)$ ;
STPN $(σ, λ = 0, q) \equiv$ SHN $(σ, q)$ ;
STPN $(σ, λ = 0, q \to \infty) \equiv$ HN $(σ)$ .

Figure 1 summarizes the relationships among the STPN and its particular cases.

Figure 1. Particular cases for the STPN distribution.

2.2. Density Function

Proposition 1.

Let

Y \sim S T P N (σ, λ, q)

. Then, the pdf of Y is given by:

f_{Y} (y; σ, λ, q) = \frac{q}{σ Φ (λ)} \int_{0}^{1} w^{q} ϕ (\frac{y w}{σ} - λ) d w, y > 0,

(5)

where

σ > 0

is a scale parameter,

λ \in R

is a shape parameter, and

q > 0

is a parameter related to the kurtosis of the distribution.

Proof.

Using the representation in (4) and computing the Jacobian of the transformation for

Y = Z / U^{1 / q}

and

W = U^{1 / q}

, we obtain:

\begin{matrix} Z = Y W \\ U = W^{q} \end{matrix}\} \Rightarrow J = |\begin{matrix} \frac{\partial Z}{\partial Y} & \frac{\partial Z}{\partial W} \\ \frac{\partial U}{\partial Y} & \frac{\partial U}{\partial W} \end{matrix}| = |\begin{matrix} w & z \\ 0 & q w^{q - 1} \end{matrix}| = q w^{q} .

Therefore,

\begin{matrix} f_{Y, W} (y, w) & = | J | f_{Z, U} (y w, w^{q}) = q w^{q} f_{X} (y w) f_{U} (w^{q}), \\ = \frac{q}{σ Φ (λ)} w^{q} ϕ (\frac{y w}{σ} - λ), 0 < w < 1, z > 0 . \end{matrix}

Marginalizing with respect to variable W, we obtain the density function corresponding to the rv Y, that is,

\begin{matrix} f_{Y} (y; σ, λ, q) & = & \frac{q}{σ Φ (λ)} \int_{0}^{1} w^{q} ϕ (\frac{y w}{σ} - λ) d w . \end{matrix}

An alternative way to obtain this pdf is by substituting

u = \frac{y w}{σ} - λ

, obtaining:

f_{Y} (y; σ, λ, q) = \frac{q σ^{q}}{{(2 π)}^{1 / 2} Φ (λ) y^{q + 1}} \int_{- λ}^{y / σ - λ} {(u + λ)}^{q} e^{- u^{2} / 2} d u .

With

t = \frac{u^{2}}{2}

in the last expression, we obtain:

f_{Y} (y; σ, λ, q) = \frac{q σ^{q}}{{(2 π)}^{\frac{1}{2}} Φ (λ) y^{q + 1}} \sum_{K = 0}^{\infty} (\binom{q}{k}) λ^{q - k} 2^{\frac{k}{2} - \frac{1}{2}} Γ (\frac{k + 1}{2}) [G ({(\frac{y}{σ} - λ)}^{2}, \frac{k + 1}{2}, 1) - G (\frac{λ^{2}}{2}, \frac{k + 1}{2}, 1)] .

□

2.3. Some Properties

In this section, we study some basic properties of the STPN distribution.

Proposition 2.

Let

Y \sim S T P N (σ, λ, q)

. Then, the cdf of Y is given by:

F_{Y} (y; σ, λ, q) = \frac{q}{Φ (λ)} \int_{0}^{1} w^{q - 1} (Φ (\frac{y w}{σ} - λ) + Φ (λ) - 1) d w, y > 0

Proof.

It is immediate from the definition. □

Proposition 3.

Let

Y \sim S T P N (σ, λ, q)

. Then, the hazard function is given by:

H_{Y} (y; σ, λ, q) = \frac{\frac{q}{σ Φ (λ)} \int_{0}^{1} w^{q} ϕ (\frac{y w}{σ} - λ) d w}{1 - \frac{q}{Φ (λ)} \int_{0}^{1} w^{q - 1} (Φ (\frac{y w}{σ} - λ) + Φ (λ) - 1) d w}, y > 0

Figure 2 shows the pdf, cdf, and hazard function for the STPN model with different combinations of parameters.

Figure 2. pdf, cdf, and hazard function for the STPN

(σ = 1, λ = 2, q)

model with different combinations of q and the STPN

(σ = 1, λ, q = 2)

model with different combinations of

λ

. (a) pdf of STPN

(σ = 1, λ = 2, q)

. (b) pdf of STPN

(σ = 1, λ, q = 2)

. (c) cdf of STPN

(σ = 1, λ = 2, q)

. (d) cdf of STPN

(σ = 1, λ, q = 2)

. (e) hazard function of STPN

(σ = 1, λ = 2, q)

. (f) hazard function of STPN

(σ = 1, λ, q = 2)

.

Proposition 4.

Let

Y \sim S T P N (σ, λ, q)

. If

q \to + \infty

, then Y strongly converges to the rv

Z \sim T P N (σ, λ)

.

Proof.

Let

Y \sim STPN (σ, λ, q)

. Then, Y can be written as

Y = Z / U^{1 / q}

, where

Z \sim TPN (σ, λ)

and

U \sim U (0, 1)

. First, we studied the convergence in the probability of

U^{1 / q}

. It is clear that

W = U^{1 / q}

, then

W \sim B e t a (q, 1)

, so that

E {(W - 1)}^{2} = \frac{2}{(q + 1) (q + 2)}

. If

q \to + \infty

, then

E {(W - 1)}^{2} \to 0

. Therefore,

W = U^{1 / q} \overset{P}{\to} 1, as q \to + \infty,

where

\overset{P}{\to}

denotes convergence in probability. Then, applying Slutsky’s theorem [16] to

Y = Z / U^{1 / q}

, we have:

Y \overset{D}{\to} Z, as q \to + \infty,

where

\overset{D}{\to}

denotes convergence in the distribution. In other words, for greater q values, Y strongly converges to the

T P N (σ, λ)

distribution.

□

Proposition 5.

If

Y | T = t \sim T P N (σ t^{- 1 / q}, λ)

and

T \sim U (0, 1)

, then

Y \sim S T P N (σ, λ, q) .

Proof.

The marginal distribution of Y can be computed as:

f_{Y} (y; σ, λ, q) = \int_{0}^{1} f_{Y | T} (y | t) f_{T} (t) d t = \int_{0}^{1} \frac{1}{σ Φ (λ) t^{- 1 / q}} ϕ (\frac{y}{σ t^{- 1 / q}} - λ) d t .

With the transformation

w = t^{1 / q}

, we obtain Equation (5). □

Remark 1.

Proposition 4 implies that for

q \to + \infty

, the pdf of the STPN distribution converges to the pdf of the TPN model. Proposition 5 shows that the STPN distribution can also be seen as a scale mixture of the TPN model. This property is very important for obtaining random values from this model and for the application of an EM-type algorithm to estimate the parameters of the model.

2.4. Moments

The following proposition provides the moments of the STPN distribution.

Proposition 6.

Let

Y \sim S T P N (σ, λ, q)

. Therefore, for

r = 1, 2, \dots

and

q > r

, the r-th moment of Y is given by:

μ_{r} = E (Y^{r}) = \frac{q σ^{r}}{q - r} κ_{r} (λ),

where

κ_{r} (λ) = \frac{1}{\sqrt{2 π} Φ (λ)} \sum_{k = 0}^{t} (\binom{r}{k}) λ^{r - k} 2^{(k - 1) / 2} Γ ((k + 1), λ^{2} / 2)

.

Proof.

Using the stochastic representation given in Equation (4), we have that:

μ_{r} = E (Y^{r}) = E (Z^{r} U^{- r / q}) = E (Z^{r}) E (U^{- r / q}),

where

E (U^{- r / q}) = \frac{q}{q - r}

,

q > r

, and

E (Z^{r}) = \frac{σ^{r}}{\sqrt{2 π} Φ (λ)} \sum_{k = 0}^{t} (\binom{r}{k}) λ^{r - k} 2^{(k - 1) / 2} Γ ((k + 1), λ^{2} / 2)

are the moments of the TPN

(σ, λ)

model. □

Corollary 1.

If

Y \sim S T P N (σ, λ, q)

, then its first four moments are determined as follows:

$μ_{1} = E (Y) = \frac{q σ}{q - 1} κ_{1} (λ), q > 1;$
$μ_{2} = E (Y^{2}) = \frac{q σ^{2}}{q - 2} κ_{2} (λ), q > 2;$
$μ_{3} = E (Y^{3}) = \frac{q σ^{3}}{q - 3} κ_{3} (λ), q > 3;$
$μ_{4} = E (Y^{4}) = \frac{q σ^{4}}{q - 4} κ_{4} (λ), q > 4 .$

V a r (Y) = σ^{2} q (\frac{1}{q - 2} κ_{2} (λ) - \frac{q}{{(q - 1)}^{2}} κ_{1}^{2} (λ)), q > 2 .

(6)

Proof.

It is immediate from Proposition 6. □

Corollary 2.

Let

Y \sim S T P N (σ, λ, q)

, then the asymmetry coefficient

(\sqrt{β_{1}})

and the kurtosis coefficient

(β_{2})

are:

\sqrt{β_{1}} = \frac{\sqrt{q - 2} {{(q - 1)}^{3} (q - 2) κ_{3} - 3 q κ_{1} κ_{2} {(q - 1)}^{2} (q - 3) + 2 q^{2} (q - 2) (q - 3) κ_{1}^{3}}}{\sqrt{q} (q - 3) {{(q - 1)}^{2} κ_{2} - q (q - 2) κ_{1}^{2}}^{3 / 2}}, q > 3,

(7)

β_{2} = \frac{{(q - 1)}^{3} {(q - 2)}^{2} A + 3 (q - 2) (q - 3) (q - 4) q^{2} B}{q (q - 3) (q - 4) {{(q - 1)}^{2} κ_{2} - q (q - 2) κ_{1}^{2}}^{2}}, q > 4

(8)

where

A = (q - 3) (q - 1) κ_{4} - 4 q (q - 1) (q - 4) κ_{1} κ_{3}

and

B = 2 {(q - 1)}^{2} κ_{1}^{2} κ_{2} - q (q - 2) κ_{1}^{4}

.

Proof.

By the definition of the asymmetry and kurtosis coefficients, we have:

\sqrt{β_{1}} = \frac{μ_{3} - 3 μ_{2} μ_{1} + 2 μ_{1}^{3}}{{(μ_{2} - μ_{1}^{2})}^{3 / 2}} a n d β_{2} = \frac{μ_{4} - 4 μ_{1} μ_{3} + 6 μ_{1}^{2} μ_{2} - 3 μ_{1}^{4}}{{(μ_{2} - μ_{1}^{2})}^{2}} .

Replacing

μ_{1}, μ_{2}, μ_{3}

, and

μ_{4}

obtained in Corollary 1, we have the result. □

Remark 2.

Proposition 6 shows that the moments of the

S T P N

distribution depend essentially on the moments of the

T P N

distribution. Equations (6) and (8) show the effect of parameter q on the model; a lower value of q produces greater variance and kurtosis. Table 1 shows some values of the kurtosis coefficient of the

S T P N

distribution for different values of λ and q.

Table 1. Some values for the kurtosis coefficients of the STPN distribution for different values of

λ

and q.

Figure 3 shows the mean, standard deviation, asymmetry coefficient, and kurtosis coefficient for the STPN

(σ = 1, λ, q)

in terms of

λ

and q.

Figure 3. (a) Mean; (b) standard deviation; (c) asymmetry coefficient; (d) kurtosis coefficient for the STPN(

λ, σ = 1, q

) model.

3. Inference

In this Section, we discuss a classical approach for the inference for the STPN distribution. In particular, we discuss the moments estimators and maximum likelihood (ML) estimation based on the EM algorithm.

3.1. Moments Estimators

The moments estimators result from the solution of the equation

E (Y^{j}) = \bar{Y^{j}}

, for

j = 1, 2, 3

, where

\bar{Y^{j}} = n^{- 1} \sum_{i = 1}^{n} y_{i}^{j}

denotes the j-th sample moment. Solving

E (Y) = \bar{Y}

, we have that:

\begin{matrix} σ & = \frac{(q - 1) \bar{Y}}{q (λ + ξ (λ))} . \end{matrix}

(9)

Replacing this, we have the following nonlinear equations

\begin{matrix} \bar{Y^{2}} & = & \frac{{\bar{Y}}^{2} {(q - 1)}^{2} (λ^{2} + λ ξ (λ) + 1)}{q (q - 2) {(λ + ξ (λ))}^{2}}, and \\ \bar{Y^{3}} & = & \frac{{\bar{Y}}^{3} {(q - 1)}^{3} (λ^{3} + λ^{2} ξ (λ) + 3 λ + 2 ξ (λ))}{q^{2} (q - 3) {(λ + ξ (λ))}^{3}} . \end{matrix}

These equations can be solved using different software. For instance, in R [17], we can use the nleqslv function to obtain the moments estimators

{\hat{λ}}_{M}

and

{\hat{q}}_{M}

. The moments estimator

{\hat{σ}}_{M}

is obtained by substitution in Equation (9).

3.2. Maximum Likelihood Estimation

Given

y_{1}, \dots, y_{n}

, a random sample from the STPN

(σ λ, q)

distribution, the log-likelihood function for

θ = (σ, λ, q)

is given by:

ℓ (θ) = n log (q) - n log (σ) - n log (Φ (λ)) + \sum_{i = 1}^{n} log (G (y_{i})),

where:

G (y_{i}) = G (y_{i}, σ, λ, q) = \int_{0}^{1} w^{q} ϕ (\frac{y_{i} w}{σ} - λ) d w .

Deriving in relation to the components of

θ

, we obtain the following ML equations:

\sum_{i = 1}^{n} \frac{G_{1} (y_{i})}{G (y_{i})} = \frac{n}{σ}, \sum_{i = 1}^{n} \frac{G_{2} (y_{i})}{G (y_{i})} = n ξ (λ), and - \sum_{i = 1}^{n} \frac{G_{3} (y_{i})}{G (y_{i})} = \frac{n}{q},

where

G_{1} (y_{i}) = \frac{\partial G (y_{i})}{\partial σ}

,

G_{2} (y_{i}) = \frac{\partial G (y_{i})}{\partial λ}

, and

G_{3} (y_{i}) = \frac{\partial G (y_{i})}{\partial q}

. For

j > 0

, we define:

\begin{matrix} a_{i} (j) & = a_{i} (σ, λ, j) = \int_{0}^{1} w^{j} ϕ (\frac{y_{i} w}{σ} - λ) d w, and \\ b_{i} (j) & = b_{i} (σ, λ, j) = \int_{0}^{1} w^{j} ln (w) ϕ (\frac{y_{i} w}{σ} - λ) d w . \end{matrix}

With those notations, the ML equations can also be written as:

\begin{matrix} \sum_{i = 1}^{n} \frac{y_{i}^{2}}{σ^{3}} \frac{a_{i} (q + 2)}{a_{i} (q)} - \sum_{i = 1}^{n} \frac{y_{i} λ}{σ^{2}} \frac{a_{i} (q + 1)}{a_{i} (q)} & = & \frac{n}{σ}, \\ \sum_{i = 1}^{n} \frac{y_{i}}{σ} \frac{a_{i} (q + 1)}{a_{i} (q)} - n λ & = & n ξ (λ), and \\ \sum_{i = 1}^{n} \frac{b_{i} (q)}{a_{i} (q)} & = & - \frac{n}{q} \end{matrix}

Taking

x_{i} = y_{i} / σ

,

ω_{1} (x_{i}) = \frac{a_{i} (q + 2)}{a_{i} (q)}

,

ω_{2} (x_{i}) = \frac{b_{i} (q)}{a_{i} (q)}

, and

ω_{3} (x_{i}) = \frac{a_{i} (q + 1)}{a_{i} (q)}

, the equations are equivalent to:

\sum_{i = 1}^{n} x_{i}^{2} ω_{1} (x_{i}) - λ \sum_{i = 1}^{n} x_{i} ω_{3} (x_{i}) = n, \sum_{i = 1}^{n} \frac{y_{i}}{σ} ω_{3} (x_{i}) - n λ = n ξ (λ), and \sum_{i = 1}^{n} ω_{2} (x_{i}) = - \frac{n}{q} .

The ML estimators can be obtained directly using numerical procedures. However, to increase the robustness of the procedure for obtaining those estimators, we also discuss an EM-type algorithm for estimation in the model.

3.3. EM Algorithm

The EM algorithm is a well-known tool for ML estimation in the presence of nonobserved (latent) data. For this particular problem, the algorithm takes advantage of the stochastic representation of the STPN model in Equation (4). Let

W = U^{1 / q}

. The representation of the model can be seen as

Y_{i} = Z_{i} / W_{i}

, where

W_{i} \sim

Beta

(q, 1)

.

In this context, the STPN distribution can also be written using the following hierarchical representation:

\begin{matrix} Y_{i} | W_{i} = w_{i} & \overset{i n d .}{\sim} & T P N (\frac{σ}{w_{i}}, λ), \\ W_{i} & \overset{i n d .}{\sim} & B e t a (q, 1), i = 1, \dots n . \end{matrix}

In our context,

y = {[y_{1}, \dots, y_{n}]}^{⊤}

and

w = {[w_{1}, \dots, w_{n}]}^{⊤}

represent the observed and nonobserved data, respectively. The complete data are given by

y_{c} = [y^{⊤}, w^{⊤}]

. We also denote

ℓ_{c} (θ | y_{c})

as the complete log-likelihood function, which up to a constant is given by:

\begin{matrix} ℓ_{c} (θ | y_{c}) = n [log q - log (σ) - log (Φ (λ))] - \frac{n}{2} λ^{2} - \sum_{i = 1}^{n} \frac{y_{i}^{2} w_{i}^{2}}{2 σ^{2}} + \frac{λ}{σ} \sum_{i = 1}^{n} y_{i} w_{i} + q \sum_{i = 1}^{n} log (w_{i}) . \end{matrix}

Note that

Q (θ | {\hat{θ}}^{(k)}) = E (ℓ_{c} (θ | y) | y, θ = {\hat{θ}}^{(k)})

; the expected value of

ℓ_{c} (θ)

provided the observed data is given by:

\begin{matrix} Q (θ | {\hat{θ}}^{(k)}) & = n [log q - log (σ) - log (Φ (λ))] - \frac{n}{2} λ^{2} - \sum_{i = 1}^{n} \frac{y_{i}^{2} {\hat{w_{i}^{2}}}^{(k)}}{2 σ^{2}} + \frac{λ}{σ} \sum_{i = 1}^{n} y_{i} {\hat{w_{i}}}^{(k)} \\ + q \sum_{i = 1}^{n} {\hat{log w_{i}}}^{(k)}, \end{matrix}

where

{\hat{w_{i}}}^{(k)} = E (w_{i} | y_{i}, θ = {\hat{θ}}^{(k)})

,

{\hat{w_{i}^{2}}}^{(k)} = E (w_{i}^{2} | y_{i}, θ = {\hat{θ}}^{(k)})

, and

{\hat{log w_{i}}}^{(k)} = E (log w_{i} | y_{i},

θ = {\hat{θ}}^{(k)})

. In our context,

{\hat{w}}_{i}^{(k)}

,

{\hat{w}}_{i}^{2}

^{(k)}

and

{\hat{log w_{i}}}^{(k)}

do not have a closed form; they therefore need to be computed numerically. In short, the k-th step of the EM algorithm is detailed as follows:

E-step: For ${\hat{θ}}^{(k)} = {({\hat{σ}}^{(k)}, {\hat{λ}}^{(k)}, {\hat{q}}^{(k)})}^{⊤}$ , the value for the vector of parameters at the k-step, compute ${\hat{w}}_{i}^{(k)}$ , ${\hat{w}}_{i}^{2}$ $^{(k)}$ , and ${\hat{log w_{i}}}^{(k)}$ , for $i = 1, \dots, n$ ;
CM-Step I: Given ${\hat{λ}}^{(k)}$ and ${\hat{w}}_{1}^{(k)}, \dots, {\hat{w}}_{n}^{(k)}$ , update $σ$ as follows:

${\hat{σ}}^{(k + 1)} = \frac{\sum_{i = 1}^{n} y_{i} {\hat{w}}_{i}^{(k)}}{n ξ ({\hat{λ}}^{(k)}) + n {\hat{λ}}^{(k)}};$
CM-Step II: Given ${\hat{σ}}^{(k + 1)}$ and ${\hat{w}}_{1}^{(k)}, \dots, {\hat{w}}_{n}^{(k)}$ , update $λ$ since the solution is obtained from the nonlinear equation.

$- \frac{\sum_{i = 1}^{n} y_{i} {\hat{w}}_{i}^{(k)}}{n} = ξ {({\hat{λ}}^{(k)})}^{2} + 3 ξ ({\hat{λ}}^{(k)}) {\hat{λ}}^{(k)} + 2 {\hat{λ}}^{2^{(k)}};$
CM-Step III: Given ${\hat{log w_{1}}}^{(k)}, \dots, {\hat{log w_{n}}}^{(k)}$ , update q as follows:

${\hat{q}}^{(k + 1)} = - \frac{n}{\sum_{i = 1}^{n} {\hat{log w_{i}}}^{(k)}} .$

The E-, CM-I, CM-II, and CM-III steps are repeated until an ad hoc criterion is satisfied. For instance, we considered

|ℓ ({\hat{θ}}^{(k + 1)}) - ℓ ({\hat{θ}}^{(k)})| < ϵ

, for a fixed

ϵ

. In other words, the difference in the observed log-likelihood for successive steps is lower than a determined value. The initial values for the algorithm can be obtained, for instance, using the

{\hat{σ}}_{M}

,

{\hat{λ}}_{M}

, and

{\hat{q}}_{M}

, moments estimators.

3.4. Observed Fisher Information Matrix

The variance of the estimators can be estimated based on the observed Fisher information matrix, say

I (θ) = - \partial^{2} ℓ (θ) / \partial θ \partial θ^{⊤}

. In particular, we have that:

\sqrt{n} I {(θ)}^{- 1} (θ - \hat{θ}) \overset{D}{\to} N_{3} (0_{3}, I_{3}), as n \to + \infty,

where

N_{3} (0_{3}, I_{3})

denotes the standard trivariate normal distribution. The computation of

I (θ)

is not trivial, because it involves the derivation of functions that depend on integrals. Taking advantage of the complete log-likelihood function,

I (θ)

can also be approximated by Louis’ method [14] as follows:

\begin{matrix} I (θ) & = \sum_{i = 1}^{n} E (B_{i} (θ) | y, θ = \hat{θ}) - \sum_{i = 1}^{n} E (S_{i} (θ) S_{i}^{⊤} (θ) | y, θ = \hat{θ}) \\ + \underset{\begin{matrix} 1 \leq i, j \leq n \\ i \neq j \end{matrix}}{\sum \sum} E (S_{i} (θ) ∣ y, θ = \hat{θ}) E^{⊤} (S_{j} (θ) ∣ y, θ = \hat{θ}) . \end{matrix}

The details of the components of

I (θ)

are provided in the Appendix A.

3.5. Computational Aspects

The EM algorithm and Louis’ method to obtain the ML estimators and their standard errors for the STPN distribution are included in the tpn package [18] from R [17]. The following function can be used to obtain these results:

est.stpn(y, sigma0=NULL, lambda0=NULL, q0=NULL, prec = 0.001, max.iter = 1000)

where y is the response variable, sigma0, lambda0, q0 are the initial values for the algorithm (they are not defined by default), prec is the precision for the parameters, and max.iter is the maximum number of iterations to be applied for the algorithm. The tpn package also includes the functions dstpn, pstpn, and rstpn, which compute the pdf, cdf, and generation for the STPN distribution.

4. Simulation

In this section, we study the performance of the ML estimators using the EM algorithm for the STPN distribution under different scenarios. We considered two values for

σ

: 2 and 10; three values for

λ

: −1, 1, and 3; two values for q:

- 1.5

and 3; and four sample sizes: 50, 100, 200, and 500. For each combination of

σ, λ, q

and n (totaling 48 combinations), we drew 1000 replicates, and we used the tpn package to estimate the parameters based on the EM algorithm and estimated the standard deviations based on Louis’ method to estimate the observed Fisher information matrix. Table 2 summarizes the mean of the estimated bias for the 1000 replicates (bias), the mean of the standard errors (SEs), the root of the estimated mean-squared error (RMSE), and the estimated coverage probability based on the asymptotic distribution for the ML estimator using a 95% confidence level (CP). Note that the bias and the RMSE terms are reduced when the sample size is increased, suggesting that the estimators are consistent even in finite samples. The SE and RMSE terms are closer when the sample size is increased, suggesting that the standard errors are also consistently estimated. Finally, the CP terms converges to the nominal value when the sample size is increased, suggesting that the asymptotic distribution of the ML estimators also works well in finite samples.

Table 2. Recovery parameters for the STPN distribution based on 1000 replicates for different combinations of parameters and sample size.

5. Application

In this section, we present a real data application in order to illustrate the performance of the STPN model in comparison with other proposals in the literature. For this, a comparison was conducted utilizing the TPN distribution and the model proposed by Gómez et al. [19], which is a generalization of a TPN model, denominated the generalized TPN (GTPN). The density function of the GPTN model is given by:

f (y; σ, λ, α) = \frac{α}{σ^{α} Φ (λ)} y^{α - 1} ϕ ({(\frac{y}{σ})}^{α} - λ),

with

x > 0

,

σ, α > 0

, and

λ \in R

.

A real dataset of body fat was considered, which measured weight and various body circumferences (see http://lib.stat.cmu.edu/datasets/bodyfat (accessed on 8 October 2021)); for examination purposes, the weight variable (measured in pounds (lbs)) was chosen to conduct the application. When calculating basic statistics (Table 3 shows basic statistics), high kurtosis can be observed for the variable, suggesting the use of a distribution with heavy tails as the STPN.

Table 3. Descriptive statistics for the

w e i g h t

dataset.

Table 4 shows the estimated parameters for the three models considered. Based on the AIC [20] and BIC [21], the STPN model provides a better fit. In addition, Figure 4 shows the histogram for the data and the estimated pdf for all the models, where a better performance of the STPN model is shown. In order to check the better fit of the STPN model in comparison with the rest of the models, we also computed the quantile residuals (QRs). If the model is appropriate for the data, the QRs should be a sample from the standard normal model. This assumption can be validated with traditional normality tests such as the Anderson–Darling (AD), Cramér–von Mises (CVM), and Shapiro–Wilkes (SW) tests. Figure 5 suggests that the STPN model provides a better fit for this dataset.

Table 4. Estimated parameters and their standard errors (in parentheses) for the STPN, TPN, and GTPN models for the

w e i g h t

dataset. The AIC and BIC are also presented.

Figure 4. Fit of the distributions for the

w e i g h t

dataset.

Figure 5. QRs for the fitted models in the

w e i g h t

dataset. The p-values for the AD, CVM, and SW normality tests are also presented to check if the QRs came from the standard normal distribution. (a) qq-plot STPN. (b) qq-plot TPN. (c) qq-plot GTPN.

6. Conclusions

This study presents a new distribution with positive support denominated the slash truncation positive normal. This distribution serves as a more general model compared to the TPN model, pursuing the increase of kurtosis in order to improve the modeling of positive databases with high kurtosis. The basic properties of the model were analyzed, and a simulation study was conducted implementing the EM algorithm. Finally, an application with real data was performed proving that the new model performs better than competing models.

Author Contributions

Conceptualization, H.J.G. and D.I.G.; Data curation, H.J.G.; Formal analysis, D.I.G. and K.I.S.; Investigation, K.I.S.; Methodology, H.J.G., D.I.G. and K.I.S.; Software, H.J.G. and D.I.G.; Supervision, D.I.G. All authors have read and agreed to the published version of the manuscript.

Funding

The research of Hector J. Gómez was supported by Proyecto de Investigación de Facultad de Ingeniería, Universidad Católica de Temuco, UCT-FDI032020.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used in Section 5 were duly referenced.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

In this Appendix, we explain the terms involved in the observed Fisher information matrix presented in Section 3.4. Let

\hat{w_{i}} = E [w_{i} | y_{i}]

,

\hat{w_{i}^{2}} = E [w_{i}^{2} | y_{i}]

,

\hat{w_{i}^{3}} = E [w_{i}^{3} | y_{i}]

,

\hat{w_{i}^{4}} = E [w_{i}^{4} | y_{i}]

,

\hat{log (w_{i})} = E [log (w_{i}) | y_{i}]

,

\hat{{log}^{2} (w_{i})} = E [{log}^{2} (w_{i}) | y_{i}]

,

\hat{log {(w_{i})}^{*}} = E [w log (w_{i}) | y_{i}]

, and

\hat{log {(w_{i})}^{(2 *)}} = E [w_{i}^{2} log (w_{i}) | y_{i}]

.

We also define

B_{i} = E [B_{i} (θ) | y, θ = \hat{θ}]

,

D_{i} = E [S_{i} (θ) {S_{i}}^{⊤} (θ) ∣ y, θ = \hat{θ}]

, and

F_{i j} = E [S_{i} (θ) ∣ y, θ = \hat{θ}] E^{⊤} [S_{j} (θ) ∣ y, θ = \hat{θ}]

.

The elements of

B_{i}

are

B_{i, 1, 1} = \frac{1}{σ^{2}} - \frac{3 y_{i}^{2} \hat{w_{i}^{2}}}{σ^{4}} + \frac{2 λ \hat{w_{i}}}{σ^{3}}, B_{i, 1, 2} = B_{i, 2, 1} = - \frac{y_{i} \hat{w_{i}}}{σ^{2}}, B_{i, 1, 3} = B_{i, 3, 1} = 0, B_{i, 2, 2} = ξ (λ) [λ + ξ (λ)] - 1, B_{i, 2, 3} = B_{i, 3, 2} = 0

, and

B_{i, 3, 3} = - \frac{1}{q^{2}}

.

The elements of

D_{i}

are:

\begin{matrix} D_{i, 1, 1} & = & \frac{1}{α} - \frac{2 y_{i}^{2}}{σ^{4}} \hat{w_{i}^{2}} + \frac{2 λ}{σ^{3}} y_{i} \hat{w_{i}} + \frac{y_{i}^{4}}{σ^{6}} \hat{w_{i}^{4}} - \frac{2 λ y_{i}^{3}}{σ^{5}} \hat{w_{i}^{3}} + \frac{λ^{2} y_{i}^{2}}{σ^{4}} \hat{w_{i}^{2}}, \\ D_{i, 1, 2} & = & D_{i, 2, 1} = \frac{1}{σ} ξ (λ) + \frac{y_{i}}{σ} \hat{w_{i}} (λ^{2} - 1 + λ ξ (λ)) - \frac{y_{i}^{2}}{σ^{3}} \hat{w_{i}^{2}} (ξ (λ) + λ) + \frac{y_{i}^{3}}{σ^{4}} \hat{w_{i}^{3}} - \frac{y_{i}^{2} λ}{σ^{3}} \hat{w_{i}^{2}} + \frac{λ}{σ}, \\ D_{i, 1, 3} & = & D_{i, 3, 1} = - \frac{1}{σ q} - \frac{1}{σ} \hat{log (w_{i})} + \frac{y_{i}^{2}}{q σ^{3}} \hat{w_{i}^{2}} + \frac{y_{i}^{2}}{σ^{3}} \hat{log {(w_{i})}^{(2 *)}} - \frac{λ y_{i}}{σ^{2} q} \hat{w_{i}} - \frac{λ y_{i}}{σ^{2}} \hat{log {(w_{i})}^{*}}, \\ D_{i, 2, 3} & = & D_{i, 3, 2} = - \frac{ξ (λ)}{q} - ξ (λ) \hat{log (w_{i})} + \frac{y_{i}}{q σ} \hat{w_{i}} + \frac{y_{i}}{σ} \hat{log {(w_{i})}^{*}} - \frac{λ}{q} - λ \hat{log (w_{i})}, \\ D_{i, 2, 2} & = & ξ^{2} (λ) + \frac{y_{i}^{2}}{σ^{2}} \hat{w_{i}^{2}} + λ^{2} - \frac{2}{σ} ξ (λ) \hat{w_{i}} + 2 λ ξ (λ) - \frac{2 λ y_{i}}{σ} \hat{w_{i}}, a n d \\ D_{i, 3, 3} & = & \frac{1}{q^{2}} + \frac{2}{q} \hat{log (w_{i})} + \hat{{log}^{2} (w_{i})} . \end{matrix}

Finally, the elements of

F_{i j}

are given by:

\begin{matrix} F_{i, j, 1, 1} & = & \frac{1}{σ^{2}} + \frac{y_{i}^{2} y_{j}^{2}}{σ^{6}} \hat{w_{i}^{2}} \hat{w_{j}^{2}} + \frac{y_{i} y_{j} λ^{2}}{σ^{4}} \hat{w_{i}} \hat{w_{j}} - \frac{y_{i}^{2}}{σ^{4}} \hat{w_{i}^{2}} - \frac{y_{j}^{2}}{σ^{4}} \hat{w_{j}^{2}} - \frac{λ y_{j}^{2} y_{i}}{σ^{5}} \hat{w_{j}^{2}} \hat{w_{i}} - \frac{λ y_{i}^{2} y_{j}}{σ^{5}} \hat{w_{i}^{2}} \hat{w_{j}} + \frac{y_{i} λ}{σ^{3}} \hat{w_{i}} + \frac{y_{j} λ}{σ^{3}} \hat{w_{j}}, \\ F_{i, j, 1, 2} & = & \frac{1}{σ} ξ (λ) - \frac{y_{j}}{σ^{2}} \hat{w_{j}} + \frac{λ}{σ} - \frac{y_{i}^{2}}{σ^{3}} ξ (λ) \hat{w_{i}^{2}} + \frac{y_{i}^{2} y_{j}}{σ^{4}} \hat{w_{i}^{2}} \hat{w_{j}} - \frac{λ y_{i}^{2}}{σ^{3}} \hat{w_{i}^{2}} + \frac{y_{i} λ}{σ^{2}} ξ (λ) \hat{w_{i}} - \frac{y_{i} y_{j} λ}{σ^{3}} \hat{w_{i}} \hat{w_{j}} + \frac{λ^{2} y_{i}}{σ^{2}} \hat{w_{i}}, \\ F_{i, j, 1, 3} & = & - \frac{1}{σ q} + \frac{y_{i}^{2}}{q σ^{3}} \hat{w_{i}^{2}} - \frac{y_{i} λ}{q σ^{2}} \hat{w_{i}} - \frac{1}{σ} \hat{log (w_{i})} + \frac{y_{i}^{2}}{σ^{3}} \hat{w_{i}^{2}} \hat{log (w_{j})} - \frac{y_{i} λ}{σ^{2}} \hat{w_{i}} \hat{log (w_{j})}, \\ F_{i, j, 2, 1} & = & \frac{1}{σ} ξ (λ) - \frac{y_{i}}{σ^{2}} \hat{w_{i}} + \frac{λ}{σ} - \frac{y_{j}^{2}}{σ^{3}} ξ (λ) \hat{w_{j}^{2}} + \frac{y_{j}^{2} y_{i}}{σ^{4}} \hat{w_{j}^{2}} \hat{w_{i}} - \frac{λ y_{j}^{2}}{σ^{3}} \hat{w_{j}^{2}} + \frac{y_{j} λ}{σ^{2}} ξ (λ) \hat{w_{j}} - \frac{y_{j} y_{i} λ}{σ^{3}} \hat{w_{j}} \hat{w_{i}} + \frac{λ^{2} y_{j}}{σ^{2}} \hat{w_{j}}, \\ F_{i, j, 2, 2} & = & ξ^{2} (λ) + \frac{y_{i} y_{j}}{σ^{2}} \hat{w_{i}} \hat{w_{j}} + λ^{2} - \frac{y_{i}}{σ} ξ (λ) \hat{w_{i}} - \frac{y_{j}}{σ} ξ (λ) \hat{w_{j}} - \frac{y_{i} λ}{σ} \hat{w_{i}} - \frac{y_{j} λ}{σ} \hat{w_{j}} + 2 λ ξ (λ), \\ F_{i, j, 2, 3} & = & - \frac{ξ (λ)}{q} - ξ (λ) \hat{log (w_{j})} + \frac{y_{i}}{q σ} \hat{w_{i}} + \frac{y_{i}}{σ} \hat{w_{i}} \hat{log (w_{j})} - \frac{λ}{q} - λ \hat{log (w_{j})}, \\ F_{i, j, 3, 1} & = & - \frac{1}{σ q} + \frac{y_{j}^{2}}{q σ^{3}} \hat{w_{j}^{2}} - \frac{y_{j} λ}{q σ^{2}} \hat{w_{j}} - \frac{1}{σ} \hat{log (w_{j})} + \frac{y_{j}^{2}}{σ^{3}} \hat{w_{j}^{2}} \hat{log (w_{i})} - \frac{y_{j} λ}{σ^{2}} \hat{w_{j}} \hat{log (w_{i})}, \\ F_{i, j, 3, 2} & = & - \frac{ξ (λ)}{q} - ξ (λ) \hat{log (w_{i})} + \frac{y_{j}}{q σ} \hat{w_{j}} + \frac{y_{j}}{σ} \hat{w_{j}} \hat{log (w_{i})} - \frac{λ}{q} - λ \hat{log (w_{i})}, a n d \\ F_{i, j, 3, 3} & = & \frac{1}{q^{2}} + \hat{log (w_{j})} \hat{log (w_{i})} + \frac{1}{q} \hat{log (w_{i})} + \frac{1}{q} \hat{log (w_{j})} . \end{matrix}

Appendix B

In this section, we present the codes in R used to estimate the parameters for the STPN model in the real data application presented in Section 5.

require(tpn)

y<-c(154.25, 173.25, 154.00, 184.75, 184.25, 210.25, 181.00, 176.00, 191.00, 198.25,

186.25, 216.00, 180.50, 205.25, 187.75, 162.75, 195.75, 209.25, 183.75, 211.75,

179.00, 200.50, 140.25, 148.75, 151.25, 159.25, 131.50, 148.00, 133.25, 160.75,

182.00, 160.25, 168.00, 218.50, 247.25, 191.75, 202.25, 196.75, 363.15, 203.00,

262.75, 205.00, 217.00, 212.00, 125.25, 164.25, 133.50, 148.50, 135.75, 127.50,

158.25, 139.25, 137.25, 152.75, 136.25, 198.00, 181.50, 201.25, 202.50, 179.75,

216.00, 178.75, 193.25, 178.00, 205.50, 183.50, 151.50, 154.75, 155.25, 156.75,

167.50, 146.75, 160.75, 125.00, 143.00, 148.25, 162.50, 177.75, 161.25, 171.25,

163.75, 150.25, 190.25, 170.75, 168.00, 167.00, 157.75, 160.00, 176.75, 176.00,

177.00, 179.75, 165.25, 192.50, 184.25, 224.50, 188.75, 162.50, 156.50, 197.00,

198.50, 173.75, 172.75, 196.75, 177.00, 165.50, 200.25, 203.25, 194.00, 168.50,

170.75, 183.25, 178.25, 163.00, 175.25, 158.00, 177.25, 179.00, 191.00, 187.50,

206.50, 185.25, 160.25, 151.50, 161.00, 167.00, 177.50, 152.25, 192.25, 165.25,

171.75, 171.25, 197.00, 157.00, 168.25, 186.00, 166.75, 187.75, 168.25, 212.75,

176.75, 173.25, 167.00, 159.75, 188.15, 156.00, 208.50, 206.50, 143.75, 223.00,

152.25, 241.75, 146.00, 156.75, 200.25, 171.50, 205.75, 182.50, 136.50, 177.25,

151.25, 196.00, 184.25, 140.00, 218.75, 217.00, 166.25, 224.75, 228.25, 172.75,

152.25, 125.75, 177.25, 176.25, 226.75, 145.25, 151.00, 241.25, 187.25, 234.75,

219.25, 118.50, 145.75, 159.25, 170.50, 167.50, 232.75, 210.50, 202.25, 185.00,

153.00, 244.25, 193.50, 224.75, 162.75, 180.00, 156.25, 168.00, 167.25, 170.75,

178.25, 150.00, 200.50, 184.00, 223.00, 208.75, 166.00, 195.00, 160.50, 159.75,

140.50, 216.25, 168.25, 194.75, 172.75, 219.00, 149.25, 154.50, 199.25, 154.50,

153.25, 230.00, 161.75, 142.25, 179.75, 126.50, 169.50, 198.50, 174.50, 167.75,

147.75, 182.25, 175.50, 161.75, 157.75, 168.75, 191.50, 219.15, 155.25, 189.75,

127.50, 224.50, 234.25, 227.75, 199.50, 155.50, 215.50, 134.25, 201.00, 186.75,

190.75, 207.50)

est.stpn(y)

References

Rafiqullah, H.M.; Saxena, A.; Vera, V.; Abdool-Ghany, F.; Gabbidon, K.; Perea, N.; Shauna-Jeanne Stewart, T.; Ramamoorthy, V. Black Hispanic and Black Non-Hispanic Breast Cancer Survival Data Analysis with Half-normal Model Application. Asian Pac. J. Cancer Prev. 2014, 15, 9453–9458. [Google Scholar]
Bosch-Badia, M.T.; Montllor-Serrats, J.; Tarrazon-Rodon, M.A. Risk Analysis through the Half-Normal Distribution. Mathematics 2020, 8, 2080. [Google Scholar] [CrossRef]
Tsizhmovska, N.L.; Martyushev, L.M. Principle of Least Effort and Sentence Length in Public Speaking. Entropy 2021, 23, 1023. [Google Scholar] [CrossRef] [PubMed]
Olmos, N.M.; Varela, H.; Gómez, H.W.; Bolfarine, H. An extension of the half-normal distribution. Stat. Pap. 2012, 53, 875–886. [Google Scholar] [CrossRef]
Cooray, K.; Ananda, M.M.A. A generalization of the half-normal distribution with applications to lifetime data. Comm. Stat. Theory Methods 2007, 36, 1323–2157. [Google Scholar] [CrossRef]
Olmos, N.M.; Varela, H.; Gómez, H.W.; Bolfarine, H. An extension of the generalized half-normal distribution. Stat. Pap. 2014, 55, 967–981. [Google Scholar] [CrossRef]
Iriarte, Y.; Gómez, H.W.; Varela, H.; Bolfarien, H. Slashed Rayleigd distribution. Rev. Colomb. Estad. 2015, 38, 31–44. [Google Scholar] [CrossRef]
Reyes, J.; Barranco-Chamorro, I.; Gallardo, D.I.; Gómez, H.W. Generalized modified slash Birnbaum-Saunders distribution. Symmetry 2018, 10, 724. [Google Scholar] [CrossRef] [Green Version]
Olmos, N.M.; Osvaldo, V.; Gómez, Y.M.; Iriarte, Y.A. Confluent hypergeometric slashed-Rayleigh distribution: Properties, estimation and applications. J. Comput. Appl. Math. 2020, 328, 112548. [Google Scholar] [CrossRef]
Segovia, F.A.; Gómez, Y.M.; Venegas, O.; Gómez, H.W. A Power Maxwell Distribution with Heavy Tails and Applications. Mathematics 2020, 8, 1116. [Google Scholar] [CrossRef]
Astorga, J.M.; Reyes, J.; Santoro, K.I.; Venegas, O.; Gómez, H.W. A Reliability Model Based on the Incomplete Generalized Integro-Exponential Function. Mathematics 2020, 8, 1537. [Google Scholar] [CrossRef]
Gómez, H.J.; Olmos, N.M.; Varela, H.; Bolfarine, H. Inference for a truncated positive normal distribution. Appl. Math. J. Chin. Univ. 2018, 33, 163–176. [Google Scholar] [CrossRef]
Jonhson, N.L.; Kotz, S.; Balakrishnan, N. Continuos Univariate Distribution, 2nd ed.; Wiley: New York, NY, USA, 1995; Volume 2. [Google Scholar]
Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum Likelihood from Incomplete Data via the EM Algorithm. J. R. Stat. Soc. Ser. B 1977, 39, 1–38. [Google Scholar]
Louis, T.A. Finding the observed information matrix when using the EM algorithm. J. R. Stat. Soc. Ser. B Methodol. 1982, 44, 226–233. [Google Scholar]
Casella, G.; Berger, R.L. Statistical Inference; Duxbury: Pacific Grove, CA, USA, 2002. [Google Scholar]
R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2021; Available online: https://www.R-project.org/ (accessed on 8 October 2021).
Gallardo, D.I.; Gómez, H.J. tpn: Truncated Positive Normal Model and Extensions. R Package Version 1.0. 2021. Available online: https://cran.r-project.org/web/packages/tpn/index.html (accessed on 8 October 2021).
Gómez, H.J.; Gallardo, D.I.; Osvaldo, V. Generalized truncation positive normal distribution. Symmetry 2019, 11, 1361. [Google Scholar] [CrossRef] [Green Version]
Akaike, H. A new look at the statistical model identification. IEEE Trans. Auto Contr. 1974, 19, 716–723. [Google Scholar] [CrossRef]
Schwarz, G. Estimating the dimension of a model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]

Figure 1. Particular cases for the STPN distribution.

Figure 2. pdf, cdf, and hazard function for the STPN

(σ = 1, λ = 2, q)

model with different combinations of q and the STPN

(σ = 1, λ, q = 2)

model with different combinations of

λ

. (a) pdf of STPN

(σ = 1, λ = 2, q)

. (b) pdf of STPN

(σ = 1, λ, q = 2)

. (c) cdf of STPN

(σ = 1, λ = 2, q)

. (d) cdf of STPN

(σ = 1, λ, q = 2)

. (e) hazard function of STPN

(σ = 1, λ = 2, q)

. (f) hazard function of STPN

(σ = 1, λ, q = 2)

.

Figure 3. (a) Mean; (b) standard deviation; (c) asymmetry coefficient; (d) kurtosis coefficient for the STPN(

λ, σ = 1, q

) model.

Figure 4. Fit of the distributions for the

w e i g h t

dataset.

Figure 5. QRs for the fitted models in the

w e i g h t

dataset. The p-values for the AD, CVM, and SW normality tests are also presented to check if the QRs came from the standard normal distribution. (a) qq-plot STPN. (b) qq-plot TPN. (c) qq-plot GTPN.

Table 1. Some values for the kurtosis coefficients of the STPN distribution for different values of

λ

and q.

Table 1. Some values for the kurtosis coefficients of the STPN distribution for different values of

λ

and q.

	q
$λ$	5	7	10	15	$+ \infty$ (TPN)
−5	19.68	10.28	8.58	8.04	7.76
−2	16.63	8.22	6.73	6.26	6.02
−1	14.88	7.02	5.64	5.22	5.00
0	13.19	5.72	4.45	4.06	3.87
1	12.70	4.82	3.54	3.18	3.00
2	15.23	4.93	3.34	2.93	2.76
5	35.37	10.07	4.84	3.44	2.99

Table 2. Recovery parameters for the STPN distribution based on 1000 replicates for different combinations of parameters and sample size.

True Value				$n = 50$				$n = 100$				$n = 200$				$n = 500$
$σ$	$λ$	$q$	est.	bias	SE	RMSE	CP	bias	SE	RMSE	CP	bias	SE	RMSE	CP	bias	SE	RMSE	CP
2	−1	1.5	$\hat{σ}$	−0.43	1.98	1.06	0.71	−0.12	1.98	1.09	0.80	0.04	1.64	1.01	0.88	0.19	1.17	0.81	0.92
			$\hat{λ}$	0.76	2.29	1.40	0.77	0.33	2.10	1.21	0.83	0.11	1.69	1.02	0.90	−0.14	1.20	0.83	0.94
			$\hat{q}$	0.09	0.57	0.54	0.94	0.08	0.42	0.43	0.93	0.06	0.29	0.32	0.96	0.03	0.18	0.16	0.97
		3	$\hat{σ}$	−0.70	1.28	0.97	0.62	−0.34	1.30	0.86	0.78	−0.09	1.13	0.78	0.86	0.07	0.78	0.66	0.94
			$\hat{λ}$	0.94	1.64	1.35	0.72	0.45	1.49	1.08	0.83	0.19	1.18	0.85	0.90	−0.05	0.81	0.68	0.94
			$\hat{q}$	−0.30	2.11	2.01	0.81	−0.11	1.50	1.03	0.88	0.08	1.23	0.93	0.93	0.09	0.72	0.67	0.93
	1	1.5	$\hat{σ}$	0.21	1.18	1.14	0.89	0.20	0.80	0.86	0.94	0.09	0.49	0.50	0.95	0.06	0.29	0.31	0.96
			$\hat{λ}$	0.06	0.76	0.72	0.94	−0.04	0.53	0.52	0.96	−0.02	0.35	0.35	0.97	−0.02	0.22	0.22	0.95
			$\hat{q}$	0.23	0.63	0.72	0.96	0.10	0.33	0.39	0.96	0.04	0.21	0.25	0.96	0.03	0.13	0.14	0.95
		3	$\hat{σ}$	−0.01	0.83	0.67	0.88	0.09	0.58	0.53	0.94	0.07	0.39	0.41	0.95	0.03	0.23	0.24	0.95
			$\hat{λ}$	0.11	0.60	0.56	0.95	0.01	0.41	0.39	0.96	−0.01	0.29	0.30	0.96	−0.01	0.18	0.18	0.96
			$\hat{q}$	1.07	4.79	5.13	0.92	0.46	1.53	1.36	0.95	0.28	0.81	0.89	0.96	0.11	0.42	0.46	0.97
	3	1.5	$\hat{σ}$	0.12	0.59	0.83	0.94	0.04	0.37	0.40	0.95	0.02	0.26	0.25	0.96	0.03	0.17	0.17	0.95
			$\hat{λ}$	0.13	0.70	0.76	0.95	0.07	0.45	0.63	0.95	0.02	0.31	0.30	0.96	−0.02	0.19	0.19	0.95
			$\hat{q}$	0.14	0.39	0.50	0.96	0.06	0.22	0.23	0.97	0.03	0.16	0.16	0.95	0.02	0.10	0.10	0.95
		3	$\hat{σ}$	0.01	0.45	0.48	0.95	0.04	0.31	0.31	0.96	0.02	0.21	0.21	0.96	0.01	0.13	0.13	0.95
			$\hat{λ}$	0.16	0.57	0.63	0.97	0.02	0.37	0.37	0.96	0.02	0.25	0.25	0.95	0.01	0.16	0.16	0.95
			$\hat{q}$	0.50	1.55	2.01	0.96	0.22	0.67	0.78	0.96	0.11	0.43	0.48	0.96	0.05	0.26	0.25	0.97
10	−1	1.5	$\hat{σ}$	−2.41	8.09	5.04	0.70	−1.37	7.73	4.63	0.76	−1.00	5.71	3.71	0.83	0.05	4.56	3.03	0.90
			$λ$	0.84	1.89	1.34	0.75	0.52	1.71	1.08	0.81	0.34	1.25	0.85	0.86	0.04	0.97	0.66	0.92
			$\hat{q}$	0.08	0.56	0.55	0.92	0.07	0.41	0.45	0.93	0.03	0.27	0.27	0.94	0.02	0.17	0.16	0.97
		3	$\hat{σ}$	−3.69	5.49	4.83	0.62	−2.29	5.30	3.86	0.76	−0.93	4.73	3.14	0.85	0.11	3.58	2.87	0.90
			$\hat{λ}$	0.99	1.45	1.33	0.71	0.58	1.27	0.98	0.82	0.28	1.03	0.74	0.88	0.04	0.75	0.60	0.92
			$\hat{q}$	−0.30	2.05	2.03	0.81	−0.19	1.33	0.95	0.87	0.07	1.18	0.92	0.92	0.15	0.76	0.73	0.94
	1	1.5	$\hat{σ}$	1.24	6.09	5.89	0.89	0.87	3.89	4.52	0.93	0.37	2.40	2.66	0.93	0.25	1.42	1.49	0.95
			$\hat{λ}$	0.02	0.77	0.72	0.95	−0.02	0.52	0.52	0.96	−0.00	0.35	0.37	0.95	−0.02	0.21	0.22	0.96
			$\hat{q}$	0.23	0.59	0.68	0.97	0.12	0.36	0.45	0.95	0.04	0.21	0.23	0.96	0.02	0.13	0.13	0.95
		3	$\hat{σ}$	−0.11	3.87	3.17	0.89	0.26	2.84	2.60	0.92	0.28	1.91	2.02	0.94	0.14	1.14	1.21	0.95
			$\hat{λ}$	0.12	0.59	0.54	0.95	0.03	0.41	0.39	0.94	0.00	0.28	0.28	0.95	−0.00	0.18	0.18	0.95
			$\hat{q}$	1.28	5.65	6.16	0.91	0.41	1.55	1.75	0.94	0.30	0.87	1.03	0.96	0.10	0.42	0.45	0.97
	3	1.5	$\hat{σ}$	0.44	2.74	3.03	0.93	0.22	1.85	1.91	0.95	0.15	1.27	1.36	0.94	0.09	0.81	0.85	0.95
			$\hat{λ}$	0.14	0.67	0.89	0.96	0.05	0.44	0.47	0.96	0.02	0.30	0.31	0.96	−0.00	0.19	0.20	0.95
			$\hat{q}$	0.14	0.35	0.42	0.98	0.06	0.23	0.24	0.96	0.03	0.15	0.16	0.96	0.01	0.10	0.10	0.95
		3	$\hat{σ}$	0.22	2.24	2.36	0.94	0.07	1.50	1.57	0.94	0.00	1.03	1.06	0.95	0.01	0.65	0.65	0.95
			$\hat{λ}$	0.11	0.55	0.59	0.97	0.07	0.37	0.40	0.97	0.04	0.25	0.26	0.95	0.01	0.16	0.16	0.95
			$\hat{q}$	0.69	1.98	3.14	0.95	0.21	0.67	0.82	0.96	0.10	0.42	0.45	0.96	0.04	0.25	0.26	0.96

Table 3. Descriptive statistics for the

w e i g h t

dataset.

Table 3. Descriptive statistics for the

w e i g h t

dataset.

Dataset	n	$\bar{X}$	$S^{2}$	$\sqrt{b_{1}}$	$b_{2}$
Weight measured	252	$178.9$	$863.72$	$1.2$	$8.14$

Table 4. Estimated parameters and their standard errors (in parentheses) for the STPN, TPN, and GTPN models for the

w e i g h t

dataset. The AIC and BIC are also presented.

Table 4. Estimated parameters and their standard errors (in parentheses) for the STPN, TPN, and GTPN models for the

w e i g h t

dataset. The AIC and BIC are also presented.

Estimated	STPN	TPN	GTPN
$σ$	20.775 (1.811)	29.331 (1.306)	0.321 (0.088)
$λ$	7.825 (0.593)	6.100 (0.279)	14.689 (0.636)
q	11.250 (2.132)	-	-
$α$	-	-	0.426 (0.015)
$A I C$	2394.456	2421.978	2405.15
$B I C$	2405.044	2429.037	2415.738

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Slash Truncation Positive Normal Distribution and Its Estimation Based on the EM Algorithm

Abstract

1. Introduction

2. The Slash Truncation Positive Normal Model

2.1. Stochastic Representation and Particular Cases

2.2. Density Function

2.3. Some Properties

2.4. Moments

3. Inference

3.1. Moments Estimators

3.2. Maximum Likelihood Estimation

3.3. EM Algorithm

3.4. Observed Fisher Information Matrix

3.5. Computational Aspects

4. Simulation

5. Application

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix B

References

Article Metrics

Citations

Article Access Statistics