A Bimodal Model Based on Truncation Positive Normal with Application to Height Data

Héctor J. Gómez; Wilson E. Caimanque; Yolanda M. Gómez; Tiago M. Magalhães; Miguel Concha; Diego I. Gallardo

doi:10.3390/sym14040665

,

and

¹

Departamento de Ciencias Matemáticas y Físicas, Facultad de Ingeniería, Universidad Católica de Temuco, Temuco 4780000, Chile

²

Departamento de Matemática, Facultad de Ingeniería, Universidad de Atacama, Copiapó 1530000, Chile

³

Department of Statistics, Institute of Exact Sciences, Federal University of Juiz de Fora, Juiz de Fora 36036-900, Brazil

^*

Author to whom correspondence should be addressed.

Symmetry2022, 14(4), 665;https://doi.org/10.3390/sym14040665

This article belongs to the Special Issue Symmetric and Asymmetric Bimodal Distributions with Applications

Version Notes

Order Reprints

Abstract

In this work, we propose a new bimodal distribution with support in the real line. We obtain some properties of the model, such as moments, quantiles, and mode, among others. The computational implementation of the model is presented in the tpn package of the software R. We perform a simulation study in order to assess the properties of the maximum likelihood estimators in finite samples. Finally, we present an application to a bimodal data set, where our proposal is compared with other models in the literature.

Keywords:

bimodal distribution; unimodal distribution; tpn package; maximum likelihood estimation; asymmetric

1. Introduction

Describing a phenomenon by a probability distribution is very useful because of the properties associated with it: expectation, shape, range, etc. However, this description can be difficult when a phenomenon (in practice, an observed dataset) is bimodal, which occurs commonly in areas like astrophysics, ecology and genetics; see [1,2,3], respectively. The first approach to fit a bimodal data is using a mixture of two unimodal distributions, for instance, a mixture of gaussian distributions; see [4]. The main disadvantage of this procedure is the non-identifiability of the proposed mixture model. The second and the most workable practical approach is to use distributions which already have bimodal properties. Because of these properties, there is an increasing interest to derive bimodal distributions in the literature: refs. [5,6] presented extensions of the skew-normal, ref. [7] proposed a generalization of the Burr type X distribution and [8] derived an extension of the sinh Cauchy distribution. In this paper, we will discuss an extension of the half normal distribution proposed by [9], the truncated positive normal (tpn) model. The probability density function (pdf) for the tpn model is given by

f (x; σ, λ) = \frac{1}{σ Φ (λ)} ϕ (\frac{x}{σ} - λ), x, σ \in ℝ_{+}, λ \in ℝ,

where

σ

and

λ

are the scale and shape parameters, respectively, and

ϕ (\cdot)

and

Φ (\cdot)

are the pdf and cumulative distribution function (cdf), respectively, of the standard normal distribution. The corresponding cdf of the tpn model is

F_{X} (x; σ, λ) = \frac{Φ (\frac{x}{σ} - λ) + Φ (λ) - 1}{Φ (λ)} .

Note that the cdf above has a closed-form expression, which is useful for generating random data besides defining quantiles. For more properties of the tpn model, see [9]. The restriction to positive values is a limitation of the tpn model. To overcome this limitation, the chief goal of this paper is to derive an extension of the tpn model which has support in the real line. We describe in detail the model, studying its main properties and related functions. Moreover, we show analytically the regions in which the model is unimodal and bimodal, and such regions depend only on one parameter.

The paper is organized as follows. In Section 2, we derive an extension of the tpn with support in the real line and study some properties of the distribution. The inference for parameter estimation in the proposed model and computational aspects are presented in Section 3. In Section 4, we perform a simulation study to evaluate the parameter estimation in finite samples. An application to real data is discussed in Section 5. Finally, conclusions are given in Section 6.

2. A Bimodal Truncation Positive Normal Distribution

In this section, we present the stochastic representation for the bimodal truncation positive normal (btpn) distribution and some properties, such as its pdf and its cdf. We also discuss some particular cases of the model.

2.1. Stochastic Representation, pdf and cdf

Let T be a discrete random variable such as

T = {\begin{matrix} - (1 + ϵ) & , with probability (1 + ϵ) / 2 \\ 1 - ϵ & , with probability (1 - ϵ) / 2 \end{matrix},

where

ϵ \in (- 1, 1)

. If

Z \sim tpn (σ, λ)

, independent from T, then we define a new random variable given by

X = Z T

. We say that X follows a btpn distribution.

Proposition 1.

The pdf for the btpn distribution is given by

f (x; σ, λ, ϵ) = {\begin{matrix} \frac{ϕ (\frac{x}{σ (1 + ϵ)} + λ)}{2 σ Φ (λ)} & , i f x < 0 \\ \frac{ϕ (\frac{x}{σ (1 - ϵ)} - λ)}{2 σ Φ (λ)} & , i f x \geq 0 \end{matrix},

where

σ > 0, λ \in R

and

ϵ \in (- 1, 1)

.

Proof.

If

x < 0

, then the cdf for X is

\begin{matrix} F_{X} (x) & = & P (X \leq x) = \frac{(1 + ϵ)}{2} P (z \geq \frac{- x}{1 + ϵ}) = \frac{(1 + ϵ)}{2} [1 - P (z \leq \frac{- x}{1 + ϵ})] \\ = & \frac{(1 + ϵ)}{2} [1 - F_{Z} (\frac{- x}{1 + ϵ})] . \end{matrix}

Deriving the last expression in relation to x, we have

f_{X} (x) = \frac{(1 + ϵ)}{2} [- f_{Z} (\frac{- x}{1 + ϵ}) \frac{- 1}{(1 + ϵ)}] = \frac{1}{2} [\frac{1}{σ Φ (λ)} ϕ (\frac{\frac{- x}{(1 + ϵ)}}{σ} - λ)] = \frac{ϕ (\frac{x}{σ (1 + ϵ)} + λ)}{2 σ Φ (λ)} .

A similar routine calculation shows that for

x \geq 0

, we have that

f_{X} (x) = \frac{ϕ (\frac{x}{σ (1 - ϵ)} - λ)}{2 σ Φ (λ)},

completing the proof. □

Figure 1 shows the pdf function for the btpn model with different combination of parameters. Note that the model can assume different shapes, including unimodal, bimodal, symmetric and asymmetric.

Figure 1. Pdf for btpn

(σ = 1, λ, ϵ)

with different fixed values for

λ

and varying

ϵ

: (a)

λ = - 2

; (b)

λ = - 0.2

; (c)

λ = 0.4

and; (d)

λ = 1.5

.

Proposition 2.

The cdf of

X \sim b t p n (σ, λ, ϵ)

is given by

F_{X} (x) = {\begin{matrix} \frac{(1 + ϵ)}{2 Φ (λ)} Φ (\frac{x}{σ (1 + ϵ)} + λ) & , i f x \leq 0 \\ \frac{(1 - ϵ)}{2 Φ (λ)} [Φ (\frac{x}{σ (1 - ϵ)} - λ) + Φ (λ) - 1] & , i f x \geq 0 \end{matrix}

Proof.

It is immediate from the last proof. □

Proposition 3.

Let there be

X \sim b t p n (σ, λ, ϵ)

. Its quantile function is given by

Q_{X} (p) = F_{X}^{- 1} (p) = {\begin{matrix} σ (1 + ϵ) [Φ^{- 1} (\frac{2 p Φ (λ)}{1 + ϵ}) - λ] & , i f 0 < p \leq \frac{1 + ϵ}{2} \\ σ (1 - ϵ) [Φ^{- 1} (\frac{2 p Φ (λ)}{1 - ϵ} - Φ (λ) + 1) + λ] & , i f \frac{1 + ϵ}{2} < p < 1 \end{matrix}

Proof.

It is immediate from inverting the cdf for the btpn distribution given in Proposition 2. □

Corollary 1.

The median for

X \sim b t p n (σ, λ, ϵ)

is given by

M e (X) = {\begin{matrix} σ (1 + ϵ) [Φ^{- 1} (\frac{Φ (λ)}{1 + ϵ}) - λ] & , i f ϵ \geq 0 \\ σ (1 - ϵ) [Φ^{- 1} (\frac{ϵ Φ (λ)}{(1 - ϵ)} + 1) + λ] & , i f ϵ < 0 . \end{matrix}

Corollary 2.

The median for

X \sim b t p n (σ, λ, ϵ)

is

< 0

,

= 0

and

> 0

if ϵ is

> 0

,

= 0

and

< 0

, respectively.

2.2. Moments and Moment-Generating Function

The following proposition presents the central moments of the btpn distribution.

Proposition 4.

Let

X \sim b t p n (σ, λ, ϵ)

. The r-th central moment of X is given by

E (X^{r}) = \frac{σ^{r}}{2 \sqrt{2 π} Φ (λ)} [{(- 1)}^{r} {(1 + ϵ)}^{r + 1} + {(1 - ϵ)}^{r + 1}] \sum_{k = 0}^{r} (\binom{r}{k}) λ^{r - k} {(2)}^{(k - 1) / 2} Γ (k + 1) / 2, λ^{2} / 2),

where

Γ (a, b) = \int_{b}^{+ \infty} t^{a - 1} e^{- t} d t

is the upper incomplete gamma function.

Proof.

Note that

E (X^{r}) = E_{1} (X^{r}) + E_{2} (X^{r})

, where

E_{1} (X^{r}) = \int_{- \infty}^{0} x^{r} f (x) d x

and

E_{2} (X^{r}) = \int_{0}^{+ \infty} x^{r} f (x) d x

. For the first term, we perform the change of variable

u = \frac{- x}{σ (1 + ϵ)} - λ

. With this,

\begin{matrix} E_{1} (X^{r}) & = & \int_{- \infty}^{0} \frac{x^{r}}{2 σ (1 + ϵ)} ϕ (\frac{- x}{σ (1 + ϵ)} - λ) d x \\ = & \frac{{(- σ)}^{r} {(1 + ϵ)}^{(r + 1)}}{2 Φ (λ)} \int_{- λ}^{\infty} {(u + λ)}^{r} ϕ (u) d u . \end{matrix}

Using the binomial theorem and the change of variable

t = u^{2} / 2

in the last expression, we obtain

\begin{matrix} E_{1} (X^{r}) & = & \frac{{(- σ)}^{r} {(1 + ϵ)}^{(r + 1)}}{2 \sqrt{2 π} Φ (λ)} \sum_{k = 0}^{r} (\binom{r}{k}) λ^{r - k} \int_{- λ}^{\infty} u^{k} e^{- u^{2} / 2} d u, \\ = & \frac{{(- σ)}^{r} {(1 + ϵ)}^{(r + 1)}}{2 \sqrt{2 π} Φ (λ)} \sum_{k = 0}^{r} (\binom{r}{k}) λ^{r - k} 2^{(k - 1) / 2} \int_{λ^{2} / 2}^{\infty} t^{(k - 1) / 2} e^{- t} d t, \end{matrix}

Note that the last integral corresponds to

Γ ((k + 1) / 2, λ^{2} / 2)

.

On the other hand, for

E_{2} (X^{r})

, we perform the change of variable

u = \frac{x}{σ (1 - ϵ)} - λ

, obtaining

\begin{matrix} E_{2} (X^{r}) & = & \int_{0}^{\infty} \frac{x^{r}}{2 σ (1 - ϵ)} ϕ (\frac{x}{σ (1 - ϵ)} - λ) d x \\ = & \frac{σ^{r} {(1 - ϵ)}^{(r + 1)}}{2 Φ (λ)} \int_{- λ}^{\infty} {(u + λ)}^{r} ϕ (u) d u . \end{matrix}

Using the same routine calculation, we obtain

E_{2} (X^{r}) = \frac{σ^{r} {(1 - ϵ)}^{(r + 1)}}{2 \sqrt{2 π} Φ (λ)} \sum_{k = 0}^{r} (\binom{r}{k}) λ^{r - k} 2^{(k - 1) / 2} \int_{λ^{2} / 2}^{\infty} t^{(k - 1) / 2} e^{- t} d t,

where again the last integral corresponds to

Γ ((k + 1) / 2, λ^{2} / 2)

. The final result is obtained by summing

E_{1} (X^{r})

and

E_{2} (X^{r})

. □

Proposition 5.

Let

X \sim b t p n (σ, λ, ϵ)

. The moment-generating function (mgf) for X is given by

\begin{matrix} M_{x} (t) = & \frac{(1 + ϵ) \exp {\frac{{(1 + ϵ)}^{2}}{2} t^{2} σ^{2} - t σ λ (1 + ϵ)} Φ (λ - t σ (1 + ϵ))}{2 Φ (λ)}, \\ + \frac{(1 - ϵ) \exp {\frac{{(1 - ϵ)}^{2}}{2} t^{2} σ^{2} + t σ λ (1 - ϵ)} Φ (λ + t σ (1 - ϵ))}{2 Φ (λ)} . \end{matrix}

Proof.

Note that

M_{X} (t) = M_{X_{1}} (t) + M_{X_{2}} (t)

, where

M_{X_{1}} (t) = \int_{- \infty}^{0} e^{t x} f (x) d x

and

M_{X_{2}} (t) = \int_{0}^{+ \infty} e^{t x} f (x) d x

. For the first integral and using the change of variable

u = x / [σ (1 + ϵ)] + λ

, we obtain

\begin{matrix} M_{X_{1}} (t) = \frac{(1 + ϵ)}{2 Φ (λ)} \int_{- \infty}^{λ} e^{t σ (1 + ϵ) (μ - λ)} ϕ (u) d u . \end{matrix}

Completing the square of a binomial in the last term of the exponential and using the change of variable

z = μ - t σ (1 + ϵ)

, we have

\begin{matrix} M_{X_{1}} (t) & = \frac{(1 + ϵ) \exp {\frac{t^{2} σ^{2} {(1 + ϵ)}^{2}}{2} - t σ λ (1 + ϵ)}}{2 Φ (λ)} \int_{- \infty}^{λ - t σ (1 + ϵ)} \frac{e^{- z^{2} / 2}}{\sqrt{2 π}} d z, \\ = \frac{(1 + ϵ) \exp {\frac{t^{2} σ^{2} {(1 + ϵ)}^{2}}{2} - t σ λ (1 + ϵ)}}{2 Φ (λ)} Φ (λ - t σ (1 + ϵ)) . \end{matrix}

For

X \geq 0

, and similarly to the previous development, we use the change of variable

u = x / [σ (1 - ϵ)] - λ

, obtaining that

\begin{matrix} M_{X_{2}} (t) = \frac{(1 - ϵ)}{2 Φ (λ)} \int_{- λ}^{\infty} e^{t σ (1 - ϵ) (μ + λ)} ϕ (u) d u . \end{matrix}

Again, completing the square and using

z = μ - t σ (1 - ϵ)

, we obtain

\begin{matrix} M_{X_{2}} (t) & = \frac{(1 - ϵ) \exp {\frac{t^{2} σ^{2} {(1 - ϵ)}^{2}}{2} + t σ λ (1 - ϵ)}}{2 Φ (λ)} \int_{- λ - t σ (1 - ϵ)}^{\infty} \frac{e^{- z^{2} / 2}}{\sqrt{2 π}} d z, \\ = \frac{(1 - ϵ) \exp {\frac{t^{2} σ^{2} {(1 - ϵ)}^{2}}{2} + t σ λ (1 - ϵ)}}{2 Φ (λ)} Φ (λ + t σ (1 - ϵ)) . \end{matrix}

Finally, the result is obtained by summing

M_{X_{1}} (t)

and

M_{X_{2}} (t)

. □

Corollary 3.

Using properties of the mgf, the first four moments of

X \sim b t p n (σ, λ, ϵ)

can be obtained from the expression

μ_{r} = (\partial^{r} M_{X} (t) / \partial t^{r}) |_{t = 0}

.

$μ_{1} = E [X^{1}] = - 2 ϵ σ [λ + Ω (λ)]$
$μ_{2} = E [X^{2}] = σ^{2} (1 + 3 ϵ^{2}) [λ^{2} + λ Ω (λ) + 1]$
$μ_{3} = E [X^{3}] = - 4 ϵ σ^{3} (1 + ϵ^{2}) [λ^{3} + λ^{2} Ω (λ) + λ + 2 Ω (λ)]$
$μ_{4} = E [X^{4}] = σ^{4} [1 + 5 ϵ^{2} (2 + ϵ)] [λ^{4} + λ^{3} Ω (λ) + 6 λ^{2} + 5 λ Ω (λ) + 3]$

where

Ω (λ) = ϕ (λ) / Φ (λ)

is the reciprocal of the Mill’s ratio for the standard normal distribution.

Corollary 4.

The variance, coefficients of skewness and kurtosis for

X \sim b t p n (σ, λ, ϵ)

are given by

\begin{matrix} V a r (X) & = σ^{2} [λ^{2} (1 - ϵ^{2}) + λ Ω (λ) (1 - 5 ϵ^{2}) + ϵ^{2} (3 - 4 Ω (λ)) + 1], \\ \sqrt{b_{1}} & = \frac{- 4 ϵ (1 + ϵ^{2}) [λ^{3} + λ^{2} Ω (λ) + 2 Ω (λ)]}{{[(1 + 3 ϵ) (λ^{2} + λ Ω (λ) + 1)]}^{3 / 2}}, a n d \\ b_{2} & = \frac{[1 + 5 ϵ^{2} (2 + ϵ)] [λ^{4} + λ^{3} Ω (λ) + 6 λ^{2} + 5 λ Ω (λ) + 3]}{{[(1 + 3 ϵ^{2}) (λ^{2} + λ Ω (λ) + 1)]}^{2}}, \end{matrix}

respectively.

Figure 2 shows the plots for asymmetry and kurtosis coefficients. Note that a more right-skewed distribution is obtained when

ϵ \to - 1

and

λ \to - \infty

, whereas a more left-skewed model is obtained when

ϵ \to 1

and

λ \to - \infty

. On the other hand, a greater kurtosis is obtained when

ϵ \to - 1

and

λ \to - \infty

, whereas a lower kurtosis is obtained when

| ϵ | \to 1

and

λ \to \infty

. Note that this pattern is consistent with the pdf for different parameters presented in Figure 1.

Figure 2. (a) Asymmetry coefficient and (b) kurtosis coefficient for btpn

(σ = 1, λ, ϵ)

distribution.

2.3. Mode and Unimodality and Bimodality Regions

The next proposition presents the unimodality and bimodality property of the btpn distribution.

Proposition 6.

Let

X \sim b t p n (σ, λ, ϵ)

. For

λ \leq 0

, the model is unimodal, and for

λ > 0

, the model is bimodal. Moreover, for the unimodal case, the mode of the model is 0, and for the bimodal case, the two modes are

- σ λ (1 + ϵ)

and

σ λ (1 - ϵ)

, respectively.

Proof.

By definition, the mode is the value that maximizes the pdf or, equivalently, the logarithm of the pdf. For

X \sim b t p n (σ, λ, ϵ)

, it is straighforward to show that

\begin{matrix} \frac{\partial \log f (x; σ, λ, ϵ)}{\partial x} & = {\begin{matrix} - \frac{1}{σ (1 + ϵ)} (\frac{x}{σ (1 + ϵ)} + λ) & , if x < 0 \\ - \frac{1}{σ (1 - ϵ)} (\frac{x}{σ (1 - ϵ)} - λ) & , if x \geq 0 \end{matrix} and \\ \frac{\partial^{2} \log f (x; σ, λ, ϵ)}{\partial x^{2}} & = {\begin{matrix} - \frac{1}{σ^{2} {(1 + ϵ)}^{2}} & , if x < 0 \\ - \frac{1}{σ^{2} {(1 - ϵ)}^{2}} & , if x \geq 0 \end{matrix} . \end{matrix}

Therefore, solving the equation

\partial \log f (x; σ, λ, ϵ) / \partial x = 0

, we obtain

x_{1} = - σ λ (1 + ϵ)

and

x_{2} = σ λ (1 - ϵ)

as the potential mode for each branch, because the second derivative is negative for each respective case. However, this is valid if and only if

x_{1} < 0

and

x_{2} > 0

, respectively. In other words, if

- σ λ (1 + ϵ) < 0

, then

x_{1}

is a mode and if

σ λ (1 - ϵ) > 0

, then

x_{2}

also is a mode. This is equivalent to

\begin{matrix} {λ > 0 \land 1 + ϵ > 0} & \lor {λ < 0 \land 1 + ϵ < 0} \Rightarrow x_{1} is a mode and \\ {λ > 0 \land 1 - ϵ > 0} & \lor {λ < 0 \land 1 - ϵ < 0} \Rightarrow x_{2} is a mode . \end{matrix}

where

1 + ϵ > 0

and

1 - ϵ > 0

,

\forall ϵ (- 1, 1)

. On the other hand,

1 + ϵ ≮ 0

and

1 - ϵ ≮ 0

. For this reason, it is immediate that for

λ > 0

, the btpn distribution have two modes, and such modes are

x_{1}

and

x_{2}

. Finally, for

λ \leq 0

, it is immediate that

\partial \log f (x; σ, λ, ϵ) / \partial x > 0

, for

x < 0

and

\partial \log f (x; σ, λ, ϵ) / \partial x \leq 0

for

x \geq 0

. In other words, the pdf for the btpn distribution is an increasing function in

(- \infty, 0)

and a decreasing function in

(0, \infty)

, where we can deduce that the model is unimodal and the respective mode is attached in zero. □

Figure 3 shows the regions of unimodality and bimodality for the btpn depending on the parameters

λ

and

ϵ

.

Figure 3. Regions of unimodality and bimodality for the btpn model in terms of

λ

and

η

.

2.4. Particular Cases

By construction, the following models are particular cases for the btpn distribution:

btpn $(σ = σ / 2, λ, ϵ = - 1) \equiv$ tpn $(σ, λ)$ ;
btpn $(σ, λ = 0, ϵ = 0) \equiv$ N $(0, σ^{2})$ , i.e., the normal distribution with mean 0 and variance $σ^{2}$ ;
btpn $(σ = 1, λ = 0, ϵ) \equiv$ esn $(σ, ϵ)$ , i.e., the epsilon skew-normal distribution (Mudholkar and Hutson [10]).

Figure 4 summarizes the relationships among the btpn and its particular cases.

Figure 4. Particular cases for the btpn distribution.

3. Inference

In this section, we discuss the maximum likelihood (ML) method for parameter estimation for the btpn model. We also provide details about the computational aspects.

3.1. Maximum Likelihood Function

Hereafter, and to simplify the estimation procedure, we consider the reparameterization

η = \frac{ϵ}{\sqrt{1 - ϵ^{2}}} \in R

. Therefore, henceforth, we denote

X \sim b t p n (σ, λ, η)

, with

σ

,

λ

and

η

scale, shape and asymmetry parameters, respectively, if its pdf is given by

f (x; σ, λ, η) = {\begin{matrix} \frac{1}{2 σ Φ (λ)} ϕ (\frac{- x \sqrt{1 + η^{2}}}{σ (\sqrt{1 + η^{2}} + η)} - λ) & , if x < 0 \\ \frac{1}{2 σ Φ (λ)} ϕ (\frac{x \sqrt{1 + η^{2}}}{σ (\sqrt{1 + η^{2}} - η)} - λ) & , if x \geq 0 \end{matrix}

Given

z_{1}, \dots, z_{n}

, a random sample from the btpn

(σ, λ, η)

distribution, the log-likelihood function for

θ = (σ, λ, η)

is given by

ℓ (θ) = - n [\log (2 σ Φ (λ)) + \frac{1}{2} \log (2 π) + \frac{λ^{2}}{2}] + ℓ_{1} (θ) + ℓ_{2} (θ),

(1)

where

\begin{matrix} ℓ_{1} (θ) & = - \frac{1}{2} \sum_{i : z_{i} \leq 0} {{(\frac{z_{i} \sqrt{1 + η^{2}}}{σ (\sqrt{1 + η^{2}} - η)})}^{2} - \frac{2 z_{i} λ \sqrt{1 + η^{2}}}{σ (\sqrt{1 + η^{2}} - η)}}, and \\ ℓ_{2} (θ) & = - \frac{1}{2} \sum_{i : z_{i} > 0} {{(\frac{z_{i} \sqrt{1 + η^{2}}}{σ (\sqrt{1 + η^{2}} + η)})}^{2} + \frac{2 z_{i} λ \sqrt{1 + η^{2}}}{σ (\sqrt{1 + η^{2}} + η)}} . \end{matrix}

To find the ML estimator of

θ

, say

\hat{θ}

, we need to maximize

ℓ (θ)

in (1) in relation to

θ

. However, no closed-form expressions for the ML estimates are possible. Therefore, we must use an iterative method for nonlinear optimization. For instance, we solve this problem using the Broyden-–Fletcher–-Goldfarb–-Shanno (BFGS) quasi-Newton method; see [11] (p. 199).

3.2. Computational Aspects

The ML estimators for the btpn model and the obtaining of their standard errors are included in the tpn package [12] from the R [13] software. The following function can be used to obtain these results:

est.btpn(y)

where y is the sample. The function returns a list with the estimates, the iterations used for the maximization algorithm, the log-likelihood function evaluated in the parameter estimations and the corresponding Akaike information criterion (AIC, see [14]) and the Bayesian information criterion (BIC, see [15]). Models with lower AIC and/or BIC are preferable. The package also includes the functions to drawn values to evaluate the pdf and the cdf for the btpn model named rbtpn, dbptn and pbtpn, respectively.

4. Simulation Study

In this section, we present a simulation study in order to evaluate the behaviour of the ML estimators in finite samples. The study was conducted using the tpn package [12]. Specifically, random samples were generated using the rbtpn function, and the estimation was performed using the est.btpn function. We considered 5000 Monte Carlo replicates for 3 sample sizes: 50, 100 and 200. We also considered 2 combinations for the scale parameter

σ

: 2 and 10; 3 values for

λ

:

- 0.75, 1

and 3; and 2 values for

η

:

- 0.5

and

0.75

. This setting provides 36 combinations of the parameters

σ, λ

and

η

and the sample size. Table 1 and Table 2 summarize the empirical bias, the standard errors of the MLE (SE), the root-mean-squared error (RMSE) and the 95% coverage probability (CP) based on the asymptotic distribution of the MLE. In general terms, the bias and RMSE terms are reduced when the sample size is increased, suggesting the consistency of the MLE. Note also that the SE and RMSE terms are closer when the sample size is increased, suggesting that the standard errors of the estimators are also well estimated. Additionally, the CP terms converge reasonably to the nominal value used to their construction (95%), suggesting that the normality is reasonable as an asymptotic distribution to the ML estimators in the btpn model, even in reasonable sample sizes.

Table 1. Empirical bias, SE, RMSE and 95% CP for the ML estimators of

σ, λ

and

η

in the btpn distribution with different combinations of parameters (case true

σ = 2

).

Table 2. Empirical bias, SE, RMSE and 95% CP for the ML estimators of

σ, λ

and

η

in the btpn distribution with different combinations of parameters (case true

σ = 10

).

5. Application

In this section, we present an application to a real data set in order to illustrate the btpn model. We consider the height data set, which consists of the height of 126 students from the University of Pennsylvania (Cruz-Medina [16]). We compare our proposal with other bimodal proposals, such as the epsilon skew inverted gamma (esig, see Abdulah et al. [17]) and the alpha skew-normal (asn, Elal-Olivero [18]). The pdf for the esig model is given by:

f (x; σ, λ, η) = \frac{λ^{σ}}{2 Γ (σ)} {\begin{matrix} {(\frac{x}{1 - η})}^{- (σ + 1)} e^{- \frac{λ (1 - η)}{x}} & x \geq 0 \\ {(\frac{- x}{1 + η})}^{- (σ + 1)} e^{- \frac{λ (1 + η)}{- x}} & x < 0 \end{matrix},

where

λ > 0

,

σ > 0

and

| η | < 1

are the scale, shape and skewness parameters, respectively.

The pdf for the asn model is given by:

f (x; η, λ, σ) = (\frac{{[1 - η (\frac{x - λ}{σ})]}^{2} + 1}{σ (2 + η^{2})}) ϕ (\frac{x - λ}{σ}),

where

η, λ \in R

and

σ > 0

are the shape, location and scale parameters, respectively.

Table 3 summarizes some descriptive statistics for the sample, where we highlight the symmetrical behaviour of the data (

\sqrt{b_{1}} = - 0.05

).

Table 3. Descriptive statistics for the

h e i g h t

data set.

Table 4 presents the estimatives, standard errors, AIC and BIC criteria for the mentioned models. Note that, based on both criteria, btpn presents a better fit than the rest of the distributions. Figure 5 shows the histogram for the height data and the pdf for the three considered distributions, where the better performance for the btpn in this data set is demonstrated. Moreover, as discussed in Proposition 6,

\hat{λ} = 0.496 > 0

implies a bimodal model, and such modes are equal to

x_{1} = - 0.402

and

x_{2} = 0.404

. In addition, the distribution of height is very close to symmetry.

Table 4. Estimated parameters and their standard errors (in parentheses) for the btpn, esig and asn models for the

h e i g h t

data set. The AIC and BIC criteria are also presented.

Figure 5. Histogram for the

h e i g h t

data set and the estimated pdf for the btpn, esig and asn models.

We also compute the randomized quantile residuals [19] for the three fitted models. If the model was correctly specified, these residuals should be a random sample from the standard normal distribution. Figure 6 shows the qqplot for such residuals, also suggesting that the btpn is a more appropriated model for this data set.

Figure 6. Quantile residuals for fitted models: (a) asn, (b) esig, and (c) btpn.

6. Conclusions

The importance of fitting an observable dataset by a probability distribution is well-known, since it will be covered with convenient properties. Difficulty arises when the data is bimodal, because there are not traditional distributions with this property. This gap is being filled by an increasing movement in the statistical literature to develop probability distributions which already have a bimodality feature. In this paper, we made our contribution with the bimodal positive truncation normal distribution. The btpn distribution has the following advantages: support in the real line, closed-form cdf and moments, and the ability to generalize the standard normal and treatable maximum likelihood estimators. The ML procedure works very reasonably, i.e, as the sample size increases, the bias and the SE decrease. Since there are models for which the estimation procedure does not work even for large samples, the btpn distribution also has this strength. We ended the advantages of our proposed distribution with an application where btpn was the best choice of fitting. As suggestions for future work, we can mention two possibilities: the first entails the improvement of the asymptotic properties of the ML estimation through bias and variance corrections (see [20,21], respectively), and the second involves the addition of a regression structure. A closed-form cdf allows even a quantile regression structure, see [22,23], for instance, as [24] did for the gamma–sinh Cauchy distribution. For the applicability and possibilities of future works, we think the bimodal positive truncation normal distribution is useful for practitioners and researchers of many different areas.

Author Contributions

Conceptualization, H.J.G., Y.M.G. and D.I.G.; Data curation, M.C.; Formal analysis, W.E.C., Y.M.G. and M.C.; Investigation, W.E.C. and T.M.M.; Methodology, H.J.G., T.M.M. and D.I.G.; Software, H.J.G., Y.M.G. and D.I.G.; Supervision, Y.M.G. and D.I.G.; Validation, M.C.; Visualization, W.E.C.; Writing—review editing, T.M.M. and D.I.G. All authors have read and agreed to the published version of the manuscript.

Funding

The research of H.J.G. was supported by the Proyecto de Investigación de Facultad de Ingeniería. Universidad Católica de Temuco. UCT-FDI032020. W.E.C., Y.M.G., M.C. and D.I.G. also acknowledge the support of Proyecto Gidi: “La estadística como respuesta a problemas de otras áreas” supported by the University of Atacama.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ashman, K.M.; Bird, C.M.; Zepf, S.E. Detecting Bimodality in Astronomical Datasets. Astron. J. 1994, 108, 2348–2361. [Google Scholar] [CrossRef] [Green Version]
Michele, C.; Accatino, F. Tree cover bimodality in savannas and forests emerging from the switching between two fire dynamics. PLoS ONE 2014, 9, e91195. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Wang, J.; Wen, S.; Symmans, W.F.; Pusztai, L.; Coombes, K.R. The bimodality index: A criterion for discovering and ranking bimodal signatures from cancer gene expression profiling data. Cancer Inform. 2009, 7, 199–216. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Robertson, C.A.; Fryer, J.G. Some descriptive properties of normal mixtures. Scand. Actuar. J. 1969, 3–4, 137–146. [Google Scholar] [CrossRef]
Gómez, H.W.; Elal-Olivero, D.; Salinas, H.S.; Bolfarine, H. Bimodal extension based on the skew-normal distribution with application to pollen data. Environmetrics 2011, 22, 50–62. [Google Scholar] [CrossRef]
Venegas, O.; Salinas, H.S.; Gallardo, D.I.; Bolfarine, H.; Gómez, H.W. Bimodality based on the generalized skew-normal distribution. J. Stat. Comput. Simul. 2018, 88, 156–181. [Google Scholar] [CrossRef]
Butt, N.S.; Khalil, M.G. A New Bimodal Distribution for Modeling Asymmetric Bimodal Heavy-Tail Real Lifetime Data. Symmetry 2020, 12, 2058. [Google Scholar] [CrossRef]
Gómez, Y.M.; Gómez-Déniz, E.; Venegas, O.; Gallardo, D.I.; Gómez, H.W. An Asymmetric Bimodal Distribution with Application to Quantile Regression. Symmetry 2019, 11, 899. [Google Scholar] [CrossRef] [Green Version]
Gómez, H.J.; Olmos, N.M.; Varela, H.; Bolfarine, H. Inference for a truncated positive normal distribution. Appl. Math. J. Chin. Univ. 2018, 33, 163–176. [Google Scholar] [CrossRef]
Mudholkar, G.S.; Hutson, A.D. The epsilon–skew–normal distribution for analyzing near-normal data. J. Stat. Plan. Inference 2000, 83, 291–309. [Google Scholar] [CrossRef]
Mittelhammer, R.C.; Judge, G.G.; Miller, D.J. Econometric Foundations; Cambridge University Press: New York, NY, USA, 2000. [Google Scholar]
Gallardo, D.I.; Gómez, H.J.; Gómez, Y.M. tpn: Truncated Positive Normal Model and Extensions. R Package Version 1.1. 2021. Available online: https://cran.r-project.org/web/packages/tpn/index.html (accessed on 25 January 2022).
R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2022; Available online: https://www.R-project.org/ (accessed on 25 January 2022).
Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control 1974, 19, 716–723. [Google Scholar] [CrossRef]
Schwarz, G. Estimating the dimension of a model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
Cruz-Medina, I.R.; Olmos, N.M. Almost Nonparametric and Nonparametric Estimation in Mixture Model. Ph.D. Thesis, Pennsylvania State University, State College, PA, USA, 2001. [Google Scholar]
Abdulah, E.K.; Elsalloukh, H. Bimodal Class based on the Inverted Symmetrized Gamma Distribution with Applications. J. Stat. Appl. Probab. 2014, 3, 1–7. [Google Scholar] [CrossRef]
Elal-Olivero, D. Alpha-skew-normal distribution. Proyecciones 2010, 29, 224–240. [Google Scholar] [CrossRef] [Green Version]
Dunn, P.K.; Smyth, G.K. Randomized quantile residuals. J. Comput. Graph. Stat. 1996, 5, 236–244. [Google Scholar]
Magalhães, T.M.; Gómez, Y.M.; Gallardo, D.I.; Venegas, O. Bias reduction for the Marshall-Olkin extended family of distributions with application to an airplane’s air conditioning system and precipitation data. Symmetry 2020, 12, 851. [Google Scholar] [CrossRef]
Magalhães, T.M.; Botter, D.A.; Sandoval, M.C. A general expression for second-order covariance matrices—An application to dispersion models. Braz. J. Probab. Stat. 2021, 35, 37–49. [Google Scholar] [CrossRef]
Cade, B.S.; Noon, B.R. A gentle introduction to quantile regression for ecologists. Front. Ecol. Environ. 2003, 1, 412–420. [Google Scholar] [CrossRef]
Alencar, A.P.; Santos, B.R. Association of pollution with quantiles and expectations of the hospitalization rate of elderly people by respiratory diseases in the city of São Paulo, Brazil. Environmetrics 2014, 25, 165–171. [Google Scholar] [CrossRef]
Gómez, Y.M.; Gallardo, D.I.; Venegas, O.; Magalhães, T.M. An asymmetric bimodal double regression model. Symmetry 2021, 13, 2279. [Google Scholar] [CrossRef]

Figure 1. Pdf for btpn

(σ = 1, λ, ϵ)

with different fixed values for

λ

and varying

ϵ

: (a)

λ = - 2

; (b)

λ = - 0.2

; (c)

λ = 0.4

and; (d)

λ = 1.5

.

Figure 2. (a) Asymmetry coefficient and (b) kurtosis coefficient for btpn

(σ = 1, λ, ϵ)

distribution.

Figure 3. Regions of unimodality and bimodality for the btpn model in terms of

λ

and

η

.

Figure 4. Particular cases for the btpn distribution.

Figure 5. Histogram for the

h e i g h t

data set and the estimated pdf for the btpn, esig and asn models.

Figure 6. Quantile residuals for fitted models: (a) asn, (b) esig, and (c) btpn.

Table 1. Empirical bias, SE, RMSE and 95% CP for the ML estimators of

σ, λ

and

η

in the btpn distribution with different combinations of parameters (case true

σ = 2

).

Table 1. Empirical bias, SE, RMSE and 95% CP for the ML estimators of

σ, λ

and

η

in the btpn distribution with different combinations of parameters (case true

σ = 2

).

True Value			$n = 50$				$n = 100$				$n = 200$
$λ$	$η$	par.	bias	SE	RMSE	CP	bias	SE	RMSE	CP	bias	SE	RMSE	CP
−0.75	−0.5	$σ$	0.894	3.271	8.403	0.814	0.170	1.089	3.165	0.856	0.035	0.514	0.584	0.898
		$λ$	−0.831	3.405	8.561	0.862	−0.132	1.237	3.050	0.891	−0.028	0.638	0.691	0.917
		$η$	−0.018	0.112	0.121	0.940	−0.008	0.078	0.081	0.945	−0.003	0.055	0.057	0.947
	0.75	$σ$	0.736	2.490	7.381	0.802	0.233	1.218	3.497	0.862	0.041	0.538	0.630	0.901
		$λ$	−0.663	2.663	7.325	0.855	−0.205	1.368	3.453	0.897	−0.029	0.660	0.728	0.924
		$η$	0.030	0.142	0.167	0.936	0.014	0.099	0.106	0.945	0.009	0.070	0.073	0.947
1	−0.5	$σ$	−0.057	0.350	0.360	0.887	−0.026	0.248	0.250	0.914	−0.012	0.175	0.176	0.936
		$λ$	0.073	0.406	0.416	0.938	0.035	0.285	0.289	0.946	0.014	0.201	0.201	0.949
		$η$	−0.014	0.086	0.095	0.932	−0.008	0.061	0.064	0.940	−0.003	0.043	0.045	0.944
	0.75	$σ$	−0.055	0.350	0.355	0.884	−0.024	0.248	0.251	0.919	−0.015	0.174	0.176	0.931
		$λ$	0.072	0.406	0.412	0.935	0.035	0.285	0.288	0.943	0.020	0.200	0.203	0.942
		$η$	0.028	0.109	0.127	0.940	0.012	0.076	0.082	0.942	0.006	0.054	0.055	0.943
3	−0.5	$σ$	−0.049	0.203	0.214	0.919	−0.022	0.145	0.152	0.930	−0.013	0.103	0.106	0.936
		$λ$	0.103	0.356	0.383	0.948	0.047	0.248	0.262	0.947	0.028	0.174	0.180	0.948
		$η$	−0.007	0.051	0.054	0.937	−0.003	0.036	0.036	0.949	−0.001	0.025	0.026	0.950
	0.75	$σ$	−0.048	0.203	0.214	0.917	−0.023	0.145	0.148	0.932	−0.012	0.103	0.103	0.946
		$λ$	0.105	0.356	0.385	0.946	0.046	0.248	0.253	0.952	0.024	0.174	0.175	0.953
		$η$	0.011	0.064	0.072	0.933	0.005	0.045	0.048	0.939	0.002	0.032	0.033	0.942

Table 2. Empirical bias, SE, RMSE and 95% CP for the ML estimators of

σ, λ

and

η

in the btpn distribution with different combinations of parameters (case true

σ = 10

).

Table 2. Empirical bias, SE, RMSE and 95% CP for the ML estimators of

σ, λ

and

η

in the btpn distribution with different combinations of parameters (case true

σ = 10

).

True Value			$n = 50$				$n = 100$				$n = 200$
$λ$	$η$	par.	bias	SE	RMSE	CP	bias	SE	RMSE	CP	bias	SE	RMSE	CP
−0.75	−0.5	$σ$	2.279	10.597	19.571	0.802	0.961	5.254	11.695	0.864	0.221	2.538	3.411	0.900
		$λ$	−0.400	2.336	4.023	0.857	−0.161	1.209	2.328	0.896	−0.034	0.632	0.780	0.924
		$η$	−0.020	0.112	0.126	0.938	−0.009	0.079	0.083	0.949	−0.004	0.055	0.056	0.950
	0.75	$σ$	1.748	9.147	17.694	0.797	0.608	5.063	7.312	0.859	0.157	2.496	3.225	0.891
		$λ$	−0.272	2.046	3.498	0.856	−0.093	1.172	1.548	0.895	−0.021	0.623	0.734	0.920
		$η$	0.038	0.143	0.177	0.933	0.013	0.099	0.103	0.951	0.007	0.069	0.072	0.945
1	−0.5	$σ$	−0.225	1.772	1.829	0.887	−0.126	1.237	1.260	0.916	−0.057	0.876	0.891	0.932
		$λ$	0.065	0.408	0.415	0.938	0.037	0.285	0.289	0.940	0.015	0.201	0.205	0.943
		$η$	−0.014	0.086	0.095	0.936	−0.005	0.060	0.063	0.943	−0.002	0.043	0.043	0.952
	0.75	$σ$	−0.284	1.744	1.756	0.886	−0.136	1.236	1.255	0.913	−0.059	0.875	0.869	0.933
		$λ$	0.073	0.405	0.405	0.941	0.037	0.285	0.288	0.943	0.016	0.201	0.201	0.946
		$η$	0.028	0.109	0.131	0.926	0.012	0.076	0.081	0.945	0.008	0.054	0.056	0.946
3	−0.5	$σ$	1.985	2.784	30.165	0.913	1.699	1.824	26.982	0.928	1.113	0.799	22.887	0.932
		$λ$	0.023	0.420	1.320	0.941	−0.032	0.287	1.178	0.947	−0.025	0.184	0.955	0.941
		$η$	−0.004	0.051	0.054	0.934	−0.002	0.036	0.037	0.944	−0.001	0.026	0.026	0.948
	0.75	$σ$	0.386	4.618	13.399	0.918	0.596	1.816	14.121	0.924	0.355	0.929	10.561	0.943
		$λ$	0.071	0.474	0.692	0.954	0.016	0.285	0.680	0.941	0.003	0.188	0.503	0.952
		$η$	0.019	0.065	0.076	0.925	0.015	0.046	0.051	0.935	0.008	0.033	0.034	0.941

Table 3. Descriptive statistics for the

h e i g h t

data set.

Table 3. Descriptive statistics for the

h e i g h t

data set.

Data Set	n	$\bar{X}$	$S^{2}$	$\sqrt{b_{1}}$	$b_{2}$
weight measured	126	0	1	$- 0.05$	$3.053$

Table 4. Estimated parameters and their standard errors (in parentheses) for the btpn, esig and asn models for the

h e i g h t

data set. The AIC and BIC criteria are also presented.

Table 4. Estimated parameters and their standard errors (in parentheses) for the btpn, esig and asn models for the

h e i g h t

data set. The AIC and BIC criteria are also presented.

Estimated	btpn	esig	asn
$σ$	0.813 (0.113)	1.304 (0.148)	0.996 (0.063)
$λ$	0.496 (0.316)	0.527 (0.073)	0.014 (3.422)
$η$	−0.002 (0.048)	0.095 (0.059)	0.014 (3.409)
AIC	360.76	415.79	362.67
BIC	369.27	424.30	375.91

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

A Bimodal Model Based on Truncation Positive Normal with Application to Height Data

Abstract

1. Introduction

2. A Bimodal Truncation Positive Normal Distribution

2.1. Stochastic Representation, pdf and cdf

2.2. Moments and Moment-Generating Function

2.3. Mode and Unimodality and Bimodality Regions

2.4. Particular Cases

3. Inference

3.1. Maximum Likelihood Function

3.2. Computational Aspects

4. Simulation Study

5. Application

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics