A Unimodal/Bimodal Skew/Symmetric Distribution Generated from Lambert’s Transformation

Yuri A. Iriarte; Mário de Castro; Héctor W. Gómez

doi:10.3390/sym13020269

,

and

¹

Departamento de Matemática, Facultad de Ciencias Básicas, Universidad de Antofagasta, Antofagasta 1240000, Chile

²

Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo, São Carlos 13560-095, SP, Brazil

^*

Authors to whom correspondence should be addressed.

Symmetry2021, 13(2), 269;https://doi.org/10.3390/sym13020269

This article belongs to the Section Mathematics

Version Notes

Order Reprints

Abstract

The generalized bimodal distribution is especially efficient in modeling univariate data exhibiting symmetry and bimodality. However, its performance is poor when the data show important levels of skewness. This article introduces a new unimodal/bimodal distribution capable of modeling different skewness levels. The proposal arises from the recently introduced Lambert transformation when considering a generalized bimodal baseline distribution. The bimodal-normal and generalized bimodal distributions can be derived as special cases of the new distribution. The main structural properties are derived and the parameter estimation is carried out under the maximum likelihood method. The behavior of the estimators is assessed through simulation experiments. Finally, two applications are presented in order to illustrate the utility of the proposed distribution in data modeling in different real settings.

Keywords:

bimodality; generalized bimodal distribution; lambert-F generator; shape parameter; skewness

1. Introduction

Analysts often have to deal with data that exhibit bimodality; for example, when observing the size of worker ants in weaver ant colonies [1], the duration of volcanic eruptions [2], the amount of excretion of mercury in urine [3], the grain size in sintered Zirconia [4], or the amount of tropospheric water vapor in the tropics [5].

Two-component mixture distributions are usually used to model data that exhibit bimodality. These distributions have a very flexible density function (unimodal/bimodal), a highly valued feature when trying to model data in different real settings. However, one difficulty in working with them is that it is necessary to deal with the problem of the non-identifiability of their parameters, see McLachlan et al. [6] for more detail.

It is possible to find in the literature guidelines to deal with the problem of non-identifiability in a mixture distribution; for example, as suggested by Aitkin and Rubin [7], it is convenient to assume a certain restriction on the mixing parameter to evaluate the behavior of the maximum likelihood estimates. An alternative that has been considered in various studies is to impose restrictions on the components of the mixture distribution; for example, assuming that the components have the same variance. In this way, sub-families of mixture distributions with a simpler parametric structure (where the parameters are identifiable) are defined.

Taking into account certain restrictions for the components of a mixture distribution implies working with a sub-family that exhibits a less flexible density function than the unrestricted case. However, it is possible to find in the literature sub-families of two-component mixture distributions that have been shown to be useful in data modeling in various real settings. In this context, it is possible to find the generalized bimodal (GB) distribution originally proposed by Rao [8] and later studied as a special case of the bimodal symmetric distribution proposed in Sarma et al. [9].

A random variable X has a generalized bimodal distribution, denoted as

X \sim GB (γ)

, if its probability density function (pdf) is given by

g (x; γ) = \frac{γ + x^{2}}{1 + γ} ϕ (x), x \in R, γ \in [0, 2),

(1)

and its cumulative distribution function (cdf) is given by

G (x; γ) = Φ (x) - \frac{x}{1 + γ} ϕ (x),

(2)

where

γ

is a shape parameter that controls bimodality and

ϕ (\cdot)

and

Φ (\cdot)

denote the pdf and the cdf of the standard normal distribution, respectively.

It is easy to verify that Equation (1) corresponds to the pdf of a two-component mixture distribution with mixing parameter

1 / (1 + γ)

, for

γ \in [0, 2)

, one component having a standard normal distribution, and the other a standard bimodal-normal (BN) distribution. If

γ = 0

, the Lambert generalized bimodal (LGB) distribution reduces to the BN distribution. Details on the structural properties of the BN distribution can be found in Hassan and Hijazi [10] and Elal-Olivero [11]. A class of generalized bimodal distributions that extends the GB distribution can be found in [12]. This class is defined by the cdf F(x) = Φ(x) − α(x) ϕ(x), where α(x) is a linear function of x. Thus, the GB distribution (with cdf given in Equation (2)) can be derived as a special case of the class proposed by [12] when α(x) = x/(1 + γ).

The parametric space bounded to the interval [0,2) for

γ

can be explained by the fact that the pdf given in Equation (1) is bimodal for any value of

γ

in such interval. However, we observe that the range of

γ

can be extended so that

γ

assumes values in the interval

[0, \infty)

. In such case, it is possible to verify the following properties: The GB pdf is symmetric-bimodal when

γ \in [0, 2)

and symmetric-unimodal when

γ \in [2, \infty)

. The GB distribution reduces to the bimodal-normal distribution when

γ = 0

and tends to the standard normal distribution as

γ \to \infty

.

The symmetry characteristic of the GB pdf can be considered a desirable characteristic in the analysis of certain data sets, but a limitation in the analysis of others. In the literature, it is possible to find different construction methods that allow generating an asymmetric distribution from a symmetric baseline distribution, see Azzalini [13], Eugene et al. [14], Cordeiro and de Castro [15], Ferreira and Steel [16], Goerg [17], and Alzaatreh et al. [18], among others.

Recently, Iriarte et al. [19] introduce the Lambert-F distribution generator defined as

F_{X} (x; α) = 1 - [1 - F (x)] α^{F (x)}, α \in (0, e),

(3)

where

α

is an extra shape parameter and

F (\cdot)

is the cdf of an arbitrary baseline distribution.

The transformation given in Equation (3) defines a new family of distributions more flexible in terms of skewness than the baseline distribution. Iriarte et al. [19] study two special cases of Equation (3), extending the classical exponential and Rayleigh distributions and showing that the hazard rate functions induced by this transformation can be understood as modifications in the early times of the baseline hazard rate functions.

In this article, we introduce a new unimodal/bimodal distribution that generalizes the GB distribution and is capable of modeling different levels of skewness. The proposal, called the Lambert generalized bimodal (LGB) distribution, arises from Equation (3) when considering a baseline GB distribution. The result is a new distribution that generalizes to the GB and BN distributions and that can serve as an alternative to other asymmetric bimodal distributions in the literature.

The article is organized as follows. In Section 2, we define the LGB random variable and derive the density and distribution functions. In Section 3, we describe the characteristics of unimodality, bimodality, asymmetry, and kurtosis. In addition, alias distributions are discussed. In Section 4, we consider the problem of parameter estimation using the maximum likelihood (ML) method. In Section 5, the behavior of the ML estimators and the utility of the proposed distribution are evaluated through simulation experiments. Section 6 presents two application examples illustrating the usefulness of the LGB distribution in real settings. Finally, the main conclusions are reported in Section 7.

2. The LGB Distribution

In this section, we introduce the LGB distribution and study some of its main properties.

2.1. LGB Random Variable

Definition 1.

A random variable X follows a Lambert generalized bimodal distribution, with location parameter

μ \in R

, scale parameter

σ > 0

and shape parameters

γ \in [0, 2)

, and

α \in (0, e)

, denoted as

X \sim LGB (μ, σ, γ, α)

, if it can be represented as

X = {\begin{matrix} σ Q_{G B} [\frac{1}{log (α)} W_{0} (\frac{log (α) (U - 1)}{α}) + 1; γ] + μ, if α \in {(0, 1) \cup (1, e)}, \\ σ Q_{G B} (U; γ) + μ, if α = 1, \end{matrix}

(4)

where

W_{0} (\cdot)

is the principal branch of the Lambert W function,

Q_{G B} (\cdot; \cdot)

is the standard generalized bimodal quantile function, and U is a uniform(0,1) random variable.

Remark 1.

The quantile function of the GB distribution does not have a closed analytical form. However, it can be calculated numerically from the cdf given in Equation (2).

Proposition 1.

Let

X \sim LGB (μ, σ, γ, α)

. Then, the cdf of X is given by

F_{X} (x; μ, σ, γ, α) = 1 - [1 - Φ (z) + \frac{z}{γ + 1} ϕ (z)] α^{Φ (z) - \frac{z}{γ + 1} ϕ (z)}, z = \frac{x - μ}{σ}, x \in R,

(5)

Proof.

From the Equation (4), for

α \neq 1

, we have that

F_{X} (x; μ, σ, γ, α) = P (X \leq x) = P (W_{0} [\frac{log (α) (U - 1)}{α}] \leq log (α) [G_{G B} (z; γ) - 1]),

where

G_{G B} (z; γ) = Φ (z) - \frac{z}{γ + 1} ϕ (z)

is the inverse function of

Q_{G B} (\cdot; γ)

, that is, the standard GB cdf. Then, by definition of the Lambert W function, it follows that

\begin{matrix} P (X \leq x) & = P (\frac{log (α) (U - 1)}{α} \leq log (α) [Φ (z) - \frac{z}{γ + 1} ϕ (z) - 1] \\ \times exp {- log (α) [Φ (z) - \frac{z}{γ + 1} ϕ (z) - 1]}) \\ = P (U \leq 1 - [1 - Φ (z) + \frac{z}{γ + 1} ϕ (z)] α^{Φ (z) - \frac{z}{γ + 1} ϕ (z)}), \end{matrix}

and the result is obtained considering that

P (U \leq u) = u

, as

U \sim uniform (0, 1)

. Finally, note that the analytical expression obtained for the cdf of X is valid for

α = 1

once

F_{X} (x; μ, σ, η, 1) = Φ (z) - \frac{z}{γ + 1} ϕ (z)

. □

The pdf of X can be obtained in a straightforward way from the cdf given in Equation (5).

Corollary 1.

Let

X \sim LGB (μ, σ, η, α)

. Then, the pdf of X is given by

f_{X} (x; μ, σ, γ, α) = \frac{γ + z^{2}}{σ (γ + 1)} ϕ (z) α^{Φ (z) - \frac{z}{γ + 1} ϕ (z)} {1 - log (α) [1 - Φ (z) + \frac{z}{γ + 1} ϕ (z)]} .

(6)

In accordance with Definition 1, note that Equations (5) and (6) reduce, respectively, to the cdf and pdf of the GB distribution when

α = 1

. Consequently, the LGB distribution can be understood as an extension with one extra parameter of the GB distribution. An interesting property of the LGB distribution is that its pdf corresponds to a modification in a multiplicative fashion of the GB pdf. If

α \in (0, 1)

or

α \in (1, e)

, the GB pdf is modified in a multiplicative fashion by the expression

α^{Φ (z) - \frac{z}{γ + 1} ϕ (z)} {1 - log (α) [1 - Φ (z) + \frac{z}{γ + 1} ϕ (z)]}

, allowing asymmetric shapes for the LGB pdf.

Figure 1 shows some pdf curves for the LGB distribution considering different values of

γ

and

α

. Here, it can be seen that the LGB pdf is bimodal symmetric when

α = 1

. Note that the parameter

α

has an effect on the shape of the LGB pdf allowing unimodal or bimodal asymmetric shapes. This will be discussed in more detail in Section 3.

Figure 1. Pdf curves for the LGB distribution for

μ = 5

,

σ = 2

, and

α = 1

in the top left panel;

μ = - 5

,

σ = 4

, and

α = 0.5

in the top right panel;

μ = 5

,

σ = 4

, and

α = 2

in the bottom left panel; and

μ = 10

,

σ = 5

, and

γ = 1.5

in the bottom right panel.

The quantile function (qf) of the LGB distribution can be easily derived by inverting Equation (5), considering steps very similar to those of the proof of Proposition 1. The resulting analytical expression for this function is given by

Q_{X} (u; μ, σ, γ, α) = {\begin{matrix} σ Q_{G B} [\frac{1}{log (α)} W_{0} (\frac{log (α) (u - 1)}{α}) + 1; γ] + μ, if α \in {(0, 1) \cup (1, e)}, \\ σ Q_{G B} (u; γ) + μ, if α = 1 . \end{matrix}

(7)

As the Lambert W function is included in different statistical software, Equation (7) can be easily computed. A code in the R programming language [20] is provided in Appendix A.

2.2. Related Distributions

In the previous section, it was shown that the LGB and GB distributions are nested distributions, with the GB distribution being a special case of the LGB distribution. Now, taking into account that the BN distribution is a special case of the GB distribution, we observe that the LGB distribution reduces to the BN distribution when

γ = 0

and

α = 1

. Consequently, a new special case can be highlighted. If

γ = 0

, the pdf given in Equation (6) reduces to

f_{X} (x; μ, σ, α) = \frac{z^{2}}{σ} ϕ (z) α^{Φ (z) - z ϕ (z)} {1 - log (α) [1 - Φ (z) + z ϕ (z)]} .

(8)

The pdf given in Equation (8) is bimodal symmetric or bimodal asymmetric depending on the value of

α

. If

α = 1

, then Equation (8) reduces to the BN pdf. We refer to Equation (8) as the Lambert-bimodal normal (LBN) pdf. It is important to note that, unlike the LGB distribution, the pdf given in Equation (8) valued in the antimode is equal to 0 for all

μ \in R

,

σ > 0

, and

α \in (0, e)

.

We observe that the parameter

α

induced by the Lambert-F transformation plays a determining role in the skewness of the LBN distribution. We also see that the LBN pdf can be understood as a modification in a multiplicative fashion of the BN pdf.

3. Shapes and Aliases

In this section, we discuss the forms of LGB distribution and analyze the possible existence of alias distributions for members of this LGB family.

3.1. Shapes

Let

f_{X}^{(s)} (x) = \partial^{s} f_{X} (x; μ, σ, γ, α) / \partial x^{s}

be the s-th derivative of the LGB pdf with respect to x,

s = 1, 2

, we have

\begin{matrix} f_{X}^{(1)} (x) & = & α^{z_{2}} [log (α) z_{1}^{2} (1 + z_{3}) + z_{3} z_{1}^{(1)}] and \\ f_{X}^{(2)} (x) & = & α^{z_{2}} {{log}^{2} (α) z_{1}^{3} (2 + z_{3}) + 3 log (α) z_{1} z_{1}^{(1)} (1 + z_{3}) + [1 - log (α) (1 + z_{2})] z_{1}^{(2)}}, \end{matrix}

where

z_{1} = \frac{γ + z^{2}}{σ (1 + γ)} ϕ (z), z_{2} = Φ (z) - \frac{z}{1 + γ} ϕ (z), z_{3} = 1 - log (α) (1 - z_{2}), z_{4} = 1 - log (α) (1 + z_{2}),

z_{1}^{(1)} = \frac{\partial z_{1}}{\partial x} = \frac{z (2 - γ - z^{2})}{σ^{2} (1 + γ)} ϕ (z) and z_{1}^{(2)} = \frac{\partial^{2} z_{1}}{\partial x^{2}} = \frac{2 + z^{4} - (5 - γ) z^{2} - γ}{σ^{3} (1 + γ)} ϕ (z) .

Thus, if

X \sim LGB (μ, σ, γ, α)

, and if

(x^{*}, f_{X} (x^{*}; γ, α))

and

(x^{* *}, f_{X} (x^{* *}; γ, α))

are, respectively, critical points and inflection points of the pdf of X, then

x^{*}

is a root of the equation

σ log (α) (γ + z^{2}) (1 + z_{3}) z_{1} + z (2 - γ - z^{2}) z_{3} = 0,

(9)

and

x^{* *}

a root of the equation

σ^{2} {log}^{2} (α) (γ + z^{2}) (2 + z_{3}) z_{1}^{2} + 3 σ log (α) z (2 - γ - z^{2}) (1 + z_{3}) z_{1} + [2 + z^{4} - (5 - γ) z^{2} - γ] z_{4} = 0 .

(10)

In the case

α = 1

, Equations (9) and (10) lead to establish that the LGB pdf is bimodal with modes given by

μ \pm σ \sqrt{2 - γ}

, antimode given by

μ

, and abscissa of inflection points given by

μ \pm σ \sqrt{w_{j}}

, with

j = 1, 2

, where

w_{1} = [(5 - γ) + \sqrt{{(5 - γ)}^{2} - 4 (2 - γ)}] / 2

and

w_{2} = [(5 - γ) - \sqrt{{(5 - γ)}^{2} - 4 (2 - γ)}] / 2

.

In the case where

α \neq 1

, it is not possible to obtain closed expressions for the modes, antimode, and abscissa of inflection points. Therefore, these values must be obtained by solving Equations (9) and (10) by numerical procedures. Considering Equations (9) and (10) as functions of x, say

w (x)

and

v (x)

, respectively, we observe that

lim_{x \to \pm \infty} w (x) = [1 - log (α) {1 - lim_{x \to \pm \infty} Φ (z)}] (2 - γ - lim_{x \to \pm \infty} z^{2}) lim_{x \to \pm \infty} z = \mp \infty,

which means that Equation (9) has at least one root, associated with the unimodal case. Thus, the above also implies that the Equation (10) has at least two roots and, taking into account that

lim_{x \to \pm \infty} v (x) = [2 - γ + (lim_{x \to \pm \infty} z^{2} - 5 + γ) lim_{x \to \pm \infty} z^{2}] {1 - log (α) [1 + lim_{x \to \pm \infty} Φ (z)]} = \infty,

it follows that Equation (10) has at least one negative minimum value at

x = x_{0}

, where

x_{0}

satisfies the equation

\begin{matrix} 0 & = & σ^{3} {log}^{2} (α) (γ + z^{2}) z_{1}^{3} + 2 σ^{2} {log}^{2} (α) z (3 - γ - z^{2}) (2 + z_{3}) z_{1} - \frac{1}{log (α)} [2 (5 - γ) z - 4 z^{3}] z_{4} \\ + 3 (2 - γ - z^{2}) [σ^{2} log (α) z (γ + z^{2}) z_{1} + \frac{2}{1 + γ} z^{2} (1 + z_{3}) ϕ (z)] - σ [2 - γ - (5 - γ) z^{2} + z^{4}] z_{1} . \end{matrix}

Figure 2 shows a good summary of the shapes that the LGB pdf can take. In this figure, profiles are presented for the equations of critical points and inflection points, Equations (9) and (10), together with the corresponding LGB pdf. As in Figure 1 and Figure 2, it is observed that the LGB pdf can be unimodal or bimodal depending on the values of

γ

and

α

, but here it is also observed that a unimodal LGB pdf can have two or four inflection points. In this sense, Equation (9) has one or three roots (depending on whether the pdf is unimodal or bimodal) and Equation (10) has two or four roots (four roots when the pdf is bimodal and two or four roots when the pdf is unimodal). In Figure 3, the regions of unimodality and bimodality established in the plane defined by the ranges of

γ

and

α

are presented. This figure was drawn by solving Equation (9) using the uniroot.all function [21] in the R language [20].

Figure 2. Critical and inflection points and pdf curve for the distributions Lambert generalized bimodal (LGB) (5,2,1.5,0.005) (black curves), LGB (5,2,1.5,0.5) (red curves), LGB (5,2,0.5,0.5) (green curves), and LGB (5,2,0.5,1) (blue curves).

Figure 3. Regions of unimodality (white region) and bimodality (gray region) for a LGB distribution.

3.2. Skewness and Kurtosis

Next, the skewness behavior of the LGB distribution is described. The behavior of the kurtosis is also described in the case in which the LGB distribution is unimodal. In this case, first the raw moments of the LGB distribution are derived and later the Fisher’s skewness and kurtosis coefficients are analyzed.

Proposition 2.

Let

Z \sim LGB (0, 1, γ, α)

and

X \sim LGB (μ, σ, γ, α)

. Then, for

r = 1, 2, \dots

, the r-th raw moment of Z and X are given by

E (Z^{r}) = a_{r} (γ, α)

and

E (X^{r}) = \sum_{k = 0}^{r} (\binom{r}{k}) μ^{r - k} σ^{k} a_{k} (γ, α),

(11)

where

a_{r} (γ, α) = \int_{0}^{1} {[Q_{G B} (u; γ)]}^{r} α^{u} [1 - log (α) (1 - u)] d u .

(12)

Proof.

Under the change of variable

u = Φ (z) - \frac{z}{γ + 1} ϕ (z)

, the r-th moment of Z is given by

\begin{matrix} E (Z^{r}) = a_{r} (γ, α) & = \int_{- \infty}^{\infty} z^{r} \frac{γ + z^{2}}{γ + 1} ϕ (z) α^{Φ (z) - \frac{z}{γ + 1} ϕ (z)} {1 - log (α) [1 - Φ (z) + \frac{z}{γ + 1} ϕ (z)]} d z \\ = \int_{0}^{1} {[Q_{G B} (u; η)]}^{r} α^{u} [1 - log (α) (1 - u)] d u, \end{matrix}

which is the result given in Equation (12). Now, as X can be represented as

X = μ + σ Z

, the result in Equation (11) is obtained as

E (X^{r}) = E [{(μ + σ Z)}^{r}] = \sum_{k = 0}^{r} (\binom{r}{k}) μ^{r - k} σ^{k} E (Z^{k})

. □

Corollary 2.

The mean and the variance of X are given by

E (X) = μ + σ a_{1} (γ, α)

and

V a r (X) = σ^{2} [a_{2} (γ, α) - a_{1}^{2} (γ, α)]

, respectively.

Corollary 3.

The skewness (

β_{1} (γ, α)

) and kurtosis (

β_{2} (γ, α)

) coefficients of X are given by

\begin{matrix} β_{1} (γ, α) & = \frac{E {{[X - E (X)]}^{3}}}{{[V a r (X)]}^{3 / 2}} = \frac{a_{3} (γ, α) - 3 a_{1} (γ, α) a_{2} (γ, α) + 2 a_{1}^{3} (γ, α)}{{[a_{2} (γ, α) - a_{1}^{2} (γ, α)]}^{3 / 2}} a n d \\ β_{2} (γ, α) & = \frac{E {{[X - E (X)]}^{4}}}{{[V a r (X)]}^{2}} = \frac{a_{4} (γ, α) - 4 a_{1} (γ, α) a_{3} (γ, α) + 6 a_{1}^{2} (γ, α) a_{2} (γ, α) - 3 a_{1}^{4} (γ, α)}{{[a_{2} (γ, α) - a_{1}^{2} (γ, α)]}^{2}} . \end{matrix}

Note that the r-th raw moment of the LGB distribution should be computed using numerical integration. Table 1 presents some values for the first four raw moments of the LGB distribution. Figure 4 (left panel) presents some curves for the skewness coefficient of the LGB distribution considering different values of γ; and α. In the right panel of the same figure, some curves for the kurtosis coefficient are presented when the LGB pdf is unimodal.

Table 1. Some values for the first four raw moments of the LGB distribution considering different values of γ and α.

Figure 4. Plots of the skewness and kurtosis coefficients of the LGB distribution.

In Table 1, we observe that the odd raw moments are 0 when α = 1, an expected result because in this case the LGB distribution is symmetric around μ. In the Figure 4, we observe that

The LGB pdf is symmetric when α = 1, regardless of the value assumed by the parameter γ.
The LGB distribution can be skewed (positively or negatively) depending on the value assumed by α. If $α \in (0, 1)$ or $α \in (1, e)$ , then the LGB pdf is skewed and the skewness is also controlled by the parameter γ. However, the effect of γ on skewness is important when it assumes small values.
If the LGB pdf is unimodal, then it is asymmetric.
In the unimodal case, the excess kurtosis $β_{2} (γ, α) - 3$ is less than 0; that is, the LGB distribution is a platykurtic distribution.

In the case where

E (X) > Median (X)

, we observe from Equations (7) and (11) that the skewness of the LGB distribution is positive if γ and α satisfy the inequality

1 / 2 < [1 - Φ (a_{1} (γ, α)) + \frac{a_{1} (γ, α)}{1 + γ} ϕ (a_{1} (γ, α))] α^{Φ (a_{1} (γ, α)) - \frac{a_{1} (γ, α)}{1 + γ} ϕ (a_{1} (γ, α))} .

On the other hand, values of γ and α (

α \neq 1

) that do not satisfy the previous inequality are related to a negative skewness.

Based on these last results, together with the results of Section 3.1, we observe that the LGB distribution can be considered as a feasible alternative for modeling data exhibiting asymmetry and uni/bimodality. In this way, the LGB distribution can be used in the analysis of random phenomena raised in various areas of knowledge. Below, we describe some situations in which the LGB distribution could be used: (1) In entomology, volcanology, medicine, materials science, and climatology, according to the scenarios exposed at the beginning of Section 1. (2) In meteorology; for example, when analyzing wind speed data in certain geographic regions that tend to have two speed spikes per day. (3) In astronomy; for example, when analyzing data associated with certain solar wind parameters. Studies based on annual measurements have shown that the distribution of solar wind speed (as well as those of other parameters such as proton density, temperature and magnetic field) can exhibit bimodality. (4) In population dynamics. It is well known that the interactions between the characteristics of the individuals that make up a certain population can produce changes in the distribution of the size of the individuals, including changes that lead to a bimodal distribution.

3.3. Alias Distributions

Previously, it was noted that the original range for the parameter γ of the GB distribution can be extended so that γ assumes values in the interval [0,∞). In this way, the normal distribution can be obtained as a limiting case of the GB distribution. Despite this interesting property, we have considered the range bounded to the interval [0,2) for the parameter γ of the LGB distribution. The rationale for this is to avoid alias distributions for some members of the LGB distribution, that is, avoid distributions that are virtually identical in relation to minimizing the Kullback–Leibler divergence [22]. The Kullback–Leibler divergence measures the degree of divergence between the distributions of two random variables. For two LGB, random variables consider the following proposition.

Proposition 3.

Let

X_{1} \sim LGB (θ_{1})

and

X_{2} \sim LGB (θ_{2})

, where

θ_{j} = {(μ_{j}, σ_{j}, γ_{j}, α_{j})}^{'}

,

j = 1, 2

. Then, the Kullback–Leibler divergence is given by

K (X_{1}, X_{2}) = log (σ_{2} / σ_{1}) + \int_{- \infty}^{\infty} log {\frac{f (u; 0, 1, γ_{1}, α_{1})}{f (h (u); 0, 1, γ_{2}, α_{2})}} f (u; 0, 1, γ_{1}, α_{1}) d u

(13)

where

h (u) = (u σ_{1} + μ_{1} - μ_{2}) / σ_{2}

and

f (\cdot; \cdot, \cdot, \cdot, \cdot)

is as in Equation (6).

Proof.

From the definition of the Kullback–Leibler divergence, we obtain

K (X_{1}, X_{2}) = E [log {\frac{f (x; θ_{1})}{f (x; θ_{2})}}],

where the expectation is taken with respect to

X_{1}

. Thus, the result is obtained by considering the change of variable

u = z_{1} = (x - μ_{1}) / σ_{1}

, once

z_{2} = (x - μ_{2}) / σ_{2}

can be written as

z_{2} = (z_{1} σ_{1} + μ_{1} - μ_{2}) / σ_{2}

. □

Considering

θ_{1}

known, minimizing Equation (13) with respect to

θ_{2}

is equivalent to solving a nonlinear system of four equations, see Appendix B.

We observe that the existence of alias distributions in the LGB distributions family is related to its skewness behavior when considering that γ can also assume values greater than or equal to 2. In the case

γ \geq 2

, it can be verified that the LGB family is unimodal regardless of the value of α. In Figure 4 (left panel), it is observed that the skewness of the LGB distribution is positive for those values of

α \in (0, 1)

. However, we observe that the skewness can become negative if α assumes a value close to 0, once γ has assumed a value greater than 2 that is large enough. This favors the existence of alias distributions for certain unimodal members of the family, as it suggests that a negative skewness value may be associated with two different values of α. Under this scenario, it must be taken into account that this is very serious when both members of the family are unimodal, as it would be enough to make an adequate consideration of the locations and scales so that the members are virtually identical.

To exemplify the above, consider the LGB family member specified by

μ = 0

,

σ = 1

,

γ = 1000

, and

α = 1.6

, which has a unimodal shape and slightly negative skewness. Now, consider also a LGB distributions subfamily where μ and σ are unknown:

γ = 1000

and

α = 0.001

, respectively. In this scenario, the members of the subfamily have unimodal shape and the same skewness value as the fully specified distribution. In Figure 5, we graphically represent the behavior of the skewness for the fully specified distribution (in blue color) and for the LGB subfamily (in red color). In this figure (left panel), it is observed that the skewness is negative only for

α = 1.6 \in (1, e)

when

γ \in [0, 2)

, while the skewness can be negative for both values of α (0.001 and 1.6) if γ is sufficiently large (center panel). Notice that both skewness curves tend to

- 0.136472

as

γ \to \infty

. Therefore, among the members of the LGB distributions subfamily, it is possible to find an alias distribution for the fully specified distribution by considering certain values for the location and scale parameters. These values are

μ = 2.360

and

σ = 1.487

, which are obtained by minimizing the Kullback–Leibler divergence given in Equation (13), where the minimum value of

K (X_{1}, X_{2})

is 0.0008. The right panel of Figure 5 shows the pdf curves for the LGB(

0, 1, 1000, 1.6

) distribution (black color) and for some members of the LGB family with

μ = 2.360

,

σ = 1.487

,

α = 0.001

, and different values of γ. Notice that the pdfs represented by the dashed lines approach the black curve as γ grow toward the value 1000, so that the distributions LGB (

0, 1, 1000, 1.6)

and LGB (

2.360, 1.487, 1000, 0.001

) are virtually identical. Figure 6 shows the Kullback–Leibler divergence curve as functions of

μ_{2}

and

σ_{2}

for

X_{1} \sim LGB (0, 1, 1000, 1.6)

and

X_{2} \sim LGB (2.360, 1.487, 1000, 0.001)

, where it can be seen that the Kullback–Leibler divergence is minimized precisely at

μ_{2} = 2.360

and

σ_{2} = 1.487

.

Figure 5. Top and center panels: Skewness curves for two LGB distributions with

α = 1.6

in blue color and

α = 0.001

in red color. Right panel: Pdf curves for an LGB (0,1,1000,1.6) distribution (in black color) and four LGB distributions specified by

μ = 2.360

,

σ = 1.487

,

γ = 0.001

, and different values of

γ

.

Figure 6. Kullback–Leibler divergence curve for

X_{1} \sim LGB (0, 1, 1000, 1.6)

and

X_{2} \sim LGB (2.360, 1.487, 1000, 0.001)

as a function of

μ_{2}

in the left panel and as a function of

σ_{2}

in the right panel.

Returning to Figure 5 (right panel), it is clearly observed that those pdfs related to a value of

γ

less than 2 are not alias distributions of the LGB (0,1,1000,1.6) distribution. Finally, based on the minimization of Equation (13), we analyze the existence of alias distributions in scenarios where

γ

is less than 2 and in others where it is greater than 2. In each scenario,

X_{1} \sim LGB (θ_{1})

and

X_{2} \sim LGB (θ_{2})

,

θ_{j} = {(μ_{j}, σ_{j}, γ_{j}, α_{j})}^{'}

,

j = 1, 2

, where

θ_{1}

is known and

θ_{2} \neq θ_{1}

minimizes Equation (13). Specifically, the scenarios considered are the following: Scenario A:

θ_{1} = {(0, 1, 1.5, 0.5)}^{'}

,

θ_{2} = {(2.568, 1.719, 1.462, 0.002)}^{'}

, where

K (X_{1}, X_{2}) = 0.029

. Scenario B:

θ_{1} = {(0, 1, 0.5, 0.2)}^{'}

,

θ_{2} = {(2.600, 1.891, 0.999, 0.001)}^{'}

, where

K (X_{1}, X_{2}) = 0.091

. Scenario C:

θ_{1} = {(0, 1, 1, 1.5)}^{'}

,

θ_{2} = {(3.686, 1.896, 1.999, 0.001)}^{'}

, where

K (X_{1}, X_{2}) = 0.054

. Scenario D:

θ_{1} = {(0, 1, 1, 1.5)}^{'}

,

θ_{2} = {(4.302, 1.631, 1.999, 0.002)}^{'}

, where

K (X_{1}, X_{2}) = 0.039

. Scenario E:

θ_{1} = {(0, 1, 100, 2)}^{'}

,

θ_{2} = {(2.274, 1.334 . 20, 002, 0.003)}^{'}

, where

K (X_{1}, X_{2}) = 0.001

. Scenario F:

θ_{1} = {(0, 1, 50, 0.2)}^{'}

,

θ_{2} = {(0.488, 1.020, 5.010, 0.043)}^{'}

, where

K (X_{1}, X_{2}) = 0.0002

.

Table 2 shows the values of the mean, variance, skewness, number of critical points, and inflection points for the distributions of the random variables

X_{1}

and

X_{2}

in each scenario. Here, we observe that in scenarios E and F (when

γ \geq 2

) the values for the two distributions are very similar, so that the distributions of

X_{1}

and

X_{2}

are virtually identical. In contrast, in scenarios A to D (when

γ < 2

) the values differ especially in the number of critical points and inflection points, and therefore the distribution of

X_{2}

is not an alias of the distribution of

X_{1}

. Graphical comparisons of the pdf’s of

X_{1}

and

X_{2}

can be seen in Appendix C.

Table 2. Mean, variance, skewness, and amounts of critical points and inflection points of the pdf for the distributions of

X_{1}

and

X_{2}

in scenarios A to F.

4. Maximum Likelihood Estimator

In this section, we deal with the problem of parameter estimation in the LGB distribution under the maximum likelihood (ML) method.

If

X \sim LGB (μ, σ, γ, α)

, then (for

θ = (μ, σ, γ, α)

’) the log-likelihood is given by

\begin{matrix} ℓ (θ; x) & = & - log (1 + γ) - log (σ) + log [ϕ (z)] + log (γ + z^{2}) + log (α) Φ (z) \\ - log (α) \frac{z}{1 + γ} ϕ (z) + log [H (z; γ, α)], \end{matrix}

(14)

where

H (z; γ, α) = 1 - log (α) [1 - Φ (z) + \frac{z}{1 + γ} ϕ (z)] .

Thus, the elements of the score vector are

\begin{matrix} \frac{\partial ℓ (θ; x)}{\partial μ} & = & \frac{z}{σ} - \frac{2 z}{σ (γ + z^{2})} - \frac{γ log (α) ϕ (z)}{σ (1 + γ)} - \frac{log (α) z^{2} ϕ (z)}{σ (1 + γ)} - \frac{log (α) (γ + z^{2}) ϕ (z)}{σ (1 + γ) H (z; γ, α)}, \\ \frac{\partial ℓ (θ; x)}{\partial σ} & = & - \frac{1}{σ} + \frac{z^{2}}{σ} - \frac{2 z^{2}}{σ (γ + z^{2})} - \frac{γ log (α) z ϕ (z)}{σ (1 + γ)} - \frac{log (α) z^{3} ϕ (z)}{σ (1 + γ)} - \frac{log (α) z (γ + z^{2}) ϕ (z)}{σ (1 + γ) H (z; γ, α)}, \\ \frac{\partial ℓ (θ; x)}{\partial γ} & = & - \frac{1}{1 + γ} + \frac{1}{γ + z^{2}} + \frac{log (α) z ϕ (z)}{{(1 + γ)}^{2}} + \frac{log (α) z ϕ (z)}{{(1 + γ)}^{2} H (z; γ, α)}, \\ \frac{\partial ℓ (θ; x)}{\partial α} & = & - \frac{z ϕ (z)}{α (1 + γ)} + \frac{Φ (z)}{α} - \frac{1 - Φ (z) + [z / (1 + γ)] ϕ (z)}{α H (z; γ, α)} . \end{matrix}

For a random sample

X_{1}, \dots, X_{n}

from

X \sim LGB (μ, σ, γ, α)

, we observe that the ML estimator

{\hat{θ}}_{M L} = ({\hat{μ}}_{M L}, {\hat{σ}}_{M L}, {\hat{γ}}_{M L}, {\hat{α}}_{M L})

’ for

θ = (μ, σ, γ, α)

’ cannot be expressed in closed form. The solution of the likelihood equations gives rise to a system of four nonlinear equations (See Appendix D) that must be solved with the help of some computational routine in search of ML estimates.

In this case, as the ML estimators do not have a closed form, a good alternative to obtain ML estimates is to solve the following optimization problem,

max_{θ} Σ_{i = 1}^{n} ℓ (θ; x_{i}), subject to μ \in R, σ > 0, γ \in [0, 2), α \in (0, e),

(15)

where

ℓ (θ; x)

is given in Equation (14). We solved (15) using the function optim of the R language [20] and, specifically, the L-BFGS-B algorithm [23] was applied. This algorithm requires the declaration of a feasible starting point in the parametric space to start the iterative process. Considering that the bimodal-normal distribution is a special case of the LGB distribution, we verify through simulation experiments that (

\bar{x}, s_{x}, 0, 1

), where

\bar{x}

is the mean of the observations and

s_{x}

the corresponding standard deviation, is a good starting point.

Under regularity conditions, the asymptotic distribution of

({\hat{θ}}_{ML} - θ)

is

N_{4} (0, K {(θ)}^{- 1})

, where

K (θ)

is the expected information matrix. As the function

\sum_{i = 1}^{n} ℓ (θ; x_{i})

is not simple, it is not easy to obtain the analytical expression of this matrix. However, we obtain an approximation from the observed information matrix, whose elements are computed as minus the second partial derivatives of the log-likelihood function with respect to all the parameters (evaluated at the ML estimates). Thus, for a random sample

X_{1}, \dots, X_{n}

from

X \sim LGB (μ, σ, γ, α)

, the observed information matrix is given by

\begin{matrix} I_{n} (θ) = & (\begin{matrix} ε_{μ μ} & ε_{σ μ} & ε_{γ μ} & ε_{α μ} \\ ε_{σ σ} & ε_{γ σ} & ε_{α σ} \\ ε_{γ γ} & ε_{α γ} \\ ε_{α α} \end{matrix}), \\ ε_{θ_{r} θ_{p}} = & - \sum_{i = 1}^{n} \frac{\partial^{2} ℓ (θ; x_{i})}{\partial θ_{r} θ_{p}} |_{θ = {\hat{θ}}_{ML}}, r = p = 1, 2, 3, 4, \end{matrix}

(16)

with

θ_{1} = μ

,

θ_{2} = σ

,

θ_{3} = γ

and

θ_{4} = α

, where the analytical expressions of the second partial derivatives are presented in Appendix E.

5. Simulation Studies

In the analysis of data exhibiting bimodality, it is common to use a two-component mixture normal (MN) distribution. In this section, we initially carry out a simulation study to evaluate the behavior of the ML estimators of the LGB distribution parameters. Subsequently, we conducted a second simulation study in order to evaluate the usefulness of the LGB distribution in a context where the MN distribution performs well.

5.1. First Simulation Study

In this study, 1000 random samples from the LGB distribution were generated considering the sample sizes n = 100, 200, 300, 500, and 1000 in the following two scenarios:

Scenario A: $μ = 5$ , $σ = 2$ , $γ = 0.5$ and $α = 0.5$ .
Scenario B: $μ = - 5$ , $σ = 4$ , $γ = 0.75$ and $α = 1.5$ .

Simulated random samples were generated using the qf given in Equation (7). The LambertW package [24] in the R language was used to compute the principal branch of the Lambert W function. A code in the R language is provided in Appendix A.

For each simulated sample, we obtain the ML estimates by solving (15) under the considerations mentioned in Section 4. Table 3 reports the average estimate (AE), the empirical standard deviation (SD), and the root of the mean square error (RMSE) for the 1000 estimates obtained in each scenario and sample size considered. Table 4 reports the average of the asymptotic standard error (SE) for the ML estimates along with the coverage probability (CP) of the 95% asymptotic confidence intervals.

Table 3. Averages (AE), standard deviations (SD), and root of the simulated mean square errors (RMSE) for the estimates of

μ

,

σ

,

γ

, and

α

of the LGB distribution.

Table 4. Averages of asymptotic standard errors (SE) and coverage probabilities (CP) for the estimates of

μ

,

σ

,

γ

, and

α

for the LGB distribution.

Table 3 indicates that the AEs tend to be close to the true values of the parameters as the sample size increases. The SDs and RMSEs are close and decrease towards 0 as the sample size increases, as expected in the standard asymptotic theory. Table 4 indicates that the SEs are close to the SDs and RMSEs given in Table 3. As expected, the SEs decrease towards 0 and the CPs converge to the nominal values used to construct the confidence intervals as the sample size increases.

5.2. Second Simulation Study

In the first place, 1000 random samples from MN distribution were generated considering the sample sizes n = 50, 100, 200, 300 and the following four scenarios: Scenario A,

μ_{1} = - 5

,

σ_{1} = 2

,

μ_{2} = 2

,

σ_{2} = 2

, and

α = 0.3

. Scenario B,

μ_{1} = - 20

,

σ_{1} = 18

,

μ_{2} = 50

,

σ_{2} = 20

, and

α = 0.6

. Scenario C,

μ_{1} = 20

,

σ_{1} = 25

,

μ_{2} = - 20

,

σ_{2} = 20

and

α = 0.5

. Scenario D,

μ_{1} = - 20

,

σ_{1} = 10

,

μ_{2} = 20

,

σ_{2} = 30

, and

α = 0.2

.

For each simulated sample, the LGB and MN distributions are fitted via the ML method using the optim function in R language. Subsequently, based on the Akaike Information Criterion (AIC) [25], Corrected Akaike Information Criterion (CAIC) [26], and Bayesian Information Criterion (BIC) [27], the proportions where the AIC, CAIC, and BIC values are lower in the LGB distribution are calculated. We call this the hit rate for the LGB distribution. In addition, the modified Cramer–von Mises (

W^{*}

) and Anderson–Darling (

A^{*}

) statistics [28] are calculated for the LGB distribution in order to test the hypothesis

H_{0} : X_{1}, \dots, X_{n}

is a random sample from a LGB population, where the parameters have been estimated by the ML method. Thus, we calculate the rate of simulated samples where

H_{0}

is not rejected, which we call the non-rejection rate.

Finally, considering a procedure analogous to the one described above, we simulate random samples from the LGB distribution and calculate the hit and non-rejection rates for the MN distribution. The scenarios considered here are the following: Scenario A,

μ = - 2.5

,

σ = 2.8

,

γ = 0.2

, and

α = 2.5

. Scenario B,

μ = 15

,

σ = 22

,

γ = 0.2

, and

α = 0.5

. Scenario C,

μ = 14

,

σ = 22.4

,

γ = 1.0

, and

α = 0.3

. Scenario D,

μ = - 20

,

σ = 30

,

γ = 1.5

, and

α = 1.8

.

Table 5 and Table 6 report the hit and non-rejection rates for the LGB and MN distributions, respectively. In Table 5, we observe that the non-rejection rates are high, which means that a considerable proportion of samples generated from the MN distribution can be appropriately fitted with the LGB distribution. On the other hand, we observe that the hit rates are high, exceeding the value 0.5, even in moderate sample sizes,

n = 300

. Note that the non-rejection rates decrease considerably in scenario D as the sample size increases and that the hit rates are lower than the other scenarios. This is because the samples are generated from a MN population where the scales

σ_{1}

and

σ_{2}

are considerably different. This shows that the LGB distribution can perform well in settings where the MN distribution is used and where the estimates for

σ_{1}

and

σ_{2}

in this distribution are similar.

Table 5. Non-rejection rate based on modified statistics

W^{*}

and

A^{*}

and hit rates based on the AIC, CAIC, and BIC for the LGB distribution, when the data are simulated from the MN distribution.

Table 6. Non-rejection rate based on modified statistics

W^{*}

and

A^{*}

and hit rates based on the AIC, CAIC, and BIC for the MN distribution, when the data are simulated from the LGB distribution.

In Table 6, we observe that the non-rejection rates are very high, which was expected as the MN distribution has one more parameter than the LGB distribution. This means that a very considerable proportion of samples generated from a LGB population can be appropriately fitted with the MN distribution. However, due to having one more parameter, the hit rates in the different scenarios are small, as the AIC, CAIC, and BIC values depend on the number of parameters of the distribution. Therefore, it can be expected that in a possible real setting where both distributions appropriately fit a certain dataset, the information criteria AIC, CAIC, and BIC will provide favorable indications for the use of the LGB distribution due to the fact of having to estimate a smaller amount of parameters.

6. Data Analysis

In this section, two applications are presented in order to illustrate the usefulness of the LGB distribution and its special cases in data modeling in different real settings. Other symmetric/asymmetric unimodal/bimodal distributions are also considered to illustrate that the LGB distribution or some of its special cases may have a better fit than other distributions in the literature. Specifically, the odd log-logistic skew-normal (OLLSN) [29] and gamma sinh-Cauchy (GSC) [30] distributions are considered. Like the LGB distribution, these distributions have four parameters: two shape parameters (which together control skewness and bimodality), a location parameter, and a scale parameter. A mixture distribution of two normal components (MN) [6] is also included into the analysis as it is a commonly used distribution for analyzing data exhibiting bimodality.

The first dataset corresponds to 188 observations on the inflation rate (in %) registered quarterly between the years 1950 and 1996 in Canada. This dataset can be found with the name Tbrate in the R language [31].
The second dataset refers to 128 observations on the electrical resistance (in ohms) of nectarine fruits. This data can be found with the name fruitohms in the R language [32].

For the datasets described above, we test hypothesis

H_{0}

: the data have exactly one mode versus the alternative hypothesis

H_{1}

: the data have at least two modes. For this, we consider the excess mass test [33] using the modetest function in R language [34]. For the inflation rate data, the observed statistic was 0.05 with a p-value equal to 0.05. For the electrical resistance data, these values were 0.074 and 0.01, respectively. Thus, at a significance level equal to 0.05, in both datasets

H_{0}

is rejected; that is, the distributions of the inflation rate and electrical resistance data are at least bimodal.

We compared the distributions fitted by the ML method using the information criteria AIC, CAIC, and BIC. We also calculate the statistics

W^{*}

and

A^{*}

to test the hypothesis

H_{0} : X_{1}, \dots, X_{n}

is a random sample from a continuous distribution

F (x; θ)

, where

F (\cdot; \cdot)

is known but

θ

is unknown. In these tests,

H_{0}

is rejected at a significance level equal to 0.05 if

W^{*} > 0.126

and

A^{*} > 0.752

.

Table 7 reports the ML estimates with the corresponding standard errors for each distribution fitted to the inflation rate and electrical resistance data. In addition, the values associated with the statistics

A^{*}

and

W^{*}

and with the information criteria are reported.

Table 7. The ML estimates and their standard errors (in parentheses) for each distribution fitted to the inflation rate and electrical resistance data and the values of the statistics

W^{*}

and

A^{*}

and of the information criteria.

In Table 7, with respect to the inflation rate data, based on the values of the statistics

W^{*}

and

A^{*}

, it can be seen that the hypothesis that the data correspond to an observed random sample of the GSC, GB, LBN, or BN distributions is rejected at a significance level equal to 0.05. In addition, it can be seen that the LGB distribution is the one with the lowest AIC, CAIC, and BIC values among the fitted distributions, indicating that this distribution should be selected over the others for the modeling of these data.

With respect to the electrical resistance data, it is observed that the hypothesis that the data correspond to an observed random sample of the OLLSN, GB, BN, or GSC distributions is rejected at a significance level equal to 0.05. Note that the AIC, CAIC, and BIC values for the LGB, MN, and LBN distributions are close, the values associated with the LBN distribution being slightly lower. Thus, the LBN distribution (which has a smaller parametric dimension than the LGB and MN distributions) is capable of fitting the electrical resistance data as well as the LGB and MN distributions.

In the left panels of Figure 7 and Figure 8, the histograms of inflation rate and electrical resistance are displayed along with the fitted densities. In the right panels of the same figures, the empirical cdf and the fitted cdf’s are compared. In these plots, we see that the LGB distribution fits the inflation rate data appropriately, while the LBN distribution fits the electrical resistance data appropriately. Note that the LGB and LBN distributions have one and two fewer parameters, respectively, than the MN distribution, and that despite this fact they are capable of presenting good fits to the analyzed data.

Figure 7. Left panels: Histogram for inflation rate data and the fitted pdf curves via the ML method. Right panels: Empirical cdf for the inflation rate data and the fitted cdf curves.

Figure 8. Left panels: Histogram for electrical resistance data and the fitted pdf curves via the ML method. Right panels: Empirical cdf for the electrical resistance data and the fitted cdf curves.

7. Final Comments

This article introduces a new symmetric/asymmetric unimodal/bimodal distribution called the Lambert generalized bimodal (LGB) distribution. Some special cases of the LGB distribution are discussed. One of the special cases, the Lambert-bimodal normal (LBN) distribution, can be considered as an alternative to other symmetric/asymmetric bimodal distributions, including the LGB distribution. The LGB distribution arises using the Lambert-F transformation when the generalized bimodal distribution is considered as baseline distribution. We study the main structural properties of the LGB distribution, such as the pdf, cdf, qf, and raw moments that are used for a description of the skewness and kurtosis characteristics. Parameter estimation of the LGB distribution is discussed using the ML method. Through simulation experiments, we observe that the ML method provide acceptable estimates of the parameters of the LGB distribution. Furthermore, through simulation experiments, we observed that the LGB distribution can adequately fit datasets generated from the mixture normal distribution, despite having one less parameter. Finally, two applications that illustrate that the LGB distribution and the LBN especial case can present a better fit of data in real settings than other symmetric/asymmetric unimodal/bimodal distributions such as the odd log-logistic skew-normal distribution (OLLSN), gamma sinh-Cauchy (GSC), and mixture normal (MN) distributions.

As a final consideration, we leave the question open, does the LGB distribution have an intuitive stochastic representation? In the literature, it is possible to find distribution families that have an attractive intuitive generation mechanism, such as the skew-elliptical (SE) distributions [35] and the closed-skew-normal (CSN) distribution [36]. According to Loperfido et al. [37], any linear combination of the largest and smaller component of a bivariate, exchange elliptical random vector has a skew-elliptical distribution. According to Loperfido [38], any order statistic from a random vector with exchangeable normal distribution has a closed-skew-normal distribution. As far as we know, no similar property is known for the Lambert-F distributions class or for some of its special cases.

Author Contributions

Conceptualization, Y.A.I. and M.d.C.; Formal analysis, Y.A.I., M.d.C., and H.W.G.; Investigation, Y.A.I., M.d.C., and H.W.G.; Methodology, Y.A.I. and H.W.G.; Software, Y.A.I.; Supervision, M.d.C. and H.W.G.; Validation, H.W.G.; All of the authors contributed significantly to this research article. All authors have read and agreed to the published version of the manuscript.

Funding

The research of Y. A. Iriarte was funded by CONICYT PAI/INDUSTRIA 79090016, Chile. This work was partially done during M. de Castro’s visit to the Universidad de Antofagasta, supported by MINEDUC-UA Project, code ANT1856, Chile. The work of M. de Castro is partially funded by CNPq, Brazil. The research of H.W. Gómez was supported by Grant SEMILLERO UA-2021 (Chile).

Acknowledgments

The authors would like to thank the editor and the anonymous referees for their comments and suggestions, which significantly improved our manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

pdf	Probability density function
cdf	Cumulative distribution function
qf	Quantile function
AIC	Akaike information criterion
CAIC	Corrected Akaike information criterion
BIC	Bayesian informetion criterion
MN	Mixture normal
GB	Generalized bimodal
GSC	Gamma sinh-Cauchy
OLLSN	Odd log-logistic skew-normal
LGB	Lambert generalized bimodal

Appendix A. R Codes to Compute the qf of the LGB Distribution and Generate Pseudo-Random Numbers

+ qgbimodal <- function(p,gamma){

+ n = length(p)

+ f = rep(0,n)

+ for(i in 1:n){

+ f[i] = uniroot(function(x,gamma)pnorm(x)-x/(gamma+1)*dnorm(x)-p[i],

+ c(-1e+4,1e+4),gamma=gamma)$‘root‘

+ }

+ return(f)

+ }

+

+ library(LambertW)

+

+ qLGB <- function(p,mu,sigma,gamma,alpha){

+ if(alpha==1){

+ mu+sigma*qgbimodal(p,gamma)

+ }else{

+ pp = 1/log(alpha)*W(log(alpha)*(p-1)/alpha)+1

+ mu+sigma*qgbimodal(pp,gamma)

+ }

+

+ n <- 100; p <- runif(n); mu <- 5; sigma <- 3; gamma <- 0.5; alpha <- 1.5

+ x <- qLGB(p,mu,sigma,gamma,alpha)

Appendix B. System of Equations to Minimize the Kullback–Leibler Divergence with Respect to θ₂

Let

X_{1} \sim LGB (θ_{1})

and

X_{2} \sim LGB (θ_{2})

,

θ_{j} = {(μ_{j}, σ_{j}, γ_{j}, α_{j})}^{'}

,

j = 1, 2

, where

θ_{1}

is known and

θ_{2}

is unknown. Then, the Kullback–Leibler divergence,

K (X_{1}, X_{2})

, has a minimum value at

θ_{2} = θ_{2_{0}}

, with

θ_{2_{0}} \neq θ_{1}

, where

θ_{2_{0}} = {(μ_{2_{0}}, σ_{2_{0}}, γ_{2_{0}}, α_{2_{0}})}^{'}

satisfies the system of equations

\begin{matrix} S (\frac{1}{u_{1}}) & = & 0, \\ \frac{μ_{2} - μ_{1}}{σ_{2}} S (\frac{1}{u_{1}}) - \frac{σ_{1}}{σ_{2}} S (\frac{u}{u_{1}}) & = & 1, \\ S (\frac{u_{2}}{u_{3}}) + \frac{{(1 + γ_{2})}^{2}}{log (α_{2})} S (\frac{1}{u_{5}}) + S (u_{2}) & = & \frac{1 + γ_{2}}{log (α_{2})}, \\ S (\frac{1}{u_{3}}) - S (\frac{u_{4}}{u_{3}}) + \frac{1}{1 + γ_{2}} S (\frac{u_{2}}{u_{3}}) - \frac{1}{α_{2}} S (u_{4}) + S (u_{2}) & = & 0 . \end{matrix}

where

S (u^{*}) = \int_{- \infty}^{\infty} u^{*} \frac{γ_{1} + u^{2}}{1 + γ_{1}} ϕ (u) α_{1}^{Φ (u) - \frac{u}{1 + γ_{1}} ϕ (u)} {1 - log (α_{1}) [1 - Φ (u) + \frac{u}{1 + γ_{1}} ϕ (u)]} d u,

u_{1} = \frac{u_{3} u_{5}}{1 + γ_{2}} ϕ (h (u)) α_{2}^{u_{4} - \frac{u_{2}}{1 + γ_{2}}}, u_{2} = h (u) ϕ (h (u)), u_{3} = 1 - log (α_{2}) (1 - u_{4} + \frac{u_{2}}{1 + γ_{2}}),

u_{4} = Φ (h (u)), u_{5} = γ_{2} + h^{2} (u) and h (u) = \frac{u σ_{1} + μ_{1} - μ_{2}}{σ_{2}} .

Appendix C. Graphical Comparison of the pdf of X₁ and X₂ in Scenarios A to F

Figure A1. Pdf curves of

X_{1}

(black solid line) and

X_{2}

(red dashed line) in scenarios A to F.

Appendix D. System of Equations to Obtain the ML Estimates Based on a Random Sample of Size n from a LGB(μ, σ, γ, α) Population

Let

X_{1}, \dots, X_{n}

be a random sample of

X \sim LGB (μ, σ, γ, α)

, where

{(μ, σ, γ, α)}^{'}

is unknown. Then, the ML estimate of

{(μ, σ, γ, α)}^{'}

is a root of the system of equations

\begin{matrix} \sum_{i = 1}^{n} z_{i} - 2 \sum_{i = 1}^{n} \frac{z_{i}}{γ + z_{i}^{2}} - \frac{γ log (α)}{1 + γ} \sum_{i = 1}^{n} ϕ (z_{i}) - \frac{log (α)}{1 + γ} \sum_{i = 1}^{n} z_{i}^{2} ϕ (z_{i}) - \frac{log (α)}{1 + γ} \sum_{i = 1}^{n} \frac{(γ + z_{i}^{2}) ϕ (z_{i})}{H (z_{i}; γ, α)} & = & 0, \\ \sum_{i = 1}^{n} z_{i}^{2} - 2 \sum_{i = 1}^{n} \frac{z_{i}^{2}}{γ + z_{i}^{2}} - \frac{γ log (α)}{1 + γ} \sum_{i = 1}^{n} z_{i} ϕ (z_{i}) - \frac{log (α)}{1 + γ} \sum_{i = 1}^{n} z_{i}^{3} ϕ (z_{i}) - \frac{log (α)}{1 + γ} \sum_{i = 1}^{n} \frac{z_{i} (γ + z_{i}^{2}) ϕ (z_{i})}{H (z_{i}; γ, α)} & = & n, \\ (1 + γ) \sum_{i = 1}^{n} \frac{1}{γ + z_{i}^{2}} + \frac{log (α)}{1 + γ} \sum_{i = 1}^{n} z_{i} ϕ (z_{i}) + \frac{log (α)}{1 + γ} \sum_{i = 1}^{n} \frac{z_{i} ϕ (z_{i})}{H (z_{i}; γ, α)} & = & n, \\ - \frac{1}{1 + γ} \sum_{i = 1}^{n} z_{i} ϕ (z_{i}) + \sum_{i = 1}^{n} Φ (z_{i}) - \sum_{i = 1}^{n} \frac{1 - Φ (z_{i}) + [z / (1 + γ)] ϕ (z_{i})}{H (z_{i}; γ, α)} & = & 0 . \end{matrix}

Appendix E. Second Partial Derivatives of the Log-Likelihood Function for a Single Observation of the LGB Distribution

If

X \sim LGB (μ, σ, γ, α)

and

z = (x - μ) / σ

, the second partial derivatives of the log-likelihood function with respect to all the parameters are given by

\begin{matrix} \frac{\partial^{2} ℓ (θ; x)}{\partial μ^{2}} & = & - \frac{4 z^{2}}{{(γ + z^{2})}^{2} σ^{2}} - \frac{1}{σ^{2}} + \frac{2}{(γ + z^{2}) σ^{2}} - \frac{log (α) z^{3} ϕ (z)}{(1 + γ) σ^{2}} - \frac{log (α) z ϕ (z)}{σ^{2}} + \frac{3 log (α) z ϕ (z)}{(1 + γ) σ^{2}} \\ - \frac{{log}^{2} (α) {(z^{2} + γ)}^{2} ϕ^{2} (z)}{{(1 + γ)}^{2} σ^{2} H^{2} (z; γ, α)} - \frac{log (α) z (z^{2} + γ - 2) ϕ (z)}{(1 + γ) σ^{2} H (z; γ, α)}, \\ \frac{\partial^{2} ℓ (θ; x)}{\partial σ \partial μ} & = & - \frac{4 z^{3}}{{(γ + z^{2})}^{2} σ^{2}} - \frac{2 z}{σ^{2}} + \frac{4 z}{(γ + z^{2}) σ^{2}} - \frac{log (α) z^{4} ϕ (z)}{(1 + γ) σ^{2}} - \frac{log (α) z^{2} ϕ (z)}{σ^{2}} + \frac{4 log (α) z^{2} ϕ (z)}{(1 + γ) σ^{2}} \\ - \frac{log (α) ϕ (z)}{(1 + γ) σ^{2}} - \frac{{log}^{2} (α) z {(z^{2} + γ)}^{2} ϕ^{2} (z)}{{(1 + γ)}^{2} σ^{2} H^{2} (z; γ, α)} - \frac{log (α) [z^{4} + (γ - 3) z^{2} - γ] ϕ (z)}{(1 + γ) σ^{2} H (z; γ, α)} + \frac{log (α) ϕ (z)}{σ^{2}}, \\ \frac{\partial^{2} ℓ (θ; x)}{\partial γ \partial μ} & = & \frac{2 z}{{(γ + z^{2})}^{2} σ} + \frac{log (α) z^{2} ϕ (z)}{{(1 + γ)}^{2} σ} - \frac{log (α) ϕ (z)}{{(1 + γ)}^{2} σ} + \frac{{log}^{2} (α) z (z^{2} + γ) ϕ^{2} (z)}{{(1 + γ)}^{3} σ H (z; γ, α)} + \frac{log (α) (z^{2} + 1) ϕ (z)}{σ {(1 + γ)}^{2} H (z; γ, α)}, \\ \frac{\partial^{2} ℓ (θ; x)}{\partial σ^{2}} & = & - \frac{4 z^{4}}{{(γ + z^{2})}^{2} σ^{2}} - \frac{3 z^{2}}{σ^{2}} + \frac{6 z^{2}}{(γ + z^{2}) σ^{2}} + \frac{1}{σ^{2}} - \frac{log (α) z^{5} ϕ (z)}{(1 + γ) σ^{2}} - \frac{log (α) z^{3} ϕ (z)}{σ^{2}} \\ + \frac{5 log (α) z^{3} ϕ (z)}{(1 + γ) σ^{2}} + \frac{2 log (α) z ϕ (z)}{σ^{2}} - \frac{2 log (α) z ϕ (z)}{(1 + γ) σ^{2}} - \frac{{log}^{2} (α) z^{2} {(z^{2} + γ)}^{2} ϕ (z)}{{(1 + γ)}^{2} σ^{2} H^{2} (z; γ, α)} \\ - \frac{log (α) [z^{4} + (γ - 4) z^{2} - 2 γ] z ϕ (z)}{(1 + γ) σ^{2} H (z; γ, α)}, \\ \frac{\partial^{2} ℓ (θ; x)}{\partial γ \partial σ} & = & \frac{2 z^{2}}{{(γ + z^{2})}^{2} σ} + \frac{log (α) z^{3} ϕ (z)}{{(1 + γ)}^{2} σ} - \frac{log (α) z ϕ (z)}{{(1 + γ)}^{2} σ} + \frac{{log}^{2} (α) z^{2} (z^{2} + γ) ϕ^{2} (z)}{{(1 + γ)}^{3} σ H^{2} (z; γ, α)} \\ + \frac{log (α) z (z - 1) ϕ (z)}{{(1 + γ)}^{2} σ H (z; γ, α)}, \\ \frac{\partial^{2} ℓ (θ; x)}{\partial α \partial σ} & = & \frac{z^{3} ϕ (z)}{α (1 + γ) σ} - \frac{z ϕ (z)}{α σ} + \frac{z ϕ (z)}{α (1 + γ) σ} - \frac{z (z^{2} + γ) ϕ (z)}{α (1 + γ) σ H (z; γ, α)} \\ - \frac{log (α) z (z + γ) ϕ (z) [1 + z ϕ (z) / (1 + γ) - Φ (z)]}{α (1 + γ) σ H^{2} (z; γ, α)}, \\ \frac{\partial^{2} ℓ (θ; x)}{\partial γ^{2}} & = & \frac{1}{{(1 + γ)}^{2}} - \frac{1}{{(γ + z^{2})}^{2}} - \frac{2 log (α) z ϕ (z)}{{(1 + γ)}^{3}} - \frac{{log}^{2} (α) z^{2} ϕ^{2} (z)}{{(1 + γ)}^{4} H^{2} (z; γ, α)} - \frac{2 log (α) z ϕ (z)}{{(1 + γ)}^{2} H (z; γ, α)}, \\ \frac{\partial^{2} ℓ (θ; x)}{\partial α \partial γ} & = & \frac{z ϕ (z)}{α {(1 + γ)}^{2}} + \frac{log (α) z ϕ (z) [1 + z ϕ (z) / (1 + γ) - Φ (z)]}{α {(1 + γ)}^{2} H^{2} (z; γ, α)} + \frac{z ϕ (z)}{α {(1 + γ)}^{2} H (z; γ, α)}, \\ \frac{\partial^{2} ℓ (θ; x)}{\partial α^{2}} & = & \frac{z ϕ (z)}{α^{2} (1 + γ)} - \frac{Φ (z)}{α^{2}} - \frac{{[1 + z ϕ (z) / (1 + γ) - Φ (z)]}^{2}}{α^{2} H^{2} (z; γ, α)} + \frac{1 + z ϕ (z) / (1 + γ) - Φ (z)}{α^{2} H (z; γ, α)} . \end{matrix}

References

Weber, N.A. Dimorphism in the African Oecophylla worker Nd an anomaly (Hym.: Formicidae). Ann. Entomol. Soc. Am. 1946, 39, 7–10. [Google Scholar] [CrossRef]
Azzalini, A.; Bowman, A.W. A look at some data on the Old Faithful Geyser. Appl. Statist. 1990, 39, 357–365. [Google Scholar] [CrossRef]
Ely, J.T.A.; Fudenberg, H.H.; Muirhead, R.J.; LaMarche, M.G.; Krone, C.A.; Buscher, D.; Stern, E.A. Urine mercury in micromercurialism: Bimodal distribution and diagnostic implications. Bull. Environ. Contam. Toxicol. 1999, 63, 553–559. [Google Scholar] [CrossRef]
Dierickx, D.; Basu, B.; Vleugels, J.; Van der Biest, O. Statistical extreme value modeling of particle size distributions: Experimental grain size distribution type estimation and parameterization of sintered zirconia. Mater. Character 2000, 45, 61–70. [Google Scholar] [CrossRef]
Zhang, C.; Mapes, B.E.; Soden, B.J. Bimodality in tropical water vapor. Q. J. R. Meteorol. Soc. 2004, 129, 2847–2866. [Google Scholar] [CrossRef]
McLachlan, G.J.; Lee, S.X.; Rathnayake, S.I. Finite mixture models. Annu. Rev. Stat. Appl. 2019, 6, 355–378. [Google Scholar] [CrossRef]
Aitkin, M.; Rubin, D.B. Estimation and hypothesis testing in finite mixture models. J. Roy. Statist. Soc. Ser. B 1985, 47, 67–75. [Google Scholar] [CrossRef]
Rao, K.S. On a bivariate bimodal distribution. In Proceedings of the ISPS Annual Conference, Delhi, India, 1987. [Google Scholar]
Sarma, P.V.S.; Rao, S.K.S.; Rao, R.P. On a family of bimodal distributions. Sankhya Ser. B 1990, 52, 287–292. [Google Scholar]
Hassan, M.Y.; Hijazi, R.H. A bimodal exponential power distribution. Pak. J. Statist. 2010, 2, 379–396. [Google Scholar]
Elal-Olivero, D. Alpha-skew-normal distribution. Proyecciones 2010, 29, 224–240. [Google Scholar] [CrossRef]
Hassan, M.Y.; El-Bassiouni, M.Y. Bimodal Skew-Symmetric Normal Distribution. Commun. Stat.—Theory Methods 2016, 45, 1527–1541. [Google Scholar] [CrossRef]
Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
Eugene, N.; Lee, C.; Famoye, F. Beta-normal distribution and its applications. Commun. Stat. Theory Methods 2002, 31, 497–512. [Google Scholar] [CrossRef]
Cordeiro, G.M.; de Castro, M. A new family of generalized distributions. J. Stat. Comput. Simul. 2011, 81, 883–898. [Google Scholar] [CrossRef]
Ferreira, J.T.S.; Steel, M.F.J. A constructive representation of univariate skewed distributions. J. Am. Stat. Assoc. 2006, 101, 823–829. [Google Scholar] [CrossRef]
Goerg, G.M. Lambert W random variables-a new family of generalized skewed distributions with applications to risk estimation. Ann. Appl. Stat. 2011, 5, 2197–2230. [Google Scholar] [CrossRef]
Alzaatreh, A.; Lee, C.; Famoye, F. A new method for generating families of continuous distributions. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef]
Iriarte, Y.A.; de Castro, M.; Gómez, H.W. Lambert-F distributions class: An alternative family for positive data analysis. Mathematics 2020, 8, 1398. [Google Scholar] [CrossRef]
R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2019. [Google Scholar]
Soetaert, K. rootSolve: Nonlinear Root Finding, Equilibrium and Steady-State Analysis of Ordinary Differential Equations. R-Package Version 1.6. 2009. Available online: https://CRAN.R-project.org/package=rootSolve (accessed on 3 February 2021).
Hutson, A.D.; Vexler, A. A cautionary note on beta families of distributions and the aliases within. Am. Stat. 2017, 72, 121–129. [Google Scholar] [CrossRef]
Byrd, R.H.; Lu, P.; Nocedal, J.; Zhu, C. A limited memory algorithm for bound constrained optimization. SIAM J. Sci. Comput. 1995, 16, 1190–1208. [Google Scholar] [CrossRef]
Goerg, G.M. LambertW: Probabilistic Models to Analyze and Gaussianize Heavy-Tailed, Skewed Data. R Package Version 0.6.4. 2016. Available online: https://CRAN.R-project.org/package=LambertW (accessed on 3 February 2021).
Akaike, H. A new look at the statistical model identification. IEEE Trans. Automat. Contr. 1974, 19, 716–723. [Google Scholar] [CrossRef]
Bozdogan, H. Model selection and Akaike’s information criterion (AIC): The general theory and its analytical extension. Psychometrika 1987, 52, 345–370. [Google Scholar] [CrossRef]
Schwarz, G. Estimating the dimension of a model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
Chen, G.; Balakrishnan, N. A general purpose approximate goodness-of-fit test. J. Qual. Technol. 1995, 27, 154–161. [Google Scholar] [CrossRef]
da Silva Braga, A.; Cordeiro, G.M.; Ortega, E.M.M. A new skew-bimodal distribution with applications. Commun. Stat. Theory Methods. 2018, 47, 2950–2968. [Google Scholar] [CrossRef]
Gómez, Y.M.; Gómez-Déniz, E.; Venegas, O.; Gallardo, D.I.; Gómez, H.W. A new skew-bimodal distribution with applications. Symmetry 2019, 11, 899. [Google Scholar] [CrossRef]
Croissant, Y.; Graves, S. Ecdat: Data Sets for Econometrics. R Package Version 0.3-4. 2019. Available online: https://CRAN.R-project.org/package=Ecdat (accessed on 3 February 2021).
Maindonald, J.H.; John Braun, J.W. DAAG: Data Analysis and Graphics Data and Functions. R Package Version 1.22.1. 2019. Available online: https://CRAN.R-project.org/package=DAAG (accessed on 3 February 2021).
Ameijeiras-Alonso, J.; Crujeiras, R.M.; Rodríguez-Casal, A. Mode testing, critical bandwidth and excess mass. Test 2019, 28, 900–919. [Google Scholar] [CrossRef]
Ameijeiras-Alonso, J.; Crujeiras, R.M.; Rodríguez-Casal, A. Multimode: An R Package for Mode Assessment. arXiv 2018, arXiv:1803.00472. [Google Scholar]
Branco, M.D.; Dey, D.K. A general class of multivariate skew-elliptical distributions. J. Multivar. Anal. 2001, 79, 99–113. [Google Scholar] [CrossRef]
González-Farías, G.; Domínguez-Molina, A.; Gupta, A.K. Additive properties of skew-normal random vectors. J. Stat. Plan. Inference 2004, 126, 521–534. [Google Scholar] [CrossRef]
Loperfido, N.; Navarro, J.; Ruiz, J.M.; Sandoval, C.J. Some relationships between skew-normal distributions and order statistics from exchangeable normal random vectors. Commun. Stat. Theory Methods 2007, 36, 1719–1733. [Google Scholar] [CrossRef]
Loperfido, N. A note on skew-elliptical distributions and linear functions of order statistics. Stat. Probab. Lett. 2008, 78, 3184–3186. [Google Scholar] [CrossRef][Green Version]

Figure 1. Pdf curves for the LGB distribution for

μ = 5

,

σ = 2

, and

α = 1

in the top left panel;

μ = - 5

,

σ = 4

, and

α = 0.5

in the top right panel;

μ = 5

,

σ = 4

, and

α = 2

in the bottom left panel; and

μ = 10

,

σ = 5

, and

γ = 1.5

in the bottom right panel.

Figure 2. Critical and inflection points and pdf curve for the distributions Lambert generalized bimodal (LGB) (5,2,1.5,0.005) (black curves), LGB (5,2,1.5,0.5) (red curves), LGB (5,2,0.5,0.5) (green curves), and LGB (5,2,0.5,1) (blue curves).

Figure 3. Regions of unimodality (white region) and bimodality (gray region) for a LGB distribution.

Figure 4. Plots of the skewness and kurtosis coefficients of the LGB distribution.

Figure 5. Top and center panels: Skewness curves for two LGB distributions with

α = 1.6

in blue color and

α = 0.001

in red color. Right panel: Pdf curves for an LGB (0,1,1000,1.6) distribution (in black color) and four LGB distributions specified by

μ = 2.360

,

σ = 1.487

,

γ = 0.001

, and different values of

γ

.

Figure 6. Kullback–Leibler divergence curve for

X_{1} \sim LGB (0, 1, 1000, 1.6)

and

X_{2} \sim LGB (2.360, 1.487, 1000, 0.001)

as a function of

μ_{2}

in the left panel and as a function of

σ_{2}

in the right panel.

Figure 7. Left panels: Histogram for inflation rate data and the fitted pdf curves via the ML method. Right panels: Empirical cdf for the inflation rate data and the fitted cdf curves.

Figure 8. Left panels: Histogram for electrical resistance data and the fitted pdf curves via the ML method. Right panels: Empirical cdf for the electrical resistance data and the fitted cdf curves.

Table 1. Some values for the first four raw moments of the LGB distribution considering different values of γ and α.

Parameters		Moments
$γ$	$α$	$E (Z)$	$E (Z^{2})$	$E (Z^{3})$	$E (Z^{4})$
0.2	0.5	−0.5504	2.7657	−2.6574	13.7983
0.4	1.0	0.0000	2.4285	0.0000	11.5714
0.6	2.0	0.7173	2.4001	3.1908	11.5966
0.8	0.5	−0.4906	2.2032	−2.1096	10.3165
1.0	1.0	0.0000	2.0000	0.0000	9.0000
1.2	2.0	0.6592	2.0483	2.6865	9.3859
1.4	0.5	−0.455	1.9191	−1.8137	8.5612
1.6	1.0	0.0000	1.7692	0.0000	7.6153
1.8	2.0	0.6229	1.8451	2.3840	8.1118

Table 2. Mean, variance, skewness, and amounts of critical points and inflection points of the pdf for the distributions of

X_{1}

and

X_{2}

in scenarios A to F.

Table 2. Mean, variance, skewness, and amounts of critical points and inflection points of the pdf for the distributions of

X_{1}

and

X_{2}

in scenarios A to F.

Variable	Scenario	Mean	Variance	Skewness	Number of Critical Points	Number of Inflexion Points
$X_{1}$	A	0.271	1.680	0.271	1	4
$X_{2}$	A	0.361	1.834	0.361	1	2
$X_{1}$	B	−0.988	1.749	0.685	2	4
$X_{2}$	B	−0.954	2.097	0.430	1	2
$X_{1}$	C	0.365	1.910	−0.235	2	4
$X_{2}$	C	0.415	2.107	0.230	1	2
$X_{1}$	D	0.639	1.526	−0.355	1	4
$X_{2}$	D	0.648	1.626	0.262	1	2
$X_{1}$	E	0.481	0.904	−0.172	1	2
$X_{2}$	E	0.486	0.912	−0.043	1	2
$X_{1}$	F	0.475	0.880	−0.170	1	2
$X_{2}$	F	0.476	0.895	−0.052	1	2

Table 3. Averages (AE), standard deviations (SD), and root of the simulated mean square errors (RMSE) for the estimates of

μ

,

σ

,

γ

, and

α

of the LGB distribution.

Table 3. Averages (AE), standard deviations (SD), and root of the simulated mean square errors (RMSE) for the estimates of

μ

,

σ

,

γ

, and

α

of the LGB distribution.

n	$\hat{μ}$			$\hat{σ}$			$\hat{γ}$			$\hat{α}$
n	AE	SD	RMSE	AE	SD	RMSE	AE	SD	RMSE	AE	SD	RMSE
Scenario A
100	5.005	0.329	0.329	1.992	0.126	0.126	0.497	0.260	0.260	0.517	0.168	0.169
200	5.004	0.234	0.235	1.993	0.092	0.091	0.498	0.177	0.177	0.508	0.120	0.119
300	5.003	0.192	0.192	1.995	0.076	0.076	0.498	0.139	0.139	0.504	0.097	0.097
500	5.001	0.148	0.148	1.997	0.058	0.058	0.499	0.106	0.106	0.503	0.077	0.077
1000	5.001	0.107	0.107	1.999	0.041	0.041	0.500	0.073	0.073	0.500	0.052	0.052
Scenario B
100	−4.943	0.774	0.776	3.977	0.284	0.285	0.745	0.413	0.412	1.478	0.315	0.316
200	−4.984	0.556	0.556	3.989	0.217	0.217	0.748	0.276	0.276	1.505	0.233	0.233
300	−4.996	0.464	0.464	3.996	0.167	0.167	0.749	0.227	0.227	1.504	0.193	0.193
500	−4.999	0.342	0.342	3.997	0.126	0.126	0.750	0.160	0.160	1.502	0.143	0.143
1000	−5.000	0.236	0.236	3.999	0.094	0.094	0.750	0.115	0.115	1.500	0.101	0.101

Table 4. Averages of asymptotic standard errors (SE) and coverage probabilities (CP) for the estimates of

μ

,

σ

,

γ

, and

α

for the LGB distribution.

Table 4. Averages of asymptotic standard errors (SE) and coverage probabilities (CP) for the estimates of

μ

,

σ

,

γ

, and

α

for the LGB distribution.

n	$\hat{μ}$		$\hat{σ}$		$\hat{γ}$		$\hat{α}$
n	SE	CP	SE	CP	SE	CP	SE	CP
Scenario A
100	0.329	0.947	0.129	0.942	0.247	0.903	0.169	0.939
200	0.234	0.948	0.092	0.943	0.170	0.924	0.118	0.943
300	0.188	0.948	0.074	0.944	0.135	0.929	0.096	0.947
500	0.145	0.955	0.057	0.947	0.104	0.939	0.075	0.949
1000	0.103	0.956	0.041	0.954	0.073	0.950	0.053	0.953
Scenario B
100	0.798	0.931	0.293	0.936	0.388	0.883	0.336	0.939
200	0.559	0.938	0.205	0.937	0.258	0.921	0.237	0.944
300	0.455	0.945	0.168	0.954	0.210	0.929	0.193	0.945
500	0.347	0.955	0.128	0.955	0.158	0.945	0.148	0.953
1000	0.245	0.956	0.091	0.955	0.111	0.947	0.105	0.957

Table 5. Non-rejection rate based on modified statistics

W^{*}

and

A^{*}

and hit rates based on the AIC, CAIC, and BIC for the LGB distribution, when the data are simulated from the MN distribution.

Table 5. Non-rejection rate based on modified statistics

W^{*}

and

A^{*}

and hit rates based on the AIC, CAIC, and BIC for the LGB distribution, when the data are simulated from the MN distribution.

	Non-Rejection		Hit Rate				Non-Rejection		Hit Rate
	Rate						Rate
n	$W^{*}$	$A^{*}$	AIC	CAIC	BIC	n	$W^{*}$	$A^{*}$	AIC	CAIC	BIC
Scenario A						Scenario C
50	0.993	0.994	0.730	0.798	0.903	50	0.998	0.996	0.721	0.763	0.871
100	0.996	0.994	0.686	0.736	0.936	100	0.997	0.997	0.704	0.737	0.906
200	0.992	0.988	0.584	0.608	0.941	200	0.966	0.988	0.606	0.622	0.895
300	0.993	0.990	0.531	0.549	0.936	300	0.989	0.978	0.511	0.519	0.840
Scenario B						Scenario D
50	0.995	0.993	0.732	0.793	0.874	50	0.980	0.978	0.507	0.566	0.728
100	0.998	0.996	0.692	0.719	0.911	100	0.951	0.951	0.427	0.451	0.696
200	0.992	0.990	0.583	0.608	0.902	200	0.892	0.886	0.297	0.308	0.600
300	0.991	0.984	0.505	0.525	0.875	300	0.781	0.758	0.198	0.205	0.464

Table 6. Non-rejection rate based on modified statistics

W^{*}

and

A^{*}

and hit rates based on the AIC, CAIC, and BIC for the MN distribution, when the data are simulated from the LGB distribution.

Table 6. Non-rejection rate based on modified statistics

W^{*}

and

A^{*}

and hit rates based on the AIC, CAIC, and BIC for the MN distribution, when the data are simulated from the LGB distribution.

	Non-Rejection		Hit Rate				Non-Rejection		Hit Rate
	Rate						Rate
n	$W^{*}$	$A^{*}$	AIC	CAIC	BIC	n	$W^{*}$	$A^{*}$	AIC	CAIC	BIC
Scenario A						Scenario C
50	0.995	0.996	0.302	0.229	0.122	50	0.999	0.999	0.294	0.225	0.130
100	0.995	0.996	0.244	0.214	0.065	100	0.999	0.999	0.227	0.199	0.074
200	0.995	0.994	0.229	0.221	0.044	200	0.999	0.999	0.211	0.198	0.031
300	0.996	0.995	0.207	0.203	0.031	300	0.999	0.999	0.206	0.197	0.030
Scenario B						Scenario D
50	0.999	0.999	0.241	0.189	0.098	50	0.098	0.999	0.298	0.255	0.141
100	0.998	0.988	0.250	0.232	0.044	100	0.998	0.988	0.266	0.233	0.076
200	0.998	0.993	0.222	0.208	0.042	200	0.999	0.999	0.210	0.193	0.034
300	9.997	0.993	0.172	0.171	0.026	300	0.999	0.999	0.200	0.193	0.028

Table 7. The ML estimates and their standard errors (in parentheses) for each distribution fitted to the inflation rate and electrical resistance data and the values of the statistics

W^{*}

and

A^{*}

and of the information criteria.

Table 7. The ML estimates and their standard errors (in parentheses) for each distribution fitted to the inflation rate and electrical resistance data and the values of the statistics

W^{*}

and

A^{*}

and of the information criteria.

Distribution	$\hat{μ}$	${\hat{μ}}_{2}$	$\hat{σ}$	${\hat{σ}}_{2}$	$\hat{γ}$	$\hat{α}$	$W^{*}$	$A^{*}$	AIC	CAIC	BIC
Inflation rate data
LGB	6.682	-	2.545	-	0.434	0.183	0.045	0.248	962.5	962.7	975.4
	(0.307)		(0.131)		(0.175)	(0.060)
MN	2.600	9.086	2.001	2.026	-	0.777	0.040	0.246	965.5	965.9	981.7
	(0.220)	(0.585)	(0.161)	(0.410)		(0.045)
OLLSN	0.834	-	4.129	-	3.003	0.772	0.085	0.558	973.7	973.9	986.6
	(1.048)		(1.401)		(0.880)	(0.294)
GSC	6.441	-	1.176	-	0.177	0.572	0.158	1.083	982.1	982.2	995.0
	(0.274)		(0.104)		(0.044)	(0.065)
GB	4.454	-	2.631	-	2.000	-	0.610	3.318	996.7	996.8	1006.4
	(0.379)		(0.230)		(1.321)
LBN	6.381	-	2.248	-	-	0.247	0.254	1.356	987.3	987.4	997.0
	(0.073)		(0.065)			(0.055)
BN	6.227	-	2.314	-	-	-	0.244	1.417	1046.3	1046.3	1052.8
	(0.083)		(0.073)
Electrical resistance data
LGB	5215.168	-	1163.235	-	0.075	0.379	0.089	0.523	2239.1	2239.4	2250.5
	(96.230)	-	(51.339)	-	(0.070)	(0.102)
MN	3208.166	725.681	6666.668	1158.101	-	0.672	0.048	0.355	2241.6	2242.1	2255.8
	(87.272)	(62.698)	(250.758)	(218.172)		(0.047)
OLLSN	5039.000	-	642.500	-	−0.296	0.216	0.627	3.113	2277.6	2277.6	2289.0
	(12.787)		(0.991)		(0.006)	(0.015)
GSC	5024.639	-	5218.963	-	0.083	0.788	0.211	1.168	2244.2	2244.5	2255.6
	(102.417)		(48.691)		(0.028)	(0.078)
GB	4980.008	-	1157.944	-	0.085	-	0.297	1.597	2256.2	2256.4	2264.7
	(89.904)		(47.155)		(0.061)
LBN	5182.212	-	1132.943	-	-	0.400	0.085	0.525	2239.0	2239.1	2247.5
	(60.406)		(41.002)			(0.097)
BN	5085.210	-	1148.593	-	-	-	0.209	1.191	2257.0	2257.1	2262.7
	(54.338)		(42.991)

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

A Unimodal/Bimodal Skew/Symmetric Distribution Generated from Lambert’s Transformation

Abstract

1. Introduction

2. The LGB Distribution

2.1. LGB Random Variable

2.2. Related Distributions

3. Shapes and Aliases

3.1. Shapes

3.2. Skewness and Kurtosis

3.3. Alias Distributions

4. Maximum Likelihood Estimator

5. Simulation Studies

5.1. First Simulation Study

5.2. Second Simulation Study

6. Data Analysis

7. Final Comments

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. R Codes to Compute the qf of the LGB Distribution and Generate Pseudo-Random Numbers

Appendix B. System of Equations to Minimize the Kullback–Leibler Divergence with Respect to θ₂

Appendix C. Graphical Comparison of the pdf of X₁ and X₂ in Scenarios A to F

Appendix D. System of Equations to Obtain the ML Estimates Based on a Random Sample of Size n from a LGB(μ, σ, γ, α) Population

Appendix E. Second Partial Derivatives of the Log-Likelihood Function for a Single Observation of the LGB Distribution

References

Article Metrics

Citations

Article Access Statistics

A Unimodal/Bimodal Skew/Symmetric Distribution Generated from Lambert’s Transformation

Abstract

1. Introduction

2. The LGB Distribution

2.1. LGB Random Variable

2.2. Related Distributions

3. Shapes and Aliases

3.1. Shapes

3.2. Skewness and Kurtosis

3.3. Alias Distributions

4. Maximum Likelihood Estimator

5. Simulation Studies

5.1. First Simulation Study

5.2. Second Simulation Study

6. Data Analysis

7. Final Comments

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. R Codes to Compute the qf of the LGB Distribution and Generate Pseudo-Random Numbers

Appendix B. System of Equations to Minimize the Kullback–Leibler Divergence with Respect to θ2

Appendix C. Graphical Comparison of the pdf of X1 and X2 in Scenarios A to F

Appendix D. System of Equations to Obtain the ML Estimates Based on a Random Sample of Size n from a LGB(μ, σ, γ, α) Population

Appendix E. Second Partial Derivatives of the Log-Likelihood Function for a Single Observation of the LGB Distribution

References

Article Metrics

Citations

Article Access Statistics

Appendix B. System of Equations to Minimize the Kullback–Leibler Divergence with Respect to θ₂

Appendix C. Graphical Comparison of the pdf of X₁ and X₂ in Scenarios A to F