Next Article in Journal
GMBO: Group Mean-Based Optimizer for Solving Various Optimization Problems
Next Article in Special Issue
An α-Monotone Generalized Log-Moyal Distribution with Applications to Environmental Data
Previous Article in Journal
Probabilistic Non-Negative Matrix Factorization with Binary Components
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Flexible Log-Linear Birnbaum–Saunders Model

by
Guillermo Martínez-Flórez
1,2,
Inmaculada Barranco-Chamorro
3,* and
Héctor W. Gómez
4
1
Departamento de Matemáticas y Estadística, Facultad de Ciencias Básicas, Universidad de Córdoba, Córdoba 230027, Colombia
2
Programa de Pós-Graduação em Modelagem e Métodos Quantitativos, Universidade Federal do Ceará, Fortaleza 60020-181, Brazil
3
Departamento de Estadística e I.O., Facultad de Matemáticas, Universidad de Sevilla, 41012 Sevilla, Spain
4
Departamento de Matemáticas, Facultad de Ciencias Básicas, Universidad de Antofagasta, Antofagasta 1240000, Chile
*
Author to whom correspondence should be addressed.
Mathematics 2021, 9(11), 1188; https://doi.org/10.3390/math9111188
Submission received: 19 April 2021 / Revised: 14 May 2021 / Accepted: 17 May 2021 / Published: 24 May 2021
(This article belongs to the Special Issue Mathematical and Computational Statistics and Their Applications)

Abstract

:
Rieck and Nedelman (1991) introduced the sinh-normal distribution. This model was built as a transformation of a N(0,1) distribution. In this paper, a generalization based on a flexible skew normal distribution is introduced. In this way, a more general model is obtained that can describe a range of asymmetric, unimodal and bimodal situations. The paper is divided into two parts. First, the properties of this new model, called flexible sinh-normal distribution, are obtained. In the second part, the flexible sinh-normal distribution is related to flexible Birnbaum–Saunders, introduced by Martínez-Flórez et al. (2019), to propose a log-linear model for lifetime data. Applications to real datasets are included to illustrate our findings.

1. Introduction

In this paper, a generalization of the sinh-normal distribution introduced by Rieck and Nedelman [1] is proposed. Recall that the sinh-normal model is defined as a transformation, h ( · ) , of a standard normal distribution
Y = h ( Z ) where Z N ( 0 , 1 ) .
Rieck and Nedelman [1] showed that the probability density function (pdf) of Y is symmetric and can be unimodal or bimodal. The most relevant application of (1) is that the sinh-normal distribution can be used to propose a log-linear model for lifetime data distributed as a Birnbaum–Saunders (see [1] for details). This point is of crucial importance since the Birnbaum–Saunders distribution is used in a variety of situations, such as to model lifetime, economics and environmental data. In these applications, departures of the BS model are often found. In recent years, some improvements have been introduced to deal with this problem. In this sense, we highlight the flexible Birnbaum–Saunders model, proposed by Matínez-Flórez et al. [2], where two parameters, δ R , λ > 0 , are added to the usual Birnbaum–Saunders model, in such a way that λ controls asymmetry (skewness) and δ is a shape parameter related to unimodality/bimodality of our proposal.
Throughout this paper, the next points are addressed.
First, details about the precedents, in which our proposal is based on, are given in Section 2.
Second, the flexible sinh-normal distribution is defined as result of applying (1) to Z distributed as a flexible skew-normal distribution, Z F S N ( δ , λ ) with δ R and λ > 0 . Recall that the F S N ( δ , λ ) model was introduced by Gómez et al. [3], as a generalization of Azzalini skew-normal model [4], and it can deal with unimodal and bimodal situations. The properties of the flexible sinh-normal distribution are given in Section 3. These are explicit expressions for the pdf and cumulative distribution function (cdf), shape and modes of the distribution, location and scale changes, approximations, moments and quantiles, among others. All of them are based on the fact that the flexible sinh-normal is obtained as a given function of Z F S N ( δ , λ ) . The obtained properties are compared to those in the Rieck and Nedelman model. In addition, we discuss in which sense our proposal is more general than others previously introduced in literature. The relationship between the flexible sinh-normal distribution and the flexible Birnbaum–Saunders via the exponential (or neperian logarithm), which allows us to build a regression model for lifetime data, is also proven there. Section 4 is devoted to maximum likelihood estimation for the parameters in the flexible sinh-normal distribution.
Third, the flexible Log-linear Birnbaum–Saunders regression model is studied in Section 5. This can be used for non-negative lifetime variables following the flexible Birnbaum–Saunders model given in [2]. The model is built, particular cases of interest are cited, scores and maximum likelihood equations are given.
Applications to real datasets are included in Section 6. In Section 6.1, a positively skewed environmental dataset is considered. There, it is proven that the flexible sinh-normal model provides a better fit than other precedent skew models. In Section 6.2, a symmetric bimodal dataset is considered. Only a few models in the literature exhibit both properties. They are considered along with a mixture of normal distributions. It is shown that the flexible sinh-normal submodel with skewness parameter equal to zero provides a better fit than the other ones. Section 6.3 deals with failure time data in a fatigue test, where the values of stress are considered as known covariate. It is shown that the flexible log-linear Birnbaum–Saunders provides a superior fit to the log-Birnbaum–Saunders [1] and the log-skew-Birnbaum–Saunders model. A simulation study is presented in Section 7. A final discussion is given as Section 8.

2. Materials and Methods

For completeness, details about the most relevant precedents in which our proposal is based are introduced next.
  • Flexible skew-normal distribution.
The flexible skew-normal (FSN) model was obtained and studied in detail by [5]. They proved that the FSN model can be bimodal for certain values of δ . A random variable (rv) Z follows a FSN distribution, Z F S N ( δ , λ ) , if its pdf is given by
f ( z ; δ , λ ) = c δ ϕ ( z + δ ) Φ ( λ z ) , z R , λ , δ R ,
where ϕ and Φ are the pdf and cdf of the N ( 0 , 1 ) distribution, respectively, and c δ 1 = 1 Φ ( δ ) .
Particular cases of interest in the flexible skew-normal are the following:
-
Taking δ = 0 in (2), the skew-normal distribution proposed by Azzalini [4] is obtained.
-
If λ = 0 , then (2) reduces to the flexible normal distribution introduced by Martínez-Flórez et al. [6].
-
If δ = λ = 0 , then Z N ( 0 , 1 ) .
  • Sinh-normal or log-Birnbaum–Saunders distribution.
Rieck and Nedelman [1] developed the sinh-normal (SHN) distribution, which is given as the following transformation of a standard normal distribution
Y = a r c s i n h γ 2 Z η + ξ where Z N ( 0 , 1 ) ,
γ > 0 is a shape parameter, ξ R is a location parameter and η > 0 is a scale parameter. (3) is denoted as Y S H N ( γ , ξ , η ) .
Proposition 1.
(Properties of sinh-normal distribution). Let Y S H N ( γ , ξ , η ) . Then,
1. 
Y is symmetric about the location parameter ξ.
2. 
The pdf of Y is unimodal for γ 2 and bimodal for γ > 2 .
3. 
The mean and variance of Y are E [ Y ] = ξ and V a r [ Y ] = η 2 v ( γ ) , where v ( γ ) is the variance when the scale parameter is equal to one ( η = 1 ). There is no closed-expression for v ( γ ) , but Rieck and Nedelman [1] provided asymptotic approximations for small and large values of γ.
4. 
Let Y γ S H N ( γ , ξ , η ) . Then, U γ = 2 ( Y γ ξ ) γ η converges in law to a N ( 0 , 1 ) distribution as γ 0 + .
The next lemma considers the particular case of a sinh-normal distribution in which the scale parameter is equal to 2, η = 2 . This model is related to the Birnbaum–Saunders (BS) distribution. This result is the basis of the use of this distribution as a regression model, as shown in Section 5.
Lemma 1.
1. 
If Y S H N ( γ , ξ , 2 ) , then T = exp ( Y ) B S ( γ , exp ( ξ ) ) .
2. 
Reciprocally, if T B S ( α , β ) , then Y = log T follows a sinh-normal distribution with shape parameter α, location parameter ξ = log β and scale parameter η = 2 , that is Y S H N ( α , log β , 2 ) .
Proof. 
It can be seen in [1] (Theorem 1.1). □
Due to the result given in Lemma 1, the sinh-normal distribution is also named log-Birnbaum–Saunders distribution.

3. Results

  • Flexible sinh-normal distribution.
This model is obtained by applying the transformation introduced in (3) to Z F S N ( δ , λ ) . The new model is called the flexible sinh-normal (FSHN) distribution. It is denoted as Y F S H N ( γ , ξ , η , δ , λ ) . By applying the change of variable formula, the pdf of Y, f Y , is
f Y ( y ) = f Z ( a γ , ξ , η ( y ) ) a γ , ξ , η ( y )
where f Z denotes the pdf of Z F S N ( δ , λ ) given in (2), a γ , ξ , η ( y ) and a γ , ξ , η ( y ) = y a γ , ξ , η ( y ) are
a γ , ξ , η ( y ) = 2 γ s i n h y ξ η , a γ , ξ , η ( y ) = 2 η γ c o s h y ξ η .
Explicitly, (4) can be written as
f Y ( y ) = c δ 2 η γ c o s h y ξ η ϕ 2 γ s i n h y ξ η + δ Φ λ 2 γ s i n h y ξ η ,
with c δ = ( 1 Φ ( δ ) ) 1 .
The following are particular cases of interest in the F S H N model:
-
If δ = 0 , then the FSHN model reduces to the skew sinh-normal distribution studied by Leiva et al. [7].
-
If λ = 0 , then a symmetric model denoted by F S H N λ = 0 ( γ , ξ , η , δ ) is obtained, which allows us to model symmetric bimodal data.
-
If δ = λ = 0 , then the FSHN model reduces to the sinh-normal distribution introduced by Rieck and Nedelman [1].
Next, some results dealing with the inverse transformation to that introduced in (3) are given.
Proposition 2.
Let Y F S H N ( γ , ξ , η , δ , λ ) . Then,
a γ , ξ , η ( Y ) = 2 γ s i n h Y ξ η F S N ( δ , λ ) .
Proof. 
It is obtained by considering Z = a γ , ξ , η ( Y ) and applying the inverse transformation technique to
Y = h ( Z ) = a r c s i n h γ 2 Z η + ξ , with Z F S N ( δ , λ ) .
The following are particular cases of interest in Proposition 2:
  • If λ = 0 , then Z = a γ , ξ , η ( Y ) follows the f l e x i b l e n o r m a l ( δ ) model studied by Castillo et al. [8].
  • If δ = 0 , then Z = a γ , ξ , η ( Y ) S k e w N o r m a l ( λ ) .
  • If λ = δ = 0 , then Z = a γ , ξ , η ( Y ) N ( 0 , 1 ) .
Next, an explicit expression for the cumulative distribution function (cdf) of a FSHN distribution is given.
Theorem 1.
Let Y F S H N ( γ , ξ , η , δ , λ ) . Then, the cdf of Y is
F Y ( y ) = c δ Φ B N λ λ δ 1 + λ 2 , a ( y ) δ , i f y < ξ c δ Φ B N λ λ δ 1 + λ 2 , δ + Φ B N λ λ δ 1 + λ 2 , a ( y ) + δ Φ B N λ λ δ 1 + λ 2 , δ , i f y ξ ,
where a ( y ) = a γ , ξ , η ( y ) is defined in (5), Φ B N λ ( · , · ) denotes the cdf of a bivariate normal distribution, with mean vector μ = ( 0 , 0 ) and covariance matrix
Ω λ = 1 ρ λ ρ λ 1 where ρ λ = λ 1 + λ 2 .
Proof. 
Since Y F S H N ( γ , ξ , η , δ , λ )
Y = a r c s i n h γ 2 Z η + ξ , with Z F S N ( δ , λ ) .
Taking into account that a r c s i n h ( x ) = l o g x + x 2 + 1 is a monotonically increasing function of x, we have the following relationship between the cdf’s de Y and Z:
F Y ( y ) = F Z ( a γ , ξ , η ( y ) )
with a γ , ξ , η ( y ) = 2 γ s i n h y ξ η .
By applying the expression for the cdf of Z F S N ( δ , λ ) given in [2] we have that, for a γ , ξ , η ( y ) < 0 , or, equivalently, for y < ξ
F Z ( a γ , ξ , η ( y ) ) = c δ Φ B N λ λ δ 1 + λ 2 , a ( y ) δ ,
and for a γ , ξ , η ( y ) 0 , or, equivalently, for y ξ
F Z ( a γ , ξ , η ( y ) ) =
c δ Φ B N λ λ δ 1 + λ 2 , δ + Φ B N λ λ δ 1 + λ 2 , a ( y ) + δ Φ B N λ λ δ 1 + λ 2 , δ
where Φ B N λ ( · , · ) denotes the cdf of a bivariate normal distribution, with mean vector μ = ( 0 , 0 ) and covariance matrix Ω λ given in (8). Thus, the proposed result holds. □
Next, some particular cases of interest for the cdf in the FSHN model are discussed.
Corollary 1.
Let Y F S H N ( γ , ξ , η , δ , λ ) .
1. 
If λ = 0 , then the cdf of Y reduces to
F Y ( y ) = c δ 2 Φ ( a ( y ) δ ) , i f y < ξ c δ 2 Φ ( a ( y ) + δ ) + 1 2 Φ ( δ ) , i f y ξ .
2. 
If δ = 0 , then the cdf of Y reduces to F Y ( y ) = 2 Φ B N λ ( 0 , a ( y ) ) , for y R .
Proof. 
  • If λ = 0 , then ρ λ = 0 , where ρ λ is given in (8). Recall that, in the bivariate normal distribution, uncorrelation implies independence, therefore
    Φ B N λ = 0 ( x 1 , x 2 ) = Φ ( x 1 ) Φ ( x 2 ) , ( x 1 , x 2 ) .
    Taking into account that Φ ( 0 ) = 1 / 2 and Φ ( δ ) = 1 Φ ( δ ) , we have that (7) reduces to (10).
  • If δ = 0 , then we have the cdf of the skew-sinh-normal distribution.
The result in Corollary 1 for λ = 0 corresponds to the model studied by Olmos et al. [9], whereas the result for δ = 0 is a particular case of models studied by Vilca-Labra and Leiva-Sanchez [10].
  • Shape of f Y ( · ) .
Proposition 3.
Let Y F S H N ( γ , ξ , η , δ , λ ) . The pdf given in (16) can be bimodal. The modes would be given as the solution of the following non-linear equations
1. 
y 1 * < ξ solution of
δ + λ ϕ ( λ s ( y ) ) Φ ( λ s ( y ) ) + s ( y ) c 2 ( y ) = s ( y ) ,
2. 
y 2 * > ξ solution of
δ + λ ϕ ( λ s ( y ) ) Φ ( λ s ( y ) ) + s ( y ) c 2 ( y ) = s ( y ) ,
with s ( y ) = 2 γ s i n h y ξ η and c ( y ) = 2 γ c o s h y ξ η .
Proof. 
To obtain the maximums of f Y , we take the natural logarithm of f Y , except additive terms not depending of y. Thus, let us consider
g ( y ) = ln ( c ( y ) ) + ln ϕ | s ( y ) | + δ + ln Φ λ s ( y )
where s ( y ) = 2 γ s i n h y ξ η and c ( y ) = 2 γ c o s h y ξ η . Taking the first derivative of g ( y ) with respect to y, g ( y ) = 0 , after straightforward calculations, (11) and (12) are obtained. □
Proposition 4.
Let Y F S H N ( γ , ξ , η , δ , λ ) with λ = 0 . Then, the pdf of Y is symmetrical with respect to ξ and can be bimodal for δ < 0 .
Proof. 
Symmetry follows from the fact that c o s h ( · ) and ϕ ( | z | + δ ) are even functions. Bimodality for δ < 0 follows from Proposition 3. □
Proposition 5.
Let Y F S H N ( γ , ξ , η , δ , λ ) . Then, the pdf of Y is non-differentiable at y = ξ .
Proof. 
This result follows from the fact that, if y = ξ , then s ( ξ ) = 2 γ s i n h ( 0 ) = 0 and the non-differentiability of the absolute value function at zero. □
Some illustrative plots for the pdfs in the FSHN model are given in Figure 1, Figure 2 and Figure 3. In Figure 1 and Figure 2, we mainly focus on the study of the shape parameters, δ and γ , so in every figure a positive or negative value of δ is fixed, and a range of possible values is considered for γ (less and greater than two), since our aim is to compare with the sinh-normal distribution, which is strongly unimodal for γ 2 and bimodal for γ > 2 [1]. In this way, the effect of δ and/or γ can be appreciated. Moreover, positive and negative values of λ are considered to assess the effect of the skewness parameter. In Figure 3, λ = 0 is fixed and therefore the symmetric submodel is plotted. Additional details are given next for every figure.
In Figure 1, without loss of generality for our purposes, the location and scale parameters are taken as ξ = 0 and η = 1 . We intend to highlight the effect of δ on the bimodality of FSHN model. In Figure 1a, small values are considered for the skewness parameter, λ = 0.25 and 0.25 , and a clearly negative value of δ = 2.75 . All pdfs are bimodal. Note that the parameter γ { 0.75 , 1.5 , 3.5 , 5.5 } and even for values of γ 2 the pdf is bimodal. These plots suggest that a negative value of δ induces bimodality and the effect of γ is attenuated. It is also possible to appreciate the effect of the sign of skewness parameter λ in Figure 1a; note that black and blue plots correspond to negative λ , whereas orange and red ones correspond to positive λ . On the other hand, Figure 1b, a positive value of δ = 1.5 is considered ( λ = 1 ). We highlight that the distribution is unimodal even for values of γ > 2 .
In Figure 2, a small value of δ is fixed ( δ = 0.2 in Figure 2a and δ = 0.2 in Figure 2b); the plots suggest that, in this case, the bimodality depends on the value of γ . However, we must also be conscious that, in these plots λ = 0.4 , for large values of λ unimodal distributions are nearly always obtained for any values of γ and δ .
In Figure 3, the skewness parameter λ = 0 is fixed, therefore symmetric pdfs are always obtained. In Figure 3a, a negative value of δ = 0.75 is fixed and all plots are bimodal (even for values of the parameter γ < 2 in which the sinh-normal distribution is unimodal). In Figure 3b, δ = 0.75 is fixed and unimodal symmetric distributions are obtained.
Next, we show that the FSHN model is closed under location and scale changes.
Proposition 6.
Let Y F S H N ( γ , ξ , η , δ , λ ) . Then,
1. 
a Y F S H N ( γ , a ξ , a η , δ , λ ) , a > 0 .
2. 
Y + b F S H N ( γ , ξ + b , η , δ , λ ) , b R .
3. 
Y F S H N ( γ , ξ , η , δ , λ ) .
Proof. 
  • Let Z = a Y with a > 0 . The result proposed follows from the fact that f Z ( z ) = 1 a f Y z a .
  • Let Z = Y + b . Then, f Z ( z ) = f Y ( z b ) .
  • Let Z = Y . Then, f Z ( z ) = f Y ( z ) . Taking into account that c o s h ( · ) is an even function, c o s h ( x ) = c o s h ( x )
    2 γ c o s h z ξ η = 2 γ c o s h z + ξ η = 2 γ c o s h z ( ξ ) η ,
    and, since s i n h ( · ) is an odd function, s i n h ( x ) = s i n h ( x )
    2 γ s i n h z ξ η = 2 γ s i n h z + ξ η = 2 γ s i n h z ( ξ ) η ,
    and the result proposed is obtained.
Corollary 2.
Let Y F S H N ( γ , ξ , η , δ , λ ) . Then,
Y ξ η F S H N ( γ , 0 , 1 , δ , λ ) .
Next, the limit behavior of FSHN model when γ approaches to 0 + is studied.
Proposition 7.
Let Y F S H N ( γ , ξ , η , δ , λ ) . Then, U γ = 2 ( Y ξ ) γ η converges in distribution to the flexible skew-normal F S N ( δ , λ ) distribution as γ 0 + .
Proof. 
Let U γ = 2 ( Y ξ ) γ η . By applying Proposition 6, U γ = 2 ( Y ξ ) γ η F S H N ( γ , 0 , 2 γ , δ , λ ) . Therefore, its pdf is
f U γ ( u ) = c δ c o s h γ u 2 ϕ 2 γ s i n h γ u 2 + δ Φ λ 2 γ s i n h γ u 2 , u R .
Considering that lim γ 0 + c o s h γ u 2 = 1 , by L’Hôpital–Bernoulli rule,
lim γ 0 + 2 γ s i n h γ u 2 = u lim γ 0 + c o s h γ u 2 = u .
Since ϕ ( · ) and Φ ( · ) are continuous differentiable functions,
f U γ ( u ) c δ ϕ ( | u | + δ ) Φ ( λ u ) , u R as γ 0 + ,
which is the pdf of a flexible skew-normal ( δ , λ ) distribution. That is, the result proposed. □
Remark 1.
Given Y F S H N ( γ , ξ , η , δ , λ ) with γ close to zero, Proposition 7 allows us to approximate the distribution of Y, properly normalized, by a flexible skew-normal ( δ , λ ) distribution. In addition, note that this result is analogous to the property of the sinh-normal distribution given in Proposition 1.
Next, it is shown that the p t h quantile of Y can be obtained from the p t h quantile of Z F S N ( δ , λ ) .
Theorem 2.
Let Y F S H N ( γ , ξ , η , δ , λ ) . Then, the pth quantile of Y, y p , with 0 < p < 1 is given by
y p = ξ + η a r c s i n h γ 2 z p
where z p denotes the pth quantile of Z F S N ( δ , λ ) .
Proof. 
It follows from (3) and the fact that the arcsinh function is monotonically increasing. □
  • Submodel with λ = 0 .
For λ = 0 , y p can be obtained from the quantiles of the N ( 0 , 1 ) distribution.
Corollary 3.
If λ = 0 , then
y p = ξ + η a r c s i n h γ 2 δ + Φ 1 ( 2 p / c δ ) , i f p < 0.5 ξ + η a r c s i n h γ 2 δ + Φ 1 Φ ( δ ) + ( 2 p 1 ) c δ , i f p 0.5 .
As it can be seen in [1], recall that the log-Birnbaum–Saunders regression model is based on the relationship between the Birnbaum–Saunders and the sinh-normal distribution. In our proposal, a regression model is introduced next that is based on the relationship between the flexible Birnbaum–Saunders, studied in [2] and the FSHN distribution introduced in this paper. First, we recall the expression of the pdf for the flexible Birnbaum–Saunders.
  • Flexible Birnbaum–Saunders.
Based on the flexible skew-normal model defined in (2), Martínez-Flórez et al. [2] proposed the flexible Birnbaum–Saunders (FBS) distribution, T F B S ( α , β , δ , λ ) , whose pdf is given by
f ( t ; α , β , δ , λ ) = t 3 / 2 ( t + β ) 2 α β 1 / 2 ( 1 Φ ( δ ) ) ϕ ( | a t | + δ ) Φ ( λ a t ) , t > 0 ,
with a t
a t = a t ( α , β ) = 1 α t β β t ,
α > 0 , β > 0 , δ R , λ R and ϕ ( · ) and Φ ( · ) the pdf and cdf of a N ( 0 , 1 ) , respectively.
In (16), δ and λ are shape parameters. We highlight that λ is a parameter that controls asymmetry (or skewness) and δ is a shape parameter related to bimodality of FBS model. Specifically, it can be seen in [2] that (16) can be bimodal for some combinations of δ and λ parameters.
The following are particular cases of interest in (16):
-
If λ = 0 , then the model introduced by Olmos et al. [9] is obtained.
-
If δ = 0 , then the skew-Birnbaum–Saunders is obtained.
-
If λ = δ = 0 , then the Birnbaum–Saunders is obtained.
Theorem 3.
Let Y F S H N ( γ , ξ , 2 , δ , λ ) . Then, T = e x p ( Y ) F B S ( γ , e x p ( ξ ) , δ , λ ) .
Proof. 
This result is obtained by applying the transformation technique for random variables and taking into account (16). □
In practice, it is usual to consider non-negative rv’s following a F B S ( α , β , δ , λ ) . Then, its neperian logarithm will follow a FSHN distribution, as established in next corollary.
Corollary 4.
Let T F B S ( α , β , δ , λ ) . Then,
Y = l o g ( T ) F S H N ( α , ξ = l o g ( β ) , η = 2 , δ , λ ) .
If T depends on a known set of covariates then it is of interest to build a generalized linear model related to T [11]. This issue is studied in Section 5.

Moments

Let Y F S H N ( γ , ξ , η , δ , λ ) . Recall that
Y = ξ + η a r c s i n h γ 2 Z where Z F S N ( δ , λ ) .
Thus, the moments of Y can be expressed in terms of the moments of a r c s i n h γ 2 Z with respect to Z F S N ( δ , λ ) . To simplify the notation, let us introduce
c j ( γ , δ , λ ) = E F S H N a r c s i n h γ 2 Z j j = 1 , 2 , , V ( γ , δ , λ ) = V a r F S H N a r c s i n h γ 2 Z .
In particular, we have that
E [ Y ] = ξ + η c 1 ( γ , δ , λ ) ,
V a r [ Y ] = η 2 V ( γ , δ , λ ) .

4. Inference Based on Maximum Likelihood Estimation

Let Y F S H N ( γ , ξ , η , δ , λ ) . Let us denote the vector of unknown parameters by θ = ( γ , ξ , η , δ , λ ) T . Given Y 1 , , Y n a random sample of Y F S H N ( γ , ξ , η , δ , λ ) , let us consider the log-likelihood function for the parameter vector θ = ( γ , ξ , η , δ , λ ) , which is given
( θ ) = n log c δ n log η + i = 1 n log z i , 1 + i = 1 n log ϕ ( | z i , 2 | + δ ) + i = 1 n log Φ ( λ z i , 2 ) + c ,
where c denotes a constant independent of θ and
z i , 1 = 2 γ c o s h y i ξ η , z i , 2 = 2 γ s i n h y i ξ η .
To maximize l ( θ ) in θ , consider the score functions, denoted as U ( γ ) , U ( ξ ) , U ( η ) , U ( δ ) and U ( λ ) , and given as the first derivatives of l ( θ ) with respect to γ , ξ , η , δ and λ , respectively. Maximum likelihood estimates for the parameters are given as the solutions to U ( γ ) = 0 , U ( ξ ) = 0 , U ( η ) = 0 , U ( δ ) = 0 and U ( λ ) = 0 , which are equivalent to the following equations
i = 1 n z i , 2 2 + δ i = 1 n | z i , 2 | λ i = 1 n z i , 2 R i , 2 = n
i = 1 n z i , 1 z i , 2 z i , 2 z i , 1 + δ i = 1 n s g n ( z i , 2 ) z i , 1 λ i = 1 n z i , 1 R i , 2 = 0
i = 1 n y i ξ η z i , 1 z i , 2 z i , 2 z i , 1 + δ i = 1 n y i ξ η s g n ( z i , 2 ) z i , 1 λ i = 1 n y i ξ η z i , 1 R i , 2 = n
ϕ ( δ ) 1 Φ ( δ ) δ = i = 1 n | z i , 2 | n
i = 1 n z i , 2 R i , 2 = 0 ,
where s g n ( · ) denotes the s i g n function and
R i , 2 = ϕ ( λ z i , 2 ) Φ ( λ z i , 2 ) , i = 1 , , n .
Iterative methods must be used to solve these equations [12]. Details about the previous maximum likelihood estimators (MLEs) are postponed to Section 5, where the MLEs for the parameters in the regression model based on the FSHN distribution are studied.

5. The Flexible Log-Birnbaum–Saunders Regression Model

In this section the flexible log-Birnbaum–Saunders regression model is introduced. Let T 1 , , T n be independent positive continuous random variables such as
T i F B S ( γ i , τ i , δ i , λ i ) .
Let us assume that the distribution of T i , proposed in (27), depends on a set of known explanatory variables, X i , 1 , , X i , p 1 , satisfying for i = 1 , , n
  • τ i = e x p ( x i T β ) where x i = ( 1 , x i , 1 , , x i , p 1 ) T and β = ( β 0 , β 1 , , β p 1 ) T is a p-dimensional vector of unknown parameters.
  • The shape and skewness parameters in (27) do not involve x i , that is, γ i = γ , δ i = δ and λ i = λ .
Let us consider Y i = l o g ( T i ) . By applying Corollary 4, Y i F S H N ( γ , x i T β , 2 , δ , λ ) . Thus, Y i can be written as a linear model
Y i = x i T β + ϵ i
where the error term ϵ i F S H N ( γ , 0 , 2 , δ , λ ) (Proposition 6 is applied) and ϵ i s are independent. As for the expectation, variance of ϵ i and covariance of ϵ i and ϵ j ( i j ), taking into account (19), we have for i = 1 , , n
E [ ϵ i ] = 2 c 1 ( γ , δ , λ ) V a r [ ϵ i ] = 4 V ( γ , δ , λ ) C o v ( ϵ i , ϵ j ) = 0 , i j .
Taking
β 0 * = β 0 + 2 c 1 ( γ , δ , λ ) ,
we can write
E [ Y i ] = x i T β * with β * = ( β 0 * , β 1 T ) T
where the initial vector of unknown parameters has been partitioned into β = ( β 0 , β 1 T ) T with β 1 = ( β 1 , β p 1 ) T .
By applying the ordinary least squares approach, an unbiased linear estimator of β * is
β ^ * = ( X T X ) 1 X T y ,
where y = ( y 1 , , y n ) T and
X = 1 x 1 , 1 x 1 , 2 x 1 , p 1 1 x 2 , 1 x 2 , 2 x 2 , p 1 1 x n , 1 x n , 2 x n , p 1 .
The covariance matrix of β ^ * is given by
C o v ( β ^ * ) = 4 V ( γ , δ , λ ) ( X T X ) 1 .
The model previously introduced is named flexible log-Birnbaum–Saunders regression model and is denoted by F L B S ( γ , x i T β , 2 , δ , λ ) .
  • Particular cases of interest.
  • If δ = λ = 0 , then we have the log-Birnbaum–Saunders regression model proposed by Rieck and Nedelman [1], which is based on the sinh-normal distribution [1].
  • If δ = 0 , then (28) reduces to the skewed log-Birnbaum–Saunders regression model studied by Lemonte [13], which is based on the skewed sinh-normal distribution introduced in Leiva et al. [7].
  • If λ = 0 , then (28) reduces to the submodel flexible log-Birnbaum–Saunders, denoted by F L B S λ = 0 ( γ , x i T β , 2 , δ ) , which is based on the F S H N λ = 0 ( γ , 0 , 2 , δ ) distribution introduced in this paper.

Likelihood Equations in the Regression Model

Let us consider the log-likelihood function for the parameter vector θ = ( γ , β T , δ , λ ) T , which is given
( θ ) = n log c δ + i = 1 n log ξ i , 1 + i = 1 n log ϕ ( | ξ i , 2 | + δ ) + i = 1 n log Φ ( λ ξ i , 2 ) + c ,
where c denotes a constant independent of θ and
ξ i , 1 = 2 γ c o s h y i x i T β 2 , ξ i , 2 = 2 γ s i n h y i x i T β 2 .
To maximize l ( θ ) in θ , consider the score functions, given as the first derivatives of l ( θ ) with respect to γ , β j (with j = 0 , , p 1 ), δ and λ , respectively. They are denoted as U ( γ ) , U ( β j ) with j = 0 , 1 , , p 1 , U ( δ ) and U ( λ ) ,
U ( γ ) = n γ + 1 γ i = 1 n ξ i , 2 2 + δ γ i = 1 n s g n ( ξ i , 2 ) ξ i , 2 λ γ i = 1 n ξ i , 2 w i , 2
U ( β j ) = 1 2 i = 1 n x i , j ξ i , 1 ξ i , 2 ξ i , 2 ξ i , 1 + δ 2 i = 1 n x i , j s g n ( ξ i , 2 ) ξ i , 1 λ 2 i = 1 n x i , j ξ i , 1 , w i , 2
with j = 0 , 1 , , p 1 .
U ( δ ) = n ϕ ( δ ) 1 Φ ( δ ) i = 1 n s g n ( ξ i , 2 ) ξ i , 2 n δ ,
U ( λ ) = i = 1 n ξ i , 2 w i , 2 ,
where s g n ( · ) denotes the s i g n function and
w i , 2 = ϕ ( λ ξ i , 2 ) Φ ( λ ξ i , 2 ) , i = 1 , , n .
Maximum likelihood estimators for the parameters are given as the solutions to U ( γ ) = 0 , U ( β j ) = 0 , j = 0 , , p 1 , U ( δ ) = 0 and U ( λ ) = 0 , which requires numerical techniques.

6. Applications

6.1. Illustration 1

Dataset 1 consists of n = 116 observations of daily ozone mean concentration in the atmosphere (in ppb = p p m × 1000 ) in New York taken from May to September 1973 [14]. The average concentration of pollutants in the air is of interest in epidemiological studies due to its serious adverse effects on the human health. It is usually assumed that these data are independent, and therefore they do not require a cyclic trend analysis, see for instance the works of Gokhale and Khare [15], Nadarajah [16] and Leiva et al. [7], where this dataset is also studied.
Descriptive summaries. The sample mean, variance, skewness and kurtosis coefficients are: y ¯ = 42.1293 , s 2 = 1088.2010 , b 1 = 1.2098 and b 2 = 1.1122 . From these summaries and Figure 4, we can conclude that this dataset is positively (or right) skewed. Thus, sinh-normal, skew sinh-normal, Birnbaum–Saunders, extended Birnbaum–Saunders [7] and FSHN distributions can be considered as models for this dataset. Their performance is compared throughout the Akaike information criterion, A I C = 2 l ^ ( · ) + 2 k and Bayesian information criterion, B I C = 2 l ^ ( · ) + l o g ( n ) k , where l ^ ( · ) denotes the log-likelihood function evaluated at the MLEs of parameters and k is the number of parameters in the model. From the results in Table 1, the FSHN distribution provides the best fit to these data since its AIC and BIC are the smallest [17,18].
On the other hand, since the sinh-normal, skew sinh-normal and FSHN are nested models, they can be compared by using the likelihood ratio test, [19].
First, let us consider the test
H 0 : ( δ , λ ) = ( 0 , 0 ) vs H 1 : ( δ , λ ) ( 0 , 0 ) ,
which is equivalent to testing the sinh-normal (SHN) versus a FSHN distribution. The associated likelihood ratio statistic is
Λ 1 = L S H N ( γ ^ , ξ ^ , η ^ ) L F S H N ( γ ^ , ξ ^ , η ^ , δ ^ , λ ^ ) ,
which is asymptotically distributed as a chi-square variable with two degrees of freedom (df). After substituting the estimated values of the parameters, we obtain 2 log ( Λ 1 ) = 60.261 , which is greater than the 5% chi-square critical value with 2 df, which is 5.99 . Therefore, the FSHN model is preferred to the SHN model for this dataset.
Second, we compare the FSHN model to the skew sinh-normal (skew-SHN) model. Let us now consider the following likelihood ratio statistic
Λ 2 = L S k e w S H N ( γ ^ , ξ ^ , η ^ , λ ^ ) L F S H N ( γ ^ , ξ ^ , η ^ , δ ^ , λ ^ ) ,
which is asymptotically distributed as a chi-square variable with d f = 1 , χ 1 2 .
After substituting the estimated values, we obtain 2 log ( Λ 2 ) = 7.182 , which is greater than the 5% chi-square critical value with 1 df, which is 3.84 . Therefore, FSHN is preferred to skew-SHN model for this dataset.
Figure 4 presents the histogram and the fitted pdfs. Note that FSHN and extended BS provide a good fit to this dataset.

6.2. Illustration 2

Dataset 2 consists of n = 500 observations of fetus weight (in grams) before birth obtained by ultrasound technique. Descriptive summaries are given in Table 2. The data are available at http://www.mat.uda.cl/hsalinas/cursos/2011/R/weight.rar (accessed on 17 May 2021). The variables b.weigt (fetal weight in grams) and a.weigt (birth weight in grams) can be found there. We study the b.weight variable.
In Table 2, note that the mean and median are similar and the sample asymmetry coefficient b 1 is close to zero. Therefore, the distribution is fairly symmetrical. This fact can be seen in the histogram plotted in Figure 5, where we can also appreciate the bimodality of this dataset. Bimodality is due to the difference in weight, at the gestation stage, between males and females. Therefore, a symmetric bimodal model must be considered. In the literature, there are few models which exhibit both properties [20]. We propose the bimodal normal [21], the two-piece skew normal [22], the sinh-normal with γ > 2 and the submodel FSHN with λ = 0 . In addition, we consider a mixture of two normal distributions whose pdf is
f ( y ) = α σ 1 ϕ y μ 1 σ 1 + 1 α σ 2 ϕ y μ 2 σ 2 y R ,
with 0 < α < 1 , μ i R and σ i > 0 , i = 1 , 2 .
The estimated parameters in these models, along with their standard errors in parentheses, are given in Table 3. In this table, the following abbreviations are used: SHN (sinh-normal), TN (two-piece skew normal [22]), BN (bimodal normal [21]), MN (mixture of two normals given in (35)), F S H N λ = 0 (FSHN with λ = 0 ). From AIC and BIC in Table 3, we can conclude that the mixture of two normals and the F S H N λ = 0 model provide the best fit to this dataset. However, it is of interest to point out that the use of mixtures is a quite controversial issue in statistics, mainly due to non-identifiability problems (see, e.g., [23]).
Since SHN and F S H N λ = 0 are nested models, we can test
H 0 : δ = 0 versus H 1 : δ 0 ,
which is equivalent to compare SHN versus F S H N λ = 0 . Taking into account that the Fisher information matrix is non-singular at ( δ , λ ) = ( 0 , 0 ) , we can consider the likelihood ratio statistic
Λ 1 = L S H N ( γ ^ , ξ ^ , η ^ ) L F S H N ( γ ^ , ξ ^ , η ^ , δ ^ ) .
It is obtained that 2 log ( Λ 1 ) = 15.868 , which is greater than the χ 1 , 0.05 2 = 3.84 . Thus, H 0 is rejected, leading to the conclusion that the proposed F S H N λ = 0 model fits better than the SHN considered in this case.
In Figure 5, the histogram of bebe weights is plotted along with the proposed bimodal models. There, the good fit provided by our proposal can be checked.

6.3. Illustration 3

Dataset 3 consists of 40 independent observations, which correspond to the failure time, T, for hardened steel specimens in a rolling contact fatigue test. The observations were taken at each of four values of contact stress, which is the covariate x in our proposal. This dataset can be seen in the work of Chan et al. [24]. Let Y i = log T i and consider the regression model
Y i = β 0 + β 1 log x i + ϵ i , i = 1 , , 40 .
We fit the log-Birnbaum–Saunders, the log-skew-Birnbaum–Saunders proposed by Lemonte et al. [13] and the log-flexible Birnbaum–Saunders introduced in this paper to this dataset. The estimated parameters along with the AIC and BIC are given in Table 4. According to AIC and BIC, log-flexible Birnbaum–Saunders provides a better fit than the other ones, so this model would be preferred.

7. Simulation

Next, a simulation study is presented to illustrate the performance of our results. Two features are studied: (1) the global performance of MLEs of parameters if the sample size increases; and (2) the effect of varying every shape parameter, γ , δ and λ , on the performance of the rest of estimators under consideration. Both issues are studied in the use of the flexible sinh-normal distribution as a regression model.

7.1. Simulation I

The flexible sinh-normal distribution as a regression model with one covariate is considered, that is
Y i = β 0 + β 1 x i + ϵ i , i = 1 , , n ,
where ϵ i F S H N ( γ , 0 , 2 , δ , λ ) or, equivalently, Y i F S H N ( γ , β 0 + β 1 x i , 2 , δ , λ ) independent for i = 1 , , n . The vector of unknown parameters is ( γ , β 0 , β 1 , δ , λ ) .
As sample sizes, we consider n { 25 , 50 , 75 , 100 , 500 } , m = 5000 simulations and values of the parameters ( γ , β 0 , β 1 , δ , λ ) = ( 1 , 1.75 , 1.25 , 1.5 , 2 ) . As x, covariate values of a U ( 0 , 1 ) distribution are considered. Values of the FSHN were generated from values of the FSN distribution, which were obtained by using the stochastic representation of FSN model given in [5]. The results are listed in Table 5, Table 6 and Table 7. As for measures of performance, the mean of estimates, their standard errors (sd) in parentheses, the relative bias and the square root of the mean squared error (MSE) are given.
From the results in Table 5, Table 6 and Table 7, we can conclude that the MLEs are biased; in general, the standard error, relative bias and the squared root of MSE decrease if the sample size n increases. We highlight that, in this simulation, β ^ 1 exhibits a good behavior, whereas the greater variability corresponds to λ ^ .
Simulations were carried out by using the optim function of software R [25] by applying the Nelder–Mead method.

7.2. Simulation II

Next, a sensitivity study about the effect of varying the shape parameters (and the sample size) on the estimates of other parameters is presented.
Summaries varying the γ parameter are given in Table 8. Similarly, results varying the parameters δ and λ are given in Table 9 and Table 10, respectively.
  • Comments to Table 8 (varying γ > 0 ).
In this table, we consider β 0 = 1.75 , β 1 = 1.25 ,   δ = 1.5 , λ = 2 , γ { 0.25 , 0.75 , 1.25 } and sample sizes n { 25 , 50 , 75 , 100 , 200 } . For β 0 ^ and β 1 ^ , it can be seen that the relative biases are small. Moreover, we can appreciate that the relative bias, standard error and M S E decrease when the sample size increases, especially for β 1 ^ . δ ^ and λ ^ exhibit greater variability, but their relative bias, sd and M S E also decrease when the sample size increases. As for γ ^ , we can point out that this estimator is also well behaved when n increases. Its relative bias for small sample sizes, n = 25 , is small and the M S E is a moderate value.
  • Comments to Table 9 (varying δ R ).
In this table, we consider γ = 1 , β 0 = 1.75 , β 1 = 1.25 , λ = 2 , δ { 1 , 0.5 , 0.25 , 0.25 , 0.5 , 1 } and n { 25 , 50 , 75 , 100 , 200 } . We can see that, for β 0 ^ and β 1 ^ , the relative bias is small even for n = 25 , and the relative bias, sd and M S E decrease when n increases. Again, δ ^ and λ ^ exhibit greater variability than the other estimators, but the error measures under consideration are satisfactory. The relative bias, sd and M S E of γ ^ decrease when n increases, and their values are moderate even for small sample sizes.
  • Comments to Table 10 (varying λ R ).
In this case, we consider γ = 1 , β 0 = 1.75 , β 1 = 1.25 , δ = 1.5 , λ { 1 , 0.5 , 0.25 , 0.25 , 0.5 , 1 } and n { 25 , 50 , 75 , 100 , 200 } . Again, a similar behavior to the previously explained is obtained.
As final conclusion, we point out that these simulation studies cover a variety of situations as for the shapes of the FSHN distribution (unimodal and bimodal) and suggest that the estimators of the parameters are consistent when the sample size increases [26].

8. Discussion

The BS distribution is an asymmetric model used for survival time data and material lifetime subject to stress as it can be seen in [27]. Due to its practical and theoretical interest, a number of generalizations can be found in literature. We can cite the extensions provided in [28] to the family of elliptical distributions; in [10] based on the elliptical asymmetric distributions; and the extended Birnbaum–Saunders (EBS) distribution introduced in [7].
Other generalizations intend to solve specific deficiencies observed when this model is fitted to a dataset. In this sense, we can cite the epsilon-Birnbaum–Saunders model introduced in [8] to accommodate outliers; the extension based on the slash-elliptical family of distributions given in [3]; and the generalized modified slash Birnbaum–Saunders, which is based on the work in [29] and proposed in [30]. All these extensions are appropriate to fit data with greater or smaller asymmetry (or kurtosis) than that of the usual BS model, but they are not appropriate for fitting bimodal data. This issue is of great interest, as discussed by Olmos et al. [9], Bolfarine et al. [31], Martínez-Flórez et al. [2] and Elal-Olivero et al. [32]. The flexible BS distribution was proposed to model skewness and to fit data with and without bimodality. In this paper, this model is spread out to be used as a regression model in those situations in which regression models based on other generalizations of BS, such as those proposed in [7,13,33,34,35,36,37], do not provide a satisfactory fit.

Author Contributions

Conceptualization, G.M.-F., I.B.-C. and H.W.G.; formal analysis, G.M.-F., I.B.-C. and H.W.G.; investigation, G.M.-F., I.B.-C. and H.W.G.; methodology, G.M.-F., I.B.-C. and H.W.G.; Software, G.M.-F. and I.B.-C.; supervision, H.W.G.; validation, G.M.-F. and I.B.-C.; and visualization, H.W.G. All authors have read and agreed to the published version of the manuscript.

Funding

The research of G. Martínez-Flórez was supported by project: Distribuições de Probabilidade Mutivariadas Assimétricas e Flexíveis, Universidade Federal do Ceará, Fortaleza, Brazil. The research of H. W. Gómez was supported by Grant SEMILLERO UA-2021, Chile.

Data Availability Statement

Details about data availability are given in Section 6.1, Section 6.2 and Section 6.3.

Acknowledgments

G. Martínez-Flórez acknowledges the support given by Universidad de Córdoba, Montería, Colombia, and Universidade Federal do Ceará, Fortaleza, Brazil.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
BSBirnbaum–Saunders
cdfcumulative distribution function
FBSFlexible Birnbaum-Saunders
FSHNFlexible sinh-normal
FSNFlexible skew Normal
pdfprobability density function
SHNSinh-normal

References

  1. Rieck, J.R.; Nedelman, J.R. A log-linear model for the Birnbaum-Saunders distribution. Technometrics 1991, 33, 51–60. [Google Scholar]
  2. Martínez-Flórez, G.; Barranco-Chamorro, I.; Bolfarine, H.; Gómez, H.W. Flexible Birnbaum-Saunders Distribution. Symmetry 2019, 11, 1305. [Google Scholar] [CrossRef] [Green Version]
  3. Gómez, H.W.; Olivares-Pacheco, J.F.; Bolfarine, H. An extension of the generalized Birnbaum- Saunders distribution. Stat. Probab. Lett. 2009, 79, 331–338. [Google Scholar] [CrossRef]
  4. Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
  5. Gómez, H.W.; Elal-Olivero, D.; Salinas, H.S.; Bolfarine, H. Bimodal Extension Based on the Skew-normal Distribution with Application to Pollen Data. Environmetrics 2011, 22, 50–62. [Google Scholar] [CrossRef]
  6. Martínez-Flórez, G.; Bolfarine, H.; Gómez, H.W. Censored bimodal symmetric-asymmetric families. Stat. Interface 2018, 11, 237–249. [Google Scholar] [CrossRef]
  7. Leiva, V.; Vilca-Labra, F.; Balakrishnan, N.; Sanhueza, A. A skewed sinh-normal distribution and its properties and application to air pollution. Commun. Stat. Theory Methods 2010, 39, 426–443. [Google Scholar] [CrossRef]
  8. Castillo, N.; Gomez, H.W.; Bolfarine, H. Epsilon Birnbaum-Saunders distribution family: Properties and inference. Stat. Pap. 2011, 52, 871–883. [Google Scholar] [CrossRef]
  9. Olmos, N.M.; Martínez-Flórez, G.; Bolfarine, H. Bimodal Birnbaum-Saunders Distribution with Application to Corrosion Data. Commun. Stat. Theory Methods 2017, 46, 6240–6257. [Google Scholar] [CrossRef]
  10. Vilca-Labra, F.; Leiva-Sanchez, V. A new fatigue life model based on the family of skew-elliptical distributions. Commun. Stat. Theory Methods 2006, 35, 229–244. [Google Scholar] [CrossRef]
  11. Calderín Ojeda, E.; Gómez Déniz, E.; Barranco Chamorro, I. Modelling Zero-Inflated Count Data With a Special Case of the Generalised Poisson Distribution. Astin Bull. 2019, 49, 689–707. [Google Scholar] [CrossRef]
  12. Rivera, P.A.; Barranco-Chamorro, I.; Gallardo, D.I.; Gómez, H.W. Scale mixture of Rayleigh distribution. Mathematics 2020, 8, 1842. [Google Scholar] [CrossRef]
  13. Lemonte, A. A log-Birnbaum-Saunders regression model with asymmetric errors. Improved statistical inference for the two-parameter Birnbaum-Saunders distribution. J. Stat. Comput. Stat. Simul. 2012, 82, 1775–1787. [Google Scholar] [CrossRef] [Green Version]
  14. Chambers, J.M.; Cleveland, W.S.; Kleiner, B.; Tukey, P.A. Graphical Methods for Data Analysis; Wadsworth: Belmont, CA, USA, 1983. [Google Scholar]
  15. Gokhale, S.; Khare, M. Statistical behavior of carbon monoxide from vehicular exhausts in urban environments. Environ. Model. Softw. 2007, 22, 526–535. [Google Scholar] [CrossRef]
  16. Nadarajah, S. A truncated inverted beta distribution with application to air pollution data. Stoch. Environ. Res. Risk Assess. 2008, 22, 285–289. [Google Scholar] [CrossRef]
  17. Akaike, H. A new look at statistical model identification. IEEE Trans. Autom. Control 1974, 19, 716–723. [Google Scholar] [CrossRef]
  18. Kullback, S.; Leibler, R.A. On information and sufficiency. Ann. Math. Stat. 1951, 22, 79–86. [Google Scholar] [CrossRef]
  19. Vuong, Q. Likelihood ratio tests for model selection and non-nested hypotheses. Econometrica 1989, 57, 307–333. [Google Scholar] [CrossRef] [Green Version]
  20. Arnold, B.C.; Gómez, H.W.; Salinas, H.S. On multiple constraint skewed models. Statistics 2009, 43, 279–293. [Google Scholar] [CrossRef]
  21. Elal-Olivero, D. Alpha-skew-normal distribution. Proyecciones J. Math. 2010, 29, 224–240. [Google Scholar] [CrossRef] [Green Version]
  22. Kim, H.J. On a class of two-piece skew-normal distribution. Statistic 2005, 39, 537–553. [Google Scholar] [CrossRef]
  23. Marin, J.-M.; Mengersen, K.; Robert, C. Bayesian modelling and inference on mixtures of distributions. In Handbook of Statistics; Rao, C., Dey, D., Eds.; Springer: New York, NY, USA, 2005; Volume 25, pp. 1–56. [Google Scholar]
  24. Chan, P.S.; Ng, H.K.T.; Balakrishnan, N.; Zhou, Q. Point and interval estimation for extreme-value regression model under Type-II censoring. Comput. Stat. Data Anal. 2008, 52, 4040–4058. [Google Scholar] [CrossRef]
  25. R Development Core Team. A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2021. [Google Scholar]
  26. Barranco Chamorro, I.; Jimenez Gamero, M.D. Asymptotic Results in Partially Non-Regular Log-Exponential Distributions. J. Stat. Comput. Simul. 2012, 82, 445–461. [Google Scholar] [CrossRef]
  27. Birnbaum, Z.W.; Saunders, S.C. A New Family of Life Distributions. J. Appl. Probab. 1969, 6, 319–327. [Google Scholar] [CrossRef]
  28. Díaz-García, J.A.; Leiva-Sánchez, V. A new family of life distributions based on the elliptically contoured distributions. J. Stat. Plann. Inference 2005, 128, 445–457. [Google Scholar] [CrossRef]
  29. Reyes, J.; Barranco-Chamorro, I.; Gómez, H.W. Generalized modified slash distribution with applications. Commun. Stat. Theory Methods 2020, 49, 2025–2048. [Google Scholar] [CrossRef]
  30. Reyes, J.; Barranco-Chamorro, I.; Gallardo, D.I.; Gómez, H.W. Generalized Modified Slash Birnbaum-Saunders Distribution. Symmetry 2018, 10, 724. [Google Scholar] [CrossRef] [Green Version]
  31. Bolfarine, H.; Martínez–Flórez, G.; Salinas, H.S. Bimodal symmetric-asymmetric families. Commun. Stat. Theory Methods 2018, 47, 259–276. [Google Scholar] [CrossRef]
  32. Elal-Olivero, D.; Olivares-Pacheco, J.F.; Venegas, O.; Bolfarine, H.; Gómez, H.W. On Properties of the Bimodal Skew-Normal Distribution and an Application. Mathematics 2020, 8, 703. [Google Scholar] [CrossRef]
  33. Martínez-Flórez, G.; Bolfarine, H.; Gómez, H.W. An alpha-power extension for the Birnbaum-Saunders distribution. Statistics 2014, 48, 896–912. [Google Scholar] [CrossRef]
  34. Santana, L.; Vilca, F.; Leiva, V. Influence analysis in skew-Birnbaum-Saunders regression models and applications. J. Appl. Stat. 2011, 38, 1633–1649. [Google Scholar] [CrossRef]
  35. Barros, M.; Paula, G.A.; Leiva, V. A new class of survival regression models with heavy-tailed errors: Robustness and diagnostics. Lifetime Data Anal. 2008, 14, 316–332. [Google Scholar] [CrossRef] [PubMed]
  36. Martínez-Flórez, G.; Bolfarine, H.; Gómez, H.W. The Log-Linear Birnbaum-Saunders Power Model. Methodol. Comput. Appl. Probab. 2017, 19, 913–933. [Google Scholar] [CrossRef]
  37. Ortega, E.M.; Bolfarine, H.; Paula, G.A. Influence diagnostics in generalized log-gamma regression models. Comput. Stat. Data Anal. 2003, 42, 165–186. [Google Scholar] [CrossRef]
Figure 1. FSHN pdfs: (a) F S H N (5.5, 0, 1, −2.75, −0.25) (black), F S H N (3.5, 0, 1, −2.75, −0.25) (blue), F S H N (1.5, 0, 1, −2.75, 0.25) (orange) and F S H N (0.75, 0, 1, −2.75, 0.25) (red); and (b) F S H N (5.5, 0, 1, 1.5, 1) (black), F S H N (3.5, 0, 1, 1.5, 1) (blue), F S H N (1.5, 0, 1, 1.5, 1) (orange) and F S H N (0.75, 0, 1, 1.5, 1) (red).
Figure 1. FSHN pdfs: (a) F S H N (5.5, 0, 1, −2.75, −0.25) (black), F S H N (3.5, 0, 1, −2.75, −0.25) (blue), F S H N (1.5, 0, 1, −2.75, 0.25) (orange) and F S H N (0.75, 0, 1, −2.75, 0.25) (red); and (b) F S H N (5.5, 0, 1, 1.5, 1) (black), F S H N (3.5, 0, 1, 1.5, 1) (blue), F S H N (1.5, 0, 1, 1.5, 1) (orange) and F S H N (0.75, 0, 1, 1.5, 1) (red).
Mathematics 09 01188 g001
Figure 2. FSHN pdfs: (a) F S H N (9.5, 0, 1, 0.2, 0.4) (black), F S H N (5.5, 0, 1, 0.2, 0.4) (blue), F S H N (3, 0, 1, 0.2, 0.4) (orange) and F S H N (1.5, 0, 1, 0.2, 0.4) (red); and In (b) F S H N (9.5, 0, 1, −0.2, 0.4) (black), F S H N (5.5, 0, 1, −0.2, 0.4) (blue), F S H N (3, 0, 1, −0.2, 0.4) (orange) and F S H N (1.5, 0, 1, −0.2, 0.4) (red).
Figure 2. FSHN pdfs: (a) F S H N (9.5, 0, 1, 0.2, 0.4) (black), F S H N (5.5, 0, 1, 0.2, 0.4) (blue), F S H N (3, 0, 1, 0.2, 0.4) (orange) and F S H N (1.5, 0, 1, 0.2, 0.4) (red); and In (b) F S H N (9.5, 0, 1, −0.2, 0.4) (black), F S H N (5.5, 0, 1, −0.2, 0.4) (blue), F S H N (3, 0, 1, −0.2, 0.4) (orange) and F S H N (1.5, 0, 1, −0.2, 0.4) (red).
Mathematics 09 01188 g002
Figure 3. FSHN pdfs: (a) F S H N (4.5, 0, 2.5, −0.75, 0) (black), F S H N (2.5, 0, 2.5, −0.75, 0) (blue), F S H N (1.5, 0, 2.5, −0.75, 0) (orange) and F S H N (1.25, 0, 2.5, −0.75, 0) (red); and (b) F S H N (4.5, 0, 2.5, 0.75, 0) (black), F S H N (2.5, 0, 2.5, 0.75, 0) (blue), F S H N (1.5, 0, 2.5, 0.75, 0) (orange) and F S H N (1.25, 0, 2.5, 0.75, 0) (red).
Figure 3. FSHN pdfs: (a) F S H N (4.5, 0, 2.5, −0.75, 0) (black), F S H N (2.5, 0, 2.5, −0.75, 0) (blue), F S H N (1.5, 0, 2.5, −0.75, 0) (orange) and F S H N (1.25, 0, 2.5, −0.75, 0) (red); and (b) F S H N (4.5, 0, 2.5, 0.75, 0) (black), F S H N (2.5, 0, 2.5, 0.75, 0) (blue), F S H N (1.5, 0, 2.5, 0.75, 0) (orange) and F S H N (1.25, 0, 2.5, 0.75, 0) (red).
Mathematics 09 01188 g003
Figure 4. Estimated pdfs: (a) FSHN (blue), skew-SHN (red) and SHN (green); and (b) FSHN (blue), extended BS (red) and BS (green).
Figure 4. Estimated pdfs: (a) FSHN (blue), skew-SHN (red) and SHN (green); and (b) FSHN (blue), extended BS (red) and BS (green).
Mathematics 09 01188 g004
Figure 5. b.weight histogram and fitted bimodal models: (a) FSHN (blue), mixture normal (red) and SHN (green); and (b) FSHN (blue), two-piece skew normal (red) and bimodal normal (green).
Figure 5. b.weight histogram and fitted bimodal models: (a) FSHN (blue), mixture normal (red) and SHN (green); and (b) FSHN (blue), two-piece skew normal (red) and bimodal normal (green).
Mathematics 09 01188 g005
Table 1. Estimates of parameters and their standard errors in parentheses for sinh-Normal (SHN), skew-SHN, BS, extended BS and FSHN models.
Table 1. Estimates of parameters and their standard errors in parentheses for sinh-Normal (SHN), skew-SHN, BS, extended BS and FSHN models.
EstimatorSHNSkew-SHNBSExtended BSFSHN
γ ^ 0.4370.7290.9823.3813.021
(0.170)(0.205)(0.064)(0.766)(1.349)
ξ ^ 43.4454.199 1.1137.043
(3.218)(1.484) (0.448)(1.868)
η ^ 155.008147.88228.0212.03788.949
(56.170)(35.000)(2.264)(0.206)(15.259)
δ ^ −0.7782.845
(0.205)(1.422)
λ ^ 21.035 2.90526.549
(9.841) (0.914)(10.4072)
AIC1147.491096.411102.191091.291091.23
BIC1155.751107.431107.701105.061105.00
Table 2. Descriptive summaries for b.weight variable.
Table 2. Descriptive summaries for b.weight variable.
n y ¯ s 2 Median b 1
5003210.356695,710.631750.0712
Table 3. Estimates of parameters, (standard errors) in models fitted in Dataset 2.
Table 3. Estimates of parameters, (standard errors) in models fitted in Dataset 2.
EstimatorSHNTNBNMN FSHN λ = 0
γ ^ | μ ^ 1 2.216 2501.8590.355
(0.296) (51.012)(0.141)
ξ ^ | μ ^ 2 3237.5463207.4223201.6663880.4793218.617
(29.560)(26.083)(5.620)(59.509)(22.821)
η ^ | σ ^ 1 972.770772.688481.088436.3802976.591
(82.977)(25.760)(8.782)(33.756)(1043.259)
δ ^ | σ ^ 2 497.050−1.156
(40.223)(0.133)
α ^ 1.771 0.4860
(0.539) (0.040)
AIC8103.258109.678238.398096.998089.38
BIC8111.518122.318246.828118.068112.45
Table 4. MLEs in LBS, LSBS and LFBS models.
Table 4. MLEs in LBS, LSBS and LFBS models.
ParametersLBSLSBSLFBS
γ 1.2792.0111.988
(0.143)(0.313)(0.527)
β 0 0.097−0.961−2.053
(0.170)(0.166)(0.530)
β 0 * 0.165−0.392
β 1 −14.116−13.870−13.812
(1.571)(1.602)(1.248)
δ −1.565
(0.430)
λ −0.9322.073
(0.174)(0.910)
AIC129.23125.36121.885
BIC134.30132.11130.32
Table 5. Simulations for γ ^ .
Table 5. Simulations for γ ^ .
n γ ^ (sd)bias ( γ ^ ) MSE
250.9968 (0.2878)0.20260.3833
501.0674 (0.2338)0.14610.2966
751.0972 (0.2221)0.12220.2696
1001.1235 (0.2196)0.10120.2534
5001.0960 (0.2114)0.04320.2181
Table 6. Simulations for β ^ 0 , β ^ 1 .
Table 6. Simulations for β ^ 0 , β ^ 1 .
n β ^ 0 (sd)bias ( β ^ 0 ) MSE β ^ 1 (sd)bias ( β ^ 1 ) MSE
251.8205 (0.4539)0.04030.45931.2866 (0.5022)0.02930.5034
501.7290 (0.4054)0.01200.40591.2681 (0.3424)0.01450.3428
751.6813 (0.4027)0.03930.40851.2621 (0.2702)0.00970.2704
1001.6348 (0.4074)0.06590.42341.2617 (0.2373)0.00940.2376
5001.7141 (0.4475)0.02050.50581.2514 (0.1027)0.00110.1027
Table 7. Simulations for δ ^ and λ ^ .
Table 7. Simulations for δ ^ and λ ^ .
n δ ^ (sd)bias ( δ ^ ) MSE λ ^ (sd)bias ( λ ^ ) MSE
25−1.6190 (0.4530)0.07930.46831.8495 (2.4948)0.07532.4991
50−1.6021 (0.3325)0.06800.34782.0366 (1.0632)0.01831.0638
75−1.6048 (0.3013)0.06990.31902.1029 (1.0358)0.05151.0408
100−1.6239 (0.2934)0.08260.31852.1537 (1.0253)0.07691.0366
500−1.6025 (0.2981)0.10830.33952.1683 (1.3550)0.23421.4335
Table 8. Empirical sd, relative bias and M S E for the F S H N ( γ , 1.75 , 1.25 , 1.5 , 2 ) model.
Table 8. Empirical sd, relative bias and M S E for the F S H N ( γ , 1.75 , 1.25 , 1.5 , 2 ) model.
γ ^ β ^ 0 β ^ 1 δ ^ λ ^
γ n sd RB MSE sdRB MSE sdRB MSE sdRB MSE sdRB MSE
250.12230.08410.12410.11080.02140.11690.16000.00120.16000.73790.00400.73781.73710.03321.7382
500.03620.0480.03810.08510.01240.08780.11220.00030.11220.38400.00050.38390.90760.04630.9123
750.02850.03380.02970.07320.00910.07490.09130.00190.09140.32880.00230.32880.83270.06980.8443
0.251000.02460.02460.02540.06470.00780.06610.07660.00000.07660.30120.00930.30150.73950.06070.7494
2000.01740.01260.01770.05080.00370.05120.05480.00050.05480.24650.00280.24650.47370.03650.4793
250.18010.08190.19030.30160.05610.31710.41950.01640.41990.56910.00690.56921.08710.04871.0914
500.12850.04370.13260.24130.03270.24790.28960.00120.28950.43140.00750.43151.10600.06961.1146
750.09890.02840.10120.20550.02080.20870.23260.00770.23270.35550.00710.35571.21540.08331.2266
0.751000.08530.02210.08690.18290.01290.18430.19870.00230.19870.31470.00170.31470.81970.07530.8334
2000.05960.01000.06000.14440.00620.14480.14010.00130.14010.23870.00070.23870.57180.04780.5797
250.33040.10370.35490.43360.10150.46850.55760.02120.55820.59840.00690.59853.46110.01793.4610
500.23700.05510.24680.36060.05340.37250.38720.00460.38720.40800.00310.40791.00820.05051.0131
750.19510.03880.20100.31060.03530.31660.30650.00650.30660.32860.00300.32860.95890.07500.9705
1.251000.17030.02790.17380.28030.02550.28390.25960.00110.25960.29420.00100.29421.01680.08341.0303
2000.12780.01380.12900.22190.01100.22280.18580.00270.18580.21980.00250.21980.61090.04720.6181
Table 9. Empirical sd, relative bias and M S E for the F S H N ( 1 , 1.75 , 1.25 , δ , 2 ) model.
Table 9. Empirical sd, relative bias and M S E for the F S H N ( 1 , 1.75 , 1.25 , δ , 2 ) model.
γ ^ β ^ 0 β ^ 1 δ ^ λ ^
δ n sd RB MSE sdRB MSE sdRB MSE sdRB MSE sdRB MSE
250.71490.08350.71970.36100.04880.37090.50220.00800.50231.23110.10811.23585.12190.06035.1228
500.52210.04930.52440.28570.02130.28810.35280.00220.35280.73370.07500.73741.35910.06761.3657
750.16790.03470.17140.25150.00710.25180.29180.00730.29190.41050.05720.41440.94450.06570.9535
−11000.14500.02700.14750.23110.01010.23170.24820.00030.24820.36440.03570.36610.87530.06600.8851
2000.10090.01380.10180.18420.00410.18430.16830.00160.16830.27500.01860.27560.61440.03550.6185
250.91400.09340.91870.34780.01370.34860.49730.01490.49761.3880.52871.41286.27750.09236.2795
500.36200.05350.36590.27140.00180.27140.34260.00030.34260.74850.30180.76351.36590.09511.3790
750.24730.04030.25060.24820.00120.24820.27540.00560.27550.55980.22150.57061.07310.08161.0853
−0.51000.17350.03480.17700.22170.00410.22170.24150.00050.24150.45550.19870.46621.60110.08051.6090
2000.11950.02160.12140.18500.00250.18500.16430.00050.16430.34060.11830.34560.73190.04420.7371
251.38810.05961.38920.33640.00930.33680.48920.00940.48932.09551.08172.11267.99800.20268.0075
500.63690.04480.63840.27130.00360.27130.32980.01090.33011.14660.62451.15702.64810.13182.6609
750.27280.03720.27530.24550.00260.24550.26540.00040.26530.67090.47610.68131.41550.11591.4342
−0.251000.20670.02660.20840.22920.00740.22960.22720.00060.22720.55680.37040.56441.21910.12691.2451
2000.15180.01640.15270.19630.00500.19650.16030.00020.16030.44220.22350.44560.94830.08070.9619
252.08970.11392.09250.32160.02110.32370.44660.00090.44663.42030.45463.421811.19580.47911.2356
500.88160.02420.88180.26500.00070.26500.30360.00530.30371.68590.01111.68572.84730.22962.8838
750.58320.00870.58320.23830.00670.23860.23720.00380.23721.17110.16681.17172.14030.22252.1859
0.251000.38100.01550.38130.22110.01000.22180.20830.00080.20830.95960.06490.95971.49840.20601.5538
2000.18880.00360.18880.18470.01160.18580.14590.00020.14590.59330.17000.59470.99730.14121.0364
251.91520.12401.91910.29730.01910.29910.41970.00510.41973.41840.46883.426115.98340.579216.0237
501.02500.06671.02710.24350.00220.24350.28020.00500.28022.04450.20532.04693.60070.30983.6533
750.71430.04550.71570.22280.00760.22320.22840.00150.22841.48700.09341.48762.72620.25062.7716
0.51000.33240.01370.33260.20760.01020.20840.19040.00290.19040.90700.03850.90711.38570.19261.4381
2000.20890.01100.20910.16980.01240.17110.13590.00020.13590.67430.05300.67470.96940.14421.0113
251.97230.20521.98280.25730.01700.25900.36590.00370.36593.81930.53433.856218.17670.732418.2338
501.33440.13451.34100.21160.00250.21160.25480.00340.25482.85320.31362.87014.79510.34444.8439
750.82670.07640.83020.19060.00630.19090.20340.00130.20341.92030.12961.92452.48150.28372.5453
1.01000.73160.06900.73470.17430.01060.17530.17110.00140.17111.70360.10551.70672.04840.25452.1105
2000.43340.03740.43500.13500.01050.13620.11840.00000.11831.14080.03821.14131.13120.15081.1706
Table 10. Empirical sd, relative bias and M S E for the F S H N ( 1 , 1.75 , 1.25 , 1.5 , λ ) model.
Table 10. Empirical sd, relative bias and M S E for the F S H N ( 1 , 1.75 , 1.25 , 1.5 , λ ) model.
γ ^ β ^ 0 β ^ 1 δ ^ λ ^
λ n sd RB MSE sdRB MSE sdRB MSE sdRB M S E sdRB MSE
250.24040.10880.26390.38830.04070.39480.50860.00040.50850.53200.14240.57320.58070.01510.5809
500.17630.05080.18350.30730.01100.30790.35210.00680.35220.37090.08380.39160.44880.02820.4496
750.15100.03250.15440.26480.00110.26480.28380.00190.28380.29990.06060.31340.32660.01930.3272
−11000.13430.02840.13720.23010.00530.23030.24450.00290.24450.26040.04680.26960.29260.00960.2927
2000.09790.00990.09830.16280.00130.16280.16930.00200.16930.18280.01940.18510.18930.00660.1894
250.22060.09350.23960.37420.01020.37460.53690.00400.53690.47740.18420.55150.32200.00990.3220
500.17290.03630.17660.26900.00030.26900.37780.00450.37780.34210.08190.36350.21210.01340.2122
750.14460.02580.14680.21600.00080.21600.29560.00600.29570.27850.05650.29110.16560.00090.1655
−0.51000.12350.02120.12530.18170.00160.18170.25320.00140.25320.23900.04380.24790.14130.00040.1413
2000.08700.01020.08760.12770.00080.12770.17540.00280.17540.17020.02240.17340.09840.00080.0984
250.20860.08310.22450.36420.00520.36430.57400.00080.57390.48120.17130.54540.22170.02120.2217
500.15270.03890.15760.24360.00030.24360.38690.00110.38690.34090.07850.36060.14420.00080.1442
750.12450.02470.12690.19790.00080.19790.30830.00390.30830.28410.05100.29420.11420.00400.1142
−0.251000.10680.01940.10860.17140.00020.17140.26750.00160.26750.24100.03980.24820.09620.00640.0962
2000.07740.00970.07800.11680.00080.11680.18130.00020.18130.17290.01830.17500.06880.00510.0688
250.20990.08590.22670.36460.00750.36480.56330.00330.56330.47990.17910.54990.20340.03350.2035
500.15370.04070.15900.25170.00030.25170.38560.00640.38570.34230.08300.36420.14730.00400.1473
750.12630.02880.12960.19790.00110.19790.30790.00180.30790.28130.05660.29380.11420.00060.1142
0.251000.10940.01940.11110.16960.00320.16970.26440.00310.26440.24490.03990.25210.09840.01270.0984
2000.07570.00920.07620.11850.00010.11850.18580.00110.18580.16890.01940.17140.06830.00080.0683
250.22580.09860.24630.39320.01660.39420.55490.00220.55490.47830.18980.55660.30890.02630.3091
500.17610.04470.18170.27530.00320.27540.3730.00400.37300.33430.08930.36010.21240.00420.2124
750.14200.03170.14550.21310.00310.21310.29500.00210.29500.27790.06300.29350.16950.00160.1694
0.51000.12360.02160.12550.18850.00370.18860.25710.00600.25720.23790.04340.24660.14160.00170.1416
2000.0880.00870.08840.12940.00140.12940.17650.00020.17650.16950.01750.17150.09690.00070.0969
250.24530.10650.26740.39310.04530.4010.51650.00090.51650.50630.13830.54710.57950.00120.5795
500.18090.06130.19100.30070.02300.30340.35070.00210.35070.35030.08030.37040.46790.01080.4680
750.15500.03800.15950.26020.01170.26090.28780.00180.28770.28770.05410.29890.35880.01760.3592
1.01000.13450.03080.13800.23100.00870.23150.24680.00000.24670.25340.04210.26110.28350.00600.2835
2000.09820.01400.09920.16620.00370.16630.17250.00300.17260.18140.02230.18450.19150.00280.1915
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Martínez-Flórez, G.; Barranco-Chamorro, I.; Gómez, H.W. Flexible Log-Linear Birnbaum–Saunders Model. Mathematics 2021, 9, 1188. https://doi.org/10.3390/math9111188

AMA Style

Martínez-Flórez G, Barranco-Chamorro I, Gómez HW. Flexible Log-Linear Birnbaum–Saunders Model. Mathematics. 2021; 9(11):1188. https://doi.org/10.3390/math9111188

Chicago/Turabian Style

Martínez-Flórez, Guillermo, Inmaculada Barranco-Chamorro, and Héctor W. Gómez. 2021. "Flexible Log-Linear Birnbaum–Saunders Model" Mathematics 9, no. 11: 1188. https://doi.org/10.3390/math9111188

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop