Next Article in Journal
Applications of the q-Srivastava-Attiya Operator Involving a Certain Family of Bi-Univalent Functions Associated with the Horadam Polynomials
Next Article in Special Issue
The Uniform Poisson–Ailamujia Distribution: Actuarial Measures and Applications in Biological Science
Previous Article in Journal
Scientometric Analysis for Mechanical Performance of Broken-Line Long-Span Steel Structure in Construction Considering Geometric Nonlinearity
Previous Article in Special Issue
Optimal Sample Size for the Birnbaum–Saunders Distribution under Decision Theory with Symmetric and Asymmetric Loss Functions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Generalized Rayleigh Family of Distributions Based on the Modified Slash Model

by
Inmaculada Barranco-Chamorro
1,*,†,
Yuri A. Iriarte
2,†,
Yolanda M. Gómez
3,†,
Juan M. Astorga
4,† and
Héctor W. Gómez
2,*,†
1
Departamento de Estadística e Investigación Operativa, Facultad de Matemáticas, Universidad de Sevilla, 41012 Sevilla, Spain
2
Departamento de Matemáticas, Facultad de Ciencias Básicas, Universidad de Antofagasta, Antofagasta 1240000, Chile
3
Departamento de Matemáticas, Facultad de Ingeniería, Universidad de Atacama, Copiapó 1530000, Chile
4
Departamento de Tecnologías de la Energía, Facultad Tecnológica, Universidad de Atacama, Copiapó 1530000, Chile
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Symmetry 2021, 13(7), 1226; https://doi.org/10.3390/sym13071226
Submission received: 3 June 2021 / Revised: 2 July 2021 / Accepted: 4 July 2021 / Published: 8 July 2021

Abstract

:
Specifying a proper statistical model to represent asymmetric lifetime data with high kurtosis is an open problem. In this paper, the three-parameter, modified, slashed, generalized Rayleigh family of distributions is proposed. Its structural properties are studied: stochastic representation, probability density function, hazard rate function, moments and estimation of parameters via maximum likelihood methods. As merits of our proposal, we highlight as particular cases a plethora of lifetime models, such as Rayleigh, Maxwell, half-normal and chi-square, among others, which are able to accommodate heavy tails. A simulation study and applications to real data sets are included to illustrate the use of our results.

1. Introduction

Vodua [1] proposed a two-parameter distribution for the analysis of positive data, the well-known generalized Rayleigh distribution. This model overcomes the problems found in practice, which are related to a Rayleigh distribution and pointed out by authors, such as Siddiqui [2] and Hirano et al. [3]. Recall that a random variable (rv) X follows a generalized Rayleigh distribution, denoted as X GR ( θ , α ) , if its probability density function (pdf) is given by the following:
f ( x ; θ , α ) = 2 θ α + 1 Γ ( α + 1 ) x 2 α + 1 e θ x 2 , x > 0 ,
where θ > 0 is a scale parameter, α > 1 is a shape parameter and Γ ( α ) = 0 u α 1 e u d u is the gamma function. The cumulative distribution function (cdf) of the GR distribution can be expressed as follows:
F ( x ; θ , α ) = γ ( α + 1 , θ x 2 ) Γ ( α + 1 ) = P ( α + 1 , θ x 2 ) , x > 0 ,
where γ ( α , x ) = 0 x u α 1 e u d u is the incomplete gamma function and P ( α + 1 , θ x 2 ) denotes the regularized gamma function [4]. The moments of X are easily derived by using the results in [5] (Section 3.478). In fact, the rth distributional moment is as follows:
μ r = E ( X r ) = Γ r 2 + α + 1 Γ ( α + 1 ) θ r / 2 , r = 1 , 2 , 3 , . . .
Following the approach proposed in [6], lifetime models to accommodate outliers were proposed in [7]. We focus on the approach proposed in [8], where an extension of the generalized Rayleigh distribution called the slashed generalized Rayleigh distribution was introduced. This model has a kurtosis coefficient, which exhibits a wider range of values than the kurtosis coefficient in the GR distribution; therefore, it is appropriate to fit the data sets with outliers. Recall that a rv T follows a slashed generalized Rayleigh distribution, denoted as T SGR ( θ , α , q ) , if its stochastic representation is given by the following:
T = X U 1 / q , q > 0 ,
where X GR ( θ , α ) and U U ( 0 , 1 ) are independent. The associated pdf is given by the following:
f T ( t ; θ , α , q ) = q t ( q + 1 ) Γ ( α + 1 ) θ q / 2 Γ 2 α + q + 2 2 F θ t 2 , 2 α + q + 2 2 , 1 , t > 0 ,
where θ > 0 is a scale parameter, α > 1 is a shape parameter, q > 0 is a kurtosis parameter and F ( x , α , β ) = 0 x β α Γ ( α ) t α 1 e β t d t is the cdf of a gamma distribution with shape parameter α > 0 and rate parameter β > 0 . If α = 0 and θ = ( 2 σ ) 1 , the GR distribution reduces to the slashed-Rayleigh distribution. If α = 1 / 2 and θ = ( 2 σ 2 ) 1 , the GR distribution reduces to slashed-half normal distribution [9].
In this work, we propose a modified version of the SGR distribution, given in (4), which can be used as an alternative model to the GR and SGR distributions.
We follow the idea proposed by [10], who introduced an extension of the normal distribution called modified slash (MS) distribution. A rv X follows a modified slash distribution, denoted as X MS ( q ) , if it can be written as follows:
X = Y W 1 / q , q > 0 ,
where Y N ( 0 , 1 ) and W E x p ( 2 ) are independent. If q , then the MS distribution converges in law to a normal distribution.
The paper is organized as follows. In Section 2 the stochastic representation, the pdf and ordinary moments of the proposed model are given. Section 3 is devoted to the family of distributions belonging to the generalized Rayleigh family of distributions based on the modified slash model. Among others, we can find here the modified slashed Rayleigh, half-normal, Maxwell and chi-square distributions. Recall that the unified treatment of a family of distributions is of interest in statistics; see, for instance, [11,12,13]. In Section 4, the estimation of parameters is discussed by considering the moments and maximum likelihood approaches. Similar to [14], their asymptotic normality is established. In addition, we carry out a simulation study to illustrate the performance of the estimators. In Section 5, two applications to real data sets are presented, where we illustrate that the proposed distribution may provide a better fit than the generalized Rayleigh and slashed generalized Rayleigh distributions. Finally, the main conclusions of this paper are given.

2. The New Model

In this section, some structural properties of the new model are given. These are the stochastic representation, pdf, hazard rate function and distributional moments.

2.1. Stochastic Representation

Definition 1.
A random variable T follows a modified slashed generalized Rayleigh distribution, denoted as T MSGR ( θ , α , q ) , if it can be expressed as follows:
T = X 1 2 log ( Z ) 1 / q , q > 0 ,
where X GR ( θ , α ) and Z U ( 0 , 1 ) are independent.
Remark 1.
  • Let T MSGR ( θ , α , q ) . Then, T can be expressed as T = X Y 1 / q , where X GR ( θ , α ) and Y E x p ( 2 ) are independent.
  • If q = 1 then T follows a canonic modified slashed generalized Rayleigh (CMSGR) distribution, and we can write T = 2 X log 1 ( Z ) , where X GR ( θ , α ) and Z U ( 0 , 1 ) are independent. This is denoted as T CMSGR ( θ , α ) .
The stochastic representation given in (6) can be used to generate values of T MSGR ( θ , α , q ) by using the following algorithm:
  • Generate X G R ( θ , α ) .
  • Generate Z U ( 0 , 1 ) .
  • Compute T = X 1 2 log ( Z ) 1 / q .

2.2. Probability Density Function

To deal with the pdfs of generalized slash distributions, it is common to use hypergeometric functions as can be seen in [7,15,16]. In this subsection, we first introduce the special function, which is used in the expression of the pdf for the MSGR distribution, which is given in Proposition 1.
Definition 2.
For γ > 0 , a > 0 , r > 0 and s > 0 , let us consider the following:
I ( γ , a , r , s ) = 0 x γ 1 e a x r s x d x
where I ( · , · , · , · ) is defined as in [17] (Equation 2.3.1.13).
Proposition 1.
Let T MSGR ( θ , α , q ) . Then, the pdf of T is given by the following:
f T ( t ; θ , α , q ) = 2 q θ q 2 Γ ( α + 1 ) t q + 1 I α + q 2 + 1 , 2 θ q 2 t q , q 2 , 1 , t > 0 ,
where θ > 0 is a scale parameter, α > 1 is a shape parameter and q > 0 is a kurtosis parameter.
Proof. 
From (6), and by applying the change of variable method, the pdf of T is given by the following:
f T ( t ; θ , α , q ) = 4 q θ α + 1 Γ ( α + 1 ) t 2 α + 1 0 w 2 α + q + 1 e θ ( t w ) 2 2 w q d w = 2 q θ q 2 Γ ( α + 1 ) t q + 1 0 u α + q 2 e u 2 u θ t 2 q 2 d u .
By considering I ( · , · , · , · ) , the integral introduced in Definition 2, (8) is obtained.  □
Corollary 1.
Let T MSGR ( θ , α , q ) . Then, the cdf of T is as follows:
F T ( t ; θ , α , q ) = 2 q Γ ( α + 1 ) θ q 2 0 t u ( q + 1 ) I α + q 2 + 1 , 2 θ u q , q 2 , 1 d u
where Γ ( · ) is the gamma function and I ( · , · , · , · ) was defined in (7).
As for the shape of the pdf in the MSGR model, we highlight that the pdf can be unimodal or monotonically decreasing. Plots of the pdf for certain values of the parameters are given in Figure 1. In Figure 1a,b, we can see that the parameter q is a kurtosis parameter since it mainly affects the right tail of the distribution. In Figure 1c, we can appreciate that the parameter θ is a scale-type parameter; the smaller the value of θ , the more spread out the distribution [18]. Finally, in Figure 1d, we can see that the parameter α is a shape parameter.

2.3. Reliability Analysis

For non-negative rvs, it is of interest to know the reliability (or survival) function and the hazard rate function [19,20]. The reliability function, R T ( t ) = 1 F T ( t ) , of a MSGR distribution is given by the following:
R T ( t ; θ , α , q ) = 1 2 q Γ ( α + 1 ) θ q 2 0 t u ( q + 1 ) I ( u ) d u ,
where I ( u ) = I θ , α , q ( u ) = I α + q 2 + 1 , 2 θ q 2 u q , q 2 , 1 is defined as in (7).
The hazard rate function, defined by h T ( t ) = f T ( t ) R T ( t ) , is given by the following:
h T ( t ; θ , α , q ) = t ( q + 1 ) I ( t ) Γ ( α + 1 ) θ q 2 2 q 0 t u ( q + 1 ) I ( u ) d u ,
where I ( u ) is as in (9) and I ( t ) = I α + q 2 + 1 , 2 θ q 2 t q , q 2 , 1 , defined in (7).
Figure 2 displays some plots of the hazard rate function of the MSGR distribution for different values of its parameters.
Applications of interest in engineering of the Rayleigh models can be seen in [21,22,23,24] among others. In these applications, it can be of interest to apply our proposal.

2.4. Moments

Next, we proceed to obtain the moments of the MGSR distribution. From these features, important characteristics of the model, such as the mean or expected value, variance, skewness and kurtosis coefficients are obtained. These results will also be the basis to apply the method of moments in Section 4.
Proposition 2.
Let T MSGR ( θ , α , q ) . Then, for r = 1 , 2 , . . . and q > r the rth distributional moment is given by the following:
μ r = E ( T r ) = 2 r q Γ q r q Γ r 2 + α + 1 Γ ( α + 1 ) θ r 2 .
Proof. 
Using the stochastic representation given in Part 1 of Remark 1 and the independence of X and Y, it follows that
μ r = E T r = E X Y 1 q r = E X r Y r q = E X r E Y r q ,
where E Y r q = 2 r q Γ q r q , q > r and E X r = Γ r 2 + α + 1 Γ ( α + 1 ) θ r 2 are distributional moments of the E x p ( 2 ) and GR ( θ , α ) distributions, respectively. □
Corollary 2.
Let T MSGR ( θ , α , q ) . Then, the mean and variance are, respectively, the following:
E ( T ) = 2 1 / q Γ q 1 q Γ α + 3 / 2 Γ ( α + 1 ) θ 1 / 2 , q > 1 ,
and
V a r ( T ) = 2 2 / q Γ 2 ( α + 1 ) θ Γ ( α + 1 ) Γ ( α + 2 ) Γ q 2 q Γ 2 α + 3 / 2 Γ 2 q 1 q , q > 2 .
Corollary 3.
Let T MSGR ( σ , q ) . Then, the asymmetry ( β 1 ) and kurtosis ( β 2 ) coefficients are the following:
β 1 = 1 A 3 / 2 ( α , q ) Γ 2 ( α + 1 ) Γ α + 5 / 2 Γ q 3 q 3 Γ α + 3 / 2 Γ ( α + 2 ) Γ ( α + 1 ) × Γ q 1 q Γ q 2 q + 2 Γ 3 α + 3 / 2 Γ 3 q 1 q , q > 3 ,
and
β 2 = 1 A 2 ( α , q ) Γ 3 ( α + 1 ) Γ ( α + 3 ) Γ q 4 q 4 Γ α + 3 / 2 Γ α + 5 / 2 × Γ 2 ( α + 1 ) Γ q 1 q Γ q 3 q + 6 Γ 2 α + 3 / 2 Γ ( α + 1 ) Γ ( α + 2 ) × Γ 2 q 1 q Γ q 2 q 3 Γ 4 α + 3 / 2 Γ 4 q 1 q , q > 4 ,
where A ( α , q ) = Γ ( α + 1 ) Γ ( α + 2 ) Γ ( q 2 q ) Γ 2 ( α + 3 2 ) Γ 2 ( q 1 q ) .
Remark 2.
Notice that β 1 and β 2 do not depend on θ because it is a scale parameter. Furthermore, as q the β 1 and β 2 coefficients tend, respectively, to the following:
β 1 G R = 1 A 3 / 2 ( α ) Γ 2 ( α + 1 ) Γ ( α + 5 / 2 ) 3 Γ ( α + 3 / 2 ) Γ ( α + 2 ) Γ ( α + 1 ) + 2 Γ 3 ( α + 3 / 2 )
and
β 2 G R = 1 A 2 ( α ) Γ 3 ( α + 1 ) Γ ( α + 3 ) 4 Γ ( α + 3 / 2 ) Γ ( α + 5 / 2 ) Γ 2 ( α + 1 ) + 6 Γ 2 ( α + 3 / 2 ) Γ ( α + 1 ) Γ ( α + 2 ) 3 Γ 4 ( α + 3 / 2 ) ,
where A ( α ) = Γ ( α + 1 ) Γ ( α + 2 ) Γ 2 ( α + 3 / 2 ) , which are the asymmetry and kurtosis coefficients of the GR distribution. Figure 3 depicts plots for the asymmetry and kurtosis coefficients of the MSGR model. In the figure, we observe that the highest values of asymmetry and kurtosis are associated with small values of parameter q and large values of parameter α. In addition, when comparing the asymmetry and kurtosis coefficients of the GR and MSGR distributions, we observe that q has a great impact on the increase in kurtosis, which is why we refer to q as a kurtosis parameter. This point is clearly observed in Figure 4.

3. The MSGR Family of Distributions

In this section, we see that well-known models are associated with the MSGR family of distributions. Some of these results follow directly from (8) under certain reparameterizations, while other ones follow taking the limit when q . Similar to the techniques used in [25], the limit distribution of (1) when q is obtained.
Lemma 1.
Let T q M S G R ( θ , α , q ) . Then, T q converges in distribution to X G R ( θ , α ) when q .
Proof. 
Let us consider the stochastic representation for T q given in Remark 1:
T q = X Y 1 / q
where X G R ( θ , α ) and Y E x p ( 2 ) .
First, we prove the convergence in probability of Y 1 / q to a constant as q . Since Y E x p ( 2 ) and q > 0 , we have the following:
E [ Y 1 / q ] = Γ 1 + 1 q 2 1 / q , V a r [ Y 1 / q ] = Γ 1 + 2 q Γ 2 1 + 2 q 2 2 / q .
By applying the Tchebychev inequality, we have the following:
P Y 1 / q E [ Y 1 / q ] > ϵ V a r [ Y 1 / q ] ϵ 2 ϵ > 0 ,
and since V a r [ Y 1 / q ] 0 as q then
{ Y 1 / q E [ Y 1 / q ] } P 0 .
Similarly, the sequence of real numbers behaves as follows:
E [ Y 1 / q ] = Γ 1 + 1 q 2 1 / q 1 , q .
Therefore,
Y 1 / q = Y 1 / q E [ Y 1 / q ] + E [ Y 1 / q ] P 1 as q .
By applying Slutsky’s theorem ([26] Corollary 2.3.2), we have the following:
T q = X Y 1 / q d X as q
that is, T q converges in distribution to X G R ( θ , α ) distribution. □
Next, we see that well-known models are particular cases of the M S G R ( θ , α , q ) family of distributions. Specifically, for α = 0 and θ = ( 2 σ ) 1 , the M S G R model is reduced to the modified slashed Rayleigh distribution [27]. If α = 1 / 2 and θ = ( 2 σ 2 ) 1 , then the M S G R reduces to the modified slashed half-normal distribution [28]. For α = 1 / 2 and θ = σ / 2 , we get the modified slashed Maxwell distribution. If α = σ / 2 1 and θ = 1 / 2 , we obtain the modified slashed chi-square distribution. Taking the limit as q in previous models, the Rayleigh, half-normal [29], Maxwell [30] and chi-square distribution are obtained [31]. Details about these related models are given in the next corollaries. The relationships among these models are summarized in Figure 5.
Corollary 4
(Modified slashed Rayleigh model). Let T MSGR ( 1 / ( 2 σ ) , 0 , q ) . Then, T follows a modified slashed Rayleigh distribution, MSR ( σ , q ) , whose density function is given by the following:
f T ( t , σ , q ) = 2 q + 2 2 q σ q 2 t q + 1 I q 2 + 1 , 2 q + 2 2 σ q 2 t q , q 2 , 1 , t > 0 ,
σ > 0 is a scale parameter, q > 0 is a kurtosis parameter [27].
As q then T converges in distribution to a Rayleigh distribution whose pdf is the following:
f T ( t ) = t σ e t 2 2 σ , t > 0 .
Corollary 5
(Modified slashed half-normal model). Let T MSGR ( 1 / 2 σ 2 , 1 / 2 , q ) . Then, T follows a modified slashed half-normal distribution, MSHN ( σ , q ) , whose pdf is given by the following:
f T ( t ; σ , q ) = 2 q + 2 2 q σ q π t q + 1 I q + 1 2 , 2 q + 2 2 σ q t q , q 2 , 1 , t > 0 ,
where σ > 0 is a scale parameter and q > 0 is a kurtosis parameter [28].
As q , T converges in distribution to a half-normal distribution whose pdf is the following:
f T ( t ) = 2 σ ϕ t σ , t > 0 ,
where ϕ ( · ) denotes the pdf of a N ( 0 , 1 ) distribution.
Corollary 6
(Modified slashed Maxwell model). Let T MSGR ( σ / 2 , 1 / 2 , q ) . Then, T follows a modified slashed Maxwell distribution, MSM ( σ , q ) , whose pdf is given by the following:
f T ( t ; σ , q ) = 2 σ q 2 4 q π t q + 1 I q + 3 2 , 2 q 2 + 1 σ q 2 t q , q 2 , 1 , t > 0 ,
where σ > 0 is a scale parameter and q > 0 is a kurtosis parameter.
As q , T converges in distribution to a Maxwell distribution whose pdf is the following:
f T ( t ) = 2 π σ 3 / 2 t 2 e σ 2 t 2 , t > 0 .
Corollary 7
(Modified slashed chi-square model). Let T MSGR ( 1 / 2 , σ / 2 1 , q ) . Then, T follows a modified slashed chi-square distribution, MS χ σ , q 2 , whose pdf is given by the following:
f T ( t ; σ , q ) = 2 q 2 + 1 q Γ σ 2 t q + 1 I σ + q 2 , 2 q 2 + 1 t q , q 2 , 1 , t > 0 ,
where σ > 0 is a scale parameter and q > 0 is a kurtosis parameter.
As q , T converges in distribution to a gamma distribution with shape parameter σ / 2 and rate parameter 1 / 2 as follows:
f T ( t ) = 1 2 σ / 2 Γ ( σ / 2 ) t σ / 2 1 e t 2 , t > 0 .
If σ is a positive integer, then we have a chi-square distribution.

4. Parameter Estimation

In this section, we discuss moments and the maximum likelihood estimation for parameters θ , α and q when sampling from the MSGR distribution. First, the moment method is introduced.

4.1. Method of Moments Estimators

Proposition 3.
Let T 1 , , T n be a random sample of size n from the T MSGR ( θ , α , q ) distribution. Then, the method of moments estimates of ( θ , α , q ) for q > 3 are given by the following:
θ ^ = 2 2 / q ^ Γ 2 q ^ 1 q ^ Γ 2 α ^ + 3 2 Γ 2 ( α ^ + 1 ) T ¯ 2 ,
α ^ = 3 2 d 2 d 1 d 1 d 2 ,
T ¯ 2 T 2 ¯ = Γ 2 q ^ 1 q ^ Γ 2 3 2 d 2 d 1 d 1 d 2 + 3 2 Γ q ^ 2 q ^ Γ 3 2 d 2 d 1 d 1 d 2 + 2 Γ 3 2 d 2 d 1 d 1 d 2 + 1 ,
with d 1 = T ¯ 2 T 2 ¯ Γ ( q 1 q ) Γ ( q 2 q ) , d 2 = T ¯ 3 T 3 ¯ Γ ( q 3 q ) and T ¯ k = ( 1 / n ) i = 1 n T i k , k = 1 , 2 , 3 .
Proof. 
First, from Proposition 2 and taking into account the first three equations in the moments method, we have the following:
T ¯ = 2 1 / q Γ ( q 1 q ) Γ ( α + 3 / 2 ) Γ ( α + 1 ) θ 1 / 2 , T 2 ¯ = 2 2 / q Γ ( q 2 q ) Γ ( α + 2 ) Γ ( α + 1 ) θ , T 3 ¯ = 2 3 / q Γ ( q 3 q ) Γ ( α + 5 / 2 ) Γ ( α + 1 ) θ 3 / 2 .
Now, consider the above equations for the following:
  • Solving for θ , the second of the above equations, the result in (12) is obtained.
  • Solving for θ , the first two of the above equations, and matching the resulting expressions, we obtain the following:
    Γ 2 ( q 1 q ) Γ 2 ( α + 3 / 2 ) Γ ( α + 1 ) = T ¯ 2 T 2 ¯ Γ ( q 2 q ) Γ ( α + 2 ) ,
    and solving the first and third of the equations in (15), we have the following:
    Γ 2 ( q 1 q ) Γ 2 ( α + 3 / 2 ) Γ ( α + 1 ) = T ¯ 3 T 3 ¯ Γ 1 ( q 1 q ) Γ ( q 3 q ) ( α + 3 / 2 ) Γ ( α + 1 )
    Equating (16) and (17), and considering an appropriate algebraic manipulation, the result in (13) is obtained.
  • Finally, substituting (13) in (16), by a straightforward calculation, the result in (14) is obtained.

4.2. Maximum Likelihood Estimation

Now, the maximum likelihood (ML) estimates of parameters in the MSGR distribution, based on a random sample of size n, are obtained. Given an observed sample t 1 , , t n from the MSGR ( θ , α , q ) distribution, the log-likelihood function for the parameters θ , α and q, given t = ( t 1 , , t n ) , can be written as the following:
log L ( ϑ ; t i ) = c ( θ , α , q ) ( q + 1 ) i = 1 n log ( t i ) + i = 1 n log I α + q 2 + 1 , 2 θ q 2 t i q , q 2 , 1 ,
where c ( θ , α , q ) = n log ( 2 ) + n log ( q ) n q 2 log ( θ ) n log Γ ( α + 1 ) and ϑ = ( θ , α , q ) . The maximum likelihood equations are given by the following:
n q 2 θ + i = 1 n I 1 ( t i ) I ( t i ) = 0 ,
n Ψ ( α + 1 ) + i = 1 n I 2 ( t i ) I ( t i ) = 0 ,
n q n 2 log ( θ ) i = 1 n log ( t i ) + i = 1 n I 3 ( t i ) I ( t i ) = 0 ,
where Ψ ( · ) is the digamma function,
I ( t i ) = I α + q 2 + 1 , 2 θ q 2 t i q , q 2 , 1 , I 1 ( t i ) = θ I α + q 2 + 1 , 2 θ q 2 t i q , q 2 , 1 , I 2 ( t i ) = α I α + q 2 + 1 , 2 θ q 2 t i q , q 2 , 1 and I 3 ( t i ) = q I α + q 2 + 1 , 2 θ q 2 t i q , q 2 , 1 .
Solutions for Equations (19)–(21) can be obtained by using numerical procedures, such as the Newton–Raphson procedure. The observed information matrix for the parameters θ , α and q is given by the following:
I n ( θ , α , q ) = n q 2 θ 2 + G θ , 1 ( t i ) G α , 1 ( t i ) n 2 θ + G q , 1 ( t i ) G θ , 2 ( t i ) n Ψ 1 ( α + 1 ) + G α , 2 ( t i ) G q , 2 ( t i ) n 2 θ + G θ , 3 ( t i ) G α , 3 ( t i ) n q 2 + G q , 3 ( t i )
where G ν , k ( t i ) = i = 1 n d d ν I k ( t i ) I ( t i ) , ν = θ , α , q , and Ψ 1 is the trigamma function, with I k ( t i ) , k = 1 , 2 , 3 , given in Equations (19)–(21), respectively.
Asymptotic confidence intervals and hypothesis tests for a given parameter in the model can be obtained by using the observed information matrix I n ( γ ) , with γ = ( θ , α , q ) . They can be deduced by assuming that the maximum likelihood estimator has approximately a N 3 ( γ , K γ 1 ) distribution, where K γ = lim n n 1 I n ( γ ) . In practice, the matrix I n ( γ ) , evaluated at γ ^ is used since there is no closed-form expression for the expected information matrix K γ for γ .

4.3. Maximum Likelihood Estimation via the EM Algorithm

The EM algorithm (Dempster et al. [32]) is a useful method for ML estimation in the presence of latent variables. To carry out the estimation process, we introduce latent variables W 1 , . . . , W n through the following hierarchical representation of the MSGR model:
T i | W i = w i G R ( w i 2 θ , α )
W i W e i ( q , 1 / 2 )
where W e i ( q , 1 / 2 ) denotes a Weibull distribution. The joint pdf of ( T i , W i ) is as follows:
f ( t i , w i ) = 2 ( w i 2 θ ) α + 1 Γ ( α + 1 ) t i 2 α + 1 exp { w i 2 θ t i 2 } 2 q w i q 1 exp { 2 w i q } .
The full log-likelihood function is the following:
l c ( ϑ ) = 2 ( α + 1 ) i = 1 n log w i + n ( α + 1 ) log θ + ( 2 α + 1 ) i = 1 n log t i n log Γ ( α + 1 ) θ i = 1 n w i 2 t i 2 + n log ( q ) + ( q 1 ) i = 1 n log w i 2 i = 1 n w i q + c o n s t a n t
with ϑ = ( θ , α , q ) . To apply the EM algorithm, it is necessary to obtain log ( w i ) ^ = E ( log ( w i ) | T i = t i , ϑ = ϑ ^ ) and w i 2 ^ = E ( w i 2 | T i = t i , ϑ = ϑ ^ ) .
Finally, the k-th step of the algorithm is as follows:
Step E: Using θ ^ ( k 1 ) , α ^ ( k 1 ) and q ^ ( k 1 ) , calculate log w i ( k ) ^ and w i 2 ( k ) ^ , i = 1 , , n .
Step ECME I: Using α ^ ( k 1 ) , update θ as follows:
θ ^ ( k ) = n ( α ^ ( k 1 ) + 1 ) i = 1 n w i ^ 2 ( k ) t i 2 .
Step ECME II: Update α as follows:
α ^ ( k ) = Ψ 1 2 i = 1 n log w i ( k ) ^ + log t i + n log θ ^ ( k ) n 1 ,
where Ψ 1 ( · ) denotes the inverse of the digamma function.
Step ECME III: Fix α = α ( k ) and θ = θ ( k ) , update q using the log-likelihood observed as follows:
q ( k ) = a r g m a x q > 0 l ( α ( k ) , θ ( k ) , q ) .
This procedure is repeated until a suitable convergence rule is satisfied. Useful starting values are required to start the recursion.

4.4. Simulation Study

To check the good behavior of ML estimators in the MSGR model, a simulation study is carried out by using R software [33]. In this study, 1000 samples of sample size n = 100 , 200 and 300 were generated from the MSGR model. The aim of this simulation is to study the performance of the ML estimators for the parameters using the proposed procedure.
For each generated sample, the ML estimates for each parameter were computed numerically. The mean value and standard deviation (SD) of the estimates for the parameters ( θ , α , q ) are given as summaries in Table 1. It can be seen there that the ML estimates are quite stable and close to the real values for the sample sizes under consideration. As expected, the bias and standard deviation of our estimates decrease as the sample size n increases.

5. Illustrations

In this section, two applications based on real data sets are given. The aim is to compare the fit provided by the MSGR, SGR and GR models. As one of the referees points out, a validation of the virtues of our proposal would require a large number of successful applications. So, the reader must be cautious. It can be said that our aim is to illustrate the use of these models.

5.1. Active Repair Time Data

The first illustration consists of n = 46 lifetime data (in hours) for an airborne communication transceiver. This data set was previously studied by [34]. Table 2 presents the statistics summaries for these data. These are the sample mean, sample variance ( s 2 ), b 1 and b 2 , which denote the sample asymmetry and kurtosis coefficients, respectively. We highlight the fact that the sample kurtosis coefficient b 2 is higher than expected for a GR distribution. Therefore, it makes sense to fit a MSGR model.
First, the moment estimators under the MSGR model were computed. We obtained the following estimates: θ ^ M = 0.011 , α ^ M = 0.803 and q ^ M = 4.638 . Using the method of moment estimates as the initial values, the maximum likelihood estimates were computed and are presented in Table 3; their standard errors (s.e) are given in parenthesis. For every model, we also give the estimated log-likelihood value.
In order to compare distributions with different numbers of parameters, the Akaike information criterion (AIC) [35] and Bayesian information criterion (BIC) [36] were computed. Recall that A I C = 2 k 2 log lik and B I C = k log n 2 log lik , where k is the number of parameters in the model, n is the sample size and log lik is the maximized value of the log-likelihood function. The Kolmogorov–Smirnov statistic (KS) is considered. In Table 3, the corresponding AIC, BIC and KS statistics are reported for each fitted model. Note that the AIC, BIC and KS statistics values show that the better fit is provided by the MSGR model. Figure 6 depicts the histogram of this data set, including the estimated densities under the MSGR, SGR, and GR models. Finally, the Q–Q plots are given in Figure 7, where we can again see that the MSGR model fits better than the GR and SGR models.

5.2. Remission Time Data

The second data set corresponds to the remission times (in months) of a random sample of n = 128 bladder cancer patients. This data set can be found in [37]. Table 4 presents the statistics summaries for this data set. Notice that sample kurtosis b 2 is quite high.
Computing initially the moment estimators under the MSGR model we have the following estimates: θ ^ M = 0.003 , α ^ M = 0.681 and q ^ M = 4.071 . Using the moment estimators as initial values, the maximum likelihood estimates are computed and presented in Table 5 with the standard errors in parenthesis. Table 5 also reports the log-likelihood estimated value and the corresponding AIC, BIC and KS statistic for each fitted model. Figure 8 depicts the histogram of this data set, including the estimated densities under the MSGR, SGR, GR models. Finally, from the Q–Q plots in Figure 9 we can observe that the MSGR model fits better than the GR and SGR models.

6. Concluding Remarks

In this article, a new generalization of the GR distribution, which is able to model positive data with high kurtosis, is introduced. The proposed distribution can be written as the ratio of two independent random variables. These are a GR distribution and a power of the exponential distribution. We derived some special cases that can be seen as two-parameter extensions of the Rayleigh, half normal, Maxwell and chi square distributions. The parameters were estimated by using moments and maximum likelihood methods. The method of moments estimates were used to start the maximum likelihood estimation through the Newton–Raphson procedure. The asymmetry and kurtosis coefficients were also obtained. The study of these coefficients illustrates the fact that the MSGR model can be used to describe positive right-skewed data with high kurtosis. Finally, the MSGR model was fitted to two real data sets. In both applications, the MSGR distribution presented a better fit than the GR and SGR models, which shows the potential applicability of our proposal. To conclude, we point out that these models can be used not only in reliability studies, but also in the fields of economics, for instance, to describe financial data, where models with heavy tails are very interesting.

Author Contributions

Conceptualization, I.B.-C., Y.A.I. and Y.M.G.; Formal analysis, I.B.-C., Y.A.I., Y.M.G. and J.M.A.; Investigation, I.B.-C., Y.A.I., Y.M.G., J.M.A. and H.W.G.; Methodology, I.B.-C., Y.A.I., Y.M.G. and H.W.G.; Software, Y.A.I. and Y.M.G.; Supervision, I.B.-C., Y.M.G. and H.W.G.; Validation, I.B.-C., Y.A.I. and H.W.G. All of the authors contributed significantly to this research article. All authors have read and agreed to the published version of the manuscript.

Funding

The research of H.W. Gómez was supported by SEMILLERO UA-2021 project, Chile.

Data Availability Statement

The datasets are available in the references given in Section 5.

Acknowledgments

The authors wants to thank the reviewers for their thoughtful comments, which contribute to improve the present paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Vodă, V.G. Inferential procedures on a generalized Rayleigh variate I. Apl. Mat. 1976, 21, 395–412. [Google Scholar]
  2. Siddiqui, M.M. Some problems connected with Rayleigh distributions. J. Res. Natl. Bur. Stand. Sect. D Radio Propag. 1962, 66, 167–174. [Google Scholar] [CrossRef]
  3. Hirano, K. Rayleigh distribution. In Encyclopedia of Statistical Sciences; Kotz, S., Johnson, N.L., Read, C.B., Eds.; Wiley: New York, NY, USA, 1986; pp. 647–649. [Google Scholar]
  4. Abramowitz, M.; Stegun, I.E. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables, 9th ed.; National Bureau of Standars: Washington, DC, USA, 1970.
  5. Gradshteyn, I.S.; Ryzhik, I.M. Table of Integrals, Series, and Products; Academic Press: San Diego, CA, USA, 2000. [Google Scholar]
  6. Gómez, H.W.; Quintana, F.A.; Torres, F.J. A new family of slash-distributions with elliptical contours. Stat. Probab. Lett. 2007, 77, 717–725. [Google Scholar] [CrossRef]
  7. Reyes, J.; Barranco-Chamorro, I.; Gómez, H.W. Generalized modified slash distribution with applications. Commun. Stat. Theory Methods 2020, 49, 2025–2048. [Google Scholar] [CrossRef]
  8. Iriarte, Y.A.; Vilca, F.; Varela, H.; Gómez, H.W. Slashed generalized Rayleigh distribution. Commun. Stat. Theory Methods 2017, 46, 4686–4699. [Google Scholar] [CrossRef]
  9. Olmos, N.M.; Varela, H.; Gómez, H.W.; Bolfarine, H. An extension of the half-normal distribution. Stat. Pap. 2012, 53, 875–886. [Google Scholar] [CrossRef]
  10. Reyes, J.; Gómez, H.W.; Bolfarine, H. Modified slash distribution. Statistics 2013, 47, 929–941. [Google Scholar] [CrossRef]
  11. Stacy, E. A generalization of the gamma distribution. Ann. Math. Stat. 1962, 33, 1187–1192. [Google Scholar] [CrossRef]
  12. Rahman, M.S.; Gupta, R.P. Family of transformed chi-square distributions. Commun. Stat. Theory Methods 1992, 22, 135–146. [Google Scholar] [CrossRef]
  13. Hwang, L.C. Asymptotic Non-Deficiency of Some Procedures for a Particular Exponential Family Under Relative LINEX Loss. Seq. Anal. 2014, 33, 345–359. [Google Scholar] [CrossRef]
  14. Kudryavtsev, A.; Shestakov, O. Asymptotically normal estimators for the parameters of the gamma-exponential distribution. Mathematics 2021, 9, 273. [Google Scholar] [CrossRef]
  15. Reyes, J.; Barranco-Chamorro, I.; Gallardo, D.I.; Gómez, H.W. Generalized Modified Slash Birnbaum—Saunders Distribution. Symmetry 2018, 10, 724. [Google Scholar] [CrossRef] [Green Version]
  16. Zörnig, P. On Generalized Slash Distributions: Representation by Hypergeometric Functions. Stats 2019, 2, 26. [Google Scholar] [CrossRef] [Green Version]
  17. Prudnikov, A.; Brychkov, Y.A.; Marichev, O.I. Integrals and Series, Vols. 1, 2 and 3; Gordon and Breach Science Publishers: Amsterdam, The Netherlands, 1986. [Google Scholar]
  18. Casella, G.; Berger, R.L. Statistical Inference, 2nd ed.; Duxbury Advanced Series; Duxbury-Thomson Learning: Pacific Grove, CA, USA, 2002. [Google Scholar]
  19. Polovko, A. Fundamentals of Reliability Theory; Academic Press: San Diego, CA, USA, 1968. [Google Scholar]
  20. Lawless, J.F. Statistical Models and Methods for Lifetime Data, 2nd ed.; John Wiley & Sons: Hoboken, NJ, USA, 2003. [Google Scholar]
  21. Sarti, A.; Corsi, C.; Mazzini, E.; Lamberti, C. Maximum likelihood segmentation of ultrasound images with Rayleigh distribution. IEEE Trans. Ultrason. Ferroelectr. Freq. Control 2005, 52, 947–960. [Google Scholar] [CrossRef] [Green Version]
  22. Kalaiselvi, S.; Loganathan, A.; Vijayaraghavan, R. Bayesian Reliability Sampling Plans under the Conditions of Rayleigh-Inverse-Rayleigh Distribution. Econ. Qual. Control 2014, 29, 29–38. [Google Scholar] [CrossRef]
  23. Dhaundiyal, A.; Singh, S. Approximations to the Non-Isothermal Distributed Activation Energy Model for Biomass Pyrolysis Using the Rayleigh Distribution. Cta Technol. Agric. 2017, 20, 78–84. [Google Scholar] [CrossRef] [Green Version]
  24. Kuruoglu, E.; Zerubia, J. Modeling SAR Images with a Generalization of the Rayleigh Distribution. IEEE Trans. Image Process. 2004, 13, 527–533. [Google Scholar] [CrossRef]
  25. Barranco-Chamorro, I.; Moreno-Rebollo, J.L.; Pascual-Acosta, A.; Enguix-González, A. An Overview of Asymptotic Properties of Estimators in Truncated Distributions. Commun. Stat. Theory Methods 2007, 36, 2351–2366. [Google Scholar] [CrossRef]
  26. Lehmann, E. Elements of Large-Sample Theory; Springer: New York, NY, USA, 1999. [Google Scholar]
  27. Iriarte, Y.A.; Castillo, N.O.; Bolfarine, H.; Gómez, H.W. Modified slashed-Rayleigh distribution. Commun. Stat. Theory Methods 2018, 47, 3220–3233. [Google Scholar] [CrossRef]
  28. Olmos, N.M.; Venegas, O.; Gómez, Y.M.; Iriarte, Y.A. An Asymmetric Distribution with Heavy Tails and Its Expectation-Maximization (EM) Algorithm Implementation. Symmetry 2019, 11, 1150. [Google Scholar] [CrossRef] [Green Version]
  29. Hogg, R.; Tanis, E. Probability and Statistical Inference; MacMillan Publishing: New York, NY, USA, 1993. [Google Scholar]
  30. Bekker, A.; Roux, J. Reliability characteristics of the Maxwell distribution: A Bayes estimation study. Commun. Stat. Theory Methods 2005, 34, 2169–2178. [Google Scholar] [CrossRef]
  31. Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions, 2nd ed.; John Wiley & Sons Inc.: New York, NY, USA, 1995; Volume 2. [Google Scholar]
  32. Dempster, A.; Laird, N.; Rubim, D. Maximum likelihood from incomplete data via the EM algorithm (with discussion). J. R. Stat. Soc. Ser. B 1977, 39, 1–38. [Google Scholar]
  33. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2021. [Google Scholar]
  34. Chhikara, R.; Folks, J. The inverse Gaussian distribution as a lifetime model. Technometrics 1977, 19, 461–468. [Google Scholar] [CrossRef]
  35. Akaike, H. A new look at the statistical model identification. IEE Trans. Autom. Control 1974, 19, 716–723. [Google Scholar] [CrossRef]
  36. Schwarz, G. Estimating the dimension of a model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
  37. Lee, E.T.; Wang, J. Statistical Methods for Survival Data Analysis; John Wiley & Sons: Hoboken, NJ, USA, 2003. [Google Scholar]
Figure 1. Plots of the MSGR pdf for certain values of its parameters.
Figure 1. Plots of the MSGR pdf for certain values of its parameters.
Symmetry 13 01226 g001
Figure 2. Plot of the hazard rate function for the MSGR distribution for different values of its parameters.
Figure 2. Plot of the hazard rate function for the MSGR distribution for different values of its parameters.
Symmetry 13 01226 g002
Figure 3. Plots of the asymmetry and kurtosis coefficients for the MSGR model.
Figure 3. Plots of the asymmetry and kurtosis coefficients for the MSGR model.
Symmetry 13 01226 g003
Figure 4. Plots of the asymmetry and kurtosis coefficients for the GR and MSGR models.
Figure 4. Plots of the asymmetry and kurtosis coefficients for the GR and MSGR models.
Symmetry 13 01226 g004
Figure 5. Relationships among distributions in the MSGR family.
Figure 5. Relationships among distributions in the MSGR family.
Symmetry 13 01226 g005
Figure 6. Fitted models by ML method for active repair time data set: MSGR (solid line), SGR (dashed line), GR (dotted line).
Figure 6. Fitted models by ML method for active repair time data set: MSGR (solid line), SGR (dashed line), GR (dotted line).
Symmetry 13 01226 g006
Figure 7. QQ-plots: (a) MSGR model, (b) SGR model, (c) GR model.
Figure 7. QQ-plots: (a) MSGR model, (b) SGR model, (c) GR model.
Symmetry 13 01226 g007
Figure 8. Fitted models by ML method for remission time data set: MSGR (solid line), SGR (dashed line), GR (dotted line).
Figure 8. Fitted models by ML method for remission time data set: MSGR (solid line), SGR (dashed line), GR (dotted line).
Symmetry 13 01226 g008
Figure 9. QQ-plots: (a) MSGR model, (b) SGR model, (c) GR model.
Figure 9. QQ-plots: (a) MSGR model, (b) SGR model, (c) GR model.
Symmetry 13 01226 g009
Table 1. Empirical means and SD for the ML estimators of θ , α and q.
Table 1. Empirical means and SD for the ML estimators of θ , α and q.
θ α q θ ^ (SD) α ^ (SD) q ^ (SD)
n = 50
1−0.521.249 (0.897)−0.459 (0.233)2.586 (1.101)
1−0.531.183 (0.674)−0.435 (0.171)3.442 (1.615)
1−0.541.117 (0.543)−0.468 (0.132)4.275 (1.568)
1−0.531.172 (0.830)0.597 (0.517)3.137 (1.471)
10.031.182 (0.842)0.099 (0.274)3.116 (0.977)
13.031.245 (0.897)3.208 (1.811)3.258 (0.793)
12.031.296 (1.098)2.269 (1.607)3.369 (1.816)
22.032.433 (1.659)2.240 (1.356)3.197 (0.918)
32.033.212 (2.365)2.084 (1.206)3.247 (0.861)
n = 100
1−0.521.223 (0.888)−0.472 (0.105)2.151 (0.662)
1−0.531.137 (0.551)−0.478 (0.082)3.225 (1.021)
1−0.541.093 (0.497)−0.481 (0.074)4.095 (1.261)
1−0.531.096 (0.498)0.591 (0.401)3.135 (0.666)
10.031.131 (0.539)0.057 (0.252)3.064 (0.672)
13.031.239 (0.821)2.828 (0.998)3.138 (0.555)
12.031.276 (0.821)2.250 (1.115)3.117 (0.566)
22.032.217 (0.957)2.237 (0.990)3.107 (0.598)
32.032.848 (0.923)2.068 (0.841)3.135 (0.617)
n = 200
1−0.521.127 (0.562)−0.475 (0.096)2.075 (0.451)
1−0.531.072 (0.467)−0.497 (0.062)3.018 (0.736)
1−0.541.016 (0.387)−0.501 (0.055)4.051 (0.742)
1−0.531.044 (0.388)0.504 (0.301)3.070 (0.598)
10.031.010 (0.376)0.010 (0.168)3.046 (0.585)
13.030.994 (0.443)2.944 (0.890)3.080 (0.481)
12.031.144 (0.643)2.156 (0.514)3.078 (0.462)
22.032.036 (0.792)2.015 (0.764)3.081 (0.498)
32.033.087 (0.746)2.061 (0.680)3.092 (0.511)
Table 2. Descriptive statistics for the active repair time data.
Table 2. Descriptive statistics for the active repair time data.
n X ¯ s 2 b 1 b 2
463.60624.4452.79411.294
Table 3. Maximum likelihood estimates, log-likelihood estimated value and respective AIC, BIC, KS statistics for GR, SGR and MSGR models.
Table 3. Maximum likelihood estimates, log-likelihood estimated value and respective AIC, BIC, KS statistics for GR, SGR and MSGR models.
ParmetersGR (s.e)SGR (s.e)MSGR (s.e)
θ ^ 0.008 (0.002)2.121 (2.316)2.772 (0.132)
α ^ −0.696 (0.049)0.241 (0.640)0.731 (0.091)
q ^ -0.859 (0.202)1.182 (0.080)
Log-likelihood111.041101.888100.292
AIC226.083209.776206.585
BIC229.740215.262212.071
KS0.1730.1520.108
Table 4. Descriptive statistics of the remission time data.
Table 4. Descriptive statistics of the remission time data.
n X ¯ s 2 b 1 b 2
1289.365110.4253.28618.483
Table 5. Maximum likelihood estimates, log-likelihood estimated value and respective AIC, BIC, KS statistics for GR, SGR and MSGR models.
Table 5. Maximum likelihood estimates, log-likelihood estimated value and respective AIC, BIC, KS statistics for GR, SGR and MSGR models.
ParametersGR (s.e)SGR (s.e)MSGR (s.e)
θ ^ 0.002 (0.0002)0.019 (0.007)0.023 (0.009)
α ^ −0.610 (0.035)−0.390 (0.086)−0.355 (0.100)
q ^ -1.839 (0.381)2.213 (0.402)
Log-likelihood−426.824−410.183−409.695
AIC859.648826.366825.391
BIC868.204834.922833.947
KS0.1560.0700.062
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Barranco-Chamorro, I.; Iriarte, Y.A.; Gómez, Y.M.; Astorga, J.M.; Gómez, H.W. A Generalized Rayleigh Family of Distributions Based on the Modified Slash Model. Symmetry 2021, 13, 1226. https://doi.org/10.3390/sym13071226

AMA Style

Barranco-Chamorro I, Iriarte YA, Gómez YM, Astorga JM, Gómez HW. A Generalized Rayleigh Family of Distributions Based on the Modified Slash Model. Symmetry. 2021; 13(7):1226. https://doi.org/10.3390/sym13071226

Chicago/Turabian Style

Barranco-Chamorro, Inmaculada, Yuri A. Iriarte, Yolanda M. Gómez, Juan M. Astorga, and Héctor W. Gómez. 2021. "A Generalized Rayleigh Family of Distributions Based on the Modified Slash Model" Symmetry 13, no. 7: 1226. https://doi.org/10.3390/sym13071226

APA Style

Barranco-Chamorro, I., Iriarte, Y. A., Gómez, Y. M., Astorga, J. M., & Gómez, H. W. (2021). A Generalized Rayleigh Family of Distributions Based on the Modified Slash Model. Symmetry, 13(7), 1226. https://doi.org/10.3390/sym13071226

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop