Abstract
This paper focuses on studying a truncated positive version of the power-normal (PN) model considered in Durrans (1992). The truncation point is considered to be zero so that the resulting model is an extension of the half normal distribution. Some probabilistic properties are studied for the proposed model along with maximum likelihood and moments estimation. The model is fitted to two real datasets and compared with alternative models for positive data. Results indicate good performance of the proposed model.
1. Introduction
Lehmann [1] proposed a class of asymmetric distributions. The cumulative distribution function (cdf) for such class is given by:
where F is in itself a cumulative distribution function and , with the set of rational numbers. In the special case where is an integer number, the above cdf corresponds to the distribution of the maximum in a sample of size .
Durrans [2] gives an interpretation for (1) in the more general case based on fractional order statistics. Assume F is an absolutely continuous function and f denotes its respective probability density function (pdf), i.e., . The pdf related to (1) is:
Henceforth, we refer to a random variable with pdf as in (2) as the power distribution (), and we use the notation . The particular case where , the cdf of the standard normal model, was approached in [2]. In such a case, the respective pdf of the model is reduced to:
where is the standard normal pdf. The authors used the term generalized Gaussian distribution to refer the model in Equation (3). This model also was studied with more detail by [3]. Pewsey et al. [4] call Model (3) the power-normal (PN) model, denoting , and show that its Fisher information matrix (FIM) for the location-scale extension is nonsingular for (i.e., the symmetric case).
The generalization of the normal distribution in (3) also is a particular case of the Beta-normal model discussed in [5].
On the other hand, the random variable X follows a half-normal distribution with scale parameter if its pdf is given by:
for . We denote . Cooray and Ananda [6] extended the half-normal (HN) model by introducing the generalized half-normal (GHN) model, that is X is a random variable with the GHN distribution with scale parameter and shape parameter , if its pdf is given by:
We use the notation . Observe that , that is one obtains the half-normal model with scale parameter .
Some properties of the GHN distribution are:
- , for
where is the cdf of X and is the gamma function. The proofs of those properties are presented in [6]. Recent extensions of the HN model are considered in [7,8], among others.
The recent literature has experienced a growth in the theory and applications of the continuous truncated models. Among others, we refer the reader to [9,10,11,12,13,14,15,16].
The main focus of this paper is to study the positive truncation for the model considered in (3), where the normalizing constant for the pdf (3) is to be determined, and the resulting model is an extension of the half-normal distribution. That is, we generate a more flexible extension of the half-normal distribution that we call the truncated positive power-normal (TPN) distribution, where the asymmetry parameter is a shape parameter. Given its flexibility, the model is quite useful for fitting positive data related to survival analysis and reliability.
The paper is organized as follows. In Section 2, we present the TPN distribution. Some basic properties such as the quantile function, the risk function and some moments are considered, and Shannon entropy is studied. In Section 3, we discuss some inferential aspects such as the log-likelihood function and its maximization, the corresponding Fisher information matrix (FIM) and the method of moments estimation. Section 4 deals with an extension of the TPN model and presents results for a small-scale simulation study, indicating good parameter recovery. Results of using the proposed model in two real applications are reported in Section 5. The main conclusion is that the TPN model can be a viable alternative for adjusting positive data.
2. The Truncated Positive PN Distribution
In this section, we present the pdf of the TPN model, some of its basic properties, moments and asymmetry and kurtosis coefficients.
2.1. The Probability Density Function
Proposition 1.
A random variable Z has a TPN distribution and is denoted as with parameters σ and α, if its pdf is given by:
Proof.
Under the assumption that , the pdf for the model TPN follows after computing the conditional distribution of , concluding the proof. ☐
Remark 1.
For , the TPN model admits the following stochastic representation. If are independent and identically distributed (iid) random variables with common distribution , then:
Its distribution function is given by:
For and varying , Figure 1 depicts examples of the pdf for model TPN.
Figure 1.
Probability density function of for different values of .
2.2. Properties
2.2.1. Quantile Function
Simple algebraic manipulations yield:
for a probability . The quartiles are, consequently:
- First quartile =
- Median(Z) =
- Third quartile =
2.2.2. Hazard Rate Function
The hazard rate function for the random variable is given by:
Remark 2.
- (i)
- If , then is the hazard function for the half-normal model .
- (ii)
- , is monotonically increasing with .
- (iii)
- , , as .
For and varying , Figure 2 depicts examples of the hazard rate function for model TPN.
Figure 2.
Hazard rate function of the model and different values for .
2.3. Moments
Proposition 2.
If , then the r-th moment of Z is:
where has to be computed numerically.
Proof.
Making the variable change, , we obtain:
☐
Corollary 1.
Therefore, the first four moments are given by:
- (a)
- (b)
- (c)
- (d)
Corollary 2.
Asymmetry and kurtosis coefficients are given, respectively, by:
and:
Remark 3.
If , the asymmetry and kurtosis coefficients take the values 0.99527 and 3.86918, respectively, which correspond to those for the classical HN distribution. Figure 3 depict plots for the asymmetry and kurtosis coefficients, respectively, of the HN and TPN distribution.
Figure 3.
Asymmetry (left) and kurtosis (right) coefficients for (solid line) and half normal (HN) (, dotted line).
2.4. Shannon Entropy
Shannon entropy (see [17]) measures the amount of uncertainty for a random variable Z. It is defined as:
Therefore, it can be verified that the Shannon entropy for the TPN model is:
Figure 4 shows the Shannon entropy for the TPN model fixing . Note that for a fixed , is maximized at .
Figure 4.
Shannon entropy of for different values of . The dashed line corresponds to the Shannon entropy for the standard normal model.
Remark 4.
- (i)
- From Figure 4 and for a fixed σ, we conclude that , , where denotes the Shannon entropy for the distribution.
- (ii)
- For , it follows that , and agrees with the entropy for the half-normal distribution (see [18]), which is given by:
2.5. Rényi Entropy
A generalization of the Shannon entropy is the Rényi entropy, which is defined as:
Routine calculations show that for the TPN model:
Remark 5.
For , the is reduced to:
where is the normalization constant in the Balakrishnan skew-normal distribution ([19,20]). In the last two references, the following facts are shown:
- (a)
- .
- (b)
- .
- (c)
- .
- (d)
- , , for .
- (e)
- For , there is no closed form expression for . However, approximated values are provided in Table 1 from [21].
2.6. Kullback–Leibler Divergence for HN and TPN Models
The Kullback–Leibler divergence () is a measure of how one pdf (say ) diverges from a second (say ) pdf. For this reason, it can be used as a measure to decide between two alternative models for a particular dataset. As the HN model is a particular case of the TPN model (for ), we compute the Kullback–Leibler from the and models, which can be shown to be given by:
Remark 6.
As expected, , if and .
3. Inference
In this section, we discuss moments and maximum likelihood estimation (MLE) and FIM and present a simulation study to investigate parameter recovery.
3.1. Moments Estimation
Solving for in Equation (a) from Corollary 1 and replacing for , it follows that:
Thus, replacing , given in Equation (8), and the second sample moment in Equation (b) from Corollary 1, it follows that:
3.2. The Log-Likelihood Function
For a random sample from the distribution , the log likelihood function can be written as:
so that the likelihood equations are given by:
The solution for Equations (10)–(11) can be obtained by using the function optim available in [22], and the specific method is the L-BFGS-B developed by [23], which allows constrained optimization, which uses a limited-memory modification of the quasi-Newton method.
Fisher Information Matrix
Let random variable . For a single observation z, the log-likelihood function for is:
The first derivatives of are:
The second derivatives of are:
It can be shown that the FIM for the TPN distribution is given by:
with the following elements:
where , for , must be computed numerically.
3.3. Truncation at c
As the following result indicates, the truncation point for the distribution TPN can be located at any . We denote this extension by .
Proposition 3.
A random variable , if its pdf is given by:
where and denote the pdf and cdf of the standard normal distribution, respectively. We use the notation .
Proof.
Under the assumption that , the pdf for the model TPN arises after computing the conditional distribution of , concluding the proof. ☐
4. Simulation Study
In this section, we present a brief simulation study in order to assess the performance of the MLEs of the TPN model in finite samples. To simulate from the distribution, it is sufficient to simulate from the PN distribution, accepting only those values greater than c. The simulation algorithm is then:
- Simulate , and compute .
- If , make . Otherwise, go to the previous step.
The acceptance ratio is then . Hereafter, c is considered known and taking values of 0, 0.5 and 1.0. Likewise, for and were chosen three values, and the generated samples were of sizes , , and . For each combination of sample size and parameter values, 1000 samples were generated and MLEs were computed. Table 1 and Table 2 summarize the mean of the estimated parameters (mean), the mean of the estimated standard deviations (s.d.) and the root of the mean squared error (). Note that a small sample size (say and ) presents a moderate bias for both parameters, which are decreasing for n increasing. Additionally, the s.d.’s are closer to , especially when n is increased, suggesting that the s.d. are well estimated even in small sample sizes.
Table 1.
Mean of the estimated parameters (mean), mean of the estimated standard deviations (s.d.) and root of the mean squared error () for MLEs of the model (cases and ).
Table 2.
Mean of the estimated parameters (mean), mean of the estimated standard deviations (s.d.) and root of the mean squared error () for MLEs of the model (cases and ).
Examples
Figure 5 depicts the model fitting for some simulated samples of size and truncations at and at .
Figure 5.
Examples of the estimation of the with their corresponding estimates. Left panel: , and . Right panel: , and .
5. Real Data Illustration
In this section, we present two applications to illustrate the performance of the TPN model compared with other usual distributions in the literature, such as the Weibull, gamma, GHN, Birnbaum–Saunders (BS, [24,25]), -Birnbaum–Saunders (-BS, [26]), epsilon half-normal (EHN, [7]), power half-normal (PHN, [27]) and truncated positive normal (TN, [28]) models. Model comparison is implemented by using the AIC ([29]).
5.1. Australian Athletes
This dataset consists of several variables recorded on 202 Australian athletes and reported in [30]. Concretely, we analyze here measurements of the body mass index (BMI). Table 3 presents basic descriptive statistics for the dataset. We use the notation and to represent sample asymmetry and kurtosis coefficients, respectively.
Table 3.
Descriptive statistics.
Using results from Section 3.1, moment estimators were computed leading to the following values: and , which were used as initial estimates for the maximum likelihood (ML) approach. In this case, we fixed two values for the TPN model, namely and (a value close to the sample minimum).
Table 4 depicts parameters’ estimates by maximum likelihood using the bbmle function in [22]. The standard errors of the MLE are calculated using the information matrix of each model. For each, we report the estimated log-likelihood function and the corresponding AIC. It can be noted that the AIC scores indicate better fit of the TPN model. On the other hand, results for and are similar. Therefore, we chose the standard model with . In Figure 6, the estimated densities of the models using the ML estimates are shown with the data histogram. This also indicates good fit for the TPM model. Finally, Figure 7 shows the q-q plots for the TPN model and the other considered models. Note that TPN is a more appropriate model than Weibull, gamma, GHN and TN for this dataset because the sample quantiles are closer to the respective theoretical quantiles. Excepting the TPN distribution, all the other models present serious difficulties in accommodating the right tail of the data. Finally, the estimated skewness and kurtosis coefficients for the TPN model consider that the MLEs are 0.694 and 3.731. The 95% confidence intervals (CI) for those coefficients estimated via bootstrap (based on 10,000 bootstrap samples) are given by (0.167; 1.420) and (2.411; 6.841), respectively. Note that the sample versions of both coefficients are contained in the estimated CI.
Table 4.
Parameter estimates (with their respective standard deviations in parenthesis) and AIC values for Weibull, Gamma, generalized half-normal (GHN) and TPN models.
Figure 6.
Histogram for the BMI dataset, with lines representing adjusted distributions using MLE for different models.
Figure 7.
q-q plots: TPN model (left), gamma model (center) and TN model (right).
5.2. Breaking Stress of Carbon Fibers
This dataset is considered in [31] and corresponds to breaking stress of carbon fiber (BSFC) measures in Gba. Cordeiro and Lemonte [26] already analyzed these data comparing the BS and -BS models. Additionally, we also compared those models with the EHN and PHN distributions. Table 5 presents basic descriptive statistics for the dataset. Note that for this dataset, the sample minimum is close to zero. Therefore, in this case, it seems reasonable to consider .
Table 5.
Descriptive statistics. BSFC, breaking stress of carbon fiber.
We also computed the moment estimators, resulting in and , which were used as initial estimates for the maximum likelihood approach.
Table 6 shows the MLEs. It can be noted that AIC shows a better fit of the TPN model. In Figure 8, the ML setting of models is shown with the probability histogram. Finally, from the q-q plots in Figure 9, we have that the TPN model fits the data better than the other models considered.
Table 6.
Parameter estimates (with their respective standard deviations in parenthesis) and AIC values for the Birnbaum–Saunders (BS), -BS, epsilon half-normal (EHN) and TPN models.
Figure 8.
Histogram for the BSFC dataset, with lines representing adjusted distributions using MLE using different models.
Figure 9.
q-q plots: TPN model (left), PHN model (center) and -BS model (right).
6. Discussion
The main focus of this paper is studying a truncated positive version of the PN model, obtaining a new extension of the HN model. This model involves two parameters and is an alternative to other positive models. Maximum likelihood estimation is conducted for parameter estimation, and results of a simulation study indicate that it has good properties for small and moderate sample sizes, as well as applications to real data, indicating that it can outperform competing distributions. A simulation study was implemented using the acceptance rejection method for some truncation values, and the results were satisfactory.
Author Contributions
Nabor Castillo and Héctor W. Gómez conceived the model and study some properties; Diego I. Gallardo performed computational programs developed in simulation studies and applications; Heleno Bolfarine included some properties and translated the paper to english. All authors have read and approved the final manuscript.
Acknowledgments
We thank the four anonymous reviewers for their comments and suggestions helping us to improve the article. The research of N.O. Castillo was supported by DIULS REGULAR PR12151 (Chile). The research of H. Bolfarine was supported by CNPq-Brasil. The research of H.W. Gómez was supported by SEMILLERO UA-2015 (Chile).
Conflicts of Interest
The authors declare no conflict of interest.
References
- Lehmann, E.L. The power of rank tests. Ann. Math. Stat. 1953, 24, 23–43. [Google Scholar] [CrossRef]
- Durrans, S.R. Distributions of fractional order statistics in hydrology. Water Resour. Res. 1992, 28, 1649–1655. [Google Scholar] [CrossRef]
- Gupta, D.; Gupta, R.C. Analyzing skewed data by power normal model. Test 2008, 17, 197–210. [Google Scholar] [CrossRef]
- Pewsey, A.; Gómez, H.W.; Bolfarine, H. Likelihood-based inference for power distributions. Test 2012, 21, 775–789. [Google Scholar] [CrossRef]
- Eugene, N.; Lee, C.; Famoye, F. Beta-normal distribution and its applications. Commun. Stat. Theory Methods 2002, 31, 497–512. [Google Scholar] [CrossRef]
- Cooray, K.; Ananda, M.M.A. A Generalization of the Half-Normal Distribution with Applications to Lifetime Data. Commun. Stat. Theory Methods 2008, 37, 1323–1337. [Google Scholar] [CrossRef]
- Castro, L.M.; Gómez, H.W.; Valenzuela, M. Epsilon half-normal model: Properties and inference. Comput. Stat. Data Anal. 2012, 56, 4338–4347. [Google Scholar] [CrossRef]
- Olmos, N.M.; Varela, H.; Gómez, H.W.; Bolfarine, H. An extension of the half-normal distribution. Stat. Pap. 2012, 53, 875–886. [Google Scholar] [CrossRef]
- Arnold, B.C. Flexible univariate and multivariate models based on hidden truncation. J. Stat. Plan. Inference 2009, 139, 3741–3749. [Google Scholar] [CrossRef]
- Barranco-Chamorro, I.; Moreno-Rebollo, J.L.; Pascual-Acosta, A.; Enguix-Gonzalez, A. An overview of asymptotic properties of estimator in truncated distributions. Commun. Stat. Theory Methods 2007, 36, 2351–2366. [Google Scholar] [CrossRef]
- Barr, D.R.; Sherrill, E.T. Mean and variance of truncated normal distributions. Am. Stat. 1999, 53, 357–361. [Google Scholar]
- Bebu, I.; Mathew, T. Confidence intervals for limited moments and truncated moments in normal and lognormal models. Stat. Probab. Lett. 2009, 79, 375–380. [Google Scholar] [CrossRef]
- Chopin, N. Fast simulation of truncated Gaussian distributions. Stat. Comput. 2011, 21, 275–288. [Google Scholar] [CrossRef]
- Damien, P.; Walker, S.G. Sampling truncated normal, beta, and gamma densities. J. Comput. Graph. Stat. 2001, 10, 206–215. [Google Scholar] [CrossRef]
- Jonhson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions, 2nd ed.; Wiley: New York, NY, USA, 1995; Volume 1. [Google Scholar]
- Robert, C.P. Simulation of truncated normal variables. Stat. Comput. 1995, 5, 121–125. [Google Scholar] [CrossRef]
- Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
- Ahsanullah, M.; Golam Kibria, B.M.; Shakil, M. Normal and Student’s t Distributions and Their Applications; Atlantis Press: Paris, France, 2014. [Google Scholar]
- Arnold, B.C.; Beaver, R.J. Skewed multivariate models related to hidden truncation and/or selective reporting (with discussion). Test 2002, 11, 7–54. [Google Scholar] [CrossRef]
- Sharafi, M.; Behboodian, J. The Balakrishnan skew-normal pdf. Stat. Pap. 2008, 49, 769–778. [Google Scholar] [CrossRef]
- Steck, G.P. Orthant probability for the equicorrelated multivariate normal distribution. Biometrika 1962, 49, 433–445. [Google Scholar] [CrossRef]
- R Development Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2017; ISBN 3-900051-07-0. [Google Scholar]
- Byrd, R.H.; Lu, P.; Nocedal, J.; Zhu, C. A limited memory algorithm for bound constrained optimization. SIAM J. Sci. Comput. 1995, 16, 1190–1208. [Google Scholar] [CrossRef]
- Birnbaum, Z.W.; Saunders, S.C. A new family of life distributions. J. Appl. Probab. 1969, 6, 319–327. [Google Scholar] [CrossRef]
- Birnbaum, Z.W.; Saunders, S.C. Estimation for a family of life distributions with applications to fatigue. J. Appl. Probab. 1969, 6, 328–377. [Google Scholar] [CrossRef]
- Cordeiro, G.M.; Lemonte, A.J. The β-Birnbaum-Saunders distribution: An improved distribution for fatigue life modeling. Comput. Stat. Data Anal. 2011, 55, 1445–1461. [Google Scholar] [CrossRef]
- Gómez, Y.M.; Bolfarine, H. Likelihood-based inference for the power half-normal distribution. J. Stat. Theory Appl. 2015, 14, 383–398. [Google Scholar] [CrossRef]
- Gómez, H.J.; Olmos, N.M.; Varela, H.; Bolfarine, H. Truncated positive normal distribution. Appl. Math. J. Chin. Univ. 2018, 33, 163–176. [Google Scholar]
- Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control 1974, 19, 716–723. [Google Scholar] [CrossRef]
- Cook, R.D.; Weisberg, S. An Introduction to Regression Graphics; JohnWiley & Sons Inc.: New York, NY, USA, 1994. [Google Scholar]
- Nichols, M.D.; Padgett, W.J. A bootstrap control chart for Weibull percentiles. Qual. Reliab. Eng. Int. 2006, 22, 141–151. [Google Scholar] [CrossRef]
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).