Next Article in Journal
New Hybrid EC-Promethee Method with Multiple Iterations of Random Weight Ranges: Applied to the Choice of Policing Strategies
Previous Article in Journal
The Metrization Problem in [0,1]-Topology
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Family of Truncated Positive Distributions

by
Héctor J. Gómez
1,
Karol I. Santoro
2,
Inmaculada Barranco-Chamorro
3,
Osvaldo Venegas
1,*,
Diego I. Gallardo
4 and
Héctor W. Gómez
2
1
Departamento de Ciencias Matemáticas y Físicas, Facultad de Ingeniería, Universidad Católica de Temuco, Temuco 4780000, Chile
2
Departamento de Estadística y Ciencias de Datos, Facultad de Ciencias Básicas, Universidad de Antofagasta, Antofagasta 1240000, Chile
3
Departamento de Estadística e Investigación Operativa, Facultad de Matemáticas, Universidad de Sevilla, 41012 Sevilla, Spain
4
Departamento de Estadística, Facultad de Ciencias, Universidad del Bío-Bío, Concepción 4081112, Chile
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(21), 4431; https://doi.org/10.3390/math11214431
Submission received: 21 September 2023 / Revised: 19 October 2023 / Accepted: 20 October 2023 / Published: 26 October 2023
(This article belongs to the Section D1: Probability and Statistics)

Abstract

:
In this paper, a new family of continuous distributions with positive support is introduced. This family is generated by a truncation of the family of univariate symmetrical distributions. In this new family of distributions, general properties, such as moments, asymmetry and kurtosis coefficients, are derived. Particular cases of interest based on the normal, logistic, Laplace and Cauchy models are discussed in depth. The estimation of parameters is carried out by applying moments and maximum likelihood methods. Also, a simulation study was conducted to illustrate the good performance of estimators. An application to the Survival Times (in days) of Guinea Pigs dataset is included, where the special cases of distributions in this family are fitted. The option which provides the best fit is ultimately chosen. An R package, called “tpn”, has been implemented, which includes the relevant cases of interest in this family.

1. Introduction

In modeling continuous real data, we frequently encounter real data that assume values only in ( 0 , ) ; models with positive support must therefore be used with these data. In this context, the traditional distributions used in the literature are the gamma, Weibull, log-normal and half-normal models, among others. Models of these types are readily applicable in areas associated with medicine, climatology, the environment, and marine sciences, to name just a few. In the more general context of distributions, we can cite the family of spherical distributions, which is a particular case of the family of elliptic distributions. The univariate case falls into a class of symmetric distributions. If the probability density function (pdf) of a random variable (r.v) X satisfies the condition of being symmetric about zero, then X itself is also symmetric about zero. In other words, its pdf—say, g 0 ( · ) —complies with
g 0 ( x ) = g 0 ( x ) , x R .
Let G 0 ( · ) be the cumulative distribution function (cdf) associated with the r.v. X, which satisfies G 0 ( 0 ) = 1 2 , G 0 ( x ) = 1 G 0 ( x ) and G 0 ( x ) = g 0 ( x ) . For the multivariate case (considering also location and scale), the elliptic family of distributions is obtained; see Kelker [1], Cambanis et al. [2], Fang et al. [3] and Gupta and Varga [4], among others. A family of models with positive support that is generated based on the family of symmetric distributions is the family of half-symmetric (HS) distributions. If the r.v. X is symmetric about zero, then the r.v. Y = | X | has an HS distribution. The pdf of the r.v. Y is given by
h X ( x ) = 2 g 0 ( x ) , x R + .
Incorporating a scale parameter σ > 0 , we obtain
h X ( x ; σ ) = 2 σ g 0 x σ , x R + .
We will denote this as Y H S ( σ ) . The HS family of distributions is a positive truncation of symmetrical distributions. The half-normal (HN) model is a member of this family, which was studied in detail by Pewsey [5,6]. Some extensions of the HN distribution have been introduced by Chou and Liu [7], Barranco-Chamorro et al. [8], Cooray and Ananda [9] and Olmos et al. [10], among others. Other authors who have developed extensions of distributions contained in the HS family are Balakrishnan and Puthenpura [11] for the half-logistic (HL) distribution, Wiper et al. [12] for the HN and half-t (Ht) distributions, Polson and Scott [13] for the half-Cauchy (HC) distribution and Gui [14] for the half-power exponential (HPE) distribution, among others. The aim of the present paper is to use univariate symmetric distributions to generate a new family of distributions with positive support that contains the HS family as a particular case and offers an alternative to distributions with two parameters for modeling data on reliability, survival or any area with continuous positive observations. Also, an R package is available, which can be used by the readers for modeling positive continuous data. The truncation of distributions or families of distributions has grown in recent years; see Almarashi et al. [15], Bantan et al. [16], Morán et al. [17] and Alotaibi et al. [18].
In this context, a key tool is presented, which significantly improves the analysis of truncated distributions: the R package “tpn”. The “tpn” package allows the application of four particular cases of truncated distributions: Truncated Positive Laplace (TPLa), Truncated Positive Logistic (TPL), Truncated Positive Cauchy (TPC) and, finally, Truncated Positive Normal (TPN); this latter was previously studied by Gómez et al. [19]. What sets “tpn” apart is its ability to assist in the selection of the optimal model among these four particular cases by using fitting criteria, providing a valuable tool for researchers and practitioners working in data analysis and statistical modeling.
The paper is organized as follows. In Section 2, we present the new family with positive support and its properties. The methods for estimation and computational implementation of the model in an R package [20] are discussed in Section 3. Section 4 presents a simulation study that evaluates the performance of the maximum likelihood estimators in finite samples. In Section 5, we present an application to a set of real data, showing the fits of four sub-models of the family of distributions. Finally, Section 6 provides the main conclusions drawn from this research.

2. Truncated Positive Symmetrical Distributions

In this section, we present a novel family of distributions with positive support. Its principal associated functions and properties are discussed in detail.

2.1. pdf, cdf and Hazard Function

The following definition presents a new family of distributions with positive support based on a positive truncation of the symmetric distributions, which we call truncated positive symmetrical (TPS).
Definition 1. 
Let X be an r.v. with pdf symmetric around zero g 0 ( x ) and respective cdf G 0 ( x ) . By using the location-scale transformation, a new r.v. Y = σ X + μ is defined, with σ > 0 and μ R . If Z = Y Y > 0 (in other words, Z is defined as the r.v. Y, but only in its positive part), then Z T P S ( σ , λ ) , where λ = μ / σ R is a shape parameter.
The pdf related to the TPS ( σ , λ ) distribution is presented in the following proposition.
Proposition 1. 
For Z T P S ( σ , λ ) , its pdf is given by
f Z ( z ; σ , λ ) = 1 σ G 0 ( λ ) g 0 z σ λ , z > 0 ,
where σ > 0 and λ R are scale and shape parameters, respectively.
Proof. 
From the definition itself, it is clear that the pdf of Z has the form
f Z ( z ; σ , λ ) = K σ g 0 z μ σ , z > 0 ,
where K is the normalization constant. Taking K = 1 / G 0 ( λ ) and making λ = μ / σ , it can directly be seen that
1 σ G 0 ( λ ) 0 g 0 z σ λ d z = 1 σ G 0 ( λ ) λ g 0 ( u ) σ d u = 1 G 0 ( λ ) ( 1 G 0 ( λ ) ) = 1 .
  □
Proposition 2. 
Let Z T P S ( σ , λ ) . Then, the cdf of Z is given by
F Z ( z ; σ , λ ) = 1 G 0 ( λ ) G 0 z σ λ + G 0 ( λ ) 1 , z > 0 .
Proposition 3. 
Let Z T P S ( σ , λ ) . Then, the hazard function of Z is given by
h ( z ) = g 0 z σ λ σ [ 2 G 0 ( λ ) G 0 ( z σ λ ) 1 ] , z > 0 .
The proofs of Propositions 2 and 3 are directly obtained by using the corresponding definitions and algebraic manipulations.

2.2. Particular Cases

The TPS distribution encompasses various models, which are detailed as follows:
  • TPS ( σ , λ = 0 ) HS ( σ ) ;
  • TPS g 0 = n o r m a l TPN ( σ , λ ) ;
  • TPS g 0 = l a p l a c e TPLa ( σ , λ ) ;
  • TPS g 0 = c a u c h y TPC ( σ , λ ) ;
  • TPS g 0 = l o g i s t i c TPL ( σ , λ ) .
The relationships among the TPS and its specific cases are summarized in Figure 1.
The pdf, cdf and hazard function for some particular cases of the TPS model are presented in Table 1. Illustrative plots for the pdf and hazard function are given in Figure 2 and Figure 3, respectively.

2.3. Moments

In this subsection, we discuss the moment-generating function (mgf) and the non-central moments for the TPS model.
Proposition 4. 
Let Z T P S ( σ , λ ) . Then, the mgf of Z is given by
M Z ( t ) = e t σ λ G 0 ( λ ) λ e t σ u g 0 u d u ,
provided that the previous integral exists for t in some neighborhood of zero.
Proof. 
By definition,
M Z ( t ) = 0 e t z 1 σ G 0 ( λ ) g 0 z σ λ d z .
Using the change u = z / σ λ , we obtain the result.    □
Proposition 5. 
Let Z T P S ( σ , λ ) and assume that the moments associated with the pdf g 0 ( · ) exist. Then, for  r = 1 , 2 , , the r-th non-central moment of Z is given by
μ r = E ( Z r ) = σ r G 0 ( λ ) ρ r ( λ ) ,
where ρ r ( λ ) : = k = 0 r r k λ r k λ u k g 0 ( u ) d u .
Proof. 
Using the definition, the non-central moment of the TPS model is
μ r = 0 z r σ G 0 ( λ ) g 0 z σ λ d z .
Using the substitution u = z σ λ , we obtain
μ r = σ r G 0 ( λ ) λ ( u + λ ) r g 0 u d u .
Since r N , the binomial theorem can be applied for the term ( u + λ ) r , obtaining
μ r = σ r G 0 ( λ ) λ k = 0 r r k u k λ r k g 0 u d u .
As the sum converges uniformly in any closer interval [ a , b ] ( λ , ) , we can then exchange the order of the integral and the sum
μ r = σ r G 0 ( λ ) k = 0 r r k λ r k λ u k g 0 u d u .
  □
Remark 1. 
Note that the two first cases for ρ r ( λ ) are given by
ρ 1 ( λ ) = λ G 0 ( λ ) + λ u g 0 ( u ) d u , a n d ρ 2 ( λ ) = λ 2 G 0 ( λ ) + 2 λ λ u g 0 ( u ) d u + λ u 2 g 0 ( u ) d u .
Corollary 1. 
Let Z T P S ( σ , λ ) . Then, the skewness ( β 1 ) and kurtosis ( β 2 ) coefficients of Z are given by
β 1 = G 0 ( λ ) ρ 3 ( λ ) 3 G 0 ( λ ) ρ 1 ( λ ) ρ 2 ( λ ) + 2 ρ 1 3 ( λ ) [ G 0 ( λ ) ρ 2 ( λ ) ρ 1 2 ( λ ) ] 3 / 2 , a n d β 2 = G 0 3 ( λ ) ρ 4 ( λ ) 4 G 0 2 ( λ ) ρ 1 ( λ ) ρ 3 ( λ ) + 6 G 0 ( λ ) ρ 1 2 ( λ ) ρ 2 ( λ ) 3 ρ 1 4 ( λ ) [ G 0 ( λ ) ρ 2 ( λ ) ρ 1 2 ( λ ) ] 2 .
Remark 2. 
Corollary 1 shows that the coefficients of asymmetry and kurtosis depend only on the parameter λ. Table 2 presents the values of the asymmetry and kurtosis coefficients of the TP-normal, TP-logistic and TP-Laplace models for some values of λ. Plots are given in Figure 4.

2.4. Shannon Entropy

The Shannon entropy [21] quantifies the level of uncertainty associated with a r.v. Z. It is defined as
S ( Z ) = E ( log f ( Z ) ) .
Proposition 6. 
For the TPS model, the Shannon entropy is given by
S ( Z ) = log ( σ G 0 ( λ ) ) 1 G 0 ( λ ) λ log ( g 0 ( u ) ) g 0 ( u ) d u .
Proof. 
Using the definition,
S ( Z ) = E log 1 σ G 0 ( λ ) g 0 Z σ λ = E ( log ( σ G 0 ( λ ) ) ) E log g 0 Z σ λ = log ( σ G 0 ( λ ) ) 1 σ G 0 ( λ ) 0 log g 0 z σ λ g 0 z σ λ d z .
With the change u = z / σ λ , we obtain
S ( Z ) = log ( σ G 0 ( λ ) ) 1 G 0 ( λ ) λ log g 0 u g 0 u d z .
  □

2.5. Quantiles

In this subsection, we present some quantiles of the TPS distribution.
Proposition 7. 
Let Z T P S ( σ , λ ) . Then, the quantile function for Z is given by
Q ( p ) = σ [ G 0 1 ( 1 + ( p 1 ) G 0 ( λ ) ) + λ ] , 0 < p < 1 ,
where G 0 1 ( · ) denotes the inverse function of G 0 ( · ) .
Corollary 2. 
The quartiles of the TPS distribution are given by the following:
1. 
First quartile: σ G 0 1 1 3 4 G 0 ( λ ) + λ ;
2. 
Median: σ G 0 1 1 1 2 G 0 ( λ ) + λ ;
3. 
Third quartile: σ G 0 1 1 1 4 G 0 ( λ ) + λ .

2.6. Mode

We now present the mode of the TPS distribution.
Proposition 8. 
The mode of the T P S ( σ , λ ) model is as follows:
1.
z = σ λ when λ 0 ;
2.
z = 0 when λ < 0 ,
Proof. 
Since the mode of g 0 is attained at 0, the corresponding mode of the TPS satisfies z / σ λ = 0 . If λ 0 , then the mode will be attained at z = σ λ . If λ < 0 , the pdf of the TPS model will be strictly decreasing in ( 0 , ) ; then, the mode of the model is 0.    □
Remark 3. 
Since λ = μ σ , Proposition 8 can be rewritten in terms of μ. That is the mode of the TPS model is reached at μ if μ 0 and at 0 if μ < 0 .

2.7. Order Statistics

One of the models that naturally arises when the observations are placed in increasing order is the order statistics (see Balakrishnan and Clifford [22]). When the r.v.s are sampled from a continuous distribution with common pdf and cdf f ( · ) and F ( · ) , respectively, then the pdf of the k-th order statistic is given by
f k : n ( z ) = n ! ( k 1 ) ! ( n k ) ! f ( z ) [ G ( z ) ] k 1 [ 1 G ( z ) ] n k , z Z , k = 1 , , n ,
where Z is the support related to the distribution. The following proposition particularizes this result for the TPS distribution.
Proposition 9. 
Suppose the r.v.s Z 1 , Z 2 , , , Z n are independent and identically distributed TPS ( σ , λ ) . Then, the pdf of the k-th order statistic is given by
f k : n ( z ) = n ! σ ( k 1 ) ! ( n k ) ! G 0 n ( λ ) g 0 z σ λ G 0 z σ λ + G 0 ( λ ) 1 k 1 1 G 0 z σ λ n k .
Proof. 
This is a direct consequence of Equation (8) taking f ( z ) as the pdf of the TPS model.    □
Corollary 3. 
For the minimum and maximum, the pdf is reduced to the following cases:
1.
Minimum ( k = 1 ).
f 1 : n ( z ) = n σ G 0 n ( λ ) g 0 z σ λ 1 G 0 z σ λ n 1 ;
2.
Maximum ( k = n ).
f n : n ( z ) = n σ G 0 n ( λ ) g 0 z σ λ G 0 z σ λ + G 0 ( λ ) 1 n 1 .

3. Inference

In this section, we explore a traditional method for making inferences about the TPS distribution. Specifically, we delve into the estimation of moments and maximum likelihood (ML) techniques.

3.1. Moment Estimators

The moment estimators arise from solving the equations E ( Z r ) = Z r ¯ for r = 1 , 2 , where Z r ¯ = n 1 i = 1 n z i r refers to the r-th sample moment. Solving E ( Z ) = Z ¯ , one obtains
σ = Z ¯ G 0 ( λ ) ρ 1 ( λ ) .
Replacing the first and second sample moments in the expressions given in Proposition 5, it follows that
Z ¯ 2 G 0 ( λ ) ρ 2 ( λ ) Z 2 ¯ ρ 1 2 ( λ ) = 0 .
Various software options are available to solve Equation (11). For instance, in R, the uniroot function can be utilized to calculate the moment estimator λ ^ M . The moment estimator σ ^ M is obtained by replacing λ ^ M in Equation (10).
Remark 4. 
The application of the method of moments assumes the existence of at least the first two moments of the distribution. For instance, for g 0 ( · ) , the pdf of the standard Cauchy is not fulfilled.

3.2. Maximum Likelihood Estimation

Given z 1 , , z n , i.e., a random sample from the TPS ( σ , λ ) distribution based on the logarithm of the pdf given in Equation (4), the log-likelihood function for θ = ( σ , λ ) is given by
( θ ) = n log ( σ ) n log ( G 0 ( λ ) ) i = 1 n log g 0 z i σ λ .
To compute the maximum likelihood estimator (MLE) for θ , (12) must be maximized. An equivalent approach is to solve the following system of equations: ( θ ) σ = 0 and ( θ ) λ = 0 , namely,
i = 1 n g 0 ( σ ) ( v i ) g 0 ( v i ) = n σ ,
i = 1 n g 0 ( λ ) ( v i ) g 0 ( v i ) = n g 0 ( λ ) G 0 ( λ ) ,
where v i = z i σ λ , g 0 ( σ ) ( v i ) = g 0 ( v i ) σ and g 0 ( λ ) ( v i ) = g 0 ( v i ) λ . Equations (13) and (14) can be numerically solved by using the R-4.3.1 software [20].

3.3. Observed Fisher Information Matrix

The asymptotic variance of MLEs, θ ^ = ( σ ^ , λ ^ ) , can be estimated from the Fisher information matrix, given by I ( θ ) = E 2 ( θ ) / θ θ , with ( θ ) given in (12). Recall that, under regularity conditions,
I ( θ ) 1 / 2 θ ^ θ D N 2 ( 0 2 , I 2 ) , a s   n + ,
where D stands for convergence in distribution and N 2 ( 0 2 , I 2 ) denotes the standard bivariate normal distribution. Moreover, I ( θ ) can be obtained from the matrix 2 ( θ ) / θ θ , whose elements are given by I σ σ = 2 ( θ ) / σ 2 , I σ λ = 2 ( θ ) / σ λ and so on. Explicitly, we have
I σ σ = n σ 2 i = 1 n g 0 ( σ σ ) ( v i ) g 0 ( v i ) ( g 0 ( σ ) ( v i ) ) 2 g 0 2 ( v i ) , I σ λ = i = 1 n g 0 ( σ ) ( v i ) g 0 ( λ ) ( v i ) g 0 ( σ λ ) ( v i ) g 0 ( v i ) g 0 2 ( v i ) , I λ λ = n ( g 0 2 ( λ ) g 0 ( λ ) ( v i ) G ( λ ) ) G ( λ ) + i = 1 n g 0 ( λ λ ) ( v i ) g 0 ( v i ) ( g 0 ( λ ) ( v i ) ) 2 g 0 2 ( v i ) ,
where g 0 ( σ σ ) ( v i ) = 2 g 0 ( v i ) σ 2 , g 0 ( σ λ ) ( v i ) = 2 g 0 ( v i ) σ λ and g 0 ( λ λ ) ( v i ) = 2 g 0 ( v i ) λ 2 .

3.4. Residuals

The quantile residuals (QRs) introduced by Dunn and Smyth [23] are an ad hoc method to assess the validity of a model. For the TPS distribution, such residuals are given by
r ^ i = Φ 1 1 1 G 0 z i σ ^ λ ^ G 0 ( λ ^ ) , i = 1 , , n ,
where σ ^ and λ ^ denote the ML estimators of σ and λ , respectively. If the model is correctly specified, then r ^ 1 , r ^ 2 , , r ^ n will be a random sample from the standard normal distribution, which can be checked using traditional normality tests such as Anderson–Darling (AD), Cramér–von Mises (CVM) and Shapiro–Wilks (SW).

3.5. Software

The ML estimation method for the TPS model was implemented in the tpn package [24] from R [20], version 1.7. To achieve these outcomes, the subsequent function can be utilized:
    est.fts(y, dist="norm")
where y is the response variable and dist should be any of the following options: norm (default), logis, cauchy and laplace for the normal, logistic, Cauchy and Laplace distributions, respectively. The TPN package provides additional functions such as dfts, pfts, qfts and rfts. These functions allow for the calculation of the pdf, cdf, quantile function and generated data for the TPS model using any of the available distributions.

4. Simulation

In this section, we assess the performance of the ML estimators for the TPS distribution in finite samples. Since the normal distribution was studied in Gómez et al. [19], we focus here on the logistic and Cauchy models. We examined two values for σ , namely, 1 and 10, as well as four values for λ , which were 0.5, 1, 2 and 3. Additionally, we considered three different sample sizes, namely, 50, 100 and 200. In total, we had 48 combinations of σ , λ and n as models. For each combination, we performed 1000 replicates and utilized the est.tps function from Section 3.5 to obtain the maximum likelihood estimators for the corresponding model. Table 3 provides a summary of the average bias, standard errors, root mean squared errors and coverage probabilities of the ML estimators at a 95% confidence level based on the asymptotic distribution. It is worth noting that increasing the sample size leads to a reduction in both bias and RMSE, indicating that the estimators are consistent. Moreover, as the sample size increases, the standard errors and RMSE become more similar, suggesting that the standard errors are accurately estimated. Finally, the coverage probabilities approach the nominal values used to construct the confidence intervals, indicating that the normal model-based approximation for the distribution of the ML estimators performs well, even for finite samples.

5. Application to Real Data

In this section, we illustrate our family of models using a database associated with survival times of Guinea pigs injected with different doses of tubercle bacilli, specifically, regimen 66; these data were studied by Bjerkedal [25]. The study of the survival time of Guinea pigs under different diets is crucial for identifying patterns and ratios, quantifying uncertainty in predictions and comparing the impact of the different diets on the animals’ longevity. This approach contributes to progress in investigation into animal nutrition and welfare. Table 4 shows a descriptive analysis of the data.
Table 5 shows the estimations for the TPN, TPLa, TPC and TPL models in the Guinea pigs data and the respective AIC and BIC criteria. Note that the TPC model provides the lowest AIC; this is corroborated by Figure 5, which shows that the TPC model provides better performance for this dataset than competing models and a better fit in the accumulated distribution (green line cdf TPC) than the empirical fit (black line). The Kolmogorov–Smirnov (KS) distance and the corresponding p-values are also included. These summaries further support our conclusions.
In order to assess the suitability of each model, we conducted an analysis using the QRs introduced in Section 3.4. Based on the findings presented in Figure 6 (TPN, TPLa, TPC and TPL), it can be concluded that, among all the fitted models, only the TPC model is suitable for this particular dataset.

6. Conclusions

This paper presents the TPS family of distributions based on univariate symmetric distributions, which contains, as a special case, the HS family. The TPS family has two parameters: scale and shape. This fact makes it an attractive competitor to several two-parameter models used in different areas of knowledge. The TPS family is a viable alternative to the HS family of distributions for fitting continuous positive data. Relevant characteristics of the TPS family studied in this paper are as follows:
  • The TPS family of distributions has turned out to be more flexible than the HS family, since it has a shape parameter;
  • The TPS family of distributions is based on the univariate symmetric family of distributions. It has been proven that its properties are manageable and depend to a large extent on the properties of the pdf and cdf of the basal symmetric distribution;
  • pdf, cdf and risk functions are represented by very well-known functions;
  • A simulation study was carried out for some members of the TPS family. The ML estimators showed good performance for moderate sample sizes;
  • A real application is given, in which several members of the TPS family were considered and fitted. The AIC and BIC model selection criteria showed which member had the best fit to this dataset. The AD, CVM and SW tests were carried out to validate the normality of the QR;
  • As for the applicability of our results, we highlight that our proposal allowed us to incorporate a variety of shapes for modeling positive data, as has been established in Section 2. Therefore, these models can be considered as competitors to those commonly used for modeling lifetime data and other disciplines with positive data. On the other hand, an important advantage is that the new models inherit theoretical properties from the base models on which they are built, which increases their interest from an applied point of view;
  • Practitioners may use the R package tpn for modeling positive continuous data.

Author Contributions

Conceptualization, H.J.G., K.I.S., I.B.-C. and D.I.G.; software, H.J.G. and D.I.G.; formal analysis, K.I.S., D.I.G., I.B.-C. and O.V.; investigation, H.J.G. and K.I.S.; writing—original draft preparation, H.J.G., K.I.S. and D.I.G.; writing—review and editing, D.I.G., I.B.-C. and O.V.; supervision, O.V. and H.W.G.; funding acquisition, O.V. and H.W.G. All authors have read and agreed to the published version of the manuscript.

Funding

The research of H.W. Gómez was supported by Semillero UA-2023. The research of I. Barranco-Chamorro was supported by IOAP of University of Seville, Spain.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The dataset is available in Bjerkedal [25].

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Kelker, D. Distribution theory of special distributions and location-scale parameter. Sankhya A 1970, 32, 419–430. [Google Scholar]
  2. Cambanis, S.; Huang, S.; Simons, G. On the theory of elliptically contoured distributions. J. Multivar. Anal. 1981, 11, 368–385. [Google Scholar] [CrossRef]
  3. Fang, K.T.; Kotz, S.; Ng, K.W. Symmetric Multivariate and Related Distributions; Chapman & Hall: London, UK; New York, NY, USA, 1990. [Google Scholar]
  4. Gupta, A.K.; Varga, T. Elliptically Contoured Models in Statistics; Kluwer Academics Publishers: Boston, MA, USA, 1993. [Google Scholar]
  5. Pewsey, A. Large-sample inference for the general half-normal distribution. Commun. Stat.-Theory Methods 2002, 31, 1045–1054. [Google Scholar] [CrossRef]
  6. Pewsey, A. Improved likelihood based inference for the general half-normal distribution. Commun. Stat.-Theory Methods 2004, 33, 197–204. [Google Scholar] [CrossRef]
  7. Chou, C.Y.; Liu, H.R. Properties of the half-normal distribution and its application to quality control. J. Ind. Technol. 1998, 14, 4–7. [Google Scholar]
  8. Barranco-Chamorro, I.; Moreno-Rebollo, J.L.; Pascual-Acosta, A.; Enguix-Gonzalez, A. An overview of asymptotic properties of estimator in truncated distributions. Commun. Stat.-Theory Methods 2007, 36, 2351–2366. [Google Scholar] [CrossRef]
  9. Cooray, K.; Ananda, M.M.A. A generalization of the half-normal distribution with applications to lifetime data. Commun. Stat.-Theory Methods 2008, 37, 1323–1337. [Google Scholar] [CrossRef]
  10. Olmos, N.M.; Varela, H.; Gómez, H.W.; Bolfarine, H. An extension of the half-normal distribution. Stat. Pap. 2012, 53, 875–886. [Google Scholar] [CrossRef]
  11. Balakrishnan, N.; Puthenpura, S. Best Linear Unbiased Estimators of Location and Scale Parameters of the Half Logistic Distribution. J. Stat. Comput. Simul. 1986, 25, 193–204. [Google Scholar] [CrossRef]
  12. Wiper, M.P.; Giron, F.J.; Pewsey, A. Objective Bayesian inference for the half-normal and half-t distributions. Commun. Stat.-Theory Methods 2008, 37, 3165–3185. [Google Scholar] [CrossRef]
  13. Polson, N.G.; Scott, J.G. On the half-Cauchy prior for a global scale parameter. Bayesian Anal. 2012, 7, 887–902. [Google Scholar] [CrossRef]
  14. Gui, W. Statistical Inferences and Applications of the Half Exponential Power Distribution. J. Qual. Reliab. Eng. 2013, 2013, 219473. [Google Scholar] [CrossRef]
  15. Almarashi, A.M.; Elgarhy, M.; Jamal, F.; Chesneau, C. The Exponentiated Truncated Inverse Weibull-Generated Family of Distributions with Applications. Symmetry 2020, 12, 650. [Google Scholar] [CrossRef]
  16. Bantan, R.A.R.; Chesneau, C.; Jamal, F.; Elbatal, I.; Elgarhy, M. The Truncated Burr X-G Family of Distributions: Properties and Applications to Actuarial and Financial Data. Entropy 2021, 23, 1088. [Google Scholar] [CrossRef] [PubMed]
  17. Morán-Vásquez, R.A.; Cataño Salazar, D.H.; Nagar, D.K. Some Results on the Truncated Multivariate Skew-Normal Distribution. Symmetry 2022, 14, 970. [Google Scholar] [CrossRef]
  18. Alotaibi, N.; Elbatal, I.; Almetwally, E.M.; Alyami, S.A.; Al-Moisheer, A.S.; Elgarhy, M. Truncated Cauchy Power Weibull-G Class of Distributions: Bayesian and Non-Bayesian Inference Modelling for COVID-19 and Carbon Fiber Data. Mathematics 2022, 10, 1565. [Google Scholar] [CrossRef]
  19. Gómez, H.J.; Olmos, N.M.; Varela, H.; Bolfarine, H. Inference for a truncated positive normal distribution. Appl. Math. J. Chin. Univ. 2018, 33, 163–176. [Google Scholar] [CrossRef]
  20. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2023; Available online: https://www.R-project.org/ (accessed on 18 October 2023).
  21. Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
  22. Balakrishnan, N.; Clifford Cohen, A. Order Statistics and Inference: Estimation Methods; Statistical Modeling and Decision Science Series; Elsevier Science: Amsterdam, The Netherlands, 1991. [Google Scholar]
  23. Dunn, P.K.; Smyth, G.K. Randomized Quantile Residuals. J. Comput. Graph. Stat. 1996, 5, 236–244. [Google Scholar]
  24. Gallardo, D.I.; Gómez, H.J.; Gómez, Y.M. tpn: Truncated Positive Normal Model and Extensions. R Package Version 1.7. 2023. Available online: https://CRAN.R-project.org/package=tpn (accessed on 18 October 2023).
  25. Bjerkedal, T. Acquisition of resistance in guinea pigs infected with different doses of virulent tubercle bacilli. Amer. J. Hyg. 1960, 72, 130–148. [Google Scholar] [PubMed]
Figure 1. Some particular cases for the TPS distribution.
Figure 1. Some particular cases for the TPS distribution.
Mathematics 11 04431 g001
Figure 2. pdf for some particular cases of T P S g 0 ( σ = 1 , λ ) , with λ = 0 (black line), λ = 1 (red line), λ = 2 (green line) and λ = 3 (blue line).
Figure 2. pdf for some particular cases of T P S g 0 ( σ = 1 , λ ) , with λ = 0 (black line), λ = 1 (red line), λ = 2 (green line) and λ = 3 (blue line).
Mathematics 11 04431 g002aMathematics 11 04431 g002b
Figure 3. Hazard function for some particular cases of T P S g 0 ( σ = 1 , λ ) , with λ = 0 (black line), λ = 1 (red line), λ = 2 (green line) and λ = 3 (blue line).
Figure 3. Hazard function for some particular cases of T P S g 0 ( σ = 1 , λ ) , with λ = 0 (black line), λ = 1 (red line), λ = 2 (green line) and λ = 3 (blue line).
Mathematics 11 04431 g003
Figure 4. Skewness and kurtosis coefficients for some particular cases of the T P S g 0 ( σ = 1 , λ ) model.
Figure 4. Skewness and kurtosis coefficients for some particular cases of the T P S g 0 ( σ = 1 , λ ) model.
Mathematics 11 04431 g004
Figure 5. Family models adjusted by the maximum likelihood method for the Survival Times (in days) of Guinea Pigs dataset and the fit of the cumulative distribution of the TPC model with the empirical one.
Figure 5. Family models adjusted by the maximum likelihood method for the Survival Times (in days) of Guinea Pigs dataset and the fit of the cumulative distribution of the TPC model with the empirical one.
Mathematics 11 04431 g005
Figure 6. QRs for the fitted models of the Survival Times (in days) of Guinea Pigs dataset. The p-values for the AD, CVM and SW normality tests are also presented.
Figure 6. QRs for the fitted models of the Survival Times (in days) of Guinea Pigs dataset. The p-values for the AD, CVM and SW normality tests are also presented.
Mathematics 11 04431 g006
Table 1. pdf, cdf and hazard function for some particular cases of the TPS model.
Table 1. pdf, cdf and hazard function for some particular cases of the TPS model.
g 0 pdfcdfHazard Function
Normal 1 σ Φ ( λ ) ϕ z σ λ Φ z σ λ + Φ ( λ ) 1 Φ ( λ ) ϕ z σ λ σ 1 Φ z σ λ
Logistic ( 1 + e λ ) e ( z σ λ ) σ ( 1 + e ( z σ λ ) ) 2 1 ( 1 + e λ ) e z σ λ 1 + e z σ λ 1 σ 1 + e z σ λ
Laplace e | z σ λ | σ ( 1 + s g n ( λ ) [ 1 + e | λ | ] ) s g n z σ λ 1 e z σ λ + s g n ( λ ) ( 1 e | λ | ) 1 + s g n ( λ ) ( 1 e | λ | ) e | z σ λ | 2 σ 1 1 2 + 1 2 s g n z σ λ 1 e | z σ λ | )
Cauchy ( 1 + ( z σ λ ) 2 ) 1 σ ( arctan ( λ ) + π 2 ) 1 π arctan z σ λ + arctan ( λ ) 1 2 + 1 π arctan ( λ ) 1 2 1 π arctan z σ λ 1 σ π 1 + z σ λ 2
ϕ ( · ) and Φ ( · ) denote the pdf and cdf of the standard normal distribution, respectively, and s g n ( · ) denotes the sign function.
Table 2. Asymmetry and kurtosis coefficients for different values of λ .
Table 2. Asymmetry and kurtosis coefficients for different values of λ .
TPNTPLTPLa
λ β 1 β 2 β 1 β 2 β 1 β 2
−51.837.761.998.972.009.00
−21.546.021.918.462.009.00
−11.324.991.777.742.009.00
00.993.871.546.582.009.00
10.593.001.215.281.697.76
20.222.760.884.321.156.13
50.002.990.233.730.265.29
Table 3. Empirical bias, SE, RMSE and 95% CP for the ML estimators of σ and λ in the TPS distribution using the logisticlogistic and Cauchy as basal models.
Table 3. Empirical bias, SE, RMSE and 95% CP for the ML estimators of σ and λ in the TPS distribution using the logisticlogistic and Cauchy as basal models.
True Values n = 50 n = 100 n = 200
Model σ λ Estimator Bias SE RMSE CP Bias SE RMSE CP Bias SE RMSE CP
logistic10.5 σ ^ −0.0100.2100.2130.897−0.0030.1500.1510.9220.0000.1070.1090.940
λ ^ −0.0591.3381.1290.9380.0040.6220.6230.945−0.0050.3950.3960.961
1 σ ^ −0.0050.1970.1960.919−0.0040.1380.1380.934−0.0020.0980.0970.943
λ ^ 0.0170.7600.7510.9580.0220.4750.4800.9590.0110.3300.3290.955
2 σ ^ −0.0060.1670.1680.9250.0010.1180.1180.941−0.0020.0830.0840.943
λ ^ 0.0680.5770.5940.9560.0210.4030.4030.9580.0180.2830.2880.951
3 σ ^ −0.0090.1450.1490.925−0.0050.1020.1040.933−0.0020.0730.0740.944
λ ^ 0.0910.5770.5950.9500.0500.4040.4090.9490.0240.2840.2880.949
100.5 σ ^ −0.0752.1102.1050.905−0.0241.5041.5050.928−0.0081.0641.0780.934
λ ^ −0.0491.1861.0580.9440.0020.6080.6110.952−0.0010.3940.4050.953
1 σ ^ −0.0151.9782.0600.9120.0031.3921.4160.934−0.0290.9770.9930.936
λ ^ 0.0290.7460.7640.9500.0130.4760.4820.9590.0070.3300.3330.955
2 σ ^ −0.0401.6701.6950.923−0.0271.1751.1710.942−0.0100.8300.8420.944
λ ^ 0.0600.5770.5850.9610.0300.4030.4040.9560.0170.2830.2860.954
3 σ ^ −0.0531.4571.4660.929−0.0241.0291.0110.946−0.0100.7270.7200.944
λ ^ 0.0840.5770.6010.9560.0390.4040.4000.9530.0180.2840.2810.954
Cauchy10.5 σ ^ −0.0320.2190.2130.924−0.0140.1500.1490.940−0.0140.1050.1080.933
λ ^ 0.0280.5060.5230.9770.0070.3220.5510.9740.0290.1760.1760.956
1 σ ^ −0.0100.2200.2250.918−0.0040.1560.1550.936−0.0060.1100.1110.941
λ ^ 0.0720.3900.4040.9690.0330.2640.2670.9600.0240.1840.1850.956
2 σ ^ 0.0000.2190.2270.929−0.0020.1540.1550.938−0.0020.1090.1090.942
λ ^ 0.1040.5270.5520.9540.0520.3640.3750.9520.0290.2540.2570.952
3 σ ^ −0.0020.2140.2210.9290.0000.1510.1530.940−0.0030.1060.1050.945
λ ^ 0.1530.7160.7500.9500.0700.4920.5030.9500.0430.3440.3400.957
100.5 σ ^ −0.3452.2032.1670.914−0.0881.5021.4980.940−0.1491.0521.0470.943
λ ^ 0.0320.4990.5180.9750.0020.3660.5490.9780.0310.1750.1750.965
1 σ ^ −0.1342.1842.2070.923−0.0711.5601.5760.930−0.0831.1001.0950.944
λ ^ 0.0710.3840.3920.9750.0300.3260.5320.9590.0270.1840.1840.959
2 σ ^ −0.0482.1762.1720.925−0.0031.5441.5560.940−0.0151.0871.0790.944
λ ^ 0.1120.5280.5480.9590.0500.3640.3710.9530.0270.2540.2540.958
3 σ ^ −0.0092.1492.2180.928−0.0041.5071.5400.939−0.0181.0621.0790.941
λ ^ 0.1490.7170.7510.9530.0740.4920.5020.9520.0410.3440.3530.944
Table 4. Descriptive statistics of the Survival Times (in days) of Guinea Pigs dataset.
Table 4. Descriptive statistics of the Survival Times (in days) of Guinea Pigs dataset.
Datasetn X ¯ S 2 b 1 b 2
Survival Times of Guinea Pigs in Days72 99.82 6580.12 1.80 5.61
Table 5. Estimated parameters and their standard errors (in parentheses) for the TPN, TPLa, TPC and TPL models for the Survival Times (in days) of Guinea Pigs dataset. The AIC and BIC criteria are also presented.
Table 5. Estimated parameters and their standard errors (in parentheses) for the TPN, TPLa, TPC and TPL models for the Survival Times (in days) of Guinea Pigs dataset. The AIC and BIC criteria are also presented.
ParametersTPNTPLaTPCTPL
σ 154.944 (47.298)67.772 (0.035)29.926 (5.895)57.513 (10.091)
λ −0.488 (0.762)0.885 (0.004)2.133 (0.451)0.749 (0.556)
log-likelihood−401.46−392.27−391.16−399.38
AIC806.92788.54786.33802.75
BIC811.47793.09790.88807.31
KS (p-value)0.152 (0.073)0.105 (0.400)0.085 (0.675)0.126 (0.200)
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gómez, H.J.; Santoro, K.I.; Barranco-Chamorro, I.; Venegas, O.; Gallardo, D.I.; Gómez, H.W. A Family of Truncated Positive Distributions. Mathematics 2023, 11, 4431. https://doi.org/10.3390/math11214431

AMA Style

Gómez HJ, Santoro KI, Barranco-Chamorro I, Venegas O, Gallardo DI, Gómez HW. A Family of Truncated Positive Distributions. Mathematics. 2023; 11(21):4431. https://doi.org/10.3390/math11214431

Chicago/Turabian Style

Gómez, Héctor J., Karol I. Santoro, Inmaculada Barranco-Chamorro, Osvaldo Venegas, Diego I. Gallardo, and Héctor W. Gómez. 2023. "A Family of Truncated Positive Distributions" Mathematics 11, no. 21: 4431. https://doi.org/10.3390/math11214431

APA Style

Gómez, H. J., Santoro, K. I., Barranco-Chamorro, I., Venegas, O., Gallardo, D. I., & Gómez, H. W. (2023). A Family of Truncated Positive Distributions. Mathematics, 11(21), 4431. https://doi.org/10.3390/math11214431

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop