Abstract
The study of hydrological characteristics has a vital role in designing, planning, and managing water resources. The selection of appropriate probability distributions and methods of estimations are basic elements in hydrology analyses. In this article, a new family named the ‘exponentiated power alpha index generalized’ (EPAIG)-G is proposed to develop several new distributions. Using this proposed family, we developed a new model, called the EPAIG-exponential (EPAIG-E). A few structural properties of the EPAIG-G were obtained. The EPAIG-E parameters were estimated through the method of maximum likelihood (MML). The study of the Monte Carlo simulation (MCS) was produced for the EPAIG-E. The model performance is illustrated using real data.
Keywords:
quantiles; moments; entropies; likelihood; exponentiated; asymptotic; exponential; simulation MSC:
60E05; 62E15
1. Introduction
In a variety of fields, the exponential distribution (ED) is commonly used for the examination of data. ED, at times, performs well with different datasets; however, in several practical instances, it is not flexible enough to examine the complicated behaviors shown by the data. Therefore, the consideration involves more flexible distributions, which can fit any type of data with any level of intricacy. To put a new parameter to the baseline distribution (BLD), Gupta et al. (1998) [1] proposed the exponentiated method (EM) with the parameter being the power of the cumulative distribution function (cdf), which provides adaptable and more flexible distributions. Let the cdf be the continuous BLD with a random variable (r.v.) X, then the exponentiated derived distribution cdf is
where is an extra shape/power parameter and is the BLD parameter.
Recently, to obtain a more flexible family of distributions (FoDs), Mahdavi and Kundu (2017) [2] proposed the alpha power method (APM). The cdf of the APM family with continuous r.v. X is defined as
Many researchers have applied the EM and APM on different traditional distributions. Gupta and Kundu (2001) [3] derived the generalized exponential (GE) (also called the exponentiated exponential (EE)) distribution with two parameters—the shape () and scale (). The cdf of the EE distribution (EED) is defined as
The probability density function (pdf) of EED varies significantly based on . Moreover, if , the hazard function (HF) of EED is a non-increasing function, and if , it is a non-decreasing function. The HF is constant when . Clearly, the ED is a special case of EED for . Naturally, EE and gamma distributions have several properties that are quite similar, but EED has explicit expressions and the reliability functions (RFs) are identical to the Weibull distribution. In different manners, the EE, Weibull, and gamma distributions extend the ED. For more details on EMs and the exponentiated family of distributions, we refer to Tahir and Nadarajah (2015) [4], ElSherpieny and Almetwally (2022) [5], and Hussain et al. (2022) [6].
Developing new FoDs is another forceful method to derive more flexible distributions. For any continuous distribution (CD), generators of various types can be practiced to obtain new FoDs. Alzaatreh et al. (2013) [7] derived the transformed-transformer method (also called the T-X family) for generating new FoDs as any CD considered a generator. Let X be a r.v. for any CD, then the family T-X cdf is
where under the control of , is the cdf of any BLD, at that is a monotone increasing–differentiable function of , and is the pdf of r.v. as . A new class of FoDs will be obtained using different transformers .
In view of the EM Equation (1), the APM Equation (2), and the T-X family Equation (4), we developed a new class by the name of the exponentiated power alpha index generalized-G (EPAIG-G) family. Let cdf
of r.v. T. The pdf conforming to Equation (5) is
Here, we use
as the transformer, and is the cumulative hazard rate function (CHRF) of BLD. Setting Equations (7) and (6) in Equation (4), then the cdf of the EPAIG-G family is defined as
The pdf of the EPAIG-G family corresponds to Equation (8), and is given as
where is the hazard rate function (HRF) of BLD.
The reliability measures of X follow EPAIG-G () with pdf Equation (9) defined below.
- i.
- The RF of the EPAIG-G family, using Equation (8), is
- ii.
- The HRF, CHRF, and reversed HRF of the EPAIG-G family are defined below:
The article layout is as follows. In Section 1, a new family called EPAIG-G is introduced with its reliability measures. We obtained useful expansion and derived structural properties of the EPAIG-G, respectively, in Section 2 and Section 3. The estimations by MML of the parameters of the EPAIG-G family are presented in Section 4. Section 5 explains the proposed EPAIG-E model and its pdf and HRF plots. Inside this section, the MCS study was performed to assess the performance of the EPAIG-E model. The potentiality of the EPAIG-E is also illustrated through real data. Finally, Section 6 presents our conclusions.
2. Useful Expansions of the EPAIG-G Family
The expansion of binomial (EoB) is
The Maclaurin series for is
The EoB series for the non-integer real positive and is
For the arbitrary parameters and , it can be proven that
where are Stirling polynomials. For details of Equation (16), see Flajonet and Odlyzko (1990); Flajonet and Sedgewick (2009) [8,9]. Applying Equation (16), we obtain using , which is equal to
The power series is
Equation (20) was extracted as
where presents the exponentiated-G (exp-G ()) cdf.
The pdf of X corresponding to Equation (20), using Equations (10), (12), (14), (16) and (18) in Equation (9), respectively, can be defined as
where
From Equation (23), another extracted form of the pdf with the infinite linear combination is given as
where , is the density of exp-G, with as the power parameter.
3. Structural Properties of the EPAIG-G Family
The properties of the EPAIG-G structure from Equation (24) and of exp-G properties can be derived as follows.
3.1. Quantile Function (QF)
Equation (25) reveals the QF of the EPAIG-G, which is based on the BLD QF. By substituting suitable u values in Equation (25), quantiles of interest can be obtained. In particular, when , the median of X is expressed as
Equation (25) can also be used for simulating EPAIG-G r.v.’s using the uniform r.v. .
3.2. Moments
Let a r.v. follow exp-G () with pdf . The nth ordinary moments (OMs) of X follow Equation (24), and can be obtained as
Another formula for from Equation (24) can be acquired in the BLD QF reference. We have
where , in connection with F QF, is
In addition, for X, the mean moments and cumulants for Equation (24) can be obtained as
and
respectively, where .
By the nth incomplete moment (IM) of X, using Equation (24), we can write
where can be computed for the F BLD.
Setting in Equation (30) gives the first IM. The first IM of X is important to obtain the mean deviations, which can be applied to measure the dispersion amount in a population, and the curves of Lorenz and Bonferroni, which have appropriate empirical applications in areas of reliability, economics, demography, and many others.
The nth factorial descending moment (FDM) of X is evaluated as:
where
is the Stirling number of the first kind that permutes the n-item list into cycles .
3.3. Moment Generating Function (MGF)
Here, we provide the MGF of X, i.e., . From Equation (24):
where is the MGF of . Because of that, can come by the MGF of exp-G().
3.4. Mean Deviations (MDs)
The formulae are given below:
and
describe the MDs of X through the mean and over the median M, respectively, is the pdf from Equation (24). Using Equation (30) by setting , the formulae in Equations (34) and (35) can be expressed as
and
where is the cdf from Equation (8) evaluated at and the first IM from Equation (24) is given below
where
3.5. Lorenz and Bonferroni Curves
3.6. Probability Weighted Moments (PWMs)
For r.v. X, the th PWMs are given as
Applying Equations (10), (12), (14), (16) and (18) in Equation (43), then Equation (43) becomes
where
The PWM quantity in Equation (45) can be achieved on condition of BLD QF by taking ; we have
3.7. Entropies
In areas of engineering, science, probability theory, finance, economics, physics, and electronics, entropy has been widely applied to measure the variation of uncertainty. Rényi (1961) [10] and Shannon (1951) [11] are two well-known entropy measures.
3.7.1. Rényi Entropy (RE)
For and , the RE of X with pdf in Equation (9) is defined as
where X is a EPAIG-G r.v., we derive the expression for the RE. First, using Equation (10), we compute as
where
Using Equation (16), we can write
Thus,
where and come from the BLD as . Hence, the RE of X is
3.7.2. Shannon Entropy (SE)
If a r.v. , the EPAIG-G FoD is defined in Equation (9),
then the SE of X, , using Equation (55), is defined as
From Equation (8), the r.v. has the pdf in Equation (6), Alzaatreh et al. (2013) [7] and defines the QF, , , for the T-X family of the distribution of the formula as
Equation (57) implies
The log series expansion is
For any and , using Equation (16), we can write
For a given and , the expectations in Equation (63) can be easily evaluated numerically.
3.8. Order Statistics (OS)
Here, , the random sample is from the EPAIG-G family. The pdf of ‘rth’ order statistics, , using in Equation (20) and in Equation (24) is conveyed as
A power series set up to n as the positive integer by Gradshteyn and Ryzhik, 2007 [12], is given as
Here, and ; are obtained by recurrence relations
From the exp-G properties and using Equation (71), we can obtain several mathematical properties of the EPAIG-G order statistics. Clearly, the cdf of can be extracted as
4. Maximum Likelihood Estimation of the EPAIG-G Family
Here, we consider the estimation by MML. Let us sample from the family EPAIG-G in Equation (9). The log-likelihood function (LLF) for unknown parameters is
The analytical score functions (SFs) for , , and are
and
Setting , , and , and solving numerically nonlinear likelihood equations simultaneously lead to the ML estimates (MLEs) .
5. The EPAIG-Exponential (EPAIG-E) Distribution
The EPAIG-E distribution (EPAIG-ED) is defined from Equations (8) and (9) by taking , , and to be the pdf, cdf, HRF, and CHRF of the ED with the positive parameter
The pdf and cdf of the EPAIG-ED are defined as
and
where parameter is the scale and parameters are the shapes.
Using Equation (80), the RF of the EPAIG-ED is
The HRF, CHRF, and reversed HRF of the EPAIG-ED, using Equations (79), (80) and (81), respectively, are given as
and
Figure 1.
Plots of the pdf for the EPAIG-ED.
Figure 2.
Plots of HRF for the EPAIG-ED.
5.1. Quantile Function and Simulation Study of the EPAIG-ED
Let EPAIG-ED with the cdf in Equation (80). Then, the QF of X, say where can be computed by inverting Equation (80), is as follows
The QF of the EPAIG-ED in Equation (82) has a closed-form expression. By fixing Equation (82) , and , the quartile’s first, second (median), and third are obtained, respectively.
Measures for skewness (Sk) and kurtosis (Ku), QF-dependent, are sometimes more appropriate for the T-X family of distributions. The long tail degrees toward the left and right sides describe the skewness while the tail-heaviness degree describes kurtosis. When Sk (or ), the distribution is left- (or right)-skewed, and when Sk , the distribution is symmetric. The tail becomes heavier as Ku increases. Using QFs, Galton (1883) [13] and Moors (1988) [14] defined the skewness measure (Sk) and kurtosis measure (Ku), respectively, which are defined as follows
and
Figure 3 demonstrates the skewness and kurtosis plots of the EPAIG-ED.
Figure 3.
Plots of Galton’s skewness (Sk) and Moor’s kurtosis (Ku) for the EPAIG-ED.
Simulation
For the Monte Carlo (MC) simulation of the EPAIG-ED, from Equation (82) we consider that p is a uniform r.v. in (0,1). We simulate EPAIG-ED (for Set I: and for Set II: ) for times. For every sample size, we compute MLE for , , and . We consider 1000 MC replicates of this process and compute the average estimates (AEs), biases, absolute biases (ABs), and mean squares errors (MSEs). The results in Table 1 and displayed graphically in Figure 4 and Figure 5.
Table 1.
The AEs, ABs, and MSEs based on 1000 MCSs of the EPAIG-ED.
Figure 4.
Plots of ABs based on the simulation of the EPAIG-ED for sets I and II.
Figure 5.
Plots of MSEs based on the simulation of the EPAIG-ED for sets I and II.
Based on the empirical results of the simulation in Table 1, Figure 4, and Figure 5, we detect that the ABs and MSEs of MLEs , and decay toward zero as n increases. As n increases, the AEs are quite stable and tend to be close to actual parameter values. This reality assists the asymptotic normal distribution (ND) that gives an appropriate approximation to the finite sample distribution of estimates.
5.2. Estimation of the EPAIG-Exponential Model
The LLF for the sample size n with values of the EPAIG-E model for unknown parameters is
Using the MML, the elements of the SFs are given as
and
5.3. Asymptotic Confidence Bounds of the EPAIG-ED
The large sample theory of ML estimators of the EPAIG-ED gives
Here, represents convergence in the distribution, is a matrix of variability measures of the estimated parameters and is obtained from the inverse of the observed Fisher information matrix (OFIM) as approximated below
The large sample 100(1 − )% confidence intervals of , , and of the EPAIG-ED are obtained, respectively, as
where is the upper th percentile of the standard ND. The derivatives in the OFIM w.r.t , and are obtained below
5.4. Applications
This section explains the illustration of the proposed model, the EPAIG-E model, by fitting it to a real dataset. The dataset, as reported by [15], describes one of the hydrological characteristics. The storm events with observations of water runoff (mm) from a watershed in Korea are as follows: 0.90, 0.60, 16.80, 59.30, 2.0, 78.20, 30.70, 146.80, 1.80, 3.40, 1.10, 0.80, 2.50, 6.10, 17.0, 5.10, 216.20, 8.10, 1.60, 2.0, 2.0, 0.80, 0.80, 2.90, 7.30, 13.30, 181.70, 20.50, 24.10, 33.50, 89.10, 7.20, 6.0, 75.90.
In the hydrological data analysis, the probability distribution models are as follows:
- 1.
- Alpha power exponential (APE) [2]
- 2.
- Exponentiated alpha power exponential (EAPE)
- 3.
- Exponentiated exponential (EE) [3]
- 4.
- Kumaraswamy exponential (KwE) [16]
- 5.
- Exponential (Exp.)
The EPAIG-E and all compared models were fitted to runoff data and the parameters using MML were estimated. The parameter estimates along with standard errors are given in Table 2. Further, the goodness of fit (GoF) test statistics criteria, such as negative log-likelihood-L, Bayesian information criterion (IC) (BIC), Akaike IC (AIC), Kolmogorov–Smirnov (KS), p-value, Anderson–Darling (AD), and Cramer-von Mises (CM) were used to check the performances of the fitting models for runoff data. The statistics were obtained for each model and are listed in Table 3. From Table 3, we observe that the EPAIG-E model has minimum values of GoF test statistics with large p-values. Further, plots in Figure 6 favor the EPAIG-E model. Accordingly, we can conclude that the EPAIG-E model is the best compared to other models for runoff data.
Table 2.
Estimates and standard errors of parameters of the EPAIG-E model for runoff data.
Table 3.
The measures -L, AIC, BIC, KS, p-Value, CM, and AD for runoff data.
Figure 6.
Histogram and estimated pdfs (left); empirical and estimated cdfs (right) of the EPAIG-E, APE, EAPE, EE, KwE, and Exp. models for the runoff data.
6. Conclusions
This article proposes a new family called the exponentiated power alpha index generalized-G. We used the new family to develop a new model of the three parameters—called the EPAIG-E model—to extend the exponential distribution. The EPAIG-E model is motivated by the wide application of the ED in analyzing lifetime datasets. We obtained structural properties of the EPAIG-G family. The parameter estimation of the EPAIG-G family and its model is obtained by MML. Simulation results are provided to assess the method’s performance. Real data applications of the proposed model fit relatively better than APE, EAPE, EE, KwE, and Exp.
Author Contributions
Conceptualization, S.H., M.U.H. and R.A.; Data curation, M.S.R.; Investigation, S.H., M.U.H. and R.A.; Methodology, S.H., M.S.R., M.U.H. and R.A.; Software, M.U.H.; Validation, M.S.R.; Visualization, S.H.; Writing—original draft, S.H., M.S.R. and M.U.H.; Writing—review & editing, R.A. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Data Availability Statement
The data are fully available in the article and the mentioned reference.
Acknowledgments
The authors are thankful to the reviewers for their valuable corrections and suggestions that improved the article.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Gupta, R.C.; Gupta, P.L.; Gupta, R.D. Modeling failure time data by Lehman alternatives. Commun. Stat.-Theory Methods 1998, 27, 887–904. [Google Scholar] [CrossRef]
- Mahdavi, A.; Kundu, D. A new method for generating distributions with an application to exponential distribution. Commun. Statist. Theory Methods 2017, 46, 6543–6557. [Google Scholar] [CrossRef]
- Gupta, R.D.; Kundu, D. Exponentiated exponential family: An alternative to gamma and Weibull distributions. Biomed. J. 2001, 43, 117–130. [Google Scholar] [CrossRef]
- Tahir, M.H.; Nadarajah, S. Parameter induction in continuous univariate distributions: Well-established G families. Acad. Bras. Cienc. 2015, 87, 539–568. [Google Scholar] [CrossRef] [PubMed]
- ElSherpieny, E.A.; Almetwally, E.M. The Exponentiated Generalized Alpha Power Exponential Distribution: Properties and Applications. Pak. J. Stat. Oper. Res. 2022, 18, 349–367. [Google Scholar] [CrossRef]
- Hussain, S.; Sajid Rashid, M.; Ul Hassan, M.; Ahmed, R. The Generalized Exponential Extended Exponentiated Family of Distributions: Theory, Properties, and Applications. Mathematics 2022, 10, 3419. [Google Scholar] [CrossRef]
- Alzaatreh, A.; Lee, C.; Famoye, F. A new method for generating families of continuous distribution. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef]
- Flajonet, P.; Odlyzko, A. Singularity analysis of generating function. SIAM J. Discr. Math. 1990, 3, 216–240. [Google Scholar] [CrossRef]
- Flajonet, P.; Sedgewich, R. Analytic Combinatorics; Cambridge University Press: Cambridge, UK, 2009; ISBN 978-0-521-89806-5. [Google Scholar]
- Rényi, A. On measures of entropy and information. Hung. Acad. Sci. 1961, 1, 547–561. [Google Scholar]
- Shannon, C.E. Prediction and entropy of printed Engish. Bell Syst. Tech. J. 1951, 30, 50–64. [Google Scholar] [CrossRef]
- Gradshteyn, I.S.; Ryzhik, I.M. Table of Integrals, Series, and Products, 7th ed.; Academic Press: San Diego, CA, USA, 2007. [Google Scholar]
- Galton, F. Enquiries into Human Faculty and its Development; Macmillan & Company: London, UK, 1883. [Google Scholar]
- Moors, J.J. A quantile alterrnative for kurtosis. Statistician 1988, 37, 25–32. [Google Scholar] [CrossRef]
- Hussain, S.; Rashid, M.S.; Ul Hassan, M.; Ahmed, R. The Generalized Alpha Exponent Power Family of Distributions: Properties and Applications. Mathematics 2022, 10, 1421. [Google Scholar] [CrossRef]
- Adepoju, K.A.; Chukwu, O.I. Maximum Likelihood Estimation of the Kumaraswamy Exponential Distribution with Applications. J. Mod. Appl. Stat. Methods 2015, 14, 208–214. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).