Abstract
Given the increasing number of phenomena that demand interpretation and investigation, developing new distributions and families of distributions has become increasingly essential. This article introduces a novel family of distributions based on the exponentiated reciprocal of the hazard rate function named the new Lomax-G family of distributions. We demonstrate the family’s flexibility to predict a wide range of lifetime events by deriving its cumulative and probability density functions. The new Lomax–Weibull distribution (NLW) is studied as a sub-model, with analytical and graphical evidence indicating its efficiency for reliability analysis and complex data modeling. The NLW density encompasses a variety of shapes, such as symmetrical, semi-symmetrical, right-skewed, left-skewed, and inverted J shapes. Furthermore, its hazard function exhibits a broad range of asymmetric forms. Five estimation techniques for determining the parameters of the proposed NLW distribution include the maximum likelihood, percentile, least squares, weighted least squares, and Cramér–von Mises methods. The performance of the estimators of the studied inferential methods is investigated through a comparative Monte Carlo simulation study and numerical demonstration. Additionally, the effectiveness of the NLW is validated by means of four real-world datasets. The results indicate that the NLW distribution provides a more accurate fit than several competing models.
1. Introduction
Statistical distributions effectively describe, analyze, and predict many global phenomena in applied fields such as medicine, engineering, economics, and insurance. Accurately analyzing and interpreting the data requires identifying the appropriate statistical distribution, as the validity of inferences and conclusions depends on the optimal distribution choice.
Classical and traditional distributions often fail to describe complex real-world data clearly and accurately. Therefore, it is essential to develop new techniques that generate families of statistical distributions that are more flexible and accurate in modeling diverse data types. Many statisticians are focused on devising methods to generate such families of distributions. One of the most notable of these families is the transformed-transformer (T-X) family, introduced by [1], a fully generalized family that is not restricted to any specific probability distribution. This family consists of two components used to create a new generalized family: the first component is the function representing the upper limit of integration, , and the second component is the function to be integrated, where any probability distribution can be applied in either part randomly to form new families. Numerous studies have used this approach to develop new distributions and families of distributions. Examples include the Weibull–Pareto distribution introduced by [2], the exponentiated T-X distribution proposed by [3], and the Weibull–Normal distribution proposed by [4]. Furthermore, the generalized odd log-logistic family by [5], the Nadarajah Topp Leone-G family by [6], the odd Lomax-G family by [7], the modified T-X family by [8], and the exponential T-X family by [9]. More recently, [10] combined the Marshall–Olkin transformation with the T-X family to introduce the Marshall–Olkin Weibull family, and [11] proposed the generalized logarithmic–X family.
In a very recent study by [12], a new generalized family of distributions (NGF) was introduced using , where is known as the Mills ratio and is equivalent to the inverse of the hazard rate function [13]. The cumulative distribution function (CDF) and probability distribution function (PDF) of the NGF are as follows:
Various applied disciplines can benefit from the increased flexibility of this method when modeling real-world data. Consequently, researchers have used it to create novel distributions. For example, [14] and colleagues introduced the survival-weighted Pareto distribution, while [15] proposed the reflected Pareto distribution.
The primary goal of this study is to propose a new family of distributions using the exponentiated reciprocal of the hazard rate function. It is important to note that the upper limit of the integral for creating the NGF family, given in (1), is . Different upper limits of the integral can be defined to generate various families of distributions [2]. In this study, we define a new upper limit of the integral by introducing a shape parameter, , such that the upper limit is set to . This will lead to new families of distributions with CDF and PDF defined as follows:
where and represent the PDF and CDF of any probability distribution and is a monotonically increasing function representing the exponentiated reciprocal of the hazard rate function for a baseline distribution.
This article introduces the new generalized Lomax (NL-G) family based on the exponentiated reciprocal of the hazard rate function. We explore a specific member of the NL-G family, the new Lomax–Weibull distribution (NLW). The NLW distribution is highly effective for modeling real-world data across a wide range of applied disciplines, as it integrates features from both the Lomax family and the Weibull distribution. This combination offers wider applicability than the parent distributions, along with superior goodness of fit, improved estimation adaptability, and enhanced mathematical flexibility. Its versatility allows it to analyze diverse datasets in fields such as biology, computer science, marketing, medicine, economics, behavioral sciences, engineering, actuarial studies, environmental sciences, and lifetime analysis. This adaptability is evident in its ability to accommodate various shapes for both density and hazard functions. The remainder of this article is presented in Figure 1.
Figure 1.
The organizational structure for the remaining sections of this article.
2. New Lomax-G Family of Distribution
Let be a random variable following the Lomax distribution with a shape parameter and a scale parameter . Then, the CDF is
The PDF corresponding to Equation (5) is
Thus, for the Lomax distribution can be defined as follows:
Using the of the Lomax distribution and Equations (3) and (4), we obtain the CDF and PDF of the NL-G family as follows:
where is a location parameter.
3. The New Lomax–Weibull Distribution
In this section, we study a new special member of the NL-G family, the NLW, using the Weibull distribution as a baseline. The CDF of the NLW distribution can be obtained as
Therefore, the PDF of the NLW model is obtained by differentiating Equation (10) as follows:
The survival function, S(x), for the NLW is given by
and the hazard rate function, H(x), for the NLW is expressed as
3.1. Graphical Presentations of the NLW
The plots of the density function and for the NLW, based on some selected parameter values, are shown in Figure 2 and Figure 3, respectively.
Figure 2.
The NLW density plots for some values of , , , and c.
Figure 3.
The NLW H(x) plots for some values of , , , and c.
Figure 2 illustrates that the density of the NLW can take on multiple forms, such as symmetrical, right-skewed, left-skewed, and inverted J-shapes. Additionally, the H(x) function of the NLW exhibits a diverse range of asymmetric shapes, including increasing, decreasing, J-shaped, and reversed J-shaped forms. These observations highlight the high flexibility of the NLW.
3.2. Special Cases of the NLW Distribution
3.3. Useful Form of the NLW Density
A simple expansion of the PDF for the NLW distribution in (11) is provided below, using its series representation:
Consider the following binomial series representation:
We have
Therefore,
4. Some Statistical Properties of the NLW
4.1. Quantile Function
The pth quantile function, say, , of the NLW can be obtained as
As a result, by setting in Equation (17), we can derive the median of the NLW as follows:
The measures of deviation and skewness are calculated from the quantities using the following equations:
and
where is the quantile function.
4.2. Moments
If , then the moment of X can be obtained as
Substituting , we will have
Then, by employing the following series expansion
Therefore, the moment of the NLW can be written as
The mean and variance of NLW are expressed as
4.3. Moment Generating Function
The moment generating function (MGF) of the random variable x is defined as
Thus, we obtain the MGF as follows:
4.4. Characteristic Function
The characteristic function is defined by
Hence, the characteristic function of the NLW is given by
4.5. Probability Weighted Moment
4.6. Order Statistics
4.7. R’enyi Entropy
R’enyi entropy of order u is given by
Therefore, the Rényi entropy of the NLW distribution is given by
where .
4.8. Shannon Entropy
5. Estimation Methods
The parameters of the NLW distribution are estimated using five estimation methods: the maximum likelihood method (MLE), the percentile method (PE), the least squares method (LS), the weighted least squares method (WLS), and the Cramér–von Mises minimum distance method (CVM).
5.1. Maximum Likelihood Method
Let represent a random sample of size n drawn from a NLW distribution. The log-likelihood function (l) for the parameter vector can be expressed as follows:
The MLE estimates of the parameters can then be derived by differentiating (36) with respect to each parameter and setting the resulting equations to zero, yielding the following:
5.2. Percentiles Method
Consider a random sample of size n drawn from an NLW. The percentile estimation (PE) method is formulated as
where
To estimate the unknown parameters , the PE method minimizes the squared deviations between the observed sample order statistics and the expected values derived from the specified percentiles.
5.3. Ordinary and Weighted Least Squares Estimators
Let be a random sample of size n drawn from NLW. The ordinary least squares estimator (LSE) method can be expressed as follows:
by minimizing the sum of squares errors for the unknown parameters. Thus, the ordinary least squares (LSE) estimator for the unknown parameters is obtained by minimizing the following quantity:
and the weighted least squares estimators (WLSs) of can be obtained by minimizing the following expression:
5.4. Ke Cramer–von Mises Minimum Distance Method
The Ke Cramer–von Mises minimum distance estimator (CVM) method relies on the discrepancy between the estimated CDF and the empirical CDF. CVM estimators are derived by minimizing this discrepancy.
By substituting Equation (10) into (46), the CVM estimator for , , , , and c can be obtained by minimizing the following expression:
6. Simulation Study
This section presents numerical results from a Monte Carlo simulation to compare estimation methods, specifically MLE, PE, LSE, WLS, and CVM, in assessing the performance of these methods with respect to the NLW parameters.
- Data are generated from the NLW distribution as described in Equation (17), with .
- We examine multiple sample sizes (20, 50, 100, 200, 300, and 500) from the NLW, each under 1000 repetitions.
- Four distinct sets of parameter values are defined as follows:
- -
- Set I: (
- -
- Set II:
- -
- Set III:
- -
- Set IV:
Using the “optim” function in the R statistical software (v4.3.2; R Core Team 2023) [16], we calculated parameter estimates () as well as the mean bias (Bias) and the mean squared error (MSE) defined as follows:
where is the true value of the parameter, is the estimate of the parameter, and n is the sample size.
The simulation results are presented in Table 1, Table 2, Table 3 and Table 4.
Table 1.
Simulation study at = 1.65, = 0.055, = 0.59, = 1.98 and c = 2.5.
Table 2.
Simulation study at = 1.9, = 0.07, = 2.5, = 1.2 and c = 1.9.
Table 3.
Simulation study at and .
Table 4.
Simulation study at and .
It can be observed from Table 1, Table 2, Table 3 and Table 4 that the mean estimates of the NLW parameters approach the true values of the parameters () as the sample size increases, reflecting a clear improvement in the accuracy of the estimates with increasing data. In addition, Figure 4, Figure 5, Figure 6 and Figure 7 show a comparison of estimation methods based on MSEs at different sample sizes. Consequently, the ML and WLS methods are more accurate than other methods in terms of accuracy and consistency when predicting unknown parameters of the NLW distribution. All analyses were performed using R Statistical Software (v4.3.2; R Core Team 2023) [16].
Figure 4.
Comparison of estimation methods based on MSE across different sample sizes for Table 1.
Figure 5.
Comparison of estimation methods based on MSE Across different sample sizes for Table 2.
Figure 6.
Comparison of estimation methods based on MSE Across different sample sizes for Table 3.
Figure 7.
Comparison of estimation methods based on MSE across different sample sizes for Table 4.
7. Applications
The NLW model has been applied to four real datasets to demonstrate its effectiveness in modeling real data compared to other distributions.
- Failure Time Data:
- The first dataset, obtained from [17], represents 84 recorded failure times of aircraft windshields, described as follows: 0.040, 1.866, 2.385, 3.443, 0.301, 1.876, 2.481, 3.467, 0.309, 1.899, 2.610, 3.478, 0.557, 1.911, 2.625, 3.578, 0.943, 1.912, 2.632, 3.595, 1.070, 1.914, 2.646, 3.699, 1.124, 1.981, 2.661, 3.779, 1.248, 2.010, 2.688, 3.924, 1.281, 2.038, 2.823, 4.035, 1.281, 2.085, 2.890, 4.121, 1.303, 2.089, 2.902, 4.167, 1.432, 2.097, 2.934, 4.240, 1.480, 2.135, 2.962, 4.255, 1.505, 2.154, 2.964, 4.278, 1.506, 2.190, 3.000, 4.305, 1.568, 2.194, 3.103, 4.376, 1.615, 2.223, 3.114, 4.449, 1.619, 2.224, 3.117, 4.485, 1.652, 2.229, 3.166, 4.570, 1.652, 2.300, 3.344, 4.602, 1.757, 2.324, 3.376, 4.663.
- Gauge Lengths of 10 mm Data:
- The second dataset was obtained from [18] and consists of 63 observations: 1.901, 2.132, 2.203, 2.228, 2.257, 2.350, 2.361, 2.396, 2.397, 2.445, 2.454, 2.474, 2.518, 2.522, 2.525, 2.532, 2.575, 2.614, 2.616, 2.618, 2.624, 2.659, 2.675, 2.738, 2.740, 2.856, 2.917, 2.928, 2.937, 2.937, 2.977, 2.996, 3.030, 3.125, 3.139, 3.145, 3.220, 3.223, 3.235, 3.243, 3.264, 3.272, 3.294, 3.332, 3.346, 3.377, 3.408, 3.435, 3.493, 3.501, 3.537, 3.554, 3.562, 3.628, 3.852, 3.871, 3.886, 3.971, 4.024, 4.027, 4.225, 4.395, 5.020.
- Strength Data:
- The third dataset, sourced from [19], includes 63 observations: 0.55, 0.74, 0.77, 0.81, 0.84, 1.24, 0.93, 1.04, 1.11, 1.13, 1.30, 1.25, 1.27, 1.28, 1.29, 1.48, 1.36, 1.39, 1.42, 1.48, 1.51, 1.49, 1.49, 1.50, 1.50, 1.55, 1.52, 1.53, 1.54, 1.55, 1.61, 1.58, 1.59, 1.60, 1.61, 1.63, 1.61, 1.61, 1.62, 1.62, 1.67, 1.64, 1.66, 1.66, 1.66, 1.70, 1.68, 1.68, 1.69, 1.70, 1.78, 1.73, 1.76, 1.76, 1.77, 1.89, 1.81, 1.82, 1.84, 1.84, 2.00, 2.01, 2.24.
- Student Grades in Mathematics Data:
- The fourth dataset represents the mathematical scores of 48 students in the slow pace program in the year 2013, sourced from [20]. The data are as follows: 29, 25, 50, 15, 13, 27, 15, 18, 7, 7, 8, 19, 12, 18, 5, 21, 15, 86, 21, 15, 14, 39, 15, 14, 70, 44, 6, 23, 58, 19, 50, 23, 11, 6, 34, 18, 28, 34, 12, 37, 4, 60, 20, 23, 40, 65, 19, and 31.
The suitability of the NLW model for the four datasets is evaluated by comparing its fit with the following distributions:
- Lomax distribution with the CDF function given in (5).
- Truncated Weibull power Lomax distribution (TWPL) by [21]
- Odd Lomax inverse Weibull distribution (OILW) by [22]
- The exponentiated generalized modified Weibull distribution (EGMW) by [23]
ML estimates, standard error (SE), and the length of confidence intervals of the NLW model for the four datasets are presented in Table 5, offering a deeper statistical understanding of the data.
Table 5.
Parameter estimates, SE in (), and confidence intervals’ length.
Additionally, the results in Table 6, Table 7, Table 8 and Table 9 show that the NLW model consistently achieves the lowest goodness-of-fit (GoF) criteria, including the Akaike Information Criterion (AIC), the Bayesian Information Criterion (BIC), Consistent Akaike Information Criterion (CAIC), the Hannan–Quinn Information Criterion (HQIC), the Kolmogorov–Smirnov (K-S) test, and the Anderson-Darling (A-D) test, indicating it provides the best fit to the data compared to other competing distributions. Furthermore, the distributions were compared using the p-values obtained from the K-S test, which revealed that the NLW model achieved the highest p-value, further supporting its superior fit. These results underscore the robustness of the NLW model in accurately capturing the characteristics of the data.
Table 6.
GOF criteria for the failure time data.
Table 7.
GOF criteria for the gauge data.
Table 8.
GOF criteria for strength data.
Table 9.
GOF criteria for student grades.
Figure 8, Figure 9, Figure 10 and Figure 11 illustrate the PDFs and CDFs for all datasets, demonstrating that the NLW model fits the data exceptionally well, effectively capturing the skewness of the four data compared to other distributions.
Figure 8.
Estimated PDF and CDF for failure time data.
Figure 9.
Estimated PDF and CDF for gauge data.
Figure 10.
Estimated PDF and CDF for the strength data.
Figure 11.
Estimated PDF and CDF for the student grades data.
8. Conclusions
This paper introduces a novel family based on the exponentiated reciprocal of the hazard rate function called the new Lomax generalized family (NL-G). A notable submodel within this family is the new Lomax–Weibull (NLW) distribution, which exhibits remarkable adaptability and is capable of fitting diverse datasets. The NLW’s density function is highly versatile, taking on various forms, including symmetric, semi-symmetric, inverted J-shape, right-skewed, and left-skewed shapes. Additionally, its hazard function exhibits a broad range of asymmetric shapes, such as increasing, decreasing, J-shaped, and inverted J-shaped forms, demonstrating a great deal of flexibility for real-world applications. The article outlines several key statistical properties of the NLW, such as quantiles, medians, moments, characteristic function, order statistics, and Rényi and Shannon entropies. We perform parameter estimation using the MLE, PE, LSE, WLS, and CVM methods, and their performance is evaluated through Monte Carlo simulation. The MLE and WLS methods demonstrated superior performance and consistency in parameter prediction. The practicality of the NLW model is showcased through its application to four real-world datasets, where it outperforms other established distributions, highlighting its exceptional ability to model and interpret complex data patterns across various domains.
Author Contributions
Conceptualization, H.B. and L.B.; Data curation, H.B.; Formal analysis, H.B.; Methodology, L.B. and H.B.; Software, H.A. and L.B.; Validation, H.A. and L.B.; Writing—original draft, H.A.; Writing—review and editing, L.B. and H.B.; visualization, H.A.; Supervision, L.B. and H.B. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Data Availability Statement
Data is contained within the article.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Alzaatreh, A.; Lee, C.; Famoye, F. A new method for generating families of continuous distributions. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef]
- Alzaatreh, A.; Famoye, F.; Lee, C. Weibull–Pareto distribution and its applications. Commun. Stat. Theory Methods 2013, 42, 1673–1691. [Google Scholar] [CrossRef]
- Alzaghal, A.; Famoye, F.; Lee, C. Exponentiated TX family of distributions with some applications. Int. J. Stat. Probab. 2013, 2, 31. [Google Scholar] [CrossRef]
- Famoye, F.; Akarawak, E.; Ekum, M. Weibull–Normal distribution and its applications. J. Stat. Theory Appl. 2018, 17, 719–727. [Google Scholar] [CrossRef]
- Cordeiro, G.M.; Alizadeh, M.; Ozel, G.; Hosseini, B.; Ortega, E.M.M.; Altun, E. The generalized odd log-logistic family of distributions: Properties, regression models and applications. J. Stat. Comput. Simul. 2017, 87, 908–932. [Google Scholar] [CrossRef]
- Reyad, H.; Selim, M.A.; Othman, S. The Nadarajah Haghighi Topp Leone-G Family of distributions with mathematical properties and applications. Pak. J. Stat. Oper. Res. 2019, 15, 849–866. [Google Scholar] [CrossRef]
- Cordeiro, G.; Afify, A.; Ortega, E.; Suzuki, A.; Mead, M. The odd Lomax generator of distributions: Properties, estimation and applications. J. Comput. Appl. Math. 2019, 347, 222–237. [Google Scholar] [CrossRef]
- Aslam, M.; Asghar, Z.; Hussain, Z.; Shah, S.F. A modified TX family of distributions: Classical and Bayesian analysis. J. Taibah Univ. Sci. 2020, 14, 254–264. [Google Scholar] [CrossRef]
- Ahmad, Z.; Mahmoudi, E.; Alizadeh, M.; Roozegar, R.; Afify, A. The exponential TX family of distributions: Properties and an application to insurance data. J. Math. 2021, 2017, 1–18. [Google Scholar]
- Klakattawi, H.; Alsulami, D.; Elaal, M.; Dey, S.; Baharith, L. A new generalized family of distributions based on combining Marshal-Olkin transformation with TX family. PLoS ONE 2022, 17, e0263673. [Google Scholar] [CrossRef] [PubMed]
- Shah, Z.; Khan, D.; Khan, Z.; Faiz, N.; Hussain, S.; Anwar, A.; Ahmad, T.; Kim, K. A New Generalized Logarithmic–X Family of Distributions with Biomedical Data Analysis. Appl. Sci. 2023, 13, 3668. [Google Scholar] [CrossRef]
- Ahmad, M.; Jabeen, R.; Zaka, A.; Hamdi, W.; Alnssyan, B. A unified generalized family of distributions: Properties, inference, and real-life applications. AIP Adv. 2024, 14, 015043. [Google Scholar] [CrossRef]
- Mills, J.P. Table of the ratio: Area to bounding ordinate, for any portion of normal curve. Biometrika 1926, 18, 395–400. [Google Scholar] [CrossRef]
- Nagy, M.; Ahmad, M.; Jabeen, R.; Zaka, A.; Alrasheedi, A.; Mansi, A.H. Evaluation of survival weighted Pareto distribution: Analytical properties and applications to industrial and aeronautics data. AIP Adv. 2024, 14, 045140. [Google Scholar] [CrossRef]
- Jabeen, R.; Ahmad, M.; Zaka, A.; Mansour, M.M.; Aljadani, A.; Abd Elrazik, E.M. A New Statistical Approach Based on the Access of Electricity Application with Some Modified Control Charts. J. Math. 2024, 1, 6584791. [Google Scholar] [CrossRef]
- R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2023; Available online: https://www.R-project.org/ (accessed on 25 January 2024).
- Tahir, M.H.; Cordeiro, G.M.; Mansoor, M.; Zubair, M. The Weibull-Lomax distribution: Properties and applications. Hacet. J. Math. Stat. 2015, 44, 455–474. [Google Scholar] [CrossRef]
- Kundu, D.; Raqab, M.Z. Estimation of R = P (Y< X) for three-parameter Weibull distribution. Stat. Probab. Lett. 2009, 79, 1839–1846. [Google Scholar]
- Smith, R.L.; Naylor, J. A comparison of maximum likelihood and Bayesian estimators for the three-parameter Weibull distribution. J. R. Stat. Soc. Ser. Appl. Stat. 1987, 36, 358–369. [Google Scholar] [CrossRef]
- ZeinEldin, R.A.; Ahsan ul Haq, M.; Hashmi, S.; Elsehety, M. Alpha power transformed inverse Lomax distribution with different methods of estimation and applications. Complexity 2020, 2020, 1860813. [Google Scholar] [CrossRef]
- Al-Marzouki, S. Truncated Weibull power Lomax distribution: Statistical properties and applications. J. Nonlinear Sci. Appl. 2019, 12, 543–551. [Google Scholar] [CrossRef]
- Almetwally, E.M. Application of COVID-19 pandemic by using odd lomax-G inverse Weibull distribution. Math. Sci. Lett. 2021, 10, 47–57. [Google Scholar]
- Aryal, G.; Elbatal, I. On the exponentiated generalized modified Weibull distribution. Commun. Stat. Appl. Methods 2015, 22, 333–348. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).