Abstract
The unit–power Burr X distribution (UPBXD), a bounded version of the power Burr X distribution, is presented. The UPBXD is produced through the inverse exponential transformation of the power Burr X distribution, which is also beneficial for modelling data on the unit interval. Comprehensive analysis of its key characteristics is performed, including shape analysis of the primary functions, analytical expression for moments, quantile function, incomplete moments, stochastic ordering, and stress–strength reliability. Rényi, Havrda and Charvat, and d-generalized entropies, which are measures of uncertainty, are also obtained. The model’s parameters are estimated using a Bayesian estimation approach via symmetric and asymmetric loss functions. The Bayesian credible intervals are constructed based on the marginal posterior distribution. Monte Carlo simulation research is intended to test the accuracy of various estimators based on certain measures, in accordance with the complex forms of Bayesian estimators. Finally, we show that the new distribution is more appropriate than certain other competing models, according to their application for COVID-19 in Saudi Arabia and the United Kingdom.
Keywords:
power Burr X distribution; entropy; Bayesian estimation; Metropolis–Hastings; COVID-19 data MSC:
60E05; 60E10; 62F15; 62F25
1. Introduction
Utilizing differential equations, Burr [1] introduced twelve distributions. In the literature, Burr type XII distributions and single-parameter Burr type X, have drawn much interest. Surles and Padgett [2] have proposed the two-parameter Burr X distribution (BXD), often known as the generalized Rayleigh distribution. For data modelling, the BXD can be used as an alternative to the Weibull and Rayleigh distributions. However, the model has a considerable impact on the prediction of failure rates and has generated a lot of interest in modelling across a wide range of disciplines, including hydrology, medicine and reliability analysis. The cumulative distribution function (CDF) of the BXD is given by:
where, b > 0, and a > 0 are the shape and scale parameters, respectively. The probability density function (PDF) associated with (1) is given by:
According to Raqab and Kundu [3], the shape parameter (b) determines whether the hazard rate function (HF) of the BXD is a bathtub or an increasing function. The HF is bathtub for and is an increasing function for In the literature, numerous studies have been undertaken in recent years to create modified or generalized forms of the BXD in order to increase the viability of BXDs, see, for example, [4,5,6,7,8,9]. Our focus is on the recently established power BXD (PBXD) by Usman and Ilyas [10], with an additional shape parameter that depends on the transformation The CDF and PDF of the PBXD are, respectively, given by:
For the CDF (3) reduces to BXD. Usman and Ilyas [10] mentioned that, subject to certain restrictions, their model can handle both symmetrical and heavy-tailed skewed data sets.
A significant challenge in data modelling is the selection of an adequate lifetime probability. However, over time, a variety of probability models have been widely proposed for the analysis of data sets in a variety of fields, including the medical sciences, actuarial sciences, engineering, finance and insurance, demography, biological sciences, and economics. In many practical scenarios, we are required to deal with the uncertainty of bounded situations. We commonly encounter variables that fall within the range of (0, 1), such as the percentage of a particular trademark, the results of some capacity tests, different lists, and rates. In order to model these variables effectively, continuous unit distributions, or probability distributions with support for (0, 1), are crucial. Due to this, some authors have recently concentrated on the creation of distributions that are specified on the bounded interval using any one of the parent distribution modification strategies. Among distributions that are specified in the (0, 1) interval, the beta distribution is obviously the most well-known. The beta distribution is helpful for simulating data on the unit interval, but different distributions have also been proposed and researched over time. The Topp–Leone distribution (see [11]) and the Kumaraswamy distribution (see [12]) can all be used as examples by the reader. The idea of offering distributions defined by the unit interval corresponding to any continuous distribution, however, has recently attracted the interest of statisticians. The following are a few of the most practical unit–interval distributions: the log–Lindley (Gómez-Déniz et al. [13]), unit–Birnbaum–Saunders (Mazucheli et al. [14]), unit–inverse Gaussian (Ghitany et al. [15]), unit–Lindley (Mazucheli et al. [16]), unit–BurrIII (Modi and Gill [17]), unit–Weibull (Mazucheli et al. [18]), unit–Burr XII (Korkmaz and Chesneau [19]), unit–odd Fréchet power function (Haq et al. [20]), unit–Teissier (Krishna et al. [21]), unit–exponentiated exponential (Jha et al. [22]) and unit–exponentiated half-logistic (Hassan et al. [23]) among others.
In this study, we propose a new unit probability distribution, based on the PBXD, that has three parameters. A new unit-PBXD (UPBXD) is provided based on the transformation where Y represents the PBXD. The UPBXD has the following desirable characteristics:
- ▪
- The UPBXD is a flexible model and can be used to describe a variety of datasets with a range between zero and one.
- ▪
- The new density function of the UBBXD takes several shapes, including unimodal, reversed J-shaped, U-shaped, left-skewed, and symmetric (see Section 2).
- ▪
- The HF shapes of the UPBXD can be increasing, J-shaped, or bathtub (U-HF) (see Section 2).
- ▪
- We derive some of the most important statistical characteristics of the UPBXD, such as the analytical expression for moments, the quantile function, incomplete moments, stochastic ordering, some uncertainty measures, and stress–strength reliability.
- ▪
- The parameter estimators of the UPBXD are explored using a Bayesian technique. The Bayesian credible intervals are also created.
- ▪
- To examine the effectiveness of estimators based on accuracy criteria, an exclusive simulation study was conducted.
- ▪
- Application to COVID-19 datasets from Saudi Arabia and the United Kingdom are used to show the superiority of the proposed model over other well-known models.
An outline of the paper’s structure is provided. Section 2 provides a definition of the suggested distribution. The distributional characteristics of the UPBXD are covered in Section 3. The maximum likelihood (ML) and Bayesian estimators utilizing various loss functions are covered in Section 4. The effectiveness of the suggested point and interval estimators is assessed using a Monte Carlo simulation in Section 5. Section 6 shows that the UPBXD outperforms the other unit distributions when employed with COVID-19 data. The paper conclusion is completed in Section 7.
2. Unit Power Burr X Distribution
In this section, we present the UPBXD, which results from the transformation of the type where Y is the PBXD and is a new bounded distribution with support on (0, 1). Thus, the following is how the CDF of the PBXD can be obtained:
which gives
Based on (5), we have for w ≤ 0, and for w ≤ 1. The PDF of the UPBXD related to (5) can be acquired as follows:
A random variable with PDF (6) is represented by UPBXD For b = 1, the PDF (6) gives UBXD as a new sub-model. The following is the HF of the UPBXD:
The related plots for various selections of the parameters and are shown in Figure 1 and Figure 2 to provide a general overview of the shapes of the PDF (2) and HF (7).
Figure 1.
Plots of various PDF shapes of the UPBXD for different parameter values.
Figure 2.
Plots of various HF shapes of the UPBXD for different parameter values.
In Figure 1, the PDF graphs for various parameter combinations display a variety of shapes, such as (a = 2, b = 2) symmetric normal, (a = 0.5, b = 0.3) U-shaped, (a = 0.5, b = 2) right-skewed, (a = 2, b = 0.3) J-shaped, and (a = 2, b = 2) normal tapered. In Figure 2, the UPBXD’s HF shapes in (a = 0.5, b = 2), (a = 2, b = 0.3), and (a = 2, b = 2) have increasing and J shapes, while (a = 0.5, b = 0.3) has a bathtub shape.
The parameter is responsible for the bathtub shapes given that the other two parameters (a and b) are less than one. The parameter is responsible for the J shapes where a > 1 and b < 1.
By inverting (5), we can get the quantile function (QF) of the UPBXD, which looks like this:
where q is the uniform random variables. The first, median, and third quantiles are produced by setting q = 0.25, 0.5, and 0.75 in (8). It is simple to simulate the random variable of the UPBXD from (8).
3. The UPBXD’s Properties
In this section, we examine aspects of the UPBXD’s structural characteristics, such as some moment’s measures, information measures, stochastic ordering (SO), and stress–strength (SS) reliability.
3.1. Some Moments Measures
The mth moment for W~UPBXD is determined as follows:
Using the binomial expansion in (9) provides
Let then the mth moment of W, is given by
Use the exponential expansion then , obtains the following form:
where, is a gamma function. Furthermore, the mth central moment of W, is defined by
Some moments measures including, first four moments, variance (), coefficient of skewness () and coefficient of kurtosis () for the UPBXD are calculated for specific parameter values. Table 1 provides these measures considering parameter values as: (i) (ii) (iii) (iv) (v) (vi) and (vii)
Table 1.
Several UPBXD moment values.
Table 1 displays that the UPBXD is right- and left- skewed in accordance with the values of Additionally, the distribution is leptokurtic and platykurtic according to the values of Figure 3 shows the 3-dimensional plots for coefficient of skewness and kurtosis for UPBXD with different values of parameters. Looking at Figure 3, we can see that the coefficient of skewness and kurtosis increases when b and increases, while a increases then the coefficient of skewness decreases and coefficient of kurtosis increases.
Figure 3.
Coefficient of skewness and kurtosis for UPBXD.
Furthermore, the mth lower incomplete moment, say of the UPBXD is given by:
Let and using the binomial expansion, then the mth incomplete moment of W is
Using exponential expansion and after simplification, the mth moment is as below:
where is an upper incomplete gamma function. The Lorenz and Bonferroni curves are well-known applications of the first incomplete moment. In the fields of economics, demographics, insurance, engineering, and medicine, these curves are especially helpful.
3.2. Information Measures
In this sub-section, we examine the entropies of Rényi, Havrda and Charvat, as well as d-generalized entropy as information metrics. These measures collectively provide information about the system’s overall amounts of data. The Rényi entropy presented by Rényi [24], is conceptually the quantity of information contained in a random process, it is defined by:
Inserting (6) in (10), and using binomial expansion, then is as follows:
Let and using exponential expansion in (11), we obtain
where
Reference [25] proposed another uncertainty measure, the Havrda and Charvat. Here we assume and this is represented mathematically by:
Using the same procedure above, we obtain as follows:
In reference [26], a further generalized Shannon entropy form known as d-generalized entropy was developed. It is represented mathematically as below:
Using a similar way as above, we obtain the d-generalized entropy as follows,
where
We use the following sets of parameters to provide entropy numerical values for the measurements under consideration: (i) (ii) (iii) (iv) (v) (vi) and (vii) Table 2 provides some numerical values for the provided three entropy measures.
Table 2.
Numerical values for the UPBXD’s entropy measures.
3.3. Stochastic Ordering
The statistical literature places a great emphasis on the ordering of distributions, especially among lifetime distributions. A significant part of the ranking of various lifetime distributions is found in Johnson et al. [27]. Here, we take into account four distinct SO for two independent UPBX random variables with a restricted parameter space: the usual, the hazard rate, the mean residual life, and the likelihood ratio order. Recall that a family has the monotone likelihood ratio property if it has a likelihood ratio ordering. This suggests that, when the other parameters are known, there exists a test that is consistently the strongest for any one-sided hypothesis. According to Shaked and Shanthikumar [28], when two independent random variables, W1 and W2, have CDFs that are and , respectively, W1 is said to be smaller than W2 in the
- ▪
- Stochastic order (W1 ≤st (W2)) if ≥ ∀w
- ▪
- Hazard rate order (W1 ≤hr (W2)) if ≥ ∀w
- ▪
- Mean residual life order (W1 ≤mrl (W2)) if ≥ ∀w
- ▪
- Likelihood ratio order (W1 ≤lr (W2)) if decreases in w.
Assume that Wi, i = 1, 2 have the UPBXD with parameters Further, assume that and indicate, respectively, Wi’s CDF and PDF.
If is a decreasing function ∀ w, then, in terms of likelihood ratio order; W1 is said to be stochastically less than W2 (W1 ≤ lrW2)
Let W1~UPBXD and W2~UPBXD then the likelihood ratio ordering is as follows:
For we get for all hence is decreasing in w and hence W1 ≤lr W2. Moreover, W1 is said to be smaller than W2 in other orderings such as SO (W1 ≤ stW2), HF(W1 ≤ hrW2), and mean residual order (W1 ≤ mrlW2).
3.4. Stress–Stress Reliability
In statistical literature, the term “SS reliability” is used to characterize the reliability of a system subjected to random stress W2 and having random strength W1, with the system failing if W2 is greater than W1, that is; R = P(W2 < W1). Let us assume that W1∼UPBXD and W2∼UPBXD are two independent random variables. The SS reliability of the UPBXD is then calculated as follows:
Using the binomial expansions in (13), we get
As seen in (14) the SS reliability dependent on the parameters and
4. Parameter Estimation
The estimation methodologies for the parameters of the UPBXD are obtained in this part using Bayesian and non-Bayesian estimation approaches. We provide classical method for the UPBXD as ML and Bayesian estimation utilizing various loss functions, including the squared error loss function (SELF), the linear exponential (LINEX) loss function and entropy loss function (ELF).
4.1. Maximum Likelihood Method
Consider a population that has a UPBXD described by PDF (6) with an unknown parameter vector and that a random sample of size n is taken from that population. Following that, the likelihood of UPBXD for say will be
The log likelihood function for say will be
The nonlinear equations created by differentiating (16) with respect to and are solved to obtain the ML estimator for the unknown parameters. The score vector components, say are given by
The ML estimator of , say , is achieved by solving the nonlinear system (17)–(19). These equations cannot be resolved analytically, but they can be resolved numerically by iterative statistical software techniques. We can use iterative methods, such as a Newton–Raphson algorithm, to obtain these estimates.
4.2. Bayesian Estimation
In this section, the Bayesian estimators based on different loss functions and associated highest posterior density (HPD) intervals of the UPBXD parameters are developed. The posterior distribution of is described in the following if we assume that the prior PDF of is unknown.
The posterior density of is defined in Equation (20) as , where on the right hand side is the likelihood function of UPBXD and is the prior density of
4.2.1. Prior Information
For the purpose of discussing Bayesian estimate, we assume that the parameters and are independently distributed using the gamma distribution. Let and where j =1, 2, 3, be the scale and shape parameters for the gamma priors of and The following is a proportionate representation of the joint density of and
The hyper-parameters will be elicited using the informative priors. When j = 1, …, L and k are the number of samples available from the UPBXD simulation, the mean and variance obtained using the ML estimates of the UPBXD and will be equal to the mean and variance of the considered priors (Gamma priors) and By equating and with the mean and variance of gamma priors, we may determine their respective means and variances. Thus, we obtain
In regard to be solving the above two equations, the estimated hyper-parameters can be written as described in the following subsections.
4.2.2. Posterior Distribution
Here, the symmetric loss function (SELF), and asymmetric loss function (LINEX and ELF) are used to develop the Bayesian estimators for the same unknown parameters by utilizing independent gamma priors.
The likelihood function (15) and the joint prior function (21) are combined to form the joint posterior distribution. Hence, the joint posterior density function is
The SELF, is defined as follows:
The Bayesian estimator of under SELF is as follows:
The LINEX, as asymmetric loss function, which is denoted by , is the derived as follows:
The Bayesian estimator of under LINEX loss function is as follows:
The ELF was first suggested by James and Stein [29] to estimate the Variance–Covariance (i.e., dispersion) matrix of the multivariate normal distribution. According to Calabria and Pulcini [30], the ELF is an excellent asymmetric loss function. The form’s ELF is thought of as
The Bayesian estimator of under ELF is as follows:
The Bayes estimator of and via different loss functions cannot be expressed in an explicit statement, as is evident from Equations (23)–(25). To do this, we suggest generating samples from conditional posterior distribution using Bayes Monte Carlo Markov chain (MCMC) techniques in order to compute the acquired Bayes estimates and create associated HPD intervals.
4.2.3. Markov Chain Monte Carlo
Since it is challenging to solve these integrals analytically, the MCMC method will be used. The most important sub-classes of MCMC algorithms are Gibbs sampling and the Metropolis–Hastings (MH) samplers. To do this, it regards a candidate value produced from a proposal distribution as normal for each iteration of the process, the MH method is comparable to acceptance–rejection sampling. From Equation (22), the full conditional density of and are provided, respectively, to execute the MCMC sampler as follows:
and
It is thought that the MH algorithm can resolve this issue (for detail, see Alrumayh et al. [31] and Almetwally et al. [32]). The MH algorithm’s sampling procedure is carried out as follows:
Step 1: Set the initial values and Step 2: Set I = 1.
Step 3: Generate and from and respectively.
Step 4: Obtain
Step 5: Generate samples Uj j =1,2,3 from the uniform U(0, 1) distribution.
Step 6: If and then set ; otherwise and
Step 7: Set I = I+ 1.
Step 8: Repeat steps 3–7 B times and obtain and for I = 1, 2,..., B.
4.2.4. Highest Posterior Density Interval
Using the technique suggested by Chen and Shao [33], HPD interval estimates of and are created. The MCMC samples of for j = 1, …, B are first ordered. Therefore, the two-sided HPD interval of is given by
where and
5. Simulation
A Monte Carlo simulation was run to evaluate the performance of the proposed point and interval estimators that were introduced in the previous sections. Based on various selections for sample size n as 40, 80, and 160, UPBXD was used to create a total of 5000 samples. To compare the results of Bayesian estimate based on various loss functions, the bias and mean squared errors (MSE) were calculated. The UPBXD was used to generate the data for the lifetime of various parameters and as follows.
Table 3.
Bayesian inference with different loss functions when .
Table 4.
Bayesian inference with different loss functions when .
Table 5.
Bayesian inference with different loss functions when .
Table 6.
Bayesian inference with different loss functions when .
The hybrid MCMC algorithm described in Section 4.2.3 was adopted to generate 12,000 MCMC samples, and we discarded the first 2000 values as ‘burn-in’. Accordingly, the 10,000 MCMC samples were used to produce the average Bayes MCMC estimates and 95% two-sided Bayesian credible intervals.
- Algorithm for simulation: By establishing all simulation controls, we can build our model. The following actions must be finished in this stage in the correct order:
- Assume different values for the UPBXD parameter vector and sample size.
- Make the sample random values for the UPBXD using uniform and the QF in Equation (7).
- We calculated the accuracy measures for each Bayes estimates of the UPBXD parameters using MH algorithm.
- This experiment should be run (L-1) times.
5.1. Simulation Results
Table 3, Table 4, Table 5 and Table 6 show the results of the suggested techniques for calculating the point and interval parameter estimates. They offer the findings as well as some intriguing data. The following observations are permissible:
- The estimates are asymptotically unbiased since they are more accurate as the sample size increases.
- The parameter estimates come from the best unbiased estimator when the MSE value is near zero.
- As the sample size grows, the MSE declines for each estimate, demonstrating consistency between the various estimates.
- When the true value of increases, the bias, MSE, and length of the credible confidence interval (LCCI) of all estimates decrease.
- The MSE and LCCI for the Bayesian estimates with positive weight for the asymmetric loss function are smaller than the Bayesian estimates with negative weight for asymmetric loss function.
- The LCCI for estimates obtains its largest value, based on the suggested method, as the true values of the parameters increase.
- An entropy loss function with positive weight is better than the other loss functions.
5.2. Represention Results
Figure 4, Figure 5, Figure 6 and Figure 7 show heatmap descriptions for the MSE results, where the bold color represents the highest values of MSE and the white color represents the lowest values of MSE.
Figure 4.
Heatmap for MSE when .
Figure 5.
Heatmap for MSE when .
Figure 6.
Heatmap for MSE when .
Figure 7.
Heatmap for MSE when .
The X-label belongs to SELFj, (j = 1, 2, 3) which are the MSE of Bayes estimates based on SELF with different parameters;
LINEXaj, (j =1, 2, 3) are the MSE of Bayes estimates based on LINEX with different parameters;
LINEXbj, (j =1, 2, 3) are the MSE of Bayes estimates based on LINEX with different parameters;
ELFaj, (j =1, 2, 3) are the MSE of Bayes estimates based on ELF with different parameters;
ELFbj, (j =1, 2, 3) are the MSE of Bayes estimates based on ELF with different parameters.
The Y-label belongs to cases and sample sizes, where C1n1 for and n = 40; C1n2 for and n = 80; C1n3 for and n = 80.
6. Application of Real Data
This section analyses two real-world datasets to show the adaptability and practical application of the UPBXD. The UPBXD is compared with the following models: unit-exponentiated half-logistic (UEHL) [23], Type II power Topp–Leone exponential (TIIPTLE) [34], Topp–Leone generalized exponential (TLGE) [35], Kumaraswamy (K), Beta, unit Weibull (UW), and Marshall–Olkin–Kumaraswamy (MOK). Two actual COVID-19 mortality rate datasets from Saudi Arabia and the United Kingdom are provided in this section to evaluate the UPBXD goodness of fit. The two real datasets were utilized to estimate the unknown parameters of the specified models using the maximum likelihood and Bayesian approaches. Kolmogorov–Smirnov statistics (KSS) with p-value, Cramer–von Mises statistics (WS), and Anderson–Darling statistics (AS) were used to compare all of the models.
6.1. Analysis for First Data
Data set I: The first set of data shows Saudi Arabia’s COVID-19 mortality rates over a 36-day period (22 July 2021 to 26 August 2021). The information is as follows: 0.1310, 0.1319, 0.1497, 0.1504, 0.1686, 0.1689, 0.1706, 0.1716, 0.1879, 0.1890, 0.1924, 0.1951, 0.2063, 0.2077, 0.2091, 0.2113, 0.2126, 0.2140, 0.2167, 0.2249, 0.2259, 0.2271, 0.2278, 0.2314, 0.2329, 0.2347, 0.2353, 0.2375, 0.2452, 0.2487, 0.2666, 0.2674, 0.2683, 0.2711, 0.2752, 0.2962. Table 7 shows the ML estimate of parameters with their standard errors (SEs) for each distribution and obtained the goodness of fit measures as KSS, WS, and AD. By the results shown in Table 7, we are able to see that the UPBXD is better than the other distributions, such as TLPTLE, TLGE, K, Beta, UW, UEHL, and MOK, for COVID-19 mortality rates in the Saudi Arabia data set.
Table 7.
ML estimates with SE and goodness of fit statistics: Saudi Arabia data set.
As can be seen, the TLPTLE, TLGE, K, Beta, UW, UEHL, and MOK distributions work well for modelling the COVID-19 mortality rates indicated in the Saudi Arabia data set, but that the UPBXD is the best. This is based on a significance level of 0.05. Figure 8 illustrates the estimated CDF in the red line with empirical CDF in the black line. It also shows the probability–probability (PP) plots of the UPBXD in the red line, also known as “parametric plots”, for the COVID-19 mortality rates of the Saudi Arabia data set, which demonstrate the empirical findings, reported in Table 7 and the empirical CDF line the (black) with the estimated CDF line (red).
Figure 8.
The CDF plot with empirical line and PP plot for Saudi Arabia data set.
Figure 9 shows three plots of COVID-19 mortality rates for the Saudi Arabia data set, where the left is a boxplot of data that explains that the data have no outlier values, the center is a TTT plot of data that explains this data set is increasing, and the right is a hazard estimated plot line that indicates the HF is increasing.
Figure 9.
Boxplot, TTT plots and hazard line of UPBXD plot for Saudi Arabia data set.
6.2. Analysis for Second Data
Data set II: The second set of data shows the United Kingdom COVID-19 mortality rates over a 28-day period (1 January 2022 to 28 January 2022). The information is as follows: 0.1484, 0.1174, 0.0522, 0.0296, 0.0339, 0.2274, 0.1555, 0.1530, 0.2079, 0.0640, 0.1407, 0.2463, 0.2569, 0.2150, 0.1723, 0.1823, 0.1807, 0.1823, 0.2736, 0.2228, 0.2036, 0.1767, 0.1814, 0.1361, 0.1620, 0.2639, 0.2067, 0.2008.
Table 8 shows the ML estimate of parameters for each distribution and obtained the goodness of fit measures as KSS, WS, and AD. By the results of Table 8, we can see that the UPBXD is better than the other distributions, such as TLPTLE, TLGE, K, Beta, UW, UEHL, and MOK, for COVID-19 mortality rates in the United Kingdom data set. Additionally, we can see that the TLPTLE, TLGE, K, Beta, UW, UEHL, and MOK distributions work well for modelling the COVID-19 mortality rates of the United Kingdom data set, though the UPBXD is the best. This is based on a significance level of 0.05.
Table 8.
Estimates with SE and goodness of fit statistics of ML: The United Kingdom data set.
Figure 10 illustrates the PP plots for the COVID-19 mortality rates of the United Kingdom data set, which demonstrate the empirical findings reported in Table 8 and the empirical CDF line (black) with the estimated CDF line (red). Figure 11 shows three plots of COVID-19 mortality rates for the United Kingdom data set, where the left is a boxplot of data that explains that these data have no outlier values, the center is a TTT plot of data that explains that these data are increasing, and the right is a hazard estimated plot line that indicates the hazard is increasing.
Figure 10.
The CDF plot with empirical line and PP plot for the United Kingdom data set.
Figure 11.
Boxplot, TTT plots and hazard line of UPBXD plot for the United Kingdom data set.
6.3. Data Analysis via Bayesian Method
Here, we analyze data sets presented in previous sub-sections using the proposed Bayesian estimation method.
The Bayesian estimation parameters of UPBXD for each of the data sets, respectively are given in Table 9. The Bayesian estimates of UPBXD parameters under SELF and the corresponding SEs are calculated. The lower and upper HPD intervals are also calculated.
Table 9.
Bayesian estimation based on SELF for parameters of UPBXD.
Figure 12 and Figure 13 display the trace plot of the UPBXD’s parameter values for the MCMC finding.
Figure 12.
Trace plots of MCMC results with interval limit line for Saudi Arabia data set.
Figure 13.
Trace plots of MCMC results with interval limit line for the United Kingdom data set.
The autocorrelation function (ACF) is generated as shown in Figure 14 and Figure 15. Figure 16 and Figure 17 demonstrate the symmetric normal distribution of the posterior density for the parameters of the UPBXD.
Figure 14.
The ACF plot of MCMC results for Saudi Arabia data set.
Figure 15.
ACF plot of MCMC results for the United Kingdom data set.
Figure 16.
Histogram plots of MCMC results for Saudi Arabia data set.
Figure 17.
Histogram plots of MCMC results for the United Kingdom data set.
Figure 18 and Figure 19 display the parameter convergence charts for UPBXD draws as well as the parameter random draw plot, respectively.
Figure 18.
Convergence lines of MCMC results for Saudi Arabia data set.
Figure 19.
Convergence lines of MCMC results for the United Kingdom data set.
7. Conclusions
This article focuses on a three-parameter unit distribution created based on the power Burr X distribution and called the UPBXD. The statistical properties of the UPBXD have been derived and expressed in closed forms. The presented unit distribution can be used as a statistical tool to model different types of HFs, including those that are bathtub, increasing and unimodally shaped. Its important features are carefully studied, including the analytical expression of moments, quantile function, incomplete moments, stochastic ordering, and stress–strength reliability. Moreover, the uncertainty-measuring metrics Rényi, Havrda, and Charvat as well as d-generalized entropy were obtained. The UPBXD parameters have been estimated utilizing ML approach as well as Bayesian estimation approach with different loss functions. Additionally, Bayesian credible intervals were constructed based on the marginal posterior distribution. For some difficult calculations, the Markov chain Monte Carlo method was used. To assess how various estimates work, simulation studies based on various sample sizes have been carried out. In light of the simulation study’s findings, it was found that the Bayesian-based symmetric loss function and LINEX loss function techniques work quite effectively for estimating the UPBXD parameters. Bayesian estimates under an entropy loss function with positive weight are superior to those under other loss functions. The MSE and length of the credible confidence interval for Bayesian estimates with positive weight are smaller than the corresponding values with negative weight. Finally, two actual COVID-19 mortality rate data sets from Saudi Arabia and the United Kingdom have been analyzed and discussed to illustrate the notability of the UPBXD. The UPBXD gives superior fits over several other competing models, as shown by a real data application. Future discussions can be expanded on the use of Bayesian estimation in stress–strength reliability for the UPBXD based on some sampling techniques [36,37,38]. Furthermore, the proposed methodology can be expanded in multivariate and Bivariate case as [39,40,41].
Author Contributions
Conceptualization, A.S.H. and E.M.A.; Methodology, A.F., A.S.H., H.B. and E.M.A.; software, A.S.H. and E.M.A.; validation, A.S.H. and H.B.; formal analysis, A.F. and H.B.; investigation, A.F. and H.B.; data curation, A.S.H. and E.M.A.; writing—original draft, A.F., A.S.H., H.B. and E.M.A.; writing—review and editing, A.S.H., H.B. and E.M.A. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Data Availability Statement
The data sets have been provided in Section 6.
Acknowledgments
The authors would like to thank the editor and the anonymous referees for their efforts, wise observations, and constructive criticism, which greatly enhanced the manuscript’s contents.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Burr, I.W. Cumulative frequency functions. Ann. Math. Stat. 1942, 13, 215–232. [Google Scholar] [CrossRef]
- Surles, J.; Padgett, W. Some properties of a scaled Burr type X distribution. J. Stat. Plan Inference 2005, 128, 271–280. [Google Scholar] [CrossRef]
- Raqab, M.Z.; Kundu, D. Burr type X distribution: Revisited. J. Probab. Stat. Sci. 2006, 4, 179–193. [Google Scholar]
- Merovci, F.; Khaleel, M.A.; Ibrahim, N.A.; Shitan, M. The beta Burr type X distribution properties with application. SpringerPlus 2016, 5, 697. [Google Scholar] [CrossRef]
- Yousof, H.M.; Afify, A.Z.; Hamedani, G.; Aryal, G. The Burr X generator of distributions for lifetime data. J. Stat. Theory Appl. 2017, 16, 288–305. [Google Scholar] [CrossRef]
- Ibrahim, N.A.; Khaleel, M.A.; Merovci, F.; Shitan, M. Weibull Burr Type X distribution properties with application. Pak. J. Stat. 2017, 33, 315–336. [Google Scholar]
- Nasiru, S.; Mwita, P.N.; Ngesa, O. Exponentiated generalized half logistic Burr X distribution. Adv. Appl. Statist. 2018, 52, 145–169. [Google Scholar] [CrossRef]
- Shrahili, M.; Elbatal, I.; Muhammad, M. The type I half-logistic Burr X distribution: Theory and practice. J. Nonlinear Sci. Appl. 2019, 12, 262–277. [Google Scholar] [CrossRef]
- Khan, M.S.; King, R.; Hudson, I.L. Transmuted Burr Type X distribution with covariates regression modeling to analyze reliability data. Am. J. Math. Manag. 2019, 39, 99–121. [Google Scholar] [CrossRef]
- Usman, R.M.; Ilyas, M. The power Burr Type X distribution: Properties, regression modeling and applications. Punjab Univ. J. Math. 2020, 52, 27–44. [Google Scholar]
- Topp, C.W.; Leone, F.C. A family of J-shaped frequency functions. J. Am. Stat. Assoc. 1955, 50, 209–219. [Google Scholar] [CrossRef]
- Kumaraswamy, P. A generalized probability density function for double-bounded random processes. J. Hydrol. 1980, 46, 79–88. [Google Scholar] [CrossRef]
- Gómez-Déniz, E.; Sordo, M.A.; Calderín-Ojeda, E. The log-Lindley distribution as an alternative to the beta regression model with applications in insurance. Insur. Math. Econ. 2014, 54, 49–57. [Google Scholar] [CrossRef]
- Mazucheli, J.; Menezes, A.F.B.; Dey, S. The unit-Birnbaum–Saunders distribution with applications. Chil. J. Stat. 2018, 9, 47–57. [Google Scholar]
- Ghitany, M.E.; Mazucheli, J.; Menezes, A.F.B.; Alqallaf, F. The unit-inverse Gaussian distribution: A new alternative to two-parameter distributions on the unit interval. Commun. Stat. Theory Methods 2019, 48, 3423–3438. [Google Scholar] [CrossRef]
- Mazucheli, J.; Menezes, A.F.B.; Chakraborty, S. On the one parameter unit-Lindley distribution and its associated regression model for proportion data. J. Appl. Stat. 2019, 46, 700–714. [Google Scholar] [CrossRef]
- Modi, K.; Gill, V. Unit Burr-III distribution with application. J. Stat. Manag. Syst. 2020, 23, 579–592. [Google Scholar] [CrossRef]
- Mazucheli, J.; Menezes, A.F.B.; Fernandes, L.B.; de Oliveira, R.P.; Ghitany, M.E. The unit-Weibull distribution as an alternative to the Kumaraswamy distribution for the modeling of quantiles conditional on covariates. J. Appl. Stat. 2020, 47, 954–974. [Google Scholar] [CrossRef]
- Korkmaz, M.C.; Chesneau, C. On the unit Burr-XII distribution with the quantile regression modeling and applications. Comput. Appl. Math. 2021, 40, 29. [Google Scholar] [CrossRef]
- Haq, M.A.U.; Albassam, M.; Aslam, M.; Hashmi, S. Statistical inferences on odd Fr´echet power function distribution. J. Reliab. Stat. Stud. 2021, 14, 141–172. [Google Scholar] [CrossRef]
- Krishna, A.; Maya, R.; Chesneau, C.; Irshad, M.R. The unit Teissier distribution and its applications. Math. Comput. Appl. 2022, 27, 12. [Google Scholar] [CrossRef]
- Jha, M.K.; Dey, S.; Alotaibi, R.; Alomani, G.; Tripath, Y.M. Multicomponent stress-strength reliability estimation based on unit generalized exponential distribution. Ain Shams Eng. J. 2022, 13, 101627. [Google Scholar] [CrossRef]
- Hassan, A.S.; Fayomi, A.; Algarni, A.; Almetwally, E.M. Bayesian and non-Bayesian inference for unit-exponentiated half-logistic distribution with data analysis. Appl. Sci. 2022, 12, 11253. [Google Scholar] [CrossRef]
- Rényi, A. On measures of entropy and information. In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Contributions to the Theory of Statistics; University of California Press: Berkeley, CA, USA, 1960; Volume 1, pp. 547–561. [Google Scholar]
- Havrda, J.; Charvát, F. Quantification method of classification processes. Concept Struct. -Entropy. Kybern. 1967, 3, 30–35. [Google Scholar]
- Mathai, A.M.; Haubold, H.J. On a generalized entropy measure leading to the pathway model with a preliminary application to solar neutrino data. Entropy 2013, 15, 4011–4025. [Google Scholar] [CrossRef]
- Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions; Wiley: New York, NY, USA, 1995; Volume 2. [Google Scholar]
- Shaked, M.; Shanthikumar, J.G. Stochastic Orders; Wiley: New York, NY, USA, 2007. [Google Scholar]
- James, W.; Stein, C. Estimation with quadratic loss. In Breakthroughs in Statistics; Springer: New York, NY, USA, 1992; pp. 443–460. [Google Scholar]
- Calabria, R.; Pulcini, G. An engineering approach to Bayes estimation for the Weibull distribution. Microelectron. Reliab. 1994, 34, 789–802. [Google Scholar] [CrossRef]
- Alrumayh, A.; Weera, W.; Khogeer, H.A.; Almetwally, E.M. Optimal analysis of adaptive type-II progressive censored for new unit-lindley model. J. King Saud Univ. Sci. 2023, 35, 102462. [Google Scholar] [CrossRef]
- Almetwally, E.M.; Jawa, T.M.; Sayed-Ahmed, N.; Park, C.; Zakarya, M.; Dey, S. Analysis of unit-Weibull based on progressive type-II censored with optimal scheme. Alex. Eng. J. 2023, 63, 321–338. [Google Scholar] [CrossRef]
- Chen, M.H.; Shao, Q.M. Monte Carlo estimation of Bayesian credible and HPD intervals. J. Comput. Graph. Stat. 1999, 8, 69–92. [Google Scholar]
- Bantan, R.A.; Jamal, F.; Chesneau, C.; Elgarhy, M. Type II Power Topp-Leone generated family of distributions with statistical inference and applications. Symmetry 2020, 12, 75. [Google Scholar] [CrossRef]
- Sangsanit, Y.; Bodhisuwan, W. The Topp-Leone generator of distributions: Properties and inferences. Songklanakarin J. Sci. Technol. 2016, 38, 537–548. [Google Scholar]
- Yousef, M.M.; Hassan, A.S.; Alshanbari, H.M.; El-Bagoury, A.-A.H.; Almetwally, E.M. Bayesian and non-Bayesian analysis of exponentiated exponential stress–strength model based on generalized progressive hybrid censoring process. Axioms 2022, 11, 455. [Google Scholar] [CrossRef]
- Hassan, A.S.; Nagy, H.F. Reliability estimation in multicomponent stress strength for generalized inverted exponential distribution based on ranked set sampling. Gazi Univ. J. Sci. 2022, 35, 314–331. [Google Scholar]
- Hassan, A.S.; Almanjahie, I.M.; Al-Omari, A.I.; Alzoubi, L.; Nagy, H.F. Stress–strength modeling using median-ranked set sampling: Estimation, simulation, and application. Mathematics 2023, 11, 318. [Google Scholar] [CrossRef]
- Chesneau, C. On new three- and two-dimensional ratio-power copulas. Comput. J. Math. Stat. Sci. 2023, 2, 106–122. [Google Scholar] [CrossRef]
- El-Sherpieny, E.S.A.; Muhammed, H.Z.; Almetwally, E.M. Bivariate Chen distribution based on copula function: Properties and application of diabetic nephropathy. J. Stat. Theory Pract. 2022, 16, 54. [Google Scholar] [CrossRef]
- El-Sherpieny, E.S.A.; Almetwally, E.M.; Muhammed, H.Z. Bivariate Weibull-G family based on copula function: Properties, Bayesian and non-Bayesian estimation and applications. Stat. Optim. Inf. Comput. 2022, 10, 678–709. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).