Abstract
The univariate alpha power exponential (APE) distribution has several appealing characteristics. It behaves similarly to Weibull, Gamma, and generalized exponential distributions with two parameters. In this paper, we consider different bivariate mixture models starting with two independent univariate APE models, and, in the latter case, starting from two dependent APE models. Several useful structural properties of such a mixture model (under the assumption of two independent APE distribution) are discussed. Bivariate APE (BAPE), in short, modelled under the dependent set up are also discussed in the context of a copula-based construction. Inferential aspects under the classical and under the Bayesian paradigm are considered to estimate the model parameters, and a simulation study is conducted for this purpose. For illustrative purposes, a well-known motor data is re-analyzed to exhibit the flexibility of the proposed bivariate mixture model.
Keywords:
alpha power exponential mixture model; Bayesian approach; Gaussian copula; bivariate model; EM algorithm; simulation study; motor data MSC:
65C20; 60E05; 62P30; 62L15
1. Introduction
Univariate continuous alpha power exponential distribution with two parameters has received a considerable amount of attention among researchers working in the area of distribution theory. Since the introduction of the model, an appreciable amount of work has taken place on the development of bivariate APE distribution and its different generalizations; for more information, see Ref. [] and the references cited therein. In this paper, we discuss several different bivariate APE models with various mixing mechanisms. Noticeably, mixture distribution plays an important role in various data-analysis purposes; see []. Precisely, if the data are coming from different subpopulations, and their identifications are unknown, then the mixture distribution can be used quite effectively to analyze the data set. Unlike any other symmetric distribution (for example, normal) APE distribution is a skewed distribution. Consequently, if the subpopulations do not have symmetric distribution, the mixture of APE resulting in a bivariate APE distribution may be used to analyze these data. Furthermore, a multi-model distribution can be well approximated by a mixture distribution. Not much work has been carried out on a bivariate mixture of APE distribution.
This is an effort towards that direction. We consider two different mixing strategies to construct the bivariate mixture APE (BMAPE type I). In both cases, we start with the following: let and , and let them be independent. In the first, we consider a finite mixing strategy as described above and call it a BMAPE distribution. While, in the second scenario, we consider a copula-based construction approach. We consider a bivariate Gaussian copula to construct a copula-based bivariate distribution which we call (henceforth) BMAPE (type I). One of the major objectives for a copula-based construction is that we can better discuss the dependence structure between two components. For a copula-based bivariate and multivariate distribution, see []. In the next, we provide a non-exhaustive list of pertinent references that serves as motivation to carry out the present work.
Statisticians and practitioners have been particularly interested in studying groups that demonstrate similar behavior in response to pre-determined criteria. Refs. [,] developed and utilized a method known as finite mixture distributions, and provided the majority of the early evidence for the study of heterogeneous populations as a possible application of finite mixture models. Studies focusing on heterogeneous populations became more popular in the modern era as data-related computation facilities improved. For some useful references, see [,,,,] and the references cited therein. Starting with one parameter exponential baseline distribution, Ref. [] developed the alpha power exponential (APE) distribution, discussed its several useful properties, and studied the associated statistical inference. Ref. [] introduced the alpha power Weibull distribution and demonstrated that it provides superior modelling to several other generalizations of the Weibull distribution, in particular, when modeling heterogenous population. To introduce the alpha power generalized exponential (APGE) distribution, Ref. [] employed the generalized exponential baseline distribution and the alpha power transformation (APT) technique. Ref. [] developed closed-form expressions for the alpha power generalized exponential (APGE) distribution’s moment properties. Ref. [] presented a new family of APT-based distributions by inverting the quantile function of any distribution as a function of the APT cumulative distribution function (CDF). In the next, we provide a non-exhaustive list of references related to APT-based mixture distributions that are developed in the univariate domain. The alpha power exponentiated Weibull distribution was suggested by Ref. [], who investigated the general mathematical features of the APT family. Ref. [] introduced the Marshall Olkin alpha power family as a new generalization of the APT and a novel approach to studying the mathematical properties of the APT model was introduced by Ref. []. The probability density function (PDF) and the hazard functions of the APE distribution are similar to those of the Weibull, Gamma, and generalized exponential (GE) distributions under certain specific choices of the model parameters. As a result, it can be used as a variable alternative for the widely used Weibull, Gamma, and GE distributions. Specially, when, component-wise, each one of them cannot provide best fit, such a mixture mode will be a good fit. As the CDF of the APE distribution has a closed form which is analytically tractable, it can be used to analyze censored data sets as well.
Mixture distributions are made up of a finite or an infinite number of components, which may be comprised of several different distributions, and can be used to represent various aspects of data for example, two of the most notable studies that explore different sorts of combinations of distributions can be found in Refs. [,]. Discussion on a finite three-component mixture of exponential distribution can be found in Ref. []. In the literature, there are not many references of finite and/or infinite mixtures of probability models. A non-exhaustive list of references can be cited as follows. A mixture of bivariate exponential distributions (MBE), in which the authors predicted the elements of MBE are given in Ref. []. Discussion on a new bivariate distribution whose both marginals are finite mixtures of gamma distribution are nicely explored in Ref. []. For a review of known bivariate distributions, we refer the readers to Refs. [,,]. The univariate exponential distribution is one of the most celebrated distributions because of its importance in many lifetime applications and its structural properties. Consequently, if we consider a simple bivariate expansion, there might be limitations due to properties, especially for modeling heterogenous data. This motivates us to consider a mixture form that will have many desirable properties. In this article, we propose and study a finite mixture of a bivariate continuous probability model arising from two independent univariate APE distributions, which we call BMAPE, in short. In this paper, we consider various mechanisms of mixing univariate APE distributions to construct a bivariate mixture of APE models, which are listed below:
- (i)
- In the first case, we start with two independent and univariate APE with parameters and with . Further, we assume that , are random variables. Then, using the methodology of compounding, we develop the bivariate APE distribution, and, for the sake of notational simplicity, call it the BMAPE (type-I) distribution.
- (ii)
- In the second scenario, we consider a copula-based construction approach. We consider a bivariate Gaussian copula in which the margins are two univariate and independent APE distributions with the coupling parameter . For notational simplicity, we call this the BMAPE (type-II) distribution, for more details of copula based on construction and associated properties, see [].
The remainder of the paper is laid out as follows. In Section 2, we describe the BMAPE (type-I) distribution and discuss its properties. In Section 3, we provide some useful properties of the BMAPE (type-I) distribution, including the survival function, hazard rate function and conditional moments. In Section 4, we look at how the expectation-maximization (EM) technique can be used to estimate model parameters for BMAPE (type-I) distribution. The estimation approach for the BMAPE (type-II) distribution formed using the bivariate Gaussian copula is discussed in Section 5. In Section 6, we investigate how to estimate model parameters under the Bayesian paradigm. In Section 7, the simulation results are reported. A well-known Motor data set was re-analyzed in Section 8 to demonstrate the efficacy of the suggested BMAPE type models. Finally, in Section 9, we wrap up the paper with some concluding remarks.
2. The APE Model with a Bivariate Mixture
We will start with two independent univariate APE distributions with parameters of and , respectively. We employ here the strategy of compounding by assuming that and are random variables; then, the marginal joint distribution of and may be derived from the joint distribution of and μ respectively, which is given as follows
where and are independent APE distributions. Next, we consider two different scenarios in the context of the nature of the distributions of and . From (1), one may obtain the desired BMAPE (type-I) distribution.
We generate a bivariate mixture of the APE distribution in two scenarios. In the first scenario, and are independent APE distributions with a generalized bivariate Bernoulli distribution for the shape parameters as
A random variable with an APE distribution has a CDF and a PDF for , given by, respectively,
where and are the shape and scale parameters.
Next, in order to construct a finite mixture of bivariate alpha power exponential model, we proceed as follows:
- (i)
- We consider and are random and is fixed, and each are independent for fixed choices of and . Likewise, where and are random and is fixed.
- (ii)
- Next, we further assume that are independent but are correlated through their bivariate generalized Bernoulli distribution with the shape parameter , assuming the following probability matrix.where P is the mixing component with the condition .
Let be the joint PDF created from the marginal densities of
Then, the joint density of can be written as
Then,
For this model, it appears that, for a valid PDF, we must have this constraint and .
To make things easier in terms of writing and visualization, let , , and . Depending on a variety of choices for the mixing parameters and for , several shapes of the joint PDF in (5) are provided in Figure 1. Moreover, several contour plots for various values of , , and when are displayed in Figure 2. As one might expect, not all four components may be required in applications to real-world data sets. Consequently, some constraints can be applied, such as , or , as appropriate. These constraints result in correlation values of +1, −1, and 0 among the shape parameter, respectively.
Figure 1.
Joint density plot of the BMAPE (type-I) distribution for the selected parameter choices.
Figure 2.
Contour plot of the BMAPE (type-I) distribution for the selected parameter choices.
The marginal densities of X and Y, respectively, from the joint PDF in (5) are not available in closed form; one may consider some numerical methods to obtain an approximate expression. The CDF for the joint model will be
For (6), to be a valid CDF, we must have
- (i)
- and .
- (ii)
- .
- (iii)
- .
Next, one may obtain the marginal survival functions for the smallest order statistics for a random sample of size two. For example, let then, while
From (7), one can obtain the associated PDF for We discuss some useful structural properties in the next section for the BMAPE (type-I) distribution.
3. Structural Properties
Theorem: The BMAPE (type I) distribution is log-concave. Taking the negative logarithm of (5), for (or for ), we obtain
Next, taking partial derivatives w.r.t. and of (8), respectively, we obtain
It is obvious from (9) that, provided the expression in (9) is greater than zero with the assumption that i.e., the joint density in (5) is log-concave.
The BMAPE (type-I) distribution’s survival function will be
The marginal distribution functions of X and Y are and , respectively. The following is the hazard rate function (hrf):
For each fixed Y = y, the conditional PDF of X given Y as well as Y given X are given, respectively, by
Proposition 1.
Let. Then, (X,Y) has a positive association with a total positivity of order 2 (), provided (. Observe that an absolute continuous bivariate random vector, such as, has theproperty if and only if for each,wheneverandit satisfies, whereis the joint PDF. Take distinct ordered,andsuch that<and, and our result follows instantly. Consequently, the positive quadrant dependence property (also known as theproperty) would reveal a number of non-increasing features relating to conditional survival, the conditional CDF of X given Y, and Y given X, as well as the following finding:and,.
Proposition 2.
Shape of the distribution
A critical point in a two-variable function is where the first-order partial derivatives are equal to zero. The following are the two most compelling reasons to investigate the critical points of a bivariate distribution:
- (1)
- A real-life data set (or sets) can assume a variety of shapes. A study such as this can help determine the adaptability of this bivariate distribution.
- (2)
- When working with bivariate distributions, it is essential to examine the joint PDF’s tails as well as the point of inflection. This study is useful in the context of data fitting, as data sets having heavier tails and/or which are small can be adequately modeled by a distribution exhibiting one of these features, as appropriate. Critical-point(s) knowledge will aid in a better understanding of these features, as will be illustrated in Appendix A.
4. Inference
In this section, we consider the estimation of the model parameters of the BMAPE (type-I) distribution using the EM algorithm. The EM algorithm, originally proposed by [], is an iterative approach for determining the maximum likelihood estimator of a parameter in a parametric probability distribution. To use the EM technique, we combine the data ( (to be obtained from a random sample of size obtained from the associated joint density) with the group membership variables , , where is an indicator variable that is one if the observation is in and zero otherwise. Similarly, we have four groups for , with the following densities of
where , , and are the equivalent mixing proportions.
The associated log likelihood function for the complete sample is symbolized by:
The EM method has two steps: Step-E (expectation) and Step-M (maximization) in each iteration. As the group membership variables represented as are linear, we replace the related expected values in (11) with the current estimates ( of the parameters determined and, subsequently, an update occurs. For more information on this topic, see [] as:
Similarly, for and , apply the same technique, as follows:
Following that, in the M-step, we maximize (11) across for fixed values of . The conditional independence of X and Y given the group membership achieves maximization by differentiating (11), which are:
and
The M-step is then completed by setting
The model parameters must have a starting value to initiate the process, which is denoted by the symbol . To ensure that the proposed EM algorithm’s rate of convergence does not become significantly sluggish, careful selection of these initial values is required. Another factor to consider is that the maximum likelihood equation may have several solutions corresponding to local maxima, necessitating careful selection of starting values. In the literature, some discussion on the identification of starting value(s) have been discussed and the references cited therein. The “coda” and “maxLik” of R package were used to quantitatively solve these equations. After obtaining the maximum likelihood estimators (MLEs) for and , we replace these estimates in ( ). Then, the M-step is completed by setting , and the process continues. To produce initial values for the mixing proportions individually, the matching moment’s method for the marginal univariate APE parameter is employed. The BMAPE (type-I) parameter estimates are used as the starting points for the EM technique. Then, assuming that the two variables X and Y are independent, we combine the marginal mixing parameter moment estimators to obtain initial values for the bivariate mixing parameters. Specific details are provided in Section 7, and, more specifically, in Tables 2–4. By assuming that the two variables X and Y are independent, as in (3) and (4), we combine the marginal mixing parameter of moment estimators to obtain initial values for the bivariate mixing parameters. We equate (12)–(17) to zero to obtain the estimates of the BMAPE (type-I) distribution’s parameter. Approximate confidence intervals (ACIs) of the unknown parameters are also obtained. This part is not investigated here but its results are presented numerically in Section 7 and Section 8.
5. BMAPE (Type-II) Model Gaussian Copula Distribution
In this scenario, the proposed BMAPE model uses APE marginals and provides a reasonable flexibility in terms of marginals and the associated correlation structure. A copula is a useful tool for constructing bivariate and multivariate distributions, see [,] and the references cited therein. Copulas are extensively employed and play a vital part in the study of dependency. Several bivariate and multivariate lifetime distributions have been proposed by Refs. [,] on using a wide variety of couples. In this paper, we discuss the construction of a BMAPE (type-II) distribution via bivariate Gaussian copula with two APE marginals.
5.1. BMAPE (Type-II) Distribution Based on a Gaussian Copula
Any multivariate distribution may be decomposed into a copula and its continuous marginal, according to Ref. []. Copulas are useful tools in a bivariate scenario to combine two marginal distributions such that, for every bivariate distribution function F(x,y), with continuous marginal F(x) and F(y), one can write
The density function associated with the bivariate distribution will be
where is the density function of a copula; see [].
As explained, there are a variety of options for building BMAPE (type-II) distributions using copulas and APE marginals (1). A bivariate Gaussian copula is defined as
where is the joint CDF of a bivariate normal vector with zero means, unit variates, and correlation coefficient .
Here, we denote the joint distribution of the two variables as and where symbolizes the phrase dependence. The joint PDF of and based on Gaussian copula becomes
where and are latent random variable as well as being the density functions of APE distributions provided in (3) and (4), and the dependence parameter. The BAPE (type-II) distribution PDF is generated if the marginals are from an APE distribution.
. Assuming that Xd and Yd are dependent APE distributions with a generalized bivariate Bernoulli distribution for the shape parameters and, hence, the PDF of the BMAPE (type-II) distribution is defined as
where the mixing proportions are , and it must satisfy and , all of which are unknown. The first component of APE’s density is provided by (3). The PDF of the first component of APE is given by
where is a fixed-scale parameter, and is a random-shape parameter, , which takes two distinct values of and . Let have an APE mixing density and the PDF of the second component APE be determined by for a fixed-scale parameter
with the values and being a random-shape parameter. We suppose that Xd and Yd are dependent for given values and that and are associated by their generalized bivariate distribution with the following probability matrix provided by
If is the combined PDF of , then
The joint PDF in (25) can take on a variety of shapes, similar to the joint PDF in (4). For simplicity, let , and ; as a result, the BMAPE (type-II) distribution PDF will be
5.2. Gaussian Copula and the EM Algorithm
Here again, the EM algorithm is introduced. To use the EM technique, we supplement the data ( ); k = 1,…, n, with the group membership variables ( ), k = 1,…, n, where is one if the observation is in , and zero otherwise. Similarly, we have four groups , i, j = 1, 2 for , with densities as
where denotes the dependence parameter. Let , , and be the mixing proportions. We define = and then the EM algorithm as a technique of estimate by finding the complete log likelihood, say , is given as follows:
As the group membership variables ( ) are linear, we enter their predicted values into (28) in the E-step, given the current estimates of the parameter, computed as:
Similarly, we use the same approach for . It is worth noting that the above algebraic simplification may be required to obtain numerical solution. For fixed values of (), we maximize (28) over . The univariates and the Gaussian copula parameter can be dealt with separately.
Then, differentiating (28) results in
and
The method uses a two-step strategy to estimate the marginal of X and Y as well as the copula function separately, yielding the MLEs of . The MLEs of are obtained by solving the nonlinear equation in (28)–(37). The density of copulas is then calculated as follows:
where and are the maximum likelihood estimates of the PDF from the first step. The MLEs of and are obtained by solving the nonlinear (39). The M-step is completed by setting To solve these equations numerically, we use the copula R package. Next, we replace these estimates in ( ) after obtaining MLEs for . We finish the M-step by setting , and so on.
The method of matching moments, which is generated from the marginal univariate APE and the Gaussian copula parameter separately, are used to obtain the initial values of the parameters for the mixing proportions. The BAPE parameter estimators obtained are used as starting values for the EM method. Then, assuming that the dependency between two variables X and Y is zero, we combine the moment estimators of the marginal mixing parameters to generate initial values for the bivariate mixing parameters. Ref. [] for more information. Following that, we present the process for estimating the unknown parameters for the density in (28). We use two approaches in copula-based estimation: parametric and semiparametric.
5.3. Maximum Likelihood Estimation
We look at how to estimate the unknown parameters of BMAPE (type-II) distributions using the maximum likelihood technique and two-step estimation. This entails a two-step process in which the marginal and copula functions are individually estimated; see []. The log-likelihood function is defined as follows:
The log-likelihood function in (40) may be rewritten as
The MLEs are separately used to estimate the parameters of the marginal distribution and , as follows:
Then, by maximizing the copula density, the copula parameters may be estimated.
The maximum likelihood estimation will estimate the parameters of each marginal distribution by evaluating the first step with APE distributions. If is a random sample from and is a random sample from the log-likelihood functions are provided by
and
As a result, the maximum likelihood equations are as follows:
and
The MLEs of and are obtained by solving the nonlinear (46)–(49) system. The density of copulas is then calculated as follows:
where and are the maximum likelihood estimates of the first-step parameters. Therefore, the maximum likelihood estimate of can be obtained by solving the nonlinear (50).
5.4. Estimation via Semiparametric Methods
The inversion Kendall’s and inversion of Spearman’s semiparametric methods for estimating the copula parameter in copula models are compared to the two methods of moments approaches, inversion Kendall’s and inversion of Spearman’s , respectively.
5.5. Moments Method
We present a brief description of moment’s method of inversion of Kendall’s and the inversion of Spearman’s as indicated in Ref. []. Let c be a bivariate random sample from a CDF, , where F1 and F2 are continuous CDF and is an absolutely continuous copula such that Moreover, unless otherwise specified, let be the vectors of ranks associated with . All vectors in the following are row vectors. The inversion of a consistent estimator of a moment of the copula is used in moments methods. The two most well-known moments, Spearman’s and Kendall’s , are given by Spearman and Kendall, respectively, as
and
These two moments’ consistent estimators can be stated as
and
Consistent estimators of ρ and τ will be if and are one-to-one , respectively.
Inversion of Kendall’s and inversion of Spearman’s are two terms for the same thing. Ref. [] and the references referenced therein for more information. As previously stated, the moment’s method of and estimation for copula can be classified as semiparametric approach estimation.
5.6. Copula Fit Tests
We want to compare the empirical copula with the parametric estimator generated under the null hypothesis; for more information, see [,]. According to the theory, a test to see if C is well-represented by a given copula can be represented as
There are several well-known ways in the literature, such as Ref. [] or the rapid multiplier approach, as described by Refs. [,]. The empirically based goodness of fit test is written as
where is the empirical copula of the X and Y data.
the pseudo-observations from C, are calculated as:
are, respectively, the ranks of
is an estimator obtained using the pseudo observations and is a consistent estimator. The relevant test statistic, is the Cramer-von Miss, which is defined as
see [] as well as the references within.
6. Bayesian Estimation
The Bayes estimates of the joint posterior density (54) model parameters are computed in this part under the assumption that the random variable have independent gamma prior distributions with hyperparameter , provided by
and theses have a non-informative prior. The joint posterior density for the vector given the data is obtained by multiplying (9) or (28) as
Integrating out the (nuisance) hyperparameters yields marginal distributions of As a result, under the square error loss function, the Bayesian estimators of the parameters can be derived as follows:
6.1. Bayes MCMC Estimates
As the integrals in (56) are not in a closed form, the Markov-chain Monte-Carlo (MCMC) method is used. The posterior distribution and intractable integrals are derived using simulated samples from the posterior distribution in MCMC algorithms. Gibbs sampling and the Metropolis-Hastings (M-H) algorithm are also employed as MCMC techniques, see [,].
The M-H algorithm assumes that a candidate value can be generated from a proposed distribution for each iteration of the process. As a result, the applicant value is allowed if the approval chance is high enough. This method ensures that the Markov chain for the target density will converge. Finally, we can see that the advantage of the MCMC technique over the MLE method is that by creating probability intervals based on empirical posterior distribution, we can always obtain an acceptable interval estimate of the parameters. In MLE, this is frequently unavailable.
6.2. Bayes Credible Intervals
The following expression can be used to obtain a symmetric percent two-sided Bayes probability interval estimate of , denoted by .
We need appropriate numerical approaches to solve this non-linear equation because finding the interval and analytically is difficult.
7. Monte-Carlo Simulation
To assess the adaptability and flexibility of the proposed MBAPE model for different combinations of the true parameter value and sample size, an extensive Monte-Carlo simulation is conducted based on both independent and dependent cases. The point (MLE and MCMC) estimates and interval (ACI and BCI) estimates are evaluated based on 2,000 random samples of sizes . Extensive computations were performed using R statistical software via two useful statistical packages, namely, (i) the “coda” package proposed by Ref. [], and (ii) the “maxLik” package, which uses the Newton–Raphson method of maximization in the likelihood computations, proposed by Ref. [].
Comparison between point estimators is made in terms of their mean absolute bias (MAB) and root mean squared-error (RMSE) values. In addition, the behavior of 95% asymptotic/credible intervals are compared by their average confidence lengths (ACLs). In Table 1, different actual values of the model parameters and the associated hyperparameters are reported. Clearly, the values of hyperparameters are specified in such a way that the prior mean becomes the expected value of the target parameter. In this study, for both sets III and IV, we take as (0.2,0.15,0.05,0.1) and (0.5,0.4,0.2,0.3) as well as also assuming they have improper gamma priors. The MCMC estimates under squared-error loss are computed by running the M-H algorithm 12,000 times and ignoring the first 2,000 MCMC iterations. The calculated MLE of each unknown parameter is taken as an initial guess to obtain the corresponding Bayes estimate; for more information in this topic see [,,]. The simulation outcomes are provided in Table 2, Table 3, Table 4, Table 5 and Table 6. In these tables, the Bayes MCMC results under Prior 1 (as an example) are denoted as “MCMC.1”; similarly, the BCI estimates under Prior 1 are also denoted as “BCI.1”, for brevity. From Table 2, Table 3, Table 4, Table 5 and Table 6, it can be seen that the proposed estimates of the unknown MBAPE parameters are sufficiently efficient with respect to their minimum simulated values of RMSEs, MABs and ACLs. For all true parametric combinations, when increases under independent and dependent assumptions, the RMSEs, MABs and ACLs of the classical and Bayesian estimates decrease, as expected. Therefore, one can maximize the efficiency associated with all estimates by increasing the overall sample size. As , , , and increase, it is observed that their RMSEs, MABs and ACLs increase. A similar pattern is observed in the case of , , , and for increases, a dependent case. Since the Bayesian estimates include the flexible gamma information, the Bayesian estimates of all unknown parameters have overall lower RMSE and MAB values, so they performed better compared to the classical estimates in both independent and dependent cases. Consequently, the credible intervals estimates are also better than the asymptotic confidence intervals in terms of shortest ACLs. In particular, as the calculated variance in Prior 2 is less than Prior 1, it is also seen that the Bayes (point and interval) estimates have performed better based on Prior 2 than those obtained based on Prior 1. All accuracy criteria developed in this work support this statement.
Table 1.
Different parameter values under independent/dependent cases.
Table 2.
Simulated point and interval estimates of the BMAPE parameters for Sets I and II.
Table 3.
Simulated point and interval estimates of the BMAPE parameters using Set III when .
Table 4.
Simulated point and interval estimates of the BMAPE parameters using Set III when .
Table 5.
Simulated point and interval estimates of the BMAPE parameters using Set IV when .
Table 6.
Simulated point and interval estimates of the BMAPE parameters using Set IV when .
8. Motor Data Analysis
To demonstrate the practical utility of the proposed methodologies in an engineering phenomenon, one real data set reported by Ref. [] is analyzed. This data represents the failure times (in days) of a parallel system constituted by two identical motors, namely, (X) and (Y), see Table 7. We first check the suitability of the APE model to the complete motor data sets. For this purpose, for each X and Y data point, the Kolmogorov–Smirnov (KS) statistics and its -value, as well as the MLEs with their standard errors (SEs), are obtained and also provided in Table 7.
Table 7.
Failure times of two motors.
It shows that the APE distribution fits the motor data sets quite well. Moreover, some graphical methods for goodness of fit for the motor data sets are developed. Plots of fitted density, empirical cumulative distribution, probability-probability (PP) and quantile-quantile (QQ) for X and Y data sets are displayed in Figure 3 and Figure 4, respectively. It is clear that the two variables X and Y are fitted for the marginal APE distribution.
Figure 3.
The fitted density, empirical CDF, PP and QQ plots under X data.
Figure 4.
The fitted density, empirical CDF, PP and QQ plots under Y data.
Using the complete X and Y data sets, the MLEs and 95% ACIs of , , , and are calculated. Furthermore, since no prior information is available about the MBAPE parameters, the Bayesian (point/credible) estimates of , , , and are developed using the non-informative priors. Taking the maximum likelihood estimates of the same unknown quantities as initial values, the MCMC sampler is replicated 30,000 times and then the first 5,000 iterations removed as a burn-in. The point (with their SEs) estimates and interval (with their lengths) estimates of the unknown parameters based on X and Y data sets are computed and listed in Table 8 and Table 9, respectively. This indicates that the estimates of the unknown parameters obtained by maximum likelihood and Bayesian methods are very similar. Moreover, trace plots (in Figure 5) for MCMC draws of the posterior generated samples of , , , and showed that the MCMC simulated variates converge very well. It represents the sample mean and two bounds of 95% BCIs with solid (—) and dashed (- - -) lines, respectively. Moreover, the MCMC frequency estimates of , , , and using the Gaussian kernel are represented in Figure 6. It represents the sample mean with a vertical dash-dotted (:) line. It is evident that the generated posteriors of all unknown parameters are quite symmetrical. To sum up, the numerical findings of the suggested estimation methods based on motor data provide a good demonstration of the proposed model.
Table 8.
The point estimates of , , , and under motor data.
Table 9.
The 95% interval estimates of , , , and under motor data.
Figure 5.
MCMC trace plots of , , , and under motor data set.
Figure 6.
MCMC frequency estimates plots of , , , and under motor data set.
9. Conclusions
In this paper, we propose and study a new class of bivariate alpha power distributions via finite mixtures involving APE distributions as marginals, and also consider the copula-based construction of bivariate alpha power distributions. The developed class of distribution is made up of two different types of mixture, namely, type I, which starts with two independent APE distributions, and type II, which starts with a bivariate Gaussian copula. The model parameters are estimated using both frequentist (for example, method of moments and the method of maximum likelihood) and under the Bayesian (using independent gamma priors) paradigm. This distribution can be employed in practice for non-negative and positively correlated random variables because the joint distribution function and joint density function are in closed forms; in addition, it can be applied to model censored data as well. Under the Bayesian paradigm, we adopted the EM approach, which works quite well and may be efficiently employed to compute the MLEs, because the MLEs of the unknown parameters cannot be computed in closed form. To compare the behavior/efficiency of the proposed methods and to see the applicability of the different estimators, a small simulation study is considered, and one real data set is investigated to examine the applicability of the proposed model. From the simulation study, it appears that the point and interval estimates of all unknown parameters using the MCMC algorithm are performing better. Therefore, the Bayes M-H method used to estimate the unknown MBAPE parameters is recommended. However, whether this is a general trend or not is subject to a separate study with a wide range of possible values of the model parameters.
Author Contributions
Formal analysis, A.E.; Funding acquisition, R.A.; Investigation, H.R.; Methodology, M.N. and I.G.; Project administration, R.A.; Resources, R.A.; Software, A.E.; Supervision, M.N. and H.R.; Writing—original draft, I.G. and A.E.; Writing—review & editing, H.R. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2022R50), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
The data used to support the findings of this study are included within the article.
Acknowledgments
Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2022R50), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia).
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A
Let us have a look at the BMAPE (type-I) distribution’s shape. From the joint density in (5), it is clear that there may be more than one critical point for the BMAPE (type-I) distribution. A numerical examination must be adopted for varying model parameter choices to obtain an idea of the critical points for this distribution
Also,
From (A1) and (A2), it is evident that there will be more than one point for the BMAPE (type-I) distribution.
References
- Mahdavi, A.; Kundu, D. A new method for generating distributions with an application to exponential distribution. Commun. Stat.-Theory Methods 2017, 46, 6543–6557. [Google Scholar] [CrossRef]
- McLachlan, G.J.; Krishnan, T. The EM Algorithm and Extensions; John Wiley & Sons: New York, NY, USA, 1997. [Google Scholar]
- Mead, M.E.; Cordeiro, G.M.; Afify, A.Z.; Al Mofleh, H. The alpha power transformation family: Properties and applications. Pak. J. Stat. Oper. Res. 2019, 15, 525–545. [Google Scholar] [CrossRef]
- Newcomb, S. A generalized theory of the combination of observations so as to obtain the best result. Am. J. Math. 1886, 8, 343–366. [Google Scholar] [CrossRef]
- Pearson, K. Contributions to the Mathematical Theory of Evolution. Philos. Trans. A 1894, 185, 71–110. [Google Scholar]
- Titterington, D.M.; Smith, A.F.M.; Makov, U.E. Statistical Analysis of Finite Mixture Distributions; Wiley: New York, NY, USA, 1985. [Google Scholar]
- Everitt, B.S.; Hand, D.J. Finite Mixture Distributions; John Wiley & Sons, Ltd.: The Netherlands, 1981. [Google Scholar]
- McLachlan, G.J.; Basford, K.E. Mixture Models Inferences and Applications to Clustering; Marcel Dekker: New York, NY, USA, 1988. [Google Scholar]
- Al-Hussaini, E.K.; Sultan, K.S. Reliability and hazard based on finite mixture Models. In Handbook of Statistics; Balakrishnan, N., Rao, C.R., Eds.; Handbook of Statistics: Amsterdam, The Netherlands, 2001; Volume 20, pp. 139–183. [Google Scholar]
- AL-Moisheer, A.S.; Alotaibi, R.M.; Alomani, G.A.; Rezk, H.R. Bivariate Mixture of Inverse Weibull Distribution: Properties and Estimation. Math. Probl. Eng. 2020, 2020, 5234601. [Google Scholar] [CrossRef]
- Nassar, M.; Alzaatreh, A.; Mead, M.; Abo-Kasem, O. Alpha power Weibull distribution: Properties and applications. Commun. Stat.-Theory Methods 2017, 46, 10236–10252. [Google Scholar] [CrossRef]
- Dey, A.; Alzaatreh, A.; Zhang, C.; Kumar, D. A new extension of generalized exponential distribution with application to ozone data. Ozone Sci. Eng. 2017, 39, 273–285. [Google Scholar] [CrossRef]
- Nadarajah, S.; Okorie, I.E. On the moments of the alpha power transformed generalized exponential distribution. Ozone Sci. Eng. 2018, 40, 330–335. [Google Scholar] [CrossRef]
- Nassar, M.; Afify, A.Z.; Dey, S.; Kumar, D. A new extension of Weibull distribution: Properties and different methods of estimation. J. Comput. Appl. Math. 2018, 336, 439–457. [Google Scholar] [CrossRef]
- Nassar, M.; Kumar, D.; Dey, S.; Cordeiro, G.M.; Afify, A.Z. The Marshall–Olkin alpha power family of distributions with applications. J. Comput. Appl. Math. 2019, 351, 41–53. [Google Scholar] [CrossRef]
- Nassar, M.; Afify, A.; Shakhatreh, M. Estimation Methods of Alpha Power Exponential Distribution with Applications to Engineering and Medical Data. Pak. J. Stat. Oper. Res. 2020, 16, 149–166. [Google Scholar] [CrossRef]
- Jaheen, Z. On record statistics from a mixture of two exponential distributions. J. Stat. Comput. Simul. 2005, 75, 1–11. [Google Scholar] [CrossRef]
- Al-Hussaini, E.K.; Hussein, M. Estimation under a Finite Mixture of Exponentiated Exponential Components Model and Balanced Square Error Loss. Open J. Stat. 2012, 2, 28–38. [Google Scholar] [CrossRef]
- Tahir, M.; Aslam, M.; Hussain, Z.; Khan, A. On finite 3-component mixture of exponential distributions: Properties and estimation. Cogent Math. 2016, 3, 1275414. [Google Scholar] [CrossRef]
- Diawara, N.; Carpenter, M. Mixture of bivariate exponential distributions. Commun. Stat.-Theory Methods 2010, 39, 2711–2720. [Google Scholar] [CrossRef]
- Rafiei, M.; Iranmanesh, A.; Nagar, D. A bivariate gamma distribution whose marginals are finite mixtures of gamma distributions. Stat. Optim. Inf. Comput. 2020, 8, 950–971. [Google Scholar] [CrossRef]
- Mardia, K.V. Measures of multivariate skewness and kurtosis with applications. Biometrika 1970, 57, 519–530. [Google Scholar] [CrossRef]
- Kotz, S.; Balakrishnan, N.; Johnson, N. Continuous Multivariate Distributions: Models and Applications, 2nd ed.; Wiley: New York, NY, USA, 2000; Volume 1. [Google Scholar]
- Balakrishnan, N.; Lai, C. Cotinuous Bivariate Distributions, 2nd ed.; Springer: New York, NY, USA, 2009. [Google Scholar]
- Nelson, R.B. An Introduction to Copulas; Springer: New York, NY, USA, 2006. [Google Scholar]
- Dellaert, F. The Expectation Maximization Algorithm; CiteSeerX 10.1.1.9.9735. gives an easier explanation of EM algorithm as to lowerbound maximization 2002; Georgia Institute of Technology: Atlanta, GA, USA, 2002. [Google Scholar]
- Kundu, D. Bivariate sinh-normal distribution and a related model. Braz. J. Probab. Stat. 2015, 29, 590–607. [Google Scholar] [CrossRef]
- El-Morshedy, M.; Alhussain, Z.A.; Atta, D.; Almetwally, E.M.; Eliwa, M.S. Bivariate Burr X Generator of Distributions: Properties and Estimation Methods with Applications to Complete and Type-II Censored Samples. Mathematics 2020, 8, 264. [Google Scholar] [CrossRef]
- Alotaibi, R.; Rezk, H.; Ghosh, I.; Dey, S. Bivariate exponentiated half logistic distribution: Properties and application. Commun. Stat.-Theory Methods 2020, 50, 6099–6121. [Google Scholar] [CrossRef]
- Sklar, A. Fonctions de Répartition à n Dimensions et Leurs Marges; Publications de l’Institut Statistique de l’Université de Paris: Paris, France, 1959; Volume 8, pp. 229–231. [Google Scholar]
- Kosmidis, I.; Karlis, D. Model-based clustering using copulas with applications. Stat. Comput. 2016, 26, 1079–1099. [Google Scholar] [CrossRef]
- Joe, H.; Xu, J. The Estimation Method of Inference Functions for Margins for Multivariate Models; Technical Report No. 166; Department of Statistics, University of British Columbia: Vancouver, BC, Canada, 1996. [Google Scholar]
- Kojadinovic, I.; Yan, J. Comparison of three semiparametric methods for estimating dependence parameters in copula models. Insur. Math. Econ. 2010, 47, 52–63. [Google Scholar] [CrossRef]
- Fermanian, J.D. Goodness-of-fit tests for copulas. J. Multivar. Anal. 2005, 95, 119–152. [Google Scholar] [CrossRef]
- Dobric, J.; Schmid, F. A goodness of fit test for copulas based on Rosenblatt’s transformation. Comput. Stat. Data Anal. 2007, 51, 4633–4642. [Google Scholar] [CrossRef]
- Genest, C.; Remillard, B. Validity of the parametric bootstrap for goodness-of-fit testing in semiparametric models. Ann. L’institut Henri Poincare (B) Probab. Stat. 2008, 44, 1096–1127. [Google Scholar] [CrossRef]
- Genest, C.; Rémillard, B.; Beaudoin, D. Goodness-of-fit tests for copulas: A review and a power study. Insur. Math. Econ. 2009, 44, 199–213. [Google Scholar] [CrossRef]
- Kojadinovic, I.; Yan, J.; Holmes, M. Fast large-sample goodness-of-fit tests for copulas. Stat. Sin. 2011, 21, 841–871. [Google Scholar] [CrossRef]
- Meintanis, S.G. Test of fit for Marshall–Olkin distributions with applications. J. Stat. Plan. Inference 2007, 137, 3954–3963. [Google Scholar] [CrossRef]
- Metropolis, N.; Rosenbluth, A.W.; Rosenbluth, M.N.; Teller, A.H. Equation of state calculations by fast computing machines. J. Chem. Phys. 1953, 21, 1087–1091. [Google Scholar] [CrossRef]
- Hastings, W.K. Monte Carlo sampling methods using Markov chains and their applications. Biometrika 1970, 57, 97–109. [Google Scholar] [CrossRef]
- Plummer, M.; Best, N.; Cowles, K.; Vines, K. Convergence diagnosis and output analysis for MCMC. R News 2006, 6, 7–11. [Google Scholar]
- Henningsen, A.; Toomet, O. maxLik: A package for maximum likelihood estimation in R. Comput. Stat. 2011, 26, 443–458. [Google Scholar] [CrossRef]
- Gilks, W.R.; Best, N.G.; Tan, K.K.C. Adaptive Rejection Metropolis Sampling within Gibbs Sampling. Appl. Stat. 1995, 44, 455–472. [Google Scholar] [CrossRef]
- Martino, L.; Yang, H.; Luengo, D.; Kanniainen, J.; Corander, J. A Fast Universal Self-Tuned Sampler Within Gibbs Sampling. Digit. Signal Process. 2015, 47, 68–83. [Google Scholar] [CrossRef]
- Fitzgerald, W.J. Markov chain Monte Carlo methods with applications to signal processing. Signal Process. 2001, 81, 3–18. [Google Scholar] [CrossRef]
- Alotaibi, R.; Khalifa, M.; Almetwally, E.; Ghosh, I.; Rezk, H. Classical and Bayesian Inference of a Mixture of Bivariate Exponentiated Exponential Model. J. Math. 2021, 2021, 5200979. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).