Abstract
As an alternative to available bivariate Birnbaum–Saunders (BS) models, a conditionally specified distribution with BS conditionals is considered. The behavior of conditional or pseudo-likelihood parameter estimates of the model parameters is investigated via simulation. A comparison using a mineralogy data set suggests that the conditionally specified model outperforms competing models (with BS marginals). An analogous comparison using a well-known data set of Australian athletes also suggests the superiority of the conditionally specified model. Further investigation of its possible general superiority is suggested.
1. Introduction
The BS distribution, which is asymmetric, was initially introduced as a suitable model for lifetimes in fatigue settings (see Leiva [1] and Balakrishnan and Kundu [2]). Other recent extensions of the BS model are delivered in the works of Reyes et al. [3], Martínez-Flórez et al. [4], Gómez-Déniz and Gómez [5] and Mazucheli et al. [6], among others. However, it has also been shown to function as a suitable model for other non-negative variables. Recently, certain bivariate BS models have been proposed in the literature. It can and has been argued that the shape of appropriate conditional densities, which are suggested by cross sections of bivariate histograms, are easier to visualize and that these conditional features are more informative about appropriate two dimensional models than any modeling suggestion based on marginal insights alone. With this in mind, in many situations, it may be worthwhile to consider bivariate models with BS conditionals, as we will, rather than models with BS marginals. For further discussion of the advantages of conditional specification when compared with marginal specification, see Arnold et al. [7].
The available bivariate Birnbaum–Saunders models, while providing adequate marginal fits, are not always capable of adapting to conditional features of the data. The goal of the current paper is to introduce alternative models that are endowed with a dependence structure that is driven by modeling conditional features of data sets.
In Section 2, we review the definition of the BS distribution and identify the general bivariate model with BS conditionals. Algorithms for simulating draws from this conditionally specified model are then described. Parameter estimation is most easily implemented via conditional (or pseudo) likelihood. In Section 4, the performance of this estimation strategy is investigated via simulation, and in Section 5, the conditionally specified model is compared with two recently proposed models with BS marginals when applied to two data sets, one of a mineralogical nature and the other a subset of the Australian athletes data as reported in Cook and Weisberg [8]. The performance of the conditionally specified BS distribution is encouraging.
2. The Proposed Conditionally Specified Model
The BS density with unit scale parameter is typically expressed in the following form
where . We will reparametrize the model, defining , so the density becomes
where and . Note that the parametrization in Equation (1) belongs to the natural exponential family of distributions. This guarantees that the parameter space for is convex. If a random variable X has (1) as its density, we write ; here the 1 refers to the fact that the scale parameter is set equal to 1. If this random variable is multiplied by a positive scale parameter , we denote the resulting distribution by . Clearly, with , this is a one parameter exponential family. The corresponding parameter space is . Using a result in Arnold and Strauss [9], the most general family of bivariate densities with conditionals in this one parameter family (1) (i.e., belonging to the natural exponential family of distributions) is of the form
where and The corresponding marginal density of X is
which is clearly not of the BS form unless , which corresponds to the case in which X and Y are independent. The conditional densities are indeed of the BS form. Thus, specifically
and
For added flexibility, we introduce scale parameters and in this bivariate density. Thus, if has density (2) and if we define and , then we can say that has a general bivariate BS conditionals (BVBSC) distribution and we will write Figure 1 shows the scatterplot for the distribution with different values for the vector . Note that the model can assume different shapes for the bivariate scatterplot depending on the values for the vector .
Figure 1.
Scatterplot for distribution with different values for .
2.1. Drawing Values from the Model
One advantage of the model is that both conditional distributions are specified in relatively simple forms. There is a well-known formula expressing a variable denoted by U as a function of a standard normal variable Z, thus
This can used whenever a BS draw is to be simulated.
There are two approaches that can be used to simulate draws from a general BS conditionals distribution. One approach involves simulation of a draw from the X marginal followed by use of the conditional density of Y given X to find the corresponding second coordinate of . For this approach, we wish to avoid drawing values directly from the marginal distribution because of possible problems in the computation of the normalizing constant. For this reason, to draw values from the marginal distribution of X, we propose the following steps based on the Metropolis–Hastings algorithm (Algorithm 1):
| Algorithm 1 Metropolis–Hastings algorithm. |
|
To eliminate the dependence among the drawn values, it is possible to consider a burn-in period and to thin the resulting series. Having drawn a final sample for X, say , we can simulate corresponding values for Y (using (5)) such that . Finally, the scale parameters can be introduced by setting and for . The pairs then constitute a simulated random sample from the model.
An alternative approach is one involving Gibbs sampler simulation utilizing the relative simplicity of the two corresponding conditional distributions (and the ease of drawing from univariate BS distributions). For it, we begin with an arbitrary value for X, say . Then, we use the conditional distribution of Y given ( as in (5)) to simulate a corresponding . Next, we generate a simulated value of X, say , using the conditional distribution of X given ( as in (4)), and continue in this fashion using (5) and (4) alternately. Use of burn-in and thinning is also recommended for this approach.
3. Estimation
Up to an additive constant, the log-likelihood function for can be written as
where
However, the maximization of (7) can be challenging because of the need to repeatedly compute the quantity . For this reason, we recommend the implementation of the (conditional or) pseudo-likelihood approach. Besag [10] (see also Arnold and Strauss [11]) defined the pseudo-maximum likelihood estimator as that parameter vector that maximizes the pseudo-likelihood function, which for the bivariate case is given by
For our proposed bivariate density, up to an additive constant, the log-pseudo-likelihood function is given by
The pseudo-likelihood estimates of the parameters are then obtained by numerically maximizing this expression.
4. Simulation Study
In this section, we present a brief simulation study in order to assess the recovery of parameter values based on the pseudo-maximum likelihood estimation discussed in Section 3. The data were drawn based on the first method described in Section 2.1, considering a burn-in period and a thinning of 1000 and 10, respectively. We consider ten different combinations for and . We also consider three sample sizes: 50, 100 and 200. For each combination of parameters and sample sizes, we consider 10,000 replicates. We present the average bias (AB), the mean of the standard errors (SE) based on the hessian matrix and the root of the mean squared error () for those replicates. Results are presented in Table 1. It can be observed that accurate estimation of the dependence parameter appears to require a quite large sample size. This is not unexpected since dependence parameters are routinely more elusive in conditionally specified models. Simulation and applications codes were written in the R programming language R [12].
Table 1.
Simulation study to assess the recovery parameters for the BVBSC model under different scenarios and based on the pseudo-maximum likelihood estimation method.
5. Applications
In this section, we present two applications to compare our proposal with two other bivariate BS distributions that have been considered in the literature. The parameter estimation was performed based on the pseudo-maximum likelihood (ML) estimation method for the BVBSC model and the traditional ML estimation method for the other models. Both data sets (Section 5.1 and Section 5.2) have been used in many papers, particularly in univariate distributional settings.
Regarding the first application, the variable Neodymium was used in Reyes et al. [3] and Bourguignon and Gallardo [13] in a univariate context. For the second application, Arellano-Valle et al. [14] used the variable weight, also in a univariate context.
5.1. Minerals Data Set
This data set corresponds to lanthanum and neodymium measurements collected by the Department of Mines of the University of Atacama, Chile, representing 86 samples of both minerals (see Supplementary Materials). Some descriptive statistics for this data set are presented in Table 2.
Table 2.
Descriptive measure for minerals data set.
We fitted the bivariate BS (BVBS) discussed in Kundu et al. [15] and the bivariate BS (BVBS2) proposed by Lemonte et al. [16]. The formulas for the densities corresponding to these models are displayed in the Appendix A. Results are presented in Table 3. In order to compare the models, the AIC (see Akaike [17]) and BIC (see Schwarz [18]) criteria are considered. The conditionally specified model appears to be markedly superior to the other two bivariate BS models. It should be remarked, however, that, since the conditionally specified model fails to have BS marginals, it might be difficult to convince a scientist to use it if they feel that there are valid scientific reasons to believe that BS marginals are appropriate. Figure 2 shows the better fit for the mineral data set of the BVBSC when compared with the BVBS and BVBS2 models.
Table 3.
Estimates for different bivariate BS-type models in the minerals data set based on the pseudo-ML estimation (BVBSC) model and the ML estimation (BVBS and BVBS2 models) and standard errors in brackets.
Figure 2.
Scatterplot of minerals data and contour level of the fitted models. BVBS (left panel), BVBS2 (center panel) and BVBSC (right panel).
5.2. Australian Athletes Data Set
These data are related to measurements of 202 Australian athletes found in Cook and Weisberg [8]. For each athlete various characteristics of the blood were determined together with data regarding the sport, body size and gender of the athlete. In particular, we consider two measurements, the Body fat percentage (Bfat) and the weight in kg. Some descriptive statistics for this data set are presented in Table 4.
Table 4.
Descriptive measure for athletes data set
We fitted the same three bivariate BS models to these data as were fitted in Section 5.1. Results are presented in Table 5. Again, the conditionally specified model appears to be superior to the other two bivariate BS models. Figure 3 shows the better fit for the athlete data set of the BVBSC when compared with the BVBS and BVBS2 models.
Table 5.
Estimates for different bivariate BS-type models in the athlete data set based on the pseudo-ML estimation (BVBSC) model and the ML estimation (BVBS and BVBS2 models) and standard errors in brackets.
Figure 3.
Scatterplot of athlete data and contour level of the fitted models. BVBS (left panel), BVBS2 (center panel) and BVBSC (right panel).
6. Discussion
The BS density has been utilized in a wide range of scientific and economic settings as a suitable model for non-negative data. As was remarked in the Introduction, its genesis was in the context of fatigue life distributions, but its relative simplicity has led to successful application in many other fields. There are, of course, many ways in which we might seek to develop suitable bivariate BS models. One approach involves the use of copulas to link BS marginals. An example of this approach is the Kundu et al. [15] model, which can be viewed as utilizing a classical bivariate normal copula. The model proposed by Lemonte et al. [16] involves marginal transformations of a well-known bivariate skew normal model. The proposal, in the current paper, of using a conditionally specified model does have the drawback of not having BS marginals, but, as has been demonstrated, it fares well, when compared to the two models described above, in terms of fitting the data sets discussed here.
The field is, of course, open for the development of alternative bivariate BS distributions distinct from those discussed here. It must be kept in mind that we cannot expect to find an overall best bivariate BS model. Some will be good for certain kinds of data, while others will be better for different data configurations.
Supplementary Materials
This manuscript contains the mineral data set presented in Section 5.1 as Supplementary Materials, available at https://www.mdpi.com/article/10.3390/sym13050762/s1.
Author Contributions
Conceptualization, B.C.A., D.I.G. and H.W.G.; Formal analysis, B.C.A., D.I.G. and H.W.G.; Investigation, B.C.A., D.I.G. and H.W.G.; Methodology, B.C.A. and H.W.G.; Software, D.I.G. and H.W.G.; Supervision, B.C.A. and H.W.G.; Validation, B.C.A., D.I.G. and H.W.G. All of the authors contributed significantly to this research article. All authors have read and agreed to the published version of the manuscript.
Funding
The research of H.W. Gómez was supported by SEMILLERO UA-2021 project, Chile.
Data Availability Statement
The data of the first application can be found in the Supplementary Materials and of the second application in Cook and Weisberg [8].
Acknowledgments
The authors would like to thank the anonymous referees for their comments and suggestions, which significantly improved our manuscript.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A
The BVBS density presented in Kundu et al. [15] is
where and are defined as
and denotes the joint probability density of the bivariate standard normal distribution with correlation parameter equal to , i.e.,
On the other hand, the BVBS2 density studied by Lemonte et al. [16] is
where and are presented in (A1) and and denote the density and cumulative distribution functions for the univariate standard normal distribution.
References
- Leiva, V. The Birnbaum-Saunders Distribution; Academic Press: New York, NY, USA, 2016. [Google Scholar]
- Balakrishnan, N.; Kundu, D. Birnbaum-Saunders distribution: A review of models, analysis, and applications. Appl. Stoch. Model. Bus. Ind. 2019, 35, 4–49. [Google Scholar] [CrossRef]
- Reyes, J.; Barranco-Chamorro, I.; Gallardo, D.I.; Gómez, H.W. Generalized Modified Slash Birnbaum-Saunders Distribution. Symmetry 2018, 10, 724. [Google Scholar] [CrossRef]
- Martínez-Flérez, G.; Barranco-Chamorro, I.; Bolfarine, H.; Gómez, H.W. Flexible Birnbaum–Saunders Distribution. Symmetry 2018, 11, 1305. [Google Scholar] [CrossRef]
- Gómez-Déniz, E.; Gómez, L. The Rayleigh Birnbaum Saunders Distribution: A General Fading Model. Symmetry 2020, 12, 389. [Google Scholar] [CrossRef]
- Mazucheli, J.; Leiva, V.; Alves, B.; Menezes, A.F.B. A New Quantile Regression for Modeling Bounded Data under a Unit Birnbaum–Saunders Distribution with Applications in Medicine and Politics. Symmetry 2021, 13, 682. [Google Scholar] [CrossRef]
- Arnold, B.C.; Castillo, E.; Sarabia, J.M. Conditional Specification of Statistical Models; Springer Series in Statistics; Springer: Berlin/Heidelberg, Germany, 1999. [Google Scholar]
- Cook, R.D.; Weisberg, S. An Introduction to Regression Graphics; John Wiley& Sons Inc.: New York, NY, USA, 1994. [Google Scholar]
- Arnold, B.C.; Strauss, D.J. Bivariate distributions with conditionals in prescribed exponential families. J. R. Stat. Soc. Ser. B 1991, 53, 365–375. [Google Scholar]
- Besag, J. Statistical Analysis of Non-Lattice Data. J. R. Stat. Soc. Ser. D 1975, 24, 179–195. [Google Scholar] [CrossRef]
- Arnold, B.C.; Strauss, D.J. Pseudolikelihood estimation: Some examples. Sankhya Ser. B 1991, 53, 233–243. [Google Scholar]
- R Development Core Team. A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2019. [Google Scholar]
- Arellano-Valle, R.B.; Castro, L.M.; Genton, M.G.; Gómez, H.W. Bayesian Inference for Shape Mixtures of Skewed Distributions, with Application to Regression Analysis. Bayesian Anal. 2008, 3, 513–540. [Google Scholar] [CrossRef]
- Bourguignon, M.; Gallardo, D.I. Reparameterized inverse gamma regression models with varying precision. Statistica Neerlandica 2020, 74, 611–627. [Google Scholar] [CrossRef]
- Kundu, D.; Balakrishnan, N.; Jamalizadeh, A. Bivariate Birnbaum-Saunders distribution and associated inference. J. Multivar. Anal. 2010, 101, 113–125. [Google Scholar] [CrossRef]
- Lemonte, A.J.; Martínez-Florez, G.; Moreno-Arenas, G. Multivariate Birnbaum-Saunders distribution: Properties and associated inference. J. Stat. Comput. Simul. 2015, 85, 374–392. [Google Scholar] [CrossRef]
- Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control. 1974, 19, 716–723. [Google Scholar] [CrossRef]
- Schwarz, G. Estimating the dimension of a model. Ann Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).