Next Article in Journal
More Effective Criteria for Testing the Oscillation of Solutions of Third-Order Differential Equations
Next Article in Special Issue
Statistical Inferences about Parameters of the Pseudo Lindley Distribution with Acceptance Sampling Plans
Previous Article in Journal
Solving Particle–Antiparticle and Cosmological Constant Problems
Previous Article in Special Issue
Bayesian Sensitivity Analysis for VaR and CVaR Employing Distorted Band Priors
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Bayesian Inference for a Hidden Truncated Bivariate Exponential Distribution with Applications

1
Department of Mathematics and Statistics, University of North Carolina, Wilmington, NC 28403, USA
2
Department of Mathematical Sciences, Bentley University, Waltham, MA 02452, USA
3
Department of Mathematical Data Sciences, Hanyang University, Ansan 15588, Republic of Korea
*
Authors to whom correspondence should be addressed.
Axioms 2024, 13(3), 140; https://doi.org/10.3390/axioms13030140
Submission received: 13 January 2024 / Revised: 4 February 2024 / Accepted: 16 February 2024 / Published: 22 February 2024
(This article belongs to the Special Issue Computational Statistics and Its Applications)

Abstract

:
In many real-life scenarios, one variable is observed only if the other concomitant variable or the set of concomitant variables (in the multivariate scenario) is truncated from below, above, or from a two-sided approach. Hidden truncation models have been applied to analyze data when bivariate or multivariate observations are subject to some form of truncation. While the statistical inference for hidden truncation models (truncation from above) under the frequentist and the Bayesian paradigms has been adequately discussed in the literature, the estimation of a two-sided hidden truncation model under the Bayesian framework has not yet been discussed. In this paper, we consider the Bayesian inference for a general two-sided hidden truncation model based on the Arnold–Strauss bivariate exponential distribution. In addition, a Bayesian model selection approach based on the Bayes factor to select between models without truncation, with truncation from below, from above, and two-sided truncation is also explored. An extensive simulation study is carried out for varying parameter choices under the conjugate prior set-up. For illustrative purposes, a real-life dataset is re-analyzed to demonstrate the applicability of the proposed methodology.

1. Introduction

A hidden truncation model, also known as a selective reporting model or latent variable model, is a mathematical model that describes observed variables truncated with respect to some hidden covariable(s). An example of hidden truncation provided by Arnold and Beaver [1] is that the observed variable is the waist size for uniforms of elite troops, and the hidden covariable is the height of the troops. The elite troops are selected only if they meet a specific minimum height requirement. Therefore, the variable waist size will not be observed unless the troop meets the minimum height requirement. Hidden truncation models have been used in studying personal income data when an individual’s income is either not always correctly specified or may be unreported. Different hidden truncation models and their inference are developed mostly using the classical approach. If the data are subject to hidden truncation, then it is natural to expect that the behavior of the random phenomenon differs from a non-truncated model. Note that the notion of the mixture model is related to the hidden truncated model. Arnold and Gomez [2] showed that the two-parameter skew–normal distribution can be viewed as having arisen from a standard normal density by hidden truncation or an additive component construction, and they conjectured that this apparent relation between hidden truncation and mixture models only occurs in the normal case. However, the hidden truncation and mixture models are slightly different, as in a mixture model, there is no such hidden covariable subject to truncation. Sometimes, through a natural mechanism or otherwise, a dataset has already been subject to one of the types of hidden truncation, such as the Quasar data as originally and independently studied in Efron and Petrosian [3].
A non-exhaustive list of the references on hidden truncation models is as follows. One may cite the seminal paper by Azzalini [4] to propel research endeavors in this area. It is conjectured and supported by appropriate real-life scenarios, including but not limited to income modeling, astronomical data, survival analysis, etc., such that hidden truncation models provide flexible alternatives to the somewhat constrained and elliptically contoured distributions for modeling bivariate/multivariate or matrix-variate data in higher dimensions. Arnold [5] developed and studied several univariate, bivariate, and multivariate parametric families of distributions based on hidden truncation. Arnold and Beaver [1] suggested that a hidden truncation model involves a plethora of model parameters. Subsequently, it almost always involves a ubiquitous normalizing constant, making inferences under a frequentist framework virtually impossible. This is mainly because parameter estimates in conjunction with such models will not be available in closed forms or at least analytically intractable, and quite often, they have constraints. Furthermore, using a frequentist approach, non-normal models involving a two-sided truncation would require special attention when exploring inferential aspects. However, efforts have been made to estimate the model parameters for hidden truncation models under the frequentist approach with some limitations. For example, Ghosh and Nadarajah [6] discussed the inference under the classical approach of a hidden truncated Pareto (type-II) model starting from a bivariate Pareto (type-II) model. Hidden truncation in bivariate and multivariate Pareto data has been developed and applied to income modeling (see the references in [7]). Zaninetti [8] found that a left-truncated beta distribution better fits the initial mass function for stars compared to the lognormal distribution, which has commonly been used in astrophysics. Kotz et al. [9] (Chapter 44) also discussed the details of the formation of distributions through truncation.
Under the Bayesian paradigm, one-sided hidden truncated Pareto (type-II and type-IV) has been discussed by Ghosh [10,11], but not much discussion has been made on the applicability as well as the efficacy of the proposed Bayesian methodology in terms of prior choices, including the choice of hyperparameters, among others. Noticeably, nonparametric methods for estimation corresponding to two-sided truncated data have been developed by Efron and Petrosian [3]. This serves as a major motivation for the current research work.
In this paper, we explore the estimation of model parameters for a two-sided hidden truncated model, assuming that the data come from a bivariate exponential distribution defined by Arnold and Strauss [12]. Since the exponential distribution is an important probability model for studies involving non-negative random variables, such as the frailty models, this probability model is selected to utilize the proposed Bayesian estimation strategy with the hope that the proposed technique can be well adopted for several other non-normal models. For a detailed study on the genesis of the construction of hidden truncated non-normal models, readers may refer to [7] and the references therein. Next, we delve into the hidden truncation models and discuss the reason to start with a bivariate exponential-type model. Hidden truncation involves ubiquitous normalizing constants followed by the truncation parameter(s) masking within other parameters embedded in such a way that there is an estimate for the composite function, say ψ ( θ ) , in which θ involves not only the truncation parameter but also location and scale parameters of the conditioning variable. Moving away from the normal distribution under the hidden truncation model will increase the difficulty in model fitting and related statistical inference. In search for a simpler non-normal model involving a two-sided hidden truncation, the bivariate exponential distribution proposed by Arnold and Strauss [12] is selected for illustrative purposes. As the statistical parameter estimation of a hidden truncation model is challenging, especially when there is a lack of information on the truncation limits in the model, we aim to provide a feasible approach based on the Bayesian paradigm that can be used in practical situations.
The rest of the paper is organized as follows. In Section 2, the mathematical derivation for a basic two-component model of hidden truncated distributions and the mathematical derivation of a two-sided hidden truncated bivariate exponential (HTBEXP, in short) model are provided. Section 3 provides the details of the Bayesian inference adopted in this paper for the HTBEXP model developed in Section 2 under both an informative and non-informative and improper priors set-up. Section 4 explores the Bayes factor to deal with model selection procedures out of all possible candidate models. In Section 5, a Monte Carlo simulation study under the Bayesian paradigm is used to evaluate the performance of the proposed Bayesian estimation method. For illustrative purposes, a real-life dataset is re-analyzed in Section 6 to exhibit the efficacy of the proposed estimation strategy. Finally, some concluding remarks are presented in Section 7.

2. Hidden Truncation in Arnold–Strauss Bivariate Exponential Model

Let ( X , Y ) be a two-dimensional absolutely continuous random vector. Consider the conditional distribution of X given Y M , where M is a Borel set in R . In a hidden truncation model, variable X is the variable of interest, and variable Y is a related unobservable variable that may be related to X. Let f X , Y ( x , y ) be the joint probability density function (PDF) of the random vector ( X , Y ) , and let f X ( x ) and f Y ( y ) be the marginal PDFs of random variables X and Y, respectively. The conditional PDF of X given Y M can be expressed as
f X | Y M ( x ) = f X ( x ) Pr ( Y M | X = x ) Pr ( Y M ) .
The following three forms of hidden truncation models can be considered:
(i)
Truncation from below (also known as lower truncation): M = ( d 1 , ) , where < d 1 < is the lower truncation point;
(ii)
Truncation from above (also known as upper truncation): M = ( , d 2 ) , where < d 2 < is the upper truncation point;
(iii)
Two-sided truncation: M = ( d 1 , d 2 ] , where d 1 and d 2 are the lower and upper truncation points, respectively, with < d 1 < d 2 < .
For a two-sided hidden truncation, observations are only available for X’s whose corresponding concomitant variable Y is in ( d 1 , d 2 ] , for < d 1 < d 2 < . Hence, Equation (1) reduces to
f X | Y ( d 1 , d 2 ] ( x | d 1 < Y d 2 ) = f X ( x ) Pr d 1 < Y d 2 | X = x Pr d 1 < Y d 2 .
This type of hidden truncation model is characterized as follows:
  • The underlying marginal distribution of X can be specified by the PDF f X ( x ) ;
  • The conditional distribution of Y given X = x can be specified by the conditional PDF f Y | X y | x ;
  • The specified values of truncation points are denoted by d 1 and d 2 ;
  • There could be some other model parameters in addition to d 1 and d 2 .
For the conditional PDF in Equation (2), if we consider d 1 and d 2 + , the PDF reduces to the unconditional marginal PDF of X. The shape of the conditional PDF will be more sensitive for smaller values of the lower truncation point d 1 compared to small or large values of the upper truncation point d 2 . From the outset, the hidden truncation from below will not augment the original model since the resulting density will again be a member of the same family of distributions with only a reparametrization of the parent model. Consequently, we focus primarily on the hidden truncation from both sides in the more general framework, as the hidden truncation from below and/or from above will be a particular case of two-sided hidden truncation.
In this paper, for the applications in situations with random variables having non-negative supports (e.g., the survival and reliability analyses), a two-dimensional random vector ( X , Y ) with non-negative coordinates that follows the Arnold–Strauss bivariate exponential (ASBE) distribution is considered. The joint PDF of the random vector ( X , Y ) of ASBE distribution is
f X , Y ( x , y ) = K exp a x + b y + c x y , for x > 0 , y > 0 , a > 0 , b > 0 , c > 0 ,
where K is the normalizing constant defined as
K = c exp a b c E i a b c 1 .
Here, E i ( x ) is the exponential integral function defined by E i ( x ) = x t 1 exp ( t ) d t . The associated marginal PDF of X will be
f X ( x ) = K exp a x b + c x , x > 0 ,
and the marginal density of Y will be
f Y ( y ) = K exp b y a + c y , y > 0 .
Therefore, the normalizing constant K can also be defined as
K = 0 exp ( a x ) b + c x d x 1 = 0 exp ( b y ) a + c y d y 1 .
Since P r Y > y | X = x for each fixed X = x is monotonically decreasing in x for all possible values of y, the joint density in Equation (3) is negative quadrant dependent, and hence it has a negative correlation coefficient. In the context of hidden truncation, we assume that the random variable of interest X and the hidden covariable Y are dependent non-negative random variables that follow the ASBE distribution.
Note that the conditional PDF of Y given X = x is
f Y | X = x y | X = x = b + c x exp ( b + c x ) y , y > 0 .
Next, a two-sided hidden truncation for the random variable Y, say d 1 < Y d 2 , where 0 d 1 < d 2 < is considered. We can obtain
Pr d 1 < Y d 2 | X = x = d 1 d 2 ( b + c x ) exp ( b + c x ) y d y = exp ( b + c x ) d 1 exp ( b + c x ) d 2 ,
and
Pr d 1 < Y d 2 = d 1 d 2 K exp b y a + c y d y .
Therefore, the conditional PDF of X given d 1 < Y d 2 , can be expressed as
f x | d 1 < Y d 2 = f X ( x ) f Y | X = x y | X = x Pr d 1 < Y d 2 = exp a x exp ( b + c x ) d 1 exp ( b + c x ) d 2 b + c x d 1 d 2 exp b y a + c y d y = exp a x exp ( b + c x ) d 1 exp ( b + c x ) d 2 b + c x M ( a , b , c , d 1 , d 2 ) , x > 0 ,
where
M ( a , b , c , d 1 , d 2 ) = d 1 d 2 exp b y a + c y d y
is a constant depending only on the parameters and not the random variable X. Figure 1 presents the hidden truncated PDFs of X given d 1 < Y d 2 for different values of the parameters a, b, c, d 1 , and d 2 based on the ASBE distribution along with the non-truncated exponential distribution.
From Figure 1, the hidden truncated PDFs are always right-skewed with different intensities for given choices of a, b, and c along with the truncated parameters 0 d 1 < d 2 < . In Section 3, we discuss in detail the Bayesian inference of the density given in Equation (7).

3. Bayesian Inference

Let X = ( X 1 , X 2 , , X n ) be a random sample from the hidden-truncated distribution with the PDF in Equation (7), and x = ( x 1 , x 2 , , x n ) be the observed values of X. Based on x = ( x 1 , x 2 , , x n ) , the associated likelihood function can be expressed as
L ( a , b , c , d 1 , d 2 | x ) = 1 [ M ( a , b , c , d 1 , d 2 ) ] n i = 1 n exp a x i exp ( b + c x i ) d 1 exp ( b + c x i ) d 2 b + c x i .
This section discusses the choices for the priors of the model parameters. Due to the condition that d 1 < d 2 , we consider dependent priors for parameters d 1 and d 2 . For parameters a , b , and c, the following two scenarios are considered:
  • Independent priors for parameters a ,   b , and c;
  • Dependent priors for parameters a ,   b , and c.

3.1. Prior Specifications

3.1.1. Independent Priors for the Parameters a, b, and c

When the underlying distribution follows an exponential distribution, it is common that a gamma-type distribution is used as a prior distribution in a subjective Bayesian framework. Since the ASBE distribution is a member of the exponential family, the conjugacy of priors is expected. Consider a random variable that follows a gamma distribution with shape parameter α and rate parameter β , denoted as G a m m a ( α , β ) , with the following PDF
g ( w ; α , β ) = β α Γ ( α ) w α 1 exp ( β w ) , w > 0 , α > 0 , β > 0 .
The following independent priors for parameters a, b, c, and dependent priors for d 1 , and d 2 are considered:
  • The prior for parameter a
    π 1 ( a ) G a m m a ( α 1 , β 1 ) .
  • The prior for parameter b
    π 2 ( b ) G a m m a ( α 2 , β 2 ) .
  • The prior for parameter c
    π 3 ( c ) G a m m a ( α 3 , β 3 ) .
  • For the upper and lower truncation points d 1 and d 2 , since d 1 < d 2 , the following dependent priors are considered:
    π 4 ( d 1 ) 1 ( 1 + d 1 ) 2 I ( 0 < d 1 < d 2 ) , π 5 ( d 2 | d 1 ) = 1 δ d 1 I ( d 1 < d 2 < δ ) .
Remark 1.
Regarding the prior specifications on d 1 and d 2 , the marginal prior distribution for d 1 is assumed to be a truncated distribution with support ( 0 , d 2 ] . A uniform distribution for d 2 | d 1 on the interval [ d 1 , δ ] is used, where δ is a hyperparameter that should be chosen properly for the underlying Bayesian analysis. For instance, the informed expert can provide a judicious choice of hyperparameters. These conditional priors for the two truncation points may not be the optimal choice of priors, but among all the other different priors that we have considered, this one seems to perform satisfactorily.
Based on the likelihood function in Equation (9), along with the joint prior distribution under independent a priori, and with the dependent prior choices for d 1 and d 2 , the joint posterior distribution of the five model parameters can be expressed as
h ( a , b , c , d 1 , d 2 | x ) L ( a , b , c , d 1 , d 2 | x ) × π 1 ( a ) × π 2 ( b ) × π 4 ( c ) × π 4 ( d 1 ) × π 5 ( d 2 | d 1 ) 1 [ M ( a , b , c , d 1 , d 2 ) ] n i = 1 n exp a x i exp ( b + c x i ) d 1 exp ( b + c x i ) d 2 b + c x i × g ( a ; α 1 , β 1 ) g ( b ; α 2 , β 2 ) g ( c ; α 3 , β 3 ) 1 ( 1 + d 1 ) 2 1 δ d 1 .
Note that the marginal posterior distribution of each of the model parameters can be derived from the joint posterior in Equation (10). For instance, the posterior distribution of a can be obtained as
p 1 ( a | x ) = 1 D 0 d 2 d 1 δ 0 0 h ( a , b , c , d 1 , d 2 | x ) d b d c d d 2 d d 1 ,
where D is the normalizing constant defined as
D = 0 d 2 d 1 δ 0 0 0 h ( a , b , c , d 1 , d 2 | x ) d a d b d c d d 2 d d 1 .
Similarly, the posterior distributions for the remaining other parameters can be obtained. It is worth pointing out that each of the marginal distributions involves the quantity M ( a , b , c , d 1 , d 2 ) defined in Equation (8), which is a constant. Due to the presence of the term D, an accurate approximation of the expectation in calculating the posterior mean is needed, and a five-dimensional numerical integration will be involved here. In this work, Gibbs sampling is used to find the Bayes estimates (posterior means) of the model parameters. Detailed procedures associated with the Metropolis–Hastings (MH) algorithms will be presented in Section 3.2.

3.1.2. Dependent Priors for Parameters a, b and c

In reality, the model parameters may be dependent. Hence, it is more reasonable to consider dependent priors. Here, some general prior choices for the model parameters are considered. Suppose we have specific information (in the form of a prior belief) about any of the parameters. Then, one can build informative priors for the rest of the parameter(s). For instance, if we know that parameter a can take values between 2 and 4, then a reasonable prior distribution for the parameter a would be any priors with support on ( 2 , 4 ) , e.g., a uniform distribution on the interval ( 2 , 4 ) . Although the use of dependent priors will increase the complexity of our analysis, one may still want to consider such priors. For analytically intractable models as given in Equation (7), the corresponding posterior analysis can be efficiently performed by Markov Chain Monte Carlo (MCMC) algorithms. However, in a real-life scenario, it is common that one might not have explicit knowledge of the priors for each parameter and the dependency between these parameters. Specifically, this occurs when we consider a situation in which the prior belief on either of the parameters a, b, or c, or some of the parameters a, b, and c depend on the two unknown truncation parameters d 1 and d 2 .
Instead of independent priors, dependent priors for parameters a, b, and c can be considered. In this scenario, we assume that the prior distributions of a, b, and c follow conditional gamma distributions. For example,
π 1 * ( a | b , c ) G a m m a ( ξ 1 ( b , c ) , λ 1 ( b , c ) ) , π 2 * ( b | a , c ) G a m m a ( ξ 2 ( a , c ) , λ 2 ( a , c ) ) , π 3 * ( c | a , b ) G a m m a ( ξ 3 ( a , b ) , λ 3 ( a , b ) ) .
On the other hand, the same prior distributions for d 1 and d 2 that were defined in Section 3.1.1 are used.
Next, the joint prior PDF of parameters a, b, and c can be expressed as (see [13] for details)
f a , b , c = exp { m 000 m 100 a + m 200 ln a m 010 b + m 110 a b m 210 b ln a + m 020 ln b m 120 a ln b + m 220 ln a ln b m 001 c + m 101 c a m 201 c ln a + m 011 c b m 111 a b c m 021 c ln b + m 121 c a ln b + m 211 c b ln a m 221 ln b + m 002 ln c m 102 a ln c + m 202 ln a ln c m 012 b ln c + m 112 a b ln c m 212 b ln a ln c m 022 ln b ln c m 122 a ln b ln c + m 222 ln a ln b ln c } I ( a > 0 ) I ( b > 0 ) I ( c > 0 ) .
The family of PDFs in Equation (11) involves 27 hyperparametrs. A simplified sub-model, which still retains considerable flexibility, can be obtained by setting m i j k = 0 , whenever i + j + k > 2 . Then, Equation (11) reduces to
f * ( a , b , c ) = exp m 000 m 100 a + m 200 ln a m 010 b + m 110 a b + m 020 ln b m 001 c   + m 101 c a + m 011 c b + m 002 ln c I ( a > 0 ) I ( b > 0 ) I ( c > 0 ) .
When the simplified 10-parameter model in Equation (12) is used, the joint posterior distribution from Equation (9) reduces to
h * ( a , b , c , d 1 , d 2 | x ) L ( a , b , c , d 1 , d 2 | x ) × f * ( a , b , c ) × π 4 ( d 1 ) × π 5 ( d 2 | d 1 ) .

3.2. Computation Algorithms

In this subsection, we present the detailed procedures for the MCMC algorithms in which the MH algorithms within Gibbs chains are utilized. Furthermore, different types of MH algorithms are adopted depending on the types of full conditional distributions. Since the procedures regarding the joint posterior distribution for dependent priors are similar, we present the posterior analysis based on only independent priors for brevity. From the joint posterior distribution in Equation (10), we can construct the full conditional distributions to proceed with utilizing the Gibbs sampling. The full conditional distributions are given as follows:
p ( a | b , c , d 1 , d 2 , x ) 1 [ M ( a , b , c , d 1 , d 2 ) ] n a α 1 1 exp i = 1 n x i + β 1 a I ( a > 0 ) ; p ( b | a , c , d 1 , d 2 , x ) 1 [ M ( a , b , c , d 1 , d 2 ) ] n i = 1 n [ exp { ( b + c x i ) d 1 } exp { ( b + c x i ) d 2 } b + c x i ] × b α 2 1 exp { β 2 b } I ( b > 0 ) ; p ( c | a , b , d 1 , d 2 , x ) 1 [ M ( a , b , c , d 1 , d 2 ) ] n i = 1 n [ exp { ( b + c x i ) d 1 } exp { ( b + c x i ) d 2 } b + c x i ] × c α 3 1 exp { β 3 c } I ( c > 0 ) ; p ( d 1 | a , b , c , d 2 , x ) 1 [ M ( a , b , c , d 1 , d 2 ) ] n i = 1 n [ exp { ( b + c x i ) d 1 } exp { ( b + c x i ) d 2 } b + c x i ] × 1 ( 1 + d 1 ) 2 I ( 0 < d 1 < d 2 ) ; p ( d 2 | a , b , c , d 1 , x ) 1 [ M ( a , b , c , d 1 , d 2 ) ] n i = 1 n [ exp { ( b + c x i ) d 1 } exp { ( b + c x i ) d 2 } b + c x i ] × 1 δ d 1 I ( d 1 < d 2 < δ ) .
Note that none of the full conditional distributions has a closed-form expression. Hence, we propose using MH algorithms within each Gibbs chain. For parameters a, b, and c, a truncated normal distribution is used as a proposal density with a left-truncation point of zero. In particular, an adaptive MH algorithm that updates the variance of proposal distributions throughout the entire sampling process is used. In general, the proposal density of parameter θ ( θ = a , b , c , d 1 , or d 2 ) is used as a (truncated) normal distribution, with the current value being its mean and the variance depending on forthcoming iterations. More specifically, suppose we want to update parameter θ in the ( h + 1 ) t h step based on an initial value of θ ( 0 ) and observed values of the first h iterations are denoted by θ ( 1 ) , θ ( 2 ) , , θ ( h ) . The candidate value θ * can be generated from a normal distribution with mean θ ( h ) and variance V ( h ) , where
V ( h ) = V ( 0 ) , i f h = 0 , γ [ Var θ ( 0 ) , , θ ( h 1 ) + ζ ] , i f h > 0 .
Here, V ( 0 ) is an initial (could be arbitrary) variance of the proposal distribution of parameter θ , γ is the adjusting coefficient, and ζ is a small constant to prevent the variance in Equation (15) from shrinking to zero. One of the important aspects of the adaptive MH algorithm is that it updates its variance so that the parameter being sampled reaches a desired acceptance rate. It is known that the optimal acceptance rate for one parameter is 0.44, so we set γ = 2.4 / k , where k is the dimension of the parameter space. This ensures that the desired acceptance rate stays around 0.44; see [14] for pertinent details.
On the other hand, an independent MH algorithm is used to generate random variates from the full conditional distribution of d 1 . The algorithm for generating a random variate from p ( d 1 | others ) is described below. When d 1 ( j ) is the current value of the Markov chain, updating d 1 ( j + 1 ) is required. The steps of the algorithm are described as follows.
A1.
Let q be the candidate density with a form:
q ( d 1 ) = 1 + d 2 d 2 1 ( 1 + d 1 ) 2 for 0 < d 1 < d 2 ,
where d 2 d 2 ( j ) is the value of the current state.
A2.
Generate d 1 * from the PDF in Equation (16). Following Devroye [15], and the inverse cumulative distribution function (CDF) technique, d 1 * can be generated from the following equation:
d 1 * = 1 + d 2 1 + d 2 ( 1 u ) 1 ,
where u is a random variate from U n i f ( 0 , 1 ) .
A3.
Next, the acceptance probability ω is calculated:
ω ( d 1 ( j ) , d 1 * ) = p ( d 1 * ) q ( d 1 ( j ) ) p ( d 1 ( j ) ) q ( d 1 * ) ,
where p ( · ) is the full conditional distribution of d 1 in Equation (14). Now, set d 1 ( j + 1 ) = d 1 * if u ω ( d 1 ( j ) , d 1 * ) , and d 1 ( j + 1 ) = d 1 ( j ) otherwise.
Finally, we employ the random-walk MH for updating d 2 . When d 2 ( j ) is given, we wish to update d 2 ( j + 1 ) . Generate d 2 * from Unif ( d 2 ( j ) ϵ , d 2 ( j ) + ϵ ) for ϵ > 0 . Next, we calculate the acceptance probability:
ω ( d 2 ( j ) , d 2 * ) = p ( d 2 * ) I ( d 1 < d 2 * < δ ) p ( d 2 ( j ) ) I ( d 1 < d 2 ( j ) < δ ) ,
where d 1 d 1 ( j ) is the value of the current state. If u ω ( d 2 ( j ) , d 2 * ) and I ( d 1 < d 2 * < δ ) , then set d 2 ( j + 1 ) = d 2 * . Otherwise, d 2 ( j + 1 ) = d 2 ( j ) , and u Unif ( 0 , 1 ) .
Remark 2.
The efficacy of the MCMC algorithm involving analytically intractable models, such as the case here with the constant M ( a , b , c , d 1 , d 2 ) , can be a concern. For instance, the constant M ( a , b , c , d 1 , d 2 ) is in an integral form, and the exact value may not be able to be obtained analytically. However, we can approximate M ( a , b , c , d 1 , d 2 ) accurately for given values of a, b, c, d 1 , and d 2 and perform the Bayesian computation with the MCMC strategy without any difficulty. Noticeably, the results based on the simulation study are supportive of this assertion. One may need to consider non-standard MCMC techniques for doubly intractable posteriors; see, for example, [16]. In the Section 4, we discuss the selection strategy to select among all possible candidate hidden truncation models using the Bayes factor, the most appropriate and plausible model.

4. Selection between Different Hidden Truncation Models

In this section, we consider the following four models for the hidden truncated bivariate exponential distribution:
  • M 1 : No hidden truncated model, i.e., d 1 = 0 and d 2 = .
  • M 2 : A truncated from below model, i.e., 0 < d 1 < d 2 = .
  • M 3 : A truncate from above model, i.e., 0 = d 1 < d 2 < .
  • M 4 : A truncate from both sides model, i.e., d 1 > 0 and d 2 < .
Let Θ j denote the parameter space for model M j ( j = 1 , 2 , 3 , 4 ). Hence, we have Θ 1 = ( a , b , c ) , Θ 2 = ( a , b , c , d 1 ) , Θ 3 = ( a , b , c , d 2 ) , and Θ 4 = ( a , b , c , d 1 , d 2 ) . Based on the observed values x = ( x 1 , x 2 , x n ) from each of models M 1 , M 2 , and M 3 , the likelihood functions under these three models are given respectively by
L 1 ( a , b , c | x ) = 1 [ I 1 ( a , b , c ) ] n i = 1 n exp { a x i } b + c x i , L 2 ( a , b , c , d 1 | x ) = 1 [ I 2 ( a , b , c , d 1 ) ] n i = 1 n exp { a x i } exp { ( b + c x i ) d 1 } b + c x i , L 3 ( a , b , c , d 2 | x ) = 1 [ I 3 ( a , b , c , d 2 ) ] n i = 1 n exp { a x i } [ 1 exp { ( b + c x i ) d 2 } ] b + c x i ,
where I 1 ( a , b , c ) is the same as 1 / K in Equation (4) of the current text, and
I 2 ( a , b , c , d 1 ) = d 1 exp { b y } a + c y d y and I 3 ( a , b , c , d 2 ) = 0 d 2 exp { b y } a + c y d y .
For model M 4 , the likelihood function L ( a , b , c , d 1 , d 2 | x ) is presented in Equation (9).
Remark 3.
Regarding prior specifications for models M 1 , M 2 , and M 3 , we use the same prior distributions utilized in the full model M 4 by deleting irrelevant parameters to estimate the corresponding parameters. Subsequently, we could obtain MCMC outputs in the three models that can also approximate marginal distributions.
To choose the most plausible model based on the observed data, we consider using the Bayesian approach based on the Bayes factor. Suppose that there are q different models, namely, M 1 , , M q , any of which could be meaningful, and they contend with each other for model selection. If model M i holds true, the data x = ( x 1 , , x n ) follow a parametric distribution with PDF f i ( x | θ i ) , where θ i is an unknown parameter vector for M i . Let Θ i be the parameter space for the parameter vector θ i , where Θ i may or may not be nested. Bayesian model selection proceeds by choosing a prior PDF π i ( θ i ) for θ i under M i , and a prior model probability p ( M i ) of M i being true for i = 1 , , q . The posterior model of the probability that M i is true can be obtained as (for pertinent details, see, [17])
P ( M i | x ) = [ j = 1 q p ( M j ) p ( M i ) B j i ( x ) ] 1 ,
where the Bayes factor B j i ( x ) of model M j to M i is defined by
B j i ( x ) = m j ( x ) m i ( x ) = Θ j f j ( x | θ j ) π j ( θ j ) d θ j Θ i f i ( x | θ i ) π i ( θ i ) d θ i ,
where m i ( x ) is called the marginal or predictive density of x under M i . Subsequently, the model with the largest posterior probability in Equation (19) becomes the most plausible model. It is customary to use vague model probabilities, i.e., p ( M i ) = 1 / q for i = 1 , , q . Then, Equation (19) becomes P ( M i | x ) = [ j = 1 q B j i ] 1 . Clearly,
B i i = 1 and B i j = B j i 1 , for i , j = 1 , , q .
Meanwhile, we use the method proposed by Newton and Raftery [18] to evaluate the marginals of the four models. The procedure is described as follows. Observe that the posterior distribution can be represented as p ( θ | x ) L j ( θ | x ) × π j ( θ ) under model M j for j = 1 , 2 , 3 , 4 . For notational simplicity, let θ = ( a , b , c , d 1 , d 2 ) under the full model M 4 . Now, let { θ ( g ) } { θ ( 1 ) , , θ ( G ) } be G, drawing from the posterior density obtained by conventional MCMC methods such as the Gibbs sampler. Newton and Raftery [18] suggested that the marginal can be estimated as
m ^ 4 ( x ) = { 1 G g = 1 G ( 1 L 4 ( x | θ ( g ) ) ) } 1 .
Note that the estimate in Equation (22) is the harmonic mean of the likelihood evaluated at MCMC samples.

5. Monte Carlo Simulation Studies

In this section, Monte Carlo simulation studies are used to evaluate the Bayesian parameter estimation and model selection procedures proposed in this paper for the hidden truncation models. First, a Monte Carlo simulation study is used to evaluate the performance of Bayes estimates for the model parameters of the probability distribution given in Equation (7). For illustrative purposes, we consider the true values of the model parameters to be ( a , b , c , d 1 , d 2 ) = ( 2 , 3 , 4 , 1 , 10 ) , and three different sample sizes n = 100, 200, and 300. For each simulated dataset, the posterior means, posterior medians, and the associated 95% highest posterior density (HPD) credible intervals of the model parameters are computed. After collecting the posterior estimates based on a total of 200 replications, the performance of the point estimates is evaluated based on the average posterior mean, coverage probability (CP), and average width (AW) of the 95%. The total and burn-in MCMC iterations of the Bayesian estimation procedure are set to be 20,000 and 5000, respectively. Regarding the hyperparameters of the prior distributions on a, b, and c, we set α 1 , β 1 = ( 4 , 2 ) , α 2 , β 2 = 9 , 3 , and α 3 , β 3 = 16 , 4 , in order to obtain the mean of each prior distribution to be the same as the true value and the variances as one for the three prior gamma distributions. For the hyperparameter(s) of the distribution on d 2 | d 1 , we set δ = 20 . Moreover, regarding the initial variances in Equation (15) of the zero-truncated normal distributions, we set a standard deviation of two for each of a, b, and c. Table 1 provides the average posterior means, the mean square errors (MSEs) of the posterior means, the average posterior variances, the CPs, and AWs for the model parameters based on 200 simulations.
From Table 1, we observe that all the CPs are 100%, which indicates that the standard errors of the estimates are relatively large. The posterior means of b, c, d 1 , and d 2 are close to the true values for decently large sample sizes. However, the posterior means for a are overestimated with large average widths for the HPD intervals. We also observe that there are no dramatic improvements in parameter estimation accuracy when the sample size increases. Since we are dealing with subjective prior choices, there are infinitely many combinations on the hyperparameters, and the performance of the Bayesian estimation procedure depends on the choice of the hyperparameters. For instance, based on our preliminary study (results are not shown here), the performance can be poor when prior distributions with large prior variances are used. Although the choices of hyperparameter values used in this simulation study may not be optimal, these choices of hyperparameter values provide satisfactory results among the choices we examined.
In the second Monte Carlo simulation study, the performance of the model selection procedure is evaluated when the four models M 1 M 4 are considered candidate models. In this simulation study, data are generated from the four models M 1 M 4 , and the model selection method based on Bayes factors presented in Section 4 is applied to each simulated dataset. Then, we obtain the proportions to determine which model has the largest posterior model probability. Recall that the posterior probabilities can be computed by Equation (19) in conjunction with Equation (22). For comparative purposes, we compute the average posterior probabilities in two ways by assuming the equal prior model probability of 1 / 4 for each model M 1 M 4 :
I.
The average of posterior probabilities, no matter which model is selected out of 500 replications (denoted as Post. Prob. I in Table 2);
II.
The average of posterior probabilities is based only on the samples possessing the highest posterior probability in each replication (denoted as Post. Prob. I in Table 2). For example, the “Post. Prob. II” when the true model is M 1 in Table 2, 0.265 for contending model M 1 is computed based on 125 samples that model M 1 is selected.
Based on 500 simulated samples from each model, the above two types of posterior probabilities, the frequency and proportion of selecting a particular model, and the average posterior means (medians) based only on selected models for each model parameter are presented in Table 2. The numerical summaries in Table 2 with the same prior specifications and iterations are in accordance with the adopted MCMC procedures described in Section 3.2.
From Table 2, we observe that the parameter estimates based on either the posterior mean or the posterior median of the four models yield decently reasonable results for all cases considered. To select the most plausible model, although the procedure based on the Bayes factor (posterior probability) does not capture the true model in all the cases, the procedure based on the Bayes factor tends to select model M 4 , which is the most general hidden truncation model among the four models with the largest number of parameters.
Since the number of parameters models M 1 M 4 are different, instead of evaluating the performance based on the parameter estimates, we consider the estimation of the conditional mean E X | w 1 < Y < w 2 , where w 1 and w 2 are pre-specified values based on the model and the values of d 1 and d 2 (e.g., for model M 1 , w 1 = 0 and w 2 = ), which is a function of the model parameters. Specifically, for given values of w 1 and w 2 , the conditional expectation is
τ ( a , b , c ) = 0 x f X | y ( x | w 1 < Y < w 2 ) d x = 0 x e a x [ exp { ( b + c x ) w 1 } exp { ( b + c x ) w 2 } ] ( b + c x ) M * ( a , b , c , w 1 , w 2 ) ,
where
M * ( a , b , c , w 1 , w 2 ) = w 1 w 2 e b y a + c y d y .
Let R be the total number of simulations (i.e., R = 500 in this study) and r i be the number of samples that model M i is selected with the largest posterior model probability ( i = 1 , 2 , 3 , 4 ), where R = r 1 + r 2 + r 3 + r 4 . Suppose τ ^ i j ( a , b , c ) is the estimate of τ ( a , b , c ) based on the j-th sample, where model M i is selected as the most plausible model for i = 1 , 2 , 3 , 4 , j = 1 , 2 , , r i . The MSEs based on the simulated data corresponding to selected model M i , denoted as M S E i , can be calculated as
M S E i = 1 r i j = 1 r i [ τ ^ i j ( a , b , c ) τ ( a , b , c ) ] 2 .
To illustrate the effect on the performance of estimating the conditional expectation with the model selection procedure, the MSE with the model selection procedure is calculated as
M S E M S = 1 R i = 1 4 j = 1 r 1 [ τ ^ i j ( a , b , c ) τ ( a , b , c ) ] 2 .
The simulated values of M S E i ( i = 1 , 2 , 3 , 4 ) and M S E M S for estimating the conditional expectation associated with the model selection perspectives are presented in Table 3. Note that the proportions of selecting the correct models with the highest posterior probability are all the same as in Table 2. As expected, the value of M S E i is the smallest if M i is the true model in most cases. The simulation results in Table 3 show that the model selection procedure can reduce the risk of mis-specifying the underlying model. On the other hand, we observe that the values of M S E 1 and M S E 3 are similar, while the values of M S E 2 and M S E 4 are similar. Note that models M 1 and M 3 assume d 1 = 0 , while M 2 and M 4 assume d 1 > 0 with an unknown value. This observation suggests that determining whether d = 0 may provide further information to improve the estimation of the conditional expectation.

6. Illustrative Example

In this section, the proposed methodologies in this paper are illustrated using a real dataset comprised of independently collected quadruplets of the redshift and the apparent magnitude of a quasar object previously analyzed by Efron and Petrosian [3]. The dataset is available in the R package DTDA version 3.0.1 [19] named Quasars. The original data consist of n = 210 observations with the following variables ( z i , m i , d 1 i , d 2 i ) for i = 1 , , n , where z i denotes the redshift of the i-th quasar, m i denotes its apparent magnitude, and the two values d 1 i and d 2 i indicate lower and upper truncation bounds corresponding to the apparent magnitude, respectively. The variable of interest is the transformed logarithm of luminosity values provided in the first column of the dataset Quasars, i.e., V i = t ( z i , m i ) , where t is a transformation that depends on the cosmological model assumed (see [20,21] for a detailed description). The anti-logarithm transformation X i = exp ( V i ) is considered such that the support of the random variable X i is [ 0 , ) . We conjecture that the dataset has been subjected to a two-sided truncation and the PDF in Equation (7) would be appropriate to model this dataset, where the observation X i is subjected to a two-sided truncation by other covariates, and the same truncation points apply for X i , i = 1 , 2 , , n , i.e., d 1 i d 1 and d 2 i d 2 .
Based on the model in Equation (7), and using the Bayesian methodologies proposed in this paper, the results presented in Table 4 are obtained. Regarding the hyperparameters of the prior distributions on a, b, and c, we set α j , β j = ( 4 , 1 ) for j = 1 , 2 , 3 because we have no information on these parameters. The same value of δ = 20 as in the Monte Carlo simulation studies is used. In Table 4, we report the posterior probabilities for each of the four models M 1 M 4 , and the posterior means and medians (presented in the parenthesis) along with the corresponding 95% HPD intervals. From a model selection perspective, model M 2 is selected with the largest posterior probability of 0.305 , assuming equal prior model probability for all four models. We notice that M 4 has the second largest posterior probability of 0.283 , which agrees with our observation of the similarity of models M 2 and M 4 in the simulation results. Regarding the posterior inference, we observe that the estimates of a, b, and c have some patterns. The estimates under models M 1 and M 3 are close to each other, while those values are close to each other when models M 2 and M 4 are applied. Thus, the HPD intervals are changed accordingly. Figure 2 shows the trace plots for all parameters when the four models are adapted, respectively. Overall, the plots for a, b, and c are decently well behaved, while there are some fluctuations in the plots of d 1 and d 2 .
We want to highlight the fact that the parameter estimates/values of parameter d 1 and d 2 (i.e., the last two columns) in Table 4 and for the Model M 2 having the maximum posterior probability confirm the fact that the data (alias the main study variable, X i ) have been subject to a one-sided upper truncation, as the value of d 1 0 . In this data application, we have explored all possible scenarios of hidden truncation (including zero truncation) and then adopted strategy searches for the best-case scenario under the Bayesian paradigm.

7. Concluding Remarks

In this article, we explore the features of a two-sided hidden truncated model in terms of estimation under the Bayesian paradigm and, most importantly, detecting with a desired level of accuracy whether or not the data have been subject to hidden truncation starting with a simple model in two dimensions, namely, the bivariate exponential distribution proposed by Arnold and Strauss [12]. Indeed, there are several other versions of the classical bivariate exponential distribution, but they may have a singular part (i.e., if two random variables Y 1 and Y 2 follow a bivariate distribution, there is a positive probability that Y 1 = Y 2 ) [22] (see, for example, [23]), which makes it more difficult to consider from real-world and mathematical perspectives. For this reason, we resort to a simple model that is an absolutely continuous statistical distribution, develop the corresponding hidden-truncation model, and provide some feasible model-fitting methodologies that can be used in practice. Consequently, inferential aspects under the Bayesian paradigm for the parameters of a doubly (two-sided) hidden truncation model are still in their infancy stage and have not been explored in detail to the best of our knowledge. Statistical inference of the hidden truncation models with unknown truncation point(s) is a challenging problem, especially when there is very little information about the truncation point(s) in the observed sample. One can expect to mimic the inferential results obtained in this paper to other types of hidden truncated bivariate exponential models, albeit with computational complexity and possibly with a different set of informative and non-informative priors that might be challenging. Needless to say, the associated computational complexity and judicious selection of prior choices, especially for the truncation points, are the natural hindrance to developing the methodology and theory. Inference under the Bayesian framework, as discussed in the simulation study as well as in the real-data application, is encouraging in the sense that the Bayesian estimates of the parameters are reasonably good under both settings. Efficient estimation for multiple constraints models, i.e., the estimation of model parameters for a hidden truncated model under a multi-component set-up, where, for example, the main study variable Y is observed only when the associated concomitant variables, say, X j are truncated from below, such as X j < c j (or j < X j < d j ), will be of great interest in the context of several real-life applications. However, the real-life applicability of such models and their analytical traceability are the two key factors that should be addressed first. Research in this direction is in progress, and we hope to report the results in a future paper.

Author Contributions

Conceptualization, Methodology, and Writing—Original Draft: I.G. and H.K.T.N.; Investigation, Methodology, and Writing—review and editing: K.K. and S.W.K. All authors have read and agreed to the published version of the manuscript.

Funding

S. Kim’s research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2021R1A2C1005271).

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Arnold, B.C.; Beaver, R.J. Skewed multivariate models related to hidden truncation and/or selective reporting (with discussion). Test 2002, 11, 7–54. [Google Scholar] [CrossRef]
  2. Arnold, B.C.; Gomez, H. Hidden truncation and additive components: Two alternative skewing paradigms. Calcutta Stat. Assoc. Bull. 2008, 60, 239–240. [Google Scholar] [CrossRef]
  3. Efron, B.; Petrosian, V. Nonparametric methods for doubly truncated data. J. Am. Stat. Assoc. 1999, 94, 824–834. [Google Scholar] [CrossRef]
  4. Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
  5. Arnold, B.C. Flexible univariate and multivariate models based on hidden truncation. J. Stat. Plan. Inference 2009, 139, 3741–3749. [Google Scholar] [CrossRef]
  6. Ghosh, I.; Nadarajah, S. Inference for a hidden truncated (both-sided) bivariate Pareto (II) distribution. Commun. Stat.—Theory Methods 2015, 44, 2136–2150. [Google Scholar] [CrossRef]
  7. Ghosh, I.; Ng, H.K.T. Hidden Truncation in Non-Normal Models: A Brief Survey. In Advances in Statistics-Theory and Applications: Honoring the Contributions of Barry C. Arnold in Statistical Science; Ghosh, I., Balakrishnan, N., Ng, H.K.T., Eds.; Springer: Cham, Switzerland, 2021; pp. 133–160. [Google Scholar]
  8. Zaninetti, L. The initial mass function modeled by a left truncated beta distribution. Astrophys. J. 2013, 765, 1–7. [Google Scholar] [CrossRef]
  9. Kotz, S.; Balakrishnan, N.; Johnson, N.L. Continuous Multivariate Distributions, 2nd ed.; John Wiley & Sons: New York, NY, USA, 2000; Volume 1. [Google Scholar]
  10. Ghosh, I. Bayesian inference for hidden truncation Pareto (II) models. J. Appl. Stat. Sci. 2014, 20, 257–281. [Google Scholar] [CrossRef]
  11. Ghosh, I. Bayesian inference for hidden truncation Pareto (IV) models. J. Stat. Comput. Simul. 2020, 90, 2136–2155. [Google Scholar] [CrossRef]
  12. Arnold, B.C.; Strauss, D. Bivariate distributions with exponential conditionals. J. Am. Stat. Assoc. 1988, 83, 522–527. [Google Scholar] [CrossRef]
  13. Arnold, B.C.; Castillo, E.; Sarabia, J.M. Conditional Specification of Statistical Models; Springer: New York, NY, USA, 1999. [Google Scholar]
  14. Gelman, A.; Hwang, J.; Vehtari, A. Understanding predictive information criteria for Bayesian models. Stat. Comput. 2014, 24, 997–1016. [Google Scholar] [CrossRef]
  15. Devroye, L. Non-Uniform Random Variate Generation; Springer: New York, NY, USA, 1986. [Google Scholar]
  16. Murray, I.; Ghahramani, Z.; MacKay, D. MCMC for doubly-intractable distributions. In Proceedings of the 22nd Annual Conference on Uncertainty in Artificial Intelligence (UAI-06), Cambridge, MA, USA, 13–16 July 2006; AUAI Press: Arlington, VA, USA, 2013; pp. 359–366. [Google Scholar]
  17. Kass, R.E.; Raftery, A.E. Bayes factors. J. Am. Stat. Assoc. 1995, 90, 773–795. [Google Scholar] [CrossRef]
  18. Newton, M.A.; Raftery, A.E. Approximate Bayesian inference with the weighted likelihood bootstrap. J. R. Stat. Soc. Ser. B (Methodol.) 1994, 56, 3–48. [Google Scholar] [CrossRef]
  19. Moreira, C.; de Uña-Álvarez, J.; Crujeiras, R. DTDA: Doubly Truncated Data Analysis, R package version 3.0.1.; The Comprehensive R Archive Network: Vienna, Austria, 2022.
  20. Weinberg, S. Gravitation and Cosmology: Principles and Applications of the General Theory of Relativity; John Wiley & Sons: New York, NY, USA, 1972. [Google Scholar]
  21. Ying, Z.; Yu, W.; Zhao, Z.; Zheng, M. Regression analysis of doubly truncated data. Astrophys. J. 2020, 115, 810–821. [Google Scholar] [CrossRef] [PubMed]
  22. Kundu, D.; Gupta, R.D. Bivariate generalized exponential distribution. J. Multivar. Anal. 2009, 100, 581–593. [Google Scholar] [CrossRef]
  23. Marshall, A.W.; Olkin, I. A Multivariate Exponential Distribution. J. Am. Stat. Assoc. 1967, 62, 30–44. [Google Scholar] [CrossRef]
Figure 1. Hidden truncated PDFs of X given d 1 < Y d 2 for different values of d 1 and d 2 based on the ASBE distribution along with the non-truncated exponential distribution.
Figure 1. Hidden truncated PDFs of X given d 1 < Y d 2 for different values of d 1 and d 2 based on the ASBE distribution along with the non-truncated exponential distribution.
Axioms 13 00140 g001
Figure 2. Traceplots of the estimates under different models.
Figure 2. Traceplots of the estimates under different models.
Axioms 13 00140 g002
Table 1. Simulation results when the data are generated from M 4 with parameter values ( a , b , c , d 1 , d 2 ) = ( 2 , 3 , 4 , 1 , 10 ) .
Table 1. Simulation results when the data are generated from M 4 with parameter values ( a , b , c , d 1 , d 2 ) = ( 2 , 3 , 4 , 1 , 10 ) .
Sample SizePosterior EstimateParameters
a b c d 1 d 2
100Average posterior mean 2.443 2.952 4.102 0.90310.504
MSE of posterior mean 0.203 0.011 0.017 0.054 0.279
Average posterior variance 1.411 0.974 0.938 0.192 30.125
Coverage probability (CP)100%100%100%100%100%
Average width (AW) 4.348 3.715 3.695 1.621 17.968
200Average posterior mean 2.458 2.951 4.092 0.898 10.495
MSE of posterior mean 0.217 0.021 0.023 0.031 0.264
Average posterior variance 1.442 0.962 0.921 0.175 30.159
Coverage probability100%100%100%100%100%
Average width 4.411 3.682 3.665 1.585 19.955
300Average posterior mean 2.433 2.960 4.098 0.901 10.495
MSE of posterior mean 0.210 0.028 0.031 0.027 0.269
Average posterior variance 1.407 0.957 0.901 0.164 30.125
Coverage probability (CP)100%100%100%100%100%
Average width (AW) 4.371 3.674 3.610 1.552 17.944
Table 2. Posterior probabilities and Bayes estimates for different models.
Table 2. Posterior probabilities and Bayes estimates for different models.
True ModelPosterior Probability & Parameter EstimateContending Models
M 1 M 2 M 3 M 4
M 1 Post. Prob. I0.2510.2490.2510.249
Freq. (Prop.) of selection125 (0.250)142 (0.284)96 (0.192)137 (0.274)
Post. Prob. II0.2650.2610.2650.261
Post. mean (median) for a1.912 (1.910)1.339 (1.350)1.971 (1.964)1.372 (1.379)
Post. mean (median) for b2.777 (2.660)3.320 (3.210)2.771 (2.657)3.344 (3.227)
Post. mean (median) for c4.158 (4.088)3.471 (3.392)4.179 (4.101)3.480 (3.408)
Post. mean (median) for d 1 d 1 = 0 0.250 (0.224) d 1 = 0 0.251 (0.220)
Post. mean (median) for d 2 d 2 = d 2 = 10.000 (10.000)10.000 (10.000)
M 2 Post. Prob. I0.2390.2610.2390.261
Freq. (Prop.) of selection46 (0.092)186 (0.372)44 (0.088)224 (0.448)
Post. Prob. II0.2710.2700.2780.270
Post. mean (median) for a5.301 (5.315)2.033 (1.889)5.299 (5.308)2.043 (1.899)
Post. mean (median) for b2.377 (2.250)3.060 (2.951)2.338 (2.207)3.062 (2.939)
Post. mean (median) for c4.460 (4.376)3.789 (3.691)4.483 (4.402)3.786 (3.716)
Post. mean (median) for d 1 d 1 = 0 1.127 (1.113) d 1 = 0 1.131 (1.103)
Post. mean (median) for d 2 d 2 = d 2 = 10.000 (9.999)10.000 (10.000)
M 3 Post. Prob. I0.2510.2500.2500.249
Freq. (Prop.) of selection105 (0.210)140 (0.280)109 (0.218)146 (0.292)
Post. Prob. II0.2640.2600.2650.261
Post. mean (median) for a1.941 (1.934)1.358 (1.368)1.934 (1.922)1.360 (1.368)
Post. mean (median) for b2.759 (2.645)3.321 (3.206)2.763 (2.643)3.339 (3.227)
Post. mean (median) for c4.161 (4.087)3.488 (3.416)4.186 (4.109)3.482 (3.406)
Post. mean (median) for d 1 d 1 = 0 0.250 (0.222) d 1 = 0 0.247 (0.217)
Post. mean (median) for d 2 d 2 = d 2 = 10.000 (9.999)10.000 (10.000)
M 4 Post. Prob. I0.2390.2610.2390.261
Freq. (Prop.) of selection45 (0.090)205 (0.410)42 (0.084)208 (0.416)
Post. Prob. II0.2740.2700.2760.270
Post. mean (median) for a5.224 (5.230)2.033 (1.889)5.287 (5.229)2.042 (1.897)
Post. mean (median) for b2.348 (2.220)3.055 (2.945)2.342 (2.207)3.059 (2.949)
Post. mean (median) for c4.482 (4.393)3.787 (3.692)4.481 (4.399)3.790 (3.695)
Post. mean (median) for d 1 d 1 = 0 1.139 (1.127) d 1 = 0 1.122 (1.101)
Post. mean (median) for d 2 d 2 = d 2 = 10.000 (10.001)10.000 (10.000)
Table 3. Mean square errors for estimating the conditional expectation.
Table 3. Mean square errors for estimating the conditional expectation.
Contending Models
True Model M 1 M 2 M 3 M 4 Model Selection MSE MS
MSE 1 MSE 2 MSE 3 MSE 4
M 1 0.000690.000850.000680.000860.00081
M 2 0.000160.000100.000160.000110.00012
M 3 0.000690.000850.000680.000850.00081
M 4 0.000160.000110.000150.000100.00012
Table 4. Posterior probabilities, posterior means (medians), and 95% HPD intervals for quasar data.
Table 4. Posterior probabilities, posterior means (medians), and 95% HPD intervals for quasar data.
ModelPosterior ProbabilityParameters
a b c d 1 d 2
M 1 0.203 0.535 (0.536)7.044 (6.741)0.994 (0.893) d 1 = 0 d 2 =
( 0.433 ,   0.644 ) ( 2.645 ,   11.889 ) ( 0.166 ,   2.067 ) d 1 = 0 d 2 =
M 2 0.3050.435 (0.450)6.261 (6.000)0.666 (0.588)0.238 (0.170) d 2 =
( 0.195 ,   0.614 ) ( 1.969 ,   10.649 ) ( 0.060 ,   1.388 ) ( 0 ,   0.686 ) d 2 =
M 3 0.209 0.537 (0.536)7.073 (6.741)0.992 (0.889) d 1 = 0 9.499   ( 9.499 )
( 0.436 ,   0.646 ) ( 2.588 ,   11.989 ) ( 0.158 ,   2.100 ) d 1 = 0 ( 9.020 ,   9.971 )
M 4 0.283 0.439 (0.452)6.500 (6.232)0.722 (0.634)0.210 (0.148)9.500 (9.501)
( 0.226 ,   0.628 ) ( 2.392 ,   11.198 ) ( 0.117 ,   1.527 ) ( 0 ,   0.661 ) ( 9.006 ,   9.956 )
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ghosh, I.; Ng, H.K.T.; Kim, K.; Kim, S.W. Bayesian Inference for a Hidden Truncated Bivariate Exponential Distribution with Applications. Axioms 2024, 13, 140. https://doi.org/10.3390/axioms13030140

AMA Style

Ghosh I, Ng HKT, Kim K, Kim SW. Bayesian Inference for a Hidden Truncated Bivariate Exponential Distribution with Applications. Axioms. 2024; 13(3):140. https://doi.org/10.3390/axioms13030140

Chicago/Turabian Style

Ghosh, Indranil, Hon Keung Tony Ng, Kipum Kim, and Seong W. Kim. 2024. "Bayesian Inference for a Hidden Truncated Bivariate Exponential Distribution with Applications" Axioms 13, no. 3: 140. https://doi.org/10.3390/axioms13030140

APA Style

Ghosh, I., Ng, H. K. T., Kim, K., & Kim, S. W. (2024). Bayesian Inference for a Hidden Truncated Bivariate Exponential Distribution with Applications. Axioms, 13(3), 140. https://doi.org/10.3390/axioms13030140

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop