Abstract
In the development of simplex mixed-effects models, random effects in these mixed-effects models are generally distributed in normal distribution. The normality assumption may be violated in an analysis of skewed and multimodal longitudinal data. In this paper, we adopt the centered Dirichlet process mixture model (CDPMM) to specify the random effects in the simplex mixed-effects models. Combining the block Gibbs sampler and the Metropolis–Hastings algorithm, we extend a Bayesian Lasso (BLasso) to simultaneously estimate unknown parameters of interest and select important covariates with nonzero effects in semiparametric simplex mixed-effects models. Several simulation studies and a real example are employed to illustrate the proposed methodologies.
1. Introduction
Various mixed-effects models based on simplex distribution have increasingly become popular tools in the analysis of longitudinal continuous proportional data over time in many biological, medical and clinical studies. Under the framework of generalized linear mixed models, see Qiu et al. [1] for information on developing a simplex generalized linear mixed model on the basis of the penalized quasi-likelihood (PQL) and restricted maximum likelihood (REML) inference; see Zhang and Wei [2] for information on using the maximum likelihood estimation combining the stochastic approximation (SA) algorithm and the MCMC method to infer on simplex distribution nonlinear mixed models; see Zhao et al. [3] for information on implementing the MCMC algorithm to obtain the joint Bayesian estimate of simplex distribution nonlinear mixed models from the Bayesian perspective; see Bonat et al. [4] for information on investigating the likelihood analysis for a class of simplex mixed models with logit, probit, complement log–log and Cauchy link functions; see Quintero [5] for information on presenting the sensitivity analysis for variance parameters of random effects in Bayesian simplex mixed models. The random effects in the abovementioned mixed-effects models are assumed to have a multivariate normal distribution. However, in some practical applications, it is questionable for the normal assumption for random effects to analyze the skewed, bimodal and heavy-tailed longitudinal data. Therefore, it is essential to incorporate a semiparametric hierarchical structure via a Dirichlet process prior distribution for the random effects into the simplex mixed-effects models to accommodate longitudinal proportional data.
The nonparametric Bayesian approach based on Dirichlet process (DP) prior for random effects in mixed-effects models has been receiving a lot of attention in recent years. For example, Kleinman and Ibrahim [6] used a Dirichlet process prior for the general distribution of the random effects in generalized linear mixed model. As a variant of Dirichlet process prior, the truncation approximation Dirichlet process with stick-breaking priors is widely incorporated into various mixed-effects models to specify the general distribution of random effects. For example, Tang and Duan [7] used this approach for a semiparametric Bayesian approach to generalized partial linear mixed model; Tang and zhao [8] used this approach for nonlinear reproductive dispersion mixed models; Zhao et al. [9] used this approach for a semiparametric Bayesian approach to binomial distribution logistic mixed-effects model. In particular, Duan et al. [10] used a truncated and centered Dirichlet process prior to specify random effects in semiparametric reproductive dispersion mixed model. However, the abovementioned DP with stick-breaking prior for random effects is inappropriate when the underlying density of random effects is continuous. In addition, this type of variant for Dirichlet process prior is rather time-consuming in the calculation process for complicated models. Therefore, to address the above issues, the goal of this paper is to propose a new semiparametric simplex mixed-effects models with the random effects distribution specified by the centered Dirichlet process mixture model (CDPMM).
Although various methodologies have been developed to make statistical inference on the aforementioned simplex mixed-effects models, little work has been performed for the variable selection of simplex mixed-effects models. Classical model-selection methods, such as the step-wise selection method [11], the model comparison via Bayes factor [12], the Akaike information criterion [13] and Deviance information criterion [14], are often used to identify the important covariates in regression analysis; however, these approaches are generally computationally intensive and unstable for complicated mixed models with many covariates. On the other hand, the regularization (penalization) method has increasingly become a popular tool for conducting variable selection in regression analysis. Commonly used regularization methods in the context of linear regression include least absolute shrinkage and selection operator (Lasso) [15], elastic net [16] and adaptive lasso [17]. In addition, Park and Casella [18] proposed the Bayesian version of the Lasso (BLasso) by assigning the conditional Laplace prior of regression coefficients and the gamma distribution of shrinkage parameter under the Bayesian framework. The BLasso procedure has been extended to various complex models including semiparametric structural equation models [19] and semiparametric joint models of multivariate longitudinal and survival data [20]. In particular, Erd et al. [21] pointed out that Bayesian penalization methods perform similarly or sometimes even better than frequentist penalization methods, since Bayesian penalization methods can easily provide credible intervals (CIs) for parameters of interest and obtain the estimate of the penalty parameter by assigning an appropriate prior distribution. Therefore, the other main purpose of this paper is to extend the BLasso procedure to the considered semiparametric simplex mixed-effects models.
The paper is organized as follows: In Section 2, we propose a new semiparametric simplex mixed-effects models with random effects following the centered Dirichlet process mixture model (CDPMM) and incorporate a BLasso procedure into the proposed simplex mixed-effects models. The required conditional distributions are derived in Section 3. Two simulation studies and a real example are used to illustrate the proposed methodologies in Section 4. Some concluding remarks are given in Section 5.
2. Model and Notation
The simplex distribution was firstly proposed by Barndorff-Nielsen and Jørgensen [22], whose probability density function is specified as
where denotes the mean parameter; represents the dispersion parameter; and . For simplicity of notation, we denote if a random variable, y, is distributed as a simplex distribution with mean parameter, , and dispersion parameter, , in the rest of this paper.
In the context of longitudinal data analysis, let denote the longitudinal percentage outcome for the ith individual at the jth follow-up time , and , . We assume that, given a random effects corresponding to the ith individual, the responses are conditionally independent and each is distributed as a simplex distribution with conditional means, , and constant dispersion parameter, : that is, . Under the framework of GLMM, the conditional mean is linked to explanatory variables and random effects as follows:
where an unknown and monotone link function is chosen as the logit link; is a vector of covariates which consist of the constant 1 and time-dependent covariates observed at time point ; is a vector of unknown regression parameters; is a vector of time-dependent variables which may include some elements of corresponding to random effects . In classical random-effects models, the random effects in (2) are generally assumed to be a multivariate normal distribution, which may give rise to biased estimates of parameters or even misleading conclusions. Thus, inspired by Ohlssen and Spiegelhalter [23], we used the DP mixture of normals to specify the random effects: that is, with , where is an unknown random probability. Clearly, it is rather difficult and inefficient to make Bayesian estimates for regression parameter and dispersion parameter in Equation (2) since an unknown form of is involved. To address the difficulty, the Dirichlet process (DP) prior is usually introduced to approximate , i.e., , in which is a given base distribution such as multivariate normal distribution that serves as a starting point for constructing the nonparametric distribution, and is a weight that indicates the researcher’s certainty of as the distribution of . In particular, Sethuraman [24] showed that the DP prior has the stick-breaking prior representation; however, this approach causes a nonzero mean of random effects [25] and a discrete probability distribution of random effects [23]. Generally, the variants of Dirichlet Process proposed by Ishwaran and Zarepour [26] and Yang et al. [25] were regarded as discrete Dirichlet processes (discrete DPs). A discrete DP with stick-breaking prior for random effects is inappropriate when the underlying density of random effects is continuous. Furthermore, violation of zero mean assumption on the random effects may lead to non-identifiability in the aforementioned random effects model. In addition, the discrete DP methods with stick-breaking prior for random effects are generally computationally intensive for the complicated models.
To overcome the above issues, inspired by Ohlssen and Spiegelhalter [23] and Yang et al. [25], we incorporated the following variant of Dirichlet process into the above model in (2) to specify random effects. That is,
where is a random probability weight satisfying and . In addition, is assumed to be be independent of . This variant of Dirichlet process is referred to as the centered Dirichlet process mixture model (CDPMM). As in Ishwaran and Zarepour [26], we adopt the following mixture model of the truncated approximation DP for :
where G is a limited integer satisfying . As for the selection of G, Ishwaran and Zarepour [26] pointed out that a moderate value of G such as 25 may be enough to capture a good approximation in practical application. Thus, the value of G is chosen to be 25 in the rest of this paper. Furthermore, the random probability weight, , is specified by the following stick-breaking procedure:
where for , and so that . The prior distribution for the unknown parameter is chosen as , such that the posterior distribution for is conjugated. Here, we set the hyperparameters and to be 25 and 5, respectively, such that large value of is generated, which results in more unique values.
It is rather difficult and inefficient to generate observations from posterior distributions of with the above DP prior via MCMC algorithm. Furthermore, a latent variable is introduced to solve sample issue since this latent variable can record each ’s cluster membership and convey its parametric value to the distribution of . Let , , and , in which for . As in Ishwaran and Zarepour [26], the hierarchical structure defined in (4) can be written as
where denotes a discrete probability measure concentrated at g, is defined in Equation (5), the prior for associated with is defined by
and the prior for related to is defined by
where , denotes the Gamma distribution with parameters and , and and are pre-specified hyperparameters: that is, , , , , , and . Thus, given the values of , and , the prior for random effect is assumed to be with .
To estimate the unknown parameters and in Equation (2) from the Bayesian perspective, it is necessary to specify priors for and . In order to alleviate the computational burden, the conjugate prior distribution for dispersion parameter is taken to be
where the values of hyperparameters and are taken to be 1 and , respectively. In this paper, the main goal is to incorporate the Bayesian version of lasso into our proposed model (2) to conduct parameter estimation and model selection simultaneously. Similar to Park and Casella [18] and Tang et al. [20], the following Laplace prior on is given by
where is the regularization parameter. Because the mass of the above presented Laplace prior is quite highly concentrated around zero with a distinct peak at zero, posterior means or modes of ’s are shrunk towards zero, which is the key principle in using BLasso method to select the important covariates. Following Robert [15], the Laplace distribution with the form can be represented as a scale mixture of normal distributions with independent exponentially distributed variance: that is,
Therefore, the aforementioned prior for can be reformulated as the following hierarchical structure:
where the hyperparameters and are selected as 1 and , respectively, which imply diffuse prior. Similar to Park and Casella [18], the posterior distribution for and in the hierarchical structure (10) have closed expressions, such that this hierarchical representation greatly simplifies the computation. Therefore, it follows from Equation (10) that the posterior distribution of is distributed as the following Gamma distribution
In addition, the posterior distributions for are derived as
where denotes the inverse Gaussian distribution with parameter a and the shape parameter b. As for sampling from the inverse Gaussian distribution, Tang et al. [20] gave a detailed procedure.
3. Bayesian Analysis of Model
Let , , , and random effects . To obtain joint Bayesian estimates of unknown parameters and and the random effects, as well as to select important covariates in our considered models, a hybrid algorithm combining the block Gibbs sampler and the Metropolis–Hastings algorithm is employed to draw a sequence of random observations from the joint posterior distribution , as follows. In this hybrid algorithm, observations are iteratively drawn from the following conditional distributions: , and .
Block Gibbs Sampler (A): Conditional distribution related to
It follows from Equations (2) and (10) that the conditional distribution is proportional to
which is an unfamiliar distribution. Therefore, we used the well-known Metropolis–Hastings (MH) algorithm to generate observations from the aforementioned conditional distribution as follows. Given the current value , new candidate is generated from the proposal distribution and is accepted with probability
where
with and , and the variance coefficient can be chosen, such that the average acceptance rates are approximately 0.25 or more.
Block Gibbs Sampler (B): Conditional distribution related to
The conditional distribution can be derived as
which can be simplified as
Clearly, it is straightforward and efficient to draw observations for from the Gamma distribution via any statistical software.
Block Gibbs Sampler (C): Conditional distribution related to
Let denote all unknown parameters associated with distribution of random effects , . can be iteratively sampled by using the following nine steps:
Step (a). Conditional distribution of given is given
where and .
Step (b). For , the diagonal elements of is conditionally distributed as
where is the jth element of and is the jth element of .
Step (c). For , is conditionally distributed as
where is the jth diagonal element of .
Step (d). Following Ishwaran and Zarepour [26], the conditional distribution of can be expressed as
where is a random weight sampled from the beta distribution and it is sampled with step (e).
Step (e). It is easily obtained that the conditional distribution of is distributed as the following generalized Dirichlet distribution:
where for , and is the number of (and thus individuals) whose values equal to g. Simulating observation from the conditional distribution can be conducted as follows. First, is independently generated from a Beta distribution . Then, are obtained from the following formulae:
Step (f). Conditional distribution of .
Let be the d unique values of (i.e., unique number of “clusters”), for ; is conditionally distributed as follows:
where and for . Given , , and .
Step (g). Conditional distribution of .
Similar to the notation of step (f), given g, for , the jth diagonal element of is conditionally distributed as
where is the jth element of and is the jth element of . Given , and .
Step (h). The conditional distribution of is given by
where is proportional to with , and are sampled from step (e). Given , and , the prior of is distributed as , with and being the elements of sets and , respectively.
Step (i). The conditional distribution for
The conditional distribution is non-standard and cannot be derived directly via Gibbs sampling for . Specifically,
where , with specified by Equation (1) and by Equation (2). The Metropolis–Hastings algorithm used to sample observation is implemented as follows. At the ℓth iteration with a current value , a new candidate is drawn from the normal distribution , where and . The new is accepted with probability
The variance, , can be chosen such that the average acceptance rate is approximately 0.25 or more.
Then, we can obtain a series of sample observations——via the above iterative process. Then, Bayesian estimates of and for given i can be obtained by sample mean as follows:
Similarly, the consistent estimates of the posterior covariance matrices of and can be obtained via the sample covariance matrices.
4. Numerical Examples
To investigate the behavior of our proposed model and the BLasso method under the Bayesian framework, we conduct four simulation studies and a real example related to a prospective ophthalmology study.
4.1. Simulation Study
In the first simulation study, we assume that, given the random effects , the longitudinal percentage responses, , are conditionally independent and each (, ) follows the simplex distribution—that is, . The conditional mean parameter is specified as follows:
where randomly takes 1 or -1 with equal probability—, for . Moreover, the true values of the parameters are specified as follows: . This implies that a covariate corresponding to 0 is unimportant, and that . The true distribution of random effect, , is assumed to be
where the random effects cover the symmetric and skewed features with mean 0. A total of 500 Monte Carlo replications were conducted on the basis of the above-simulated setup.
In the second simulation study, 500 simulated datasets were generated by using the same setup as specified in the first simulation study except for the distribution of random effects. That is, random effects are distributed as
where random effects have bimodal features with 0.
Fore each dataset generated from the abovementioned two simulation studies, the hybrid algorithm combining the block Gibbs sampler and the Metropolis–Hastings algorithm in conjunction with the BLasso method and the stick-break prior of CDPMM was used to produce Bayesian estimates of parameters and random effects as well as simultaneously select the important covariates. To investigate the convergence for these Bayesian algorithms, we computed the estimated potential scale reduction (EPSR, proposed by Gelman et al. [27]) of parameters via three parallel sequences of observations based on three different initial values. It can be seen from Figure 1 that the EPSR values were less than 1.2 after about 7000 iterations in both simulations for all the test runs. Therefore, observations collected after 7000 iterations were used to compute the simulation results for all replications. Results obtained under simulations 1 and 2 are reported in Table 1, where ‘Bias’ is the difference between the true value and the mean of the estimates based on 500 replications; ‘RMS’ is the root mean square of differences between the true values and their corresponding estimates based on 500 replications. Compared with the Lasso from the frequentist view, the BLasso would not shrink the non-significant elements of exactly toward 0 since the sampling-based method is involved. Thus, as suggested by Tang et al. [20], the criterion for variable selection is that a coefficient is viewed as 0 if its 95% confidence interval includes zeros. In Table 1, ‘F0’ denotes the proportion that the number of 95% confidence interval for regression parameter including zero in 500 replications is divided by 500. The larger the values of F0 corresponding to non-significant regression parameters, and the smaller the values of F0 corresponding to significant parameters, the better the performance of the posited model.
Figure 1.
EPSR values of all parameters against iteration numbers for a randomly selected replication in the first simulation (left panel) and second simulation (right panel).
Table 1.
Bayesian estimates of parameters in the first and second simulation studies.
Examination of Table 1 indicated that (i) the Bayesian estimates of the unknown parameters and were reasonably accurate under the two abovementioned simulation studies since their absolute biases were less than 0.10 and their RMS values were less than 0.16; (ii) BLasso could correctly identify the zero and nonzero coefficients in most cases because the F0 values corresponding to important covariates were less than 10%, whilst the F0 values corresponding to unimportant covariates were near to 90%. On the other hand, to investigate the effectiveness of using the CDPMM prior for the random effects, we introduced the following RMSE (root of mean squared error) criterion in term of random effects,
where and denote, respectively, the true density function for random effect and kernel density estimation for the estimated values of random effect ; is chosen to be the th quantile of the dataset . The sample quantiles for the estimated values of RMSE are reported in Table 2. Furthermore, we chose a typical replication whose RMSE value is equal to the median in the 500 replications. Therefore, on the basis of the selected replication, the estimated densities of and based on the CDPMM prior against their corresponding true densities are plotted in Figure 2 and Figure 3, which indicated that the finite mixture of normal distributions can flexibly capture the symmetric, skewed and bimodal shapes of random effect . From Table 2, based on the results of 500 replication in both simulations, the estimated means and standard deviation (SD) of random effects and is approximate to their corresponding true values, the 25%, 50% and 75% quantiles of values are small and close enough, which indicated that it is robust to apply CDPMM method to estimate random effects. All these findings indicated that (i) our proposed Bayesian procedure could capture the true information of well, regardless of their true distributions and forms, and (ii) BLasso could identify the true model with a high probability.
Table 2.
Estimated means, standard deviations and RMSE quantile of random effects in the first and second simulation studies.
Figure 2.
Estimated densities versus true densities for random effects and in the first simulation.
Figure 3.
Estimated densities versus true densities for random effects and in the second simulation.
To compare the performance of the CDPMM prior for the random effects with the discrete DP given by Ishwaran and Zarepour [26] and Yang et al. [25], we conducted the following third simulation study. In this simulation study, 500 simulated datasets were generated by using the same setup as specified in the first simulation study, and fitted by the model with the discrete DP for the random effects. In the fourth simulation, we reanalyzed the aforementioned 500 datasets generated in the second simulation by using a parametric Bayesian approach with the random effects distribution specified by a multivariate normal distribution. The aim of this simulation was to compare the semiparametric approach based on the CDPMM prior with the parametric approach based on the Gaussian prior from the Bayesian perspective. Results obtained under the third and fourth simulations are reported in Table 3. Our programs were written in Matlab. It roughly took 119.3 s and 186.9 s in an Intel(R) Xeon(R) Silver 4216 CPU @ 2.10GHz (Intel, Santa Clara, CA, USA) serve to run 12,000 iterations for our proposed CDPMM and discrete DP, respectively; this indicates that the CDPMM method is much more efficient than the discrete DP in our considered simulations. It can be seen from Table 2 and Table 3 that (i) our proposed CDPMM and discrete DP methods have the same performance in term of the ’Bias’ and ’RMS’ values, but the F0 values corresponding to the non-significant parameters under the proposed CDPMM prior are higher than those under the discrete DP prior; (ii) the RMS values and the correct rates of variable selection based on the F0 values under the semiparametric CDPMM prior are better than those under the parametric Gaussian prior.
Table 3.
Bayesian estimates of parameters in the third and fourth simulation studies.
4.2. Real Example
In this section, the application of our proposed approach to the skewed longitudinal proportional data is illustrated by the analysis of a prospective ophthalmology study [28] from the Bayesian perspective. The prospective ophthalmology data used in the study were obtained from the Supplementary Materials of a paper by Song and Tan [29] and were available from https://biometrics.biometricsociety.org/home/archive/supplementary-materials, accessed on 5 september 2022. This prospective ophthalmology study described that the eyes of 31 patients before surgery were injected by three gas concentration levels of , and all patients after surgery were followed up three–eight times over a three-month period. The outcome variable was the percentage of remaining gas volume relative to the initial volume of gas injected. These longitudinal proportional data from a prospective ophthalmology study were analyzed by Qiu et al. [1], Song and Tan [29] and Song et al. [30], respectively. However, these authors did not consider to conduct variable selection in the analysis of this dataset. Our scientific interest in this study is to investigate the effect of three initial gas concentration levels of and time on the percentage of remaining gas volume, while accounting for selecting the important covariates based on the BLasso method. Let the response denote the percentage of gas left in the eye for patient i at the jth follow-up day, , and . Thus, the conditional mean in our proposed semiparametric mixed-effects model is given by
where is the time covariate for days after the gas injection; is the covariate of gas concentration levels equal to −1, 0 and 1, corresponding to the concentration levels of 15%, 20% and 25%, respectively; and random effects and are specified by CDPMM in (4), which characterize the effect fluctuations of interception and logarithmic time among patients.
The abovementioned MCMC algorithm was used to produce the joint Bayesian estimates of parameters and random effects in this real example. In the implementation of MCMC process, the hyperparameter values were taken to be the same as those given in simulation. Similarly, we used the EPSR method given in simulation to investigate the convergence for these algorithms. The EPSR values of all parameters against the iteration numbers was plotted in Figure 4, which indicated that the MCMC algorithm converged after 4000 iterations since their EPSR values were less than 1.2 after about 4000 iterations. Hence, observations collected after 4000 iterations were used to calculate Bayesian estimates of parameters and random effects. Results are reported in Table 4 and Figure 5, which indicated that (i) the estimated densities of random effects and were bimodal and skewed, which indicated that traditional normality assumption for random effects is inappropriate in this real example; (ii) the square of logarithmic time () was detected to be an important covariate with a significantly negative effect on the percentage of gas left in the eye, since its corresponding 95% confidence interval did not include zero. The gas concentration levels () and the logarithmic time () were insignificant at significance level because their 95% confidence interval included zero.
Figure 4.
EPSR values of all parameters against iteration numbers in the ophthalmology study.
Table 4.
Bayesian estimates (BEs) and standard deviations (SDs) and credible intervals (CIs) for parameters in the ophthalmology study.
Figure 5.
Estimated densities for random effects and in the ophthalmology study.
5. Conclusions
In this paper, we introduced a new semiparametric simplex mixed-effects models with the random effects following the centered Dirichlet process mixture model (CDPMM). The advantages of the proposed model based on CDPMM are the following: (i) it can capture the features of skewed and bimodal longitudinal proportional data; (ii) it can characterize absolutely continuous distributions for random effects. The novelty of our approach is that we adopted the BLasso procedure to simultaneously estimate parameters of interest, provide credible intervals (CIs) for parameters and conduct both shrinkage and variable selection for our considered models. A hybrid algorithm combining the Gibbs sampler and the MH algorithm was used to simultaneously obtain Bayesian estimates of unknown parameters, random effects and their standard errors and credible intervals. Empirical results show that (i) the proposed semiparametric Bayesian method provides quite accurate estimates of parameters (see Table 1); (ii) the average frequencies of correctly identifying unimportant predictors were near to 90%; (iii) CDPMM can effectively capture the potential features of normal, gamma and mixture normal distributions (see Table 2 and Figure 2, Figure 3 and Figure 5).
Author Contributions
Conceptualization, A.T. and X.D.; methodology, X.D. and A.T.; software, A.T. and Y.Z.; validation, A.T., X.D. and Y.Z.; formal analysis, A.T. and X.D.; investigation, A.T., X.D. and Y.Z.; Preparation of the original work draft, X.D. and A.T.; visualization, A.T. and Y.Z.; supervision, funding acquisition, A.T., X.D. and Y.Z. All authors have read and agreed to the published version of the manuscript.
Funding
This work was supported by the National Natural Science Foundation of China (No. 11961079, No. 12161014, No. 11761016), the Guizhou Provincial Science and Technology Project ([2020]1Y009), the Natural Science Research Project of Education Department of Guizhou Province (KY[2021]134), the Project of High Level Creative Talents in Guizhou Province of China, and Guiyang University Multidisciplinary Team Construction Projects in 2021[2021-xk04].
Data Availability Statement
The research data are available on the website: https://biometrics.biometricsociety.org/home/archive/supplementary-materials, accessed on 5 september 2022.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Qiu, Z.G.; Song, P.X.K.; Tan, M. Simplex mixed-effects models for longitudinal proportional data. Scand. J. Stat. 2008, 35, 577–596. [Google Scholar] [CrossRef]
- Zhang, W.; Wei, H. Maximum Likelihood Estimation for Simplex Distribution Nonlinear Mixed Models via the Stochastic Approximation Algorithm. Rocky Mt. J. Math. 2008, 38, 1863–1875. [Google Scholar] [CrossRef]
- Zhao, Y.Y.; Xu, D.K.; Duan, X.D.; Dai, L. Bayesian estimation of simplex distribution nonlinear mixed models for longitudinal data. Int. J. Appl. Math. Stat. 2014, 52, 1–10. [Google Scholar]
- Bonat, W.H.; Lopes, J.E.; Shimakura, S.E.; Ribeiro, P.J., Jr. Likelihood analysis for a class of simplex mixed models. Chil. J. Stat. 2018, 8, 3–7. [Google Scholar]
- Quintero, F.O.L. Sensitivity analysis for variance parameters in Bayesian simplex mixed models for proportional data. Commun. Stat. Simul. Comput. 2017, 46, 5212–5228. [Google Scholar] [CrossRef]
- Kleinman, K.P.; Ibrahim, J.G. A semiparametric Bayesian approach to generalized linear mixed models. Stat. Med. 1998, 17, 2579–2596. [Google Scholar] [CrossRef]
- Tang, N.S.; Duan, X.D. A semiparametric Bayesian approach to generalized partial linear mixed models for longitudinal data. Comput. Stat. Data Anal. 2012, 56, 4348–4365. [Google Scholar] [CrossRef]
- Tang, N.S.; Zhao, Y.Y. Semi-parametric Bayesian analysis of nonlinear reproductive dispersion mixed models for longitudinal data. J. Multivar. Anal. 2013, 115, 68–83. [Google Scholar] [CrossRef]
- Zhao, Y.Y.; Xu, D.K.; Duan, X.D.; Du, J. A semiparametric Bayesian approach to binomial distribution logistic mixed-effects models for longitudinal data. J. Stat. Comput. Simul. 2022, 92, 1438–1456. [Google Scholar]
- Duan, X.D.; Fung, W.K.; Tang, N.S. Bayesian semiparametric reproductive dispersion mixed models for non-normal longitudinal data: Estimation and case influence analysis. J. Stat. Comput. Simul. 2017, 87, 1925–1939. [Google Scholar] [CrossRef]
- Hocking, R.R. The analysis and selection of variables in linear regression. Biometrics 1976, 32, 1–51. [Google Scholar] [CrossRef]
- Kass, R.E.; Raftery, A.E. Bayes factors. J. Am. Stat. Assoc. 1995, 90, 773–795. [Google Scholar] [CrossRef]
- Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control. 1974, 19, 716–723. [Google Scholar] [CrossRef]
- Spiegelhalter, D.J.; Best, N.; Carlin, B.P.; Linde, A. Bayesian measures of model complexity and fit (with discussion). J. R. Stat. Soc. Ser. B 2002, 64, 583–639. [Google Scholar] [CrossRef]
- Robert, T. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B 1996, 58, 267–288. [Google Scholar]
- Zou, H.; Hastie, T. Regularization and variable selection via the elastic net. J. R. Stat. Soc. Ser. B 2005, 61, 301–320. [Google Scholar] [CrossRef]
- Zou, H. The Adaptive Lasso and Its Oracle Properties. J. Am. Stat. Assoc. 2006, 101, 1418–1429. [Google Scholar] [CrossRef]
- Park, T.; Casella, G. The Bayesian Lasso. J. Am. Statal Assoc. 2008, 103, 681–686. [Google Scholar] [CrossRef]
- Guo, R.; Zhu, H.; Chow, S.M.; Ibrahim, J.G. Bayesian lasso for semiparametric structural equation models. Biometrics 2012, 68, 567–577. [Google Scholar] [CrossRef]
- Tang, A.M.; Zhao, X.; Tang, N.S. Bayesian variable selection and estimation in semiparametric joint models of multivariate longitudinal and survival data. Biom. J. 2017, 59, 57–78. [Google Scholar] [CrossRef]
- Erp, S.V.; Oberski, D.L.; Mulder, J. Shrinkage priors for Bayesian penalized regression. J. Math. Psychol. 2019, 89, 31–50. [Google Scholar] [CrossRef]
- Barndorff-Nielsen, O.E.; Jørgensen, B. Some parametric models on the simplex. J. Multivar. Anal. 1991, 39, 106–116. [Google Scholar] [CrossRef]
- Ohlssen, D.I.; Sharples, L.D.; Spiegelhalter, D.J. Flexible random-effects models using Bayesian semiparametric models: Applications to institutional comparisons. Stat. Med. 2007, 26, 2088–2112. [Google Scholar] [CrossRef] [PubMed]
- Sethuraman, J. A constructive definition of Dirichlet priors. Stat. Sin. 1994, 4, 639–650. [Google Scholar]
- Yang, M.G.; Dunson, D.B.; Baird, D. Semiparametric Bayes hierarchical models with mean and variance constraints. Comput. Stat. Data Anal. 2010, 54, 2172–2186. [Google Scholar] [CrossRef]
- Ishwaran, H.; Zarepour, M. Markov chain Monte Carlo in approximate Dirichlet and beta two-parameter process hierarchical models. J. Am. Stat. Assoc. 2000, 87, 371–390. [Google Scholar] [CrossRef]
- Gelman, A. Inference and monitoring convergence. In Markov Chain Monte Carlo in Practice; Gilks, W.R., Richardson, S., Spiegelhalter, D.J., Eds.; Chapman and Hall: London, UK, 1996. [Google Scholar]
- Meyers, S.M.; Ambler, J.S.; Tan, M.; Werner, J.C.; Huang, S.S. Variation of perfluorpropane disapperance after vitrectomy. Retina 1992, 4, 359–363. [Google Scholar] [CrossRef]
- Song, P.X.K.; Tan, M. Marginal models for longitudinal continuous proportional data. Biometrics 2000, 56, 496–502. [Google Scholar] [CrossRef]
- Song, P.X.K.; Qiu, Z.G.; Tan, M. Modelling heterogeneous dispersion in marginal models for longitudinal continuous proportional data. Biom. J. 2004, 46, 540–553. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).