Bayesian Semiparametric Regression Analysis of Multivariate Panel Count Data

Chunling Wang; Xiaoyan Lin

doi:10.3390/stats5020028

and

¹

Wells Fargo, San Francisco, CA 94105, USA

²

Department of Statistics, University of South Carolina, Columbia, SC 29208, USA

^*

Author to whom correspondence should be addressed.

Stats2022, 5(2), 477-493;https://doi.org/10.3390/stats5020028

This article belongs to the Special Issue Statistics, Analytics, and Inferences for Discrete Data

Version Notes

Order Reprints

Abstract

Panel count data often occur in a long-term recurrent event study, where the exact occurrence time of the recurrent events is unknown, but only the occurrence count between any two adjacent observation time points is recorded. Most traditional methods only handle panel count data for a single type of event. In this paper, we propose a Bayesian semiparameteric approach to analyze panel count data for multiple types of events. For each type of recurrent event, the proportional mean model is adopted to model the mean count of the event, where its baseline mean function is approximated by monotone I-splines. The correlation between multiple types of events is modeled by common frailty terms and scale parameters. Unlike many frequentist estimating equation methods, our approach is based on the observed likelihood and makes no assumption on the relationship between the recurrent process and the observation process. Under the Poisson counting process assumption, we develop an efficient Gibbs sampler based on novel data augmentation for the Markov chain Monte Carlo sampling. Simulation studies show good estimation performance of the baseline mean functions and the regression coefficients; meanwhile, the importance of including the scale parameter to flexibly accommodate the correlation between events is also demonstrated. Finally, a skin cancer data example is fully analyzed to illustrate the proposed methods.

Keywords:

data augmentation; Gibbs sampler; I-splines; pairwise correlation

1. Introduction

Panel count data often arise in epidemiological and medical studies, in which the events of interest have a property of recurring and subjects are monitored periodically. Since the subjects are not under continuous monitoring, the exact time of each recurrent event is not observed but the count of such events between adjacent observation times is known. Many times in a study, several types of related recurrent events are of interest to be collected. For example, the recurrent events can be different types of infections, tumors, and social behaviors such as drinking and drug use. The motivating example is bivariate panel count data on skin cancers from the literature [1]. The data arose from a skin cancer chemoprevention trial conducted by the University of Wisconsin Comprehensive Cancer Center. This was a double-blinded and placebo-controlled randomized III clinical trial. The main objective of the study was to evaluate the effectiveness of 0.5/m

^{2}

/day PO difluoromethylornithine (DFMO) in reducing the recurrence rate of skin cancers in a population of patients with a history of non-melanoma skin cancers: basal cell carcinoma and squamous cell carcinoma. Two hundred and ninety one patients were randomized into either a placebo group or a DFMO group. During the study, the patients were scheduled to regularly take 6-month reviews to check the development of both skin cancers. At each visit, the numbers of occurrences of both basal cell carcinoma and squamous cell carcinoma since the previous visit were recorded. For these bivariate panel count data, it is possible to analyze these two skin cancers separately to evaluate the effectiveness of DFMO. However, conducting a joint analysis is a better practice to investigate the correlation between two cancers and to improve the estimation efficiency.

Extensive methods have been established to analyze panel count data for a single type of recurrent event, for example Sun and Kalbfleisch [2], Sun and Zhao [1], Wellner and Zhang [3,4], Lu et al. [5,6], Wang and Lin [7], among others. For multivariate panel count data, He et al. [8] considered regression analysis for multivariate panel count data and first proposed a class of marginal mean models, which leave the dependence structures for related recurrent events completely unspecified. Zhang et al. [9] then improved their model and provided a robust joint modeling approach for the regression analysis of multivariate panel count data with an informative observation process. Li et al. [10] proposed semiparametric transformation models that allow the dependence of the recurrent event processes on the observation process. Along the same line, Zhao et al. [11] proposed a semiparametric additive model to analyze the multivariate panel count data with dependent observation processes and a terminal event. This branch of works emphasize the dependence of the recurrent event processes on the observation process and require the model of the observation process being explicitly specified. They usually derive estimating equation methods to estimate regression coefficient without estimating baseline mean functions.

Another line of research for panel count data is to consider varying or nonlinear covariate effects. He et al. [12] proposed a class of partially linear models with varying coefficients for the mean function of the counting processes to explore the nonlinear interactions between covariates. Zhao et al. [13] adopted the B-splines to model the regression function. Wang and Yu [14] proposed the time varying coefficient model by the local linear expansion method. For multivariate panel count data, Li et al. [15] presented a joint modeling for the recurrent event processes, observation process, and censoring time by using time-dependent random effects. Wang and Yu [16] proposed a varying coefficient mean model for multivariate panel count data to describe the possible nonlinear interact effects between covariates. In this paper, instead of investigating varying covariate effects, we only focus on traditional constant covariate effects.

Frequentist approaches dominate the analysis of panel count data. There is only a limited number of papers describing Bayesian approaches for analyzing panel count data. Chib et al. [17] investigated estimation and model comparison for panel count data models with multiple random effects. Sinha and Maiti (2004) proposed a Bayesian approach for the analysis of panel count data with dependent termination. Wang and Lin (2020) developed a Bayesian semiparametric approach under the proportional mean model with the baseline mean function approximated by monotone I-splines [18]. Dimitrakopoulos [19] discussed how Bayesian techniques can be used to estimate the panel count data model. Liang et al. [20] proposed a bivariate Gaussian Cox process model to jointly model the recurrent event process and the observation process. To the best of our knowledge, Ref. [20] is the only Bayesian paper for analyzing multivariate panel count data.

In this paper, we contribute a new Bayesian semiparameteric approach to analyze panel count data for multiple types of recurrent events. For each type of recurrent event, we adopt the modeling in Wang and Lin (2020) by using the proportional mean model to model the mean count of the recurrent event with its baseline mean function approximated by monotone I-splines [18]. The proposed approach is based on the observed likelihood using only the observed counts and observation times, which does not require us to specify a model for the observation process. Instead of building up the correlation among multiple types of events through their dependence on the observation process, we tackle the correlations between different types of events directly by introducing common subject-specific gamma frailty terms and additional scale parameters. The resulting pairwise correlations can be calculated in a close form and flexibly accommodate different correlation situations including positive, negative, strong, and weak correlations. Based on Poisson latent variable augmentation, our developed MCMC algorithm can estimate both the regression coefficients and the baseline mean functions simultaneously.

The remainder of this paper is organized as follows. In Section 2, primary models and the correlation derivation and interpretation are introduced. In Section 3, a detailed description of monotone I-splines, augmented likelihood function construction, and prior specification and posterior computation are presented. Section 4 evaluates our proposed methods via extensive simulation studies. The skin cancer data are used to demonstrate the performance of the proposed methods in Section 5. Lastly, Section 6 summarizes our findings and discusses some future work directions.

2. Model and Notation

2.1. Model Construction

Consider that n subjects participate a long-term study involving K types of related recurrent events. Each subject is not under continuous monitoring and instead observed at discrete time points. Specifically, for each subject i,

J_{i}^{(k)}

denotes the total number of observation times for event k, and the corresponding observation times are

0 = t_{i 0}^{(k)} < t_{i 1}^{(k)} < t_{i 2}^{(k)} < \dots < t_{i J_{i}^{(k)}}^{(k)}

. Let

N_{i}^{(k)} (t)

denote the cumulative count of the occurrence of event k prior to time t for subject i. Let

X_{i}^{(k)} = {(x_{i 1}^{(k)}, \dots, x_{i p}^{(k)})}^{'}

denote the

p \times 1

covariate vector associated with subject i for event k. For simplicity, in this paper we assume the covariates for subject i are identical for all K events and denoted by

X_{i}

. The whole set of observed panel count data is denoted by

D

= {t_{i j}^{(k)}, N_{i}^{(k)} (t_{i j}^{(k)}), X_{i}, f o r k = 1, \dots, K; i = 1, \dots, n; j = 1, \dots, J_{i}^{(k)}}

. Finally, we assume there is a latent subject-specific positive frailty term

w_{i}

for each subject i, which affects the occurrence rates and connects the multiple events. The frailties

w_{i}

’s are assumed to follow a gamma distribution

G (η, η)

with mean 1 and variance

1 / η

. The model identifiability is satisfied with the mean of frailties equal to 1 [21,22]. Given the covariates

X_{i}

and frailty

w_{i}

, we assume the counting process {

N_{i}^{(k)} (t), t > 0

} for event k has a proportional mean function in the following form:

E (N_{i}^{(k)} (t) | X_{i}, w_{i}) = w_{i}^{α_{k}} U_{0}^{(k)} (t) exp (X_{i}^{'} β^{(k)}),

(1)

where

α_{k}

is a scale parameter introduced to more flexibly accommodate the correlation between events,

U_{0}^{(k)} (t)

is the baseline mean function for event k, and

β^{(k)}

is a

p \times 1

vector of regression coefficients for event k. By default, the model considers different covariate effects for different recurrent events, but it is easy to extend to the situation where the covariate effects are identical across all the K events. More details about

α_{k}

related to the correlation are discussed in Section 2.2.

Let

Z_{i j}^{(k)}

denote the number of occurrence of event k in the jth time interval

(t_{i, j - 1}^{(k)}, t_{i j}^{(k)}]

for subject i, i.e.,

Z_{i j}^{(k)} = N_{i}^{(k)} (t_{i j}^{(k)}) - N_{i}^{(k)} (t_{i, j - 1}^{(k)})

, where we assume

N_{i}^{(k)} (t_{i 0}) = 0

for

i = 1, \dots, n .

Through this transformation, the whole set of data can be expressed as

D

= {t_{i j}^{(k)}, Z_{i j}^{(k)}, X_{i}, f o r k = 1, \dots, K; i = 1, \dots, n; j = 1, \dots, J_{i}^{(k)}}

. Under the Poisson process assumption, given the covariates

X_{i}

and latent frailty terms

w_{i}

,

Z_{i j}^{(k)}

independently follows Poisson distributions, which can be written as:

Z_{i j}^{(k)} | X_{i}, w_{i} \sim P (w_{i}^{α_{k}} {U_{0}^{(k)} (t_{i j}^{(k)}) - U_{0}^{(k)} (t_{i, j - 1}^{(k)})} exp (X_{i}^{'} β^{(k)})) .

(2)

This form is particularly useful for constructing the likelihood function in Section 3.2. Note that when integrating out the frailty effect

w_{i}

,

Z_{i j}^{(k)}

marginally follows mixed Poisson distributions [1].

2.2. Correlation Expression

An advantage of our proposed model is its straightforwardness in deriving the correlation formula between two events. Among the K events, we predesignate one event that is of main interest, for instance event j, as a reference event, and let

α_{j} = 1

. Then, given the covariates

X_{i}

and the unobservable frailty

w_{i}

, the cumulative counts of event j and event k (

k \neq j

) for subject i at any time point t are conditionally independent and have

N_{i}^{(j)} (t) | X_{i}, w_{i} \sim P (w_{i} U_{0}^{(j)} (t) exp (X_{i}^{'} β^{(j)})),

N_{i}^{(k)} (t) | X_{i}, w_{i} \sim P (w_{i}^{α_{k}} U_{0}^{(k)} (t) exp (X_{i}^{'} β^{(k)})) .

Using the law of total variance, we explicitly derive

C o v (N_{i}^{(j)} (t), N_{i}^{(k)} (t))

,

V a r (N_{i}^{(j)} (t))

and

V a r (N_{i}^{(k)} (t))

and hence the correlation formula as below:

\begin{matrix} C o r r (N_{i}^{(j)} (t), N_{i}^{(k)} (t)) & = \frac{C o v (N_{i}^{(j)} (t), N_{i}^{(k)} (t))}{\sqrt{V a r (N_{i}^{(j)} (t)) V a r (N_{i}^{(k)} (t))}} \\ = \frac{α_{k}}{\sqrt{η + \frac{η^{2}}{[U_{0}^{(j)} (t) exp (X_{i}^{'} β^{(j)})]}} \sqrt{\frac{Γ (η + 2 α_{k}) Γ (η)}{{(Γ (η + α_{k}))}^{2}} - 1 + \frac{Γ (η) η^{α_{k}}}{Γ (η + α_{k}) [U_{0}^{(k)} (t) exp (X_{i}^{'} β^{(k)})]}}} \end{matrix}

(3)

where

η > 0

and

α_{k} > - η / 2

. For details on derivation of Equation (3), see Appendix A. Clearly,

α_{k} = 0

implies that event j and event k are independent;

α_{k} > 0

implies that event j and event k are positively related;

α_{k} < 0

implies that event j and event k are negatively related. When getting rid of the covariate effects, the baseline correlation between the two events is

C o r r_{0} (N^{(j)} (t), N^{(k)} (t)) = \frac{α_{k}}{\sqrt{η + \frac{η^{2}}{U_{0}^{(j)} (t)}} \sqrt{\frac{Γ (η + 2 α_{k}) Γ (η)}{{(Γ (η + α_{k}))}^{2}} - 1 + \frac{Γ (η) η^{α_{k}}}{Γ (η + α_{k}) U_{0}^{(k)} (t)}}} .

The benefit of this form is that it gets away with the individual level and provides a broader view of the correlation between two events. Letting

U_{0}^{(j)} (t) = U_{0}^{(k)} (t) = 1

, we can explore the pure effect of

α_{k}

and

η

on the baseline correlation

C o r r_{0} (N^{(j)} (t), N^{(k)} (t))

. Note that the correlation between any two events is controlled by the parameters

η

and

α_{k}

. Figure A1 shows the baseline correlation performance related to

η

and

α_{k}

when

α_{k} \in [- 1, 1]

. It shows that with the variance of frailty increasing (

η

decreasing), the correlation increases; with the magnitude of

α_{k}

increasing, the correlation also increases. When

α_{k} > 0

, any

η > 0

is a legitimate choice; while for

α_{k} < 0

, the condition

η > - 2 α_{k}

must be satisfied to make

Γ (\cdot)

valid. In fact, the interpretation of the pairwise correlation is not limited to the pairs of events involving the predesignated event. A similar derivation can derive the correlation between any two types of events. Theoretically,

α_{k}

can be any value greater than

- η / 2

. We only choose

α_{k}

between −1 and 1 because we think in practice we can always choose the event with the largest frailty variability as the reference event. Then, for the other events, we only need

α_{k}

between

- 1

and 1.

3. The Proposed Bayesian Semiparametric Approach

3.1. Modeling $U_{0}^{(k)} (t)$ with Monotone I-Splines

To accommodate the nondecreasing nature of the baseline mean functions in the proposed model, we choose monotone I-splines to model them. I-splines were first bought up by Ramsay [18] and then widely applied in many semiparametric models. To put it briefly, I-splines are actually integrated M-splines, a set of non-negative spline functions [18]. In our proposed model, each baseline function of event k is modeled as a linear combination of I-splines:

U_{0}^{(k)} (t) = \sum_{l = 1}^{L} r_{l}^{(k)} I_{l} (t | d),

(4)

In the formula,

I_{l} (\cdot | d)

is the I-spline basis function with degree d; L is the number of I-spline basis functions, which equals the number of interior knots plus the degree d;

r_{l}^{(k)}

s are the nonnegative spline coefficients. For more detailed information on the formula of I-splines, refer to Ramsay [18] and Lin et al. [23]. The degree d and the placement of knots are two chief components that determine the basis functions. The former controls the smoothness and the latter controls the shape of the spline function. In general, 2 or 3 degrees is enough to provide adequate smoothness, and 10 to 30 knots can provide enough flexibility for a regression incorporating thousands of observations according to Cai et al. [24] and Wang and Dunson [25]. Equally spaced knots and quantile-based knots are two commonly used methods to select knots. Ref. [7] showed that the deviance information criterion (DIC) can be used to facilitate choosing the setup for the I-spline functions. In addition, reversible jump MCMC technique [26] may also help adaptively choosing the number and location of knots. In this paper, to avoid additional computation burden, we uniformly use degree 3 and 20 equally spaced knots (18 interior knots), which provides sufficient flexibility for modeling the unknown baseline mean functions.

3.2. Likelihood Augmentation with Poisson Latent Variables

Consider that n subjects participate in a long-term study involving K types of related recurrent events. The data structure and model are defined in Section 2.1. Under the Poisson process assumption, following Equation (2), the observed likelihood function can be written in the following form:

L_{o b s} = \prod_{k = 1}^{K} \prod_{i = 1}^{n} \prod_{j = 1}^{J_{i}^{(k)}} exp [- w_{i}^{α_{k}} {U_{0}^{(k)} (t_{i, j}^{(k)}) - U_{0}^{(k)} (t_{i, j - 1}^{(k)})} exp (X_{i}^{'} β^{(k)})]

\times [w_{i}^{α_{k}} {U_{0}^{(k)} (t_{i j}^{(k)}) - U_{0}^{(k)} (t_{i, j - 1}^{(k)})} exp (X_{i}^{'} β^{(k)})]^{Z_{i j}^{(k)}} / Z_{i j}^{(k)}!

Taking the baseline function

U_{0}^{(k)} (t)

in the form of (4), the likelihood can further be written as

L (θ | D) = \prod_{k = 1}^{K} \prod_{i = 1}^{n} \prod_{j = 1}^{J i^{(k)}} exp [- w_{i}^{α_{k}} \sum_{l = 1}^{L} r_{l}^{(k)} {I_{l} (t_{i, j}^{(k)}) - I_{l} (t_{i, j - 1}^{(k)})} exp (X_{i}^{'} β^{(k)})]

\times [w_{i}^{α_{k}} \sum_{l = 1}^{L} r_{l}^{(k)} {I_{l} (t_{i, j}^{(k)}) - I_{l} (t_{i, j - 1}^{(k)})} exp (X_{i}^{'} β^{(k)})]^{Z_{i j}^{(k)}} / Z_{i j}^{(k)}!

where

θ

represents the vector of all unknown parameters including

β^{(k)} = {(β_{1}^{(k)}, \dots, β_{p}^{(k)})}^{'}

and

r^{(k)} = {(r_{1}^{(k)}, \dots, r_{L}^{(k)})}^{'}

f o r k = 1, \dots, K

, and

α = {(α_{1}, \dots, α_{k})}^{'}

. Note that for the simplicity of notation, the d is omitted from the I-spline basis functions. With this likelihood format, the sampling for

r_{l}^{(k)}

is especially difficult. To solve this problem, we further decompose

Z_{i j}^{(k)}

as

\sum_{l = 1}^{L} Z_{i j l}^{(k)}

, where the augmented Poisson latent variables

Z_{i j l}^{(k)}

s independently follow Poisson distributions as follows:

Z_{i j l}^{(k)} | X_{i}, w_{i} \sim P (w_{i}^{α_{k}} {r_{l}^{(k)} [I_{l} (t_{i, j}^{(k)}) - I_{l} (t_{i, j - 1}^{(k)})]} exp (X_{i}^{'} β^{(k)})) .

The convolution property of Poisson distribution aids the transformation smoothly. Then, with the augmented Poisson latent variables, the likelihood function can be expressed as

L_{a u g} (θ | D) = \prod_{k = 1}^{K} \prod_{i = 1}^{n} \prod_{j = 1}^{J_{i}^{(k)}} \prod_{l = 1}^{L} exp [- w_{i}^{α_{k}} r_{l}^{(k)} {I_{l} (t_{i, j}^{(k)}) - I_{l} (t_{i, j - 1}^{(k)})} exp (X_{i}^{'} β^{(k)})]

\times [w_{i}^{α_{k}} r_{l}^{(k)} {I_{l} (t_{i, j}^{(k)}) - I_{l} (t_{i, j - 1}^{(k)})} exp (X_{i}^{'} β^{(k)})]^{Z_{i j l}^{(k)}} / Z_{i j l}^{(k)}! .

(5)

Based on this augmented likelihood Equation (5), we develop the Bayesian computation algorithm in Section 3.3.

3.3. Prior Specification and Posterior Computation

For Bayesian computation, we need to first specify prior distributions for unknown parameters. When we do not have much prior information about parameters, we usually assign vague priors for them. For

β_{m}^{(k)}

,

m = 1, \dots, p

;

k = 1, \dots, K

, we assign

N (0, σ^{2})

priors, where

σ^{2}

takes a large value such as 100. For nonnegative

r_{l}^{(k)}

,

l = 1, \dots, L

;

k = 1, \dots, K

, we assign exponential priors with rate parameter

λ_{k}

, where

λ_{k}

itself follows a gamma prior

G (a_{λ}, b_{λ})

. This prior specification is appealing from the computational perspective because it leads to conjugate forms for each of the conditional posterior distributions of

r_{l}^{(k)}

and

λ_{k}

. Theoretically, such a prior specification is closely related to Bayesian Lasso [27] and is equivalent to the penalized likelihood approach with an L1 penalty on those spline coefficients, in which

λ_{k}

serve as tuning parameters. Our simulation studies show that our approach is robust to the choice of hyperparameters, so we simply choose

a_{λ} = 1

and

b_{λ} = 1

. We adopt Gibbs sampling algorithm for posterior computation. Basically, we derive the full conditional distribution of each parameter component-wisely from the joint distribution of the likelihood function in (5) and the specified prior distributions. If the full conditional distribution of a parameter has a closed form, the sampling is straightforward. When the closed form is intractable, an adaptive rejection sample (ARS) [28] is adopted if the full conditional posterior distribution preserves the log-concavity. Even if the log concavity is not satisfied, we can still use adaptive rejection metropolis sampling (ARMS) [29] to draw samples. Specifically, we use function arms() in the R package HI [30] to realize the ARMS. The full conditional distributions of the Gibbs sampler are summarized as below.

Sample ( $Z_{i j 1}^{(k)}$ , …, $Z_{i j L}^{(k)}$ ) from a multinomial distribution $M (Z_{i j}^{(k)}, P {(k)}_{i j})$ , for $i = 1, \dots, n;$ $j = 1, \dots,$ $J_{i}^{(k)}$ ; $k = 1, \dots, K$ , where $P_{i j}^{(k)} = (p_{i j 1}^{(k)}, \dots, p_{i j L}^{(k)})$ with $\sum_{l = 1}^{L} p_{i j l}^{(k)} = 1$ , and

$p_{i j L}^{(k)} = \frac{r_{l}^{(k)} {I_{l} (t_{i j}^{(k)}) - I_{l} (t_{i, (j - 1)}^{(k)})}}{\sum_{l = 1}^{L} r_{l}^{(k)} {I_{l} (t_{i j}^{(k)}) - I_{l} (t_{i, (j - 1)}^{(k)})}} .$
Sample $r_{l}^{(k)}$ from a Gamma distribution $G (A_{l}^{(k)}, B_{l}^{(k)})$ , for $l = 1, \dots, L; k = 1, \dots, K$ , with

$A_{l}^{(k)} = \sum_{i = 1}^{n} \sum_{j = 1}^{J_{i}^{(k)}} Z_{i j l}^{(k)} + 1,$

and

$B_{l}^{(k)} = \sum_{i = 1}^{n} w_{i}^{α_{k}} {I_{l} (t_{i J_{i}^{(k)}}^{(k)}) - I_{l} (t_{i 0}^{(k)})} exp (X_{i}^{'} β^{(k)}) + λ_{k} .$
Sample $λ_{k}$ from a Gamma distribution $G (a_{λ} + L, b_{λ} + \sum_{l = 1}^{L} r_{l}^{(k)})$ , for $k = 1, \dots, K$ .
Sample $β_{m}^{(k)}$ by using the adaptive rejection sampling (ARS) [28] method, for $m = 1,$ $\dots, p; k = 1, \dots, K$ . The log full conditional distribution of each $β_{m}$ is proportional to

$exp [- \sum_{i = 1}^{n} \sum_{l = 1}^{L} w_{i}^{α_{k}} r_{l}^{(k)} {I_{l} (t_{i J_{i}^{(k)}}^{(k)}) - I_{l} (t_{i 0}^{(k)})} exp (X_{i}^{'} β^{(k)}) + \sum_{i = 1}^{n} \sum_{j = 1}^{J_{i}^{(k)}} X_{i}^{'} β^{(k)} Z_{i j}^{(k)} - {(β_{m}^{(k)})}^{2} / (2 σ^{2})] .$
Sample $w_{i}$ for $i = 1, \dots, n$ , by using the ARS. The log full conditional distribution of each $w_{i}$ is proportional to

$- \sum_{k = 1}^{K} w_{i}^{α_{k}} r_{l}^{(k)} {I_{l} (t_{i J_{i}^{(k)}}^{(k)}) - I_{l} (t_{i 0}^{(k)})} exp (X_{i}^{'} β^{(k)}) + \sum_{k = 1}^{K} \sum_{j = 1}^{J_{i}^{(k)}} \sum_{l = 1}^{L} Z_{i j l}^{(k)} log (w_{i}^{α_{k}}) - η w_{i} + (η - 1) log (w_{i}) .$
Sample $η$ by using the ARMS, the log full conditional distribution of which is proportional to

$(η - 1) \sum_{i = 1}^{n} log (w_{i}) - η (\sum_{i = i}^{n} w_{i} + b_{η}) + (n η + a_{η} - 1) log (η) - n log (τ (η)) .$
Sample $α_{k}$ , for $k = 1, \dots, K$ , by using the ARMS, the log full conditional distribution of which is proportional to

$- w_{i}^{α_{k}} r_{l}^{(k)} {I_{l} (t_{i J_{i}^{(k)}}^{(k)}) - I_{l} (t_{i 0}^{(k)})} exp (X_{i}^{'} β^{(k)}) + \sum_{j = 1}^{J_{i}} \sum_{l = 1}^{L} Z_{i j l}^{(k)} log (w_{i}^{α_{k}}) .$

In the R function arms(), we set the low bound of the support of $α_{k}$ as $- η / 2$ .

4. Simulation Studies

Simulation studies are conducted to evaluate the proposed methods. We only consider

2 (K)

types of events for the demonstration purpose. By default,

α_{1} = 1

. For notation simplicity, we denote

α_{2}

as

α

. It is straightforward to extend to three or more types of events. We particularly assess the performance of estimating covariate coefficients

β^{(k)}

and baseline functions

U_{0}^{(k)}

for different

η

and

α

values. We also compare the estimation results between models with and without the scale parameter

α

.

4.1. Data Generation

Consider 2 types of events and 100 subjects. Data are simulated according to model (1) under the Poisson process assumption. Specifically, we consider four different values for

α

:

- 0.3, 0, 0.5,

and 1, and two different values for

η

: 1 and 5. The baseline mean function for the first type of event is

U_{0}^{(1)} (t) = t + log (1 + t)

, which is approximately linear, and for the second type of event,

U_{0}^{(2)} (t) = t^{0.5} + log (1 + t)

, which is curvilinear. Two covariates are involved for each subject, where

X_{1}

is from a Bernoulli distribution with the probability of success 0.5 and

X_{2}

is from a standard normal distribution. Each subject has the same observation times for the two types of events. The number of observation times for each subject is generated from a Poisson distribution with mean 7, and the time length between every two adjacent observation times follows an exponential distribution with mean 0.5. Given the observation times and the generated frailty

w_{i}

from

G (η, η)

, for each subject, the event count for each time interval for each type of event is generated from a Poisson distribution as Equation (2). Each set of simulations consists of 500 data replicates.

4.2. Simulation Results

We implemented the Gibbs sampler in Section 3.3 for each simulated dataset. Good mixing and fast convergence in the chains of the key parameters were observed. The convergence assessment was carried out using various convergence criteria in the R package coda [31]. The following results are summarized based on 5000 iterations of the Gibbs samples after discarding the first 2000 iterations as a burn-in.

The first set of simulations aims at assessing the performance of estimating regression coefficients

β^{(k)}

. For this set of simulations, data are generated with different true

β

values but

α

fixed at 1. Table A1 shows the estimation bias (Bias) defined as the average of the posterior means minus the true value, the mean of the posterior standard deviations (SD), standard deviation of the posterior means (SE), and 95% coverage probability (CP95) of four combinations of

β

values. Clearly, the estimation of regression coefficients is good with small biases, SDs close to SEs, and 95% coverage probabilities around 95%. When

η

goes from 1 to 5 (i.e., the variance of frailty goes from 1 to 0.2), the Bias, SD and SE uniformly become smaller. It is noted that the estimation of

β_{2}

is consistently better than that of

β_{1}

in terms of giving smaller biases and SDs and SEs. This is because that

X_{2}

is a normal covariate providing more information than a binary covariate

X_{1}

, and that the variance of

X_{2}

is four times the variance of

X_{1}

. Our simulation study shows that when we add one more possible value to

X_{1}

such that

X_{1}

follows a uniform discrete distribution with

X_{1} \in {0, 1, - 1}

, the estimation of

β_{1}

’s is improved and close to the estimation of

β_{2}

.

The second set of simulations focuses on assessing the estimation performance of our proposed methods when data are generated with different

α

values. The true

β

values are fixed as

β_{1}^{(1)} = 1

,

β_{2}^{(1)} = - 1

,

β_{1}^{(2)} = 1

, and

β_{2}^{(2)} = 1

. Table A2 summarizes the estimation results. The estimation results of regression coefficients are similar as those in Table A1. Overall, the estimation is good with small biases, SDs close to SEs, and the CP95s close to 95%. The estimation is more precise for the larger

η

value, and the estimation of

β_{2}

is overall better than that of

β_{1}

. A new fact is observed that with the magnitude of

α

approaching 0, the estimation of the regression coefficients for the second type of event becomes more precise with smaller biases and SDs and SEs. This observation is reasonable, as the smaller value of

α

reflects the occurrence of the second type of events less affected by frailties, and thus the more precise estimation of the regression coefficients is expected. Furthermore, the estimated

α

and

η

have remarkably caught the truth. Different from the estimation of regression coefficients, a small value of

η

leads to the better estimation of

α

. A glance at the table may find the bias of

η

is a little too high when the true

η

is 5. We need to point out that the difference between

G (5, 5)

and

G (6, 6)

is trivial, so the estimation can still accurately catch the shape of the distribution. Figure A2 shows the estimation of the baseline mean functions corresponding to different setups of

α

and

η

values. The solid lines represent the real baseline mean function and the broken lines represent the estimated baseline mean function. The two lines on the top are for the first type of event

U_{0}^{(1)} (t) = t + log (1 + t)

, and the curvilinear lines at the bottom are for the baseline mean function

U_{0}^{(2)} (t) = t^{0.5} + log (1 + t)

of the second type of event. We can clearly see that all the estimated lines catch up to the true lines well and the plots on the right side when

η = 5

show an even better convergence to the truth than their counterparts on the left side when

η = 1

. Figure A3 shows the estimation of the baseline correlation functions corresponding to different setups of

α

and

η

values. It is clear that for each

α

and

η

setup, the estimated baseline correlation function overlays the true baseline correlation function.

Finally, we evaluate the effect of the scale parameter

α

on the estimation. Frailty models are well accepted for univariate panel count due to its flexibility and robustness. A naive extension to multivariate panel count data is to treat the frailty exactly the same for all types of events, i.e., each subject shares the common frailty to its full extent for all the events. Our simulation shows that this practice leads to wrong estimation results when true

α

is not 1. Table A3 presents the estimation results for the same simulated datasets as in Table A2, but are fitted with the naive model, i.e., model (1) with

α

omitted. The last block of rows of the table shows that the naive model does a comparable job in estimating the regression coefficients and

η

, which is reasonable because the naive model is the true model when the true value of

α

is 1. However, for other data generated with

α

not equal to 1, the misspecified naive model provides bad estimation results, which is especially clear when true

η = 1

. Compared with the results in Table A2, all Biases for the regression coefficients slightly increase. For the regression coefficients for the first type of event,

β_{1}^{(1)}

and

β_{2}^{(1)}

, the SEs are quite close to those in Table A2, but the SDs become smaller (the further away the true

α

is from 1, the smaller the SDs become), leading to CP95 being much smaller than 95%. For the regression coefficients for the second type of event,

β_{1}^{(2)}

and

β_{2}^{(2)}

, compared with those in Table A2, both SDs and SEs increase. When data are generated with

α = 0

or

0.5

, the naive method inflates SDs more than SEs, leading to higher coverage probabilities than 95%. When

α = - 0.3

, the inflated SEs are actually larger than the inflated SDs, leading to lower coverage probabilities. The bias, SD and SE of

η

are increased when the true

α

gets away from the value 1. The estimation results for

η = 5

have a similar pattern to those for

η = 1

but much better, which is reasonable because the large

η

value implies a small variance of frailties and thus less correlation between two types of events. This set of simulations shows that including the scale parameter of

α

is important to correctly estimate the regression coefficients.

5. Real Data Analysis

In this section, we apply our proposed methods to analyze the motivating dataset, the skin cancer dataset, which was introduced in Section 1, and compare the results with those in the literature. Two hundred and ninety patients are analyzed, with one removed from the original group because of no observation. The observation times were recorded in days. The covariate of major interest is denoted as

X_{1}

, which is equal to 1 if the patient was assigned in the DFMO group and 0 in the placebo group. The other three covariates of interest are

X_{2}

, the number of cancers prior to the trial;

X_{3}

, the patient’s age; and

X_{4}

, gender with male 1 and female 0. Table A4 displays the results of the covariate coefficients of the two types of skin cancer from the proposed model. Clearly, gender (

X_{4}

) has no significant effect, but the number of cancers prior to trial (

X_{2}

) and the patient’s age (

X_{3}

) are significant for both skin cancers. The positive value of

β_{2}

implies that the number of prior cancers has a positive relationship with the rate of new cancers. The negative value of

β_{3}

suggests that older patients tend to have a lower recurrence rate of new cancers, which makes biological sense as older people have a slower metabolism. However, DFMO has different effects on the recurrence rate of the two skin cancers. For basal cell carcinoma, the recurrence rate is decreased by a factor of 1.209, but for squamous cell carcinoma, the recurrence rate is increased by a factor of 1.111. However, neither of these two effects are significant due to the large posterior standard deviations.

The skin cancer data have also been analyzed by He et al. [8] and Zhang et al. [9], assuming the same covariate effects for both skin cancers. For comparison, we also modify step 4 in the Gibbs sampler in Section 3.3 and reanalyze the data with the same covariate coefficients for both cancers. The results are presented in Table A5. Overall, all the estimation results of the covariate coefficients for the three methods assuming common covariate effects are quite similar, with the Bayesian method producing smaller posterior standard deviations of the regression coefficients than the standard errors from the other two frequentist methods. The different effects of DFMO on the two skin cancers are completely hidden compared to the method where different covariate effects are assumed. The effect of the number of cancers prior to trial (

X_{2}

) is significant for all three methods. Our Bayesian method shows the significant effect of patient’s age (

X_{3}

), while the other two frequentist methods do not. Our method and Zhang et al. [9]’s method show no significant effect of gender (

X_{4}

), but He et al. [8]’s method does.

Our methods can estimate the baseline mean functions simultaneously with the regression coefficients. Figure A4 presents the estimated baseline mean functions for basal cell carcinoma and squamous cell carcinoma, respectively. The solid line on the top and the broken line at the bottom represent the baseline mean functions of basal cell carcinoma and squamous cell carcinoma from the model, assuming different covariate effects. The two broken lines in the middle represent the baseline mean functions of the two cancers when assuming the covariate effects are the same. Clearly, under the common covariate effects assumption, the baseline mean functions are both pulled towards the middle, which blurs the difference of the mean functions between these two skin cancers.

Finally, Figure A5 displays the estimated baseline correlation function when covariate effects are assumed different for the two skin cancers. It shows that the correlation between the two recurrent skin cancers is positive and strengthened across time.

6. Discussion

In this paper, we have proposed a Bayesian estimation approach for the semiparametric regression analysis of multivariate panel count data. For each type of event, the proportional mean model is adopted to model the cumulative mean count of the event, where its baseline mean function is approximated by monotone I-splines. The correlation between multiple types of events is modeled by common frailty terms and scale parameters. Based on the novel Poisson data augmentation, an efficient and easy-to-implement Gibbs sampler is developed for the MCMC computation. Through the MCMC samples, the regression coefficients, the baseline mean functions, and the baseline correlation function can be simultaneously estimated. Simulation studies have shown that the proposed approach provides accurate estimation of the regression coefficients and the baseline mean functions. Simulation studies have also demonstrated the importance of including the scale parameter in the model. The scale parameter provides substantive flexibility for modeling the correlation and meanwhile improving the estimation performance.

Instead of using a single common frailty

w_{i}

and scale parameters to model the correlation between counts of multiple events, another possible way is to have a frailty vector

(w_{i 1}, w_{i 2}, \dots, w_{i K})

for each subject. Then, the correlation between counts of multiple events is modeled through the dependence structure of these

w_{i j}

s. Bedair et al. [32] used copula-frailty models to analyze correlated recurrent events of different types. We may also borrow the copula-frailty idea to more flexibly model the correlation between counts of multiple events. In this paper, we assume frailties follow a gamma distribution

G (η, η)

. The gamma frailty is the simplest and most popular frailty distribution in survival analysis. Under the mean equal to 1 restriction, there is only one parameter

η

to represent the variance in the frailty, which is easy to interpret when connecting to the correlation. Some other frailties may also be used, such as log normal frailty and inverse Gaussian frailty. Then, the formula for the correlation and the sampling for the frailty would be different. Other possible choices of frailties are referred to by Balan and Putter [33], who provide an excellent tutorial on frailty models.

Regarding choosing which event as the reference event, in this paper, we choose the event with the largest frailty variability as the reference event. One possible way to choose the reference event is to first conduct univariate panel count data analysis for each type of event, and then to choose the one with the largest frailty variability as the reference event. For real data analysis, gamma frailty assumption may not hold for the reference event. For this, nonparametric frailties without assuming any specific distribution may be a more flexible choice.

Our approach can be slightly modified to accommodate common covariate effects for different types of events. However, the real data analysis shows that doing this would hide significant covariate effects for individual types of events. Therefore, we suggest assuming different covariate effects for different events first for real data analysis and then using the common covariate effect model later if similar effects are observed for different events.

Unlike the existing frequentist methods, our approach does not require model assumptions for the observation or censoring processes. Our approach is solely based on the observed likelihood and only needs the observed counts and observation times for the analysis, which makes our proposed approach generic to deal with panel count data arising from different observation schemes, including dependent or independent censoring and/or observation processes. On the other hand, the proposed approach may lose a certain amount of efficiency due to not incorporating the information of the observation or censoring processes when they are actually available.

Author Contributions

Conceptualization, X.L.; methodology, X.L. and C.W.; validation, X.L. and C.W.; formal analysis, C.W.; investigation, C.W.; writing—original draft preparation, C.W.; writing—review and editing, X.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would thank Lianming Wang for the initial conceptualization of the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Derivation of Cov(N_i^(j), N_i^(k) ), Var(N_i^(j) ), and Var(N_i^(k) ) for α_j = 1

\begin{matrix} \begin{matrix} C o v (N_{i}^{(j)}, N_{i}^{(k)}) & = E (N_{i}^{(j)} N_{i}^{(k)}) - E (N_{i}^{(j)}) E (N_{i}^{(k)}) \\ = E [E (N_{i}^{(j)} | w_{i}) E (N_{i}^{(k)} | w_{i})] - E [E (N_{i}^{(j)} | w_{i})] E [E (N_{i}^{(k)} | w_{i})] \\ = U_{0}^{(j)} exp (X^{'} β^{(j)}) U_{0}^{(k)} exp (X^{'} β^{(k)}) [E (w_{i}^{1 + α_{k}}) - E (w_{i}) E (w_{i}^{α_{k}})] \\ = U_{0}^{(j)} exp (X^{'} β^{(j)}) U_{0}^{(k)} exp (X^{'} β^{(k)}) \times \frac{α_{k} η^{- (α_{k} + 1)} Γ (η + α_{k})}{Γ (η)} \\ V a r (N_{i}^{(j)}) & = V a r (E (N_{i}^{(j)} | w_{i})) + E (V a r (N_{i}^{(j)} | w_{i})) \\ = V a r (w_{i} U_{0}^{(j)} exp (X^{'} β^{(j)})) + E (w_{i} U_{0}^{(j)} exp (X^{'} β^{(j)})) \\ = \frac{{(U_{0}^{(j)} exp (X^{'} β^{(j)}))}^{2}}{η} + U_{0}^{(j)} exp (X^{'} β^{(j)}) \\ V a r (N_{i}^{(k)}) & = V a r (E (N_{i}^{(k)} | w_{i})) + E (V a r (N_{i}^{(k)} | w_{i})) \\ = V a r (w_{i}^{α_{k}} U_{0}^{(k)} exp (X^{'} β^{(k)})) + E (w_{i}^{α_{k}} U_{0}^{(k)} exp (X^{'} β^{(k)})) \\ = {(U_{0}^{(k)} exp (X^{'} β^{(k)}))}^{2} [\frac{Γ (η + 2 α_{k})}{Γ (η) η^{2 α_{k}}} - {(\frac{Γ (α_{k} + η)}{Γ (η)})}^{2} \frac{1}{η^{2 α_{k}}}] + U_{0}^{(k)} exp (X^{'} β^{(k)}) \frac{Γ (α_{k} + η)}{Γ (η) η^{α_{k}}} \end{matrix} \end{matrix}

Table A1. Simulation results: estimation of regression coefficients from our proposed method when true

α = 1

. Bias refers to the difference between the average of the 500 posterior means and the true value; SD refers to the mean of the 500 posterior standard deviations; SE refers to the standard deviation of the 500 posterior means, and CP95 refers to the 95% coverage probability.

Table A1. Simulation results: estimation of regression coefficients from our proposed method when true

α = 1

. Bias refers to the difference between the average of the 500 posterior means and the true value; SD refers to the mean of the 500 posterior standard deviations; SE refers to the standard deviation of the 500 posterior means, and CP95 refers to the 95% coverage probability.

		$η = 1$				$η = 5$
Parameter	Truth	Bias	SD	SE	CP95	Bias	SD	SE	CP95
$β_{1}^{(1)}$	0	0.0583	0.2238	0.1988	0.9520	0.0214	0.1287	0.1319	0.9400
$β_{2}^{(1)}$	1	0.0004	0.1215	0.1223	0.9520	0.0015	0.0696	0.0722	0.9300
$β_{1}^{(2)}$	1	0.0561	0.2225	0.1975	0.9700	0.0203	0.1291	0.1236	0.9600
$β_{2}^{(2)}$	1	−0.0002	0.1203	0.1220	0.9480	0.0004	0.0683	0.0689	0.9460
$β_{1}^{(1)}$	0	0.0392	0.2294	0.2179	0.9580	0.0178	0.1303	0.1311	0.9520
$β_{2}^{(1)}$	1	−0.0053	0.1245	0.1228	0.9520	−0.0008	0.0714	0.0709	0.9520
$β_{1}^{(2)}$	−1	0.0647	0.2523	0.2479	0.9560	0.0364	0.1651	0.1607	0.9500
$β_{2}^{(2)}$	1	−0.0109	0.1344	0.1297	0.9620	−0.0001	0.0858	0.0824	0.9620
$β_{1}^{(1)}$	1	0.0490	0.2198	0.2133	0.9500	0.0119	0.1203	0.1191	0.9540
$β_{2}^{(1)}$	1	−0.0078	0.1182	0.1202	0.9280	−0.0017	0.0646	0.0653	0.9380
$β_{1}^{(2)}$	−1	0.0771	0.2497	0.2373	0.9440	0.0400	0.1629	0.1593	0.9500
$β_{2}^{(2)}$	1	−0.0127	0.1323	0.1335	0.9440	−0.0026	0.0840	0.0825	0.9600
$β_{1}^{(1)}$	1	−0.0639	0.2152	0.2173	0.9320	−0.0178	0.1192	0.1194	0.9392
$β_{2}^{(1)}$	−1	0.0025	0.1138	0.1121	0.9500	0.0051	0.0633	0.0678	0.9392
$β_{1}^{(2)}$	1	−0.0663	0.2202	0.2247	0.9300	−0.0219	0.1280	0.1329	0.9196
$β_{2}^{(2)}$	1	0.0031	0.1177	0.1156	0.9440	0.0046	0.0681	0.0786	0.9412

Table A2. Simulation results: estimation of regression coefficients

α

and

η

from our proposed method for data generated with different

α

values and true

β_{1}^{(1)} = 1

,

β_{2}^{(1)} = - 1

,

β_{1}^{(2)} = 1

, and

β_{2}^{(2)} = 1

. Bias refers to the difference between the average of the 500 posterior means and the true value; SD refers to the mean of the 500 posterior standard deviations; SE refers to the standard deviation of the 500 posterior means, and CP95 refers to the 95% coverage probability.

Table A2. Simulation results: estimation of regression coefficients

α

and

η

from our proposed method for data generated with different

α

values and true

β_{1}^{(1)} = 1

,

β_{2}^{(1)} = - 1

,

β_{1}^{(2)} = 1

, and

β_{2}^{(2)} = 1

. Bias refers to the difference between the average of the 500 posterior means and the true value; SD refers to the mean of the 500 posterior standard deviations; SE refers to the standard deviation of the 500 posterior means, and CP95 refers to the 95% coverage probability.

		$η = 1$				$η = 5$
Parameter		Bias	SD	SE	CP95	Bias	SD	SE	CP95
$α = - 0.3$	$β_{1}^{(1)}$	−0.0269	0.2190	0.2266	0.9300	−0.0124	0.1216	0.1252	0.9360
	$β_{2}^{(1)}$	−0.0054	0.1168	0.1202	0.9460	0.0006	0.0652	0.0685	0.9460
	$β_{1}^{(2)}$	−0.0086	0.1048	0.1049	0.9440	−0.0022	0.0829	0.0812	0.9540
	$β_{2}^{(2)}$	0.0021	0.0548	0.0580	0.9400	0.0006	0.0421	0.0421	0.9600
	$α$	−0.0083	0.0423	0.0462	0.9200	−0.0033	0.1010	0.1010	0.9400
	$η$	0.0412	0.1699	0.1754	0.9440	0.2494	1.0231	0.8814	0.9740
$α = 0$	$β_{1}^{(1)}$	−0.0051	0.2229	0.2256	0.9400	−0.0017	0.1208	0.1175	0.9588
	$β_{2}^{(1)}$	−0.0043	0.1195	0.1195	0.9460	−0.0004	0.0654	0.0682	0.9294
	$β_{1}^{(2)}$	−0.0034	0.0758	0.0767	0.9460	−0.0048	0.0770	0.0749	0.9608
	$β_{2}^{(2)}$	−0.0010	0.0370	0.0378	0.9580	0.0007	0.0377	0.0365	0.9627
	$α$	0.0028	0.0344	0.0349	0.9420	0.0017	0.1152	0.1078	0.9784
	$η$	0.0328	0.1790	0.1781	0.9560	0.6264	1.3301	1.3349	0.9588
$α = 0.5$	$β_{1}^{(1)}$	−0.0190	0.2194	0.2227	0.9360	−0.0076	0.1213	0.1179	0.9588
	$β_{2}^{(1)}$	−0.0136	0.1162	0.1190	0.9340	0.0001	0.0647	0.0652	0.9490
	$β_{1}^{(2)}$	−0.0263	0.1372	0.1338	0.9400	−0.0143	0.0958	0.0946	0.9549
	$β_{2}^{(2)}$	−0.0047	0.0719	0.0697	0.9600	0.0010	0.0497	0.0475	0.9588
	$α$	0.0066	0.0589	0.0578	0.9480	0.0163	0.1213	0.1246	0.9471
	$η$	0.0496	0.1729	0.1841	0.9500	0.4241	1.2655	1.2633	0.9569
$α = 1$	$β_{1}^{(1)}$	−0.0639	0.2152	0.2173	0.9320	−0.0178	0.1192	0.1194	0.9392
	$β_{2}^{(1)}$	0.0025	0.1138	0.1121	0.9500	0.0051	0.0633	0.0678	0.9392
	$β_{1}^{(2)}$	−0.0663	0.2202	0.2247	0.9300	−0.0219	0.1280	0.1329	0.9196
	$β_{2}^{(2)}$	0.0031	0.1177	0.1156	0.9440	0.0046	0.0681	0.0786	0.9412
	$α$	0.0089	0.0804	0.0745	0.9760	0.0294	0.1529	0.1577	0.9392
	$η$	0.0469	0.1684	0.1734	0.9540	0.3745	1.2136	1.2777	0.9490

Figure A1. Baseline correlation performance related to

η

and

α_{k}

when assuming

U_{0}^{(j)} (t) = U_{0}^{(k)} (t) = 1

.

Table A3. Simulation results: estimation of regression coefficients and

η

from the naive method for data generated with different

α

values and true

β_{1}^{(1)} = 1

,

β_{2}^{(1)} = - 1

,

β_{1}^{(2)} = 1

, and

β_{2}^{(2)} = 1

. Bias refers to the difference between the average of the 500 posterior means and the true value; SD refers to the mean of the 500 posterior standard deviations; SE refers to the standard deviation of the 500 posterior means, and CP95 refers to the 95% coverage probability.

Table A3. Simulation results: estimation of regression coefficients and

η

from the naive method for data generated with different

α

values and true

β_{1}^{(1)} = 1

,

β_{2}^{(1)} = - 1

,

β_{1}^{(2)} = 1

, and

β_{2}^{(2)} = 1

. Bias refers to the difference between the average of the 500 posterior means and the true value; SD refers to the mean of the 500 posterior standard deviations; SE refers to the standard deviation of the 500 posterior means, and CP95 refers to the 95% coverage probability.

		$η = 1$				$η = 5$
Parameter		Bias	SD	SE	CP95	Bias	SD	SE	CP95
$α = - 0.3$	$β_{1}^{(1)}$	−0.0438	0.1344	0.2443	0.7220	−0.0130	0.1016	0.1288	0.8640
	$β_{2}^{(1)}$	0.0061	0.0710	0.1388	0.7080	0.0009	0.0536	0.0722	0.8500
	$β_{1}^{(2)}$	−0.0206	0.1353	0.1628	0.8880	−0.0103	0.1085	0.0938	0.9800
	$β_{2}^{(2)}$	0.0062	0.0715	0.0967	0.8540	0.0027	0.0574	0.0525	0.9720
	$η$	2.6937	0.6275	0.9058	0.0000	3.9252	0.8115	0.3937	0.0060
$α = 0$	$β_{1}^{(1)}$	−0.0114	0.1459	0.2308	0.7820	−0.0046	0.1021	0.1205	0.9080
	$β_{2}^{(1)}$	−0.0028	0.0770	0.1294	0.7700	0.0010	0.0544	0.0690	0.8760
	$β_{1}^{(2)}$	−0.0302	0.1513	0.1166	0.9800	−0.0095	0.1106	0.0843	0.9900
	$β_{2}^{(2)}$	0.0080	0.0802	0.0709	0.9820	0.0031	0.0587	0.0474	0.9780
	$η$	1.9095	0.4903	0.6390	0.0000	3.6761	0.9184	0.5119	0.0320
$α = 0.5$	$β_{1}^{(1)}$	−0.0282	0.1826	0.2277	0.8840	−0.0059	0.1073	0.1185	0.9160
	$β_{2}^{(1)}$	−0.0092	0.0962	0.1194	0.8780	−0.0002	0.0568	0.0646	0.9220
	$β_{1}^{(2)}$	−0.0536	0.1893	0.1383	0.9760	−0.0193	0.1162	0.0962	0.9780
	$β_{2}^{(2)}$	−0.0016	0.0997	0.0777	0.9880	0.0037	0.0617	0.0496	0.9920
	$η$	0.5948	0.2502	0.2712	0.2160	2.3688	1.1662	0.9781	0.4600
$α = 1$	$β_{1}^{(1)}$	−0.0679	0.2161	0.2179	0.9320	−0.0181	0.1196	0.1194	0.9420
	$β_{2}^{(1)}$	0.0029	0.1136	0.1118	0.9500	0.0027	0.0631	0.0614	0.9500
	$β_{1}^{(2)}$	−0.0690	0.2202	0.2261	0.9380	−0.0208	0.1276	0.1315	0.9280
	$β_{2}^{(2)}$	0.0022	0.1170	0.1155	0.9520	0.0019	0.0673	0.0702	0.9440
	$η$	0.0401	0.1552	0.1614	0.9400	0.1853	0.9692	0.9736	0.9600

Table A4. Estimation results (posterior mean, posterior standard deviation and 95% credible interval) of the covariate effects for the two types of skin cancers. * indicates statistically significant.

	$β_{1}$	$β_{2}$	$β_{3}$	$β_{4}$
basal	−0.1902	0.1028	−0.0376	0.0238
	0.1500	0.0145	0.0055	0.1506
	(−0.4894, 0.0987)	(0.0762, 0.1330) *	(−0.0487, −0.0273) *	(−0.2717, 0.3185)
squamous	0.1055	0.1451	−0.0196	0.3053
	0.2174	0.0213	0.0070	0.2142
	(−0.3153, 0.5251)	(0.1051, 0.0.1894) *	( −0.0333, −0.0060) *	(−0.1165, 0.7255)

Table A5. Estimation results (posterior mean, posterior standard deviation and 95% credible interval) of the covariate effects when common covariate effects are assumed for the two types of skin cancers from the proposed method, He et al. (2008)’s method, and Zhang et al. (2013)’s method. * indicates statistically significant.

	$β_{1}$	$β_{2}$	$β_{3}$	$β_{4}$
Proposed	−0.1509	0.0810	−0.0264	0.0636
	0.1381	0.0122	0.0052	0.1409
	(−0.4203, 0.1197)	(0.0599, 0.1087) *	(−0.0367, −0.0169) *	(−0.2098, 0.3369)
He et al.	−0.0239	0.1440	−0.0116	0.3807
	0.1809	0.0212	0.0084	0.1778
	(−0.3785, 0.3307)	(0.1024, 0.1856) *	(−0.0281, 0.0049)	(0.0322, 0.7292) *
Zhang et al.	−0.2253	0.0784	0.0016	0.2534
	0.1831	0.0090	0.0087	0.1942
	(−0.5842, 0.1336)	(0.0608, 0.0960) *	(−0.0155, 0.0187)	(−0.1272, 0.06340)

Figure A2. Simulation results: estimation of baseline mean function for models with different

α

and

η

values. The solid lines represent the true baseline mean function and the broken lines represent the estimated baseline mean function.

Figure A3. Simulation results: estimation of baseline correlation function for models with different

α

and

η

values. The solid lines represent the true baseline mean function and the broken lines represent the estimated baseline mean function.

Figure A4. Estimated cumulative baseline mean function of models with and without the same covariate effects for basal cell carcinoma and squamous cell carcinoma.

Figure A5. Estimated baseline correlation between basal cell carcinoma and squamous cell carcinoma across time.

References

Sun, J.; Zhao, X. Statistical Analysis of Panel Count Data; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
Sun, J.; Kalbfleisch, J. Estimation of the mean function of point processes based on panel count data. Stat. Sin. 1995, 5, 279–289. [Google Scholar]
Wellner, J.A.; Zhang, Y. Two estimators of the mean of a counting process with panel count data. Ann. Stat. 2000, 28, 779–814. [Google Scholar] [CrossRef]
Wellner, J.A.; Zhang, Y. Two likelihood-based semiparametric estimation methods for panel count data with covariates. Ann. Stat. 2007, 35, 2106–2142. [Google Scholar] [CrossRef] [Green Version]
Lu, M.; Zhang, Y.; Huang, J. Estimation of the mean function with panel count data using monotone polynomial splines. Biometrika 2007, 94, 705–718. [Google Scholar] [CrossRef] [Green Version]
Lu, M.; Zhang, Y.; Huang, J. Semiparametric estimation methods for panel count data using monotone B-splines. J. Am. Stat. Assoc. 2009, 104, 1060–1070. [Google Scholar] [CrossRef]
Wang, J.; Lin, X. A Bayesian approach for semiparametric regression analysis of panel count data. Lifetime Data Anal. 2020, 26, 402–420. [Google Scholar] [CrossRef]
He, X.; Tong, X.; Sun, J.; Cook, R.J. Regression analysis of multivariate panel count data. Biostatistics 2008, 9, 234–248. [Google Scholar] [CrossRef]
Zhang, H.; Zhao, H.; Sun, J.; Wang, D.; Kim, K. Regression analysis of multivariate panel count data with an informative observation process. J. Multivar. Anal. 2013, 119, 71–80. [Google Scholar] [CrossRef]
Li, N.; Park, D.H.; Sun, J.; Kim, K. Semiparametric transformation models for multivariate panel count data with dependent observation process. Can. J. Stat. 2010, 39, 458–474. [Google Scholar] [CrossRef] [Green Version]
Zhao, H.; Li, Y.; Sun, J. Semiparametric analysis of multivariate panel count data with dependent observation processes and a terminal event. J. Nonparametr. Stat. 2013, 25, 379–394. [Google Scholar] [CrossRef]
He, X.; Feng, X.; Tong, X.; Xingqiu, Z. Semiparametric partially linear varying coefficient models with panel count data. Lifetime Data Anal. 2017, 23, 439–466. [Google Scholar] [CrossRef] [PubMed]
Zhao, H.; Zhang, Y.; Zhao, X.; Yu, Z. A nonparametric regression model for panel count data analysis. Stat. Sin. 2019, 29, 809–826. [Google Scholar] [CrossRef] [Green Version]
Wang, Y.; Yu, Z. A kernel regression model for panel count data with time-varying coefficients. arXiv 2019, arXiv:1903.10233. [Google Scholar] [CrossRef]
Li, Y.; He, X.; Wang, H.; Zhang, B.; Sun, J. Semiparametric regression of multivariate panel count data with informative observation times. J. Multivar. Anal. 2015, 140, 209–219. [Google Scholar] [CrossRef]
Wang, Y.; Yu, Z. A kernel regression model for panel count data with nonparametric covariate functions. Biometrics 2021. [Google Scholar] [CrossRef]
Chib, S.; Greenberg, E.; Winkelmann, R. Posterior simulation and Bayes factors in panel count data models. J. Econom. 1998, 86, 33–54. [Google Scholar] [CrossRef] [Green Version]
Ramsay, J. Monotone regression splines in action. Stat. Sci. 1988, 3, 425–441. [Google Scholar] [CrossRef]
Dimitrakopoulos, S. Bayesian Estimation of Panel Count Data Models: Dynamics, Latent Heterogeneity, Serial Error Correlation, and Nonparametric Structures. Panel Data Econom. 2019, 6, 147–173. [Google Scholar]
Liang, Y.; Li, Y.; Zhang, B. Bayesian nonparametric inference for panel count data with an informative observation process. Biom. J. 2018, 60, 583–596. [Google Scholar] [CrossRef] [Green Version]
Sun, J.; Tong, X.; He, X. Regression analysis of panel count data with dependent observation times. Biometrics 2007, 63, 1053–1059. [Google Scholar] [CrossRef]
Sinha, D.; Maiti, T. A Bayesian approach for the analysis of panel-count data with dependent termination. Biometrics 2004, 60, 34–40. [Google Scholar] [CrossRef] [PubMed]
Lin, X.; Cai, B.; Wang, L.; Zhang, Z. A Bayesian proportional hazards model for general interval-censored data. Lifetime Data Anal. 2015, 21, 470–490. [Google Scholar] [CrossRef] [PubMed]
Cai, B.; Lin, X.; Wang, L. Bayesian proportional hazards model for current status data with monotone splines. Comput. Stat. Data Anal. 2011, 55, 2644–2651. [Google Scholar] [CrossRef]
Wang, L.; Dunson, D.B. Semiparametric Bayes’ Proportional Odds Models for Current Status Data with Underreporting. Biometrics 2011, 67, 1111–1118. [Google Scholar] [CrossRef] [Green Version]
Green, P.J. Reversible Jump Markov Chain Monte Carlo Computation and Bayesian Model Determination. Biometrika 1995, 82, 711–732. [Google Scholar] [CrossRef]
Park, T.; Casella, G. The Bayeisan Lasso. J. Am. Stat. Assoc. 2008, 103, 681–686. [Google Scholar] [CrossRef]
Gilks, W.R.; Wild, P. Adaptive rejection sampling for Gibbs sampling. Appl. Stat. 1992, 41, 337–348. [Google Scholar] [CrossRef]
Gilks, W.R.; Best, N.; Tan, K. Adaptive rejection Metropolis sampling within Gibbs sampling. Appl. Stat. 1995, 44, 455–472. [Google Scholar] [CrossRef] [Green Version]
Petris, G.; Tardella, L. Original C code for ARMS by Wally Gilks. HI: Simulation from Distributions Supported by Nested Hyperplanes. R Package Version 0.4. 2013. Available online: https://rdrr.io/cran/HI/ (accessed on 10 January 2022).
Plummer, M.; Best, N.; Cowles, K.; Vines, K. CODA: Convergence Diagnosis and Output Analysis for MCMC. R News 2006, 6, 7–11. [Google Scholar]
Bedair, K.F.; Hong, Y.; Al-Khalidi, H.R. Copula-frailty models for recurrent event data based on Monte Carlo EM algorithm. J. Stat. Comput. Simul. 2021, 91, 3530–3548. [Google Scholar] [CrossRef]
Balan, T.; Putter, H. A tutorial on frailty models. Stat. Methods Med. Res. 2020, 29, 3424–3452. [Google Scholar] [CrossRef] [PubMed]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Bayesian Semiparametric Regression Analysis of Multivariate Panel Count Data

Abstract

1. Introduction

2. Model and Notation

2.1. Model Construction

2.2. Correlation Expression

3. The Proposed Bayesian Semiparametric Approach

3.1. Modeling $U_{0}^{(k)} (t)$ with Monotone I-Splines

3.2. Likelihood Augmentation with Poisson Latent Variables

3.3. Prior Specification and Posterior Computation

4. Simulation Studies

4.1. Data Generation

4.2. Simulation Results

5. Real Data Analysis

6. Discussion

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Derivation of Cov(N_i^(j), N_i^(k) ), Var(N_i^(j) ), and Var(N_i^(k) ) for α_j = 1

References

Article Metrics

Citations

Article Access Statistics

Bayesian Semiparametric Regression Analysis of Multivariate Panel Count Data

Abstract

1. Introduction

2. Model and Notation

2.1. Model Construction

2.2. Correlation Expression

3. The Proposed Bayesian Semiparametric Approach

3.1. Modeling U 0 ( k ) ( t ) with Monotone I-Splines

3.2. Likelihood Augmentation with Poisson Latent Variables

3.3. Prior Specification and Posterior Computation

4. Simulation Studies

4.1. Data Generation

4.2. Simulation Results

5. Real Data Analysis

6. Discussion

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Derivation of Cov(Ni(j), Ni(k) ), Var(Ni(j) ), and Var(Ni(k) ) for αj = 1

References

Article Metrics

Citations

Article Access Statistics

3.1. Modeling $U_{0}^{(k)} (t)$ with Monotone I-Splines

Appendix A. Derivation of Cov(N_i^(j), N_i^(k) ), Var(N_i^(j) ), and Var(N_i^(k) ) for α_j = 1