Extensions to Multivariate Space Time Mixture Modeling of Small Area Cancer Data

Oral cavity and pharynx cancer, even when considered together, is a fairly rare disease. Implementation of multivariate modeling with lung and bronchus cancer, as well as melanoma cancer of the skin, could lead to better inference for oral cavity and pharynx cancer. The multivariate structure of these models is accomplished via the use of shared random effects, as well as other multivariate prior distributions. The results in this paper indicate that care should be taken when executing these types of models, and that multivariate mixture models may not always be the ideal option, depending on the data of interest.


Introduction
Many cancers share risk factors but differ in incidence. Previous studies have demonstrated benefits for rarer diseases via the implementation of multivariate modeling with a more prevalent disease that presents common risk factors [1,2]. Oral cavity and pharynx cancer (hereafter referred to as oral/pharynx) (ICD-9-CM codes: 145.9, 146.2, 146.3, 146.5-146.9, 147.0-147.3, 147.8, 147.9, 148.9, 149.0, 149.8, 149.9) and lung and bronchus cancer (henceforth referred to as lung) (ICD-9-CM codes: 162.2-162.5, 162.8, and 162.9) share tobacco smoke as an exposure, while lip cancer, which is included with oral/pharynx cancer, and melanoma cancer of the skin (referred to as melanoma from here) (ICD-9-CM codes: 172.2, 172.3, 172.5-172.7, and 172.9), have the common risk factor of ultraviolet light exposure [3][4][5][6]. Additionally, oral/pharynx is quite rare in comparison to lung and melanoma, which are among the most common types of cancer in the United States [7]. Thus, it is of interest to explore the benefits of employing multivariate modeling for these relationships by including all three diseases in a single model.
Oral/pharynx, melanoma, and lung all display spatio-temporal (ST) patterns in their incidences [8][9][10][11][12], and a Bayesian multivariate ST mixture model proposed by Lawson et al. [2] can offer an ideal, flexible model for risk that offers improvements over a more standard multivariate model introduced by Knorr-Held [13][14][15]. The real data example in that paper also involved the three diseases of interest and illustrated that care should be taken in implementing types of models. In that example, there was evidence that modeling was improved for oral/pharynx, but only up to a certain point in time, year 10, in the multivariate setting. To deal with this, they simply restricted to the first 10 years of the study time to create a working example. However, in this paper we wish to explore extensions to the model that could offer greater temporal flexibility for situations of this nature so that all of the data can be utilized and no information is lost. These extensions will involve imposing more flexibility in the mixture model, as well as bivariate fits of only oral/pharynx with lung because that relationship is more grounded in the literature.
Often, some important risk factors may be unavailable because they are difficult to measure or obtain. A benefit of employing an ST model involves the models' ability to account for such unmeasured exposures that may be common among the diseases considered. Examples of these exposures include: health service availability, environmental influences, or demographic factors. In this study, we do not have the ability to measure many of these exposures since data are aggregated at the county level.
This paper is developed as follows: First, we describe the case study and the statistical methods. Following that, we present the results. Finally, we discuss the findings and draw conclusions.

Materials and Methods
The outcome data of interest is in the form of the annual South Carolina county level incidences for oral/pharynx, lung, and melanoma over the years 1996-2009. A conditionally-independent Poisson distribution is a commonly-used model for small area counts in disease mapping [16][17][18], and is described as follows: with y ijk , the incidence of cancer k in county i at time j, and µ ijk , the mean of the Poisson distribution defined as the product of the known expected rate of disease, e ijk , and the relative risk, θ ijk . In this framework, we model the logarithm of the relative risk; these modeling options are described in Section 3.

Case Study
The outcome data for this analysis was gathered from the South Carolina Assessment Network [19]. In this data, the managers placed thresholds on small counts of disease such that an observed count between 1 and 4 for a particular county is given the value 5 and an observed count between 5 and 9 is given the value 10. To make the distribution of counts more appropriate, we imputed values based on the appropriate interval as indicated by the threshold value. To do this, we calculate an expected count of diseases for each county and year by multiplying the overall rate of disease (based on the threshold values) by the county population. Next, we sample from a Poisson distribution with the expected rate as the mean restricted such that the imputed value falls within the appropriate threshold. The code for accomplishing this is included in the supplemental files of Lawson et al. [2]. The numbers and percentages of zeros and imputed data for oral/pharynx, melanoma, and lung are displayed in Table 1. Following the imputation, the expected rate of disease (e ijk ) is calculated. The subsequent statistics and figures are based on the imputed dataset. Figure 1 displays a histogram of incidences per study year segmented by disease. From this, it is easy to see that the incidences of melanoma and lung are increasing over time while oral/pharynx remains steady. This could be part of why the previous multivariate model did not perform well in the later study years. For the 7718 diagnosed cases of oral/pharynx in South Carolina for these study years, there are approximately 168 (range: 19-651) cases per county per year leading to a rate of disease equal to 0.0001. Similarly, there were 20,156 and 44,595 diagnosed cases of melanoma and lung with an average of 438 (range: 25-2726) and 969 (range: 115-3885) cases per county per year leading to rates of disease equal to 0.0003 and 0.0008, respectively. These rates of disease are calculated as the total incidence over the entire study time divided by the total population at risk over the entire study time.   and 969 (range: 115-3885) cases per county per year leading to rates of disease equal to 0.0003 and 0.0008, respectively. These rates of disease are calculated as the total incidence over the entire study time divided by the total population at risk over the entire study time. Standardized incidence rate (SIR) calculations are often a first step in data analysis. The SIRs per county over time for each of the diseases are displayed in Figures 2-4. The SIR was calculated as a ratio of the observed cancer incidences to the expected rates of disease per county [20]. For interpretation purposes, a SIR of one indicates that the observed incidence is equal to that of the expected count. The SIRs for oral/pharynx do present some spatial structure, but are somewhat scattered as well. melanoma has a clear presence in the eastern coastal counties and the rates appear to increase over time within the same spatial pattern. Like oral/pharynx, lung appears to have a spatial structure in its distribution, albeit weak, and, like melanoma, the rates appear to increase over time. Standardized incidence rate (SIR) calculations are often a first step in data analysis. The SIRs per county over time for each of the diseases are displayed in Figures 2-4. The SIR was calculated as a ratio of the observed cancer incidences to the expected rates of disease per county [20]. For interpretation purposes, a SIR of one indicates that the observed incidence is equal to that of the expected count. The SIRs for oral/pharynx do present some spatial structure, but are somewhat scattered as well. melanoma has a clear presence in the eastern coastal counties and the rates appear to increase over time within the same spatial pattern. Like oral/pharynx, lung appears to have a spatial structure in its distribution, albeit weak, and, like melanoma, the rates appear to increase over time.     The predictors for this analysis are demographic and environmental measures obtained from a variety of sources with known oral/pharynx, melanoma, and lung, as well as general oncological associations [4][5][6][21][22][23][24][25][26][27]. The demographic predictors come from the Area Health Resources Files dataset [28] while the environmental predictors come from the National Oceanic and Atmospheric Administration [29], South Carolina Department of Health and Environmental Control (SCDHEC) [30], and the North America Land Data Assimilation System [31]. The predictors for this analysis are demographic and environmental measures obtained from a variety of sources with known oral/pharynx, melanoma, and lung, as well as general oncological associations [4][5][6][21][22][23][24][25][26][27]. The demographic predictors come from the Area Health Resources Files dataset [28] while the environmental predictors come from the National Oceanic and Atmospheric Administration [29], South Carolina Department of Health and Environmental Control (SCDHEC) [30], and the North America Land Data Assimilation System [31].
For these methods, it is important to note which of these variables are spatial, temporal, or, S.T.; and predictors were designated as such based on either their availability or apparent amount of spatial and/or temporal variation. The proportion of persons with health insurance (pHI), median household radon level (radon), and proportion of African American population (pAA) are spatial where the two socioeconomic covariates are census data from the year 2000 and the measure of radon is a county-level average of in-home test kit results analyzed by the SCDHEC laboratory. The single temporal predictor is statewide average annual rainfall. Finally, the ST predictors are average daily sunlight (sun), unemployment rate of those 16 years or older (UER), and the proportion of persons in poverty (pppov). Note that the predictors are standardized for model-fitting purposes. Additionally, some of these variables are correlated, but each do still offer their own attributes for our risk model. For example, rainfall and statewide average sunlight are correlated, but rainfall could also offer information about humidity.

Statistical Methods
Here, we describe the methodology associated with the bivariate and multivariate mixture models utilized in this exploration. These methods are implemented via the R package R2WinBUGS which calls WinBUGS from R [32][33][34][35]. All models reached convergence with two chains for the Markov chain Monte Carlo sampler running for a total burn-in of 45,000 iterations and a sample of 5000. Trace plots of the deviance and some random effect standard deviations, as well as model BUGS code, are included in the supplemental materials.
We are interested in exploring the relationships between the two or three cancers in this bivariate or multivariate setting via a mixture model that was first introduced in Carroll et al. [36] as a form of model selection. These methods were then further improved and tested via a simulation study in Lawson et al. [2]. The general structure of this model for disease in county at time is as follows: For these methods, it is important to note which of these variables are spatial, temporal, or, S.T.; and predictors were designated as such based on either their availability or apparent amount of spatial and/or temporal variation. The proportion of persons with health insurance (pHI), median household radon level (radon), and proportion of African American population (pAA) are spatial where the two socioeconomic covariates are census data from the year 2000 and the measure of radon is a county-level average of in-home test kit results analyzed by the SCDHEC laboratory. The single temporal predictor is statewide average annual rainfall. Finally, the ST predictors are average daily sunlight (sun), unemployment rate of those 16 years or older (UER), and the proportion of persons in poverty (pppov). Note that the predictors are standardized for model-fitting purposes. Additionally, some of these variables are correlated, but each do still offer their own attributes for our risk model. For example, rainfall and statewide average sunlight are correlated, but rainfall could also offer information about humidity.

Statistical Methods
Here, we describe the methodology associated with the bivariate and multivariate mixture models utilized in this exploration. These methods are implemented via the R package R2WinBUGS which calls WinBUGS from R [32][33][34][35]. All models reached convergence with two chains for the Markov chain Monte Carlo sampler running for a total burn-in of 45,000 iterations and a sample of 5000. Trace plots of the deviance and some random effect standard deviations, as well as model BUGS code, are included in the supplemental materials.
We are interested in exploring the relationships between the two or three cancers in this bivariate or multivariate setting via a mixture model that was first introduced in Carroll et al. [36] as a form of model selection. These methods were then further improved and tested via a simulation study in Lawson et al. [2]. The general structure of this model for disease k in county i at time j is as follows: is an intercept, p h ik is a mixture parameter, and M h ijk is a mixture component. In the previous study, the set of mixture parameters and components was given by h = {S, ST}, thus containing a spatial and spatio-temporal parts. However, the temporal effect appeared to be causing model fit issues with oral/pharynx in the multivariate setting as this parameter was shared between diseases. Thus, in this paper we wish to explore the benefit of either keeping the same mixture component structure and allowing the temporal effect to vary between diseases or introducing a third mixture component for temporal effects. Hence, the three alternative mixture definitions for both the univariate and bivariate/multivariate cases are displayed in Table 2. The parameters displayed are such that X i , X j , and X ij are the i th , j th , and ij th values of the spatial, temporal, and ST covariates, respectively, β ∼ N 0, τ −1 β for the general case are the fixed effect parameter estimates, is a correlated spatial random effect following an intrinsic conditional autoregressive (CAR) model, where n i is the number of neighbors for county i, and i ∼ l indicates that the two counties i and l are neighbors is the temporal random effect modeled by a temporal random walk is the uncorrelated ST interaction term, and all precisions are defined such that The random effects included here are typical of ST models such as the commonly applied Knorr-Held model [13,14]. Finally, the mixture parameter is made up of two parameters, z i and a i (univariate) or z ik and a ik (multivariate), that are correlated and uncorrelated in space, respectively. However, when three mixture components and parameters are needed, as in Alternative 2 (Alt2), constraints must be imposed such that the sum of the three mixture parameters is equaled to one. This is accomplished with the following structure: logit q h The two correlated random effects, v i and γ j , are defined such that they are common between diseases when there is no k subscript written. This is how information is shared between diseases of interest in these mixture models. Thus, Alternative 1 (Alt1) allows the temporally-correlated random effect to vary across diseases while the spatially-correlated random effect is shared amongst diseases. In the univariate setting, Alt1 is the same as F2PRED from the previous study [2], and the multivariate version of F2PRED offered the best fit for oral/pharynx. On the other hand, Alt2 has the interpretation that the temporally correlated effect is common between diseases, but the relationship between disease and temporal effect is different depending on the cancer of interest through the addition of p T ik . Finally, Alternative 3 (Alt3) is like Alt1, but it allows for a shared temporal random effect that is scaled by a function which is related to the disease of interest. Alt3 was only applied in the bivariate and multivariate settings as an attempt to further improve model fits by offering even more temporal flexibility. The two functions, leading to Alt3a and Alt3b, are such that: Thus, Alt3a offers a scaling coefficient that differs per disease to relate to the temporal random effect while Alt3b offers a power term that differs per disease for relating to the temporal random effect. These alternatives offer different ways of incorporating more model flexibility relating to the temporal random effect. For notation, a "U," "B," or "M" following Alt1, Alt2, Alt3a, or Alt3b indicates that these models are fit in the univariate (each disease fitted separately), bivariate (only considers oral/pharynx with lung), or multivariate setting.
Note that bivariate and multivariate expressions can be grouped together because the only difference in their structure is that k = 1, 2 for the former and k = 1, 2, 3 for the latter. Table 3 presents the goodness of fit results for oral/pharynx with the models described in Section 2. Supplemental Tables S1-S3 present results for melanoma and lung as well as the overall model fit. The measurements here include Watanabe-Akaike information criterion (WAIC) and mean squared predictive error (MSPE). Both of these estimates indicate a superior model with a lower value. WAIC was proposed by Watanabe et al. as an alternative to the deviance information criterion and proves to be a better option of model assessment for mixture models [39,40]. These results indicate that Alt2B offers the best fit for oral/pharynx in terms of WAIC for all year classifications as well as MSPE for most of the year classifications. When examining WAIC for the multivariate models, there is evidence of improvement over the univariate models when comparing the results associated with Alt2. Further, the MSPE values are very large for the multivariate setting while the WAIC estimates remain fairly stable. This is largely driven by oral/pharynx indicating that prediction is poor, particularly for the final four years of the study period under these modeling scenarios. The MSPE estimates that only consider the first 10 years show that the multivariate models do perform best up to that point. Finally, these estimates demonstrate that Alt1, Alt2, and Alt3 show improvements over F2PRED for oral/pharynx with respect to multivariate model fit and model prediction, respectively.

Results
Supplemental Figure S1 displays the temporal random effect (γ j or γ jk ) estimates from univariate and multivariate fits of Alt1 and Alt2 and the f γ j , ρ k estimates for Alt3aM and Alt3bM. Note that this parameter is shared between all diseases in bivariate and multivariate versions of Alt2, Alt3a, and Alt3b and the figure only displays the estimate associated with oral/pharynx for Alt1B and Alt1M. The estimates from Alt2U, Alt1B, Alt3bB, and Alt3bM appear to be essentially zero for the entire study period. All of the multivariate models except Alt3bM show the estimates increasing over time with a dramatic increase at year 10 (2005) for Alt1M and Alt3aM. This is much alike the jumps that were present in the example from Lawson et al., thus allowing the temporal random effect to be disease specific does not appear to fix that issue. However, it is worth noting that the other diseases' temporal random effect estimates are much larger in magnitude than that of oral/pharynx for Alt1M. Additionally, while this parameter is shared between all diseases in Alt2M and Alt3bM, as well as all of the bivariate fits, the large jump does not appear for those situations.
Supplemental Figures S2 and S3 display the uncorrelated and correlated heterogeneity terms that relate to oral cavity and pharynx cancer. Note here that the correlated random effect is shared between diseases for all bivariate and multivariate models. The pattern for the uncorrelated random effect appears to be similar across all model specifications, however, the variances associated with Alt2U and Alt3bM are much larger than for any of the other models. Alternatively, for Alt2U, the variance for the correlated random effect is smaller and appears to follow a different pattern compared to the other models while this is not the case for Alt3bM. In general, the variance of the spatially-correlated random effect is greater than that of the uncorrelated random effect; however, when only considering Alt2U, the ranges for u i and v i are nearly identical.
Supplemental Figures S4 and S5 display the oral/pharynx related mixture parameter estimates for Alt1, Alt2, and Alt3 in the univariate, bivariate, and multivariate settings as labeled. The results here vary widely for all models. The univariate setting of Alt1 strongly selects the spatial mixture component with mixture parameter estimate very close to 1. This selection indicates that a better fit may arise from only employing the spatial component of Alt1U, and this is the case as the WAIC estimate associated with this refit is 3222.13 with a pD of 28.25. On the other hand, the MSPE is not improved by only using the spatial component. The bivariate model results are all alike and around the 0.5 range indicating that an equal mixture between the spatial and ST components is best. The estimates associated with Alt1M, Alt3aM, and Alt3bM agree and show much more variation, indicating that the central part of the state is slightly better explained by the spatial mixture component while the counties along western and eastern borders are better explained by the ST mixture component. For the Alt2 parameterization, the univariate and bivariate setting appears to prefer an equal mixture of the different components as a large majority of the counties are in the range of 0.3 to 0.4. The multivariate setting of Alt2 appears to indicate some selection relating to each of the mixture components for different counties. Both multivariate settings appear to agree somewhat in that central counties indicate better explanation from the spatial component while the counties on the eastern and western borders indicate a better explanation from the ST component. It could be the case that this relationship is an influence from the more common melanoma and lung that is not necessarily true for oral/pharynx.  1 Values in bold indicate the model with superior fit for that measure and time period. If multiple measures are indicated as superior for the same measure and time period combination, they are statistically equivalent. 2 The bivariate and multivariate results are the goodness of fit estimates related specifically to oral/pharynx. Even though multiple diseases were fit, these measures can be calculated in a way such that they only pertain to the fit of the individual diseases.
Since our goal is, ultimately, to improve the modeling of risk related to oral/pharynx cancer, Figure 5 displays the overall risk (θ ij ) of lung and oral/pharynx associated with the best model, Alt2B. The fixed effect parameter estimates used for this calculation are included in Supplemental Table S4. Only 1996Only , 2002Only , and 2009 are included for brevity. From this figure, it can be deduced that lung has high risk in the northern portion of the state, as well as a lone county of elevated risk in the southern tip. For oral/pharynx cancer, the overall risk is higher in the eastern counties, particularly for some along the borders. This could be some edge effect which occurs in models of this nature. Temporally, the risk for both diseases appears to be increasing slightly over the years represented. In general, the trend in risk appears to be consistent across both space and time.
Since our goal is, ultimately, to improve the modeling of risk related to oral/pharynx cancer, Figure 5 displays the overall risk ( ) of lung and oral/pharynx associated with the best model, Alt2B. The fixed effect parameter estimates used for this calculation are included in Supplemental Table S4. Only 1996Only , 2002Only , and 2009 are included for brevity. From this figure, it can be deduced that lung has high risk in the northern portion of the state, as well as a lone county of elevated risk in the southern tip. For oral/pharynx cancer, the overall risk is higher in the eastern counties, particularly for some along the borders. This could be some edge effect which occurs in models of this nature. Temporally, the risk for both diseases appears to be increasing slightly over the years represented. In general, the trend in risk appears to be consistent across both space and time.

Discussion
The multivariate modeling of diseases offers many benefits; however, issues can arise when models fail to offer the appropriate amount of flexibility or inappropriate diseases are combined. The results above illustrate the importance of choosing the best model, as well as the best set of diseases, to incorporate to gain the desired improvement in fit for the less common disease. In this case, the bivariate modeling combining oral/pharynx with lung furnishes the best fit.
The example from the paper in which these methods are introduced simply removes the data for the years where the multivariate model does not perform well with respect to oral/pharynx. In this exploration, we have successfully produced a slight variation on their methods that allow the entire data to be used. This is important because removing data results in a loss of information, and that is not an ideal solution. Ultimately, the results indicate that bivariate rather than multivariate methods are ideal for producing the best fit associated with oral/pharynx across the entire study time.
As far as interpretation is concerned, these models do offer some difficulties. For the best model, Alt2B, the ideal interpretation of the random component comes from the information provided in the term: ( + ) + + . This estimate represents the total of the confounding adjusted for in the fitted model and is displayed in Supplemental Figure 4. From this estimate, it is clear that the overall risk unexplained by the fixed effects becomes more variable over the study period. Increased risk appears in the northwestern and northeastern corners of the state; these areas are in the mountains and along the coast respectively. Interpretation of the fixed effects is similar in that the measure of interest would be for the particular county and disease of interest. Beyond that, an interpretation of the mixture parameter can be ascertained wherein, for example, if a particular county in Alt2B has = 0.1, = 0.5, and = 0.4, then 10% of the overall variability in the risk

Discussion
The multivariate modeling of diseases offers many benefits; however, issues can arise when models fail to offer the appropriate amount of flexibility or inappropriate diseases are combined. The results above illustrate the importance of choosing the best model, as well as the best set of diseases, to incorporate to gain the desired improvement in fit for the less common disease. In this case, the bivariate modeling combining oral/pharynx with lung furnishes the best fit.
The example from the paper in which these methods are introduced simply removes the data for the years where the multivariate model does not perform well with respect to oral/pharynx. In this exploration, we have successfully produced a slight variation on their methods that allow the entire data to be used. This is important because removing data results in a loss of information, and that is not an ideal solution. Ultimately, the results indicate that bivariate rather than multivariate methods are ideal for producing the best fit associated with oral/pharynx across the entire study time.
As far as interpretation is concerned, these models do offer some difficulties. For the best model, Alt2B, the ideal interpretation of the random component comes from the information provided in the term: p S ik (u ik + v i ) + p T ik γ j + p ST ik φ ijk . This estimate represents the total of the confounding adjusted for in the fitted model and is displayed in . From this estimate, it is clear that the overall risk unexplained by the fixed effects becomes more variable over the study period. Increased risk appears in the northwestern and northeastern corners of the state; these areas are in the mountains and along the coast respectively. Interpretation of the fixed effects is similar in that the measure of interest would be p T ik β T k for the particular county and disease of interest. Beyond that, an interpretation of the mixture parameter can be ascertained wherein, for example, if a particular county in Alt2B has p S ik = 0.1, p T ik = 0.5, and p ST ik = 0.4, then 10% of the overall variability in the risk is explained by spatial effects, 50% is explained by temporal effects, and 40% is explained by ST effects. If the entire state and all diseases have a fairly low estimate for one of the mixture parameters, as with Alt1U for oral/pharynx, it might be of interest to fit the model without that component to determine if it is necessary for explaining the overall variability in the risk of disease.
There are several explanations regarding the poor fit of the multivariate model. The first is that, while melanoma is also a more prevalent disease than oral/pharynx with common risk factors, they may be too different as far as spatial and temporal patterns are concerned to aid in the modeling of oral/pharynx in these data. Additionally, the multivariate models add even more random effects of the same type, and this could be leading to identification issues and, thus, poor fitting models. This is an issue common to all mixture model structures; however, the hierarchical structure and prior dependence, even when uninformative choices are made, that is characteristic of these Bayesian methods minimizes this issue as much as possible. We have assumed non-informative priors in this example, but even stronger prior assumptions could continue to minimize the issue of identifiability. Finally, there could be collinearity between the fixed and random effects. This issue could also lead to poorly-fitting models, as well as inaccurate estimates of the effects. Some, or all, of these explanations could impact the individual estimates presented in this paper, but when considering the overall risk of disease, which is our main goal, the random effects take on the role of representing the confounding variables to potentially produce the optimal fit.

Conclusions
The best model for explaining the overall risk of oral/pharynx is Alt2B. This model includes separate spatial, temporal, and ST mixture components, meaning that it is important to separate the temporal and ST fixed and random effects. Further, this model only combines oral/pharynx with lung, and that suggests that melanoma is not the best disease choice for modeling with oral/pharynx. Alternatively, multivariate models Alt3b and Alt3a offer improved fits for lung and melanoma, respectively.

Supplementary Materials:
The following are available online at www.mdpi.com/1660-4601/14/5/503/s1, Figure S1: γ jk , γ j , and f γ j , ρ k posterior mean estimates for univariate and multivariate fits of Alt1, Alt2, and Alt3 for oral cavity and pharynx cancer, Figure S2: Posterior mean u i estimates for Alt1 and Alt2 in the univariate and multivariate framework, Figure S3: Posterior mean v i estimates for Alt1 and Alt2 as well as univariate and multivariate, Figure S4: Alt1 oral cavity and pharynx cancer posterior mean mixture parameter estimates in the univariate, bivariate, and multivariate settings, Figure S5: Alt2 oral/pharynx posterior mean mixture parameter estimates (p S i , p T i , p ST i , p S ik , p T ik , and p ST ik ) in the univariate, bivariate, and multivariate settings, Figure S6: Sum of the posterior mean random effects times posterior mean mixture parameters (p S ik (u ik + v i ) + p T ik γ j + p ST ik φ ijk ) associated with oral cavity and pharynx cancer for Alt2B, Table S1: Overall goodness of fit results for all cancers, Table S2: Goodness of fit results for melanoma cancer of the skin, Table S3: Goodness of fit results for lung and bronchus cancer, Table S4: Posterior mean fixed effect coefficient estimates for the best fitting model, Alt2B.