1. Introduction
Survival analysis constitutes a fundamental statistical methodology in medical, epidemiological, and biostatistical research, playing a crucial role in analyzing time-to-event data, such as disease recurrence, clinical progression, hospitalization, and mortality. Its utility in identifying prognostic factors, evaluating therapeutic efficacy, and providing empirical evidence on patient outcomes makes it indispensable for evidence-based clinical decision-making [
1,
2,
3]. Among the various survival analysis techniques, the Cox proportional hazards model has become the predominant method, thanks to its interpretability, flexibility, and capability to accommodate covariate effects without specifying the baseline hazard function [
4,
5]. However, a significant limitation of the conventional Cox model is its implicit assumption of independence between event times, an assumption that frequently does not hold true in clinical practice [
6,
7].
In numerous clinical scenarios, such as chronic illnesses, cardiovascular conditions, and cancer, patients often experience sequences of related events that exhibit inherent interdependencies. For example, a hospitalization event can significantly alter the subsequent risk of mortality through factors such as clinical deterioration, treatment complications, or common underlying health determinants [
7,
8,
9]. Neglecting these dependencies in the analysis of multi-state events can substantially bias risk estimates, distort covariate effect interpretations, and lead to erroneous conclusions, ultimately impacting patient management, resource allocation, and prognostic accuracy [
9,
10].
Multi-state models have emerged as powerful statistical tools to explicitly describe and analyze transitions between distinct health states over time. These models facilitate the investigation of complex event histories, allowing researchers to quantify covariate effects on the timing and sequence of clinical outcomes comprehensively [
10,
11]. Despite their versatility, conventional multi-state models often continue to assume independence across transitions, a simplification frequently violated due to latent patient heterogeneity or unmeasured common risk factors influencing multiple events [
12,
13,
14].
Recent methodological developments have aimed to address these limitations by introducing sophisticated approaches that explicitly account for dependencies between multistate transitions. Techniques such as landmarking, joint frailty–multistate modeling, hierarchical models integrating longitudinal biomarkers, and hidden semi-Markov frameworks exemplify these advancements [
12,
14,
15,
16]. Collectively, these approaches highlight the increasing recognition of the necessity to model dependencies robustly and flexibly within multistate contexts.
An especially promising and increasingly employed approach to capturing complex dependence structures in survival analysis is the use of copula functions. Copulas offer considerable flexibility by modeling dependencies separately from marginal distributions, enabling researchers to accurately describe complex association structures among event times [
17,
18,
19]. Originally developed in fields such as finance and hydrology, copulas have gained substantial attention in biostatistics and medical research, where they have demonstrated notable improvements in modeling correlated event times and enhancing predictive accuracy [
20,
21,
22,
23].
Empirical studies in medical research have emphasized the value of copula-based multistate models. For instance, in oncology research, copula-driven approaches have more precisely quantified risks of recurrence and metastasis, outperforming traditional models [
22,
24]. Similarly, in cardiovascular epidemiology, copula-based multistate models have substantially improved predictions of rehospitalization and mortality after myocardial infarction [
25]. These studies underscore the practical importance of incorporating copula models to effectively represent real-world clinical dependencies.
Motivated by these critical methodological advancements and their substantial clinical implications, this article proposes and rigorously evaluates a bivariate copula-based multi-state model for jointly analyzing clinically significant events, hospitalization, and death using data from a large observational registry of COVID-19 patients in Colombia (2021–2022). Utilizing flexible Archimedean copulas, our model explicitly accounts for residual dependencies between events, improving inferential accuracy, risk stratification, and predictive performance.
The article is structured as follows:
Section 2 presents foundational concepts of multistate modeling, including Cox proportional hazards models and copula functions, particularly Archimedean copulas.
Section 3 introduces our proposed joint semi-parametric multistate–copula model, detailing its statistical formulation and inference methodologies.
Section 4 presents a comprehensive simulation study, discussing scenario configurations, data-generating mechanisms, estimation strategies, and extensive results with an interpretative discussion.
Section 5 demonstrates a practical application of the proposed model to COVID-19 patient data, illustrating the clinical and practical utility.
Section 6 summarizes the main findings and conclusions of the study, while
Section 7 discusses limitations and outlines directions for future research.
4. Simulation Study
To validate our two-step IFM estimator for the joint semi-parametric multi-state–copula model introduced in
Section 3, we conducted a Monte Carlo study in which all data were generated under a Clayton copula and then analyzed under four estimation strategies: marginal (independence), correctly specified Clayton, and mis-specified Gumbel and Frank. Two covariates were included to assess inference on their effects.
4.1. Scenario Configuration
We considered a factorial design comprising three factors:
For each of the 18 scenarios defined by the cross-product of these conditions, we generated 1000 independent replicates. The joint survival probability was evaluated at times corresponding to the quantiles , , and
4.2. Data-Generating Mechanism
For each individual, we simulated two covariates:
, representing age in years.
, representing a binary treatment indicator (e.g., treatment vs. control).
This simple design with one continuous covariate drawn from a uniform distribution and one binary Bernoulli covariate is common in methodological simulation studies (e.g., [
32,
33,
34,
35]). It provides a straightforward yet effective way to evaluate the performance of statistical estimators under controlled conditions. Although real datasets often include a larger number of covariates and more complex distributions, our framework naturally extends to higher-dimensional covariate structures. Exploring such scenarios constitutes a natural avenue for future work.
Given these covariates, we simulated the latent event times,
(admission to complication) and
(complication to death), with exponential marginal distributions:
and the baseline hazards and coefficients were fixed as follows:
The dependence between and was induced by generating copula-based pairs, , where denotes the Clayton copula with the specified . Right-censoring times, C, were drawn from a uniform distribution and applied independently.
The observed event times were constructed as follows: individuals were first observed for the transition from state 1 to 2 (or directly to state 3 if earlier), and then, if a complication occurred, for the transition to death or censoring.
4.3. Estimation Procedures
On each simulated dataset, we fitted the following.
Marginal cox (independence): Three separate Cox models for transitions and , each including and . The joint survival was estimated as
Clayton copula model: a two-step IFM approach was applied (
Section 3.4)
Step 1: We fitted the same Cox models to estimate .
Step 2: pseudo-observations were derived from the marginal estimates, and the copula parameter was estimated via maximum likelihood using the uncensored bivariate observations.
Gumbel and Frank copula models: the same two-step IFM procedure was applied, assuming mis-specified dependence structures using Gumbel and Frank copulas, respectively.
The true joint survival function at each time point,
, was calculated as follows:
reflecting the generating Clayton copula.
For each method and scenario we computed, over Monte Carlo replicates, the following performance measures at time t. Let denote the estimate from replicate b and the truth.
Mean squared error (MSE): Coverage: for each replicate,
b, we formed a 95% confidence interval for
using the replicate–specific standard error
(model–based within replicate): Greenwood’s formula for KM estimates and a delta–method variance (via the Breslow baseline) for the Cox–based estimator. To respect the
range, we built intervals on the complementary log–log scale
and back–transformed the following:
where
is the standard error of
(obtained through the delta method from
). The empirical coverage is, then,
4.4. Results
In this subsection, we present and discuss the findings from our Monte Carlo experiments, which were designed to assess the performance of joint survival quantile estimators under varying dependence levels, sample sizes, and censoring proportions. We focus on three key metrics: average bias, mean squared error (MSE), and the empirical coverage of
confidence intervals. To highlight the main patterns, we focus on
Figure 3, which displays the empirical coverage, and
Figure 4, which reports the mean squared error. Both figures compare, under a representative scenario with moderate dependence, the independence estimator against the copula-based estimators (Clayton, Frank, and Gumbel) for the quantiles
,
, and
.
We then examine how these metrics change as dependence increases
, the sample size grows
, and the censoring levels vary (
and
). Full numerical results, including detailed tables of bias, MSE, and coverage for every combination of parameters, are provided in
Appendix A (
Table A1,
Table A2 and
Table A3). This structure allows us to concentrate the main discussion on the most salient findings while ensuring the full transparency and reproducibility of the simulation study.
Across all scenarios, the copula-based estimators consistently outperform the product-limit estimator that assumes marginal independence in terms of empirical coverage, mean squared error (MSE), and bias, and their advantage grows as dependence and censoring increase. The key patterns are as follows:
Empirical coverage.
Results clearly demonstrate that assuming marginal independence (product-limit estimator) significantly compromises empirical coverage, falling substantially below the nominal level in all tested scenarios. Even under weak dependence (), the coverage ranged between and , which is notably poor. Conversely, copula-based estimators (Clayton, Frank, Gumbel) substantially improved coverage, achieving 73–83%, though still below the nominal value. As dependence increased ( and ), coverage with independence dropped drastically to as low as , whereas copula-based methods, particularly Clayton, achieved near-nominal coverage (92–97%) under strong dependence.
Mean squared error (MSE).
Copula-based estimators consistently yielded lower MSE values compared to the independence estimator, especially under moderate and strong dependence scenarios. For instance, at and censoring, the MSE for the quantile , was reduced by approximately 40–50% using copula methods relative to independence. These improvements became more pronounced with increased sample size and reduced censoring. Notably, the Clayton copula consistently provided the lowest MSE values across all tested scenarios.
Bias.
The average bias across all evaluated estimators was consistently negative, indicating a slight but systematic underestimation of the true joint survival quantiles. The absolute bias never exceeded , and it decreased as the sample size increased and censoring decreased.
4.5. Discussion
Our findings align with the existing literature on bivariate survival and copula models, highlighting the severe consequences of ignoring dependence structures, leading to biased estimations and incorrect inference [
19,
36]. The severe under-coverage observed when assuming independence corroborates [
6] assertion regarding the importance of explicitly modeling dependence to ensure accurate joint survival inferences. Similar bias phenomena under dependent censoring have also been reported in the copula literature [
32], who proposed a copula-based approach to survival data with dependent censoring. Additionally, the superior performance of the Clayton copula aligns with prior studies emphasizing its effectiveness in modeling clinical events exhibiting pronounced positive dependence, such as recurrence times in oncology or paired organ failures [
37,
38].
Our results also suggest important practical implications. While assuming independence between marginal distributions severely underestimates joint survival probabilities, thus potentially driving inappropriate clinical decisions and resource misallocation, incorrectly specifying the copula family also introduces estimation bias, though typically less severe. Nonetheless, even when miss-electing the copula family, copula-based estimators consistently outperform independence assumptions, highlighting their robustness and clinical relevance.
Therefore, we strongly recommend employing copula-based inference, particularly the Clayton family, in clinical contexts characterized by considerable positive dependence between event times.
5. Real-Data Application
To illustrate the applicability of the copula-driven multi-state model, we analyzed a large cohort of patients diagnosed with COVID-19 in four Colombian cities between 2021 and 2022. The dataset included sociodemographic, clinical, and vaccination information, as well as records of hospitalization and death. In this section, we first present the baseline characteristics of the study population and the marginal risk estimates obtained using Cox proportional hazards models for each transition within the illness–death framework. We then assess the dependence between hospitalization and death times using several Archimedean copula families, comparing their fit and joint survival estimates. Finally, we discuss the findings in light of the simulation results and the existing literature, emphasizing their clinical and epidemiological implications for the management of COVID-19 in the Colombian context.
The analysis of this multi-city cohort reinforces the necessity of accounting for residual dependence in multi-state survival analyses.
Baseline characteristics (
Table 1) revealed a predominantly young population (57.8% aged 18–44) with high vaccination coverage (80.5%). Nevertheless, the hospitalization rate reached 50%, and the overall mortality rate was 2.4%, reflecting the substantial clinical burden of the pandemic even in a vaccinated population.
Cox regression models (
Table 2) confirmed the strong protective effect of vaccination across all transitions, with particularly marked reductions in the risk of direct mortality (HR = 0.10) and mortality following hospitalization (HR = 0.04). These findings are consistent with evidence from other large-scale studies highlighting the effectiveness of vaccination in reducing severe outcomes [
39,
40,
41]. Older patients (≥65 years) displayed dramatically higher hazards, up to 47-fold for direct death and 42-fold for death after hospitalization, underscoring their vulnerability. Male sex and comorbidities further amplified risk, in line with international literature on COVID-19 risk factors [
42].
For each transition, we assessed the proportional-hazards (PH) assumption using Schoenfeld residual plots with LOESS smoothing and global PH tests (
Figure A1,
Figure A2 and
Figure A3). To check that the marginal Cox models reproduce the observed survival, we contrasted the population-averaged survival implied by the Cox fits with the nonparametric Kaplan–Meier estimator and its 95% Greenwood band (
Figure A4). We further compared transition-specific hazard rate functions (HRFs) on the log scale: the Cox hazard
was obtained by differentiating the Breslow cumulative baseline hazard (and averaging over strata), and it was contrasted with a kernel-based nonparametric estimator (muhaz) (
Figure A5). Finally, total time on test (TTT) curves were used to diagnose the qualitative shape of the hazard over follow-up (
Figure A6): curves below the
line indicate a decreasing hazard, curves above indicate an increasing hazard, and proximity to the diagonal suggests an approximately constant hazard. We report
, the proportion failed at the point of maximum vertical deviation from the diagonal, as a simple summary of departure from constancy.
For the joint model, candidate copulas were compared using log-likelihood, AIC, BIC, and CAIC, and were subjected to multiplier-bootstrap goodness-of-fit tests based on Kolmogorov–Smirnov (KS), Cramér–von Mises (CmV), and Anderson–Darling (AD) statistics, computed on rank pseudo-observations; margins were estimated semiparametrically (
Table 3 and
Table 4).
Proportional hazards (PHs) diagnostics indicated no substantial deviations from the proportional hazards assumption (global tests: Diagnosis → Hospitalization
p = 0.370; Hospitalization → Death
p = 0.178; and Diagnosis → Death
p = 0.097;
Figure A1,
Figure A2 and
Figure A3). Kaplan–Meier curves and the population-averaged Cox survival were nearly indistinguishable, and the Cox curves lay within the KM
band across transitions (
Appendix A,
Figure A4). Hazard-rate comparisons were consistent (
Figure A5): Diagnosis → Hospitalization displayed an early peak, followed by a monotone decline; Hospitalization → Death decreased steadily; Diagnosis → Death remained low with a slight downward trend. TTT plots corroborated these patterns (
Figure A6), indicating predominantly decreasing hazards with the largest deviation from constancy for Diagnosis → Hospitalization (
) and small deviations for Diagnosis → Death (
) and Hospitalization → Death (
). These checks support the use of Cox PH models for the marginal transition intensities in this application.
The copula selection analysis (
Table 3) indicated that the Gumbel copula provided the best overall fit. Selection was based on multiple criteria log-likelihood, Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), and Consistent AIC (CAIC), with Gumbel achieving the largest log-likelihood and the smallest information criteria. The implied Kendall’s
was
, indicating strong upper-tail dependence between hospitalization and death. These findings are consistent with prior copula-based survival analyses in medical settings where Gumbel effectively captures upper-tail dependence [
24,
25].
We complemented model selection with rank-based goodness-of-fit tests for the empirical copula using the multiplier bootstrap (
).
Table 4 reports
p-values for the Kolmogorov–Smirnov (KS), Cramér–von Mises (CvM), and Anderson–Darling (AD). Only the Gumbel copula is not rejected (e.g.,
), whereas Clayton and Frank are rejected
).
Beyond the covariate effects, the diagnostic suite (PH checks, KM–Cox agreement, HRF, and TTT) indicates that our Cox components provide a reliable marginal description of each transition. This justifies using them as the margins in the copula framework.
The Kaplan–Meier survival curves (
Figure 5) illustrated distinct survival patterns for each transition, while the joint survival estimates (
Figure 6) revealed a critical finding. Relative to the copula-based estimate, the independence curve lies uniformly lower, indicating systematic underestimation of joint survival when dependence is ignored. The shaded region highlights the pointwise gap between the two curves, quantifying how much joint survival would be understated under the independence assumption across follow-up. This underestimation occurs because neglecting dependence fails to capture the compounding risk when hospitalization and death are correlated. As demonstrated in our simulations, the independence assumption yielded empirical coverage rates as low as 40% under strong dependence, compared with 92–97% when copula models were applied. These results align with previous methodological studies that highlight the risks of disregarding dependence in multi-state data [
6,
19].
From a clinical perspective, this misestimation is particularly concerning. Underestimating joint survival implies that clinicians and policymakers may underestimate the likelihood of patients experiencing the combined burden of hospitalization and death, potentially leading to inadequate risk stratification, delayed interventions, or misallocation of healthcare resources. As emphasized by [
17,
43], such underestimation can substantially distort clinical decision-making and ultimately compromise patient outcomes. In our cohort, the copula-based estimates, particularly those from the Gumbel copula, provided a more accurate representation of survival by capturing the strong positive dependence between transitions and yielding more reliable evidence to guide clinical and epidemiological planning.
In summary, these findings confirm the simulation results and reinforce a critical message: assuming independence in the presence of dependence systematically underestimates joint survival, which can have serious consequences for patient management and health policy. Copula-based multi-state models offer a robust framework to overcome this limitation and should be considered a methodological standard in contexts where sequential clinical events are strongly correlated.
6. Conclusions
This work introduced a bivariate copula–driven multi-state model that extends the conventional illness–death framework by explicitly modeling the dependence between sequential event times. Methodologically, we formulate a joint semiparametric likelihood using Inference Functions for Margins (IFM): Cox proportional hazards models provide covariate-adjusted marginal transition intensities, while an Archimedean copula encodes the dependence structure. The construction is flexible yet tractable, allowing estimation of both marginal and joint survival functions under right-censoring.
Extensive simulation experiments showed that ignoring dependence, as assumed under independence, systematically underestimates joint survival and can lead to severe coverage losses when dependence is moderate to strong. In contrast, copula-based estimators, particularly those from the Gumbel and Clayton families, achieved near-nominal coverage and lower mean squared error, confirming the theoretical robustness of the proposed framework even under partial copula misspecification.
The large Colombian COVID-19 cohort further validated the approach. First, model-fit diagnostics supported the adequacy of the Cox margins: global PH tests showed no material violations; population-averaged Cox survival closely tracked Kaplan–Meier with Greenwood bands; hazard-rate functions (Cox vs. kernel) displayed the expected shapes; and TTT plots indicated predominantly decreasing hazards with the largest deviation from constancy for Diagnosis → Hospitalization. Second, copula selection favored Gumbel by pseudo-log-likelihood and information criteria (AIC/BIC/CAIC), and rank-based bootstrap GOF tests (AD/KS/CvM) failed to reject Gumbel while rejecting Clayton and Frank. Substantively, the estimated upper-tail dependence between hospitalization and death was strong (Kendall’s ), consistent with clinical intuition about severe disease progression.
These results have practical implications. Assuming independence when transitions are correlated can underestimate the joint burden of hospitalization and death, potentially distorting risk stratification, timing of interventions, and resource planning. By combining covariate-adjusted Cox margins with a well-supported copula, our framework yields more reliable joint survival estimates, offering a principled alternative to independence-based approaches in clinical and epidemiological studies.