Next Article in Journal
Stochastic Complexity of Rayleigh and Rician Data with Normalized Maximum Likelihood
Previous Article in Journal
Robust Kibria Estimators for Mitigating Multicollinearity and Outliers in a Linear Regression Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Proportional Hazards Mixture Cure Model for Subgroup Analysis: Inferential Method and an Application to Colon Cancer Data

by
Kai Liu
1,2,*,
Yingwei Peng
3,4 and
Narayanaswamy Balakrishnan
5,6,*
1
School of Statistics and Mathematics, Shanghai Lixin University of Accounting and Finance, Shanghai 201209, China
2
Interdisciplinary Research Institute of Data Science, Shanghai Lixin University of Accounting and Finance, Shanghai 201209, China
3
Department of Public Health Sciences, Queen’s University, Kingston, ON K7L 3N6, Canada
4
Department of Mathematics and Statistics, Queen’s University, Kingston, ON K7L 3N6, Canada
5
Department of Mathematics and Statistics, McMaster University, Hamilton, ON L8S 4K1, Canada
6
Department of Mathematics, Atilim University, Ankara 06830, Turkey
*
Authors to whom correspondence should be addressed.
Submission received: 6 November 2025 / Revised: 19 December 2025 / Accepted: 21 December 2025 / Published: 24 December 2025

Abstract

When determining subgroups with heterogeneous treatment effects in cancer clinical trials, the threshold of a variable that defines subgroups is often pre-determined by physicians based on their experience, and the optimality of the threshold is not well studied, particularly when the mixture cure rate model is considered. We propose a mixture cure model that allows optimal subgroups to be estimated for both the time to event for uncured subjects and the cure status. We develop a smoothed maximum likelihood method for the estimation of model parameters. An extensive simulation study shows that the proposed smoothed maximum likelihood method provides accurate estimates. Finally, the proposed mixture cure model is applied to a colon cancer study to evaluate the potential differences in the treatment effect of levamisole plus fluorouracil therapy versus levamisole alone therapy between younger and older patients. The model suggests that the difference in the treatment effect on the time to cancer recurrence for uncured patients is significant between patients younger than 67 and patients older than 67, and the younger patient group benefits more from the combined therapy than the older patient group.

1. Introduction

With the advancement of modern medical technology, precision medicine has attracted considerable attention in recent years. Subgroup analysis serves as a crucial component in precision medicine. Rothwell [1] summarized the importance of subgroup analysis in randomized controlled trials for new drug development for the following three reasons: (1) The trial results meet the primary objectives, but the drug efficacy may vary significantly across different subgroups. (2) The trial results fail to meet the primary objectives, but the drug demonstrates significant efficacy in certain subgroups. (3) There are clinical concerns regarding the treatment of specific populations, such as the elderly. In a clinical control trial subgroup analysis, the threshold of a variable that defines subgroups is often pre-fixed. For example, the MONARCH plus study, a randomized clinical trial with women with hormone receptor-positive and human epidermal growth factor receptor 2-negative advanced breast cancer [2], investigated heterogeneous treatment effects of abemaciclib plus endocrine therapy on progression-free survival in patients with two subgroups using the Cox proportional hazards (PH) model. The two subgroups are formed by age older or younger than 65 years. The age threshold 65 is a pre-fixed threshold, and its optimality remains unknown. Several studies have investigated methods for detecting the optimal threshold in the context of proportional hazards models [3,4,5,6]. Luo et al. [3] provided a theoretical proof of the asymptotic consistency of the maximum partial likelihood estimator for the unknown threshold. Pons [4] proposed a maximum partial likelihood estimator based on the profile method and established its consistency and asymptotic distribution. Chen et al. [5] developed a novel hierarchical Bayesian method to make statistical inference simultaneously on the threshold and the covariate coefficients. Deng et al. [6] proposed a change-point proportional hazards model for clustered event data, developing a maximum marginal pseudo-partial likelihood estimator for the regression coefficients and the unknown change point, as well as a supremum test based on robust score statistics to test the existence of the change point. Lee et al. [7] also proposed three statistical tests, namely, the maximal score, maximal normalized score, and maximal Wald tests and established their asymptotic properties. Further studies extend the single threshold proportional hazards model to mulitple thresholds [8,9], continous thresholds [10], random thresholds [11], single-index threshold model [12] and change-plane model [13]. More literature on subgroup analysis in hetergenous Cox model can be found in [14,15].
Many diseases and cancers are curable nowadays, and cured patients will never experience the event of interest. For example, the cure rate for patients with resected BRAF V600–mutant stage III melanoma treated by adjuvant dabrafenib plus trametinib is 54% [16], and the cure rate of acute lymphoblastic leukemia in children is 90% [17]. When the cured fraction is present in data, the classical survival models, such as the Cox PH model, are not appropriate, and cure rate models, particularly the mixture cure models, have been used extensively for analyzing survival data with a fraction of cured subjects for decade. Parametric, semiparametric, and nonparametric statistical inferential methods for the mixture cure rate model have been studied extensively in the literature [18,19,20,21].
However, the study on determining the optimal threshold or cutoff value to form subgroups with heterogeneous treatment effects in mixture cure rate models is very limited. Othus et al. [22] studied a subgroup analysis under a parametric mixture cure rate model for the heterogeneous prognostic effect of a variable. That is, they assumed different cure rates and constant hazard rates for two subgroups formed by a covariate with an unknown change-point or threshold. A smoothed likelihood approach based on the observed likelihood function was developed to estimate the parameters in the model. The model and the method were used to analyze the heterogeneous prognostic effect of age among prostate cancer patients. Shamsi et al. [23] further extended this method by considering Bayesian inference. Zhao et al. [24] proposed a semiparametric Cox PH mixture cure rate model that allows a threshold in the regression coefficient and then developed Bayesian inference for the proposed model, and then applied the model to an oropharynx cancer data to illustrate the gender effect changes under different age groups at diagnosis. Wang et al. [25] studied the Cox PH mixture cure model when a covariate effect on the failure time of uncured subjects may change when the value of the covariate exceeds a change-point or threshold, and proposed a nonparametric maximum likelihood estimation method to estimate the model. Lou et al. [26] studied a class of partly linear transformation models within the mixture cure model framework for interval-censored failure time data with change-points.
However, some key limitations remain. Although [22,23] considered heterogeneous effects in different subgroups on the cure rate and the hazard rate, they only focus on the prognostic effect of the subgroups, and they do not consider the heterogeneous treatment effect in different subgroups, which is often called the predictive effect of the subgroups. It is worth noting that both works in [24,25] only considered heterogeneous subgroup effects in the uncured population, while the heterogeneous subgroup effects in the cure probability is not considered. To fill this gap, we propose a mixture cure model that can determine thresholds for a variable to form subgroups with heterogeneous treatment effects on the time to event of interest for uncured subjects and/or on the probability of being cured. The interaction terms given in (2) and (3) model the predictive effect of the subgroups for both the uncured subjects and the cure probability.
The rest of this paper is organized as follows. Section 2 presents the mixture cure model for subgroup analysis and the proposed estimation method. In Section 3.1, an extensive simulation study is conducted to investigate the performance of the proposed estimation method. The proposed model is illustrated with a colon cancer data set in Section 3.2. Finally, discussion and conclusions are presented in Section 4 and Section 5.

2. Materials and Methods

Let T * = Z + ( 1 Z ) T ˜ be the lifetime of a patient, where Z is the cure indicator with Z = 1 if the patient is cured and Z = 0 otherwise, and T ˜ is the lifetime for uncured patients. Let C be the censoring time. Then, the observed lifetime or censoring time is denoted by T = min ( T * , C ) , with a censoring indicator δ = I ( T * C ) where I ( A ) is an indicator function with value 1 if A is true and 0 otherwise. Let W be the treatment variable with W = 1 for a new treatment and W = 0 for a standard treatment, U 1 and U 2 be the two covariates or biomarkers that define subgroups with different treatment effects on T ˜ and Z, respectively, via I ( U 1 > c 1 ) and I ( U 2 > c 2 ) , where c 1 and c 2 are unknown thresholds or cutoff values. Let X 1 and X 2 be covariate vectors to be adjusted in the models for T ˜ and Z, respectively, and define V 1 = ( X 1 , W , U 1 ) and V 2 = ( 1 , X 2 , W , U 2 ) . Then, the mixture cure rate model is given in terms of survival function as
S p o p ( t | V 1 , V 2 ; ϑ 1 , ϑ 2 ) = P ( T * > t | V 1 , V 2 ; ϑ 1 , ϑ 2 ) =                                                                                                           p ( V 2 ; ϑ 2 ) + ( 1 p ( V 2 ; ϑ 2 ) ) S ( t | V 1 ; ϑ 1 ) ,
where
S ( t | V 1 , ϑ 1 ) = P ( T ˜ > t | V 1 ; ϑ 1 ) = S 0 ( t ; λ ) exp ( α 1 X 1 + γ 1 W + θ 1 I ( U 1 > c 1 ) + η 1 W I ( U 1 > c 1 ) ) ;
p ( V 2 , ϑ 2 ) = P ( Z = 1 | V 2 ; ϑ 2 ) = e β 0 + α 2 X 2 + γ 2 W + θ 2 I ( U 2 > c 2 ) + η 2 W I ( U 2 > c 2 ) 1 + e β 0 + α 2 X 2 + γ 2 W + θ 2 I ( U 2 > c 2 ) + η 2 W I ( U 2 > c 2 ) ;
ϑ 1 = ( λ , α 1 , γ 1 , θ 1 , η 1 , c 1 ) and ϑ 2 = ( β 0 , α 2 , γ 2 , θ 2 , η 2 , c 2 ) are the unknown parameter vectors, and S 0 ( t ; λ ) is the baseline survival function with unknown parameter vector λ . We consider the Weibull distribution as the baseline distribution with
S 0 ( t ; λ ) = exp ( λ 2 t λ 1 ) ,
where λ = ( λ 1 , λ 2 ) . To allow a more flexible baseline distribution in the model if the Weibull distribution is not desirable in practice, we also consider a piecewise constant hazard distribution [27] with S 0 ( t ; λ ) = exp 0 t h 0 ( u ; λ ) d u , where
h 0 ( t ; λ ) = λ k for t ( k 1 ) t < t ( k ) , k = 1 , , J ,
λ = ( λ 1 , λ 2 , , λ J ) , and 0 = t ( 0 ) < t ( 1 ) < t ( 2 ) < t ( J ) < max ( t i ) is a partition of J time intervals. The part of the mixture cure model in (2) is often referred to as the latency part, and the part of the model in (3) is referred to as the incidence part.
Consider a sample of n subjects. The observed values of ( T , δ , V 1 , V 2 ) from the sample are denoted as ( t i , δ i , V i 1 , V i 2 ) , i = 1 , 2 , , n . Then, the likelihood function for the unknown parameters ϑ = ( ϑ 1 , ϑ 2 ) is given by
L ( ϑ ) = i = 1 n 1 p ( V i 2 , ϑ 2 ) f ( t i | V i 1 , ϑ 1 ) δ i                                                                                                                       p ( V i 2 , ϑ 2 ) + 1 p ( V i 2 , ϑ 2 ) S ( t i | V i 1 , ϑ 1 ) 1 δ i ,
where f ( · ) is the density function corresponding to S ( · ) . The log-likelihood function for the mixture cure rate model specified in (2) and (3) can be simplified as
l ( ϑ ) = i = 1 n δ i ln h 0 ( t i ; λ ) + r 1 ( ϑ 1 ) Λ 0 ( t i ; λ ) e r 1 ( ϑ 1 )                                                                                                   + i = 1 n ( 1 δ i ) ln [ e r 2 ( ϑ 2 ) + e Λ 0 ( t i ; λ ) e r 1 ( ϑ 1 ) ] i = 1 n ln [ 1 + e r 2 ( ϑ 2 ) ] ,
where r 1 ( ϑ 1 ) = α 1 X i 1 + γ 1 W i + θ 1 I ( U i 1 > c 1 ) + η 1 W i I ( U i 1 > c 1 ) and r 2 ( ϑ 2 ) = α 2 X i 2 + γ 2 W i + θ 2 I ( U i 2 > c 2 ) + η 2 W i I ( U i 2 > c 2 ) .
Since the indicator functions I ( U i 1 > c 1 ) and I ( U i 2 > c 2 ) are discontinuous at U i 1 and U i 2 when treated as a function of c 1 and c 2 , respectively, the log likelihood function is discontinuous, which makes the maximization of the log likelihood function challenging. One approach is to find the profile log-likelihood function p l ( c 1 , c 2 ) = l ( ϑ ^ c 1 , c 2 ) , where ϑ ^ c 1 , c 2 = argmax ϑ { c 1 , c 2 } l ( ϑ ) . Then estimates of c 1 and c 2 , denoted as c ^ 1 and c ^ 2 , can be obtained as argmax c 1 , c 2 p l ( c 1 , c 2 ) , which can also be challenging because of discontinuity in the function p l ( c 1 , c 2 ) , and a grid search method is often used. With c ^ 1 and c ^ 2 , an estimate of ϑ { c 1 , c 2 } can be obtained as ϑ ^ c ^ 1 , c ^ 2 . This estimation method will be referred to as the PMLE method. Due to the non-smoothness of the log-likelihood function, the standard errors in this method cannot be obtained under the standard likelihood theory, and the bootstrap method may be used instead.
In this work, we propose to use kernel smoothing functions to approximate the indicator function [12,28] so that the resulting log likelihood function is a smooth function. In particular, we consider using either the normal kernel function g ( x / h ) = Φ x / h or the logistic kernel function g ( x / h ) = exp ( x / h ) / [ 1 + exp ( x / h ) ] to approximate I ( x ) , where Φ is the cumulative distribution function of the standard normal distribution and h is a bandwidth. It is obvious that as h 0 , if x > 0 , g ( x / h ) 1 , otherwise g ( x / h ) 0 . Thus, we have I ( U i 1 > c 1 ) g U i 1 > c 1 h 1 and I ( U i 2 > c 2 ) g U i 2 > c 2 h 2 as h 1 0 and h 2 0 . Using these approximations, we can obtain a smoothed log-likelihood function as follows:
s l ( ϑ ) = i = 1 n δ i ln h 0 ( t i ; λ ) + r ˜ 1 ( ϑ 1 ) Λ 0 ( t i ; λ ) e r ˜ 1 ( ϑ 1 )                                                                                                   + i = 1 n ( 1 δ i ) ln [ e r ˜ 2 ( ϑ 2 ) + e Λ 0 ( t i ; λ ) e r ˜ 1 ( ϑ 1 ) i = 1 n ln [ 1 + e r ˜ 2 ( ϑ 2 ) ] ,
where r ˜ 1 ( ϑ 1 ) = α 1 X i 1 + γ 1 W i + θ 1 g ( U i 1 c 1 h 1 ) + η 1 W i g ( U i 1 c 1 h 1 ) and r ˜ 2 ( ϑ 2 ) = α 2 X i 2 + γ 2 W i + θ 2 g ( U i 2 c 2 h 2 ) + η 2 W i g ( U i 2 c 2 h 2 ) . A derivative-based method, such as the Newton-Raphson method, can be used to maximize the smoothed log-likelihood function to obtain the estimate of ϑ . The Newton-Raphson method requires
U ( ϑ ) = s l ( ϑ ) ϑ , I ( ϑ ) = 2 s l ( ϑ ) ϑ ϑ ,
and the iterative process
ϑ ( k + 1 ) = ϑ ( k ) + I 1 ( ϑ ( k ) ) U ( ϑ ( k ) ) ,
where ϑ ( k ) is an estimate of ϑ at the kth iteration. When the iterative process converges, ϑ ( k + 1 ) is the maximum smoothed likelihood estimate of ϑ and is denoted as ϑ ^ . This estimation method will be referred to as the SMLE method. The standard error of the estimates can be obtained either from I 1 ( ϑ ^ ) or using the bootstrap method. We find that the latter is numerically more stable than the former.
We can also use the profile method to estimate the parameters with the smoothed likelihood function. The profile log-likelihood function is defined as p s l ( c 1 , c 2 ) = s l ( ϑ ^ c 1 , c 2 ) , where ϑ ^ c 1 , c 2 = argmax ϑ { c 1 , c 2 } s l ( ϑ ) . Then estimates of c 1 and c 2 , denoted as c ^ 1 and c ^ 2 , can be obtained as argmax c 1 , c 2 p s l ( c 1 , c 2 ) using a grid search method, which can make the computation considerably more intensive than the SMLE method. With c ^ 1 and c ^ 2 , an estimate of ϑ { c 1 , c 2 } can be obtained as ϑ ^ c ^ 1 , c ^ 2 . This estimation method will be referred to as the PSMLE method. The bootstrap method may be used to estimate the standard errors of the estimates.
Both SMLE and PSMLE methods rely on prespecified bandwidths h 1 and h 2 . Although a smaller value of the bandwidths implies a better approximation of the likelihood function (5) by (6), too small values of the bandwidths can cause numerical errors. We suggest that h j be set to σ ^ j n 1 / 3 , where σ ^ j denotes the sample standard deviation of U i j ’s, j = 1 , 2. Our numerical study shows that this bandwidth selection method works well.

3. Results

3.1. Empirical Study

We conduct a simulation study to investigate the performance of the proposed method. Simulated data are generated from the mixture cure rate model (1) with
S ( t | V 1 , ϑ 1 ) = e t · exp ( α 11 X + γ 1 W + θ 1 I ( U > c 1 ) + η 1 W I ( U > c 1 ) ) , p ( V 2 , ϑ 2 ) = exp ( β 0 + α 21 X + γ 2 W + θ 2 I ( U > c 2 ) + η 2 W I ( U > c 2 ) ) 1 + exp ( β 0 + α 21 X + γ 2 W + θ 2 I ( U > c 2 ) + η 2 W I ( U > c 2 ) ) ,
where X is simulated from the uniform distribution over [ 0 , 10 ] , W is simulated from the Bernoulli distribution with probability 0.5, U 1 = U 2 = U is simulated from the uniform distribution over [ 1 , 1 ] or standard normal distribution, α 11 = 0.1 , γ 1 = 0.69 , θ 1 = 0.4 , η 1 = 0.4 , β 0 = 0.5 , α 21 = 0.1 , γ 2 = 1.1 , θ 2 = 0.5 , η 2 = 0.5 , c 1 = 0 , 0.5, c 2 = 0.5 , 0, 0.5. The censoring time is simulated from the uniform distribution over [ 0 , 20 ] . The average cure rate and the censoring rates are 0.31 and 0.43, respectively. The sample size is set to n = 200 , 500, 1000, 2000.
Under each of the settings, 1000 samples were simulated and fitted with the proposed SMLE, PSMLE and PMLE methods. The baseline survival function S 0 ( t ) is assumed to be from the Weibull distribution, and the normal kernel function g ( x / h ) = Φ x / h is used in the methods. The bandwidths in the SMLE and PSMLE methods are determined using the method discussed in Section 2 if not specified. All the computing work was carried out on the clusters of the Digital Research Alliance of Canada.
Table 1, Table 2, Table 3 and Table 4 present biases, standard deviations (SD), mean squared errors (MSE), and coverage probabilities (CP) obtained from the three estimation methods for simulated data with ( c 1 , c 2 ) = ( 0 , 0 ) and U generated from the uniform distribution. The biases, SD, MSE and CP are calculated according to (7) and (8).
b i a s = i = 1 n ϑ ^ i n ϑ , S D = 1 n 1 i = 1 n ϑ ^ i i = 1 n ϑ ^ i n 2 , M S E = b i a s 2 + S D 2 ,
C P = i = 1 n I ( ϑ ( ϑ ^ i 1.96 s d ( ϑ ^ i ) ) , ϑ ^ i + 1.96 s d ( ϑ ^ i ) ) n .
The estimates of most parameters have small biases and MSEs under all considered sample sizes and estimation methods except for θ 2 and η 2 under the small sample size of 200. When the sample size increases, the biases, SDs, and MSEs of all the estimates decrease. The SMLE method outperforms the PSMLE method as the SMLE method possesses smaller SDs and MSEs, especially in the cases with smaller sample sizes. The bias, SD and MSE of SMLE are slightly smaller in some parameters and larger in some other parameters than those of PMLE, indicating that the two estimation methods are comparable in terms of accuracy and precision. But the computational time of PMLE is much longer. The CPs of SMLE are closer to the 95% nominal level than the other three methods. The CPs of η 2 and c 2 are away from the nominal level for small sample sizes, but they are closer to the nominal level as the sample size increases.
Table 4 presents the simulation results under a large sample size ( n = 2000 ), demonstrating the asymptotic properties of the proposed estimation method. The estimates of SMLE for the subgroup thresholds c 1 and c 2 show minimal bias and standard deviation, confirming that the method can accurately and precisely recover the true subgroup boundaries when sufficient data are available. Even with n = 2000 , the coverage probability for c 2 remains slightly below the nominal 95% level. This suggests that inference on the subgroup-defining thresholds, particularly when the two subgroups are not well-separated, remains the most challenging aspect of the inference. The coverage for the interaction effect is slightly conservative. This is likely due to the combined uncertainty from estimating both the threshold and the interaction coefficient.
Table 5 presents the biases, SDs, MSEs, and CPs of the estimates from the SMLE method using both the normal and logistic kernel functions for simulated data with ( c 1 , c 2 ) = ( 0.5 ,   0 ) and U generated from the standard normal distribution. The results indicate that the choice of kernel has a meaningful impact on the estimation of η 2 , the treatment-by-subgroup interaction effect on the cure probability. The normal kernel yields a CP of 0.91 and an MSE of 1.05, while the logistic kernel yields a CP of 0.88 and an MSE of 0.88. This indicates that the normal kernel provides better-calibrated uncertainty intervals for this key parameter. The incidence part of the model involves estimating cure status, which may be affacted by the choice of kernel function. The normal kernel’s faster tail decay provides a more localized approximation while the heavier tails of the logistic kernel may over-smooth the boundary, which appears to be crucial for accurately capturing the discontinuity in the cure probability across the subgroup boundary. For all other parameters, performance is nearly identical between the two kernels. Consequently, we recommend the normal kernel as the default specification for the proposed method based on its robust performance for the most sensitive parameter.
The results for other values ( c 1 , c 2 ) and sample sizes are similar and therefore not presented here.
Table 6 presents the biases, SDs, MSEs, and CPs of the estimates from the SMLE method with c 1 = 0.5 and c 2 = 0.5 , 0, 0.5, and U generated from the standard normal distribution. It investigates a fundamental practical concern that how the method’s performance depends on the location of the true subgroup threshold c 2 . This is crucial for applications where a treatment-sensitive subgroup may be a large majority ( c 2 low, e.g., −0.5) or a small minority ( c 2 high, e.g., 0.5) of the population. The biases, SDs, and MSEs of the estimates of θ 2 from the SMLE method increase and the CPs move away from the 95% nominal level as c 2 goes from 0.5 to 0.5 because of fewer data in U > c 2 . This demonstrates the method’s reliance on having an adequate number of subjects within the identified subgroup for reliable inference on probablity of being cured. The biases, SDs, and MSEs of η 2 from the SMLE method do not strictly increase as c 2 increases, which may be because η 2 also depends on W. The incidence model’s logistic link function may make the cure probability estimate more robust to moderate changes in subgroup size when other predictive information is present. However, the CP for η 2 remains below the nominal level across all c 2 values, underscoring that inference for the cure probability interaction is challenging and requires careful interpretation, regardless of subgroup size.
We also investigated the dependence of the performance of the SMLE method on the choice of the bandwidth h. Table 7 and Table 8 present the biases, SDs, MSEs, and CPs of the estimates from the SMLE method with the bandwidth set to 0.5, 0.2, 0.1, 0.01, 0.001, σ ^ n 1 / 3 0.126 , n = 500 , the normal kernel function, and U generated from the standard normal distribution. The selection of the bandwidth h governs a critical bias-variance trade-off in the smoothed likelihood estimation. For large bandwidth ( h = 0.5 ), we see substantial bias in c 2 (0.19) and θ 2 , and the MSEs are relatively high. The coverage probabilities are below nominal for c 2 and θ 2 . As the bandwidth decreases, the biases for c 2 and θ 2 decrease, and the MSEs also decrease. However, the coverage for c 2 remains below nominal. For very small bandwidths, the biases become very small and the SDs decrease, but the coverage probabilities for the threshold c 2 drop dramatically to around 0.8. This indicates that the confidence intervals are too narrow, likely because the bootstrap variance estimation fails when the likelihood is too sharp. The data-driven bandwidth ( h = h a t s i g m a n 1 / 3 ) strikes a balance. It gives low bias for c 1 and moderate bias for c 2 , and the coverage probabilities are much better for c 1 and c 2 . For θ 2 and η 2 , the biases are moderate and coverage is around 0.91 and 0.88 respectively. For all the other parameters, the SDs and MSEs of the estimates from the SMLE method decrease as the value of h decreases, and the results from h = σ ^ n 1 / 3 perform well. Therefore, the data-driven bandwidth is recommended as the default, robust specification to ensure both accuracy and valid inference.

3.2. Analysis of Colon Cancer Data

We analyzed a colon cancer data set with the proposed Weibull PH mixture cure model and the piecewise PH mixture cure model for subgroup analysis. We also fitted the data set with the regular Weibull PH mixture cure model for comparison. The data set comes from a clinical trial study designed to evaluate the benefit of the combination of fluorouracil plus levamisole as an adjuvant therapy after resection of stage III colon carcinoma compared to levamisole alone therapy [29], which is available in survival package in R. The data set contains 614 patients, among them 310 patients were treated with levamisole (Lev) alone, and 304 patients were treated with the combination of levamisole plus fluorouracil (Lev+5FU). The maximum follow-up time is 3329 days, and the outcome of interest is the days to recurrence. There are 119 and 172 recurrence events in the Lev+5FU therapy and Lev therapy, respectively, and the respective censoring rates are 60.86% and 44.52%. The Kaplan–Meier survival curves for the two treatments are presented in Figure 1. The curves level off at survival probabilities of 59.94% and 43.29%, respectively, after 2500 days of follow-up, indicating the presence of cured subjects in the sample.
We consider subgroups defined by I ( age > c 1 ) in the latency part and I ( age > c 2 ) in the incidence part, and fit the data set with the proposed Weibull mixture cure rate models to determine the optimal values of c 1 and c 2 and treatment differences in the subgroups. The results are presented in Table 9.
Under the Weibull mixture cure model with age thresholds, the estimated optimal threshold for age is 67 for the time to recurrence among uncured patients, and 66 for the cure probability from the Weibull PH mixture cure model. However, the interaction effect between treatment and subgroup is significant only for latency. It implies that the treatment effects are significantly different in the time to recurrence for uncured patients between age younger than 67 and older than 67. For uncured patients younger than 67, the hazard ratio between the combined therapy and levamisole alone therapy is 0.23, while for patients older than 67, the hazard ratio increases to 0.7 and is statistically significant, indicating that although the combined treatment improves the survival of uncured patients, patients younger than 67 benefit more than patients older than 67. More specific, the hazard ratio of 0.23 indicates 77% reduction in risk of recurrence if the uncured patient younger than 67 takes combined treatment instead of single treatment, and the hazard ratio of 0.7 indicates 30% reduction in risk of recurrence for the uncured patient older than 67 if the patient takes the combined treatment.
The effects in the incidence part are not statistically significant, implying that none of the treatment, age, or their interaction has any significant effects on the probability of being cured.
The data set is also fitted with the regular Weibull proportional hazard mixture cure model, where age is not dichotomized using a threshold in the main and the interaction effects. The results are presented in Table 9, and none of the effects in the model are statistically significant.

4. Discussion

In this paper, we proposed a PH mixture cure model for subgroup analysis, which can determine the optimal cutoff values or thresholds for identifying treatment-sensitive subgroups in the latency and incidence parts. A smoothed maximum likelihood estimation method is developed for estimating the unknown parameters and the unknown cutoff values or thresholds by using kernel functions to approximate the indicator functions in the likelihood function. An extensive simulation study is performed to evaluate the performance of the proposed model and the estimation method, and the results demonstrate that the proposed model and estimation method can identify subgroups and estimate heterogeneous treatment effects accurately. Finally, we applied the proposed model and method to a data set from a clinical trial study of colon cancer patients treated either by a combination of levamisole plus fluorouracil or by levamisole alone and identified age subgroups and significant heterogeneous treatment effects. In comparison, the regular mixture cure model does not show evidence to support the heterogeneous treatment effects.
This work directly addresses specific methodological gaps identified in the literature on mixture cure models for subgroup analysis. As noted in the introduction, existing approaches have either focused on prognostic subgroup effects without modeling predictive subgroup effects [22,23], or have considered heterogeneous treatment effects in different subgroups only in the latency part for the uncured population, omitting the cure probability [24,25]. The methodology proposed here advances beyond these prior works by simultaneously modeling predictive treatment-by-subgroup interactions as specified in (2) and (3), and allowing these heterogeneous effects to be identified in both the incidence and latency components of the model. Therefore, our primary contribution is a flexible, likelihood-based tool that identifies subgroups with differential treatment responses across the entire cure model structure, thereby enabling more precise inference in settings where a treatment may affect whether a subject is cured, how long uncured subjects survive, or both. Consequently, our work fills a gap by offering a principled, likelihood-based tool to investigate a broader and more nuanced range of precision medicine hypotheses than previously possible with standard or partially extended cure models.
Despite the promising results, several limitations should be considered. First, the current model is designed for a single covariate or biomarker to define subgroups. In practice, it is very likely that subgroups may be defined by multiple covariates or biomarkers. Second, our reliance on kernel smoothing to approximate the indicator function introduces a dependence on two key smoothing parameters the bandwidth and the kernel smoothing function. Although smaller bandwidth value provide better estimates, too small bandwidth values may lead to numerical instability. Our simulation study was limited to comparing the logistic and normal kernels. However, the broader class of kernel functions (e.g., Epanechnikov, uniform, triangular) remains unexplored in this context, and their differing properties could potentially affect the smoothness of the likelihood and the finite-sample precision of the threshold estimates. Third, while obtaining point estimates is feasible, the computational burden becomes substantial when performing bootstrap resampling to estimate standard errors and construct confidence intervals for the threshold and regression coefficients withohut access to high-performance computing. Furthermore, the model’s performance is contingent on the specification of the baseline hazard function for the uncured population. While we considered parametric (Weibull) and semi-parametric (piecewise constant) forms, each carries limitations. The Weibull assumption may be violated if the true baseline hazard does not follow its specific monotonic form, leading to biased estimates. Conversely, the more flexible piecewise constant baseline introduces challenge of selecting the number and locations of knots, which can be subjective and influence the fitted survival structure. Finally, the model inherits the standard proportional hazards assumption for the latency part, and violations could affect the accuracy of subgroup-specific hazard ratios.

5. Conclusions

We have developed a novel proportional hazards mixture cure model tailored for subgroup identification within survival data with a cure fraction. The primary contribution of this work is a unified framework that simultaneously identifies optimal thresholds for defining subgroups and estimates differential treatment effects on both the cure probability and the survival of uncured patients, providing a valuable statistical tool for advancing precision medicine research.
This work naturally leads to several promising directions for future research. It will be of interest to generalize the work developed here to multiple covariates. We are currently working on this problem and will report the findings in a future paper. Beyond this, further extension could be explored, such as developing a nonparametric mixture cure model for subgroup analysis that relaxes the parametric or semiparametric assumptions on the baseline hazard, or considering alternative latency formulations like the accelerated failure time, proportional odds, or accelerated hazards models. Extending the methodology to non-mixture cure models for subgroup identification represents another valuable avenue. A comparison with a Bayesian estimation framework, which would facilitate the incorporation of prior information, is another important direction. Furthermore, the performance of treatment-sensitive subgroup identification, especially under varying degrees of separation between subgroups, merits systematic evaluation. In addition, the current evaluation focuses on sample sizes appropriate for the model’s complexity. Future work could systematically explore the minimal sample size required for stable performance.

Author Contributions

Conceptualization, Y.P.; methodology, K.L. and Y.P.; software, K.L.; formal analysis, K.L. and Y.P.; data curation, K.L.; writing—original draft preparation, K.L.; writing—review and editing, Y.P. and N.B.; supervision, Y.P. and N.B.; project administration, K.L.; funding acquisition, K.L., Y.P., and N.B. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China—Young Scientists Fund grant number 12201410 for the first author and Natural Sciences and Engineering Research Council of Canada for the second and third authors.

Institutional Review Board Statement

The original clinical trial from which the dataset was derived was conducted in accordance with the Declaration of Helsinki and was approved by the Institutional Review Boards of all participating institutions within the North Central Cancer Treatment Group, the Southwest Oncology Group, or the Eastern Cooperative Oncology Group [29].

Informed Consent Statement

Informed consent for participation was obtained from all subjects involved in the study as reported in the primary publication [29].

Data Availability Statement

The data that support the findings of this study are available in R survival 3.3-1 package.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
PHProportional hazards
SMLESmoothed maximum likelihood estimation
PSMLEProfile smoothed maximum likelihood estimation
SDStandard deviation
MSEMean square error
CPCoverage probability
LevLevamisole
Lev+5FULevamisole plus fluorouracil

References

  1. Rothwell, P.M. Subgroup analysis in randomised controlled trials: Importance, indications, and interpretation. Lancet 2005, 365, 176–186. [Google Scholar] [CrossRef]
  2. Zhang, Q.Y.; Sun, T.; Yin, Y.M.; Li, H.P.; Yan, M.; Tong, Z.S.; Oppermann, C.P.; Liu, Y.P.; Costa, R.; Li, M.; et al. MONARCH plus: Abemaciclib plus endocrine therapy in women with HR+/HER2– advanced breast cancer: The multinational randomized phase III study. Ther. Adv. Med. Oncol. 2020, 22, 1758835920963925. [Google Scholar] [CrossRef]
  3. Luo, X.; Boyett, J.M. Estimations of a threshold parameter in cox regression. Commun.-Stat.-Theory Methods 1997, 26, 2329–2346. [Google Scholar]
  4. Pons, O. Estimation in a Cox regression model with a change-point to a threshold in a covariate. Ann. Stat. 2003, 31, 442–463. [Google Scholar] [CrossRef]
  5. Chen, B.E.; Jiang, W.; Tu, D. A hierarchical Bayes model for biomarker subset effects in clinical trials. Comput. Stat. Data Anal. 2014, 71, 324–334. [Google Scholar] [CrossRef]
  6. Deng, Y.; Zeng, D.; Zhao, J.; Cai, J. Proportional hazards model with a change point for clustered event data. Biometrics 2017, 73, 835–845. [Google Scholar] [CrossRef]
  7. Lee, C.Y.; Chen, X.; Lam, K.F. Testing for change-point in the covariate effects based on the Cox regression model. Stat. Med. 2020, 39, 1473–1488. [Google Scholar] [CrossRef] [PubMed]
  8. Jensen, U.; Lütkebohmert, C. A Cox-type regression model with change-points in the covariates. Lifetime Data Anal. 2008, 14, 267–285. [Google Scholar] [CrossRef]
  9. Lee, C.Y.; Lam, K.F. Survival analysis with change-points in covariate effects. Stat. Methods Med. Res. 2020, 29, 3235–3248. [Google Scholar] [CrossRef]
  10. Liu, S.S.; Chen, B.E. Continuous threshold models with two-way interactions in survival analysis. Can. J. Stat. 2020, 48, 751–772. [Google Scholar]
  11. Chen, X.; Ping, Y.; Sun, J. Efficient estimation of Cox model with random change point. Stat. Med. 2024, 43, 1213–1226. [Google Scholar] [CrossRef] [PubMed]
  12. He, Y.; Lin, H.; Tu, D. A single-index threshold Cox proportional hazard model for identifying a treatment-sensitive subset based on multiple biomarkers. Stat. Med. 2018, 37, 3267–3279. [Google Scholar] [CrossRef] [PubMed]
  13. Wei, S.; Kosorok, M.R. The change-plane Cox model. Biometrika 2018, 105, 891–903. [Google Scholar] [CrossRef]
  14. Hu, X.; Huang, J.; Liu, L.; Sun, D.; Zhao, X. Subgroup analysis in the heterogeneous Cox model. Stat. Med. 2021, 40, 739–757. [Google Scholar] [CrossRef]
  15. León, L.F.; Jemielita, T.; Guo, Z.; Marceau West, R.; Anderson, K.M. Exploratory subgroup identification in the heterogeneous Cox model: A relatively simple procedure. Stat. Med. 2024, 43, 3921–3942. [Google Scholar] [CrossRef]
  16. Hauschild, A.; Dummer, R.; Schadendorf, D.; Santinami, M.; Atkinson, V.; Mandalà, M.; Chiarion-Sileni, V.; Larkin, J.; Nyakas, M.; Dutriaux, C.; et al. Longer follow-up confirms relapse-free survival benefit with adjuvant dabrafenib plus trametinib in patients with resected BRAF V600–mutant stage III melanoma. J. Clin. Oncol. 2018, 36, 3441–3449. [Google Scholar] [CrossRef]
  17. Iacobucci, I.; Mullighan, C.G. Genetic basis of acute lymphoblastic leukemia. J. Clin. Oncol. 2017, 35, 975–983. [Google Scholar] [CrossRef]
  18. Peng, Y.; Yu, B. Cure Models: Methods, Applications, and Implementation; Chapman and Hall/CRC: Boca Raton, FL, USA, 2021. [Google Scholar]
  19. Peng, Y.; Dear, K.B. A nonparametric mixture model for cure rate estimation. Biometrics 2000, 56, 237–243. [Google Scholar] [CrossRef]
  20. Tawiah, R.; McLachlan, G.J.; Ng, S.K. A bivariate joint frailty model with mixture framework for survival analysis of recurrent events with dependent censoring and cure fraction. Biometrics 2020, 76, 753–766. [Google Scholar] [CrossRef]
  21. Müller, U.U.; Van Keilegom, I. Goodness-of-fit tests for the cure rate in a mixture cure model. Biometrika 2019, 106, 211–227. [Google Scholar] [CrossRef]
  22. Othus, M.; Li, Y.; Tiwari, R. Change point-cure models with application to estimating the change-point effect of age of diagnosis among prostate cancer patients. J. Appl. Stat. 2012, 39, 901–911. [Google Scholar] [CrossRef] [PubMed]
  23. Shamsi, F.; Baghestani, A.R.; Binesh, F. Bayesian approach for cure models with a change-point based on covariate threshold: Application to breast cancer data. J. Biopharm. Stat. 2020, 30, 219–230. [Google Scholar] [CrossRef]
  24. Zhao, L.; Feng, D.; Bellile, E.L.; Taylor, J.M. Bayesian random threshold estimation in a Cox proportional hazards cure model. Stat. Med. 2014, 33, 650–661. [Google Scholar] [CrossRef]
  25. Wang, B.; Li, J.; Wang, X. Change point detection in Cox proportional hazards mixture cure model. Stat. Methods Med. Res. 2021, 30, 440–457. [Google Scholar] [CrossRef] [PubMed]
  26. Lou, Y.; Du, M.; Song, X. Regression analysis of interval-censored failure time data with change points and a cured subgroup. Biometrics 2025, 81, ujaf100. [Google Scholar] [CrossRef] [PubMed]
  27. Balakrishnan, N.; Liu, K. Semi-parametric likelihood inference for Birnbaum-Saunders frailty model. REVSTAT-Stat. J. 2018, 16, 232–256. [Google Scholar]
  28. Brown, B.M.; Wang, Y.G. Standard errors and covariance matrices for smoothed rank estimators. Biometrika 2005, 92, 149–158. [Google Scholar] [CrossRef]
  29. Moertel, C.G.; Fleming, T.R.; Macdonald, J.S.; Haller, D.G.; Laurie, J.A.; Tangen, C.M.; Ungerleider, J.S.; Emerson, W.A.; Tormey, D.C.; Glick, J.H.; et al. Fluorouracil plus levamisole as effective adjuvant therapy after resection of stage III colon carcinoma: A final report. Ann. Intern. Med. 1995, 122, 321–326. [Google Scholar] [CrossRef]
Figure 1. Kaplan–Meier curves by treatment groups for the colon cancer data.
Figure 1. Kaplan–Meier curves by treatment groups for the colon cancer data.
Stats 09 00001 g001
Table 1. Results from simulation studies with the normal kernel function in the SMLE and PSMLE method, ( c 1 , c 2 ) = ( 0 , 0 ) , and n = 200 .
Table 1. Results from simulation studies with the normal kernel function in the SMLE and PSMLE method, ( c 1 , c 2 ) = ( 0 , 0 ) , and n = 200 .
SMLEPSMLEPMLE
ϑ Bias SD MSE CP Bias SD MSE CP Bias SD MSE CP
λ 1 0.040.090.010.960.040.090.010.960.040.090.010.96
λ 2 0.030.320.100.930.030.400.170.930.020.320.110.93
γ 1 0.000.380.140.96−0.000.400.160.97−0.000.370.130.97
γ 2 −0.040.750.560.98−0.030.870.760.99−0.030.750.560.99
θ 1 −0.060.420.180.97−0.040.460.210.97−0.050.430.190.97
θ 2 −0.170.680.490.99−0.151.011.050.99−0.280.840.780.99
η 1 −0.020.600.360.97−0.040.600.370.98−0.030.630.400.98
η 2 −0.191.311.751.00−0.281.532.411.00−0.311.492.321.00
c 1 0.010.220.050.900.020.330.110.850.020.330.110.85
c 2 0.030.320.100.860.020.480.230.78−0.020.480.230.80
α 11 −0.000.050.000.96−0.000.050.000.96−0.000.050.000.96
α 21 0.000.080.010.970.000.080.010.970.000.080.010.98
β 0 0.020.550.300.990.020.770.600.990.080.510.270.99
Table 2. Results from simulation studies with the normal kernel function in the SMLE and PSMLE method, ( c 1 , c 2 ) = ( 0 , 0 ) , and n = 500 .
Table 2. Results from simulation studies with the normal kernel function in the SMLE and PSMLE method, ( c 1 , c 2 ) = ( 0 , 0 ) , and n = 500 .
SMLEPSMLEPMLE
ϑ Bias SD MSE CP Bias SD MSE CP Bias SD MSE CP
λ 1 0.020.050.000.950.010.050.000.950.020.050.000.96
λ 2 0.010.180.030.950.010.190.030.940.010.180.030.94
γ 1 0.030.220.050.950.020.210.050.960.010.210.050.95
γ 2 −0.010.360.130.97−0.020.420.170.98−0.020.350.120.98
θ 1 −0.020.230.050.95−0.020.220.050.96−0.010.210.040.96
θ 2 −0.070.330.120.97−0.080.390.160.97−0.090.290.090.97
η 1 −0.040.330.110.95−0.020.310.100.96−0.010.290.090.96
η 2 −0.110.870.760.99−0.110.960.931.00−0.110.910.840.99
c 1 −0.000.120.010.93−0.000.150.020.94−0.010.160.030.94
c 2 0.030.180.030.880.030.320.100.890.020.320.100.89
α 11 −0.000.030.000.97−0.000.030.000.96−0.000.030.000.96
α 21 0.000.050.000.950.000.050.000.960.000.050.000.96
β 0 0.020.310.100.960.030.330.110.970.030.300.090.96
Table 3. Results from simulation studies with the normal kernel function in the SMLE and PSMLE method, ( c 1 , c 2 ) = ( 0 , 0 ) , and n = 1000 .
Table 3. Results from simulation studies with the normal kernel function in the SMLE and PSMLE method, ( c 1 , c 2 ) = ( 0 , 0 ) , and n = 1000 .
SMLEPSMLEPMLE
ϑ Bias SD MSE CP Bias SD MSE CP Bias SD MSE CP
λ 1 0.010.040.000.950.010.040.000.950.010.040.000.95
λ 2 0.010.130.020.950.010.130.020.960.010.130.020.95
γ 1 0.010.140.020.950.010.140.020.950.000.140.020.95
γ 2 −0.000.250.060.95−0.000.240.060.97−0.010.240.060.97
θ 1 −0.000.150.020.96−0.010.150.020.960.000.140.020.95
θ 2 −0.030.210.050.96−0.030.210.050.98−0.020.190.040.98
η 1 −0.010.210.040.96−0.000.200.040.960.010.200.040.96
η 2 −0.070.670.460.99−0.040.590.350.98−0.030.590.340.98
c 1 0.000.070.000.94−0.000.090.010.95−0.000.090.010.97
c 2 0.010.130.020.900.020.180.030.950.010.180.030.95
α 11 −0.000.020.000.95−0.000.020.000.95−0.000.020.000.96
α 21 −0.000.030.000.96−0.000.030.000.96−0.000.030.000.96
β 0 0.010.210.040.960.010.200.040.960.010.200.040.96
Table 4. Results from simulation studies with the normal kernel function in the SMLE and PSMLE method, ( c 1 , c 2 ) = ( 0 , 0 ) , and n = 2000 .
Table 4. Results from simulation studies with the normal kernel function in the SMLE and PSMLE method, ( c 1 , c 2 ) = ( 0 , 0 ) , and n = 2000 .
SMLEPSMLEPMLE
ϑ Bias SD MSE CP Bias SD MSE CP Bias SD MSE CP
λ 1 0.000.030.000.940.000.030.000.940.010.030.000.94
λ 2 0.010.090.010.940.010.080.010.950.010.080.010.95
γ 1 0.010.100.010.950.010.100.010.950.000.100.010.95
γ 2 0.000.170.030.950.000.170.030.95−0.000.170.030.95
θ 1 −0.000.100.010.95−0.000.100.010.950.000.100.010.94
θ 2 −0.020.150.020.94−0.020.150.020.95−0.010.150.020.95
η 1 −0.010.150.020.94−0.000.150.020.950.010.150.020.95
η 2 −0.040.470.220.97−0.030.450.200.97−0.010.450.200.96
c 1 0.000.040.000.95−0.000.040.000.97−0.000.030.000.98
c 2 0.020.080.010.920.020.100.010.970.010.090.010.98
α 11 −0.000.010.000.95−0.000.010.000.95−0.000.010.000.94
α 21 −0.000.020.000.96−0.000.020.000.95−0.000.020.000.96
β 0 0.020.140.020.950.010.140.020.950.010.140.020.95
Table 5. Simulation results from the SMLE method with the normal and logistic kernel functions, ( c 1 , c 2 ) = ( 0.5 , 0 ) , and n = 500 .
Table 5. Simulation results from the SMLE method with the normal and logistic kernel functions, ( c 1 , c 2 ) = ( 0.5 , 0 ) , and n = 500 .
NormalLogistic
ϑ Bias SD MSE CP Bias SD MSE CP
λ 1 0.010.050.000.930.010.050.000.94
λ 2 −0.000.160.030.96−0.010.170.030.96
γ 1 0.010.160.030.970.020.180.030.97
γ 2 0.010.340.120.950.020.370.130.96
θ 1 −0.030.250.070.93−0.050.280.080.93
θ 2 −0.080.330.120.92−0.110.350.140.92
η 1 −0.060.360.130.93−0.070.380.150.93
η 2 −0.241.001.050.91−0.200.920.880.88
c 1 0.010.130.020.950.030.170.030.96
c 2 0.030.230.050.900.030.300.090.91
α 11 −0.000.030.000.95−0.000.030.000.96
α 21 0.000.050.000.960.000.050.000.96
β 0 0.020.300.090.940.030.300.090.95
Table 6. Simulation results from the SMLE method for c 1 = 0.5 and different choices of c 2 when the kernel smoothing function is normal, the bandwith is optimal and n = 500 .
Table 6. Simulation results from the SMLE method for c 1 = 0.5 and different choices of c 2 when the kernel smoothing function is normal, the bandwith is optimal and n = 500 .
c 2 = 0.5 c 2 = 0 c 2 = 0.5
ϑ Bias SD MSE CP Bias SD MSE CP Bias SD MSE CP
λ 1 0.010.050.000.930.010.050.000.930.010.050.000.93
λ 2 0.000.160.020.96−0.000.160.030.96−0.000.170.030.96
γ 1 0.010.160.030.960.010.160.030.970.000.180.030.96
γ 2 −0.000.430.180.950.010.340.120.950.010.310.100.94
θ 1 −0.020.250.060.94−0.030.250.070.93−0.030.270.070.93
θ 2 −0.070.320.110.94−0.080.330.120.92−0.130.400.180.91
η 1 −0.070.340.120.94−0.060.360.130.93−0.030.370.140.94
η 2 −0.130.760.600.92−0.241.001.050.91−0.181.081.200.88
c 1 0.010.140.020.950.010.130.020.950.000.170.030.94
c 2 0.020.250.060.900.030.230.050.900.050.280.080.87
α 11 −0.000.030.000.96−0.000.030.000.95−0.000.030.000.96
α 21 0.000.040.000.960.000.050.000.960.000.050.000.95
β 0 0.040.330.110.960.020.300.090.940.010.270.070.95
Table 7. Simulation results from the SMLE method with h = 0.5 , 0.2, and 0.1, the normal kernel function, ( c 1 , c 2 ) = ( 0.5 , 0.5 ) and n = 500 .
Table 7. Simulation results from the SMLE method with h = 0.5 , 0.2, and 0.1, the normal kernel function, ( c 1 , c 2 ) = ( 0.5 , 0.5 ) and n = 500 .
h = 0.5 h = 0.2 h = 0.1
ϑ Bias SD MSE CP Bias SD MSE CP Bias SD MSE CP
λ 1 0.010.050.000.930.010.050.000.920.010.050.000.93
λ 2 −0.020.190.040.96−0.010.180.030.96−0.000.170.030.96
γ 1 0.020.220.050.970.010.190.040.970.010.170.030.97
γ 2 0.020.360.130.950.010.310.100.940.000.300.090.94
θ 1 −0.120.390.170.91−0.040.280.080.92−0.020.260.070.93
θ 2 −0.320.660.550.88−0.160.440.220.91−0.120.370.150.92
η 1 −0.070.520.270.93−0.040.400.160.94−0.030.370.140.93
η 2 −0.131.231.530.92−0.130.990.990.91−0.151.021.060.91
c 1 0.050.350.130.950.010.230.050.950.010.150.020.92
c 2 0.190.600.390.880.070.360.130.890.040.240.060.86
α 11 −0.000.030.000.96−0.000.030.000.96−0.000.030.000.95
α 21 0.000.040.000.950.000.050.000.950.000.050.000.95
β 0 0.040.300.090.950.020.280.080.960.010.270.070.95
Table 8. Simulation results from the SMLE method with h = 0.01 , 0.001, σ ^ n 1 / 3 , the normal kernel function, ( c 1 , c 2 ) = ( 0.5 , 0.5 ) and n = 500 .
Table 8. Simulation results from the SMLE method with h = 0.01 , 0.001, σ ^ n 1 / 3 , the normal kernel function, ( c 1 , c 2 ) = ( 0.5 , 0.5 ) and n = 500 .
h = 0.01 h = 0.001 h = σ ^ n 1 / 3
ϑ Bias SD MSE CP Bias SD MSE CP Bias SD MSE CP
λ 1 0.020.050.000.930.010.050.000.920.010.050.000.93
λ 2 0.000.150.020.940.010.130.020.93−0.000.170.030.96
γ 1 0.010.160.030.950.010.140.020.950.000.180.030.96
γ 2 0.010.270.070.930.010.240.060.930.010.310.100.94
θ 1 −0.010.230.050.92−0.010.200.040.92−0.030.270.070.93
θ 2 −0.080.330.120.90−0.070.270.080.90−0.130.400.180.91
η 1 −0.030.310.100.92−0.030.260.070.91−0.030.370.140.94
η 2 −0.140.810.670.87−0.030.490.240.91−0.181.081.200.88
c 1 0.010.070.010.810.010.060.000.770.000.170.030.94
c 2 0.010.090.010.820.010.070.000.800.050.280.080.87
α 11 −0.000.030.000.95−0.000.020.000.95−0.000.030.000.96
α 21 −0.000.040.000.93−0.000.040.000.920.000.050.000.95
β 0 0.020.250.060.950.010.210.040.940.010.270.070.95
Table 9. The estimates from the mixture cure models for the colon cancer data with and without thresholds in age.
Table 9. The estimates from the mixture cure models for the colon cancer data with and without thresholds in age.
Without Thresholds in AgeWith Thresholds in Age
Weibull
Parameter Estimate (SE) p-Value Estimate (SE) p-Value
Latency part
Lev+5FU−0.343 (0.517)0.51−1.453 (0.147)<0.01
age−0.001 (0.006)0.88−1.513 (0.260)<0.01
(Lev+5FU) × age0.008 (0.009)0.391.098 (0.382)<0.01
c 1 --67 (0.395)<0.01
Incidence part
Intercept−0.520 (0.722)0.47−0.743 (0.225)<0.01
Lev+5FU−0.414 (0.892)0.64−0.421 (0.285) 0.14
age0.004 (0.012)0.72−0.144 (0.440)0.74
(Lev+5FU) × age0.018 (0.015)0.210.247 (0.508)0.63
c 2 --66 (1.077)<0.01
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, K.; Peng, Y.; Balakrishnan, N. A Proportional Hazards Mixture Cure Model for Subgroup Analysis: Inferential Method and an Application to Colon Cancer Data. Stats 2026, 9, 1. https://doi.org/10.3390/stats9010001

AMA Style

Liu K, Peng Y, Balakrishnan N. A Proportional Hazards Mixture Cure Model for Subgroup Analysis: Inferential Method and an Application to Colon Cancer Data. Stats. 2026; 9(1):1. https://doi.org/10.3390/stats9010001

Chicago/Turabian Style

Liu, Kai, Yingwei Peng, and Narayanaswamy Balakrishnan. 2026. "A Proportional Hazards Mixture Cure Model for Subgroup Analysis: Inferential Method and an Application to Colon Cancer Data" Stats 9, no. 1: 1. https://doi.org/10.3390/stats9010001

APA Style

Liu, K., Peng, Y., & Balakrishnan, N. (2026). A Proportional Hazards Mixture Cure Model for Subgroup Analysis: Inferential Method and an Application to Colon Cancer Data. Stats, 9(1), 1. https://doi.org/10.3390/stats9010001

Article Metrics

Back to TopTop