Abstract
Let the response variable Y be the time until an event such as death. Assume that there are p predictors and that the response variable is right censored. Several survival regression models, including accelerated failure time models, have the form . This paper gives a simple method for estimating the covariances for some of these models. Plots are given for checking the goodness of fit of accelerated failure time models. Plots for checking the proportional hazards regression model are often harder to use.
MSC:
62N01; 62N02
1. Introduction
This section reviews some survival regression models, including plots for checking the models. The response variable is the time until an event such as death. Let the vector of predictor variables . Let the sufficient predictor , and let the estimated sufficient predictor where it is possible that . The ESP is sometimes called the estimated risk score.
Assume that the response variable is right censored so Y is not observed. Instead, the right censored survival time where is independent of the censoring time . Also if is censored and if is uncensored. Hence the data is for
For an accelerated failure time model, the log transformation of the response variable results in a multiple linear regression model. Hence multiple linear regression models will be useful. Now let the response variable Y be for multiple linear regression, so Y need not be a nonnegative time until event. A useful multiple linear regression model is or
for Assume that the are independent and identically distributed (iid) with expected value and variance . In matrix form, this model is
where Y is an vector of dependent variables, X is an matrix with ith row , is a vector, and e is an vector of unknown errors. Also and Cov( where is the identity matrix.
For a multiple linear regression model with heterogeneity, assume model (1) holds with and Cov is an positive definite matrix. When the are known, weighted least squares is often used. Under regularity conditions, the ordinary least squares (OLS) estimator can be shown to be a consistent estimator of .
For estimation with ordinary least squares, let the covariance matrix of x be and let the vector Let
Then the OLS estimators for model (1) are , , and
For a multiple linear regression model with iid cases, is a consistent estimator of under mild regularity conditions, while is a consistent estimator of .
For a parametric accelerated failure time (AFT) model,
where the are iid from a location scale family. The parameters are estimated by maximum likelihood.
The Weibull proportional hazards regression model or Weibull regression model is
where Y has a Weibull distribution if the probability density function of Y is
for and This regression model can also be fit using the Cox (1972) [1] proportional hazards regression model. Let the sufficient predictor . If satisfies a Weibull regression model, then satisfies a Weibull AFT with and . Exponential regression is the special case where .
Two other important AFTs are (i) the lognormal AFT where , where the are lognormal and the are normal, and (ii) the loglogistic AFT where , where the are loglogistic and the are logistic. For the loglogistic AFT, Y follows a proportional odds model. Y does not follow a proportional hazards regression model for the loglogistic and lognormal AFTs.
The Buckley and James (1979) [2] estimator is a nonparametric survival regression method for models of the form (3), and is a competitor for the parametric AFTs. When there is no censoring, this estimator is equivalent to the ordinary least squares estimator for multiple linear regression.
Often the log transformation results in a linear model with heterogeneity:
where the are independent with expected value and variance For the AFT and the Buckley James estimator, the variance is constant: does not depend on i.
The Cox (1972) [1] proportional hazards regression model is a semiparametric model with and hazard function
where the baseline hazard function is left unspecified. The survival function is
where is the baseline survival function. If is within the range of the predictors, then the baseline survival and hazard functions correspond to the survival and hazard functions of The cumulative hazard function for , and the hazard function for High hazard implies low survival times while low hazard implies long survival times.
The literature for checking the goodness of fit of the proportional hazards model is fairly large. See, for example, Arjas (1988) [3], Feng et al. (2017) [4], Gill and Schumaker (1987) [5], Kay (1984) [6], Lin and Wei (1991) [7], Marzec and Marzec (1997) [8], Ng’andu (1997) [9], Quantin et al. (1996) [10], and Yu et al. (2008) [11].
Grambsch and Therneau (1994) [12] give a useful graphical check. Suppose the ith case had an uncensored survival time . Let the scaled Schoenfeld residual for the ith observation and jth variable be . For each variable, plot the versus the and add the loess curve. If the loess curve is approximately horizontal for each of the p plots, then the proportional hazards assumption is reasonable. Alternatively, fit a line to each plot and test that each of the p slopes is equal to 0. The R function cox.zph makes both the plots and tests.
2. Materials and Methods
2.1. Estimating for Some Censored Survival Regression Models
This subsection derives an estimator for where the right censored are not observed. Let the ordinary least squares (OLS) estimator be . Assume that the cases are iid. Since model (4) is a multiple linear regression model, under mild regularity conditions, the population parameter vectors are equal: . Thus When the response is censored, several survival regression models give consistent estimators of . Hence
The Buckley James estimator is equivalent to the OLS estimator when there is no censoring, and censoring does not change the population parameter. Although the population parameters are the same, the estimators are very different. The OLS estimator is not a consistent estimator of when censoring occurs. Hence is not a consistent estimator of if the response variable is censored.
If an accelerated failure time model is used, then two estimators are and . These two estimators require consistent estimators of , which occurs, for example, if the cases are iid from some population with covariance matrix and covariance vector . The survival times can be right censored, but the predictor variables are not censored. Note that the predictor variables that have the highest absolute correlation with Z have the highest values of .
Although this technique is a simple plug in estimator, to our knowledge, the technique has not been suggested previously for survival regression models. In the literature, there are several estimators for the correlation , where X and Y are survival times. These estimators usually use maximum likelihood estimation or multiple imputation assuming that are iid from a bivariate normal distribution. See, for example, Barchard and Russell (2024) [13], Li et al. (2018) [14], and Lyles, Fan, and Chuachoowong (2001) [15]. For covariance estimation, see references in Pesonen, Pesonen, and Nevalainen (2015) [16].
2.2. The EE Plot
It is important to check that a parametric AFT model is reasonable with the Buckley James estimator before using Equation (6). Make an EE plot of versus . For the Weibull AFT, also plot versus the above two ESPs, where PH stands for the Cox proportional hazards estimator. The plotted points in the EE plot should scatter tightly about the identity line with zero intercept and unit slope. Lack of fit is suggested if the plotted points do not cluster tightly about the identity line. The identity line is included in the EE plots as a visual aid. Suppose the two estimators of are consistent. Then as , the correlation of the plotted points goes to 1 in probability for any finite interval, e.g., from the 1st percentile to the 99th percentile of .
For the Exponential regression model, and . Since the Exponential regression model with is a special case of the Weibull regression model with , plots can use , , or , and where is the Weibull regression that estimates .
The EE plots are much easier to use than other plots in the literature, since tight clustering about the identity line suggests goodness of fit.
3. Some Other Plots
Let where the are residuals. Collett (2003, p. 231) [17] defines a standardized residual For accelerated failure time models, a log censored response (LCR) plot is a plot of versus log( on the vertical axis with plotting symbol 0 for censored cases and + for uncensored cases. The identity line with unit slope and zero intercept is added to the plot, and the vertical deviations from the identity line = . This plot is useful to check for cases with unusual survival times. The plot is also useful for checking for linearity (which is often rather weak due to the censored response and skewed data).
Ritov (1990) [18] and Tsiatis (1990) [19] consider the censored survival regression model where t is a known monotone function. Then a plot of versus is useful. A power transformation has the form for and for where
The modified power transformation family
for and .
A graphical method for response transformations makes the transformation plot of versus using the Buckley James estimator with as the “response” for . Then a candidate response transformation has the transformation plot that suggests a linear model is appropriate.
Smith (2002, p. 191) [20] notes that the Buckley James estimator replaces the censored by their estimated conditional expected values, resulting in a renovated response variable . Using the instead of the may increase the linearity of the plot.
The slice survival plot divides the ESP into J groups of roughly the same size. For each group j with cases, the model estimated survival function is computed using the x corresponding to the “median ESP” of the group (the kth order statistic of the ESP in group j, where ). Let be the Kaplan Meier estimator computed from the survival times in the jth group. For each group, is plotted and is plotted as circles at the uncensored event times . The survival regression model is reasonable if the circles “track well” in each of the J plots.
If the slice widths go to zero, but the number of cases per slice increases to ∞ as then the Kaplan Meier estimator and the model estimator converge to if the model holds. Simulations suggest that the two survival functions are “close” for moderate n and nine slices. For small n and skewed predictors, some slices may be too wide in that the model is correct but is not a good approximation of where corresponds to the x used to compute
For the Cox model, if pointwise confidence interval (CI) bands are added to the plot, then “tracks well” if most of the plotted circles do not fall very far outside the pointwise CI bands since these pointwise bands are not as wide as simultaneous bands. Collett (2003, pp. 241–243) [17] places several observed Kaplan Meier curves with fitted curves on the same plot.
Survival regression is the study of the conditional survival , and the slice survival plot is a useful tool for visualizing in the background of the data. Suppose the jth slice is narrow so that . If the model is reasonable, , and the number of uncensored cases in the jth slice is not too small, then (These quantities approximate for the Cox model.) Thus the nonparametric Kaplan Meier estimator is used to check the model estimator in each slice.
The slice survival plot tailored to the Cox model is closely related to the May and Hosmer (1998) [21] test. Also, van Houwelingen et al. (2006) [22] use similar ideas but place the J Kaplan Meier curves on one plot and the J Cox survival curves on another plot. The ESP is a scalar while x is a vector. Using the ESP instead of x in plots is an important dimension reduction technique (and is similar to using a scalar valued minimal sufficient statistic instead of the p-dimensional sufficient statistic x.) Plots have been suggested by several authors with x divided into J groups instead of the ESP. For example, see Miller (1981, p. 168) [23]. Hosmer and Lemeshow (1999, pp. 141–145) [24] suggest making plots based on the quartiles of the ith predictor , and note that a problem with Cox survival curves, , is that they may use inappropriate extrapolation. Using the ESP results in narrow slices with many cases, and adding Kaplan Meier curves shows if there is extrapolation.
3.1. Examples and Simulations
Example 1.
The ovarian cancer data is from Collett (2003, pp. 187–190) [17] and Edmunson et al. (1979) [25]. The response variable is the survival time of ovarian cancer patients in days, with predictors age in years and treat (1 for cyclophosphamide alone and 2 for cyclophosphamide combined with adriamycin). See Figure 1 for the three EE plots for the ovarian cancer data, where ESPW=ESPA. The Weibull AFT appears to be appropriate for this data set. Then , , , and Hence Figure 2 shows the EE plots when an Exponential AFT is used instead of the Weibull AFT. Now the plotted points do not cluster about the identity line. Hence the Exponential AFT should not be used. Figure 3 shows the LCR plot using the Weibull AFT.
Figure 1.
Three EE plots for the ovarian cancer data.
Figure 2.
Three EE plots with the exponential AFT.
Figure 3.
LCR plot for ovarian cancer data.
Example 2.
R contains a data set nwtco where the response variable Y is the time until relapse with . The model used predictors histol = tumor histology from central lab, instit = tumor histology from local institution, age in months, and stage of disease from 1 to 4 (treated as a continuous variable). In Figure 4, the Grambsch and Therneau (1994) [12] plots suggest that the Cox model is not valid since not all of the loess curves are flat, and the global test has p-value The slice survival plot in Figure 5 shows that the Cox survival estimators and Kaplan Meier estimators are nearly identical in the six slices, suggesting that the Cox model is a reasonable approximation to the data. The greatest contributors to lack of fit seem to be the predictors age and stage corresponding to the bottom two plots of Figure 4, and survival for small ESP corresponding to the upper left plot in Figure 5.
Figure 4.
Grambsch and Therneau plots for NWTCO data.
Figure 5.
Slice survival plot for NWTCO data: dashes are pointwise CI bands.
3.2. Simulation
An R code similar to that of Zhou (2001) [26] was used to generate a Weibull regression data set with parameter vector . Then the Weibull AFT parameter vector Hence . The simulation used with zeroes and with k ones and zeroes. The population was computed.
In the simulations, for , we generated where the p elements of the vector are iid N(0,1). Let the matrix with and where for . Then the vector so that where the diagonal entries and the off diagonal entries . Hence the correlations are for . If , then as where . As gets close to 1, the predictor vectors cluster about the line in the direction of
Then 5000 runs are used to obtain the estimators. The means and standard deviations of the estimators are given. In the simulation, the uncensored values of Z are known. Hence the first estimator is the usual sample covariance vector . For real data, this estimator cannot be computed since only censored values of Z are known. The second estimator is from the Weibull AFT. The third estimator is using the Buckley James estimator. Let . Table 1 gives 2 lines per simulation scenario. The first line gives the means while the second line gives the standard deviations. A value of means the absolute value was less than 0.0005. Table 2 gives similar results with . Table 3 used correlated predictors. The simulation needed roughly to avoid convergence problems in the 5000 runs.
Table 1.
.
Table 2.
.
Table 3.
.
All three estimators worked well. It is not surprising that a correctly specified AFT would slightly outperform the Buckley James estimator (have the smallest standard deviations).
4. Discussion
The Harrell (2015) [27] rms library is useful for the Buckley James estimator. For more on estimators for model (4), see, for example, Heller and Simonoff (1990) [28], James and Smith (1984) [29], Lai and Ying (1991) [30], Lin and Wei (1992) [31], and Zeng and Lin (2007) [32]. The Kaplan Meier estimator is due to Kaplan and Meier (1958) [33].
Next, we provide some directions for further research.
(a) Under iid cases, even if heterogeneity is present. Hence where, for example, is one of the estimators studied by Yu, Liu, and Chen (2024) [34].
(b) Similar ideas can be used for other censored regression models, such as a Tobit regression, provided that the cases are iid, and the population parameter vector .
(c) The Buckley James estimator can also be used for the survival regression model where t is a known monotone function and Z is right censored.
(d) Better large sample theory would be useful so confidence intervals can be made for .
(e) More resistance to outliers would be useful, although the LCR plot is somewhat useful for detecting outliers.
(f) When the Grambsch and Therneau test and the slice survival plot give contradictory evidence on the proportional hazards assumption, how should that evidence be used?
Software
The R software 4.4.1 was used in the simulations. See R Core Team (2024) [35]. Programs are in the collection of R functions survpack.txt, available from (http://parker.ad.siu.edu/Olive/survpack.txt, accessed on 29 October 2025). The function BJcovxz generates a Weibull regression data set with right censored survival times. The function BJcovxzsim was used for Table 1. The function vnwtco was used to produce plots for Example 2. The function vovar is useful for Example 1. The function bphgfit makes a slice survival plot if there is a single covariate for group (treatment) 1 and , otherwise. The function bphsim3 simulates the above function. The function phgfit2 makes a slice survival plot when x is a vector. Both of these functions are for the Cox model. The function phsim5 simulates the above function. The function wphsim simulates the slice survival plot for Weibull regression. The function wregsim2 simulates EE plots for Weibull regression.
Author Contributions
Validation, S.J.L.; Writing—original draft, D.J.O. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Data Availability Statement
One data set is available from (http://parker.ad.siu.edu/Olive/survdata.txt, accessed on 29 October 2025). The other data set comes with R.
Acknowledgments
The authors thank the editors and referees for their work.
Conflicts of Interest
The authors declare no conflicts of interest.
Abbreviations
The following abbreviations are used in this manuscript:
| AFT | accelerated failure time |
| ESP | estimated sufficient predictor |
| iid | independent and identically distributed |
| LCR | log censored response |
| OLS | ordinary least squares |
References
- Cox, D.R. Regression models and life-tables. J. R. Stat. Soc. B 1972, 34, 187–220. [Google Scholar] [CrossRef]
- Buckley, J.; James, I. Linear regression with censored data. Biometrika 1979, 66, 429–436. [Google Scholar] [CrossRef]
- Arjas, E. A graphical method for assessing goodness of fit in Cox’s proportional hazards model. J. Am. Stat. Assoc. 1988, 83, 204–212. [Google Scholar] [CrossRef]
- Feng, C.; Wang, H.; Zhang, Y.; Han, Y.; Liang, Y.; Tu, X.M. On testing proportionality in the Cox regression model by Andersen’s plot. Commun. Stat.-Theory Methods 2017, 46, 3489–3500. [Google Scholar] [CrossRef]
- Gill, R.D.; Schumaker, M. A simple test of the proportional hazards assumption. Biometrika 1987, 74, 289–300. [Google Scholar] [CrossRef]
- Kay, R. Goodness-of-fit methods for the proportional hazards model: A review. Rev. Epidemiol. Santé Publ. 1984, 32, 185–198. [Google Scholar]
- Lin, D.Y.; Wei, L.J. Goodness-of-fit tests for the general Cox regression model. Stat. Sin. 1991, 1, 1–17. [Google Scholar]
- Marzec, L.; Marzec, P. Generalized martingale-residual processes for goodness-of-fit inference in Cox’s type regression models. Ann. Stat. 1997, 25, 683–714. [Google Scholar] [CrossRef]
- Ng’andu, N.H. An empirical comparison of statistical tests for assessing the proportional hazards assumption of Cox’s model. Stat. Med. 1997, 16, 611–626. [Google Scholar] [CrossRef]
- Quantin, C.; Moreau, T.; Asselain, B.; Maccario, J.; Lellouch, J. A regression survival model for testing the proportional hazards hypothesis. Biometrics 1996, 52, 874–885. [Google Scholar] [CrossRef]
- Yu, Q.; Chappell, R.; Wong, G.Y.C.; Hsu, Y.; Mazur, M. Relationship between Cox, Lehmann, Weibull, and accelerated lifetime models. Commun. Stat.-Theory Methods 2008, 37, 1458–1470. [Google Scholar] [CrossRef]
- Grambsch, P.M.; Therneau, T.M. Proportional hazards tests and diagnostics based on weighted residuals. Biometrika 1994, 81, 515–526. [Google Scholar] [CrossRef]
- Barchard, K.A.; Russell, J.A. Distorted correlations among censored data: Causes, effects, and correction. Behav. Res. Methods 2024, 56, 1207–1228. [Google Scholar] [CrossRef] [PubMed]
- Li, Y.; Gillespie, B.W.; Shedden, K.; Gillespie, J.A. Profile likelihood estimation of the correlation coefficient in the presence of left, right or interval censoring and missing data. R J. 2018, 10, 159–179. [Google Scholar] [CrossRef]
- Lyles, R.H.; Fan, D.; Chuachoowong, R. Correlation coefficient estimation involving a left censored laboratory assay variable. Stat. Med. 2001, 20, 2921–2933. [Google Scholar] [CrossRef]
- Pesonen, M.; Pesonen, H.; Nevalainen, J. Covariance estimation for left-censored data. Comput. Stat. Data Anal. 2015, 92, 13–25. [Google Scholar] [CrossRef]
- Collett, D. Modelling Survival Data in Medical Research, 2nd ed.; Chapman & Hall/CRC: Boca Raton, FL, USA, 2003. [Google Scholar]
- Ritov, Y. Estimation in a linear regression with censored data. Ann. Stat. 1990, 18, 303–328. [Google Scholar] [CrossRef]
- Tsiatis, A.A. Estimating regression parameters using linear rank tests for censored data. Ann. Stat. 1990, 18, 354–372. [Google Scholar] [CrossRef]
- Smith, P.J. Analysis of Failure and Survival Data; Chapman and Hall/CRC: Boca Raton, FL, USA, 2002. [Google Scholar]
- May, S.; Hosmer, D.W. A simple method for calculating a goodness-of-fit test for the proportional hazards model. Lifetime Data Anal. 1998, 4, 109–120. [Google Scholar] [CrossRef]
- van Houwelingen, H.C.; Bruinsma, T.; Hart, A.A.M.; Veer, L.J.; Wessels, L.F.A. Cross-validated Cox regression on microarray gene expression data. Stat. Med. 2006, 25, 3201–3216. [Google Scholar] [CrossRef]
- Miller, R. Survival Analysis; Wiley: New York, NY, USA, 1981. [Google Scholar]
- Hosmer, D.W.; Lemeshow, S. Applied Survival Analysis: Regression Modeling of Time to Event Data; Wiley: New York, NY, USA, 1999. [Google Scholar]
- Edmunson, J.H.; Fleming, T.R.; Decker, D.G.; Malkasian, G.D.; Jorgenson, E.O.; Jeffries, J.A.; Webb, M.J.; Kvols, L.K. Different chemotherapeutic sensitivities and host factors affecting prognosis in advanced ovarian carcinoma versus minimal residual disease. Cancer Treat. Rep. 1979, 63, 241–247. [Google Scholar]
- Zhou, M. Understanding the Cox regression models with time–change covariates. Am. Stat. 2001, 55, 153–155. [Google Scholar] [CrossRef]
- Harrell, F.E. Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis, 2nd ed.; Springer: New York, NY, USA, 2015. [Google Scholar]
- Heller, G.; Simonoff, J.S. A comparison of estimators for regression with a censored response variable. Biometrika 1990, 77, 515–520. [Google Scholar] [CrossRef]
- James, I.R.; Smith, P.J. Consistency results for linear regression with censored data. Ann. Stat. 1984, 12, 590–600. [Google Scholar] [CrossRef]
- Lai, T.L.; Ying, Z. Large-sample theory of a modified Buckley-James estimator for regression analysis with censored data. Ann. Stat. 1991, 19, 1370–1402. [Google Scholar] [CrossRef]
- Lin, J.S.; Wei, L.J. Linear regression analysis based on Buckley-James estimating equation. Biometrics 1992, 48, 679–681. [Google Scholar] [CrossRef]
- Zeng, D.; Lin, D.Y. Efficient estimation for the accelerated failure time model. J. Am. Stat. Assoc. 2007, 102, 1387–1396. [Google Scholar] [CrossRef]
- Kaplan, E.L.; Meier, P. Nonparametric estimation from incomplete observations. J. Am. Stat. Assoc. 1958, 53, 457–481. [Google Scholar] [CrossRef]
- Yu, L.; Liu, L.; Chen, D.G. Extending Buckley-James method for heteroscedastic survival data. J. Stat. Comput. Sim. 2024, 94, 1776–1792. [Google Scholar] [CrossRef]
- R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2024; Available online: www.R-project.org (accessed on 29 October 2025).
- Johana Lemonge, S. OLS Testing with Predictors Scaled to Have Unit Sample Variance. Ph.D. Thesis, Southern Illinois University, Carbondale, IL, USA, 2025. Available online: http://parker.ad.siu.edu/Olive/sSanjuka.pdf (accessed on 29 October 2025).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.