Next Article in Journal
A-Spline Regression for Fitting a Nonparametric Regression Function with Censored Data
Previous Article in Journal
Depth Induced Regression Medians and Uniqueness
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Brief Overview of Restricted Mean Survival Time Estimators and Associated Variances

1
BioPharma Early Biometrics and Statistical Innovation, Data Science & AI, BioPharmaceuticals R&D, AstraZeneca, 43183 Gothenburg, Sweden
2
Department of Orthopaedics, Institute of Clinical Sciences, Sahlgrenska Academy, University of Gothenburg, 40530 Gothenburg, Sweden
3
The Swedish Hip Arthroplasty Register, Registercentrum Västra Götaland, 41390 Gothenburg, Sweden
*
Author to whom correspondence should be addressed.
Stats 2020, 3(2), 107-119; https://doi.org/10.3390/stats3020010
Submission received: 1 April 2020 / Revised: 14 May 2020 / Accepted: 22 May 2020 / Published: 26 May 2020
(This article belongs to the Section Statistical Methods)

Abstract

:
Restricted Mean Survival Time ( R M S T ) experiences a renaissance and is advocated as a model-free, easy to interpret alternative to proportional hazards regression and hazard rates with implication in causal inference. Estimation of R M S T and associated variance is mainly done by numerical integration of Kaplan–Meier curves. In this paper we briefly review the two main alternatives to the Kaplan–Meier method; analysis based on pseudo-observations, and the flexible parametric survival method. Using computer simulations, we assess the efficacy of the three methods compared to a fully parametric approach where the distribution of survival times is known. Thereafter, the three methods are directly compared without any distributional assumption for the survival data. Generally, flexible parametric survival methods outperform both competitors, however the differences are small.

1. Introduction

In 1958, Kaplan and Meier [1] published their paper on the product limit estimator for survival probabilities, an estimator that can handle censored and/or truncated survival data. The easy to use and interpretable nature of the method, made it an instant success. The Kaplan–Meier estimator have since then become the dominant method to summarize survival data and parametric methods have lost relevance. This prompted Miller [2] to criticize this over-reliance of applied scientist on the Kaplan–Meier curve. Miller established, both theoretically and numerically, the loss of efficacy of the Kaplan–Meier curve compared to parametric methods. Jullum and Hjort [3] further explored this aspect and corroborated Miller’s findings. Meier et al. [4] have addressed some of Miller’s critique, by highlighting the difficulties in choosing the right parametric form. In addition to the Kaplan–Meier curve, Meier et al. [4] have addressed the efficacy of a functional of the Kaplan–Meier curve: the Restricted Mean Survival Time ( R M S T ), the average survival time up to a given time point. Theoretical aspects of R M S T (or Kaplan–Meier integrals) are well studied [5,6,7] and currently R M S T is experiencing a renaissance [8,9,10] being hailed as a model-free, easy to interpret statistic with implications in causal inference [11]. Not needing to establish the proper parametric model has certain advantages. However, generally non-parametric methods have lower efficacy than parametric ones. Generally, practical applications do not consider parametric estimators for R M S T and use either numerical integration of the Kaplan–Meier curve [12], pseudo-observations [13,14,15] or flexible parametric survival methods [16]. In this paper, we briefly summarize parametric and non-parametric estimators for R M S T and associated variance estimators. We will use Monte Carlo simulations to examine their efficacy.

2. Restricted Survival Times

We assume that survival times to an event of interest X 1 , , X n are independently and identically distributed ( i i d ) according to the cumulative distribution function F ( x ) and survival function S ( x ) . Similarly we assume C 1 , , C n to be i i d censoring times according to the distribution function G ( c ) . Additionally, we assume that follow-up times are restricted, by an upper threshold τ set by the researcher. Thus the actual observed time for subject j is T j = min X j , C j , τ . Additionally, we have δ j = I X j min ( C j , τ ) as event indicator that takes a value of 1 if the event of interest is recorded at the given time, 0 otherwise. We assume independence between failure and censoring time. We let t ( 1 ) . . . t ( n ) denote the ordered observed survival times and δ ( 1 ) , . . . , δ ( n ) their associated indicator values. Survival times are summarized with the help of the survival function S ( t ) = P ( T > t ) and the τ -restricted mean survival times ar given by 0 τ S ( t ) d t [12].
From Andersen et al. [14] we know that n 0 τ S ^ ( t ) d t 0 τ S ( t ) d t D N ( 0 , σ 2 ) for some variance σ 2 > 0 . In the next sections we review, then evaluate, different estimators for 0 τ S ( t ) d t and σ 2 .

3. Estimators of RMST and Associated Variance

The statistical literature describes several parametric and non-parametric ways of estimating 0 τ S ( t ) d t and its associated variance.

3.1. Parametric Methods

For parametric estimation, we need to make assumptions about the distribution of the unobserved survival times, X. Once the distribution function is established, the Restricted Mean Survival Time ( μ τ ) is estimated as
μ τ = 0 τ S ( θ , t ) d t .
where θ is the parameter vector for the assumed distribution function.
In the following, we outline three possibilities for parametric estimation
  • Likelihood and δ -method based approach.
    Estimation of 0 τ S ( θ , t ) d t requires knowledge of θ . Estimates for θ under censoring can be obtained with maximum likelihood. Given the censoring indicator I { X j C j } and Y j = min ( X j , C j ) we can define the likelihood function for θ as
    L ( θ | Y , δ ) = j = 1 n f ( Y j , θ ) I { X j C j } 1 F ( Y j , θ ) 1 I { X j C j } .
    For most cases there are no closed-form solutions for θ ^ and V a r ( θ ^ ) , however, numerical estimates are enough. Once estimates for θ are available, 0 τ S ( θ , t ) d t can be estimated either by using θ ^ in closed form equations, or as plug-ins in numerical integration. The variance for 0 τ S ( θ , t ) d t by the δ -method is given by
    V a r ( μ τ ) = i δ μ τ δ θ i 2 V a r ( θ i ) + i i j δ μ τ δ θ i δ μ τ δ θ j C o v ( θ i , θ j )
    where θ i is the i t h element of the population parameter.
  • M-estimation (or estimating equation)
    M-estimation seeks solution to the vector equation i = 1 n ψ ( T i , θ ^ ) = 0 . Here, T i are independently and identically distributed restricted survival times, θ ^ is a p-dimensional parameter and ψ is a known ( p × 1 ) -function that does not depend on i or n [17]. M-estimates are asymptotically normal with variance n 1 V ( θ 0 ) . V ( θ 0 ) is a sandwich variance estimator given by
    V ( θ 0 ) = A ( θ 0 ) 1 B ( θ 0 ) { A ( θ 0 ) 1 } T .
    where,
    A ( θ 0 ) = E θ T ψ ( T i , θ )
    and
    B ( θ 0 ) = E ψ ( T i , θ ^ ) ψ ( T i , θ ) T .
    Wang [18] discusses alternative formulations for ψ with regards to the strictly non-negative and often skewed nature of survival data. For an in-depth discussion on the M-estimators the reader is referred to [17,19].
  • The second cumulant
    The variance equivalent to the second cumulant of a probability distribution of T j . If there is no censoring present V a r ( T ) = E τ [ T 2 ] E τ [ T ] 2 and can be estimated as
    V a r ( T ) = 2 0 τ t S ( t ) d t 0 τ t S ( t ) d t 2
    leading to V a r ( μ t ) = n 1 V a r ( T ) .
    This estimator is not practical due to censoring. When we have censoring in the data Rosyton & Parmar [20] suggested multiplying n 1 V a r ( T ) with a positive scaling factor ϕ , so that ϕ = 1 if no censoring and ϕ > 1 otherwise. The scaling factor ϕ can be estimated with help of Monte Carlo simulation.
    Alternatively, the variance can be estimated with the Stute estimator [6]. Adopting the notation of Stute [21], (i.e., x d F ( x ) = x f ( x ) d x ) and ( 1 H ) = ( 1 F ) ( 1 G ) and φ a transformation of X so that n 1 i φ ( X i ) φ d F S φ = μ τ . Here, V a r [ φ d F ^ n ] = n 1 σ 1 2 where
    σ 1 2 = 0 τ φ 2 1 G d F 0 τ φ d F 2 0 τ x τ φ d F 2 1 F ( x ) [ 1 H ( x ) ] 2 G ( d x ) .
    Both approaches require assumptions about the distribution of the survival times and censoring times. These two estimators of variance will not be further evaluated, however they do represent an important aspect of R M S T .

3.2. Flexible Parametric Survival Methods

In 2002, Royston and Parmar [22] introduced flexible parametric survival models as a simple way to deal with non-proportional hazards, and as a way of easy visualization of the hazard function. In a later paper the authors exemplified the utility of the model in estimation of R M S T [16]. The model is defined through the log cumulative hazard function as
ln H ( t , z ) = ln H 0 ( t ) + z T ( β ) ,
where the baseline ( H 0 ) is modeled as a restricted cubic spline in log time, ln H 0 ( t ) = s ( ln t ) , and where z is a covariate vector. The splines s ( ln t ) are linear combination of basis functions and regression parameters. The model requires a number of knots and a probability distribution function. This later is often assumed to be the Weibull distribution, which we used in this article, with 2 knots. R M S T is estimated by predicting the log cumulative hazard function using Equation (4) at a suitable grid of time points equally spaced between 0 and τ . Thereafter the cumulative hazard is transformed into the survival function S ^ R P ( t ) , and μ τ = 0 τ S ^ R P ( t ) d t . The variance is estimated with the δ -method or by bootstrapping.
While this approach is fully parametric and the survival function is completely specified, the model is ought to be robust against distribution misspecification. Integration proceeds with the formula of the survival function of the assumed distribution, but with numerical integration for the area under S ^ R P ( t ) between 0 and τ . Additionally, the splines and the piece-wise modeling between the knots offer great flexibility to the model.

3.3. Non-Parametric Methods

3.3.1. The Kaplan–Meier-Method

The Kaplan–Meier method is perhaps the most well-known and used method for estimation of R M S T . In 1975, Meier [12] proved that the substitution of the Kaplan–Meier estimator in 0 τ S ( t ) d t provides an (approximately) unbiased estimate that is asymptotically normal.
The Kaplan–Meier estimator is given by
S ^ K M t = T i t 1 δ i R i
where T i is the ordered follow up time, R i is the number of subjects at risk prior to T i and δ i the number of events that happened at time T i . The R M S T is estimated as
μ τ = 0 τ S ^ K M ( t ) d t .
As far as we know, Irwin in 1949 [23], was the first to provide an estimator for the variance based on life tables. Later, Meier [12] in 1975 proved that reducing the data to the life table format was unnecessary and established the following variance estimator
V ( μ τ ) 2 0 τ u τ S ( u ) S ( v ) C ( u ) d v d u ,
where C ( u ) = i ( 1 S ( u i ) ) / n S ( u i ) is the sum of the squared coefficient of variation for S ( u ) . Klein & Moeschberger [24] provided a variant with the Greenwood variance estimator of survival probabilities as plug-ins and popularized Meier’s variance estimator in the following form:
V ^ ( μ τ ) = i = 1 D t i τ S ^ K M ( t ) d t 2 δ i Y i ( Y i δ i ) ,
where D = i δ i . The Aalen - Johansen variance estimator of survival probabilities could also serve as plug-in in Meier’s estimator, or re-sampling methods can be used as well. However, as Eaton et al. [25] noted based on Monte Carlo simulations, the estimator with the Greenwood plug-in give variance estimates closest to empirical and asymptotic variances.

3.3.2. Pseudo-Observations

Pseudo-observations is a method that is based on the pseudo-values from a jackknife statistic constructed from simple summary statistic estimates which are then used in a generalized estimating equation to obtain estimates of the model parameters [13,15].
The parameter of interest is μ τ . If there is no censoring, then μ τ = n 1 i T i . Under censorship, an approximately unbiased estimator for μ τ is 0 τ S ^ ( t ) d t . The pseudo-observation T j * associated with the j t h observed time is estimated as
T j * = n 0 τ S ^ ( t ) d t ( n 1 ) 0 τ S ^ j ( t ) d t
where S j is the survival curve estimated withouth the j t h observation. The assumption is that μ ( T * ) μ ( T ) and σ 2 ( T * ) σ 2 ( T ) . The most straightforward estimators are μ ^ ( T * ) = n 1 j T j * and σ 2 ( T * ) = ( n 1 ) 1 j ( T j * T ¯ * ) 2 . The approach preferred by Andersen et al. [13] is based on M-estimators. The usual assumption is that the underlying distribution is symmetric about the location parameter and the corresponding function ψ is an odd function for the location parameter (mean in our case, θ 1 ) and an even function for the targeted scale parameter (variance, θ 2 ) defined as
i = 1 n ψ ( T j * , θ ^ 1 , θ ^ 2 ) = ( T j * θ 1 ) ( T j * θ 1 ) 2 θ 2 = 0 0
Solving for V ( θ 0 ) (Equation (1)) starts with solving A ( θ 0 ) (Equation (2)) and B ( θ 0 ) (Equation (3)). In this case A ( θ 0 ) = I ( θ 0 ) , the identity matrix. Solving for B ( θ 0 ) gives
B ( θ 0 ) = σ 2 μ 3 μ 3 μ 4 σ 4
where, μ k is the k t h central moment. The variance estimator V ( θ 0 ) = I ( θ 0 ) 1 B ( θ 0 ) { I ( θ 0 ) 1 } T = B ( θ 0 ) . The variability estimates by V ( θ 0 ) are robust to model misspecification. For symmetric distributions (i.e., f ( μ + x ) = f ( μ x ) ) the off-diagonal elements of V ( θ 0 ) are zero. For asymmetric distributions (i.e., μ 3 > 0 ) there is an asymptotic variance inflation due to estimating variance in addition to the mean [19].

4. Simulation Studies

We used Monte Carlo simulations to gain insight of the small sample proprieties of the estimators. We used the R computing environment (R 3.6.0) [26] for calculations and illustrations. Codes are available from the corresponding author.

4.1. Relative Efficacy under Parametric Assumption

A first set of simulation studies was used to assess the relative efficacy of non-parametric methods and flexible parametric survival methods compared to the parametric estimates of variance. The relative efficiency (eff) of the estimators was assessed as the ratio between the estimated variances of the different estimators and the estimator with the minimum variance
eff = min σ M 2 σ M 2 ,
where M is the model considered for estimation. We were interested in seeing how relative efficacy varies as a function of (i) sample size, (ii) censoring percentage and (iii) restriction time. To facilitate calculation and possibility of obtaining closed form solutions for the parametric variance estimates, we assumed that survival times were exponentially distributed with rate, λ = 1 / 365 . Similarly, we assumed that censoring times were exponentially distributed with rate γ .
In the first set of simulations we set the sample size to 50 and 100 with censoring percentages at 50% and 75%. Given the assumed censoring percentages ( c p ) we estimated γ , the rate parameter for the censoring distribution, as the solution for the equation
c p = 1 λ ( λ + γ ) 1
where λ ( λ + γ ) 1 is an estimate for P X < C = 0 t C t f t d t [27]. Every setting was simulated 1000 times and we estimated μ τ and V a r ( μ τ ) with the δ -method, M-estimation, the Kaplan–Meier method, flexible parametric survival and pseudo-observations with τ = 365 .
Table 1 presents the results of the simulation study. The expected μ 365 = 230 days. Generally, all methods provided unbiased estimates of μ 365 , with small deviations.
Of the 5 tested variance estimators, M-estimation proved to have the smallest estimates. The δ -method variance differed only marginally from the M-estimation. Of the three practically usable methods, the Kaplan–Meier method, pseudo-observations and flexible parametric survival, the later had the highest relative efficacy, irrespective of the setting. To assess if the lower variance estimate of the flexible parametric survival, compared with the Kaplan–Meier method and pseudo-observation affects negatively statistical inference, we assessed the coverage portability of the approximate 95% Wald-type confidence intervals. There is no indication that the lower variance estimate of the flexible parametric survival method negatively affects statistical inference compared to Kaplan–Meier method and pseudo-observation. Although this simulation is not comprehensive we can conclude that the used Wald-type statistics in statistical inference for R M S T should be used with caution when sample sizes are small and censoring percentage is high.
To gain further insights of the relative efficacy of the variance estimator a second set of simulation studies was run. First we simulated survival times with λ = 1 / 365 and γ = 1 / 365 with a sample size of 100 and a censoring percentage of 50%. We varied τ between 50 and 650 days and estimated μ τ and V a r ( μ τ ) with M-estimation, the Kaplan–Meier method, flexible parametric survival and pseudo-observations.
All estimators returned unbiased μ τ estimates. As Figure 1a shows, the Kaplan–Meier method, flexible parametric survival and pseudo-observations have lower efficacy compared to parametric estimate of the variance. The estimated increases with longer restriction times and declines after a maximum is reached approximately at 600 days, when around 80% of the subjects should have experienced the event of interest.
Similarly, we have assessed how the censoring percentage affects efficacy when everything else is kept constant. We simulated survival times with λ = 1 / 365 , sample size of 100 and set the restriction time at 365 days.
We estimated γ , the censoring rate with Equation (5). The pattern was similar to the previous setting. The efficacy increased up to around 65% censoring rate, thereafter declined again.

4.2. Estimation with Unknown Distribution Function

Practitioners rarely, if ever, know the distribution of survival times, which makes parametric estimation difficult, if not impossible. Here, we aim to assess the precision, efficacy and confidence internal coverage, for the Kaplan–Meier method, pseudo-observations and flexible parametric survival methods. For the former two, distributional assumptions are less important. For flexible parametric survival methods with an assumed Weibull survival function, deviations from the assumptions might affect the outcome. In a first set of simulations, we simulated survival times with the log-logistic distribution, a distribution used in accelerated failure models. The hazard of the log-logistic distribution can take non-monotonic forms depending on the shape parameter. We assumed shape of 1.5 and a scale of 151, giving a mean survival time of approximately 365 days. Thereafter, we simulated survival times with a mixture distribution with f ( t ) = 0.5 exp ( 1 / 265 ) + 0.5 exp ( 1 / 465 ) . In both cases we assumed exponential censoring with a rate of γ = 1 / 365 . Table 2 summarizes the results of the simulation studies with 1000 runs. Generally, flexible parametric survival methods outperform both the Kaplan–Meier method and pseudo-observations without compromising the coverage probability of the 95% confidence intervals.

4.3. Parametric Estimation under Model Misspecification

As we have seen in the previous subsection, parametric estimation outperforms non-parametric estimation, at least when the correct model is specified. However, this is rarely possible. In this subsection, we assess the parametric estimators and associated variances under model misspecification. In a first set of simulations, we simulated survival times with the log-logistic distribution as described above (Section 4.2). When the distribution of survival times is unknown, it is common to assume an Exponential distribution. While the Kaplan - Meier method was practically unbiased, the parametric estimator overestimated μ τ . Of the two variance estimator the δ -method had the smallest estimates. This is expected as sandwich variance estimates are robust against model misspecification at the expanse of efficacy (Table 3).
See Table 3. In a second set of simulation, we simulated survival times according to a Weibull distribution with shape (k) and scale ( λ ) parameters. We assumed exponential censoring with rate 1 / 365 . We varied the scale parameter between 0.5 and 3. The Shape parameter was set so that the mean of Weibull survival times should equal 365, the mean of exponential survival time with rate 1 / 365 . We estimated λ with numerical root finding, by solving the equation λ Γ ( 1 + k 1 ) = 365 . At k = 1 , λ = 365 and the Weibull distribution simplifies to an Exponential distribution with rate 1/365. At these values parametric R M T S estimation assuming Exponential survival distribution returned, as expected, nearly unbiased estimates. However, as k deviated from 1, the bias increased. The sign and magnitude of bias is dependent on the direction and magnitude of deviation from k = 1 (Figure 2).

5. Discussion

In this paper we presented the main estimators for R M S T and associated variances. Additionally, with the help of simulation studies, we examined their small sample properties. As expected, the results of our simulations corroborated that parametric methods of estimation are more effective than non-parametric estimation. This is in line with previous findings about the Kaplan–Meier curve [2] or semi-parametric Cox-regression [3]. Naturally, just as Meier et al. [4] concluded, establishing the proper parametric form is difficult. Since the work of Meier et al., important advancements have been made in the field. The Focused Information Criterion ( F I C ) have been proposed for comparing general non-nested parametric models with a non-parametric alternative [28]. Application of F I C requires advanced statistical/technical knowledge and it is possible that survival times do not follow a known and well-established distribution. Thus, it is likely that in practical applications, parametric estimators for R M S T will be rare.
The numerical results and the patterns of efficacy as a function of censoring percentage are similar to the ones reported by Meier et al. [4] and by Jullum and Hjort [3]. Independently of the employed method, we could see that a small sample size and a high percentage of censoring, result in genuine departure from nominal coverage values of the approximate Wald-type confidence interval. This departure from the nominal coverage of the 95% confidence intervals is not a fallacy of the R M S T and associated variance estimators, more it is a sign of the inappropriateness of Wald-type intervals and statistics. Lawrence et al. [29] described this deviation from the normal distribution in small samples and proposed an approximation to the correct null distribution using the Cornish–Fisher expansion. In medium to large samples the normal approximation is expected to hold. Our simulation suggests that in samples 100 statistical inference will be correct. This value of 100 should not be taken as a proposed threshold. This is data dependent. Schall [30] proposed a bootstrap based framework to assess if the coverage probability of confidence intervals approximates the intended nominal coverage. Alternatively, one could either correct null distribution as Lawrence et al. [29] proposed or employ computer intensive methods. Jackknife estimate, that is behind numerous important theoretical results about R M S T [21,31,32] could be a natural option or bootstrapping that has a straightforward application in survival analysis [33,34]. Nevertheless, bootstrapping R M S T requires extra care. The parameter space for R M S T is ( 0 , τ ] and R M S T can be close to τ . Bootstrapping parameters on the boundary of the parameter space can return inconsistent results [35,36]. This is not a surprising result. While we assume normal distribution for μ τ , its parameter space is restricted to ( 0 , τ ] . A Wald-type 95% confidence interval would require that 2.5% of the distribution would be at either side of the interval. As μ τ τ the right tail of the assumed normal distribution will include more and more values that are larger than τ . Thus the confidence interval might include values outside the parameter space. If μ τ N ( μ τ ^ , V a r ^ ( μ τ ) ) , then P ( μ τ τ ) determines the values of the μ τ under the assumed distribution that falls outside of ( 0 , τ ] . P ( μ τ τ ) is directly influenced by V a r ^ ( μ τ ) ) and decreases with rate n as the sample size increases. In either case, further studies are needed to establish routines of statistical inference. Extra care is also needed in inferring from R M S T based studies with Wald-type confidence intervals or Z-scores. Applied studies, while they do present R M S T and associated confidence intervals, are interested in comparing different treatment regimens. This is done by calculating difference between two R M S T estimates D , or the ratio between them R . Neither D or R is affected by the boundary value problems in realistic situations. The parameter space for D is ( τ , τ ) and the D approaches the boundaries if and only if R M S T for one arm is close to zero and the R M S T for the competing treatment should be close to τ . Under similar circumstances R approaches to its boundary values of ( 0 , ) .
One aspect that practitioners face when planing a clinical trial with R M S T as their method of choice is deciding upon τ . The choice of the time window ( 0 , t ] may be prespecified at the design stage of the study based on clinical considerations or the choice of time point τ could be data-dependent after the data is collected [37]. The choice of τ has direct implications on statistical power [38] and as we highlighted statistical power can be maximized by considering the intricate relationship between the choice of τ and the censoring percentage.
Although we observed differences in the magnitude of the variance between the Kaplan - Meier method, pseudo-observations and flexible parametric survival methods the practical implications are likely limited. The differences between the variance estimators might be numerically large however when compared to the magnitude of the R M S T estimate, these differences will have limited practical implications. Thus, we refrain from suggesting one or the other method as the preferred one. All three methods have arguments that spoke for them. The Kaplan- Meier method is probably the most used and best known. Its application is simple and straightforward and require limited programming knowledge to implement on the top of Kaplan–Meier curve estimator, if not readily available in the preferred software. The flexible parametric survival method takes the idea behind the Kaplan–Meier method one step further by assuming a parametric form and fully define a smoothed version of the survival curve. Estimation assumes definition of the survival distribution. Additionally, to increase flexibility and model fit, the survival function can be modeled as a natural cubic spline function of log time.
Standard statistical software provide readily available routines for estimation [39,40]. An approach similar in philosophy was recently proposed by Luo et al. [41] based on piecewise exponential survival and censoring times. This approach does not require smoothing of the survival curve, however the exponential assumption facilitates design and monitoring of survival trials based on R M S T . Both estimation with the Kaplan–Meier method and flexible parametric survival method allows adjustment of the survival curves for covariates. However, interpretation for continuous variables is difficult. The third approach, pseudo-observations, is also readily available in most statistical software [42,43,44]. With pseudo-observation covariates can be regressed on restricted survival times with linear regression. While the estimates of the Cox-regression or flexible parametric survival regression are hazard rates, regression on pseudo-observation returns regression coefficients on the time scale, i.e. they directly quantify the change in restricted survival times as a function of change in the covariates.
In conclusion, we suggest researchers interested in using R M S T to analyse survival data to adopt the method that best fit their purpose. If ease of implementation is paramount, then the Kaplan–Meier method is preferred. To increase statistical power, flexible parametric survival models could be the proper option as they show superior efficacy compared to the other practical applicable methods. Clinical studies that aim to assess the effect of treatments/exposures adjusted for relevant covariates could take benefit from considering pseudo-observations as their tool of preference. Naturally, parametric method of estimation and inference can be considered. However researchers should be extra cautious about violation of assumptions.

Author Contributions

Conceptualization, S.N., E.B. and A.G.; methodology, S.N.; writing—original draft preparation, S.N., E.B. and A.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding

Acknowledgments

The authors would like to thank the anonymous referees for their constructive comments and suggestions, that led to improvements in the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Kaplan, E.L.; Meier, P. Nonparametric estimation from incomplete observations. J. Am. Stat. Assoc. 1958, 53, 457–481. [Google Scholar] [CrossRef]
  2. Miller, R.G., Jr. What price kaplan-meier? Biometrics 1983, 1077–1081. [Google Scholar] [CrossRef] [Green Version]
  3. Jullum, M.; Hjort, N.L. What price semiparametric Cox regression? Lifetime Data Anal. 2019, 25, 406–438. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Meier, P.; Karrison, T.; Chappell, R.; Xie, H. The price of Kaplan–Meier. J. Am. Stat. Assoc. 2004, 99, 890–896. [Google Scholar] [CrossRef]
  5. Stute, W. The central limit theorem under random censorship. Ann. Stat. 1995, 23, 422–439. [Google Scholar] [CrossRef]
  6. Stute, W. Kaplan–meier integrals. Handbook Stat. 2003, 23, 87–104. [Google Scholar]
  7. Akritas, M.G. The central limit theorem under censoring. Bernoulli 2000, 6, 1109–1120. [Google Scholar] [CrossRef] [Green Version]
  8. Uno, H.; Wittes, J.; Fu, H.; Solomon, S.D.; Claggett, B.; Tian, L.; Cai, T.; Pfeffer, M.A.; Evans, S.R.; Wei, L.J. Alternatives to hazard ratios for comparing efficacy or safety of therapies in noninferiority studies. Ann. Intern. Med. 2015, 163, 127. [Google Scholar] [CrossRef] [Green Version]
  9. Uno, H.; Claggett, B.; Tian, L.; Inoue, E.; Gallo, P.; Miyata, T.; Schrag, D.; Takeuchi, M.; Uyama, Y.; Zhao, L.; et al. Moving beyond the hazard ratio in quantifying the between-group difference in survival analysis. J. Clin. Oncol. 2014, 32, 2380. [Google Scholar] [CrossRef]
  10. Hasegawa, T.; Misawa, S.; Nakagawa, S.; Tanaka, S.; Tanase, T.; Ugai, H.; Wakana, A.; Yodo, Y.; Tsuchiya, S.; Suganami, H.; et al. Restricted mean survival time as a summary measure of time-to-event outcome. Pharm. Stat. 2020. [Google Scholar] [CrossRef]
  11. Stensrud, M.J.; Aalen, J.M.; Aalen, O.O.; Valberg, M. Limitations of hazard ratios in clinical trials. Eur. Heart J. 2019. [Google Scholar] [CrossRef] [PubMed]
  12. Meier, P. Estimation of a distribution function from incomplete observations. J. Appl. Probab. 1975, 12, 67–87. [Google Scholar] [CrossRef]
  13. Andersen, P.K.; Klein, J.P.; Rosthøj, S. Generalised linear models for correlated pseudo-observations, with applications to multi-state models. Biometrika 2003, 90, 15–27. [Google Scholar] [CrossRef]
  14. Andersen, P.K.; Borgan, O.; Gill, R.D.; Keiding, N. Statistical Models Based on Counting Processes; Springer Science & Business Media: Berlin, Germany, 2012. [Google Scholar]
  15. Andersen, P.K.; Pohar Perme, M. Pseudo-observations in survival analysis. Stat. Methods Med. Res. 2010, 19, 71–99. [Google Scholar] [CrossRef]
  16. Royston, P.; Parmar, M.K. The use of restricted mean survival time to estimate the treatment effect in randomized clinical trials when the proportional hazards assumption is in doubt. Stat. Med. 2011, 30, 2409–2421. [Google Scholar] [CrossRef]
  17. Stefanski, L.A.; Boos, D.D. The calculus of M-estimation. Am. Stat. 2002, 56, 29–38. [Google Scholar] [CrossRef]
  18. Wang, J.L. Asymptotic Properties of M-estimators Based on Estimating Equations and Censored Data. Scand. Stat. Theory Appl. 1999, 26, 297–318. [Google Scholar] [CrossRef]
  19. Boos, D.D.; Stefanski, L. Likelihood Construction and Estimation. In Essential Statistical Inference; Springer: Berlin, Germany, 2013; pp. 27–124. [Google Scholar]
  20. Royston, P.; Parmar, M.K. Restricted mean survival time: An alternative to the hazard ratio for the design and analysis of randomized trials with a time-to-event outcome. BMC Med. Res. Methodol. 2013, 13, 152. [Google Scholar] [CrossRef] [Green Version]
  21. Stute, W. The jackknife estimate of variance of a Kaplan-Meier integral. Ann. Stat. 1996, 24, 2679–2704. [Google Scholar] [CrossRef]
  22. Royston, P.; Parmar, M.K. Flexible parametric proportional-hazards and proportional-odds models for censored survival data, with application to prognostic modelling and estimation of treatment effects. Stat. Med. 2002, 21, 2175–2197. [Google Scholar] [CrossRef]
  23. Irwin, J. The standard error of an estimate of expectation of life, with special reference to expectation of tumourless life in experiments with mice. Epidemiol. Infect. 1949, 47, 188–189. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Klein, J.P.; Moeschberger, M.L. Survival Analysis: Techniques for Censored and Truncated Data; Springer Science & Business Media: Berlin, Germany, 2006. [Google Scholar]
  25. Eaton, A.; Therneau, T.; Le-Rademacher, J. Designing clinical trials with (restricted) mean survival time endpoint: Practical considerations. Clin. Trials. 2020, 1740774520905563. [Google Scholar] [CrossRef] [PubMed]
  26. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2019. [Google Scholar]
  27. Nadarajah, S. Reliability for lifetime distributions. Math. Comput. Model. 2003, 37, 683–688. [Google Scholar] [CrossRef]
  28. Jullum, M.; Hjort, N.L. Parametric or nonparametric: The FIC approach. Stat. Sin. 2017, 951–981. [Google Scholar] [CrossRef] [Green Version]
  29. Lawrence, J.; Qiu, J.; Bai, S.; Hung, H.J. Difference in Restricted Mean Survival Time: Small Sample Distribution and Asymptotic Relative Efficiency. Stat. Biopharm. Res. 2019, 11, 61–66. [Google Scholar] [CrossRef]
  30. Schall, R. The empirical coverage of confidence intervals: Point estimates and confidence intervals for confidence levels. Biom J. 2012, 54, 537–551. [Google Scholar] [CrossRef] [PubMed]
  31. Stute, W.; Wang, J.L. The jackknife estimate of a Kaplan—Meier integral. Biometrika 1994, 81, 602–606. [Google Scholar]
  32. Azarang, L.; de Uña-Álvarez, J.; Stute, W. The Jackknife estimate of covariance of two Kaplan–Meier integrals with covariables. Stats 2015, 49, 1005–1025. [Google Scholar] [CrossRef]
  33. Akritas, M.G. Bootstrapping the Kaplan—Meier Estimator. J. Am. Stat. Assoc. 1986, 81, 1032–1038. [Google Scholar]
  34. Burr, D. A comparison of certain bootstrap confidence intervals in the Cox model. J. Am. Stat. Assoc. 1994, 89, 1290–1302. [Google Scholar] [CrossRef]
  35. Deheuvels, P.; Mason, D.M.; Shorack, G.R. Some results on the influence of extremes on the bootstrap. Ann. l’I.H.P. Prob. Stat. 1993, 29, 83–103. [Google Scholar]
  36. Andrews, D.W. Inconsistency of the bootstrap when a parameter is on the boundary of the parameter space. Econometrica 2000, 68, 399–405. [Google Scholar] [CrossRef]
  37. Tian, L.; Jin, H.; Uno, H.; Lu, Y.; Huang, B.; Anderson, K.M.; Wei, L. On the empirical choice of the time window for restricted mean survival time. Biometrics 2020. [Google Scholar] [CrossRef] [PubMed]
  38. Huang, B.; Kuan, P.F. Comparison of the restricted mean survival time with the hazard ratio in superiority trials with a time-to-event end point. Pharm. Stat. 2018, 17, 202–213. [Google Scholar] [CrossRef]
  39. Jackson, C. flexsurv: A Platform for Parametric Survival Modeling in R. J. Stat. Softw. 2016, 70, 1–33. [Google Scholar] [CrossRef] [Green Version]
  40. Lambert, P.C.; Royston, P. Further development of flexible parametric models for survival analysis. Stata J. 2009, 9, 265–290. [Google Scholar] [CrossRef] [Green Version]
  41. Luo, X.; Huang, B.; Quan, H. Design and monitoring of survival trials based on restricted mean survival times. Clin. Trials 2019, 16, 616–625. [Google Scholar] [CrossRef]
  42. Parner, E.T.; Andersen, P.K. Regression analysis of censored data using pseudo-observations. Stata J. 2010, 10, 408–422. [Google Scholar] [CrossRef] [Green Version]
  43. Overgaard, M.; Andersen, P.K.; Parner, E.T. Regression analysis of censored data using pseudo-observations: An update. Stata J. 2015, 15, 809–821. [Google Scholar] [CrossRef] [Green Version]
  44. Klein, J.P.; Gerster, M.; Andersen, P.K.; Tarima, S.; Perme, M.P. SAS and R functions to compute pseudo-values for censored data regression. Comput. Methods Programs Biomed. 2008, 89, 289–300. [Google Scholar] [CrossRef] [Green Version]
Figure 1. The effect of restriction time (a) and censoring percentage (b) on the efficacy of the three commonly used R M S T estimation methods.
Figure 1. The effect of restriction time (a) and censoring percentage (b) on the efficacy of the three commonly used R M S T estimation methods.
Stats 03 00010 g001
Figure 2. Bias under model misspecification of the parametric estimator as a function of restriction time (a) and censoring percentage (b).
Figure 2. Bias under model misspecification of the parametric estimator as a function of restriction time (a) and censoring percentage (b).
Stats 03 00010 g002
Table 1. Efficacy ( eff ) and 95 % confidence interval coverage probabilities of the different Restricted Mean Survival Time ( R M S T ) estimators as a function of sample size (n) and censoring percentage ( c p ) at restriction time of 365 days, with the expected μ τ of 230 days.
Table 1. Efficacy ( eff ) and 95 % confidence interval coverage probabilities of the different Restricted Mean Survival Time ( R M S T ) estimators as a function of sample size (n) and censoring percentage ( c p ) at restriction time of 365 days, with the expected μ τ of 230 days.
δ -MethodM-EstimatorKaplan–MeierFlexible ParametricPseudo-Obs
n = 50, CP = 50 %
μ τ ^ 230.1230.1230.7231.2230.7
V a r ( μ τ ) ^ 368.0362.1437.5429.3446.9
eff0.98-0.820.840.81
Coverage0.940.940.930.930.94
n = 50, CP = 75 %
μ τ ^ 230.2230.2230.2228.0228.4
V a r ( μ τ ) ^ 724.6712.9792.1794.61068.7
eff0.98-0.900.900.66
Coverage0.930.920.890.890.90
n = 100, CP = 50 %
μ τ ^ 230.9230.9231.4231.6231.4
V a r ( μ τ ) ^ 185.2182.2221.2217.0223.4
eff0.98-0.830.840.81
Coverage0.960.950.950.950.95
n = 100, CP = 75 %
μ τ ^ 229.9229.9229.4229.1229.1
V a r ( μ τ ) ^ 367.0365.1421.8406.3485.6
eff0.99-0.860.890.75
Coverage0.950.940.920.940.94
Table 2. Efficacy and 95% confidence interval probabilities of the different R M S T estimators at the mean survival time of 365 days, with the expected μ τ of 182 days for the Log-logistic distribution and 225 days for the exponential distribution.
Table 2. Efficacy and 95% confidence interval probabilities of the different R M S T estimators at the mean survival time of 365 days, with the expected μ τ of 182 days for the Log-logistic distribution and 225 days for the exponential distribution.
Kaplan–MeierFlexible ParametricPseudo-Obs
Log-logistic, n = 50
μ τ ^ 183.4183.9183.4
V a r ( μ τ ) ^ 402.6397.6413.9
Coverage0.930.940.93
Log-logistic, n = 100
μ τ ^ 182.2182.7182.2
V a r ( μ τ ) ^ 201.7199.4204.2
Coverage0.940.940.94
Exp-mix, n = 50
μ τ ^ 224.5224.6224.5
V a r ( μ τ ) ^ 444.4435.4454.3
Coverage0.950.950.95
Exp-mix, n = 100
μ τ ^ 225.5225.7225.5
V a r ( μ τ ) ^ 224.8220.5227.1
Coverage0.960.950.96
Table 3. Bias and accuracy of the parametric estimation with variance estimated by the δ -method and M-estimation, together with comparative numbers for the Kaplan - Meier method under model misspecification.
Table 3. Bias and accuracy of the parametric estimation with variance estimated by the δ -method and M-estimation, together with comparative numbers for the Kaplan - Meier method under model misspecification.
μ τ ^ V a r ( μ τ ) ^ BiasMSE
τ = 50
Parametric( δ -met)45.110.35−1.482.54
Parametric (M-est)45.110.39−1.482.58
Kaplan–Meier46.630.910.040.91
τ = 365
Parametric ( δ -met)188.24191.426.04227.96
Parametric (M-est)188.24213.106.04249.63
Kaplan–Meier182.54203.060.35203.18

Share and Cite

MDPI and ACS Style

Nemes, S.; Bülow, E.; Gustavsson, A. A Brief Overview of Restricted Mean Survival Time Estimators and Associated Variances. Stats 2020, 3, 107-119. https://doi.org/10.3390/stats3020010

AMA Style

Nemes S, Bülow E, Gustavsson A. A Brief Overview of Restricted Mean Survival Time Estimators and Associated Variances. Stats. 2020; 3(2):107-119. https://doi.org/10.3390/stats3020010

Chicago/Turabian Style

Nemes, Szilárd, Erik Bülow, and Andreas Gustavsson. 2020. "A Brief Overview of Restricted Mean Survival Time Estimators and Associated Variances" Stats 3, no. 2: 107-119. https://doi.org/10.3390/stats3020010

APA Style

Nemes, S., Bülow, E., & Gustavsson, A. (2020). A Brief Overview of Restricted Mean Survival Time Estimators and Associated Variances. Stats, 3(2), 107-119. https://doi.org/10.3390/stats3020010

Article Metrics

Back to TopTop