Next Article in Journal
A New Extended Weibull Distribution with Application to Influenza and Hepatitis Data
Previous Article in Journal
Climate Change: Linear and Nonlinear Causality Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Interval-Censored Regression with Non-Proportional Hazards with Applications

by
Fábio Prataviera
1,†,
Elizabeth M. Hashimoto
2,†,
Edwin M. M. Ortega
1,*,†,
Taciana V. Savian
1,† and
Gauss M. Cordeiro
3,†
1
Department of Exact Sciences, “Luiz de Queiroz” School of Agriculture, University of São Paulo—ESALQ/USP, Piracicaba 13418-900, Brazil
2
Academic Department of Mathematics, Federal University of Technology, Londrina 86036-370, Brazil
3
Department of Statistics, Federal University of Pernambuco, Recife 50670-901, Brazil
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Stats 2023, 6(2), 643-656; https://doi.org/10.3390/stats6020041
Submission received: 18 April 2023 / Revised: 7 May 2023 / Accepted: 10 May 2023 / Published: 17 May 2023
(This article belongs to the Section Survival Analysis)

Abstract

:
Proportional hazards models and, in some situations, accelerated failure time models, are not suitable for analyzing data when the failure ratio between two individuals is not constant. We present a Weibull accelerated failure time model with covariables on the location and scale parameters. By considering the effects of covariables not only on the location parameter, but also on the scale, a regression should be able to adequately describe the difference between treatments. In addition, the deviance residuals adapted for data with the interval censored and the exact time of failure proved to be satisfactory to verify the fit of the model. This information favors the Weibull regression as an alternative to the proportional hazards models without masking the effect of the explanatory variables.

1. Introduction

Data that represent the time to event are characterized by the presence of censorship. In other words, they are partial observations of the response variable caused by the non-occurrence of the event of interest during the period under study. Interval-censored data arise when the occurrence time of the event is only known to have occurred in an interval. Interval censoring is a type of censorship that occurs naturally in studies in which the sample unit is evaluated periodically [1]. Such data appear, for example, in clinical or longitudinal experiments in which patients can be followed up only through periodic examinations. In this case, we do not know the exact occurrence time, but only that it occurs within an interval. Recently, some works on interval-censored data have been reported. In Betensky et al. [2], the authors introduced proportional hazards regressions, Calle and Gómez [3] used a Bayesian framework for analyzing regressions in which one of the covariables is interval-censored, Komárek and Lesaffre [4] proposed an accelerated failure time model, and Rodrigues et al. [5] discussed an alternative to the Kaplan–Meier method.
However, these models [2,6,7,8,9] are defined under the proportional hazards assumptions, i.e., the survival curves exhibit parallel behavior. However, these assumptions do not always hold.
In fact, what can be said is that most studies adopt the assumption of proportional risks, i.e., the survival curves do not intersect. For example, Cox’s model considers this assumption. In this research, the risks may be non-proportional.
On the other hand, in a study of dairy cow herds developed by the Department of Veterinary Medicine of the Federal University of Lavras, MG, in Brazil, to evaluate the effect of animal supplementation on the ovulation of dairy cows (interval-censored survival data), the survival curve of the control supplementation was crossed with the survival curve of the new supplementation (see Figure 1a). Another example refers to breast cancer data, where the goal is to compare the effects of radiotherapy alone versus radiotherapy and adjuvant chemotherapy on women. In Figure 1b, we present the graph of the survival curve with the two effects.
In both cases, proportional hazards models are not suitable for analyzing ovulation times and compare radiotherapy and radiotherapy + chemotherapy treatments, and as an alternative, we can consider accelerated failure time models. However, in some situations, the accelerated failure time models may not be able to capture the difference between treatments, possibly due to interference from the crossing of survival curves, as shown in Hashimoto et al. [1]. A solution to solve the effect of crossed survival curves (non-proportional hazards assumption) is to consider the scale parameter as a systematic component of the model.
In this context, we include the effects of covariables in the location and scale parameters in the Weibull accelerated model with two systematic components to analyze interval-censored data with non-proportional hazards. We use standard likelihood theory for inferential purposes. Another objective is to propose an extension of the deviance residuals [1] for interval-censored survival data without considering the proportional hazards assumption.
The paper is organized as follows. In Section 2, we propose the log-Weibull regression with two systematic components for the location and scale parameters for interval-censored data. The estimation of the parameters by maximum likelihood is addressed in Section 3. We provide a simulation study to check the accuracy of the estimates and residuals in Section 4. In Section 5, we empirically prove the utility of the new regression. We offer some concluding remarks in Section 6.

2. The Proposed Model

The Weibull and log-Weibull distributions are frequently adopted for analyzing censored data [10,11,12,13] and phenomena with monotone failure rates [14]. Let T W ( α , λ ) be the Weibull random variable, where α > 0 is the shape parameter and λ > 0 is the scale parameter. The probability density function (pdf) of the log-Weibull defined by Y = log ( T ) under the re-parametrization σ = α 1 > 0 (scale) and μ = log ( λ ) R (location), say Y LW ( μ , σ ) , has the form
f ( y ; μ , σ ) = 1 σ exp y μ σ exp y μ σ , y R .
The log-exponential distribution comes with σ = 1 .
The survival function corresponding to (1) is
S ( y ; μ , σ ) = exp exp y μ σ .
Clearly, the random variable Z = ( Y μ ) / σ has density
f ( z ) = exp [ z exp ( z ) ] , z R .
The accelerated failure time parametric model, also known as the location-scale regression [14], has been used for studying the effects of covariables on the response variable. Let Y 1 , , Y n be independent random variables such that Y i LW ( μ i , σ i ) . In addition, we consider the vectors of explanatory variables x i = ( 1 , x i 1 , , x i p ) and w i = ( 1 , w i 1 , , w i q ) . The linear location–scale regression for interval-censored data is defined as by (for i = 1 , , n )
Y i = μ i + σ i Z i ,
where μ i = x i β 1 and log ( σ i ) = w i β 2 , the random error Z i has density function (3), and β 1 = ( β 10 , , β 1 p ) and β 2 = ( β 20 , , β 2 q ) are functionally independent.
Thus, we associate a systematic component with the parameter σ to allow us to model data with non-proportional risks and the presence of heteroscedasticity. Equation (4) can be useful for presenting such data.
For interval-censored data, the logarithms of the failure times Y i (for i = 1 , , n ) are not observed exactly, only that the event occurred at some moment in a interval time of the type ( log ( U i ) , log ( V i ) ] , where log ( U i ) < Y i < log ( V i ) . An exact failure time is observed if log ( U i ) = log ( V i ) . In practice, the values of log ( U i ) and log ( V i ) refer to evaluation times, and hence for some individuals the event can occur after the last visit to the evaluator, thus characterizing observations subject to right censoring, where the failure time Y i occurs in the interval [ log ( U i ) , ) . A summary of the structure of the data is given below:
  • Exact failure time Y i = y i .
  • Right-censored Y i [ log ( U i ) , ) .
  • Interval-censored Y i ( log ( U i ) , log ( V i ) ] .

3. Estimation

We are only interested in models with an exact failure time and interval censoring. Let ( log ( u 1 ) , log ( v 1 ) , x 1 , w 1 ) , ( log ( u n ) , log ( v n ) , x n , w n ) be a set of interval-censored observations. The log-likelihood function of the log-Weibull regression for an exact failure time and interval-censored data has the form
l ( θ ) = i E w i β 2 + log ( u i ) x i β 1 exp ( w i β 2 ) exp log ( u i ) x i β 1 exp ( w i β 2 ) i R exp log ( u i ) x i β 1 exp ( w i β 2 ) + i I log exp exp log ( u i ) x i β 1 exp ( w i β 2 ) exp exp log ( v i ) x i β 1 exp ( w i β 2 ) ,
where θ = ( β 1 , β 2 ) is the vector of parameters, and E, R, and I denote sets of elements with exact failure time, right censoring and interval censoring, respectively.
The maximum likelihood estimate (MLE) θ ^ of θ can be found by maximizing (5) in numerical platforms such as Ox 8.00 (MaxBFGS function) or R Version 3.6.3.
Inference can be based on the ( p + q + 2 ) × ( p + q + 2 ) observed information matrix J = J ( θ ) . The asymptotic distribution of ( θ ^ θ ) can   be   considered N p + q + 2 ( 0 , J ( θ ) 1 ) under first-order asymptotes. We construct confidence intervals for the parameters based on the approximate multivariate normal N p + q + 2 ( 0 , J ( θ ^ ) 1 ) distribution of θ ^ .

4. Modified Deviance Residuals

Various types of residuals have been proposed in the regression literature [14]. For some applications of censored data, see, for example, the log-Birnbaum–Saunders regression [15] and the generalized log-gamma regression [16]. Furthermore, [1] presented martingale residuals for interval-censored data. Following these ideas, we extend these residuals with no proportional risk assumption.
The observed sample { [ log ( u i ) , log ( v i ) ] , i = 1 , , n } is taken from Section 3. The martingale residuals [1,17,18] for interval-censored data have the form
r M i = 1 + log [ S ^ ( log ( u i ) ) ] if i E , log [ S ^ ( log ( u i ) ) ] if i R , S ^ ( log ( u i ) ) log [ S ^ ( log ( u i ) ) ] S ^ ( log ( v i ) ) log [ S ^ ( log ( v i ) ) ] S ^ ( log ( u i ) ) S ^ ( log ( v i ) ) if i I ,
where S ^ ( · ) is the estimated survival function. So, replacing S ^ ( · ) in Equation (2), we obtain the martingale residuals for the log-Weibull regression for interval-censored data and non-proportional hazards:
r M i = 1 exp ( z u i ) if i E , exp ( z u i ) if i R , exp [ z v i exp ( z v i ) ] exp [ z u i exp ( z u i ) ] exp [ exp ( z u i ) ] exp [ exp ( z v i ) ] if i I ,
where
z u i = log ( u i ) x i β ^ 1 exp ( w i β ^ 2 ) and z v i = log ( v i ) x i β ^ 1 exp ( w i β ^ 2 ) .
However, the martingale residuals are asymmetrical and have values less than or equal to one. In contrast, [1] used an adaptation of the deviance residuals [17] for interval-censored data. Based on a combination of these residuals, we define the modified deviance residuals by
r D i = sgn ( r M i ) 2 r M i + log ( 1 r M i ) 1 / 2 if i E , I sgn ( r M i ) 2 r M i 1 / 2 if i R ,
where r M i are the martingale residuals given by (6).

Simulation Studies

  • Deviance residual
A first simulation study with 1000 replications analyzed the empirical distribution of the residuals (7) by combining n = 30 , 50, 100 and 300 with the interval censoring percentages 100%, 90%, and 70%. For each combination, the generated data follow the proposal by [1]:
(a)
The coefficients are fixed at β 10 = 3.00 , β 11 = 0.72 , β 20 = 0.71 and β 21 = 0.60 ;
(b)
The explanatory variable x 1 is generated from the binomial distribution with a success probability equal to 0.5 and 1, and μ i = β 10 + β 11 x 1 and σ i = exp ( β 20 + β 21 x 1 ) are computed;
(c)
The variable z = log [ log ( 1 u ) ] is generated from u U ( 0 , 1 ) , and the logarithm time from (4);
(d)
The logarithm censoring time is generated from a uniform distribution [ 0 , τ ] by fixing τ until right-censoring percentages of 0, 10%, or 30%;
(e)
The interval length is generated from a uniform U ( 1 , 3 ) distribution;
(f)
The limits of the log interval time are l i = y + ( 1 + k ) / 2 and l s = y ( 1 + k ) / 2 , where k is the length of the randomly chosen interval.
For each fit, we calculate the residuals r D i and construct the QQ plots displayed in Figure 2. Table 1 reports the averages of the MLEs of the parameters and mean square errors (MSEs) of the Weibull regression, respectively.
The numbers in Table 1 indicate that the MSEs of the estimates decay toward zero when n increases. However, for the estimate of σ , although its MSE is decreasing, the estimated bias does not change, which may be influenced by the presence of interval censoring. We can note in Figure 2, regardless of n, that there is a greater displacement of the residuals when the percentage of interval censorship decreases. This is because the sample is being contaminated by interval censoring.
2.
Estimating the survival function
We construct a second simulation study with 200 replicates to estimate the survival functions with covariables on the parameters μ and σ simultaneously and on the parameter μ .
For the simulation scenarios, we set the sample size n = 100 combined with the same censoring percentages of the previous study. The data are also generated in the same way given in the item a. However, using the same generated data, the estimation is conducted as follows:
(a)
Effects of the covariables on the parameters μ and σ simultaneously:
i
Initial values β 10 = 3.00 , β 11 = 0.72 , β 20 = 0.71 and β 21 = 0.60 ;
ii
Estimate the survival function by
S ^ ( y i ) = exp exp y i ( β ^ 10 + β ^ 11 x 1 i ) exp ( β ^ 20 + β ^ 21 x 1 i ) , i = 1 , 100 ,
where x 1 is generated from a binomial distribution with a success probability 0.5 and 1.
(b)
Effects of the covariables on the parameter μ :
i
Initial values β 10 = 3.00 , β 11 = 0.72 , and σ = 0.6129 , where the initial value for σ is the estimate given in Table 2;
ii
Estimate the survival function by
S ^ ( y i ) = exp exp y i ( β ^ 10 + β ^ 11 x 1 i ) σ ^ , i = 1 , 100 ,
where x 1 is generated from a binomial distribution with a success probability equal to 0.5 and 1.
Figure 3 shows that the systematic components for both parameters μ and σ can model the non-proportional hazards well. Figure 4 clearly reveals that we cannot model the non-proportional hazards by taking only the systematic component for μ .

5. Applications

We present two applications in different areas. The first application refers to experiments in veterinary science and the second in medicine.

5.1. Regression for the Supplementation Animal Data

The dataset refers to a study of dairy herds developed by the Department of Veterinary Medicine of the Federal University of Lavras. The objective was to verify if the supplementation offered to the herd was influencing the ovulation time of the animals. The experiment was carried out with fifty dairy cows divided into two groups: the control group corresponds to the animals without supplementation and the treatment group corresponds to the animals treated with supplementation to induce ovulation.
The response variable is the time (in days) after delivery until the first ovulation, but only for some animals was it possible to know the exact time of the first ovulation. For the other animals, the only information is the exact time that failure occurred in a time interval, which characterizes the presence of interval censoring. The following variables are considered (for i = 1 , , 50 ):
  • y i : Logarithm time after delivery until the first ovulation;
  • x i 1 : Treatment (0 = control, 1 = supplementation);
  • x i 2 : Ovary (0 = right, 1 = left);
  • x i 3 : Number of pups (0 = two pups, 1 = two more pups).
The dataset is formed by dichotomous covariables. The estimated survival curves in Figure 5 created using the Turnbull method examine the behavior of the covariables in relation to the logs of ovulation times. Figure 5 indicates that the assumption of proportional risks is not satisfied for the current dataset. So, we adopt the regression in Equation (4) to explain these data.
Consider the log-Weibull regression
y i = μ i + σ i z i ,
for the current data, where z i has the density function (3) and
μ i = β 10 + β 11 x i 1 + β 12 x i 2 + β 13 x i 3 and σ i = exp ( β 20 + β 21 x i 1 + β 22 x i 2 + β 23 x i 3 ) .
The MLEs from the two regressions are reported in Table 3. The p-value of 0.003 for the estimate of β 21 indicates a significant difference between the levels of the treatment to explain the variability of the logs of the failure times.
Figure 6a displays the random residuals within the interval ( 3 , 3 ) . The QQ plot with a generated envelope is reported in Figure 6b to verify the response distribution. Both plots support that the two components of the fitted regression are necessary to explain these data.
Plots of the empirical and estimated survival functions and the estimated hazard rates are given in Figure 7 and Figure 8, respectively. We note an increasing curve for the ovulation time data and the non-proportionality of the hazards, which supports the new regression with two components.
Interpretations
  • The levels of the control and supplementation of the treatment are different, explaining the variability in the log ovulation time.
  • From Figure 7b, we note that before exp ( 3.5 ) = 33 days (approximately), the treatment control level has a longer survival time than the supplementation level.
  • After 33 days, we note the opposite, i.e., the survival time of the supplementation level is longer than the control level in relation to the time of ovulation.
  • Thus, if the supplement is applied at longer intervals, the supplement level would have a longer survival time compared with the control.
  • We can also note that this change in 33 days is captured by the systematic part of the parameter σ .

5.2. Regression for Breast Cancer Data

We investigate the log-Weibull regression in the presence of interval-censored data [19] when proportional risks are not satisfied. These data are taken from a retrospective study reported by [20,21] to compare the cosmetic effects of radiotherapy alone versus radiotherapy and adjuvant chemotherapy on women with early breast cancer.
In this study, we consider the following variables (for i = 1 , , 94 ):
  • y i 1 : Logarithm of time (in months) to first appearance of moderate or severe breast retraction;
  • x i 1 : Type of treatment (0 = radiotherapy and chemotherapy, 1 = radiotherapy).
The dataset is composed of a dichotomous covariable. Figure 9 displays the estimated survival curves obtained using the Turnbull method to verify the behavior of this covariable in relation to the log retraction time.
The proposed regression for the breast cancer data takes the form
y i = μ i + σ i z i ,
where z i has the density function (3) and the parameters are
μ i = β 10 + β 11 x i 1 and σ i = exp ( β 20 + β 21 x i 1 ) .
Table 2 provides some results from the fitted regressions. For a 5% significance level, the retraction time has significantly different effects from radiotherapy and radiotherapy plus chemotherapy, considering both location and dispersion parameters.
Figure 10a gives the plots of the modified deviance residuals against the index. Figure 10b provides the QQ plot and generated envelope. These plots support the wider log-Weibull regression for the current data.
In Figure 11, we present the plots for the empirical and estimated survival functions for the log-Weibull regression. Figure 11a considers only the systematic component for μ , whereas Figure 11b considers the systematic components for μ and σ .
The estimated hrf displayed in Figure 12 indicates increasing shapes for the ovulation time. Figure 12a refers to just one component for μ , whereas Figure 12b refers to two components for μ and σ . These plots support the non-proportionality of the hrf and a regression with two components for a better fit to these data.
Interpretation for μ
  • There is a significant difference between the levels of radiotherapy and chemotherapy and radiotherapy in terms of the covariable treatment in relation to the log time of the first appearance of moderate or severe breast retraction.
Interpretations of σ
  • There is a significant difference between the levels of radiotherapy and chemotherapy and radiotherapy in terms of the covariable treatment in relation to the variability of the logarithm of the time of the first appearance of moderate or severe breast retraction.
  • We note from Figure 11b that before exp ( 2.5 ) = 12 months (approximately), the radiotherapy and chemotherapy level of treatment has a longer survival time than the radiotherapy level, but this difference is not significant.
  • After 12 months, we note the opposite, i.e., the survival time of the radiotherapy level is longer than that of the radiotherapy and chemotherapy level in relation to the time of the first appearance of moderate or severe breast retraction.
  • So, we note that 12 months of applying radiotherapy and chemotherapy to the patient makes them less immune.
  • We can also note that this change after 12 months is captured by the systematic part of σ .

6. Conclusions

We define and study a new log-Weibull regression for interval-censored data with two systematic components for the location and scale parameters, whose risks are not proportional. The parameters are estimated by maximum likelihood and some Monte Carlo simulations are used to investigate the accuracy of the estimates and the normal approximation for the deviance residuals. We show significant differences between two treatments for the supplementation of dairy cows. We emphasize the utility of the log-Weibull regression in two applications to real data. The datasets can be obtained by contacting the main author. Several future works can be considered, based on the assumption of non-proportional hazards; for example, the research by Hashimoto et al. [1] referring to the regression model based on the log-exponentiated Weibull distribution for interval-censored data can be generalized considering the assumption of non-proportional hazards. Analogously, we can extend the research presented by Hashimoto et al. [22] and, in this case, the extension will be related to regression models with a cure fraction for interval-censored data and non-proportional hazards. The study presented by Yang et al. [23] can be extended to non-proportional hazards models for interval-censored data considering two systematic components. Other future works may include, for example, the use of regression models with random effects for interval-censored data with non-proportional hazards in the form of group structures or correlated data, and, finally, the use of regression models for interval-censored data under the assumption of hazards not being proportional to informative censorship.

Author Contributions

All the authors contributed equally to this work. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Stated in the text.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Hashimoto, E.M.; Ortega, E.M.; Cancho, V.G.; Cordeiro, G.M. The log-exponentiated Weibull regression model for interval-censored data. Comput. Stat. Data Anal. 2010, 54, 1017–1035. [Google Scholar] [CrossRef]
  2. Betensky, R.A.; Lindsey, J.C.; Ryan, L.M.; Wand, M. A local likelihood proportional hazards model for interval censored data. Stat. Med. 2002, 21, 263–275. [Google Scholar] [CrossRef] [PubMed]
  3. Calle, M.L.; Gómez, G. A semiparametric hierarchical method for a regression model with an interval-censored covariate. Aust. N. Z. J. Stat. 2005, 47, 351–364. [Google Scholar] [CrossRef]
  4. Komárek, A.; Lesaffre, E. The regression analysis of correlated interval-censored data: Illustration using accelerated failure time models with flexible distributional assumptions. Stat. Model. 2009, 9, 299–319. [Google Scholar] [CrossRef]
  5. Rodrigues, A.S.; Calsavara, V.F.; Silva, F.I.; Alves, F.A.; Vivas, A.P. Use of interval-censored survival data as an alternative to Kaplan-Meier survival curves: Studies of oral lesion occurrence in liver transplants and cancer recurrence. Appl. Cancer Res. 2018, 38, 16. [Google Scholar] [CrossRef]
  6. Finkelstein, D.M. A proportional hazards model for interval-censored failure time data. Biometrics 1986, 42, 845–854. [Google Scholar] [CrossRef] [PubMed]
  7. Heller, G. Proportional hazards regression with interval censored data using an inverse probability weight. Lifetime Data Anal. 2011, 17, 373–385. [Google Scholar] [CrossRef] [PubMed]
  8. Ma, L.; Hu, T.; Sun, J. Cox regression analysis of dependent interval-censored failure time data. Comput. Stat. Data Anal. 2016, 103, 79–90. [Google Scholar] [CrossRef]
  9. Ahn, S.; Lim, J.; Paik, M.C.; Sacco, R.L.; Elkind, M.S. Cox model with interval-censored covariate in cohort studies. Biom. J. 2018, 60, 797–814. [Google Scholar] [CrossRef] [PubMed]
  10. Gurm, H.S.; Yadav, J.S.; Fayad, P.; Katzen, B.T.; Mishkel, G.J.; Bajwa, T.K.; Ansel, G.; Strickman, N.E.; Wang, H.; Cohen, S.A.; et al. Long-term results of carotid stenting versus endarterectomy in high-risk patients. N. Engl. J. Med. 2008, 358, 1572–1579. [Google Scholar] [CrossRef] [PubMed]
  11. Robert, C.; Dummer, R.; Gutzmer, R.; Lorigan, P.; Kim, K.B.; Nyakas, M.; Arance, A.; Liszkay, G.; Schadendorf, D.; Cantarini, M.; et al. Selumetinib plus dacarbazine versus placebo plus dacarbazine as first-line treatment for BRAF-mutant metastatic melanoma: A phase 2 double-blind randomised study. Lancet Oncol. 2013, 14, 733–740. [Google Scholar] [CrossRef] [PubMed]
  12. Li, Y.; Zhang, Q. A Weibull multi-state model for the dependence of progression-free survival and overall survival. Stat. Med. 2015, 34, 2497–2513. [Google Scholar] [CrossRef] [PubMed]
  13. Czarnecka, A.M.; Brodziak, A.; Sobczuk, P.; Dendek, C.; Labochka, D.; Korniluk, J.; Bartnik, E.; Szczylik, C. Metastatic tumor burden and loci as predictors of first line sunitinib treatment efficacy in patients with renal cell carcinoma. Sci. Rep. 2019, 9, 7754. [Google Scholar] [CrossRef] [PubMed]
  14. Lawless, J.F. Statistical Models and Methods for Lifetime Data; John Wiley & Sons: Hoboken, NJ, USA, 2011; Volume 362. [Google Scholar]
  15. Leiva, V.; Barros, M.; Paula, G.A.; Galea, M. Influence diagnostics in log-Birnbaum–Saunders regression models with censored data. Comput. Stat. Data Anal. 2007, 51, 5694–5707. [Google Scholar] [CrossRef]
  16. Ortega, E.M.; Paula, G.A.; Bolfarine, H. Deviance residuals in generalised log-gamma regression models with censored observations. J. Stat. Comput. Simul. 2008, 78, 747–764. [Google Scholar] [CrossRef]
  17. Therneau, T.M.; Grambsch, P.M.; Fleming, T.R. Martingale-based residuals for survival models. Biometrika 1990, 77, 147–160. [Google Scholar] [CrossRef]
  18. Farrington, C. Residuals for proportional hazards models with interval-censored survival data. Biometrics 2000, 56, 473–482. [Google Scholar] [CrossRef] [PubMed]
  19. Klein, J.P.; Moeschberger, M.L. Survival Analysis: Techniques for Censored and Truncated Data; Springer: Berlin/Heidelberg, Germany, 2003; Volume 1230. [Google Scholar]
  20. Beadle, G.F.; Silver, B.; Botnick, L.; Hellman, S.; Harris, J.R. Cosmetic results following primary radiation therapy for early breast cancer. Cancer 1984, 54, 2911–2918. [Google Scholar] [CrossRef] [PubMed]
  21. Beadle, G.F.; Come, S.; Henderson, I.C.; Silver, B.; Hellman, S.; Harris, J.R. The effect of adjuvant chemotherapy on the cosmetic results after primary radiation treatment for early stage breast cancer. Int. J. Radiat. Oncol. Biol. Phys. 1984, 10, 2131–2137. [Google Scholar] [CrossRef] [PubMed]
  22. Hashimoto, E.M.; Ortega, E.M.; Cordeiro, G.M.; Cancho, V.G. A new long-term survival model with interval-censored data. Sankhya B 2015, 77, 207–239. [Google Scholar] [CrossRef]
  23. Yang, C.; Diao, L.; Cook, R. Survival trees for current status data. In Proceedings of the Survival Prediction-Algorithms, Challenges and Applications, Palo Alto, CA, USA, 22–24 March 2021; pp. 83–94. [Google Scholar]
Figure 1. Estimated survival curve by Turbull method: (a) For control and new supplementation. (b) By radiotherapy and radiotherapy + chemotherapy treatments.
Figure 1. Estimated survival curve by Turbull method: (a) For control and new supplementation. (b) By radiotherapy and radiotherapy + chemotherapy treatments.
Stats 06 00041 g001
Figure 2. QQ plots for the residuals.
Figure 2. QQ plots for the residuals.
Stats 06 00041 g002
Figure 3. Estimated survival function with covariables on the parameters μ and σ : (a) Interval-censored 100%. (b) Interval-censored 90%. (c) Interval-censored 70%.
Figure 3. Estimated survival function with covariables on the parameters μ and σ : (a) Interval-censored 100%. (b) Interval-censored 90%. (c) Interval-censored 70%.
Stats 06 00041 g003
Figure 4. Estimated survival function with covariables on the parameter μ : (a) Interval-censored 100%. (b) Interval-censored 90%. (c) Interval-censored 70%.
Figure 4. Estimated survival function with covariables on the parameter μ : (a) Interval-censored 100%. (b) Interval-censored 90%. (c) Interval-censored 70%.
Stats 06 00041 g004
Figure 5. Estimated survival curve by the Turbull method for supplementation animal data: (a) Treatment ( x 1 ). (b) Ovary ( x 2 ). (c) Number of pups ( x 3 ).
Figure 5. Estimated survival curve by the Turbull method for supplementation animal data: (a) Treatment ( x 1 ). (b) Ovary ( x 2 ). (c) Number of pups ( x 3 ).
Stats 06 00041 g005
Figure 6. (a) Index plot of the residuals. (b) QQ plot with an envelope.
Figure 6. (a) Index plot of the residuals. (b) QQ plot with an envelope.
Stats 06 00041 g006
Figure 7. Estimated survival curve by the Turbull method for ovulation time data: (a) Systematic component for μ . (b) Systematic components for μ and σ .
Figure 7. Estimated survival curve by the Turbull method for ovulation time data: (a) Systematic component for μ . (b) Systematic components for μ and σ .
Stats 06 00041 g007
Figure 8. Estimated hazard rate curve for ovulation time data: (a) Systematic component for μ . (b) Systematic components for μ and σ .
Figure 8. Estimated hazard rate curve for ovulation time data: (a) Systematic component for μ . (b) Systematic components for μ and σ .
Stats 06 00041 g008
Figure 9. Estimated survival curve by the Turbull method for the breast cancer data by treatment: radiotherapy and radiotherapy + chemotherapy.
Figure 9. Estimated survival curve by the Turbull method for the breast cancer data by treatment: radiotherapy and radiotherapy + chemotherapy.
Stats 06 00041 g009
Figure 10. (a) Index plot of the residuals. (b) QQ plot with envelope.
Figure 10. (a) Index plot of the residuals. (b) QQ plot with envelope.
Stats 06 00041 g010
Figure 11. Estimated survival curve by the Turbull method for breast cancer: (a) One component for μ . (b) Two components for μ and σ .
Figure 11. Estimated survival curve by the Turbull method for breast cancer: (a) One component for μ . (b) Two components for μ and σ .
Stats 06 00041 g011
Figure 12. Estimated hazard rate curves for breast cancer: (a) One component for μ . (b) Two components for μ and σ .
Figure 12. Estimated hazard rate curves for breast cancer: (a) One component for μ . (b) Two components for μ and σ .
Stats 06 00041 g012
Table 1. Averages of the MLEs and MSE (in parentheses) for the parameters of the Weibull regression model for interval-censored data.
Table 1. Averages of the MLEs and MSE (in parentheses) for the parameters of the Weibull regression model for interval-censored data.
nParametersActual Values0%10%30%
30 β 10 3.002.9158 (0.0284)2.9313 (0.0279)2.9354 (0.0352)
β 11 0.720.7444 (0.0879)0.7286 (0.0965)0.7275 (0.1245)
β 20 −0.71−1.1733 (0.8250)−1.2731 (1.3701)−1.4154 (2.2010)
β 21 0.600.8623 (0.7849)0.9578 (1.3332)1.0628 (2.2225)
50 β 10 3.002.9260 (0.0114)2.9304 (0.0132)2.9411 (0.0172)
β 11 0.720.7416 (0.0456)0.7427 (0.0573)0.7375 (0.0714)
β 20 −0.71−1.0239 (0.1120)−1.0507 (0.2259)−1.0612 (0.3456)
β 21 0.600.7701 (0.1679)0.7848 (0.2642)0.7950 (0.3919)
100 β 10 3.002.9401 (0.0055)2.9443 (0.00964)2.9571 (0.0086)
β 11 0.720.7408 (0.0216)0.7372 (0.0275)0.7302 (0.0353)
β 20 −0.71−0.9714 (0.0390)−0.9680 (0.0382)−0.9461 (0.0429)
β 21 0.600.7502 (0.0554)0.7479 (0.0597)0.7319 (0.0678)
300 β 10 3.002.9437 (0.0018)2.9468 (0.0022)2.9603 (0.0028)
β 11 0.720.7438 (0.0075)0.7436 (0.0086)0.7398 (0.0117)
β 20 −0.71−0.9381 (0.0098)−0.9263 (0.0102)−0.8997 (0.0112)
β 21 0.600.7335 (0.0156)0.7257 (0.0170)0.7101 (0.0204)
Table 2. Some findings from the fitted regressions.
Table 2. Some findings from the fitted regressions.
One Component for μ Two Components for μ and σ
ParameterEstimateSE p -ValueBEstimateSE p -Value
β 10 3.33110.1053<0.0001 β 10 3.33290.0852<0.0001
β 11 0.56480.17380.0016 β 11 0.71930.24180.0037
σ 0.61290.0721- β 20 −0.70880.1407<0.0001
β 21 0.59520.25580.0221
2 l ( θ ) = 286.0AIC = 292.0 2 l ( θ ) = 280.2AIC = 288.2
Table 3. Some findings from the fitted log-Weibull regression.
Table 3. Some findings from the fitted log-Weibull regression.
Only One Component for μ Two Components for μ and σ
ParameterEstimateSE p -ValueParameterEstimateSE p -Value
β 10 3.6490.159<0.001 β 10 3.6010.105<0.001
β 11 0.1230.1460.401 β 11 -0.0020.1540.991
β 12 0.0450.1450.759 β 12 0.2300.1310.085
β 13 −0.1860.1440.202 β 13 −0.1320.1310.320
σ 0.4970.062- β 20 −1.3040.275<0.001
β 21 0.8670.2740.003
β 22 −0.3040.2760.276
β 23 0.2930.3130.353
2 l ( θ ^ ) = 157.700AIC = 167.700 2 l ( θ ^ ) = 148.000AIC = 164.000
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Prataviera, F.; Hashimoto, E.M.; Ortega, E.M.M.; Savian, T.V.; Cordeiro, G.M. Interval-Censored Regression with Non-Proportional Hazards with Applications. Stats 2023, 6, 643-656. https://doi.org/10.3390/stats6020041

AMA Style

Prataviera F, Hashimoto EM, Ortega EMM, Savian TV, Cordeiro GM. Interval-Censored Regression with Non-Proportional Hazards with Applications. Stats. 2023; 6(2):643-656. https://doi.org/10.3390/stats6020041

Chicago/Turabian Style

Prataviera, Fábio, Elizabeth M. Hashimoto, Edwin M. M. Ortega, Taciana V. Savian, and Gauss M. Cordeiro. 2023. "Interval-Censored Regression with Non-Proportional Hazards with Applications" Stats 6, no. 2: 643-656. https://doi.org/10.3390/stats6020041

APA Style

Prataviera, F., Hashimoto, E. M., Ortega, E. M. M., Savian, T. V., & Cordeiro, G. M. (2023). Interval-Censored Regression with Non-Proportional Hazards with Applications. Stats, 6(2), 643-656. https://doi.org/10.3390/stats6020041

Article Metrics

Back to TopTop