Next Article in Journal
Cross-Country Assessment of Socio-Ecological Drivers of COVID-19 Dynamics in Africa: A Spatial Modelling Approach
Previous Article in Journal
Direct and Indirect Effects of Environmental and Socio-Economic Factors on COVID-19 in Africa Using Structural Equation Modeling
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Estimating Time-to-Death and Determining Risk Predictors for Heart Failure Patients: Bayesian AFT Shared Frailty Models with the INLA Method

by
Tafese Ashine
1,
Habte Tadesse Likassa
2,3 and
Ding-Geng Chen
3,4,*
1
Department of Statistics, Assosa University, Assosa 5220, Ethiopia
2
Department of Statistics, Addis Ababa University, Addis Ababa 1176, Ethiopia
3
College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
4
Department of Statistics, University of Pretoria, Pretoria 0028, South Africa
*
Author to whom correspondence should be addressed.
Stats 2024, 7(3), 1066-1083; https://doi.org/10.3390/stats7030063
Submission received: 15 August 2024 / Revised: 11 September 2024 / Accepted: 20 September 2024 / Published: 23 September 2024
(This article belongs to the Section Survival Analysis)

Abstract

:
Heart failure is a major global health concern, especially in Ethiopia. Numerous studies have analyzed heart failure data to inform decision-making, but these often struggle with limitations to accurately capture death dynamics and account for within-cluster dependence and heterogeneity. Addressing these limitations, this study aims to incorporate dependence and analyze heart failure data to estimate survival time and identify risk factors affecting patient survival. The data, obtained from 497 patients at Jimma University Medical Center in Ethiopia were collected between July 2015 and January 2019. Residence was considered as the clustering factor in the analysis. We employed the Bayesian accelerated failure time (AFT), and Bayesian AFT shared gamma frailty models, comparing their performance using the Deviance Information Criterion (DIC) and Watanabe–Akaike Information Criterion (WAIC). The Bayesian log-normal AFT shared gamma frailty model had the lowest DIC and WAIC, with well-capturing cluster dependency that was attributed to unobserved heterogeneity between patient residences. Unlike other methods that use Markov-Chain Monte-Carlo (MCMC), we applied the Integrated Nested Laplace Approximation (INLA) to reduce computational load. The study found that 39.44% of patients died, while 60.56% were censored, with a median survival time of 34 months. Another interesting finding of this study is that adding frailty into the Bayesian AFT models boosted the performance in fitting the heart failure dataset. Significant factors reducing survival time included age, chronic kidney disease, heart failure history, diabetes, heart failure etiology, hypertension, anemia, smoking, and heart failure stage.

1. Introduction

Heart failure (HF) is becoming one of the major health problems across the globe [1]. It is a leading cause of death and poses a rapidly growing public health challenge, currently affecting over 64 million individuals [2,3]. With the prevalence of HF increasing with age, the number of patients is rising swiftly in both developed and developing countries with aging populations [4]. Additionally, HF is a major contributor to cardiovascular diseases and imposes a significant economic burden on individuals and healthcare systems worldwide [2,5]. A prospective cohort study of 5823 patients revealed an overall mortality rate of 16.5%, with the highest rates in Africa (34%) and India (23%), intermediate rates in Southeast Asia (15%), and the lowest rates in China (7%), South America (9%), and the Middle East (9%) [6].
Heart failure (HF) poses a significant public health challenge, particularly in developing countries where mortality rates are alarmingly high [7,8]. In sub-Saharan Africa (SSA), including Ethiopia, HF is increasingly a leading cause of death, exacerbated by inadequate resources and weak healthcare systems [7]. Unlike in the US and Europe, HF in SSA often affects younger individuals due to preventable conditions, such as hypertension (43.9%), rheumatic heart disease (15%), and dilated cardiomyopathy (19.5%) [9,10]. In Ethiopia, HF-related hospital deaths account for 24.4% of total fatalities [11,12]. Despite this, there is a notable gap in research regarding the survival rates and duration of HF patients [11,12]. While advanced survival models like Cox regression [13] and Bayesian approaches have been employed to analyze HF survival distributions [14], these methods often fall short in addressing heterogeneity and within-cluster dependencies.
Various survival methods, including Cox regression and AFT models such as exponential, Weibull, log-normal, and log-logistic, are used to estimate median survival times and identify prognostic factors [14,15]. While parametric AFT models can offer a more accurate representation of survival data compared to Cox-PH models when the hazard shape is known [16,17,18], they often lack flexibility in capturing unobserved heterogeneity and associations. Most studies fail to incorporate random effects or frailty terms into these models, which limits their ability to account for cluster-level variability [14,16]. Frailty models address this limitation by incorporating random effects to handle unobserved heterogeneity and are an extension of the Cox-PH model [17,19]. Shared frailty models, where frailties are common within clusters, are particularly suitable for clustered survival time data, offering a more nuanced understanding of survival dynamics within groups [19,20].
Traditional survival methods, such as Cox regression and parametric AFT models, often struggle with heterogeneity and within-cluster dependence, leading to potential inaccuracies. These methods may also suffer from computational inefficiencies and convergence issues, particularly MCMC approaches [21,22,23].
In contrast, Bayesian analysis with parametric AFT models [24,25,26], enhanced by INLA, overcomes these limitations. The Bayesian approach effectively integrates prior knowledge (prior probability) with data (likelihood), improving model accuracy and handling unobserved heterogeneity [24,27]. INLA offers faster and more accurate computation than MCMC, making it highly efficient for complex survival data analysis [21]. Thus, Bayesian AFT models with INLA provide a more robust and efficient framework for analyzing survival times [24,28,29]; Bayesian survival analysis with INLA was suggested by [30], addressing the drawbacks of traditional methods.
In this study, we employed Bayesian AFT shared gamma frailty models using the INLA method to estimate the survival time of HF patients. Unlike the classical survival approaches, our method enjoys the advantage of INLA especially in largely reducing the issue of time complexity. We chose this approach due to the increasing prevalence of HF in hospital settings and the gaps identified in various studies. The proposed method better estimates the parameters and captures the heterogeneity and dependence within clusters, ensuring a more accurate and comprehensive analysis and leading to more effective interventions.
The results from this study can significantly aid medical professionals in identifying high-risk patients and adjusting their treatment plans. Analyzing the survival time of HF patients is crucial for addressing public health issues by identifying factors associated with the survival time in time-to-death-based analysis. Additionally, the findings can raise awareness about the factors that contribute to HF patient deaths.
This study can also provide valuable scientific information to the Ministry of Health in Ethiopia, helping policymakers to educate the public about factors that increase the probability of death due to HF. Heart failure is preventable and curable if screened and treated in its early stages with appropriate treatment. By enhancing awareness of these aspects, the study ultimately contributes to reducing mortality rates and improving overall public health outcomes.
Overall, the major contributions of this study are as follows:
(1) Data: This study applies Bayesian AFT shared frailty models using data that incorporate recent developments in survival methods for estimating survival time and identifying predictors in heart failure patients. These data can be publicly available to supplement the development of other methods and this study would contribute to the quality of survival data analysis and improve parameter estimation.
(2) Frailty Model: Unlike the classical survival methods, the proposed frailty model captures unobserved heterogeneity within clusters and further leverages its advantages.
(3) Novel INLA Implementation: Addressing the high complexity of computation, this study makes use of advanced INLA implementation to accurately estimate the parameters of the survival models.
(4) Improved Parameter Estimation: Integrating Bayesian methods into AFT models improves parameter estimation during the heart failure patient data by incorporating prior knowledge and providing a full probability distribution, further enhancing the accuracy and reliability of results. It also offers flexibility in handling random effects and frailty terms, performs better with small samples, and provides a more comprehensive analysis of complex survival data. With this advantage, the proposed method enjoys in handling the issue of heterogeneity and survival analysis based on the heart failure patients.
(5) Model Flexibility: The log-normal Bayesian AFT shared gamma frailty model excels in flexibly modeling the survival data with both individual and group-level random effects, effectively handling unobserved heterogeneity and providing robust estimates. Our results reveal that it outperforms its competitors.
The rest of this paper is structured as follows. Section 2 describes the nature of data and methods and Section 3 presents and discusses the data analysis results. Finally, we provide concluding remarks in Section 4.

2. Data and Methods

2.1. Heart Failure Dataset and Study Design

The retrospective study, which was conducted at Jimma University Medical Center (JUMC), Jimma, Ethiopia, focused on collecting data from HF patients over a period of 3 years and 6 months, specifically from 1 July 2015 to 1 January 2019. The primary objective was to gather comprehensive information on these patients by reviewing records from the registration logbook and patient registration cards. Any incomplete data encountered during this process were verified against patient files and excluded if inadequate for analysis. The study aimed to include all HF patients who had received treatment at least once during the study period. Patients were considered eligible if they had complete information on the study variables in the registration book or patient card. Those with insufficient data or who did not commence treatment at the hospital were excluded, as were patients who died due to accidents. Ethical permission was obtained from JUMC prior to data collection. The data extraction was carried out by a team that included a statistician, public health officer, and trained enumerator using a predefined checklist (data extraction form). This rigorous approach ensured that the data collected were reliable and suitable for subsequent analysis. Jimma University Medical Center is a pioneering and great hospital, that provides essential services to many patients. This study at JUMC aimed to contribute valuable insights into the management and outcomes of HF patients within the institution during the specified time-frame.

2.2. Study Variables and Definitions

  • Outcome variable
The response variable of the study was the time-to-death of HF patients from the time of diagnosis (measured in months). In this study, we assume that the data follow the right-censored, where patients who die as a result of HF are said to have experienced the event of interest and all others (death not related to HF, dropouts, stopped, and lost to follow-up) are said to be censored (have not experienced the event of interest). To better understand the data scheme that better represents the binary outcome variable  Y i j  the status variable was coded as 0 for censored and 1 for death. Let  Y i j  be a binary indicator variable such that:
Y i j = 0 if i t h patient j t h residence ( censored ) 1 if i t h patient j t h residence ( death )
The data scheme represents a binary outcome variable,  Y i j , used in survival analysis or similar studies, indicating the event status for the  i t h  patient at the  j t h  time point or residence. A value of 0 means the patient was censored, meaning they were still alive or the event of interest (e.g., death) had not occurred by the last observation, resulting in incomplete data. A value of 1 indicates that the event (e.g., death) occurred, and the outcome was observed for the patient at the  j t h  time point.
  • Predictor variables
The covariates that are considered in this study were as follows: sex, age, smoking cigarettes, diabetes mellitus, hypertension, place of residence, alcohol consumption, history of HF, anemia, treatments taken, etiology of HF, chronic kidney disease, and stages of HF.
The findings of the study will be intended to guide future healthcare interventions and to be well prepared for reduce the influence of heart failure patients.

2.3. Survival Data Analysis

The description of survival data analysis employs non-parametric methods to compare the survival functions of two or more groups. In non-parametric survival analysis, the data are summarized via estimates of the survival function and hazard function. The Kaplan–Meier plots are employed for this purpose [31]. When comparing groups, starting with a graphical display of the data for each group is helpful. A frequency distribution table was employed to summarize the data obtained from the patient registration book based on the study variables at Jimma University Medical Center.
Survival analysis is the statistical method used to model and analyze data in which the time until an event of interest occurs is of primary concern. This technique is well suited for HF datasets which are very common in medical research since studies in medical areas have a special feature that follow-up studies could start at a certain observation time and could end before all experimental units had experienced an event [17,32]. Survival data are considered censored in the sense that they did not provide complete information since subjects of the study may not have experienced the event of interest. The most common is right censoring, which occurs when an individual has not experienced the event by the end of the study, so their last observed follow-up time is less than their time to event [32,33]. In this study, we assume that the data follow the right-censoring. The Cox PH model is widely used in survival regression analysis [34,35]. However, Wei [36] and Cox [37] have highlighted that the AFT model can provide a more realistic interpretation and alternative to the Cox PH model. The AFT model is an alternative to the Cox PH and parametric models when the assumptions of the Cox PH model do not hold [38]. Finally, the key differences between the Cox PH and the AFT models are baseline hazard function and ways of estimating coefficients [39,40].

2.4. Shared Frailty AFT Models

The term ‘frailty’ was first introduced by [41] for univariate data and later extended to multivariate data by [42]. A natural extension of the univariate frailty model would be a multivariate frailty model which allows individuals in the same cluster to share the same frailty. To account for unobserved heterogeneity, Hougaard [43] introduced the frailty term as an extension of the Cox PH model. Frailty models extend traditional survival models by incorporating random effects (frailties) that act multiplicatively on the baseline hazard function [17,19,44]. The concept of frailty provides a convenient way to introduce random effects in the survival model to account for association and unobserved heterogeneity (frailty) [45]. A shared frailty model is a survival model that includes a random effect term, where the frailties are common or shared among individuals within specific groups [17,46,47]. The shared frailty model is a conditional independence model in which frailty is common to all subjects in a cluster [20]. AFT frailty models extend the standard AFT model by incorporating random effects [48].
Suppose that there are n clusters and that the  i th  cluster has  n i  individuals and is associated with an unobserved frailty  u i . The log-linear representation of the shared frailty AFT model for the random variable,  T i j , that is associated with the survival time of the  j th  individual in the  i th  group or cluster (patient residence). Frailties or random effects  u i  are assumed to be identically and independently distributed random variables with a common density function  f ( u , η ) , where  η  is a parameter of the frailty distribution. Shared frailty models enable individuals within the same group to share a common frailty [20]. This study focused on HF patients in different parts of Ethiopia (urban and rural). The shared frailty model was applied based on their residence.
The shared frailty AFT model can be written as follows [48]:
log T i j = μ + x i j T β + u i + δ ϵ i j , i = 1 , 2 , , n , j = 1 , 2 , , n i
where  T ij  denotes the time,  μ  and  δ  are unknown location and scale parameters,  x ij  are vector of covariates,  β  is a parameter vector,  u i  is the frailty term, and  ϵ i j  are independent identically distributed random errors.
The conditional hazard function for a given frailty variable,  u i , at time  t > 0  is,
λ ( t | x ij , u i ) = u i λ 0 ( t ) exp ( x ij T β )
where  λ ( t | x ij , u i )  is the hazard function at time t for a subject with a set of covariates  x ij λ 0 ( t )  is the baseline hazard function,  x ij  are vector of covariates and  β = ( β 1 , β 2 , , β p ) T  is a vector of regression parameters for the fixed covariates  x ij .
The conditional survival function for a given frailty  u i  at time  t > 0  is,
S ( t | x ij , u i ) = exp u i H 0 ( t ) exp ( x ij T β )
where  S ( t | x ij , u i )  is the survival function at time t for a subject with a set of covariates  x ij H 0 ( t )  is the cumulative hazard function and  x i j  are the vector of covariates.

2.5. Bayesian AFT Shared Gamma Frailty Models

The Bayesian approach is preferred over the frequentist approach in survival analysis because the power of information obtained from the Bayesian approach is much better, as it combines likelihood data with prior information regarding the distribution of the parameters. The Bayesian approach is more useful in clinical data analysis than the frequentist approach and is a more appropriate data analysis technique for clinical researchers [49]. The main reasons to prefer Bayesian statistics include the ability to estimate complex models that traditional statistics cannot, a preference for the Bayesian definition of probability, the incorporation of prior and background knowledge into the analysis, and Bayesian statistics do not rely on large sample sizes (i.e., the central limit theorem). Furthermore, Bayesian statistics allow for incorporating uncertainty about a parameter and updating this knowledge via the prior distribution. The Bayesian approach considers the parameters of the model as random variables and requires specifying prior distributions for them, while the data are considered fixed [50]. In this paper, we chose a normal prior for regression coefficients because it effectively captures both positive and negative effects without introducing bias and is both mathematically convenient and interpretable. The gamma prior for frailty parameters is well-suited for non-negative values, effectively modeling unobserved heterogeneity in survival data, and providing flexibility in shape due to its definition for positive values [24].
Let us start with the data likelihood function  L ( y θ ) , which is a function that gives the probability of observing the sample data given the current parameters. For a set of unknown parameters in the presence of right censoring, it can be written as:
L ( y θ ) = i = 1 n j = 1 n i f ( t i j y i j ; θ ) δ i j S ( t i j y i j ; θ ) 1 δ i j
where  δ i j  is the censoring indicator ( 0 =  censored and  1 =  death) and  f ( t i j y i j ; θ )  and  S ( t i j y i j ; θ )  are the probability density and survival distributions, respectively [51].
In the Bayesian framework, the prior distribution is a probability distribution that expresses uncertainty about the unknown parameters before incorporating the data. It represents the prior information associated with the parameter of interest. We assumed normal priors with mean and variance for all regression coefficients and a gamma prior with shape and scale parameters for the frailty parameters [24,52]. The posterior probability density function, which summarizes our beliefs about a particular parameter, is obtained using Bayes’ rule. The distribution of a parameter can be updated by combining its prior distribution with the likelihood function, resulting in the posterior density of the parameter.
So, if  L ( y θ )  is the likelihood function and  p ( θ )  is the prior density of the parameter, then the posterior density function,  π ( θ y )  is proportional to the likelihood multiplied by the prior:
π ( θ y ) p ( θ ) L ( y θ )
In our case, the posterior density function,  π ( θ y )  [53] is given by:
π ( θ y ) = p ( θ ) × L ( y θ ) p ( θ ) × L ( y θ ) d θ
The quantity  m ( y ) = p ( θ ) × L ( y θ ) d θ  in the denominator is the normalizing constant of  π ( θ y )  and is often called the marginal distribution of the data or the prior predictive distribution.
Parametric survival models play an important role in Bayesian survival analysis since many Bayesian analyses in practice are carried out using AFT models (Exponential, Weibull, Log-Normal, Log-Logistic) [24,53,54]. We considered shared gamma frailty as the gamma distribution is commonly used for modeling the frailty effect. Gamma frailty is very well suited to failure data in terms of computational and analytical points of view. The most common reason for using the gamma distribution is its mathematical convenience [24,52,55,56]. The mean and variance are  E η ( u i ) = 1  and  Var η ( u i ) = η . Hence, the variance parameter  η  represents the amount of heterogeneity in the risk of the event.
The assumptions in Bayesian inference are that prior knowledge about parameters is expressed through a prior distribution, and the relationship between the observed data and parameters is modeled using a likelihood function. The posterior distribution updates prior beliefs with data. Additionally, a loss function is used to quantify the cost of prediction errors, guiding decisions to minimize expected loss based on the posterior distribution [29].

2.6. Integrated Nested Laplace Approximation Method

Survival models can be challenging to fit, especially when dealing with complex censoring schemes. Using the Gibbs sampler and other MCMC techniques, fitting complex survival models becomes relatively straightforward, and the availability of software significantly simplifies the implementation [24]. However, MCMC methods have limitations, such as the time burden in approximating the posterior and convergence problems [22,23,30]. The INLA method provides a flexible and fast technique for approximating complex Bayesian models. The Bayesian approach using the INLA method focuses on providing an accurate approximation to the posterior marginal distributions of the parameters in the model [21].
The INLA method was used to estimate the parameters in the Bayesian survival models. According to [21], INLA computes the posterior marginal for each component in the model, from which the posterior expectations and standard deviations can be derived. Additionally, INLA provides both extremely fast and highly accurate approximations to the posterior marginal via the clever use of Laplace approximations and advanced numerical methods, and it can be adapted to fit survival models [57].

2.7. Data Analysis Procedures

The entire data analysis was conducted using R software to estimate the heart failure patients’< median survival time while understanding the dataset’s unique characteristics and ensuring methodological precision. First, summary statistics were used to better present the characteristics of the data and Kaplan–Meier curves were fitted to learn the death dynamics, followed by a validation of the basic assumptions of the Cox model and fitting various survival models followed to identify the most suitable one.
For Bayesian models, the DIC was used to compare Bayesian survival models [58]. An alternative is the WAIC [59], which assumes a more fully Bayesian approach to construct a criterion. According to [27], the WAIC is preferable to the DIC. The preferable model is the one with the lowest values of the DIC and WAIC. A lower value indicates a better fit of the model. The most common ways of checking the goodness of fit are the Bayesian Cox–Snell residual plot and predictive distribution. Model-checking and adequacy play an important role in models for survival data. In Bayesian analysis, Chaloner [60] defined the Bayesian version of the residuals.

3. Results and Discussions

In this section, we begin by presenting descriptive results to better understand the nature of our dataset for readers and reviewers of this manuscript. Following this, we present results to clearly demonstrate the advantages of the proposed method. In doing so, we emphasize comparisons with other existing approaches in the literature. Finally, we discuss the analysis results, highlighting the benefits of the method and its broader implications in combating the potential problem of heart failure patients.

3.1. Results

The data for this study were collected from 497 patients who received treatments for HF at least once at Jimma University Medical Center in Jimma, Ethiopia, between July 2015 and January 2019.

3.1.1. Descriptive Results

As summarized in Table 1, the minimum and maximum event time observed from HF patient follow-up was 6 and 42 months, respectively. Among those HF patients, about 60.56% were censored and the remaining 39.44% have died. Fifty percent of HF patients who received treatment survived 34 months or above.
Almost half, 53.92%, of the HF patients, were female, and the remaining were male in the follow-up study. However, the survival time of male patients seems lower. Most HF patients, about 72.03%, live in rural areas, while the remainder reside in urban areas; however, survival times appear to be shorter for those living in rural areas. The survival time of HF patients seems less as they get older. Among HF patients, 21.93% have ischemic heart disease, 22.54% have rheumatic valvular heart disease, 23.54% have cardiomyopathy, 24.14% have hypertensive heart disease, and 7.85% have other diseases. Regarding smoking status, 79.28% of HF patients were non-smokers and the remaining were smokers. Additionally, 63.38% of HF patients did not use alcohol, while 36.62% were alcohol users.
Moreover, about 18.31% of HF patients were treated in the hospital with a combination of two or more treatments, and 19.72% of HF patients took digoxin. In addition, the remaining 23.14%, 27.56%, and 11.27% of HF patients were treated with spironolactone, atorvastatin, and other treatments, respectively. Regarding diabetes, 56.94% of HF patients are non-diabetic, 15.09% have type I diabetes mellitus, and 27.97% have type II diabetes mellitus. Regarding chronic kidney disease among HF patients, approximately 31.19% have chronic kidney disease, while 68.81% do not. HF patients with chronic kidney disease generally have shorter survival times. Most HF patients, 60.16%, do not have hypertension, while the remaining patients have hypertension.
Finally, regarding the stage at which HF patients go to the hospital for treatment, approximately 35.41% are in stage IV, 25.96% in stage III, 21.73% in stage II, and 16.90% in stage I. Most HF patients who pass away, about 54.08%, are HF patients who go for treatment in the hospital at a later stage, and their survival times are lower at this advanced stage.

3.1.2. Kaplan–Meier Estimate for Selective Covariates

The survival distributions of time-to-death of the HF patients were estimated for each group using the Kaplan–Meier (KM) method to compare the survival curves of two or more groups as seen in Figure 1.
As seen in Figure 1, the overall survival rate at the end of the first year was almost 92.6%, and the overall survival rate at the end of 42 months in this study was 19.2%, 95% confidence interval was (12.8%, 28.8%). Heart failure patients whose age was below 49 years had a higher probability of surviving than patients whose ages were 49 to 65 years and those aged 65 years or older. The probability of surviving becomes less for patients aged 65 years or older. The HF patients with stage I had a higher chance of surviving than other stages. The survival curve for patients with stage I was above the survival curve for patients with stage II, stage III and stage IV. The probability of surviving becomes less for HF patients with stage IV. Additionally, HF patients who live in urban areas had a better chance of survival than HF patients who live in rural areas.

3.1.3. Bayesian AFT Shared Gamma Frailty Models

As a comparison, we employed Bayesian AFT without frailty and Bayesian AFT with shared gamma frailty models for the HF dataset.
The multivariable survival models of the exponential, Weibull, lognormal, and log-logistic for the baseline hazard function by assuming gamma frailty distributions were fitted for the time-to-death of HF patients. In univariable analysis at a 5% level of significance, except for sex, treatment, and alcohol consumption, all variables were significant. The covariates significant in univariable analysis entered into multivariable analysis. We assume that all these coefficients have a normal prior with mean 0 and variance of 1000 in the Bayesian modeling. In analyzing the frailty parameter (u) via the Bayesian approach, we assume that a gamma prior was applied to the Weibull, log-normal, and log-logistic distributions with shape parameter 1 and inverse scale parameter 0.001 [24].
Table 2 shows the analysis of the HF dataset for model comparison to compare the Bayesian AFT models with frailty with the Bayesian AFT models without frailty. To compare the efficiency of these various models, DIC and WAIC were used, and the one with the smallest value and the best fit was chosen. We noted that adding frailty to the Bayesian AFT models boosted the performance in all AFT models as compared with the Bayesian AFT models without frailty. Then, the Bayesian log-normal shared gamma frailty model (DIC = 1608.812; WAIC = 1607.838) was found to be the best model for the survival time of HF patients from the given alternative.
Table 3 shows the final results for the Bayesian log-normal shared gamma frailty model, and as this result shows, the survival time of HF patients is statistically significantly affected by age, chronic kidney disease, diabetes mellitus, history of HF, aetiology of HF, hypertension, anemia, smoking, and stages of HF.
From Table 3, the results of the Bayesian log-normal shared gamma frailty model show that, keeping the effect of other factors constant, the estimated acceleration factor for HF patients aged 49 to 65 years and those aged 65 years or older is 0.7357 [95% CrI: 0.580, 0.931] and 0.6969 [95% CrI: 0.554, 0.873], respectively. Thus, the expected survival time of HF patients decreases by 26.43% and 30.31% for those aged 49 to 65 and 65 or older, respectively, compared to HF patients aged 49 or younger (reference group). The 95% CrI for the acceleration factors in both age groups does not include one, implying that both age groups significantly impact the survival time of HF patients.
For chronic kidney disease, while controlling for other factors, the estimated acceleration factor for HF patients with chronic kidney disease is 0.6630 [95% CrI: 0.569, 0.769], indicating that the expected survival time decreases by 33.70% compared to HF patients without chronic kidney disease. The 95% CrI for the acceleration factor of HF patients with chronic kidney disease does not include one, showing a significant impact on survival time.
Regarding the history of HF, keeping the effect of other factors constant, the estimated acceleration factors for HF patients in medical OPD are 1.2751 [95% CrI: 1.046, 1.559]. Thus, the expected survival time increases by 27.51% for those in medical OPD compared to new HF patients (reference group). The 95% CrI for this acceleration factor does not include one, showing a significant impact on survival time.
For hypertension, keeping the effect of other factors constant, the estimated acceleration factor for HF patients with hypertension is 0.7320 [95% CrI: 0.626, 0.855], indicating a 26.80% decrease in expected survival time compared to HF patients without hypertension (reference group). The 95% CrI for the acceleration factor of HF patients with hypertension does not include one, implying a significant effect on survival time. Moreover, keeping the effect of other factors constant, the estimated acceleration factor for HF patients who smoked cigarettes is 0.8395 [95% CrI: 0.721, 0.976], indicating a 16.05% decrease in expected survival time compared to non-smokers. The 95% CrI for the acceleration factor of HF patients who smoked cigarettes does not include one, implying that smoking has a significant effect on survival time.
Regarding the etiologies of heart failure and keeping the effect of other factors constant, the estimated acceleration factors for HF patients with rheumatic valvular heart disease, hypertensive heart disease, cardiomyopathy, and other heart diseases are 0.7211 [95% CrI: 0.568, 0.913], 0.7558 [95% CrI: 0.602, 0.948], 0.7416 [95% CrI: 0.586, 0.936], and 0.6750 [95% CrI: 0.487, 0.933], respectively. Thus, the expected survival time of HF patients decreases by 27.89% for those with rheumatic valvular heart disease, 24.42% for those with hypertensive heart disease, 25.84% for those with cardiomyopathy, and 32.50% for those with other heart diseases, compared to those with ischemic heart disease (reference group). The 95% CrI for these acceleration factors does not include one, implying that HF patients with these aetiologies have a significant impact on survival time.
On the other hand, for diabetes mellitus and keeping the effect of other factors constant, the estimated acceleration factors for HF patients with type I diabetes and type II diabetes are 0.6421 [95% CrI: 0.509, 0.806] and 0.5764 [95% CrI: 0.468, 0.706], respectively. Thus, the expected survival time of HF patients decreases by 35.79% for type I diabetics and 42.36% for type II diabetics compared to non-diabetic HF patients (reference group). The 95% CrI for the acceleration factors of HF patients with both types of diabetes does not include one, implying that diabetes has a significant effect on survival time. Regarding anemia, while controlling for other factors, the estimated acceleration factor for HF patients with anemia is 0.8445 [95% CrI: 0.729, 0.974], indicating that the expected survival time decreases by 15.55% compared to HF patients without anemia. The 95% CrI for the acceleration factor of HF patients with anemia does not include one, implying that anemia significantly affects survival time.
Finally, observing the stages of HF patients while keeping the effect of other factors constant, the estimated acceleration factors for stages II, III, and IV of HF patients are 0.6332 [95% CrI: 0.449, 0.888], 0.7033 [95% CrI: 0.509, 0.968], and 0.6402 [95% CrI: 0.467, 0.875], respectively. This result indicates that the expected survival time decreases by 36.68%, 29.67%, and 35.98% for stages II, III, and IV, respectively, compared to stage I (reference group). The 95% CrI for the acceleration factors of HF patients in stages II, III, and IV does not include one, implying that these stages significantly impact the survival time of HF patients.
From Table 3, the 95% CrI of the acceleration factors for covariates that do not include one is significant at the 5% significance level. The 95% CrI [1.0268, 95%: 1.00598, 1.625] of the frailties in the HF dataset suggests that this variability is significant. The frailty variance ( η ) = 0.0264 indicates heterogeneity between residence regions. Additionally, a frailty variance ( η ) of 0.0264 less than 1 suggests that HF patients living in rural areas have a higher risk of death (shorter survival time) compared to HF patients living in urban areas. Kendall’s tau ( τ  = 0.013) indicates a moderate amount of dependence in the clusters or unobserved heterogeneity between groups (residence region). Therefore, the frailty term had a significant contribution to the model. The Kullback–Leibler Divergence (KLD) values for all significant parameters in the Bayesian log-normal shared gamma frailty model were 0, indicating that the posterior distribution was well approximated by a normal distribution. The most efficient algorithm used was a simplified Laplace approximation, which improved efficiency and resulted in faster computation speed.

3.1.4. Bayesian Model Diagnostics

By observing the Bayesian Cox–Snell residual plots in Figure 2, we can see that the Bayesian log-normal shared gamma frailty model best fit the HF dataset from the given alternative models, since the plot of Cox–Snell residuals against cumulative hazard function of residuals was approximately a straight line with slope one and Bayesian Cox-Snell residual plot for Bayesian log-normal shared gamma frailty model was nearest to the line through the origin. In addition, the plot also indicated that the Bayesian log-normal shared gamma frailty model describes the HF dataset well.
Figure 3 includes a 95% credibility interval by observing the posterior density for the parameter’s normal distribution in the HF dataset. In addition, the histograms of the cross-validated Probability Integral Transform (PIT) values show that the posterior predictive p-values are somewhat closer to a uniform distribution, with some outliers observed in the HF dataset. The Conditional Predictive Ordinate (CPO) values are significantly smaller (order of magnitude smaller) than the others, so the observed values would be considered surprising concerning the Bayesian log-normal shared gamma frailty model because the sum of the observations associated with failure flags is equal to zero in the HF dataset. The KLD is a diagnostic that measures the accuracy of the INLA approximation since Table 3 shows that the KLD values for all significant parameters in the Bayesian log-normal shared gamma frailty model were 0.

3.2. Discussion

This study employed the Bayesian AFT shared gamma frailty models on the HF dataset obtained from Jimma University Medical Center. Heart failure is a growing global issue, with its prevalence increasing among the adult population in developing countries with aging populations [4]. The study considered 497 HF patients to attain the proposed objectives. The overall survival time observed from HF patient’s follow-up ranged from 6 months to 42 months. The result shows that the estimated median survival time of HF patients who received treatment was 34 months or above. Moreover, among those HF patients, about 301 (60.56%) were censored and the remaining 196 (39.44%) died. This finding was supported by other studies conducted by [13,14].
In the study, among HF patients, 21.93% have ischemic heart disease, 22.54% have rheumatic valvular heart disease, 23.54% have cardiomyopathy, 24.14% have hypertensive heart disease, and 7.85% have other diseases. This finding was consistent with a study conducted by [6,7,11,13,14]. Most HF patients, about 72.03%, live in rural areas, while the remainder reside in urban areas; however, survival times appear to be shorter for those living in rural areas. This finding was consistent with a study conducted by [13,14].
Furthermore, our study results indicate heterogeneity among clusters or groups (residences) at the time of death of HF patients, suggesting residence is a significant clustering factor. In line with our current work, several authors have explored Bayesian frailty models [52,55], including parametric models with baseline distributions such as exponential, Weibull, log-normal, and log-logistic, incorporating gamma frailty. Bayesian approaches have been applied to AFT-shared gamma frailty models, and efficiency comparisons of different AFT models have utilized DIC and WAIC [58,59]. The Bayesian log-normal AFT shared gamma frailty model emerged as the most effective for describing the HF dataset among the alternatives. Similar findings were reported in previous studies [14]. However, our study, using the INLA method, reveals that age, chronic kidney disease, history of HF, diabetes mellitus, etiology of HF, hypertension, anemia, smoking, and HF stage all significantly impact the survival time of HF patients.
The findings of this study reveal that age significantly impacts the survival time of HF patients, with older individuals (65 years and older) experiencing shorter survival times. This trend is consistent with results from several other studies [13,14,61]. Additionally, HF patients without hypertension tend to have longer survival times compared to those with hypertension, underscoring the significant effect of hypertension on survival. These results are corroborated by studies conducted by [12,13,14,61].
On the other hand, the survival time of HF patients who smoke is shorter compared to nonsmokers, consistent with findings from studies by [12,13,14]. Both types of diabetes mellitus significantly impact HF patients’ survival, with those having either type of diabetes experiencing shorter expected survival times compared to HF patients without diabetes. These findings align with studies conducted by [12,13,14]. Additionally, chronic kidney disease significantly affects survival time, with HF patients without chronic kidney disease having longer survival times compared to those with the condition. This result is corroborated by studies from [13,14,61].
According to findings from [13,14], anemia significantly impacts the survival time of HF patients, with those having anemia experiencing shorter expected survival times compared to non-anemic HF patients. This study aligns with those results. Additionally, studies [13,14] support the finding that HF patients with conditions such as rheumatic valvular heart disease, hypertensive heart disease, cardiomyopathy, and other heart diseases have significantly reduced survival times. The stages of HF also markedly influence survival, with increased stage correlating with decreased survival time. This conclusion is supported by studies [13,14], which demonstrate that advancing HF stages significantly impact survival time.
To assess model adequacy, we plotted cumulative hazard plots for the Bayesian Cox–Snell residuals of the Exponential, Weibull, Log-normal, and Log-logistic models, as shown in Figure 2. The Bayesian log-normal shared gamma frailty model approached the line more closely, indicating it was the most suitable for the HF dataset, consistent with previous research [14]. We also used conditional predictive ordinates and probability integral transforms to evaluate the model. Before a graphical assessment, we confirmed that no numerical issues arose in the computation of conditional predictive ordinates, as indicated by zero failures. Histograms and scatter plots of the probability integral transform showed that the predictive residuals were reasonably uniformly distributed with some outliers, supporting a good match between the predictive distribution and actual data. These results are in line with studies by [14,57,62], highlighting the robustness of our model.
Diagnostic plots for the Bayesian log-normal shared gamma frailty model, including a 95% credibility interval, show that the posterior density for parameters is normally distributed. The Kullback–Leibler divergence values for all significant parameters were zero, demonstrating that the model offers both speed and high accuracy. These findings align with results from [14,57,62]. However, the major limitations of the Bayesian AFT shared frailty model with INLA are sensitivity to assumptions, complex interpretation, challenges with large datasets, struggles with model validation, and lack of flexibility in capturing non-linear relationships. Despite limitations, such as reliance on potentially incomplete secondary data and missing prognostic factors, this study is pioneering in Ethiopia for its rigorous analysis and critical insights into heart failure patient survival, highlighting the robustness and advantages of the proposed method.

4. Conclusions

The increasing application of survival methods to estimate median survival time and identify associated factors is crucial. However, traditional methods often struggle with overfitting or underfitting, resulting in large standard errors in regression coefficients. They also frequently fail to account for within-cluster dependence and handle heterogeneity effectively. To address these shortcomings, we analyzed the survival times of heart failure patients treated at Jimma University Medical Center. This study employed various advanced survival methods to estimate survival time and identify risk factors for heart failure patients in this region. To achieve the proposed objectives, we employed Bayesian AFT and Bayesian AFT shared frailty models, comparing their performance using the heart failure patient dataset based on DIC and WAIC criteria. Notably, the Bayesian AFT models with shared gamma frailty significantly outperformed the Bayesian AFT models without frailty across all AFT model classes considered in this research.
Among the Bayesian AFT shared frailty models, the log-normal model emerged as the best performer. This finding highlights the importance of incorporating frailty to enhance model accuracy and parameter estimation. Subsequently, we then conducted an extensive data analysis using the model that demonstrated the best performance. This is due to the fact that the log-normal Bayesian AFT shared gamma frailty model provides flexible survival data modeling by incorporating both individual-level and group-level random effects, effectively handling unobserved heterogeneity and delivering robust survival estimates. This model consistently outperforms alternatives such as exponential, Weibull and log-logistic AFT models. In this study, 50% of heart failure patients who received treatment survived for 34 months or more. Significant factors affecting survival included age, chronic kidney disease, history of heart failure, diabetes mellitus, etiology, hypertension, anemia, smoking, and heart failure stages. Additionally, survival times varied between rural and urban areas, highlighting the model’s ability to capture important heterogeneity.
More specifically, the study findings suggest that age groups (49 to 65 years and ≥65 years), etiology of heart failure (rheumatic valvular heart disease, cardiomyopathy, hypertensive heart disease, and other conditions), hypertension, anemia presence, history of HF (medical OPD), chronic kidney disease presence, smoking, diabetes mellitus (type I and type II), and heart failure stages (II, III, and IV) reduced the survival time of heart failure patients. Furthermore, residence differences between rural and urban areas significantly influenced HF patient survival times, indicating heterogeneity and highlighting the need for frailty models.
Our results underscore the superior performance of the Bayesian log-normal shared gamma frailty model compared to current state-of-the-art methods.

Author Contributions

This study was drafted and the data were analyzed by T.A. under the supervision of H.T.L. and D.-G.C., H.T.L. and D.-G.C. edited and revised the entire manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work is partially based upon research supported by the National Research Foundation, South Africa (South Africa DST-NRF-SAMRC SARChI Research Chair in Biostatistics, Grant number 114613).

Institutional Review Board Statement

The Institutional Research Ethics Review Board of Jimma University provided ethical clearance for the study. The dataset was collected after an official letter was obtained from Jimma University and sent to the Hospital for permission. Our study protocol complied with the ethical guidelines of the 1975 Declaration of Helsinki. All the enrolled patients provided informed consent. The ethics committee of Jimma University approved this form of consent due to the registry-based nature of the study.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets and R code used in this study are available from the corresponding author upon reasonable request.

Acknowledgments

We thank Jimma University Medical Center for allowing us to collect the dataset from the hospital. We would like to express our sincere gratitude to the Department of Statistics at the University of Pretoria, South Africa, for their valuable support and collaboration in this project.

Conflicts of Interest

The authors declare no competing interests.

Abbreviations

The following abbreviations are used in this manuscript:
HFHeart Failure
AFTAccelerated Failure Time
Cox PHCox Proportional Hazard
MCMCMarkov Chain Monte Carlo
INLAIntegrated Nested Laplace Approximation
JUMCJimma University Medical Center
DICDeviance Information Criteria
WAICWatanabe Akaike Information Criterion
KLDKullback–Leibler Divergence
OPDOut-Patient Department.

References

  1. Ambrosy, A.P.; Fonarow, G.C.; Butler, J.; Chioncel, O.; Greene, S.J.; Vaduganathan, M.; Nodari, S.; Lam, C.S.; Sato, N.; Shah, A.N.; et al. The global health and economic burden of hospitalizations for heart failure: Lessons learned from hospitalized heart failure registries. J. Am. Coll. Cardiol. 2014, 63, 1123–1133. [Google Scholar] [CrossRef] [PubMed]
  2. Yan, T.; Zhu, S.; Yin, X.; Xie, C.; Xue, J.; Zhu, M.; Weng, F.; Zhu, S.; Xiang, B.; Zhou, X.; et al. Burden, trends, and inequalities of heart failure globally, 1990 to 2019: A secondary analysis based on the global burden of disease 2019 study. J. Am. Heart Assoc. 2023, 12, 027852. [Google Scholar] [CrossRef] [PubMed]
  3. Martin, S.S.; Aday, A.W.; Almarzooq, Z.I.; Anderson, C.A.; Arora, P.; Avery, C.L.; Baker-Smith, C.M.; Barone, G.B.; Beaton, A.Z.; Boehme, A.K.; et al. 2024 heart disease and stroke statistics: A report of us and global data from the american heart association. Circulation 2024, 149, 347–913. [Google Scholar]
  4. Pratley, R.; Guan, X.; Moro, R.J.; Lago, R. The burden of heart failure. Am. J. Med. 2024, 137, 3–8. [Google Scholar] [CrossRef]
  5. Heidenreich, P.; Sandhu, A. Advances in management of heart failure. BMJ 2024, 385, e077025. [Google Scholar] [CrossRef]
  6. Dokainish, H.; Teo, K.; Zhu, J.; Roy, A.; AlHabib, K.F.; ElSayed, A.; Palileo-Villaneuva, L.; Lopez-Jaramillo, P.; Karaye, K.; Yusoff, K.; et al. Global mortality variations in patients with heart failure: Results from the international congestive heart failure (inter-chf) prospective cohort study. Lancet Glob. Health 2017, 5, 665–672. [Google Scholar] [CrossRef]
  7. Gtif, I.; Bouzid, F.; Charfeddine, S.; Abid, L.; Kharrat, N. Heart failure disease: An african perspective. Arch. Cardiovasc. Dis. 2021, 114, 680–690. [Google Scholar] [CrossRef]
  8. Keates, A.K.; Mocumbi, A.O.; Ntsekhe, M.; Sliwa, K.; Stewart, S. Cardiovascular disease in africa: Epidemiological profile and challenges. Nat. Rev. Cardiol. 2017, 14, 273–293. [Google Scholar] [CrossRef]
  9. Damasceno, A.; Mayosi, B.M.; Sani, M.; Ogah, O.S.; Mondo, C.; Ojji, D.; Dzudie, A.; Kouam, C.K.; Suliman, A.; Schrueder, N.; et al. The causes, treatment, and outcome of acute heart failure in 1006 africans from 9 countries: Results of the sub-saharan africa survey of heart failure. Arch. Intern. Med. 2012, 172, 1386–1394. [Google Scholar] [CrossRef]
  10. Sliwa, K.; Mayosi, B.M. Recent advances in the epidemiology, pathogenesis and prognosis of acute heart failure and cardiomyopathy in africa. Heart 2013, 99, 1317–1322. [Google Scholar] [CrossRef]
  11. Asfaw, E. Five years clinical characteristics and in hospital outcome of acute heart failure at tertiary care hospital in ethiopia. Ethiop. Med. J. 2020, 58. [Google Scholar]
  12. Tirfe, M.; Nedi, T.; Mekonnen, D.; Berha, A.B. Treatment outcome and its predictors among patients of acute heart failure at a tertiary care hospital in ethiopia: A prospective observational study. BMC Cardiovasc. Disord. 2020, 20, 1–10. [Google Scholar] [CrossRef] [PubMed]
  13. Moyehodie, Y.A.; Muluneh, M.W.; Belay, A.T.; Fenta, S.M. Time to death and its determinant factors among patients with chronic heart failure in northwest ethiopia: A retrospective study at selected referral hospitals. Front. Cardiovasc. Med. 2022, 9, 817074. [Google Scholar] [CrossRef]
  14. Ashine, T.; Muleta, G.; Tadesse, K. Assessing survival time of heart failure patients: Using Bayesian approach. J. Big Data 2021, 8, 156. [Google Scholar] [CrossRef]
  15. Kalbfleisch, J.D.; Prentice, R.L. The Statistical Analysis of Failure Time Data; John Wiley and Sons: Hoboken, NJ, USA, 2011. [Google Scholar]
  16. Khanal, S.P.; Sreenivas, V.; Acharya, S.K. Accelerated failure time models: An application in the survival of acute liver failure patients in india. Int. J. Sci. Res. 2014, 3, 161–166. [Google Scholar]
  17. Collett, D. Modelling Survival Data in Medical Research; CRC Press: Boca Raton, FL, USA, 2023. [Google Scholar]
  18. Lee, K.H.; Rondeau, V.; Haneuse, S. Accelerated failure time models for semicompeting risks data in the presence of complex censoring. Biometrics 2017, 73, 1401–1412. [Google Scholar] [CrossRef]
  19. Hanagal, D.D. Modeling Survival Data Using Frailty Models; Springer: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
  20. Hougaard, P.; Hougaard, P. Analysis of multivariate survival data; Springer: Berlin/Heidelberg, Germany, 2000; p. 564. [Google Scholar]
  21. Rue, H.; Martino, S.; Chopin, N. Approximate Bayesian inference for latent gaussian models by using integrated nested laplace approximations. J. Royal Stat. Soc. Ser. Stat. Methodol. 2009, 71, 319–392. [Google Scholar] [CrossRef]
  22. Berger, J.O. Statistical Decision Theory and Bayesian Analysis; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
  23. Brooks, S.P.; Gelman, A. General methods for monitoring convergence of iterative simulations. J. Comput. Graph. Stat. 1998, 7, 434–445. [Google Scholar] [CrossRef]
  24. Ibrahim, J.G.; Chen, M.-H.; Sinha, D.; Ibrahim, J.; Chen, M. Bayesian Survival Analysis; Springer: Berlin/Heidelberg, Germany, 2001; p. 2. [Google Scholar]
  25. Ibrahim, J.G.; Zhu, H.; Tang, N. Bayesian local influence for survival models. Lifetime Data Anal. 2011, 17, 43–70. [Google Scholar] [CrossRef]
  26. Carroll, R.L.; Andrew, B.; Zhao, S. Temporally dependent accelerated failure time model for capturing the impact of events that alter survival in disease mapping. Biostatistics 2017, 20, 666–680. [Google Scholar] [CrossRef]
  27. Gelman, A.; Hwang, J.; Vehtari, A. Understanding predictive information criteria for Bayesian models. Stat. Comput. 2014, 24, 997–1016. [Google Scholar] [CrossRef]
  28. Aslanidou, H.; Dey, D.K.; Sinha, D. Bayesian analysis of multivariate survival data using monte carlo methods. Can. J. Stat. 1998, 26, 33–48. [Google Scholar] [CrossRef]
  29. Congdon, P.D. Applied Bayesian Hierarchical Methods; CRC Press: Boca Raton, FL, USA, 2010. [Google Scholar]
  30. Abdu, J.; Muhammad, I.B. Bayesian Survival Analysis with Missing Data Using Integrated Nested Laplace Approximation; Newcastle University: Newcastle upon Tyne, UK, 2020. [Google Scholar]
  31. Kaplan, E.L.; Meier, P. Nonparametric estimation from incomplete observations. J. Am. Stat. Assoc. 1958, 53, 457–481. [Google Scholar] [CrossRef]
  32. Aalen, O.; Borgan, O.; Gjessing, H. Survival and Event History Analysis: A Process Point of View; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
  33. Chen, D.G.; Lio, Y. Comparative studies on frailties in survival analysis. Commun. Stat. Simul. Comput. 2008, 37, 1631–1646. [Google Scholar] [CrossRef]
  34. Cox, D.R. Regression models and life-tables. J. R. Stat. Ser. (Methodol.) 1972, 34, 187–202. [Google Scholar] [CrossRef]
  35. Bender, R.; Augustin, T.; Blettner, M. Generating survival times to simulate cox proportional hazards models. Stat. Med. 2005, 24, 1713–1723. [Google Scholar] [CrossRef]
  36. Wei, L.J. The accelerated failure time model: A useful alternative to the cox regression model in survival analysis. Stat. Med. 1992, 11, 1871–1879. [Google Scholar] [CrossRef]
  37. Cox, D.R. Some remarks on the analysis of survival data. In Proceedings of the First Seattle Symposium in Biostatistics: Survival Analysis; Springer: Berlin/Heidelberg, Germany, 1997; pp. 1–9. [Google Scholar]
  38. Pan, W. Using frailties in the accelerated failure time model. Lifetime Data Anal. 1997, 7, 55–64. [Google Scholar] [CrossRef]
  39. Kleinbaum, D.G.; Klein, M. Survival Analysis a Self-Learning Text; Springer: Berlin/Heidelberg, Germany, 1996. [Google Scholar]
  40. Kats, L.; Gorfine, M. An accelerated failure time regression model for illness–death data: A frailty approach. Biometrics 2023, 79, 3066–3081. [Google Scholar] [CrossRef]
  41. Vaupel, J.W.; Manton, K.G.; Stallard, E. The impact of heterogeneity in individual frailty on the dynamics of mortality. Demography 1979, 16, 439–454. [Google Scholar] [CrossRef]
  42. Clayton, D.G. A model for association in bivariate life tables and its application in epidemiological studies of familial tendency in chronic disease incidence. Biometrika 1978, 65, 141–151. [Google Scholar] [CrossRef]
  43. Hougaard, P. Modelling heterogeneity in survival data. J. Applied Probab. 1997, 28, 695–701. [Google Scholar] [CrossRef]
  44. Hougaard, P. Frailty models for survival data. Lifetime Data Anal. 1995, 1, 255–273. [Google Scholar] [CrossRef] [PubMed]
  45. Wienke, A. Frailty Models in Survival Analysis; CRC Press: Boca Raton, FL, USA, 2010. [Google Scholar]
  46. Duchateau, L.; Janssen, P. The Frailty Model; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
  47. Zhang, J.; Peng, Y. An alternative estimation method for the accelerated failure time frailty model. Comput. Stat. Data Anal. 2007, 51, 4413–4423. [Google Scholar] [CrossRef]
  48. Lambert, P.; Collett, D.; Kimber, A.; Johnson, R. Parametric accelerated failure time models with random effects and an application to kidney transplant survival. Stat. Med. 2004, 23, 3177–3192. [Google Scholar] [CrossRef]
  49. Bhattacharjee, A. Application of Bayesian approach in cancer clinical trial. World J. Oncol. 2014, 5, 109. [Google Scholar] [CrossRef]
  50. Depaoli, S. The impact of inaccurate “informative” priors for growth parameters in Bayesian growth mixture modeling. Struct. Equ. Model. Multidiscip. J. 2014, 21, 239–252. [Google Scholar] [CrossRef]
  51. Ganjali, M.; Baghfalaki, T. Bayesian analysis of unemployment duration data in the presence of right and interval censoring. J. Reliab. Stat. 2012, 17–32. [Google Scholar]
  52. Hanagal, D.D.; Dabade, A.D. Bayesian estimation of parameters and comparison of shared gamma frailty models. Commun.-Stat.-Simul. Comput. 2013, 42, 910–931. [Google Scholar] [CrossRef]
  53. Christensen, R.; Johnson, W.; Branscum, A.; Hanson, T.E. Bayesian Ideas and Data Analysis: An Introduction for Scientists and Statisticians; CRC Press: Boca Raton, FL, USA, 2010. [Google Scholar]
  54. Santos, C.; Achcar, J. A Bayesian analysis for multivariate survival data in the presence of covariates. J. Stat. Theory Appl. 2010, 9, 233–253. [Google Scholar]
  55. Yin, G.; Ibrahim, J.G. A class of Bayesian shared gamma frailty models with multivariate failure time data. Biometrics 2005, 61, 208–216. [Google Scholar] [CrossRef] [PubMed]
  56. Hougaard, P. Life table methods for heterogeneous populations: Distributions describing the heterogeneity. Biometrika 1984, 71, 75–83. [Google Scholar] [CrossRef]
  57. Akerkar, R.; Martino, S.; Rue, H. Implementing approximate Bayesian inference for survival analysis using integrated nested laplace approximations. Prepr. Stat. Nor. Univ. Sci. Technol. 2010, 1, 1–38. [Google Scholar]
  58. Spiegelhalter, D.J.; Abrams, K.R.; Myles, J.P. Bayesian Approaches to Clinical Trials and Health-Care Evaluation; CRC Press: Boca Raton, FL, USA, 2004. [Google Scholar]
  59. Watanabe, S.; Opper, M. Asymptotic equivalence of bayes cross validation and widely applicable information criterion in singular learning theory. J. Mach. Learn. Res. 2010, 11. [Google Scholar]
  60. Chaloner, K. Bayesian residual analysis in the presence of censoring. Biometrika 1991, 78, 637–644. [Google Scholar] [CrossRef]
  61. L’opez-Vilella, R.; Guerrero, C.B.; Donoso, T.V.; Martínez, D.L.; Almenar, B.L. Clinical profiling of patients admitted with acute heart failure: A comprehensive survival analysis. Front. Cardiovasc. Med. 2024, 11, 1381514. [Google Scholar] [CrossRef]
  62. Martino, S.; Akerkar, R.; Rue, H. Approximate Bayesian inference for survival models. Scand. J. Stat. 2011, 38, 514–528. [Google Scholar] [CrossRef]
Figure 1. Kaplan–Meier estimates of the survival curves of HF dataset for the Overall survivor function for HF patients, Age group, Stages and Place of residence.
Figure 1. Kaplan–Meier estimates of the survival curves of HF dataset for the Overall survivor function for HF patients, Age group, Stages and Place of residence.
Stats 07 00063 g001
Figure 2. Bayesian Cox–Snell residual plots for baseline distribution employed to fit the heart failure dataset.
Figure 2. Bayesian Cox–Snell residual plots for baseline distribution employed to fit the heart failure dataset.
Stats 07 00063 g002
Figure 3. Plots of Histogram, PIT, CPO, and 95% credible intervals used to fit the heart failure dataset.
Figure 3. Plots of Histogram, PIT, CPO, and 95% credible intervals used to fit the heart failure dataset.
Stats 07 00063 g003aStats 07 00063 g003b
Table 1. Descriptive summaries of patients for HF dataset.
Table 1. Descriptive summaries of patients for HF dataset.
CovariatesCategoriesNo. of Censored (%)No. of Death (%)Total
SexFemale180 (59.80)88 (44.90)268 (53.92)
Male121 (40.20)108 (55.10)229 (46.08)
Age≤49119 (39.54)18 (9.19)137 (27.56)
49–65101 (33.55)61 (31.12)162 (32.60)
≥6581 (26.91)117 (59.69)198 (39.84)
AlcoholNo202 (67.11)113 (57.65)315 (63.38)
Yes99 (32.89)83 (42.35)182 (36.62)
ResidenceUrban94 (31.23)45 (22.96)139 (27.97)
Rural207 (68.77)151 (77.04)358 (72.03)
History of HFNew128 (42.53)64 (32.65)192 (38.63)
HF patient before82 (27.24)61 (31.12)143 (28.77)
Medical OPD91 (30.23)71 (36.23)162 (32.60)
Chronic kidney diseaseNo270 (89.70)72 (36.73)342 (68.81)
Yes31 (10.30)124 (63.27)155 (31.19)
HypertensionNo248 (82.39)51 (26.02)299 (60.16)
Yes53 (17.61)145 (73.98)198 (39.84)
AnemiaNo256 (85.05)73 (37.24)329 (66.20)
Yes45 (14.95)123 (62.76)168 (33.80)
Diabetes mellitusNot247 (82.06)36 (18.37)283 (56.94)
Type I21 (6.98)54 (27.55)75 (15.09)
Type II33 (10.96)106 (54.08)139 (27.97)
Etiology of HFIHD86 (28.57)23 (12.19)109 (21.93)
RVHD59 (19.60)53 (26.83)112 (22.54)
Cardiomyopathy65 (21.59)52 (25.61)117 (23.54)
HHD64 (21.26)56 (29.88)120 (24.14)
Others27 (8.97)12 (5.49)39 (7.85)
SmokingNo280 (93.02)114 (58.16)394 (79.28)
Yes21 (6.98)82 (41.84)103 (20.72)
TreatmentsDigoxin60 (19.94)38 (19.39)98 (19.72)
Spironolactone76 (25.25)39 (19.90)115 (23.14)
Atorvastatin77 (25.58)60 (30.61)137 (27.56)
Others37 (12.29)19 (9.69)56 (11.27)
Combination  2 51 (16.94)40 (20.41)91 (18.31)
Stages of HFI78 (25.91)6 (3.06)84 (16.90)
II81 (26.91)27 (13.78)108 (21.73)
III72 (23.99)57 (29.08)129 (25.96)
IV70 (23.26)106 (54.08)176 (35.41)
No.: Number; %: Percent; IHD: Ischemic heart disease; RVHD: Rheumatic valvular heart disease and HHD: Hypertensive heart disease. Source: JUMC, Jimma, Ethiopia; from 1 July 2015–1 January 2019.
Table 2. Comparison of Bayesian AFT models without and with gamma frailty.
Table 2. Comparison of Bayesian AFT models without and with gamma frailty.
DistributionsBayesian AFT ModelsBayesian AFT Models with Frailty
DICWAICDICWAIC
Exponential1808.0931801.9661799.0991790.696
Log-normal1616.0991615.1841608.8121607.838
Weibull1642.9711642.4861616.2101617.387
Log-logistic1654.0361651.1651619.7311616.926
Table 3. Results for Bayesian log-normal shared gamma frailty model.
Table 3. Results for Bayesian log-normal shared gamma frailty model.
CovariatesCategoriesPmean ( β )SD   ϕ CrI for  ϕ KLD
  μ Intercept5.140.456 0
Age≤49Ref
49–65−0.3070.1210.7357[0.580, 0.931] *0
≥65−0.3610.1160.6969[0.554, 0.873] *0
History of HFNewRef
HF patient0.1830.1031.2008[0.983, 1.469]0
Medical OPD0.2430.1011.2751[1.046, 1.559] *0
CKDNoRef
Yes−0.4110.0770.6630[0.569, 0.769] *0
HypertensionNoRef
Yes−0.3120.0790.7320[0.626, 0.855] *0
Etiology of HFIHDRef
RVHD−0.3270.1210.7211[0.568, 0.913] *0
Cardiomyopathy−0.2800.1160.7558[0.602, 0.948] *0
HHD−0.2990.1190.7416[0.586, 0.936] *0
Others−0.3930.1660.6750[0.487, 0.933] *0
SmokingNoRef
Yes−0.1750.0770.8395[0.721, 0.976] *0
Stages of HFIRef
II−0.4570.1730.6332[0.449, 0.888] *0
III−0.3520.1640.7033[0.509, 0.968] *0
IV−0.4460.1600.6402[0.467, 0.875] *0
Diabetes mellitusNotRef
Type I−0.4430.1170.6421[0.509, 0.806] *0
Type II−0.5510.1050.5764[0.468, 0.706] *0
AnemiaNoRef
Yes−0.1690.0730.8445[0.729, 0.974] *0
Pre lognormal survShape3.450.36431.5[15.958, 67.356] *
Pre ResFrailty0.02640.01991.0268[1.00598, 1.625] *
* indicates statistically significant.  ϕ : Acceleration factor;  β : Coefficient of parameters; Pmean: Posterior Mean; CKD: Chronic Kidney Disease; IHD: Ischemic Heart Disease; RVHD: Rheumatic Valvular Heart Disease; HHD: Hypertensive Heart Disease; SD: Standard Deviation; CrI: Credible Interval for  ϕ ; KLD: Kullback–Leibler Divergence. Source: JUMC, Jimma, Ethiopia; from 1 July 2015, to 1 January 2019.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ashine, T.; Tadesse Likassa, H.; Chen, D.-G. Estimating Time-to-Death and Determining Risk Predictors for Heart Failure Patients: Bayesian AFT Shared Frailty Models with the INLA Method. Stats 2024, 7, 1066-1083. https://doi.org/10.3390/stats7030063

AMA Style

Ashine T, Tadesse Likassa H, Chen D-G. Estimating Time-to-Death and Determining Risk Predictors for Heart Failure Patients: Bayesian AFT Shared Frailty Models with the INLA Method. Stats. 2024; 7(3):1066-1083. https://doi.org/10.3390/stats7030063

Chicago/Turabian Style

Ashine, Tafese, Habte Tadesse Likassa, and Ding-Geng Chen. 2024. "Estimating Time-to-Death and Determining Risk Predictors for Heart Failure Patients: Bayesian AFT Shared Frailty Models with the INLA Method" Stats 7, no. 3: 1066-1083. https://doi.org/10.3390/stats7030063

APA Style

Ashine, T., Tadesse Likassa, H., & Chen, D. -G. (2024). Estimating Time-to-Death and Determining Risk Predictors for Heart Failure Patients: Bayesian AFT Shared Frailty Models with the INLA Method. Stats, 7(3), 1066-1083. https://doi.org/10.3390/stats7030063

Article Metrics

Back to TopTop