1. Introduction
The novel coronavirus (SARS-CoV-2) is associated with high morbidity and mortality, especially among those with underlying health conditions. COVID-19 is an infectious disease that is caused by a novel enveloped RNA virus, resulting in severe pneumonia [
1]. Elevated serum ferritin and D-dimer levels are found in patients with cytokine storm syndrome (CSS) due to COVID-19 infection [
2,
3]. This article seeks to identify optimal serum cutoff levels for ferritin and D-dimer with trajectory plots of repeated values obtained during hospitalization as biomarkers for COVID-19 related invasive ventilator dependence (IVD) and in-hospital mortality. No optimal serum biomarker exists for COVID-19 infection.
Serum ferritin is an iron storage protein with the primary role of regulating cellular oxygen metabolism. Ferritin is composed of two different subunits, H and L. Previous studies have suggested H-ferritin acts as an immune modulatory molecule with both proinflammatory and immunosuppressive functions [
4]. Increased ferritin levels could be indicative of a strong inflammatory reaction related to viral entry into the human body and its impact on iron metabolism [
4]. According to Eloseily et al., an elevated ferritin value (e.g., >700 ng/mL) should alert clinicians to additional diagnostic work-up so that therapeutic approaches can be considered without significant delay. Early institution of treatment for CSS has been proven to lead to better patient outcomes [
5]. Elevated ferritin serum levels have been found to significantly correlate with disease severity in COVID-19 infected patients [
6]. A prior study found that, compared to moderate cases of COVID-19, severe cases had higher ferritin levels [
7].
Several recent studies have also found associations between D-dimer levels and the severity of COVID-19 infection and mortality [
2]. D-dimer is a degradation product of cross-linked fibrin, indicating increased thrombin generation and fibrin dissolution by plasmin. High D-dimer levels are common in acutely ill individuals with a number of infectious and inflammatory diseases [
8]. Studies have indicated that the coagulation system is active in critically ill patients, and D-dimer levels correlate with activation of the proinflammatory cytokine cascade leading to CSS [
9]. It is believed that coagulation abnormalities related to elevated levels of D-dimer cause venous thromboembolism, which may contribute to respiratory deterioration related to COVID-19 infection [
10].
Most of these studies show significant elevation in serum ferritin and D-dimer in COVID-19 infected patients, yet they fail to identify an optimal serum cutoff level that can reliably predict the development of CSS resulting in acute lung injury with IVD and in-hospital mortality. Optimal serum cutoff level for ferritin and D-dimer along with trajectory plots will help with allocation of hospital resources based on individual clinical course.
2. Materials and Methods
2.1. Settings
We used data from the Cerner COVID-19 de-identified data cohort, which is a subset of the entire Cerner Real-World Data cohort.
“Cerner Real-World Data is extracted from the EMR of hospitals in which Cerner has a data use agreement. Encounters may include pharmacy, clinical and microbiology laboratory, admission, and billing information from affiliated patient care locations. All admissions, medication orders and dispensing, laboratory orders and specimens are date and time stamped, providing a temporal relationship between treatment patters and clinical information. Cerner Corporation has established Health Insurance Portability and Accountability Act-compliant operating policies to establish deidentification for Cerner Real-World Data” [
11].
Only patients with a confirmed COVID-19 diagnosis or recent positive lab test from January through June 2020 and valid ferritin or fibrin D-dimer laboratory results were included. These values were identified by Logical Observation Identifiers Names and Codes (LOINC) codes (
Supplemental Table S1) and were considered valid if they were associated with encounters showing a diagnosis or recent positive lab test of COVID-19. Additionally, an independent dataset from July to September 2020 was used for validation analyses. The University of Utah Institutional Review Board (IRB #136696) has determined that this study does not meet the definitions of Human Subjects Research for using secondary data with no intervention or interaction with an individual, and for not having identifiable private information in the data.
2.2. Measurements
The outcomes of interest involved two different indications of clinical complications in COVID-19 patients: IVD, and in-hospital mortality. IVD was a binary indication (yes or no) representing whether a patient ever had a diagnosis, procedure, or encounter result that signified reliance on an invasive ventilator. The list of codes is found in
Supplemental Table S2. These were kept separate from indications of less severe ventilator dependence such as continuous positive airway pressure (CPAP) and bilevel positive airway pressure (BiPAP) machines. In-hospital mortality was a binary indication (yes or no) of whether a patient died at hospital discharge.
The predictors of interest were ferritin and fibrin D-dimer lab results. These values were continuous, and all ferritin results were converted to be on the same unit-of-measure scale of nanograms per milliliter (ng/mL). All D-dimer results were converted to be on the same scale of milligrams, fibrinogen equivalent units, per liters (mg {FEU}/L). Measurements were sorted by date to provide a trajectory of results across the patient’s hospital stay.
Other predictors included gender and an indication of comorbidity based on the weighted Charlson comorbidity index (CCI) [
12]. The CCI measures patient comorbidity by calculating a risk-assessment score based on ICD-10 diagnosis codes (
Supplemental Table S3) that are associated with 17 chronic diseases. These specific diseases are listed in
Table 1 [
13]. With access to disease histories of patients as far back as January 1, 2015, the CCI considered all registered disease codes that fell in this time frame, using which to provide an index by categorizing numerical scores into the following categories: 0, 1–2, 3–4, and ≥5 [
14]. The level of “0” is indicative of no disease burden, whereas “≥5” is indicative of maximal disease burden. Other demographic and clinical characteristics were included for descriptive analysis, including a continuous predictor (age in years), and categorical predictors (gender, race/ethnicity, insurance, and zip-code region).
2.3. Statistical Analysis
Categorical variables were presented with frequencies and percentages, while continuous variables were presented with medians and interquartile ranges (IQR = Q1–Q3). Primary analyses were conducted separately for the ferritin and D-dimer cohorts. In each cohort, mixed-effects logistic regression models were fit with the respective laboratory values (ferritin or D-dimer) as a predictor, and the patient identifier as a random effect to allow modeling the trajectory of laboratory results within patients. For each patient, we have considered their trajectory of values instead of a single measurement, such as most recent or maximal measurement, to improve models’ predictability power. Lab values were transformed to the log-scale for violating normality. Results were presented back on their original scale by exponentiation.
Assuming each patient as a cluster with correlated observations, we used logistic random-effects models [
15] to calculate sensitivities and specificities, which in turn helped us identify the optimal cutoff level of the continuous predictor (ferritin or D-dimer lab) that discriminated those with the response of interest (ventilator dependent or in-hospital deceased) from those without the response of interest (non-ventilator dependent or alive). The optimal cutoff was found by either maximizing the Youden index or minimizing the Euclidean distance of sensitivity and specificity [
16]. The predictive ability of the cutoff level was assessed by using the receiver operating characteristic (ROC) curve and calculating the area under the curve (AUC). AUC measures of 0.5 are poor, and measures of 1 are perfect. Odds ratios (ORs) with 95% confidence intervals (CIs) were reported alongside the intraclass correlations (ICCs). Here, ICC is the proportion of total variance in the outcome that is explained by the clustering of similar laboratory results within each patient. To assess differences between levels of gender and comorbidity, the same modeling techniques were repeated by stratifying ferritin and D-dimer cohorts by gender as well as by CCI groups (0, 1–2, 3–4, ≥5).
To assess the generalizability of the results, cross-validations (CVs) were conducted where 20% of patients (and their repeated measurements) from the complete dataset were randomly sampled to be the testing set, and the remaining 80% of patients were retained as the training set. Models were fit on the training set and obtained optimal cutoffs were used to predict outcomes for the testing set. The mean percentage of correct classifications (MPCC) with corresponding standard errors (SE) were reported. Additionally, for validation purposes, optimal cutoff results were tested and reported on a completely independent dataset comprised of patient encounters from July to September 2020. Models applied the same optimal cutoff levels, which were calculated from the previous analysis, to these data. Percentages of correct classification (PCC) were reported. All of these validation analyses were conducted separately for the two predictors (ferritin and D-dimer) and the two outcomes (IVD and in-hospital mortality).
To investigate the difference in lab-level trajectory over time between patients eventually dying and patients remaining alive, mixed-effects exponential regression models were constructed. The predictor was the interaction between elapsed time (time in days since each patient’s first appointment) and a binary indication of death. This provided a lab-level trend over time for those who died as well as for those who remained alive. To account for the inherent differences between each patient, repeated lab measurements were clustered within each patient. Exponentiated coefficients, relating to the percentage change in lab value, with 95% CIs were calculated, along with coefficient of determination (R2) values to estimate the percent of variation in ferritin and D-dimer lab levels as explained by the model predictors. Figures were constructed by using model-predicted ferritin and D-dimer levels from a sample of patients. Lab levels were plotted against time for each patient and stratified by death status.
As a final sensitivity analysis, the optimal cutoff modeling schema was conducted again under two scenarios: (1) without clustering on each patient, and (2) using only the maximum lab result per patient. This was conducted to compare results with that from the full repeated patient lab measurements with the random effect. All statistical tests were two-sided at a significance level of 5% and were conducted using R version 3.6.1 (R Foundation for Statistical Computing).
4. Discussion
Due to limited hospital resources, it is critical to understand threshold patterns of serum biomarkers that can predict CSS, which would aid in judicious allocation of resources, bridging the time for the development of an effective vaccine and medical treatment. Here, we identified optimal cutoffs of serum ferritin and D-dimer levels in COVID-19 patients that predict in-hospital mortality and IVD. These cutoffs, for some CCI categories, seem to be comparable to values obtained from initial retrospective studies in Wuhan, China. In a study of 21 patients, Chen et al. found 11 patients with severe COVID-19 infection to have a serum ferritin higher than 800 µg/L [
17]. In another retrospective study of 191 patients at the Jinyintan Hospital and Wuhan Pulmonary Hospital, a serum level of >300 µg/L was present in 96% of non-survivors [
18].
In this study, female patients with COVID-19 have an overall lower optimal serum ferritin and D-dimer cutoff for in-hospital mortality and IVD compared to males. The ratio between genders was almost 1:3 (or 270.0: 860.4) for IVD, which is higher than that among healthy adults, where females have a lower serum ferritin range (12–150 ng/mL) compared to males (12–300 ng/mL) [
19]. When examining the trajectory of repeated ferritin and D-dimer levels on subsequent days of illness, we found persistently elevated, time-spaced repeated measurements of ferritin and D-dimer to be highly predictive of patients developing CSS, with imminent need of ventilation, and eventual in-hospital death. We uncovered two distinct trajectory patterns, each representing mortality or survival. The first trajectory pattern consists of rapid elevation, in which repeated serum levels of ferritin and D-dimer continue to increase over time, a pattern that highly correlates with mortality. The second trajectory pattern witnesses decreasing or unchanged subsequent levels of ferritin and D-dimer, which highly correlates with survival. Among patients with low initial ferritin and D-dimer values, but increasing subsequent time-spaced repeated values, there is a high chance of developing CSS, needing invasive ventilation, and experiencing in-hospital mortality. However, patients with high initial ferritin and D-dimer levels, and stable or decreasing subsequent time-spaced repeated values, have a low probability for developing CSS, needing ventilation, and experiencing death. This might indicate distinct biological and physiological feedback mechanisms during SARS-CoV-2 infections among certain susceptible patient groups, since genetic susceptibility factors could contribute to CSS development [
20].
Among COVID-19 patients, older age and comorbidities such as diabetes or hypertension are predictors of increased COVID-19 related morbidity and mortality [
21]. We hypothesized that COVID-19 patients without any comorbidities would possess high serum ferritin and D-dimer levels because they have adequate cardiac and pulmonary reserves and would require a strong COVID-19 related CSS to develop respiratory distress resulting in ventilation and in-hospital mortality. Conversely, in patients with increasing comorbidities, we predicted that a modest increase in serum ferritin and D-dimer due to COVID-19 would result in multiorgan failure, owing to pre-existing limited cardiopulmonary and immune reserves. However, we did not find a clear pattern for optimal cutoff in relation to CCI. As hypothesized, serum ferritin and D-dimer should have the highest optimal cutoff in patients with no comorbidities. However, in our study using CCI, patients with no comorbidities had the lowest serum ferritin and D-dimer optimal cutoff for IVD, and the lowest serum ferritin cutoff for in-house mortality. This finding may represent additional unknown factors (e.g., male sex, blood type A, and other genetic and exposure susceptibility factors, including the triggering of a previous viral infection), or immune responses toward CSS that result in mortality and IVD in relation to COVID-19 infection [
20,
22]. Furthermore, individuals with no comorbidities likely activate thrombotic responses early after SARS-CoV-2 infection [
23], which demonstrates the importance of D-dimer tests being conducted early after admission, especially among younger, generally healthy patients.
Another important finding of our study is that optimal ferritin and D-dimer cutoffs alone are not sufficient information, but must be paired with knowledge of the trajectory of repeated ferritin and D-dimer levels drawn on subsequent days of illness for COVID-19 patients. ICCs indicated exceptionally high correlation of lab values within each patient (ICC ≈ 0.96), and either ignoring this aspect or collapsing down to one value per patient would eliminate this valuable information. The information lost is demonstrated by the stark difference in AUCs between
Table 3 (AUC ≈ 0.99) and
Supplemental Tables S4 and S5 (AUC ≈ 0.60–0.75). The increase in AUC when considering the entire trajectory of values could partly be due to the increase in the number of observations [
15]. Our findings in
Supplemental Tables S4 and S5 are supported by a recent study published in August 2020 on 942 patients from a large New York City health system [
24]. Using a single measurement, the authors obtained poor to fair accuracy (AUC ≈ 0.60–0.75) for ferritin as a biomarker when they collapsed repeated measurements down. We demonstrated in our study, using logistic random-effects modeling, that considering the entire trajectory of measurements optimizes the accuracy of ferritin as a biomarker (see
Table 3 versus
Supplemental Tables S4 and S5). Using such methodology, we assumed that sensitivity and specificity were centered around a common mean across the patients (clusters) with some level of variability. Further details on using logistic random-effects models to adjust for correlations between subjects’ measurements when calculating sensitivity and specificity have been shown elsewhere [
15].
Our optimal cutoffs for ferritin (714.3 ng/mL for in-hospital mortality with AUC = 0.997; and 501.6 for IVD with AUC = 0.996), and D-dimer (2.1 ng/mL for in-hospital mortality with AUC = 0.997; and 2.0 for IVD with AUC = 0.998) were validated by using CV on a testing set, and by using an independent large cohort (
Table 4), demonstrating substantial performance.
4.1. Limitations
One of the main limitations of studying serum ferritin and D-dimer is working with non-standardized data from blood samples, which lack uniform measurement units due to the variability of various lab tests (see
Supplemental Table S1). Furthermore, patients entered this cohort at different timepoints, when experimental treatments and guidelines for COVID-19 care had been changing rapidly, which might have induced some residual confounding.
4.2. Recommendations
In line with the identification protocol of acute coronary syndrome, which uses time-spaced troponin I levels, we propose a similar approach to classify COVID-19 infection. We recommend the development of a standardized protocol with identification of appropriate time gap between repeated values for ferritin and D-dimer in COVID-19 patients. We hypothesize that at least three repeated values of ferritin and D-dimer, spaced over 8 to 24 h, will help identify the appropriate trajectory pattern, similar to serum troponin collection during acute coronary syndrome. The ferritin and D-dimer pneumogram should consider the start point of COVID-19 infection as the development of a fever.