Next Article in Journal
Home and Out-of-Hospital Therapy with COVID-19 Convalescent Plasma in Europe
Previous Article in Journal
Autocatalytic Sets Arising in a Combinatorial Model of Chemical Evolution
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Brief Report

Longitudinal Cluster Analysis of Hemodialysis Patients with COVID-19 in the Pre-Vaccination Era

1
Department of Internal Medicine, University of Genoa, 16132 Genova, Italy
2
Unit of Nephrology, Dialysis and Transplantation, IRCCS Ospedale Policlinico San Martino, 16132 Genova, Italy
3
Dipartimento di Matematica (MIDA), Università di Genova, 16132 Genova, Italy
4
Biotherapy Unit, IRCCS Ospedale Policlinico San Martino, 16132 Genova, Italy
5
Department of Internal Medicine-Centre of Excellence for Biomedical Research, University of Genova, 16132 Genova, Italy
6
IRCCS Ospedale Policlinico San Martino, 16132 Genova, Italy
7
Unit of Nephrology, Dialysis and Transplantation, Fondazione IRCCS Policlinico San Matteo, 27100 Pavia, Italy
8
Life Science Computational Laboratory (LISCOMP), IRCCS Ospedale Policlinico San Martino, 16132 Genova, Italy
*
Author to whom correspondence should be addressed.
Life 2022, 12(11), 1702; https://doi.org/10.3390/life12111702
Submission received: 27 September 2022 / Revised: 22 October 2022 / Accepted: 23 October 2022 / Published: 26 October 2022
(This article belongs to the Section Epidemiology)

Abstract

:

Simple Summary

Clinical presentation and outcomes of Coronavirus disease 2019 (COVID-19) are very heterogeneous. Among the different populations affected by COVID-19, special attention should be given to patients undergoing maintenance hemodialysis. Indeed, these patients present some peculiar characteristics that may influence disease course, leading to elevated morbidity and mortality. Furthermore, in hemodialysis patients, clinical presentations and disease severity may vary widely. Therefore, the identification of clinical and laboratory factors useful to stratify the risk of these patients could be of help in guiding clinical decision making. Starting from this observation, in this study, we tested and validated, in two cohorts of hemodialysis patients with COVID-19, an innovative analytical procedure that combines linear mixed effect modeling and cluster analysis on longitudinal data. The application of this strategy allowed patient stratification from simple and widely available data. Our results could contribute to improving COVID-19 management and supporting the implementation of longitudinal cluster analysis strategy in other clinical settings.

Abstract

Coronavirus disease 2019 (COVID-19) in hemodialysis patients (HD) is characterized by heterogeneity of clinical presentation and outcomes. To stratify patients, we collected clinical and laboratory data in two cohorts of HD patients at COVID-19 diagnosis and during the following 4 weeks. Baseline and longitudinal values were used to build a linear mixed effect model (LME) and define different clusters. The development of the LME model in the derivation cohort of 17 HD patients (66.7 ± 12.3 years, eight males) allowed the characterization of two clusters (cl1 and cl2). Patients in cl1 presented a prevalence of females, higher lymphocyte count, and lower levels of lactate dehydrogenase, C-reactive protein, and CD8 + T memory stem cells as a possible result of a milder inflammation. Then, this model was tested in an independent validation cohort of 30 HD patients (73.3 ± 16.3 years, 16 males) assigned to cl1 or cl2 (16 and 14 patients, respectively). The cluster comparison confirmed that cl1 presented a milder form of COVID-19 associated with reduced disease activity, hospitalization, mortality rate, and oxygen requirement. Clustering analysis on longitudinal data allowed patient stratification and identification of the patients at high risk of complications. This strategy could be suitable in different clinical settings.

1. Introduction

The heterogeneity of clinical presentation and outcomes is a common feature of Coronavirus disease 2019 (COVID-19) [1]. This observation is also valid for maintenance hemodialysis (HD) patients, representing a distinct subgroup of COVID-19 patients [2,3]. Indeed, these patients have some features that may influence and differentiate disease course, including distinct renal failure etiologies, a high prevalence of comorbidities, frailty, and specific assets of the immune system. Moreover, the variability of dialysis programs and therapeutic regimens, such as logistical factors related to the organization of dialysis facilities, may significantly impact the dissemination and evolution of the infection in this setting [4]. Taking advantage of these peculiarities, we investigated the presence of different COVID-19 patient subtypes in this population using an innovative analytical procedure that combines linear mixed effect modeling and cluster analysis. Cluster analysis is an unsupervised learning algorithm that works by organizing patients into groups based on common characteristics [5]. This analysis aimed to stratify the patients, thus providing information to predict outcomes and address clinical management. Here, we present our experience with the application of cluster analysis on a small group of HD patients affected by COVID-19. We decided to consider only the first pandemic waves of 2020 to evaluate the natural history of the infection without potential confounding factors, such as vaccinations and antivirals.

2. Materials and Methods

2.1. Patients

For this study, we recruited 3-weekly maintenance hemodialysis patients with confirmed COVID-19 infection, from 16 March to 30 April 2020 (validation cohort) and from 1 October to 1 December 2020 (derivation cohort), enrolled at San Martino University Hospital of Genoa, Italy. To avoid selection bias, we enrolled consecutive patients in each period. Nasopharyngeal swabs for severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) were performed on HD patients presenting recent contact with COVID-19-positive patients and/or with fever or respiratory or gastrointestinal symptoms suspected of COVID-19. The diagnosis was confirmed by positive real-time reverse transcriptase (RT-PCR) assay for SARS-CoV-2. After the diagnosis, clinical management decisions were left to the attending physicians. Maintenance HD patients who tested negative for SARS-CoV-2 PCR constituted the control group.

2.2. Data Collection

We retrospectively collected clinical data, general laboratory, cytokine determinations, and cytofluorimetric analysis of activation status and detailed maturation of CD4+ and CD8+ lymphocyte subpopulations (Table S1) [6] at the time of COVID-19 diagnosis. Disease presentation severity was scored as 0 (asymptomatic patients), 1 (mildly symptomatic, i.e., only one symptom, including fever, cough, dysgeusia, or diarrhea), or 2 (fully symptomatic, i.e., fever + one additional symptom). In the derivation cohort, data were also collected in the first 4 weeks after the diagnosis (days 7, 14, 21, and 28), with each subject having at least two time points regarding the acquisition of each biomarker.

2.3. Cytokine Determinations

For cytokine determinations, blood samples collected in polypropylene tubes were centrifuged at 3200 rpm for 10 min. Plasma was separated by centrifugation and stored at −20 °C until assayed. Then, circulating cytokine levels were determined by an Ella automated Immunoassay platform (Protein Simple, Minneapolis, MN, USA).

2.4. Immunofluorescence Analyses

Immunofluorescence analyses were performed as previously described [7]. Prior to analysis, 100 µL of peripheral blood was incubated with specific fluorochrome-conjugated monoclonal antibodies (purchased from BD Biosciences), as indicated in Table S2. The samples were analyzed by a BD Fortessa X20 flow cytometer (BD Biosciences San Diego, CA, USA) using the BD FACS Diva™ software version 8.0 (BD Biosciences). Briefly, the differential expression of these markers allows the identification of six subsets in the human peripheral blood: naive (TN), stem cell memory (TSCM), central memory (TCM), transitional memory (TTM), effector memory (TEM), and terminal effector (TTE).

2.5. General Statistical Methods

Data are presented as mean ± standard deviation (SD), or interquartile ranges (IQR) if not normally distributed (evaluated by Shapiro test). A Mann–Whitney test was used to assess the differences among patients affected by COVID-19 and the control group, and among patients of different clusters. Proportions for categorical variables were compared using Fisher’s test. A two-tailed p value < 0.05 was considered statistically significant.

2.6. Statistical Methods for Clustering Analysis

To investigate the presence of different COVID-19 HD subtypes, we used an innovative model that combines linear mixed effect modeling and cluster analysis. The model was implemented in R (https://www.rstudio.com/ (accessed on 11 May 2022)). First, in the derivation cohort, we performed a feature reduction procedure by selecting the 10 features whose distribution showed, at the same time, the highest intergroup difference (COVID-19 HD and HD without COVID-19) and the lowest intragroup variability using a Mann–Whitney test [8], and for which at least 90% of COVID-19 HD patients had a baseline acquisition. Then, we used a linear mixed effect model (LME) [9,10] with two fixed and two random effects (both slope and intercept) on the COVID-19 HD longitudinal data. For each individual and each of the 10 selected features, the estimated fixed and random effects describe the feature progression on the overall cohort (the fixed effect) and the individual variation (the random effect). Then, to identify potential different clusters of COVID-19 HD patients, we performed a k-means clustering analysis [11] using as the input the 20 features returned by the LME model (slope and intercept for each of the 10 features) and distinguish the different clusters, following a silhouette analysis with Dice distance.
Finally, the most significant clinical and laboratory parameters discriminating patients of the derivation cohort were used in a validation cohort to compute the Euclidean distances between each validation subject and the cluster centroids to assign each patient to different clusters, which are compared in the following section.

3. Results

3.1. Longitudinal Clustering of the Derivation Cohort

The derivation cohort (DC) constituted 17 HD patients (66.7 ± 12.3 years, eight males) with a molecular diagnosis of COVID-19 during the second pandemic wave of 2020 (October–December). In this cohort, overall, we collected 51 longitudinal observations over the four time points (days 7, 14, 21, and 28) on the 17 subjects. First, we performed a feature reduction procedure. Specifically, the baseline values of COVID-19 HD were compared with a sex–age-matched control group of six HD patients (70.0 ± 9.4 years, three males) (Table S3). As explained in detail in the Materials and Methods section, according to the baseline evaluation, we selected the 10 most significant variables to build the cluster analysis model. These variables included: C-reactive protein (CRP), white blood cell, neutrophil, and lymphocyte counts; albumin and ferritin serum levels; and interleukin (IL)-1β, IL-8, IL-6, and tumor necrosis factor (TNF)-α circulating levels. Then, we performed clustering analysis using an LME on the longitudinal data of all 17 patients of the DC (Figure S1 shows the individual LME fits). This analysis returned two well-balanced (seven vs. nine patients) clusters (Cluster 1 and Cluster 2, respectively, Figure 1A) and one cluster consisting of one subject, which was excluded from further analysis. The centroid profiles of Clusters 1 and 2 (Figure 1B) showed a significant difference in the value of lymphocyte count (p = 0.012). The comparison between the two clusters showed that within Cluster 1, there were more female patients (p = 0.01) presenting significantly higher lymphocyte counts (Cluster 1, 1.1 ± 0.5 vs. Cluster 2, 0.4 ± 0.2 x109/L; p = 0.012) and lower lactate dehydrogenase (LDH), CRP (Cluster 1, 22.2 ± 16.3 vs. Cluster 2, 72.6 ± 44.1 mg/dL; p = 0.017), and CD8 + T memory stem cells (Cluster 1, 0.6 ± 0.4 vs. Cluster 2, 2.3 ± 1.7%; p = 0.038, Figure S2) (Table 1).

3.2. Validation Cohort

We then tested our clustering strategy for predicting clinical outcomes in a validation cohort (VC), constituted of 30 HD patients (73.3 ± 16.3 years, 16 males) affected by COVID-19 during the first pandemic wave of 2020 (March–April).
According to those found in DC, we selected the most significant clinical and laboratory parameters that could be used to discriminated between the two clusters.
These factors included sex, age, neutrophil and lymphocyte percentages, CRP, procalcitonin, and LDH levels. CD8 + T memory stem cell count was not included at this stage because this analysis is not widely available.
As a standard analysis in cluster validation, and as described in the Materials and Methods section, we used these variables to assign each new patient to Cluster 1 or 2. Hence, in the VC, Cluster 1 or 2 constituted 16 and 14 subjects, respectively. The two clusters were comparable for general characteristics, even though in Cluster 1 there was a higher prevalence of female patients. Moreover, within Cluster 1, both biochemical profile and disease severity presentation were significantly milder than in Cluster 2 (clinical severity score distribution Cluster 1 vs. Cluster 2, p = 0.018). Similarly, we found significant differences in patient outcomes. In particular, in Cluster 1, three patients required hospitalization (18%), one patient died (6.5%), and none required high-flow oxygen therapy, whereas, in Cluster 2, nine patients were hospitalized (64%), five died (35%), and four patients (28%) required high-flow oxygen supply (p = 0.02, p = 0.07, and p = 0.02, respectively). Finally, there were no significant differences in the duration of the infection (Table 2). The positive predictive values (PPV) and negative predictive values (NPV) associated with key clinical parameters are shown in Table 3.

4. Discussion

In this study, we found that cluster analysis on longitudinal data provided information about the differentiation of baseline characteristics and overall disease progression in HD patients with COVID-19. In particular, the application of this analysis strategy in our DC resulted in the identification of two distinct clusters: Cluster 1 and Cluster 2. The comparison of these clusters showed that patients included in Cluster1 presented a better biochemical profile, as proven by a lower incidence of lymphopenia and lower levels of inflammatory and immune activation markers, such as CRP and CD8 + T memory stem cells.
Interestingly, within Cluster 1, there was also a higher prevalence of female patients, who, according to data from several studies, often develop a less severe form of COVID-19 [12]. Moreover, when the model developed on the DC was tested in the VC, we found confirmation that biochemical differences between the two clusters had relevant clinical implications, as also shown by the calculation of the predictive values. Thus, we confirmed that the patients included in Cluster 1 had milder disease severity and better outcomes in terms of rate of hospitalization, death, and oxygen requirement.
Regarding the inflammatory parameters, the analysis of serum albumin levels deserves special consideration. It is well known that serum albumin concentration is negatively influenced by acute diseases, and in COVID-19 patients, low albumin levels have been related to poor prognosis [13]. In our cohort, we observed that while albumin level was lower in COVID-19 patients than in controls, no differences were found when comparing the patients assigned to the different clusters.
Possible explanations for these findings may be that inflammation is not the only determinant of serum albumin, and nutritional status and dialysis adequacy may influence it [14]. Therefore, COVID-19 might not have impacted all these aspects. However, it cannot be ruled out that these data may result from the small size of our cohort.
Cluster analysis has been already applied to the general population of patients affected by COVID-19 in different clinical contexts. San-Cristobal R et al. used this methodology to predict disease severity within 72 h of hospital admission in patients with confirmed COVID-19 [15]. By collecting routine clinical and laboratory variables, the authors were able to describe three distinct clusters that showed a strong association with different clinical outcomes. Similar results were also observed in larger studies where more complex and multiple factors, including clinical, biochemical, and immune parameters, were considered. Additionally in these cases, cluster analysis resulted in a relevant capacity to discriminate distinct patient profiles associated with different clinical presentations and outcomes [16,17,18]. Interestingly, some authors found that among complex parameters, simple indicators, such as standard laboratory examinations (CRP, LDH, etc.), and symptoms, such as fever, fatigue, as well as, nausea and vomiting, may have a role in indicating the disease severity and prognosis [16].
Beyond clinical studies, analytical models based on cluster analysis in the setting of COVID-19 have also been used in the interpretation of big data collected at both a national and international level, taking advantage of methods developed in computer science and machine learning. In these cases, cluster analysis was extended to evaluate health system performances and the social and economic impact of the pandemic [19,20,21]. Overall, these studies, despite different patient populations and designs, have shown the feasibility and utility of cluster analysis in the management of data recorded during the COVID-19 pandemic. In addition to that already published, here we report the first example of the application of cluster analysis on longitudinal data collected from COVID-19 patients undergoing maintenance HD. The rationale for testing this strategy in this population is that HD patients constitute a peculiar subgroup of COVID-19 patients. Indeed, these patients present some distinct characteristics, including a high prevalence of comorbidity, specific assets of the immune system, and factors related to the organization of dialysis facilities that make them susceptible to severe infections [22,23]. COVID-19 was not an exception, and many reports have shown that HD patients were burdened by significantly higher morbidity and mortality when compared with the general population [24,25].
However, in HD patients, clinical presentation and severity of COVID-19 may vary widely. Therefore, the characterization of clinical and laboratory factors useful to stratify these patients could help the clinical decision process. Our results show that this is possible using cluster analysis, which may be functional in identifying the patients at high risk of complications deserving special attention. Most importantly, this methodology, by the initial analysis of a complex dataset, allowed us to build a final predictive model based on simple clinical and laboratory parameters.
We are aware of the limitations of this study, mainly due to the small number of subjects evaluated that probably influenced our ability to discriminate laboratory and clinical differences between the clusters. On the other hand, this aspect could be partially compensated by the fact that we focused our analysis on a specific and well-characterized population. Moreover, we know that COVID-19 is a rapidly evolving disease and, as for the general population, including HD patients, the introduction of vaccinations and antiviral drugs and the emergence of new virus variants have significantly changed the natural history of the disease [26]. Thus, conceivably, the model presented here is not directly applicable in the current context. Nevertheless, we think that beyond rough data, our findings prove the effectiveness of a methodology that may have a wide application.
In particular, the implementation of cluster analysis may be valuable, both for scientific and clinical purposes. Indeed, it may reveal hidden associations among laboratory, instrumental, demographic, and clinical factors, thus improving our understanding of disease pathophysiology, such as inflammation in COVID-19 patients. On the other hand, the capacity to assign patients to “low-risk” and “high-risk” categories may influence clinical management and therapeutic choices. Finally, all these effects may assist in the allocation of resources [27].

5. Conclusions

Cluster analysis on longitudinal data was effective at stratifying HD patients with COVID-19 starting from the analysis of a few laboratory and clinical variables. These findings support the application of this technique as a suitable tool to translate research findings into clinical practice, possibly in large cohorts, not only in the management of COVID-19, but also in other clinical settings.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/life12111702/s1. Figure S1: Individual linear mixed model fits. Figure S2: Representative examples of the analysis strategy of CD8 + T cell differentiation. Table S1: Full dataset of baseline clinical and laboratory parameters collected from COVID-19-positive and COVID-19-negative HD patients. Table S2: Reagents used for immunofluorescence analyses. Table S3: Comparisons between baseline characteristics of COVID-19-positive and COVID-19-negative HD patients.

Author Contributions

P.E., S.G. and D.F. (Daniela Fenoglio) designed the study; L.C., A.P., T.V., T.A. and F.G. collected the data; I.C., C.C. and D.F. (Diego Franciotta) analyzed the data; F.V., G.F. and M.P. drafted and revised the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was performed according to the Declaration of Helsinki and was approved by the local Ethics Committee (N. Registro CER Liguria: 135/2020).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data underlying this article are available in Harvard Dataverse, at https://doi.org/10.7910/DVN/KDKR4V (accessed on 16 September 2022).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Wiersinga, W.J.; Rhodes, A.; Cheng, A.C.; Peacock, S.J.; Prescott, H.C. Pathophysiology, Transmission, Diagnosis, and Treatment of Coronavirus Disease 2019 (COVID-19): A Review. JAMA 2020, 324, 782–793. [Google Scholar] [CrossRef] [PubMed]
  2. Lusczek, E.R.; Ingraham, N.E.; Karam, B.S.; Proper, J.; Siegel, L.; Helgeson, E.S.; Lotfi-Emran, S.; Zolfaghari, E.J.; Jones, E.; Usher, M.G.; et al. Characterizing COVID-19 clinical phenotypes and associated comorbidities and complication profiles. PLoS ONE 2021, 16, e024895. [Google Scholar] [CrossRef] [PubMed]
  3. Hsu, C.M.; Weiner, D.E.; Aweh, G.; Miskulin, D.C.; Manley, H.J.; Stewart, C.; Ladik, V.; Hosford, J.; Lacson, E.C.; Johnson, D.S.; et al. COVID-19 Among US Dialysis Patients: Risk Factors and Outcomes from a National Dialysis Provider. Am. J. Kidney Dis. 2021, 77, 748–756.e1. [Google Scholar] [CrossRef] [PubMed]
  4. Lano, G.; Braconnier, A.; Bataille, S.; Cavaille, G.; Moussi-Frances, J.; Gondouin, B.; Bindi, P.; Nakhla, M.; Mansour, J.; Halin, P.; et al. Risk factors for severity of COVID-19 in chronic dialysis patients from a multicentre French cohort. Clin. Kidney J. 2020, 13, 878–888. [Google Scholar] [CrossRef]
  5. Sinharay, S. International Encyclopedia of Education, 3rd ed.; Elsevier Science: Amsterdam, The Netherlands, 2010. [Google Scholar]
  6. Mahnke, Y.D.; Brodie, T.M.; Sallusto, F.; Roederer, M.; Lugli, E. The who’s who of T-cell differentiation: Human memory T-cell subsets. Eur. J. Immunol. 2013, 43, 2797–2809. [Google Scholar] [CrossRef]
  7. Fenoglio, D.; Dentone, C.; Parodi, A.; Di Biagio, A.; Bozzano, F.; Vena, A.; Fabbi, M.; Ferrera, F.; Altosole, T.; Bruzzone, B.; et al. Characterization of T lymphocytes in severe COVID-19 patients. J. Med. Virol. 2021, 93, 5608–5613. [Google Scholar] [CrossRef]
  8. Rey, D.; Neuhäuser, M. Wilcoxon-Signed-Rank Test. In International Encyclopedia of Statistical Science; Lovric, M., Ed.; Springer: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
  9. Fox, J. Applied Regression Analysis and Generalized Linear Models, 2nd ed.; Sage Publications: Thousand Oaks, CA, USA, 2008. [Google Scholar]
  10. Galecki, A.; Burzykowski, T. Linear Mixed-Effects Models Using R; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
  11. Jin, X.; Han, J. K-Means Clustering. In Encyclopedia of Machine Learning; Sammut, C., Webb, G.I., Eds.; Springer: Boston, MA, USA, 2011. [Google Scholar]
  12. Noor, F.M.; Islam, M.M. Prevalence and Associated Risk Factors of Mortality Among COVID-19 Patients: A Meta-Analysis. J. Community Health 2020, 45, 1270–1282. [Google Scholar] [CrossRef]
  13. Li, J.; Li, M.; Zheng, S.; Li, M.; Zhang, M.; Sun, M.; Li, X.; Deng, A.; Cai, Y.; Zhang, H. Plasma albumin levels predict risk for nonsurvivors in critically ill patients with COVID-19. Biomark. Med. 2020, 14, 827–837. [Google Scholar] [CrossRef]
  14. Thijssen, S.; Wystrychowski, G.; Usvyat, L.; Kotanko, P.; Levin, N.W. Determinants of serum albumin concentration analyzed in a large cohort of patients on maintenance hemodialysis. J. Ren. Nutr. 2007, 17, 70–74. [Google Scholar] [CrossRef]
  15. San-Cristobal, R.; Martín-Hernández, R.; Ramos-Lopez, O.; Martinez-Urbistondo, D.; Micó, V.; Colmenarejo, G.; Villares Fernandez, P.; Daimiel, L.; Martínez, J.A. Longwise Cluster Analysis for the Prediction of COVID-19 Severity within 72 h of Admission: COVID-DATA-SAVE-LIFES Cohort. J. Clin. Med. 2022, 11, 3327. [Google Scholar] [CrossRef] [PubMed]
  16. Han, L.; Shen, P.; Yan, J.; Huang, Y.; Ba, X.; Lin, W.; Wang, H.; Huang, Y.; Qin, K.; Wang, Y.; et al. Exploring the Clinical Characteristics of COVID-19 Clusters Identified Using Factor Analysis of Mixed Data-Based Cluster Analysis. Front. Med. 2021, 8, 644724. [Google Scholar] [CrossRef]
  17. Bergamaschi, L.; Mescia, F.; Turner, L.; Hanson, A.L.; Kotagiri, P.; Dunmore, B.J.; Ruffieux, H.; De Sa, A.; Huhn, O.; Morgan, M.D.; et al. Longitudinal analysis reveals that delayed bystander CD8+ T cell activation and early immune pathology distinguish severe COVID-19 from mild disease. Immunity 2021, 54, 1257–1275.e8. [Google Scholar] [CrossRef] [PubMed]
  18. Kenny, G.; McCann, K.; O’Brien, C.; Savinelli, S.; Tinago, W.; Yousif, O.; Lambert, J.S.; O’Broin, C.; Feeney, E.R.; De Barra, E.; et al. Identification of Distinct Long COVID Clinical Phenotypes Through Cluster Analysis of Self-Reported Symptoms. Open Forum Infect. Dis. 2022, 9, ofac060. [Google Scholar] [CrossRef] [PubMed]
  19. Nicholson, C.; Beattie, L.; Beattie, M.; Razzaghi, T.; Chen, S. A machine learning and clustering-based approach for county-level COVID-19 analysis. PLoS ONE 2022, 17, e0267558. [Google Scholar] [CrossRef] [PubMed]
  20. Pérez-Ortega, J.; Almanza-Ortega, N.N.; Torres-Poveda, K.; Martínez-González, G.; Zavala-Díaz, J.C.; Pazos-Rangel, R. Application of Data Science for Cluster Analysis of COVID-19 Mortality According to Sociodemographic Factors at Municipal Level in Mexico. Mathematics 2022, 10, 2167. [Google Scholar] [CrossRef]
  21. Sadeghi, B.; Cheung, R.C.Y.; Hanbury, M. Using hierarchical clustering analysis to evaluate COVID-19 pandemic preparedness and performance in 180 countries in 2020. BMJ Open 2021, 11, e049844. [Google Scholar] [CrossRef]
  22. Betjes, M.G.; Meijers, R.W.; Litjens, N.H. Loss of renal function causes premature aging of the immune system. Blood Purif. 2013, 36, 173–178. [Google Scholar] [CrossRef]
  23. Sepe, V.; Gregorini, M.; Rampino, T.; Esposito, P.; Coppo, R.; Galli, F.; Libetta, C. Vitamin e-loaded membrane dialyzers reduce hemodialysis inflammaging. BMC Nephrol. 2019, 20, 412. [Google Scholar] [CrossRef]
  24. El Karoui, K.; De Vriese, A.S. COVID-19 in dialysis: Clinical impact, immune response, prevention, and treatment. Kidney Int. 2022, 101, 883–894. [Google Scholar] [CrossRef]
  25. Goffin, E.; Candellier, A.; Vart, P.; Noordzij, M.; Arnol, M.; Covic, A.; Lentini, P.; Malik, S.; Reichert, L.J.; Sever, M.S.; et al. COVID-19-related mortality in kidney transplant and haemodialysis patients, a comparative, prospective registry-based study. Nephrol. Dial. Transplant. 2021, 36, 2094–2105. [Google Scholar] [CrossRef]
  26. Esposito, P.; Picciotto, D.; Cappadona, F.; Russo, E.; Falqui, V.; Conti, N.E.; Parodi, A.; Mallia, L.; Cavagnaro, S.; Battaglia, Y.; et al. The Evolving Scenario of COVID-19 in Hemodialysis Patients. Int. J. Environ. Res. Public Health 2022, 19, 10836. [Google Scholar] [CrossRef] [PubMed]
  27. Loftus, T.J.; Shickel, B.; Balch, J.A.; Tighe, P.J.; Abbott, K.L.; Fazzone, B.; Anderson, E.M.; Rozowsky, J.; Ozrazgat-Baslanti, T.; Ren, Y.; et al. Phenotype clustering in health care: A narrative review for clinicians. Front. Artif. Intell. 2022, 5, 842306. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Cluster analysis of longitudinal data performed on the derivation cohort. Top panel (A): result of cluster analysis. Bottom panel (B): centroid profiles of Clusters 1 and 2. Abbreviations: white blood cell count, WBC; C-reactive protein, CRP; interleukin, IL; tumor necrosis factor-alfa, TNF-α; intercept, i; slope, s.
Figure 1. Cluster analysis of longitudinal data performed on the derivation cohort. Top panel (A): result of cluster analysis. Bottom panel (B): centroid profiles of Clusters 1 and 2. Abbreviations: white blood cell count, WBC; C-reactive protein, CRP; interleukin, IL; tumor necrosis factor-alfa, TNF-α; intercept, i; slope, s.
Life 12 01702 g001
Table 1. Baseline characteristics of the derivation cohort of COVID-19-positive HD patients according to cluster assignment.
Table 1. Baseline characteristics of the derivation cohort of COVID-19-positive HD patients according to cluster assignment.
Cluster 1Cluster 2p
N78 *
Age, years68.1 ± 15.664.4 ± 10.70.7
Sex, M/F1/67/10.01
Dialysis vintage, months32.6 ± 19.672.8 ± 56.30.1
WBC, ×109/L5.4 ± 1.34.4 ± 1.50.33
Lymphocytes, ×109/L1.1 ± 0.50.4 ± 0.20.012
Neutrophils, ×109/L3.8 ± 1.43.5 ± 1.51
Lymphocytes, (% WBC)21.0 ± 7.511.8 ± 4.90.03
Neutrophils, (% WBC)66.6 ± 13.878.6 ± 6.20.06
CD8 + TSCM/CD8+0.6 ± 0.42.3 ± 1.70.038
Albumin, g/L34.6 ± 3.533.1 ± 4.20.6
LDH, U/L201.2 ± 42.3323.0 ± 88.80.012
Procalcitonin, ng/ml0.9 ± 0.53.3 ± 3.60.02
CRP, mg/L22.2 ± 16.372.6 ± 44.10.017
* Baseline data were available for 8/9 COVID-19 HD patients included in Cluster 2. Abbreviations: hemodialysis, HD; white blood cells, WBC; C-reactive protein, CRP; lactate dehydrogenase, LDH; T memory stem cells, TSCM.
Table 2. Validation cohort of cluster analysis performed using COVID-19-positive HD patients: General characteristic and comparison between patients assigned to different clusters.
Table 2. Validation cohort of cluster analysis performed using COVID-19-positive HD patients: General characteristic and comparison between patients assigned to different clusters.
All PatientsCluster 1Cluster 2p
Cluster 1 vs. 2
N301614
Age, years 73.3 ± 16.375.7 ± 15.570.4 ± 17.30.4
Sex, M/F (M%)16/14 (53)6/10 (37)10/4 (71)0.08
Dialysis vintage, months 36 (14–71)43 (28–75)27 (7–64)0.1
WBC, ×109/L6.5 ± 4.95.4 ±5.17.7± 4.60.02
Lymphocytes, ×109/L0.7 ± 0.30.9± 0.30.6 ± 0.30.03
Neutrophils, ×109/L5.0 ± 4.33.6 ± 3.96.6± 4.40.01
Lymphocytes, (% WBC)15.4 ± 9.521.2 ± 9.59.1 ± 4.3<0.0001
Neutrophils, (% WBC)70.1 ± 19.759.3 ± 20.882.5 ± 6.9<0.0001
LDH, U/L234.0 ± 72.1207.9 ± 49.1260.1 ± 83.40.09
Albumin, g/L33.0 ±4 233.9 ± 232.8 ± 2.20.3
Procalcitonin, ng/ml1.4 (0.6–4.6)0.8 (0.4–1.6)3.6 (1.2–9.5)0.07
CRP, mg/L14.7 (4.0–39.0)8.0 (2.8–20.1)32.6 (12.1–50.1)0.01
Clinical severity score > 0, n (%)27 (90)13 (80)14 (100)0.04
Clinical severity score, n
- 0
- 1
- 2
3
20
7
3
12
1
0
8
6
0.018
Outcomes
Hospitalization, n (%)12 (40)3 (18)9 (64)0.02
Death, n (%)6 (20)1 (6.2)5 (35)0.07
High-flow oxygen therapy, n (%)4 (13)04 (28)0.02
Duration of SARS-CoV-2 infection, days21 (13–28)22 (14.5–35)20 (11–31)0.38
Abbreviations: hemodialysis, HD; white blood cells, WBC; C-reactive protein, CRP; lactate dehydrogenase, LDH.
Table 3. The positive predictive values (PPV) and negative predictive values (NPV) associated with the key clinical parameters in the validation cohort.
Table 3. The positive predictive values (PPV) and negative predictive values (NPV) associated with the key clinical parameters in the validation cohort.
NPVPPV
Clinical severity score > 00.19 (3/16)1 (14/14)
Hospitalization0.81 (13/16)0.64 (9/14)
Death0.94 (15/16)0.36 (5/14)
High-flow oxygen therapy1 (16/16)0.29 (4/14)
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Esposito, P.; Garbarino, S.; Fenoglio, D.; Cama, I.; Cipriani, L.; Campi, C.; Parodi, A.; Vigo, T.; Franciotta, D.; Altosole, T.; et al. Longitudinal Cluster Analysis of Hemodialysis Patients with COVID-19 in the Pre-Vaccination Era. Life 2022, 12, 1702. https://doi.org/10.3390/life12111702

AMA Style

Esposito P, Garbarino S, Fenoglio D, Cama I, Cipriani L, Campi C, Parodi A, Vigo T, Franciotta D, Altosole T, et al. Longitudinal Cluster Analysis of Hemodialysis Patients with COVID-19 in the Pre-Vaccination Era. Life. 2022; 12(11):1702. https://doi.org/10.3390/life12111702

Chicago/Turabian Style

Esposito, Pasquale, Sara Garbarino, Daniela Fenoglio, Isabella Cama, Leda Cipriani, Cristina Campi, Alessia Parodi, Tiziana Vigo, Diego Franciotta, Tiziana Altosole, and et al. 2022. "Longitudinal Cluster Analysis of Hemodialysis Patients with COVID-19 in the Pre-Vaccination Era" Life 12, no. 11: 1702. https://doi.org/10.3390/life12111702

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop