Next Article in Journal
Fragmented QRS in Lateral Leads on Electrocardiography Is Associated with Cardiac Dysfunction and Left Ventricular Dilation in Duchenne Muscular Dystrophy
Next Article in Special Issue
Evaluation of suPAR as a Key Prognostic Biomarker in Patients with SARS-CoV-2 Pneumonia
Previous Article in Journal
Electroacupuncture Attenuates Intestinal Barrier Disruption via the α7nAChR-Mediated HO-1/p38 MAPK/NF-κB Pathway in a Mouse Model of Metabolic Dysfunction-Associated Fatty Liver Disease: A Randomized, Single-Blind, Controlled Trial
Previous Article in Special Issue
Retrospective Clinical Investigation into the Association Between Abnormal Blood Clotting, Oral Anticoagulant Therapy, and Medium-Term Mortality in a Cohort of COVID-19 Patients
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Effect of Naturally Acquired Immunity on Mortality Predictors: A Focus on Individuals with New Coronavirus

1
Autoimmunity and Inflammation Research Group, Río Hortega University Hospital, 47012 Valladolid, Spain
2
Cooperative Research Network Focused on Health Results—Advanced Therapies (RICORS TERAV), 28220 Madrid, Spain
3
Medical Analysis Expert Group, Institute of Technology, University of Castilla-La Mancha, 13001 Cuenca, Spain
4
Medical Analysis Expert Group, Instituto de Investigación Sanitaria de Castilla-La Mancha (IDISCAM), 45071 Toledo, Spain
5
Internal Medicine, Río Hortega University Hospital, 47012 Valladolid, Spain
*
Authors to whom correspondence should be addressed.
Biomedicines 2025, 13(4), 803; https://doi.org/10.3390/biomedicines13040803
Submission received: 18 February 2025 / Revised: 19 March 2025 / Accepted: 24 March 2025 / Published: 27 March 2025

Abstract

:
Background/Objectives: The spread of the COVID-19 pandemic has spurred the development of advanced healthcare tools to effectively manage patient outcomes. This study aims to identify key predictors of mortality in hospitalized patients with some level of natural immunity, but not yet vaccinated, using machine learning techniques. Methods: A total of 363 patients with COVID-19 admitted to Río Hortega University Hospital in Spain between the second and fourth waves of the pandemic were included in this study. Key characteristics related to both the patient’s previous status and hospital stay were screened using the Random Forest (RF) machine learning technique. Results: Of the 19 variables identified as having the greatest influence on predicting mortality, the most powerful ones could be identified at the time of hospital admission. These included the assessment of severity in community-acquired pneumonia (CURB-65) scale, age, the Glasgow Coma Scale (GCS), and comorbidities, as well as laboratory results. Some variables associated with hospitalization and intensive care unit (ICU) admission (acute renal failure, shock, PRONO sessions and the Acute Physiology and Chronic Health Evaluation [APACHE-II] scale) showed a certain degree of significance. The Random Forest (RF) method showed high accuracy, with a precision of >95%. Conclusions: This study shows that natural immunity generates significant changes in the evolution of the disease. As has been shown, machine learning models are an effective tool to improve personalized patient care in different periods.

1. Introduction

When a sufficient number of individuals are immunized against a pathogen, the probability of transmission between the infected and susceptible population decreases due to the interruption of the chain of transmission [1]. In the case of a contagious, direct contact-transferable pathogen that induces long-term immunity, this is maximized if the population has a random pattern of interaction [2]. Diseases such as rubella, measles or pertussis are under control, and others such as smallpox are even eradicated today thanks to herd immunity, either by natural means or through vaccination strategies [3].
Natural immunity, which develops after infection with COVID-19, and vaccine-induced immunity, achieved through controlled exposure to viral antigens, are key mechanisms for protection against the virus. While natural immunity depends on factors like the initial viral load and may be less predictable, vaccines offer more uniform protection and mitigate the risks associated with severe disease [4]. Moreover, recent studies suggest that the combination of both types of immunity can provide more robust and long-lasting protection [5].
This concept gained significant visibility during the coronavirus disease-2019 (COVID-19) pandemic, as the need to achieve herd immunity became a critical objective to minimize the clinical and societal impact of the health emergency. Early estimates indicated that 70% of the population would need immunization to achieve this goal [6]. Data from populations with full vaccination schedules suggest that this threshold was reached between the third and fourth waves of the pandemic (June–September 2021) in regions, such as Europe, Oceania, the Middle East, and North America, while Asia and Africa reached these rates later, between late 2021 and mid-2022 [7].
Well before vaccination campaigns began, however, a substantial percentage of the population had already acquired natural immunity. Studying the characteristics of individuals infected during this period—before vaccination but after the first wave—is of profound clinical interest. Understanding their natural immune responses, the variability in disease severity, treatment protocols, and virus epidemiology provides invaluable insights for managing future outbreaks or similar health crises.
Severe cases of COVID-19 are characterized by an inflammatory response and a cytokine storm that affects all cells of the immune system [8], especially lymphocytes. This has been shown to cause a dysregulation that can lead to an uncontrolled immune response, lung tissue damage or even multi-organ failure [9]. The initial symptoms of the disease, as reported at the onset of the pandemic, included fever, dyspnea, pneumonia and a dry cough [10,11,12]. Nevertheless, differences in clinical presentation across pandemic waves have been observed, such as reduced intensive care unit (ICU) admission rates [13] or decreased mortality [14,15]. It is evident that this progression can be attributed, firstly, to a heightened comprehension of the disease, facilitated by scientific research and the implementation of social policies. However, it is also associated with the modification of immunological characteristics at both the individual and population levels, owing to the progressive development of immunity, as a growing number of individuals recover from the infection and achieve immunological protection.
Comparing COVID-19 with earlier respiratory infections caused by coronaviruses, such as severe acute respiratory syndrome coronavirus (SARS-CoV) (2002–2003) and Middle Eastern respiratory syndrome coronavirus (MERS-CoV) (2012), offers additional perspective. Both SARS-CoV and MERS-CoV share similarities with SARS-CoV-2 in transmission routes [16] and clinical features, though with distinct differences in transmission rates and global impacts [17]. Lessons learned from studies on these viruses provide a framework for analyzing COVID-19 data and underscore the utility of advanced analytical techniques such as machine learning (ML).
In the context of data analysis from a limited sample of patients during a pandemic, advanced data analysis techniques are increasingly being used. Among artificial intelligence (AI) tools, machine learning (ML) has seen a surge in utilization in studies dealing with voluminous datasets. This branch of AI and computational analysis entails the employment of data and algorithms that emulate human information processing capabilities while enhancing efficiency [18]. Its efficacy has been demonstrated in several areas of knowledge, including biology [19] or medicine [20,21,22]. Furthermore, these machine learning tools have been extensively utilized in research related to the novel coronavirus, SARS-CoV-2, particularly in the study of mortality [23,24] and/or severity [25,26], the study of populations with specific characteristics (smokers [27], cancer patients [28,29], etc.) and biomarker analysis [30,31].
A notable benefit of leveraging ML models over conventional statistical tools is their capacity to generate precise predictions while exhibiting high levels of scalability and adaptability. This capability enables the identification of patterns within voluminous datasets. This is particularly relevant in the context of a public health problem that was not only unknown in its early stages but has shown a remarkable evolution over different time periods. Consequently, the objective of this study is to identify key predictors of mortality risk in a cohort of patients hospitalized for COVID-19 during a period when vaccines were not yet available but immunity obtained by natural means was present.
For this purpose, the specific ML model to be validated is the Random Forest (RF). This method might be utilized systematically as a risk evaluation tool in any population, enabling the derivation of conclusions in a relatively brief time frame with data from a limited number of patients (150–300). This approach would enable the development of customized protocols for diverse populations, such as health centers or cities, with the objective of optimizing resource utilization and ensuring a highly personalized healthcare experience.

2. Materials and Methods

2.1. Data Source and Description

The clinical data used in this study were taken from the electronic medical record system of the Río Hortega University Hospital in Valladolid (Spain). Data from 363 patients hospitalized with polymerase chain reaction (PCR)-confirmed COVID-19 infection between the beginning of the second wave (2 May 2020) and the end of the fourth wave (10 June 2021) were obtained from this platform. The information used in this study corresponds to the patients’ hospital stay from the time of admission to the emergency department until discharge from the hospital. Each patient was given an anonymous code to protect their privacy, and all patients gave informed consent. This study was conducted according to the principles of Helsinki and was approved by the Ethics Committee of the University Hospital of Rio Hortega.
Data were collected, retrospectively reviewed, and manually entered into a predesigned database. These data included demographics, comorbidities, chronic treatments, date of admission, date of discharge or death and cause, symptoms on admission, date of COVID-19 diagnosis and virus variant, if available, chest X-ray on admission, community acquired pneumonia severity scales (CURB-65), sequential organ failure assessment (SOFA), acute physiology and chronic health disease classification system (APACHE-II), Glasgow Coma Scale (GCS), vital signs, laboratory data, need for intensive care unit (ICU) admission and date (if applicable), need and type of ventilatory support, co-infections, nosocomial infections (specimen typing), complications during hospitalization, previous COVID-19 episodes, COVID-19 specific treatment, and number and type of COVID-19 vaccines received.
The laboratory tests whose results were used in this study were performed at the same hospital center using the following instruments: DXH900 Beckman Coulter Diagnosis (Brea, CA, USA) for whole blood samples, AU5820 Beckman Coulter Diagnosis for serum biochemistry samples and Gem5000 Werfen (Barcelona, Spain) for blood gasometry analysis. The data were entered into the aforementioned hospital electronic data storage system prior to their use in this study. The laboratory parameters considered relevant to this study were the following: leukocytes, neutrophils, lymphocytes, monocytes, eosinophils, basophils, erythrocytes, hemoglobin, hematocrit, mean corpuscular volume (M.C.V.), platelets, D-dimer, prothrombin activity (PT), international normalized ratio (I.N.R.), activated partial thromboplastin time (aPTT), aPTT ratio, derived fibrinogen, sodium, potassium, chloride, glucose, urea, creatinine, estimated glomerular filtration rate (CKD-EPI 2009), alanine aminotransferase (ALT/GPT), aspartate aminotransferase (AST/GOT), gamma glutamyl transferase (GGT), total bilirubin, alkaline phosphatase, lactate dehydrogenase (LDH), phosphate, C-reactive protein, procalcitonin, albumin, pH, pCO2, pO2, HCO3, FIO2, pO2/FIO2, O2 gradient Aa and lactate.
These variables were selected after a thorough literature review to identify the most critical factors during the pandemic. The focus was on those that could significantly aid in evaluating disease severity, immune response, and associated risk factors. Additionally, the chosen parameters were those that could be quickly and routinely measured both at the time of hospital admission and throughout the patient’s stay in the hospital.

2.2. Machine Learning Methods

In this study, the Random Forest algorithm was developed, an ensemble method that employs the bagging aggregation approach to construct multiple decision trees independently, thereby reducing variance and enhancing the robustness of the model. Random Forest is based on bootstrap sampling of the training data to build a set of independent trees, where each tree is trained on a random sample of the original dataset. Furthermore, at each tree node, a random subset of features is selected instead of considering all variables, introducing an additional source of randomness and minimizing correlation between the trees, thereby increasing the accuracy of the ensemble [32,33].
During the training process, each tree makes decisions independently, and the results are combined using a majority voting scheme for classification. Feature importance is calculated by analyzing the decrease in accuracy or Gini index when the values of a feature are randomly permuted, thereby identifying the most relevant variables for the model. Finally, the model’s performance was evaluated using specific metrics such as accuracy, sensitivity, specificity, and the area under the receiver operating characteristic (ROC) curve (AUC), enabling validation of its predictive capability and generalization to unseen data.
In the proposed Random Forest algorithm, a set of decision trees {T1, T2, …, Tm} is constructed using a bagging aggregation approach. To build each tree Ti, the following steps were performed:
Bootstrap sampling: Given a training dataset with n observations and p features, a subset of data Di is generated by selecting n random samples with replacement from the original dataset. This sampling technique allows certain data points to appear multiple times in Di, while others may not appear at all.
Random feature selection: At each node of every tree, instead of evaluating all p features, a random subset of k features is selected, where k = p . This reduces the correlation between individual trees, thereby increasing the model’s generalization ability.
Splitting criterion at each node: Each node is split according to an impurity reduction criterion, which can be either entropy or the Gini index for classification. In our study, we use the Gini index. The Gini impurity G of a node with a proportion pk of elements belonging to class k is defined as:
G = 1 k = 1 K p k 2
Combination of trees: Once the trees are trained, the prediction of the Random Forest is obtained through aggregation. For a set of trees {T1, T2, …, Tm}, the final prediction y ^ is calculated using majority voting:
y ^ = m o d e { T 1 x ,   T 2 x , , T m x }
Feature importance: The importance of each feature was measured by evaluating the change in the splitting criterion when the feature was randomly permuted in the dataset. For importance based on the Gini index, if permuting a specific feature increased the impurity of the tree nodes, that feature was considered important.
Model optimization and tuning: In order to improve the predictive performance of the Random Forest model and avoid overfitting, hyperparameter tuning was performed using a Bayesian optimization strategy. This approach was preferred over grid search due to its efficiency in navigating large parameter spaces with fewer evaluations. The optimization was carried out with a fivefold cross-validation strategy applied to the training data. The hyperparameters included the number of trees in the forest (n_estimators), the maximum depth of the trees (max_depth), the minimum number of samples required to split an internal node (min_samples_split), and the number of features considered at each split (max_features). The optimization objective was to maximize the average cross-validated AUC. This procedure allowed us to determine an optimal configuration that balances model complexity and generalization capacity, minimizing the risk of overfitting.
Model evaluation: The model’s performance was evaluated using metrics, such as accuracy, sensitivity, specificity, and the AUC.
In this study, the proposed method underwent extensive evaluation, comparing it with various machine learning techniques for classifying COVID-19 patients. The algorithms included in the comparative analysis were Gaussian Naive Bayes (GNB) [34,35], k-Nearest Neighbors (KNNs) [36,37], Bayesian Linear Discriminant Analysis (BLDA) [38,39], Support Vector Machines (SVM) [40], decision trees (DT) [41,42], and the novel system developed in this study, Random Forest (RF) [32,43,44]. The implementation and evaluation of the models were conducted using MATLAB’s Statistics and Machine Learning Toolbox (version 2024a).
To mitigate overfitting, a fivefold cross-validation strategy was applied. The data were split into two subsets, assigning 70% for training and 30% for testing, ensuring the independence of patient groups in each set. Figure 1 schematically illustrates the study workflow, which begins with patient selection and database creation, followed by the training and validation phases of the machine learning models.

3. Results

A total cohort of 363 patients was studied, corresponding to the second (135 patients), third (115 patients) and fourth (113 patients) waves of the pandemic, when mass vaccination campaigns had not yet begun. The study identified 40 deceased patients, resulting in an overall mortality rate of 11%.
The medical treatments and protocols used during the second to fourth waves of COVID-19 were stable during this period and followed hospital, local health and ministry of health protocols, based on WHO indications.
The information obtained is related to the patient’s previous condition, the time of admission, and also the evolution of the patient during the hospital stay. More detailed data on demographics, comorbidities, chronic treatments, symptoms on admission, ICU stay, need for ventilatory support or complications during hospital stay, among others, are shown in Table 1, Table 2, Table 3 and Table 4.
Several ML methods were used to identify risk patterns in a population with confirmed COVID-19 infection. The goal was to determine which algorithm provided the best predictive results. This study presents performance metrics for these ML methods, including balanced accuracy, recall, specificity, precision, Mathew’s correlation coefficient (MCC), F1 score, kappa, AUC, and degenerated Youden’s index (DYI), as shown in Table 5 and Table 6.
The data clearly indicate that the proposed method, RF, is the one with the highest acquisition and recall value. The RF algorithm consistently provides a positive predictive value of greater than 95%, demonstrating consistent performance.
The model training subset and the test subset both present high scores for all the metrics, with slightly lower scores for the test subset. This consistency is attributed to the algorithm reaching an optimal level of training without overfitting or underfitting. As shown in the radar plots in Figure 2, the RF model covers a larger area compared to the other methods tested, which is an example of a well-balanced model with high generalization capability, ensuring accurate outputs for new inputs.
Furthermore, the ROC curve was generated by plotting the sensitivity and specificity measures for each threshold to evaluate the classification capabilities of the different ML algorithms. The results are displayed in Figure 3. Again, the proposed RF-based system covers a larger area, indicating superior predictive accuracy.
Figure 4 shows the most clinically relevant parameters contributing to mortality in patients hospitalized for COVID-19 after the first wave and before mass vaccination, according to the proposed RF model. Listed in descending order of relevance, the aforementioned parameters are age, CURB-65 scale on admission, urea, procalcitonin, estimated glomerular filtrate, lymphocytes, C-reactive protein, acute renal failure, albumin, PRONO sessions, oxygen requirements, Glasgow Coma Scale, ischemic heart disease, invasive mechanical ventilation (IMV), creatinine, shock, noninvasive mechanical ventilation (NIMV), APACHE-II scale and chronic renal failure.

4. Discussion

Despite significant progress in the fight against COVID-19 and the time we have been coexisting with the virus after the global emergency, the pandemic remains a global concern. The emergence of new variants and the increase in cases during certain periods underscore the need to continue to research and improve our response strategies. In addition, the pandemic has highlighted weaknesses in health systems and the importance of researching more effective interventions and protocols to provide higher-quality care.
In this study, the analysis of the most relevant parameters for predicting the risk of mortality in hospitalized COVID-19 patients is organized into different categories for clarity and understanding. These categories include demographic factors, clinical features observed on admission, laboratory parameters, and variables monitored during hospitalization. This categorization highlights the complementary roles these parameters play in predicting mortality and enhances the study’s contribution to clinical decision making.
The most relevant parameters for predicting the risk of mortality in this group of hospitalized patients with COVID-19 during the period from the second to the fourth wave of the pandemic were identified. At that time, there was some immunity following infections in previous waves, and the population had not yet been vaccinated with more than one dose or had not yet been vaccinated at all. Among the key variables identified, the most powerful parameters were those that could be obtained at the time of hospital admission, either previous comorbidities or the clinical features assessed in the emergency department.
Thus, the strongest predictor among demographic factors is age, a conclusion common to many studies [45,46]. It should be emphasized that, although the majority of the patients in this cohort were older adults, this does not diminish the importance of age as a predictive factor. Rather than merely reflecting the higher mortality commonly associated with older populations, age itself plays a crucial role in the progression and outcome of COVID-19. This is supported by extensive evidence in the medical literature highlighting the physiological and immunological changes associated with aging that increase susceptibility to severe disease. To ensure the robustness of the analysis, statistical techniques were employed, including multivariate models, to isolate the independent contribution of age and confirm its predictive power while accounting for the influence of other clinical and pathologic parameters.
Similar trends were observed in SARS-CoV and MERS, where older age was also a significant risk factor for mortality, although the overall case fatality rates differed markedly, SARS-CoV at ~10% and MERS-COV at ~35%, compared to COVID-19′s lower fatality rate but higher transmissibility [47]. However, Philipps and Carver [48] concluded that it is not so much the age itself but what comes with advanced age that is a risk factor for mortality, namely comorbidities, chronic treatments and social situation. The results of this study support their theory, in that some of the factors that emerge as strong predictors of mortality are related to chronic diseases, such as ischemic heart disease and chronic renal failure.
The CURB-65 scale, a hallmark in the results, includes age as a variable. This scale has been evaluated in other studies as an adequate predictor of mortality in patients prior to hospitalization with COVID-19 pneumonia [49,50,51], and it is also commonly used to determine the severity in patients with any type of pneumonia. In addition to demographic factors, clinical features such as those included in the CURB-65 scale, such as urea levels or low level of consciousness, have considerable predictive power for mortality. In this study, the CURB-65 scale is a good parameter, and since it can be determined at the time of admission, this information can be used to perform rapid and efficient triage. Furthermore, a comparison with other viruses reveals that this scale, which focuses on clinical severity, has also been used to effectively manage pneumonia-like symptoms [52,53].
Laboratory parameters obtained at the time of hospitalization, such as urea levels, C-reactive protein, procalcitonin, albumin and/or lymphocyte count, are also good predictors of mortality according to the model used. Mohammadi et al. studied more than 1000 patients and concluded that elevated creatinine and urea levels were associated with poor prognosis in patients with COVID-19 [54]. Other studies, such as those of Al-Shajlawi et al. [55] or Singh and Singh [56], also include elevated C-reactive protein and procalcitonin levels, low albumin levels and low lymphocyte count as good predictors of mortality. Elevated urea levels, together with low lymphocyte levels, may be related to a severe systemic inflammatory response and consequent inflammation. C-reactive protein and procalcitonin are known markers of inflammation and sepsis, and their elevation reflects the severity of the inflammatory response. On the other hand, albumin, a protein produced by the liver, is normally decreased in states of chronic inflammation and physiological stress due to its role in the acute-phase response. A low lymphocyte count, or lymphopenia, may indicate immune dysfunction, which is common in severe viral infections such as COVID-19. All of these data would be consistent with a state of advanced inflammation and organ damage, explaining the association with an increased risk of mortality.
Moreover, this study shows that certain in-hospital monitoring variables, including acute renal failure and shock, have predicting significance. These belong to the category of monitoring parameters, which track the disease progression during hospitalization and provide critical insights into mortality risk. The study by Yüksel et al. [57] associates the aforementioned imbalances in laboratory parameters with the development of acute renal failure. At the same time, studies such as that by Qureshi et al. [58] have shown that non-surviving patients have a higher incidence of septic shock, among other pathologies. This underscores the importance of continuous monitoring and early intervention in COVID-19 patients with these alterations.
Additional monitored variables, such as the need for oxygen therapy, prone positioning sessions (PRONO), or mechanical ventilation, have also shown predictive significance for mortality risk as the disease progresses. Studies show they hold predictive significance for mortality risk as the disease progresses during ICU stay [59]. In turn, the APACHE-II scale has been shown to be a good predictor of mortality in some studies, although there are contradictions in this regard [59,60,61,62]. These include physiologic and biochemical parameters assessed on admission to the ICU that reflect the systemic response to severe physiologic stress common in severe cases of COVID-19. The higher relevance of APACHE-II in our study may be related to its ability to better capture multiorgan dysfunction in the early stages, providing a more robust prediction of mortality.
The data management approach used in this study involved an RF machine learning model, which was based on either bootstrap aggregation or bagging. This model has exhibited several advantageous characteristics, including high generalizability, stability and interpretability, along with a low risk of overfitting. Other studies have used the same method to analyze diseases, such as cardiovascular disease [63], interstitial lung disease [64], breast [65] or pancreatic cancer [66], autoimmune diseases [63] or neurological diseases [67] or neurodegenerative diseases [68]. The model can, therefore, be used as a strategic tool to quickly address clinical questions with relatively little data.
The application of the aforementioned model in the analysis of the variables with the most significant influence on the prediction of mortality in the cohort of hospitalized patients with COVID-19 during the second and third waves reveals a discrepancy in the results compared to those of analogous studies conducted in the first wave [11,12,13,69,70,71]. In these studies, patients had not yet developed any form of immunity. In contrast, the present study demonstrates that once a population achieves a certain level of immunity, additional predictive variables can be obtained at the time of admission. Furthermore, during the patient’s hospital stay, variables related to the stay itself were identified, and these variables can be used in decision making.

5. Conclusions

After the first wave of the recent COVID-19 pandemic, the size of the population that has overcome the infection and developed natural immunity is increasing. Therefore, it is crucial to study how natural immunity affects the evolution of the disease in order to improve general knowledge about it and improve preparedness for potential new health challenges.
The present study demonstrates that predictive variables for mortality risk, and, therefore, severity, can be identified in a given population at the time of hospital admission using an RF machine learning model. In this cohort, the CURB-65 scale and age were the strongest predictors of mortality, along with other laboratory tests. These results underscore the need to emphasize not only the early patient assessment but also the personalized analysis of the population and situation, given their capacity for change.
The present study shows that the increase in the number of immunized patients, even in the absence of herd immunity, generates a significant modification in the evolution of the disease, as well as the mortality risk parameters. This underscores the necessity for a more customized approach to patient care, employing distinct protocols tailored to specific groups and temporal periods. In this regard, the employment of machine learning tools emerges as a highly valuable and efficient resource.

Author Contributions

Conceptualization, M.Q. and J.B.; methodology, M.Q., J.B., A.M.T. and J.M.; software, A.M.T. and J.M.; validation, M.Q., J.B. and J.M.; formal analysis, M.Q., J.B., A.M.T. and J.M.; investigation, M.Q., J.B., A.M.T. and J.M.; writing—original draft, M.Q., J.B., A.M.T. and J.M.; writing—review and editing, J.B., A.M.T. and J.M.; visualization, M.Q., A.M.T. and J.M.; supervision, J.M.; project administration, J.M. All authors have read and agreed to the published version of the manuscript.

Funding

Research was funded by the Institute of Technology (University of Castilla-La Mancha, Spain), the Río Hortega University Hospital (Valladolid, Spain), and the University of Valladolid [60-195041] (Spain).

Institutional Review Board Statement

This study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee of the Río Hortega University Hospital (Ref.: 23-PI186).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The datasets employed and analyzed in the current study are accessible upon reasonable request from the corresponding author. We do not have the patients’ permission to publish the data collected in this study in open access.

Acknowledgments

Chair of Artificial Intelligence, sponsored by Bayer.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
COVID-19Coronavirus Disease 19
ICUIntensive Care Unit
PCRPolymerase Chain Reaction
AIArtificial Intelligence
MLMachine learning
RFRandom Forest
CURBConfusion, Urea, Respiratory rate, Blood pressure
SOFASequential Organ Failure Assessment
APACHE-IIAcute Physiology and Chronic Health Evaluation
MCVMean Corpuscular Volume
PTProthrombin time
INRInternational normalized Ratio
aPTTActivated Partial Thromboplastin Time
CKD-EPIChronic Kidney Disease Epidemiology Collaboration
ALT/GPTAlanine Aminotransferase/Glutamate Pyruvate Transaminase
AST/GOTAspartate Aminotransferase/Glutamate Oxaloacetate Transaminase
GGTGamma-Glutamyl Transferase
LDHLactate Dehydrogenase
ROCReceiver Operating Characteristic
AUCArea Under the Curve
GNBGaussian Naïve Bayes
KNNK-Nearest Neighbors
BLDABayesian Linear Discriminant Analysis
SMVSupport Vector Machine
DTDecision Tree
COPDChronic Obstructive Pulmonary Disease
IMVInvasive Mechanical Ventilation
HFNCHigh-Flow Nasal Cannula
NIVNon-Invasive Mechanical Ventilation
ECMOExtracorporeal Membrane Oxygenation
ARDSAcute Respiratory Distress Syndrome
SDStandard Deviation
CRPC-Reactive Protein
SBPSystolic Blood Pressure
DBPDiastolic Blood Pressure
MCCMathew’s Correlation Coefficient
DYIDegenerated Youden’s Index

References

  1. Suryawanshi, Y.N.; Biswas, D.A. Herd Immunity to Fight Against COVID-19: A Narrative Review. Cureus 2023, 15. [Google Scholar] [CrossRef]
  2. Clemente-Suárez, V.J.; Hormeño-Holgado, A.; Jiménez, M.; Benitez-Agudelo, J.C.; Navarro-Jiménez, E.; Perez-Palencia, N.; Maestre-Serrano, R.; Laborde-Cárdenas, C.C.; Tornero-Aguilera, J.F. Dynamics of Population Immunity Due to the Herd Effect in the COVID-19 Pandemic. Vaccines 2020, 8, 236. [Google Scholar] [CrossRef] [PubMed]
  3. Fine, P.; Eames, K.; Heymann, D.L. “Herd Immunity”: A Rough Guide. Clin. Infect. Dis. 2011, 52, 911–916. [Google Scholar] [CrossRef] [PubMed]
  4. Hussain, A.; Yang, H.; Zhang, M.; Liu, Q.; Alotaibi, G.; Irfan, M.; He, H.; Chang, J.; Liang, X.-J.; Weng, Y.; et al. mRNA Vaccines for COVID-19 and Diverse Diseases. J. Control. Release 2022, 345, 314–333. [Google Scholar] [CrossRef]
  5. Zhang, M.; Hussain, A.; Yang, H.; Zhang, J.; Liang, X.-J.; Huang, Y. mRNA-Based Modalities for Infectious Disease Management. Nano Res. 2023, 16, 672–691. [Google Scholar] [CrossRef]
  6. Omer, S.B.; Yildirim, I.; Forman, H.P. Herd Immunity and Implications for SARS-CoV-2 Control. JAMA 2020, 324, 2095. [Google Scholar] [CrossRef]
  7. Mathieu, E.; Ritchie, H.; Rodés-Guirao, L.; Appel, C.; Gavrilov, D.; Giattino, C.; Hasell, J.; Macdonald, B.; Dattani, S.; Beltekian, D.; et al. Total COVID-19 Vaccine Doses Administered. In COVID-19 Pandemic; Data Adapted from Official Data Collated by Our World in Data; World Health Organization: Geneva, Switzerland, 2020. [Google Scholar]
  8. Ruan, Q.; Yang, K.; Wang, W.; Jiang, L.; Song, J. Clinical Predictors of Mortality Due to COVID-19 Based on an Analysis of Data of 150 Patients from Wuhan, China. Intensive Care Med. 2020, 46, 846–848. [Google Scholar] [CrossRef]
  9. Fajgenbaum, D.C.; June, C.H. Cytokine Storm. N. Engl. J. Med. 2020, 383, 2255–2273. [Google Scholar] [CrossRef]
  10. Iftimie, S.; López-Azcona, A.F.; Vallverdú, I.; Hernández-Flix, S.; De Febrer, G.; Parra, S.; Hernández-Aguilera, A.; Riu, F.; Joven, J.; Andreychuk, N.; et al. First and Second Waves of Coronavirus Disease-19: A Comparative Study in Hospitalized Patients in Reus, Spain. PLoS ONE 2021, 16, e0248029. [Google Scholar] [CrossRef]
  11. Chen, N.; Zhou, M.; Dong, X.; Qu, J.; Gong, F.; Han, Y.; Qiu, Y.; Wang, J.; Liu, Y.; Wei, Y.; et al. Epidemiological and Clinical Characteristics of 99 Cases of 2019 Novel Coronavirus Pneumonia in Wuhan, China: A Descriptive Study. Lancet 2020, 395, 507–513. [Google Scholar] [CrossRef]
  12. Soriano, V.; Ganado-Pinilla, P.; Sanchez-Santos, M.; Gómez-Gallego, F.; Barreiro, P.; De Mendoza, C.; Corral, O. Main Differences between the First and Second Waves of COVID-19 in Madrid, Spain. Int. J. Infect. Dis. 2021, 105, 374–376. [Google Scholar] [CrossRef]
  13. Buttenschøn, H.N.; Lynggaard, V.; Sandbøl, S.G.; Glassou, E.N.; Haagerup, A. Comparison of the Clinical Presentation across Two Waves of COVID-19: A Retrospective Cohort Study. BMC Infect. Dis. 2022, 22, 423. [Google Scholar] [CrossRef] [PubMed]
  14. The MO-COVID19 Working Group; Coloretti, I.; Farinelli, C.; Biagioni, E.; Gatto, I.; Munari, E.; Dall’Ara, L.; Busani, S.; Meschiari, M.; Tonelli, R.; et al. Critical COVID-19 Patients through First, Second, and Third Wave: Retrospective Observational Study Comparing Outcomes in Intensive Care Unit. J. Thorac. Dis. 2023, 15, 3218–3227. [Google Scholar] [CrossRef] [PubMed]
  15. San Martín-López, J.V.; Mesa, N.; Bernal-Bello, D.; Morales-Ortega, A.; Rivilla, M.; Guerrero, M.; Calderón, R.; Farfán, A.I.; Rivas, L.; Soria, G.; et al. Seven Epidemic Waves of COVID-19 in a Hospital in Madrid: Analysis of Severity and Associated Factors. Viruses 2023, 15, 1839. [Google Scholar] [CrossRef]
  16. Wu, Z.; Harrich, D.; Li, Z.; Hu, D.; Li, D. The Unique Features of SARS-CoV-2 Transmission: Comparison with SARS-CoV, MERS-CoV and 2009 H1N1 Pandemic Influenza Virus. Rev. Med. Virol. 2021, 31, e2171. [Google Scholar] [CrossRef]
  17. Zhou, H.; Yang, J.; Zhou, C.; Chen, B.; Fang, H.; Chen, S.; Zhang, X.; Wang, L.; Zhang, L. A Review of SARS-CoV2: Compared with SARS-CoV and MERS-CoV. Front. Med. 2021, 8, 628370. [Google Scholar] [CrossRef]
  18. Han, J.; Kamber, M.; Pei, J. Data Mining: Concepts and Techniques; Morgan Kaufmann Publishers: Burlington, VT, USA, 2022. [Google Scholar]
  19. Hesami, M.; Alizadeh, M.; Jones, A.M.P.; Torkamaneh, D. Machine Learning: Its Challenges and Opportunities in Plant System Biology. Appl. Microbiol. Biotechnol. 2022, 106, 3507–3530. [Google Scholar] [CrossRef]
  20. Haug, C.J.; Drazen, J.M. Artificial Intelligence and Machine Learning in Clinical Medicine, 2023. N. Engl. J. Med. 2023, 388, 1201–1208. [Google Scholar] [CrossRef] [PubMed]
  21. Wu, H.-T.; Liao, C.-C.; Peng, C.-F.; Lee, T.-Y.; Liao, P.-H. Exploring the Application of Machine Learning to Identify the Correlations between Phthalate Esters and Disease: Enhancing Nursing Assessments. Health Inf. Sci. Syst. 2024, 13, 10. [Google Scholar] [CrossRef]
  22. Rueda, R.; Fabello, E.; Silva, T.; Genzor, S.; Mizera, J.; Stanke, L. Machine Learning Approach to Flare-up Detection and Clustering in Chronic Obstructive Pulmonary Disease (COPD) Patients. Health Inf. Sci. Syst. 2024, 12, 50. [Google Scholar] [CrossRef]
  23. Barough, S.S.; Safavi-Naini, S.A.A.; Siavoshi, F.; Tamimi, A.; Ilkhani, S.; Akbari, S.; Ezzati, S.; Hatamabadi, H.; Pourhoseingholi, M.A. Generalizable Machine Learning Approach for COVID-19 Mortality Risk Prediction Using on-Admission Clinical and Laboratory Features. Sci. Rep. 2023, 13, 2399. [Google Scholar] [CrossRef]
  24. Kourmpanis, N.; Liaskos, J.; Zoulias, E.; Mantas, J. Predicting Mortality in COVID-19 Patients Using 6 Machine Learning Algorithms. In Studies in Health Technology and Informatics; Mantas, J., Gallos, P., Zoulias, E., Hasman, A., Househ, M.S., Charalampidou, M., Magdalinou, A., Eds.; IOS Press: Nieuwe Hemweg, Amsterdam, The Netherlands, 2023; ISBN 978-1-64368-400-0. [Google Scholar]
  25. Alballa, N.; Al-Turaiki, I. Machine Learning Approaches in COVID-19 Diagnosis, Mortality, and Severity Risk Prediction: A Review. Inform. Med. Unlocked 2021, 24, 100564. [Google Scholar] [CrossRef]
  26. Liu, L.; Song, W.; Patil, N.; Sainlaire, M.; Jasuja, R.; Dykes, P.C. Predicting COVID-19 Severity: Challenges in Reproducibility and Deployment of Machine Learning Methods. Int. J. Med. Inform. 2023, 179, 105210. [Google Scholar] [CrossRef]
  27. Sharifi-Kia, A.; Nahvijou, A.; Sheikhtaheri, A. Machine Learning-Based Mortality Prediction Models for Smoker COVID-19 Patients. BMC Med. Inform. Decis. Mak. 2023, 23, 129. [Google Scholar] [CrossRef]
  28. Navlakha, S.; Morjaria, S.; Perez-Johnston, R.; Zhang, A.; Taur, Y. Projecting COVID-19 Disease Severity in Cancer Patients Using Purposefully-Designed Machine Learning. BMC Infect. Dis. 2021, 21, 391. [Google Scholar] [CrossRef]
  29. Jung, Y.J.; Ahn, J.; Park, S.; Sun, J.-M.; Lee, S.-H.; Ahn, J.S.; Ahn, M.-J.; Cho, S.Y.; Jung, H.A. Machine Learning Prediction of the Case-Fatality of COVID-19 and Risk Factors for Adverse Outcomes in Patients with Non-Small Cell Lung Cancer. Transl. Cancer Res. 2024, 13, 2587–2595. [Google Scholar] [CrossRef]
  30. Subramanian, D.; Vittala, A.; Chen, X.; Julien, C.; Acosta, S.; Rusin, C.; Allen, C.; Rider, N.; Starosolski, Z.; Annapragada, A.; et al. Stratification of Pediatric COVID-19 Cases Using Inflammatory Biomarker Profiling and Machine Learning. J. Clin. Med. 2023, 12, 5435. [Google Scholar] [CrossRef]
  31. Cerbulescu, T.; Ignuta, F.; Rayudu, U.S.; Afra, M.; Rosca, O.; Vlad, A.; Loredana, S. Inflammatory Markers and Severity in COVID-19 Patients with Clostridioides Difficile Co-Infection: A Retrospective Analysis Including Subgroups with Diabetes, Cancer, and Elderly. Biomedicines 2025, 13, 227. [Google Scholar] [CrossRef]
  32. Schonlau, M.; Zou, R.Y. The Random Forest Algorithm for Statistical Learning. Stata J. Promot. Commun. Stat. Stata 2020, 20, 3–29. [Google Scholar] [CrossRef]
  33. Rhodes, J.S.; Cutler, A.; Moon, K.R. Geometry- and Accuracy-Preserving Random Forest Proximities. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 10947–10959. [Google Scholar] [CrossRef]
  34. Jayachitra, S.; Prasanth, A. Multi-Feature Analysis for Automated Brain Stroke Classification Using Weighted Gaussian Naïve Bayes Classifier. J. Circuit syst. Comp. 2021, 30, 2150178. [Google Scholar] [CrossRef]
  35. Balaji, V.R.; Suganthi, S.T.; Rajadevi, R.; Krishna Kumar, V.; Saravana Balaji, B.; Pandiyan, S. Skin Disease Detection and Segmentation Using Dynamic Graph Cut Algorithm and Classification through Naive Bayes Classifier. Measurement 2020, 163, 107922. [Google Scholar] [CrossRef]
  36. Uddin, S.; Haque, I.; Lu, H.; Moni, M.A.; Gide, E. Comparative Performance Analysis of K-Nearest Neighbour (KNN) Algorithm and Its Different Variants for Disease Prediction. Sci. Rep. 2022, 12, 6256. [Google Scholar] [CrossRef]
  37. Zhang, S.; Li, X.; Zong, M.; Zhu, X.; Wang, R. Efficient kNN Classification with Different Numbers of Nearest Neighbors. IEEE Trans. Neural Netw. Learn. Syst. 2018, 29, 1774–1785. [Google Scholar] [CrossRef] [PubMed]
  38. Zhou, W.; Liu, Y.; Yuan, Q.; Li, X. Epileptic Seizure Detection Using Lacunarity and Bayesian Linear Discriminant Analysis in Intracranial EEG. IEEE Trans. Biomed. Eng. 2013, 60, 3375–3381. [Google Scholar] [CrossRef] [PubMed]
  39. Yuan, S.; Zhou, W.; Li, J.; Wu, Q. Sparse Representation-Based EMD and BLDA for Automatic Seizure Detection. Med. Biol. Eng. Comput. 2017, 55, 1227–1238. [Google Scholar] [CrossRef]
  40. Pérez, A.; Larrañaga, P.; Inza, I. Supervised Classification with Conditional Gaussian Networks: Increasing the Structure Complexity from Naive Bayes. Int. J. Approx. Reason. 2006, 43, 1–25. [Google Scholar] [CrossRef]
  41. Patel, H.H.; Prajapati, P. Study and Analysis of Decision Tree Based Classification Algorithms. Int. J. Comput. Sci. Eng. 2018, 6, 74–78. [Google Scholar] [CrossRef]
  42. Charbuty, B.; Abdulazeez, A. Classification Based on Decision Tree Algorithm for Machine Learning. J. Appl. Sci. Technol. Trends 2021, 2, 20–28. [Google Scholar] [CrossRef]
  43. Jiang, C.; Lv, W.; Li, J. Protein-Protein Interaction Sites Prediction Using Batch Normalization Based CNNs and Oversampling Method Borderline-SMOTE. IEEE/ACM Trans. Comput. Biol. Bioinf. 2023, 20, 2190–2199. [Google Scholar] [CrossRef]
  44. Alshammari, T.S. Applying Machine Learning Algorithms for the Classification of Sleep Disorders. IEEE Access 2024, 12, 36110–36121. [Google Scholar] [CrossRef]
  45. Andrew, M.K.; Godin, J.; LeBlanc, J.; Boivin, G.; Valiquette, L.; McGeer, A.; McElhaney, J.E.; Hatchette, T.F.; ElSherif, M.; MacKinnon-Cameron, D.; et al. Older Age and Frailty Are Associated with Higher Mortality but Lower ICU Admission with COVID-19. Can. Geriatr. J. 2022, 25, 183–196. [Google Scholar] [CrossRef] [PubMed]
  46. Conway, B.J.; Kim, J.W.; Brousseau, D.C.; Conroy, M. Heart Disease, Advanced Age, Minority Race, and Hispanic Ethnicity Are Associated with Mortality in COVID-19 Patients. WMJ Off. Publ. State Med. Soc. Wis. 2021, 120, 152–155. [Google Scholar]
  47. Petrosillo, N.; Viceconte, G.; Ergonul, O.; Ippolito, G.; Petersen, E. COVID-19, SARS and MERS: Are They Closely Related? Clin. Microbiol. Infect. 2020, 26, 729–734. [Google Scholar] [CrossRef]
  48. Phillips, S.P.; Carver, L.F. Greatest Risk Factor for Death from COVID-19: Older Age, Chronic Disease Burden, or Place of Residence? Descriptive Analysis of Population-Level Canadian Data. Int. J. Environ. Res. Public Health 2023, 20, 7181. [Google Scholar] [CrossRef]
  49. Satici, C.; Demirkol, M.A.; Sargin Altunok, E.; Gursoy, B.; Alkan, M.; Kamat, S.; Demirok, B.; Surmeli, C.D.; Calik, M.; Cavus, Z.; et al. Performance of Pneumonia Severity Index and CURB-65 in Predicting 30-Day Mortality in Patients with COVID-19. Int. J. Infect. Dis. 2020, 98, 84–89. [Google Scholar] [CrossRef] [PubMed]
  50. Bradley, J.; Sbaih, N.; Chandler, T.R.; Furmanek, S.; Ramirez, J.A.; Cavallazzi, R. Pneumonia Severity Index and CURB-65 Score Are Good Predictors of Mortality in Hospitalized Patients with SARS-CoV-2 Community-Acquired Pneumonia. Chest 2022, 161, 927–936. [Google Scholar] [CrossRef]
  51. Alegría-Baños, J.A.; Rosas-Alvarado, M.A.; Jiménez-López, J.C.; Juárez-Muciño, M.; Méndez-Celis, C.A.; Enríquez-De Los Santos, S.T.; Valdez-Vázquez, R.R.; Prada-Ortega, D. Sociodemographic, Clinical and Laboratory Characteristics and Risk Factors for Mortality of Hospitalized COVID-19 Patients at Alternate Care Site: A Latin American Experience. Ann. Med. 2023, 55, 2224049. [Google Scholar] [CrossRef] [PubMed]
  52. Ilg, A.; Moskowitz, A.; Konanki, V.; Patel, P.V.; Chase, M.; Grossestreuer, A.V.; Donnino, M.W. Performance of the CURB-65 Score in Predicting Critical Care Interventions in Patients Admitted with Community-Acquired Pneumonia. Ann. Emerg. Med. 2019, 74, 60–68. [Google Scholar] [CrossRef]
  53. Kaal, A.G.; Op De Hoek, L.; Hochheimer, D.T.; Brouwers, C.; Wiersinga, W.J.; Snijders, D.; Rensing, K.L.; Van Dijk, C.E.; Steyerberg, E.W.; Van Nieuwkoop, C. Outcomes of Community-Acquired Pneumonia Using the Pneumonia Severity Index versus the CURB-65 in Routine Practice of Emergency Departments. ERJ Open Res. 2023, 9, 00051–02023. [Google Scholar] [CrossRef]
  54. Mohammadi, Z.; Faghih Dinevari, M.; Vahed, N.; Ebrahimi Bakhtavar, H.; Rahmani, F. Clinical and Laboratory Predictors of COVID-19-Related In-Hospital Mortality; a Cross-Sectional Study of 1000 Cases. Arch. Acad. Emerg. Med. 2022, 10, e49. [Google Scholar] [CrossRef] [PubMed]
  55. Al-Shajlawi, M.; Alsayed, A.R.; Abazid, H.; Awajan, D.; Al-Imam, A.; Basheti, I. Using Laboratory Parameters as Predictors for the Severity and Mortality of COVID-19 in Hospitalized Patients. Pharm. Pract. 2022, 20, 2721. [Google Scholar] [CrossRef]
  56. Singh, S.; Singh, K. Blood Urea Nitrogen/Albumin Ratio and Mortality Risk in Patients with COVID-19. Indian J. Crit. Care Med. 2022, 26, 626–631. [Google Scholar] [CrossRef] [PubMed]
  57. Yüksel, E.; Arac, S.; Deniz Altintaş, D. Predictors of Mortality in COVID-19 Induced Acute Kidney Injury. Ther. Apher. Dial. 2022, 26, 897–907. [Google Scholar] [CrossRef] [PubMed]
  58. Qureshi, M.A.; Toori, K.U. Raja Mobeen Ahmed Predictors of Mortality in COVID-19 Patients: An Observational Study. Pak. J. Med. Sci. 2022, 39. [Google Scholar] [CrossRef]
  59. Muñoz Lezcano, S.; Armengol De La Hoz, M.Á.; Corbi, A.; López, F.; García, M.S.; Reiz, A.N.; González, T.F.; Zlatkov, V.Y. Predictors of Mechanical Ventilation and Mortality in Critically Ill Patients with COVID-19 Pneumonia. Med. Intensiv. (Engl. Ed.) 2023, S2173572723001303. [Google Scholar] [CrossRef]
  60. Arunachala, S.; Parthasarathi, A.; Basavaraj, C.K.; Kaleem Ullah, M.; Chandran, S.; Venkataraman, H.; Vishwanath, P.; Ganguly, K.; Upadhyay, S.; Mahesh, P.A. The Validity of the ROX Index and APACHE II in Predicting Early, Late, and Non-Responses to Non-Invasive Ventilation in Patients with COVID-19 in a Low-Resource Setting. Viruses 2023, 15, 2231. [Google Scholar] [CrossRef]
  61. Vandenbrande, J.; Verbrugge, L.; Bruckers, L.; Geebelen, L.; Geerts, E.; Callebaut, I.; Gruyters, I.; Heremans, L.; Dubois, J.; Stessel, B. Validation of the Acute Physiology and Chronic Health Evaluation (APACHE) II and IV Score in COVID-19 Patients. Crit. Care Res. Pract. 2021, 2021, 1–9. [Google Scholar] [CrossRef]
  62. Onuk, S.; Sipahioğlu, H.; Karahan, S.; Yeşiltepe, A.; Kuzugüden, S.; Karabulut, A.; Beştepe Dursun, Z.; Akın, A. Cytokine Levels and Severity of Illness Scoring Systems to Predict Mortality in COVID-19 Infection. Healthcare 2023, 11, 387. [Google Scholar] [CrossRef]
  63. He, J.; Tang, D.; Liu, D.; Hong, X.; Ma, C.; Zheng, F.; Zeng, Z.; Chen, Y.; Du, J.; Kang, L.; et al. Serum Proteome and Metabolome Uncover Novel Biomarkers for the Assessment of Disease Activity and Diagnosing of Systemic Lupus Erythematosus. Clin. Immunol. 2023, 251, 109330. [Google Scholar] [CrossRef]
  64. Huang, Y.; Ma, S.-F.; Oldham, J.M.; Adegunsoye, A.; Zhu, D.; Murray, S.; Kim, J.S.; Bonham, C.; Strickland, E.; Linderholm, A.L.; et al. Machine Learning of Plasma Proteomics Classifies Diagnosis of Interstitial Lung Disease. Am. J. Respir. Crit. Care Med. 2024, 210, 444–454. [Google Scholar] [CrossRef]
  65. Safia, S.N. Prediction of Breast Cancer Through Random Forest. Curr. Med. Imaging 2023, 19, e300922209414. [Google Scholar] [CrossRef]
  66. Sidiropoulos, T.; Dovrolis, N.; Katifelis, H.; Michalopoulos, N.V.; Kokoropoulos, P.; Arkadopoulos, N.; Gazouli, M. Dysbiosis Signature of Fecal Microbiota in Patients with Pancreatic Adenocarcinoma and Pancreatic Intraductal Papillary Mucinous Neoplasms. Biomedicines 2024, 12, 1040. [Google Scholar] [CrossRef] [PubMed]
  67. Founta, K.; Dafou, D.; Kanata, E.; Sklaviadis, T.; Zanos, T.P.; Gounaris, A.; Xanthopoulos, K. Gene Targeting in Amyotrophic Lateral Sclerosis Using Causality-Based Feature Selection and Machine Learning. Mol. Med. 2023, 29, 12. [Google Scholar] [CrossRef] [PubMed]
  68. Kelly, J.; Moyeed, R.; Carroll, C.; Luo, S.; Li, X. Blood Biomarker-Based Classification Study for Neurodegenerative Diseases. Sci. Rep. 2023, 13, 17191. [Google Scholar] [CrossRef]
  69. Khamis, F.; Memish, Z.; Bahrani, M.A.; Dowaiki, S.A.; Pandak, N.; Bolushi, Z.A.; Salmi, I.A.; Al-Zakwani, I. Prevalence and Predictors of In-Hospital Mortality of Patients Hospitalized with COVID-19 Infection. J. Infect. Public Health 2021, 14, 759–765. [Google Scholar] [CrossRef]
  70. Aly, M.H.; Rahman, S.S.; Ahmed, W.A.; Alghamedi, M.H.; Al Shehri, A.A.; Alkalkami, A.M.; Hassan, M.H. Indicators of Critical Illness and Predictors of Mortality in COVID-19 Patients. Infect. Drug Resist. 2020, 13, 1995–2000. [Google Scholar] [CrossRef]
  71. Tian, W.; Jiang, W.; Yao, J.; Nicholson, C.J.; Li, R.H.; Sigurslid, H.H.; Wooster, L.; Rotter, J.I.; Guo, X.; Malhotra, R. Predictors of Mortality in Hospitalized COVID-19 Patients: A Systematic Review and Meta-analysis. J. Med. Virol. 2020, 92, 1875–1883. [Google Scholar] [CrossRef]
Figure 1. The figure illustrates the training and validation stages used in this study.
Figure 1. The figure illustrates the training and validation stages used in this study.
Biomedicines 13 00803 g001
Figure 2. The figure shows the radar plots comparing multiple performance metrics for specific models. Each axis represents a key performance metric. The radial axis ranges from 0 (minimum relevance) to 100% (maximum relevance). (A) Training phase. (B) Test phase.
Figure 2. The figure shows the radar plots comparing multiple performance metrics for specific models. Each axis represents a key performance metric. The radial axis ranges from 0 (minimum relevance) to 100% (maximum relevance). (A) Training phase. (B) Test phase.
Biomedicines 13 00803 g002
Figure 3. The figure shows the ROC curves for the six machine learning predictors. This curve shows the trade-off between sensitivity (true positive rate) and 1-specificity (false positive rate) for the ML models mentioned: RF, KNN, SVM, DT, BLDA and GNB.
Figure 3. The figure shows the ROC curves for the six machine learning predictors. This curve shows the trade-off between sensitivity (true positive rate) and 1-specificity (false positive rate) for the ML models mentioned: RF, KNN, SVM, DT, BLDA and GNB.
Biomedicines 13 00803 g003
Figure 4. The histogram shows the 21 most relevant parameters contributing to the mortality in COVID-19 hospitalized patients on the y-axis. The x-axis represents the statistical weight or importance assigned of each parameter in predicting mortality. More details about the units in which the parameters are presented in Table 1, Table 2, Table 3 and Table 4.
Figure 4. The histogram shows the 21 most relevant parameters contributing to the mortality in COVID-19 hospitalized patients on the y-axis. The x-axis represents the statistical weight or importance assigned of each parameter in predicting mortality. More details about the units in which the parameters are presented in Table 1, Table 2, Table 3 and Table 4.
Biomedicines 13 00803 g004
Table 1. Parameters related to patient history and demographics.
Table 1. Parameters related to patient history and demographics.
Deceased
(40 Patients, 11%)
Alive
(323 Patients, 89%)
Global
(363 Patients, 100%)
(n)(%)(n)(%)(n)(%)
Sex
Men33831986123164
Women7181253913236
Comorbidities
Hypertension24601444516846
Diabetes mellitus102568217821
Ischemic cardiopathy718175247
Cardiac insufficiency136272
COPD (Chronic emphysema/bronchitis)13175185
Asthma13175185
Dyslipidemia16401003111632
Coagulopathies71834114111
Renal insufficiency513196247
Active tumors13165175
Immune-mediated diseases2538124011
Overweight134151
Obesity38309339
Chronic treatments32801835721559
Antihypertensives21531153613637
Beta-blockers102538124813
Diuretics82037114512
Antidiabetics82058186618
Anti-aggregants112833104412
Anticoagulants61534114011
Lipid lowering agents1538902810529
Chemotherapy132131
Immunosuppressive chronic treatment38217247
Antiretrovirals001010
Antivirals001010
Vaccination status
1 dose25155175
2 doses006262
3 doses000000
4 doses000000
Table 2. Clinical characteristics of patients at the time of hospital admission.
Table 2. Clinical characteristics of patients at the time of hospital admission.
Deceased
(40 Patients, 11%)
Alive
(323 Patients, 89%)
Global
(363 Patients)
(n)(%)(n)(%)(n)(%)
Symptoms on admission
Dyspnea19481725319153
Chest discomfort1344144512
Cough21532026322361
Rhinorrhea00103103
Loss of smell (anosmia)0049154913
Loss of taste (ageusia)2548155014
Odynophagia00124123
Myalgia41057186117
Fever26652257025169
Dysthermia38278308
Headache2533103510
Nausea/Vomiting51347155214
Diarrhea3865206819
Asthenia7181013110830
Confusion92382175
Dizziness25196216
Sputum61545145114
Diagnosis on admission
Respiratory distress615217277
Acute respiratory insufficiency20501273914740
Multiorgan failure250021
Table 3. Patient characteristics associated with hospitalization.
Table 3. Patient characteristics associated with hospitalization.
Deceased
(40 Patients, 11%)
Alive
(323 Patients, 89%)
Global
(363 Patients)
(n)(%)(n)(%)(n)(%)
ICU admission215361198223
Invasive mechanical ventilation (IMV)194856177521
High Flow Nasal Cannula (HFNC)410299339
Non-invasive mechanical ventilation (NIV)12303094212
Prone sessions12302894011
Extracorporeal membrane oxygenation (ECMO)134151
At least one previous episode of COVID132131
Nosocomial infections
Viral002121
Bacterial11283094111
Fungal383162
Complications during hospital stay
Acute renal failure1333165298
Cardiac923155247
Arrhythmias923155247
Gastrointestinal513227277
Increased transaminases513103154
Ileus009392
Mesenteric ischemia001010
Subocclusion002121
Neurological38196226
Delirium13155164
Encephalopathy130010
Peripheral neuropathy134151
Coagulopathies923144236
Deep vein thrombosis002121
Pulmonary thromboembolism3872103
Stroke132131
Bleeding5133182
Respiratory distress (ARDS)153836115114
Shock5131062
Treatment for COVID-19 during hospital stay
Oxygen32801735420556
Corticosteroids26661424416846
Rendesivir25268288
Ceftriaxone21531424416345
Azithromycin15381113412635
Heparin8201123512033
Cause of death (if applicable)
Multiorgan failure174300175
Respiratory distress130010
Respiratory failure112800113
Table 4. Numerical baseline clinical characteristics of patients. Mean and standard deviation (SD) were calculated.
Table 4. Numerical baseline clinical characteristics of patients. Mean and standard deviation (SD) were calculated.
Deceased
(40 Patients, 11%)
Alive
(323 Patients, 89%)
Global
(363 Patients)
MeanSDMeanSDMeanSD
Age79.5310.9264.4016.1866.0816.37
Leucocytes (×103 µL)7.995.126.964.867.074.90
Lymphocytes (×103 µL)0.710.361.535.601.435.28
Neutrophils (×103 µL)6.425.235.053.805.204.00
Monocytes (×103 µL)0.500.290.610.810.600.77
Eosinophils (×103 µL)0.030.090.020.070.020.07
Basophils (×103 µL)0.140.740.020.040.030.25
Erythrocytes (×106 µL)4.400.684.720.704.690.70
Hemoglobin (g/dL)13.411.8713.791.8313.751.83
Hematocrit (%)39.627.0041.525.3641.305.59
M.C.V. (fL)90.789.9987.817.2588.157.64
Platelets (×103 µL)163.5280.76186.6279.00184.0279.42
Glucose (mg/dL)144.8362.46138.8462.98139.5062.86
Urea (mg/dL)71.7240.2043.5726.3746.7729.60
Creatinine (mg/dL)1.370.761.375.331.375.02
Estimated glomerular filtrate (CKD-EPI 2009) (mL/min/1.73 m2)56.8823.8673.7920.2271.8421.33
Sodium (mmol/L)135.854.04134.913.39135.023.48
Potassium (mmol/L)4.140.584.202.824.192.66
Chloride (mmol/L)101.853.77101.135.70101.215.52
Total bilirubin (mg/dL)0.760.421.649.601.549.05
Aspartate aminotransferase (AST/GOT) (U/L)72.77100.6848.4836.1551.2048.22
Alanine aminotransferase (ALT/GPT) (U/L)51.8375.9942.1740.4543.2145.59
Lactate dehydrogenase (LDH) (U/L)411.46201.74325.42128.12335.40140.97
Albumin (g/dL)3.530.403.820.383.790.39
C-reactive protein (mg/dL)115.4770.7178.2369.8582.2270.79
Procalcitonin (ng/mL)0.812.930.250.920.311.30
D-dimer (ng/mL)1500.512568.301155.002318.821192.102345.08
Fibrinogen (Derived) (mg/dL)688.53171.63667.34161.70669.71162.71
Ratio (TP)2.034.621.907.061.916.83
Ratio (TTPA)29.833.8730.785.2430.675.11
pH7.410.077.430.077.420.07
pCO2 (mmHg)36.919.1634.757.0535.057.39
pO2 (mmHg)64.2024.8360.8023.2861.2723.48
Bicarbonate (CO3H) (mmol/L)23.014.6123.073.1023.063.34
FIO2 (%)26.8610.0824.1811.3224.5711.16
pO2/FIO2256.1086.47279.83109.36276.21106.32
Arterial/alveolar O2 gradient (mmHg)72.1955.0165.1376.8566.1174.11
Lactate (mmol/L)1.891.031.721.161.741.15
Days in hospital20.5917.4414.9218.3415.5418.30
Days elapsed between PCR and hospital admission3.135.804.004.753.914.87
Number of consolidations2.571.272.471.552.481.51
Number of opacities3.411.763.181.693.201.69
Curb 65 Scale value1.630.810.740.810.840.86
Temperature (°C)36.871.0836.700.9436.720.96
Systolic blood pressure (SBP)135.4123.07131.9422.62132.3222.66
Diastolic blood pressure (DBP)63.3116.1071.9216.5870.9816.72
Heart rate87.5619.8190.9017.5190.5417.78
Respiratory rate25.327.6723.016.9523.347.07
Glasgow Coma Scale value14.431.3214.860.7814.810.87
SOFA scale Value4.451.234.071.634.171.54
APACHE-II scale value12.804.698.313.239.534.16
Number of COVID-19 vaccine doses1.000.001.290.461.260.45
Table 5. The final results of accuracy, recall, precision, specificity and kappa are shown in the table.
Table 5. The final results of accuracy, recall, precision, specificity and kappa are shown in the table.
Accuracy (%)RecallPrecisionSpecificityKappa
SVM83.3283.4282.7383.2273.39
BLDA80.3780.4779.7880.2771.41
DT85.5285.6284.9885.4275.46
GNB74.8274.9174.3574.7366.67
KNN89.2489.3189.1489.1879.01
RF95.8395.9295.1595.7386.32
Table 6. The final results of F1 score, MCC, DYI, AUC and AUC (%) are shown in the table.
Table 6. The final results of F1 score, MCC, DYI, AUC and AUC (%) are shown in the table.
F1 ScoreMCCDYIAUCAUC (%)
SVM83.0773.9383.32820.82
BLDA80.1271.3180.37790.79
DT85.2975.9485.52840.84
GNB74.6365.8374.82740.74
KNN89.2379.4889.24890.89
RF95.5386.8395.83950.95
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Queipo, M.; Mateo, J.; Torres, A.M.; Barbado, J. The Effect of Naturally Acquired Immunity on Mortality Predictors: A Focus on Individuals with New Coronavirus. Biomedicines 2025, 13, 803. https://doi.org/10.3390/biomedicines13040803

AMA Style

Queipo M, Mateo J, Torres AM, Barbado J. The Effect of Naturally Acquired Immunity on Mortality Predictors: A Focus on Individuals with New Coronavirus. Biomedicines. 2025; 13(4):803. https://doi.org/10.3390/biomedicines13040803

Chicago/Turabian Style

Queipo, Mónica, Jorge Mateo, Ana María Torres, and Julia Barbado. 2025. "The Effect of Naturally Acquired Immunity on Mortality Predictors: A Focus on Individuals with New Coronavirus" Biomedicines 13, no. 4: 803. https://doi.org/10.3390/biomedicines13040803

APA Style

Queipo, M., Mateo, J., Torres, A. M., & Barbado, J. (2025). The Effect of Naturally Acquired Immunity on Mortality Predictors: A Focus on Individuals with New Coronavirus. Biomedicines, 13(4), 803. https://doi.org/10.3390/biomedicines13040803

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop